From: Peter Zijlstra <peterz@infradead.org> To: Masami Hiramatsu <mhiramat@kernel.org> Cc: x86@kernel.org, linux-kernel@vger.kernel.org, rostedt@goodmis.org, bristot@redhat.com, jbaron@akamai.com, torvalds@linux-foundation.org, tglx@linutronix.de, mingo@kernel.org, namit@vmware.com, hpa@zytor.com, luto@kernel.org, ard.biesheuvel@linaro.org, jpoimboe@redhat.com, jeyu@kernel.org, paulmck@kernel.org, mathieu.desnoyers@efficios.com Subject: Re: [PATCH v4 12/16] x86/kprobes: Fix ordering Date: Tue, 22 Oct 2019 12:31:26 +0200 [thread overview] Message-ID: <20191022103126.GN1817@hirez.programming.kicks-ass.net> (raw) In-Reply-To: <20191022103521.3015bc5e128cd68fa645013c@kernel.org> On Tue, Oct 22, 2019 at 10:35:21AM +0900, Masami Hiramatsu wrote: > On Fri, 18 Oct 2019 09:35:37 +0200 > Peter Zijlstra <peterz@infradead.org> wrote: > > > Kprobes does something like: > > > > register: > > arch_arm_kprobe() > > text_poke(INT3) > > /* guarantees nothing, INT3 will become visible at some point, maybe */ > > > > kprobe_optimizer() > > /* guarantees the bytes after INT3 are unused */ > > syncrhonize_rcu_tasks(); > > text_poke_bp(JMP32); > > /* implies IPI-sync, kprobe really is enabled */ > > > > > > unregister: > > __disarm_kprobe() > > unoptimize_kprobe() > > text_poke_bp(INT3 + tail); > > /* implies IPI-sync, so tail is guaranteed visible */ > > arch_disarm_kprobe() > > text_poke(old); > > /* guarantees nothing, old will maybe become visible */ > > > > synchronize_rcu() > > > > free-stuff > > Note that this is only for the case of optimized kprobe. > (On some probe points we can not optimize it) Of course.. > > Now the problem is that on register, the synchronize_rcu_tasks() does > > not imply sufficient to guarantee all CPUs have already observed INT3 > > (although in practise this is exceedingly unlikely not to have > > happened) (similar to how MEMBARRIER_CMD_PRIVATE_EXPEDITED does not > > imply MEMBARRIER_CMD_PRIVATE_EXPEDITED_SYNC_CORE). > > OK, so the sync_core() after int3 is needed to guarantee the probe > is enabled on each core. Indeed, AFAIU the old instruction can stay in I$ or micro-ops caches which are not (fully) cache coherent. So without forcing a serializing instruction (CPUID, CR3 write, etc..) there is no guarantee. > > Worse, even if it did, we'd have to do 2 synchronize calls to provide > > the guarantee we're looking for, the first to ensure INT3 is visible, > > the second to guarantee nobody is then still using the instruction > > bytes after INT3. > > I think this 2nd guarantee is done by syncrhonize_rcu() if we > put sync_core() after int3. syncrhonize_rcu() ensures that > all cores once scheduled and all interrupts have done. Right, with IPI it all works, without IPI it might be that the INT3 is still visible when we start synchronize_rcu() and thereby violate the RCU guarantees. > > --- a/arch/x86/kernel/kprobes/opt.c > > +++ b/arch/x86/kernel/kprobes/opt.c > > @@ -444,14 +444,10 @@ void arch_optimize_kprobes(struct list_h > > /* Replace a relative jump with a breakpoint (int3). */ > > void arch_unoptimize_kprobe(struct optimized_kprobe *op) > > { > > - u8 insn_buff[JMP32_INSN_SIZE]; > > - > > - /* Set int3 to first byte for kprobes */ > > - insn_buff[0] = INT3_INSN_OPCODE; > > - memcpy(insn_buff + 1, op->optinsn.copied_insn, DISP32_SIZE); > > - > > - text_poke_bp(op->kp.addr, insn_buff, JMP32_INSN_SIZE, > > - text_gen_insn(JMP32_INSN_OPCODE, op->kp.addr, op->optinsn.insn)); > > + arch_arm_kprobe(&op->kp); > > + text_poke(op->kp.addr + INT3_INSN_SIZE, > > + op->optinsn.copied_insn, DISP32_SIZE); > > + text_poke_sync(); > > } > > For this part, I thought it was same as what text_poke_bp() does. > But, indeed, this looks better (simpler & lighter) than using > text_poke_bp()... Indeed. The reason I wrote it as such is that now the text_poke_bp(.emulate) argument is unused, which I can go remove in a future patch. The whole reason I went looking here is that I was going to write a comment on how this was correct; it seems I've still to do that.. /me adds the two entries to the TODO list. > So, in total, this looks good to me. > > Acked-by: Masami Hiramatsu <mhiramat@kernel.org> Thanks!
next prev parent reply other threads:[~2019-10-22 10:31 UTC|newest] Thread overview: 70+ messages / expand[flat|nested] mbox.gz Atom feed top 2019-10-18 7:35 [PATCH v4 00/16] Rewrite x86/ftrace to use text_poke (and more) Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 01/16] x86/alternatives: Teach text_poke_bp() to emulate instructions Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 02/16] x86/alternatives: Update int3_emulate_push() comment Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 03/16] x86/alternatives,jump_label: Provide better text_poke() batching interface Peter Zijlstra 2019-10-21 8:48 ` Ingo Molnar 2019-10-21 9:21 ` Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 04/16] x86/alternatives: Add and use text_gen_insn() helper Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 05/16] x86/ftrace: Use text_poke() Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 06/16] x86/mm: Remove set_kernel_text_r[ow]() Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 07/16] x86/alternative: Add text_opcode_size() Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 08/16] x86/ftrace: Use text_gen_insn() Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 09/16] x86/alternative: Remove text_poke_loc::len Peter Zijlstra 2019-10-21 8:58 ` Ingo Molnar 2019-10-21 9:02 ` Ingo Molnar 2019-10-18 7:35 ` [PATCH v4 10/16] x86/alternative: Shrink text_poke_loc Peter Zijlstra 2019-10-21 9:01 ` Ingo Molnar 2019-10-21 9:25 ` Peter Zijlstra 2019-10-21 9:33 ` Ingo Molnar 2019-10-18 7:35 ` [PATCH v4 11/16] x86/kprobes: Convert to text-patching.h Peter Zijlstra 2019-10-21 14:57 ` Masami Hiramatsu 2019-10-18 7:35 ` [PATCH v4 12/16] x86/kprobes: Fix ordering Peter Zijlstra 2019-10-22 1:35 ` Masami Hiramatsu 2019-10-22 10:31 ` Peter Zijlstra [this message] 2019-10-18 7:35 ` [PATCH v4 13/16] arm/ftrace: Use __patch_text_real() Peter Zijlstra 2019-10-28 16:25 ` Will Deacon 2019-10-28 16:34 ` Peter Zijlstra 2019-10-28 16:35 ` Peter Zijlstra 2019-10-28 16:47 ` Will Deacon 2019-10-28 16:55 ` Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 14/16] module: Remove set_all_modules_text_*() Peter Zijlstra 2019-10-18 7:35 ` [PATCH v4 15/16] module: Move where we mark modules RO,X Peter Zijlstra 2019-10-21 13:53 ` Josh Poimboeuf 2019-10-21 14:14 ` Peter Zijlstra 2019-10-21 15:34 ` Peter Zijlstra 2019-10-21 15:44 ` Peter Zijlstra 2019-10-21 16:11 ` Peter Zijlstra 2019-10-22 11:31 ` Heiko Carstens 2019-10-22 12:31 ` Peter Zijlstra 2019-10-23 11:48 ` Peter Zijlstra 2019-10-23 15:16 ` Peter Zijlstra 2019-10-23 17:15 ` Josh Poimboeuf 2019-10-24 10:59 ` Peter Zijlstra 2019-10-24 18:31 ` Josh Poimboeuf 2019-10-24 20:33 ` Peter Zijlstra 2019-10-23 17:00 ` Josh Poimboeuf 2019-10-24 13:16 ` Peter Zijlstra 2019-10-25 6:44 ` Petr Mladek 2019-10-25 8:43 ` Peter Zijlstra 2019-10-25 10:06 ` Peter Zijlstra 2019-10-25 13:50 ` Josh Poimboeuf 2019-10-26 1:17 ` Josh Poimboeuf 2019-10-28 10:07 ` Peter Zijlstra 2019-10-28 10:43 ` Peter Zijlstra 2019-10-25 9:16 ` Peter Zijlstra 2019-10-22 2:21 ` Steven Rostedt 2019-10-22 20:24 ` Peter Zijlstra 2019-10-22 20:40 ` Steven Rostedt 2019-10-23 9:07 ` Peter Zijlstra 2019-10-23 18:52 ` Steven Rostedt 2019-10-24 10:16 ` Peter Zijlstra 2019-10-24 10:18 ` Peter Zijlstra 2019-10-24 15:00 ` Steven Rostedt 2019-10-24 16:43 ` Peter Zijlstra 2019-10-24 18:17 ` Steven Rostedt 2019-10-24 20:24 ` Peter Zijlstra 2019-10-24 20:28 ` Steven Rostedt 2019-10-18 7:35 ` [PATCH v4 16/16] ftrace: Merge ftrace_module_{init,enable}() Peter Zijlstra 2019-10-18 8:20 ` Peter Zijlstra 2019-10-21 9:09 ` [PATCH v4 00/16] Rewrite x86/ftrace to use text_poke (and more) Ingo Molnar 2019-10-21 13:38 ` Steven Rostedt
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=20191022103126.GN1817@hirez.programming.kicks-ass.net \ --to=peterz@infradead.org \ --cc=ard.biesheuvel@linaro.org \ --cc=bristot@redhat.com \ --cc=hpa@zytor.com \ --cc=jbaron@akamai.com \ --cc=jeyu@kernel.org \ --cc=jpoimboe@redhat.com \ --cc=linux-kernel@vger.kernel.org \ --cc=luto@kernel.org \ --cc=mathieu.desnoyers@efficios.com \ --cc=mhiramat@kernel.org \ --cc=mingo@kernel.org \ --cc=namit@vmware.com \ --cc=paulmck@kernel.org \ --cc=rostedt@goodmis.org \ --cc=tglx@linutronix.de \ --cc=torvalds@linux-foundation.org \ --cc=x86@kernel.org \ --subject='Re: [PATCH v4 12/16] x86/kprobes: Fix ordering' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.