linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] Use test_and_clear_bit() instead atomic_dec_and_test() for stop_machine
       [not found] <87likkt7u6.fsf@devron.myhome.or.jp>
@ 2012-05-22 18:11 ` OGAWA Hirofumi
  2012-05-22 21:46   ` Steven Rostedt
  0 siblings, 1 reply; 7+ messages in thread
From: OGAWA Hirofumi @ 2012-05-22 18:11 UTC (permalink / raw)
  To: Ingo Molnar, Masami Hiramatsu; +Cc: linux-kernel

[forgot to Cc: lkml, resend]

Hi,

Maybe, nobody using debug patch in atomic_dec_and_test()...  Well,
anyway, how about this?



stop_machine_first is just to see if it is first one or not.  So, there
is no reason to use atomic_dec_and_test(), and makes the value below 0.

I think it is not desirable, because this usage only triggers
atomic_dec_and_test() underflow debug patch. (the patch tests result
of atomic_dec_and_test() is < 0)

So, this uses test_and_clear_bit() instead.

Signed-off-by: OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
---

 arch/x86/kernel/alternative.c |    8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff -puN arch/x86/kernel/alternative.c~stop_machine-use-test_and_set_bit arch/x86/kernel/alternative.c
--- linux/arch/x86/kernel/alternative.c~stop_machine-use-test_and_set_bit	2012-05-23 02:48:01.000000000 +0900
+++ linux-hirofumi/arch/x86/kernel/alternative.c	2012-05-23 02:48:01.000000000 +0900
@@ -650,7 +650,7 @@ void *__kprobes text_poke(void *addr, co
  * Cross-modifying kernel text with stop_machine().
  * This code originally comes from immediate value.
  */
-static atomic_t stop_machine_first;
+static unsigned long stop_machine_first;
 static int wrote_text;
 
 struct text_poke_params {
@@ -664,7 +664,7 @@ static int __kprobes stop_machine_text_p
 	struct text_poke_param *p;
 	int i;
 
-	if (atomic_dec_and_test(&stop_machine_first)) {
+	if (test_and_clear_bit(0, &stop_machine_first)) {
 		for (i = 0; i < tpp->nparams; i++) {
 			p = &tpp->params[i];
 			text_poke(p->addr, p->opcode, p->len);
@@ -714,7 +714,7 @@ void *__kprobes text_poke_smp(void *addr
 	p.len = len;
 	tpp.params = &p;
 	tpp.nparams = 1;
-	atomic_set(&stop_machine_first, 1);
+	stop_machine_first = 1;
 	wrote_text = 0;
 	/* Use __stop_machine() because the caller already got online_cpus. */
 	__stop_machine(stop_machine_text_poke, (void *)&tpp, cpu_online_mask);
@@ -736,7 +736,7 @@ void __kprobes text_poke_smp_batch(struc
 {
 	struct text_poke_params tpp = {.params = params, .nparams = n};
 
-	atomic_set(&stop_machine_first, 1);
+	stop_machine_first = 1;
 	wrote_text = 0;
 	__stop_machine(stop_machine_text_poke, (void *)&tpp, cpu_online_mask);
 }
_

-- 
OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Use test_and_clear_bit() instead atomic_dec_and_test() for stop_machine
  2012-05-22 18:11 ` [PATCH] Use test_and_clear_bit() instead atomic_dec_and_test() for stop_machine OGAWA Hirofumi
@ 2012-05-22 21:46   ` Steven Rostedt
  2012-05-23  2:44     ` OGAWA Hirofumi
  0 siblings, 1 reply; 7+ messages in thread
From: Steven Rostedt @ 2012-05-22 21:46 UTC (permalink / raw)
  To: OGAWA Hirofumi; +Cc: Ingo Molnar, Masami Hiramatsu, linux-kernel

On Wed, May 23, 2012 at 03:11:48AM +0900, OGAWA Hirofumi wrote:
> [forgot to Cc: lkml, resend]
> 
> Hi,
> 
> Maybe, nobody using debug patch in atomic_dec_and_test()...  Well,
> anyway, how about this?

What debug patch?

> 
> 
> 
> stop_machine_first is just to see if it is first one or not.  So, there
> is no reason to use atomic_dec_and_test(), and makes the value below 0.
> 
> I think it is not desirable, because this usage only triggers
> atomic_dec_and_test() underflow debug patch. (the patch tests result
> of atomic_dec_and_test() is < 0)

Well it should only underflow if you have a box with more than 2 billion
CPUs.

> 
> So, this uses test_and_clear_bit() instead.
> 
> Signed-off-by: OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
> ---
> 
>  arch/x86/kernel/alternative.c |    8 ++++----
>  1 file changed, 4 insertions(+), 4 deletions(-)
> 
> diff -puN arch/x86/kernel/alternative.c~stop_machine-use-test_and_set_bit arch/x86/kernel/alternative.c
> --- linux/arch/x86/kernel/alternative.c~stop_machine-use-test_and_set_bit	2012-05-23 02:48:01.000000000 +0900
> +++ linux-hirofumi/arch/x86/kernel/alternative.c	2012-05-23 02:48:01.000000000 +0900
> @@ -650,7 +650,7 @@ void *__kprobes text_poke(void *addr, co
>   * Cross-modifying kernel text with stop_machine().
>   * This code originally comes from immediate value.
>   */
> -static atomic_t stop_machine_first;
> +static unsigned long stop_machine_first;

The down side to this is that it adds 4 more bytes on a 64bit
machine. (sizeof(unsigned log) == 8 and sizeof(atomic_t) == 4)

You could probably also set it to -1, and do a atomic_inc_and_test(),
would that also cause the debug to trigger too?

-- Steve

>  static int wrote_text;
>  
>  struct text_poke_params {
> @@ -664,7 +664,7 @@ static int __kprobes stop_machine_text_p
>  	struct text_poke_param *p;
>  	int i;
>  
> -	if (atomic_dec_and_test(&stop_machine_first)) {
> +	if (test_and_clear_bit(0, &stop_machine_first)) {
>  		for (i = 0; i < tpp->nparams; i++) {
>  			p = &tpp->params[i];
>  			text_poke(p->addr, p->opcode, p->len);
> @@ -714,7 +714,7 @@ void *__kprobes text_poke_smp(void *addr
>  	p.len = len;
>  	tpp.params = &p;
>  	tpp.nparams = 1;
> -	atomic_set(&stop_machine_first, 1);
> +	stop_machine_first = 1;
>  	wrote_text = 0;
>  	/* Use __stop_machine() because the caller already got online_cpus. */
>  	__stop_machine(stop_machine_text_poke, (void *)&tpp, cpu_online_mask);
> @@ -736,7 +736,7 @@ void __kprobes text_poke_smp_batch(struc
>  {
>  	struct text_poke_params tpp = {.params = params, .nparams = n};
>  
> -	atomic_set(&stop_machine_first, 1);
> +	stop_machine_first = 1;
>  	wrote_text = 0;
>  	__stop_machine(stop_machine_text_poke, (void *)&tpp, cpu_online_mask);
>  }

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Use test_and_clear_bit() instead atomic_dec_and_test() for stop_machine
  2012-05-22 21:46   ` Steven Rostedt
@ 2012-05-23  2:44     ` OGAWA Hirofumi
  2012-05-23  4:19       ` OGAWA Hirofumi
  0 siblings, 1 reply; 7+ messages in thread
From: OGAWA Hirofumi @ 2012-05-23  2:44 UTC (permalink / raw)
  To: Steven Rostedt; +Cc: Ingo Molnar, Masami Hiramatsu, linux-kernel

Steven Rostedt <rostedt@goodmis.org> writes:

> On Wed, May 23, 2012 at 03:11:48AM +0900, OGAWA Hirofumi wrote:
>> [forgot to Cc: lkml, resend]
>> 
>> Hi,
>> 
>> Maybe, nobody using debug patch in atomic_dec_and_test()...  Well,
>> anyway, how about this?
>
> What debug patch?

It is this patch. I got this from -mm (akpm series), I don't know
whether -mm is still using though.

The patch below will detect atomic counter underflows.  This has been
test-driven in the -RT patchset for some time.  qdisc_destroy() triggered
it sometimes (in a seemingly nonfatal way, during device shutdown) - with
DaveM suggesting that it is most likely a bug in the networking code.  So
it would be nice to have this in -mm for some time to validate all atomic
counters on a broader base.

Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Andrew Morton <akpm@osdl.org>
---

Change it to atomic check.

Signed-off-by: OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
---

 arch/x86/include/asm/atomic.h      |   13 +++++++++++++
 arch/x86/include/asm/atomic64_32.h |    5 ++++-
 arch/x86/include/asm/atomic64_64.h |   12 ++++++++++++
 3 files changed, 29 insertions(+), 1 deletion(-)

diff -puN arch/x86/include/asm/atomic.h~debug_atomic_t-underflows-atomic arch/x86/include/asm/atomic.h
--- linux/arch/x86/include/asm/atomic.h~debug_atomic_t-underflows-atomic	2012-04-03 17:29:38.000000000 +0900
+++ linux-hirofumi/arch/x86/include/asm/atomic.h	2012-04-03 17:29:38.000000000 +0900
@@ -6,6 +6,18 @@
 #include <asm/processor.h>
 #include <asm/alternative.h>
 #include <asm/cmpxchg.h>
+#include <asm/bug.h>
+
+#define ATOMIC_UNDERFLOW_CHECK(v)	do {				\
+	unsigned char __sf;						\
+	/* if (atomic_read(v) < 0) */					\
+	__asm__ __volatile__("sets %0"					\
+			     : "=qm" (__sf)				\
+			     : /* no input */				\
+			     : "memory");				\
+	WARN(__sf, KERN_ERR "atomic counter underflow: %d\n",		\
+	     atomic_read(v));						\
+} while(0)
 
 /*
  * Atomic operations that C can't guarantee us.  Useful for
@@ -123,6 +135,7 @@ static inline int atomic_dec_and_test(at
 	asm volatile(LOCK_PREFIX "decl %0; sete %1"
 		     : "+m" (v->counter), "=qm" (c)
 		     : : "memory");
+	ATOMIC_UNDERFLOW_CHECK(v);
 	return c != 0;
 }
 
diff -puN arch/x86/include/asm/atomic64_32.h~debug_atomic_t-underflows-atomic arch/x86/include/asm/atomic64_32.h
--- linux/arch/x86/include/asm/atomic64_32.h~debug_atomic_t-underflows-atomic	2012-04-03 17:29:38.000000000 +0900
+++ linux-hirofumi/arch/x86/include/asm/atomic64_32.h	2012-04-03 17:29:38.000000000 +0900
@@ -244,7 +244,10 @@ static inline void atomic64_dec(atomic64
  */
 static inline int atomic64_dec_and_test(atomic64_t *v)
 {
-	return atomic64_dec_return(v) == 0;
+	long long ret = atomic64_dec_return(v);
+	WARN(ret < 0, KERN_ERR "atomic counter underflow: %lld\n",
+	     atomic64_read(v));
+	return ret == 0;
 }
 
 /**
diff -puN arch/x86/include/asm/atomic64_64.h~debug_atomic_t-underflows-atomic arch/x86/include/asm/atomic64_64.h
--- linux/arch/x86/include/asm/atomic64_64.h~debug_atomic_t-underflows-atomic	2012-04-03 17:29:38.000000000 +0900
+++ linux-hirofumi/arch/x86/include/asm/atomic64_64.h	2012-04-03 17:29:38.000000000 +0900
@@ -5,6 +5,17 @@
 #include <asm/alternative.h>
 #include <asm/cmpxchg.h>
 
+#define ATOMIC64_UNDERFLOW_CHECK(v)	do {				\
+	unsigned char __sf;						\
+	/* if (atomic64_read(v) < 0) */					\
+	__asm__ __volatile__("sets %0"					\
+			     : "=qm" (__sf)				\
+			     : /* no input */				\
+			     : "memory");				\
+	WARN(__sf, KERN_ERR "atomic counter underflow: %ld\n",		\
+	     atomic64_read(v));						\
+} while(0)
+
 /* The 64-bit atomic type */
 
 #define ATOMIC64_INIT(i)	{ (i) }
@@ -121,6 +132,7 @@ static inline int atomic64_dec_and_test(
 	asm volatile(LOCK_PREFIX "decq %0; sete %1"
 		     : "=m" (v->counter), "=qm" (c)
 		     : "m" (v->counter) : "memory");
+	ATOMIC64_UNDERFLOW_CHECK(v);
 	return c != 0;
 }
 
_

>> stop_machine_first is just to see if it is first one or not.  So, there
>> is no reason to use atomic_dec_and_test(), and makes the value below 0.
>> 
>> I think it is not desirable, because this usage only triggers
>> atomic_dec_and_test() underflow debug patch. (the patch tests result
>> of atomic_dec_and_test() is < 0)
>
> Well it should only underflow if you have a box with more than 2 billion
> CPUs.

It meant < 0, not underflow INT_MIN.

>> -static atomic_t stop_machine_first;
>> +static unsigned long stop_machine_first;
>
> The down side to this is that it adds 4 more bytes on a 64bit
> machine. (sizeof(unsigned log) == 8 and sizeof(atomic_t) == 4)

Oh, sure. If nobody has interest, unfortunately I will use this as my
local patch...

Thanks.
-- 
OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Use test_and_clear_bit() instead atomic_dec_and_test() for stop_machine
  2012-05-23  2:44     ` OGAWA Hirofumi
@ 2012-05-23  4:19       ` OGAWA Hirofumi
  2012-05-23  8:43         ` Steven Rostedt
  2012-06-07 12:54         ` Steven Rostedt
  0 siblings, 2 replies; 7+ messages in thread
From: OGAWA Hirofumi @ 2012-05-23  4:19 UTC (permalink / raw)
  To: Steven Rostedt; +Cc: Ingo Molnar, Masami Hiramatsu, linux-kernel

OGAWA Hirofumi <hirofumi@mail.parknet.co.jp> writes:

>> The down side to this is that it adds 4 more bytes on a 64bit
>> machine. (sizeof(unsigned log) == 8 and sizeof(atomic_t) == 4)

Another patch without additional 4bytes. This simply change
atomic_dec_and_test() to atomic_xchg().

Thanks.
-- 
OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>


stop_machine_first is just to see if it is first one or not.  In this
usage, atomic_dec_and_test() makes value less than 0.

I think it is not desirable, because it only triggers
atomic_dec_and_test() less than 0 debug patch. (the patch tests result
of atomic_dec_and_test() is < 0)

So, this uses atomic_xchg() instead.

Signed-off-by: OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
---

 arch/x86/kernel/alternative.c |    2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff -puN arch/x86/kernel/alternative.c~stop_machine-use-atomic_xchg arch/x86/kernel/alternative.c
--- linux/arch/x86/kernel/alternative.c~stop_machine-use-atomic_xchg	2012-05-23 13:10:03.000000000 +0900
+++ linux-hirofumi/arch/x86/kernel/alternative.c	2012-05-23 13:10:03.000000000 +0900
@@ -664,7 +664,7 @@ static int __kprobes stop_machine_text_p
 	struct text_poke_param *p;
 	int i;
 
-	if (atomic_dec_and_test(&stop_machine_first)) {
+	if (atomic_xchg(&stop_machine_first, 0)) {
 		for (i = 0; i < tpp->nparams; i++) {
 			p = &tpp->params[i];
 			text_poke(p->addr, p->opcode, p->len);
_

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Use test_and_clear_bit() instead atomic_dec_and_test() for stop_machine
  2012-05-23  4:19       ` OGAWA Hirofumi
@ 2012-05-23  8:43         ` Steven Rostedt
  2012-06-07 12:54         ` Steven Rostedt
  1 sibling, 0 replies; 7+ messages in thread
From: Steven Rostedt @ 2012-05-23  8:43 UTC (permalink / raw)
  To: OGAWA Hirofumi; +Cc: Ingo Molnar, Masami Hiramatsu, linux-kernel

On Wed, 2012-05-23 at 13:19 +0900, OGAWA Hirofumi wrote:
> OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>
> 
> 
> stop_machine_first is just to see if it is first one or not.  In this
> usage, atomic_dec_and_test() makes value less than 0.
> 
> I think it is not desirable, because it only triggers
> atomic_dec_and_test() less than 0 debug patch. (the patch tests result
> of atomic_dec_and_test() is < 0)
> 
> So, this uses atomic_xchg() instead.

Acked-by: Steven Rostedt <rostedt@goodmis.org>

-- Steve

> 
> Signed-off-by: OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>



^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Use test_and_clear_bit() instead atomic_dec_and_test() for stop_machine
  2012-05-23  4:19       ` OGAWA Hirofumi
  2012-05-23  8:43         ` Steven Rostedt
@ 2012-06-07 12:54         ` Steven Rostedt
  2012-06-07 13:18           ` OGAWA Hirofumi
  1 sibling, 1 reply; 7+ messages in thread
From: Steven Rostedt @ 2012-06-07 12:54 UTC (permalink / raw)
  To: OGAWA Hirofumi; +Cc: Ingo Molnar, Masami Hiramatsu, linux-kernel, Andrew Morton

On Wed, 2012-05-23 at 13:19 +0900, OGAWA Hirofumi wrote:
> OGAWA Hirofumi <hirofumi@mail.parknet.co.jp> writes:
> 
> >> The down side to this is that it adds 4 more bytes on a 64bit
> >> machine. (sizeof(unsigned log) == 8 and sizeof(atomic_t) == 4)
> 
> Another patch without additional 4bytes. This simply change
> atomic_dec_and_test() to atomic_xchg().
> 

If nobody picked this up, you might want to resend it with my acked-by.
(The second patch, not the first). As patches added to replies are
usually ignored.

-- Steve



^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH] Use test_and_clear_bit() instead atomic_dec_and_test() for stop_machine
  2012-06-07 12:54         ` Steven Rostedt
@ 2012-06-07 13:18           ` OGAWA Hirofumi
  0 siblings, 0 replies; 7+ messages in thread
From: OGAWA Hirofumi @ 2012-06-07 13:18 UTC (permalink / raw)
  To: Steven Rostedt; +Cc: Ingo Molnar, Masami Hiramatsu, linux-kernel, Andrew Morton

Steven Rostedt <rostedt@goodmis.org> writes:

> On Wed, 2012-05-23 at 13:19 +0900, OGAWA Hirofumi wrote:
>> OGAWA Hirofumi <hirofumi@mail.parknet.co.jp> writes:
>> 
>> >> The down side to this is that it adds 4 more bytes on a 64bit
>> >> machine. (sizeof(unsigned log) == 8 and sizeof(atomic_t) == 4)
>> 
>> Another patch without additional 4bytes. This simply change
>> atomic_dec_and_test() to atomic_xchg().
>> 
>
> If nobody picked this up, you might want to resend it with my acked-by.
> (The second patch, not the first). As patches added to replies are
> usually ignored.

Thanks. I was going to add that to my personal patchset, I will try it again.
-- 
OGAWA Hirofumi <hirofumi@mail.parknet.co.jp>

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2012-06-07 13:18 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <87likkt7u6.fsf@devron.myhome.or.jp>
2012-05-22 18:11 ` [PATCH] Use test_and_clear_bit() instead atomic_dec_and_test() for stop_machine OGAWA Hirofumi
2012-05-22 21:46   ` Steven Rostedt
2012-05-23  2:44     ` OGAWA Hirofumi
2012-05-23  4:19       ` OGAWA Hirofumi
2012-05-23  8:43         ` Steven Rostedt
2012-06-07 12:54         ` Steven Rostedt
2012-06-07 13:18           ` OGAWA Hirofumi

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).