* [PATCH resend] cpumask: don't perform while loop in cpumask_next_and()
@ 2015-03-01 15:22 Sergey Senozhatsky
2015-06-15 13:12 ` Peter Zijlstra
0 siblings, 1 reply; 6+ messages in thread
From: Sergey Senozhatsky @ 2015-03-01 15:22 UTC (permalink / raw)
To: Andrew Morton
Cc: Tejun Heo, David S. Miller, Amir Vadai, linux-kernel,
Sergey Senozhatsky, Sergey Senozhatsky
cpumask_next_and() is looking for cpumask_next() in src1 in a loop and
tests if found cpu is also present in src2. remove that loop, perform
cpumask_and() of src1 and src2 first and use that new mask to find
cpumask_next().
Apart from removing while loop, ./bloat-o-meter on x86_64 shows
add/remove: 0/0 grow/shrink: 0/1 up/down: 0/-8 (-8)
function old new delta
cpumask_next_and 62 54 -8
Signed-off-by: Sergey Senozhatsky <sergey.senozhatsky@gmail.com>
---
lib/cpumask.c | 9 +++++----
1 file changed, 5 insertions(+), 4 deletions(-)
diff --git a/lib/cpumask.c b/lib/cpumask.c
index b6513a9..5ab1553 100644
--- a/lib/cpumask.c
+++ b/lib/cpumask.c
@@ -37,10 +37,11 @@ EXPORT_SYMBOL(__next_cpu_nr);
int cpumask_next_and(int n, const struct cpumask *src1p,
const struct cpumask *src2p)
{
- while ((n = cpumask_next(n, src1p)) < nr_cpu_ids)
- if (cpumask_test_cpu(n, src2p))
- break;
- return n;
+ struct cpumask tmp;
+
+ if (cpumask_and(&tmp, src1p, src2p))
+ return cpumask_next(n, &tmp);
+ return nr_cpu_ids;
}
EXPORT_SYMBOL(cpumask_next_and);
--
2.3.1.167.g7f4ba4b
^ permalink raw reply related [flat|nested] 6+ messages in thread
* Re: [PATCH resend] cpumask: don't perform while loop in cpumask_next_and()
2015-03-01 15:22 [PATCH resend] cpumask: don't perform while loop in cpumask_next_and() Sergey Senozhatsky
@ 2015-06-15 13:12 ` Peter Zijlstra
2015-06-15 13:40 ` Borislav Petkov
2015-06-15 14:26 ` Sergey Senozhatsky
0 siblings, 2 replies; 6+ messages in thread
From: Peter Zijlstra @ 2015-06-15 13:12 UTC (permalink / raw)
To: Sergey Senozhatsky
Cc: Andrew Morton, Tejun Heo, David S. Miller, Amir Vadai,
linux-kernel, Sergey Senozhatsky
On Mon, Mar 02, 2015 at 12:22:03AM +0900, Sergey Senozhatsky wrote:
> +++ b/lib/cpumask.c
> @@ -37,10 +37,11 @@ EXPORT_SYMBOL(__next_cpu_nr);
> int cpumask_next_and(int n, const struct cpumask *src1p,
> const struct cpumask *src2p)
> {
> + struct cpumask tmp;
> +
> + if (cpumask_and(&tmp, src1p, src2p))
> + return cpumask_next(n, &tmp);
> + return nr_cpu_ids;
> }
> EXPORT_SYMBOL(cpumask_next_and);
Just ran into this; I though we were not supposed to put cpumasks on the
stack because $BIG. ?!
explain.
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH resend] cpumask: don't perform while loop in cpumask_next_and()
2015-06-15 13:12 ` Peter Zijlstra
@ 2015-06-15 13:40 ` Borislav Petkov
2015-06-15 14:33 ` Sergey Senozhatsky
2015-06-15 14:26 ` Sergey Senozhatsky
1 sibling, 1 reply; 6+ messages in thread
From: Borislav Petkov @ 2015-06-15 13:40 UTC (permalink / raw)
To: Peter Zijlstra
Cc: Sergey Senozhatsky, Andrew Morton, Tejun Heo, David S. Miller,
Amir Vadai, linux-kernel, Sergey Senozhatsky
On Mon, Jun 15, 2015 at 03:12:21PM +0200, Peter Zijlstra wrote:
> On Mon, Mar 02, 2015 at 12:22:03AM +0900, Sergey Senozhatsky wrote:
>
> > +++ b/lib/cpumask.c
> > @@ -37,10 +37,11 @@ EXPORT_SYMBOL(__next_cpu_nr);
> > int cpumask_next_and(int n, const struct cpumask *src1p,
> > const struct cpumask *src2p)
> > {
> > + struct cpumask tmp;
> > +
> > + if (cpumask_and(&tmp, src1p, src2p))
> > + return cpumask_next(n, &tmp);
> > + return nr_cpu_ids;
> > }
> > EXPORT_SYMBOL(cpumask_next_and);
>
> Just ran into this; I though we were not supposed to put cpumasks on the
> stack because $BIG. ?!
>
> explain.
That's some fat stack with 8K CPUs:
cpumask_next_and:
pushq %rbp #
movq %rsp, %rbp #,
pushq %rbx #
movl %edi, %ebx # n, n
leaq -1040(%rbp), %rdi #, tmp106
subq $1032, %rsp #,
^^^^^
Lovely.
--
Regards/Gruss,
Boris.
ECO tip #101: Trim your mails when you reply.
--
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH resend] cpumask: don't perform while loop in cpumask_next_and()
2015-06-15 13:12 ` Peter Zijlstra
2015-06-15 13:40 ` Borislav Petkov
@ 2015-06-15 14:26 ` Sergey Senozhatsky
1 sibling, 0 replies; 6+ messages in thread
From: Sergey Senozhatsky @ 2015-06-15 14:26 UTC (permalink / raw)
To: Peter Zijlstra
Cc: Sergey Senozhatsky, Andrew Morton, Tejun Heo, David S. Miller,
Amir Vadai, linux-kernel, Sergey Senozhatsky
Hello,
On (06/15/15 15:12), Peter Zijlstra wrote:
> > +++ b/lib/cpumask.c
> > @@ -37,10 +37,11 @@ EXPORT_SYMBOL(__next_cpu_nr);
> > int cpumask_next_and(int n, const struct cpumask *src1p,
> > const struct cpumask *src2p)
> > {
> > + struct cpumask tmp;
> > +
> > + if (cpumask_and(&tmp, src1p, src2p))
> > + return cpumask_next(n, &tmp);
> > + return nr_cpu_ids;
> > }
> > EXPORT_SYMBOL(cpumask_next_and);
>
> Just ran into this; I though we were not supposed to put cpumasks on the
> stack because $BIG. ?!
Gosh, I didn't think $BIG enough. So, on a _big_ 4096 x86_64 it's like...
64 bytes on stack. That's bad. alloc_cpumask_var()/free_cpumask_var()
version just doesn't look like a win (inlined below) so I guess I'll
ask to revert. It makes sense on smaller systems, but loses on huge
ones.
---
lib/cpumask.c | 14 ++++++++++----
1 file changed, 10 insertions(+), 4 deletions(-)
diff --git a/lib/cpumask.c b/lib/cpumask.c
index 5f62708..95ce89a 100644
--- a/lib/cpumask.c
+++ b/lib/cpumask.c
@@ -16,11 +16,17 @@
int cpumask_next_and(int n, const struct cpumask *src1p,
const struct cpumask *src2p)
{
- struct cpumask tmp;
+ int ret = nr_cpu_ids;
+ cpumask_var_t tmp;
- if (cpumask_and(&tmp, src1p, src2p))
- return cpumask_next(n, &tmp);
- return nr_cpu_ids;
+ if (!alloc_cpumask_var(&tmp, GFP_KERNEL))
+ return ret;
+
+ if (cpumask_and(tmp, src1p, src2p))
+ ret = cpumask_next(n, tmp);
+
+ free_cpumask_var(tmp);
+ return ret;
}
EXPORT_SYMBOL(cpumask_next_and);
^ permalink raw reply related [flat|nested] 6+ messages in thread
* Re: [PATCH resend] cpumask: don't perform while loop in cpumask_next_and()
2015-06-15 13:40 ` Borislav Petkov
@ 2015-06-15 14:33 ` Sergey Senozhatsky
2015-06-15 14:43 ` Borislav Petkov
0 siblings, 1 reply; 6+ messages in thread
From: Sergey Senozhatsky @ 2015-06-15 14:33 UTC (permalink / raw)
To: Borislav Petkov
Cc: Peter Zijlstra, Sergey Senozhatsky, Andrew Morton, Tejun Heo,
David S. Miller, Amir Vadai, linux-kernel, Sergey Senozhatsky
On (06/15/15 15:40), Borislav Petkov wrote:
[..]
>
> That's some fat stack with 8K CPUs:
>
> cpumask_next_and:
> pushq %rbp #
> movq %rsp, %rbp #,
> pushq %rbx #
> movl %edi, %ebx # n, n
> leaq -1040(%rbp), %rdi #, tmp106
> subq $1032, %rsp #,
> ^^^^^
>
> Lovely.
>
Oh, wow, 8K cpus...
No excuse.
-ss
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [PATCH resend] cpumask: don't perform while loop in cpumask_next_and()
2015-06-15 14:33 ` Sergey Senozhatsky
@ 2015-06-15 14:43 ` Borislav Petkov
0 siblings, 0 replies; 6+ messages in thread
From: Borislav Petkov @ 2015-06-15 14:43 UTC (permalink / raw)
To: Sergey Senozhatsky
Cc: Peter Zijlstra, Andrew Morton, Tejun Heo, David S. Miller,
Amir Vadai, linux-kernel, Sergey Senozhatsky
On Mon, Jun 15, 2015 at 11:33:52PM +0900, Sergey Senozhatsky wrote:
> Oh, wow, 8K cpus...
Standard CONFIG_MAXSMP setting. I believe distros set that. At least
SLES on x86_64 does and I'd bet RHEL does that too.
config NR_CPUS
int "Maximum number of CPUs" if SMP && !MAXSMP
range 2 8 if SMP && X86_32 && !X86_BIGSMP
range 2 512 if SMP && !MAXSMP && !CPUMASK_OFFSTACK
range 2 8192 if SMP && !MAXSMP && CPUMASK_OFFSTACK && X86_64
default "1" if !SMP
default "8192" if MAXSMP
^^^^^^
--
Regards/Gruss,
Boris.
ECO tip #101: Trim your mails when you reply.
--
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2015-06-15 14:43 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-03-01 15:22 [PATCH resend] cpumask: don't perform while loop in cpumask_next_and() Sergey Senozhatsky
2015-06-15 13:12 ` Peter Zijlstra
2015-06-15 13:40 ` Borislav Petkov
2015-06-15 14:33 ` Sergey Senozhatsky
2015-06-15 14:43 ` Borislav Petkov
2015-06-15 14:26 ` Sergey Senozhatsky
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).