Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler

linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
@ 2007-03-04 20:35 Al Boldi
  2007-03-04 21:49 ` Con Kolivas
       [not found] ` <200703050834.45712.a1426z@gawab.com>
  0 siblings, 2 replies; 198+ messages in thread
From: Al Boldi @ 2007-03-04 20:35 UTC (permalink / raw)
  To: linux-kernel

Con Kolivas wrote:
> > >> >> >This message is to announce the first general public release of
> > >> >> > the "Rotating Staircase DeadLine" cpu scheduler.

Thanks a lot!

> Just to make it clear. The purpose of this scheduler is at all costs to
> maintain absolute fairness no matter what type of load it is put under.

Great!

> This means that if you heavily load up your machine without the use of
> 'nice' then your interactive tasks _will_ slow down proportionately to the
> amount of cpu you use. So doing make -j4 for example will make any other
> task started in taht presence run precisely 1/5th speed, but they will
> still be responsive, have low latency (and audio shouldn't skip for
> example).

That's just what it did, but when you "nice make -j4", things (gears) start 
to stutter.  Is that due to the staircase?


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-04 20:35 [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler Al Boldi
@ 2007-03-04 21:49 ` Con Kolivas
       [not found]   ` <45EB45F7.3050208@simon.arlott.org.uk>
  2007-03-04 23:13   ` Willy Tarreau
       [not found] ` <200703050834.45712.a1426z@gawab.com>
  1 sibling, 2 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-04 21:49 UTC (permalink / raw)
  To: Al Boldi, ck list; +Cc: linux-kernel

On Monday 05 March 2007 07:35, Al Boldi wrote:
> Con Kolivas wrote:
> > > >> >> >This message is to announce the first general public release of
> > > >> >> > the "Rotating Staircase DeadLine" cpu scheduler.
>
> Thanks a lot!

You're welcome.
>
> > Just to make it clear. The purpose of this scheduler is at all costs to
> > maintain absolute fairness no matter what type of load it is put under.
>
> Great!
>
> > This means that if you heavily load up your machine without the use of
> > 'nice' then your interactive tasks _will_ slow down proportionately to
> > the amount of cpu you use. So doing make -j4 for example will make any
> > other task started in taht presence run precisely 1/5th speed, but they
> > will still be responsive, have low latency (and audio shouldn't skip for
> > example).
>
> That's just what it did, but when you "nice make -j4", things (gears) start
> to stutter.  Is that due to the staircase?

gears isn't an interactive task. Apart from using it as a background load to 
check for starvation because it loads up the cpu fully (which a gpu intensive 
but otherwise simple app like this should _not_ do) graphics card drivers and 
interrupts and so on, I wouldn't put much credence on gears as anything else. 
However I suspect that gears will still get a fair share of the cpu on RSDL 
which almost never happens on any other scheduler.

P.S. Al, please don't drop ccs from the original thread.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu  scheduler
       [not found]   ` <45EB45F7.3050208@simon.arlott.org.uk>
@ 2007-03-04 22:27     ` Con Kolivas
  2007-03-05 18:29       ` Simon Arlott
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-04 22:27 UTC (permalink / raw)
  To: Simon Arlott; +Cc: ck list, linux-kernel

On Monday 05 March 2007 09:19, Simon Arlott wrote:
> On 04/03/07 21:49, Con Kolivas wrote:
> > On Monday 05 March 2007 07:35, Al Boldi wrote:
> >> Con Kolivas wrote:
> >>> This means that if you heavily load up your machine without the use of
> >>> 'nice' then your interactive tasks _will_ slow down proportionately to
> >>> the amount of cpu you use. So doing make -j4 for example will make any
> >>> other task started in taht presence run precisely 1/5th speed, but they
> >>> will still be responsive, have low latency (and audio shouldn't skip
> >>> for example).
> >>
> >> That's just what it did, but when you "nice make -j4", things (gears)
> >> start to stutter.  Is that due to the staircase?
> >
> > gears isn't an interactive task. Apart from using it as a background load
> > to check for starvation because it loads up the cpu fully (which a gpu
> > intensive but otherwise simple app like this should _not_ do) graphics
> > card drivers and interrupts and so on, I wouldn't put much credence on
> > gears as anything else. However I suspect that gears will still get a
> > fair share of the cpu on RSDL which almost never happens on any other
> > scheduler.
>
> If I run glxgears, thunderbird/firefox become really slow to
> respond/display and cpu usage isn't even at 100%. I had thunderbird lagging
> on keyboard character repeat earlier but can't reproduce that now even with
> glxgears - however firefox lags really badly on keyboard input with
> glxgears running.

Hi Simon

You're hitting a nasty udev bug here that is unrelated to the cpu scheduler 
and almost certainly responsible for your bad behaviour. 

[   39.314953] =================================
[   39.315068] [ INFO: inconsistent lock state ]
[   39.315120] 2.6.21-rc2-git #161
[   39.315167] ---------------------------------
[   39.315217] inconsistent {softirq-on-R} -> {in-softirq-W} usage.
[   39.315271] udevd/1424 [HC0[0]:SC1[2]:HE1:SE0] takes:
[   39.315323]  (&ndev->lock){-+-?}, at: [<b04a57c4>] 
ipv6_add_addr+0x164/0x1e0
[   39.315525] {softirq-on-R} state was registered at:
[   39.315576]   [<b013f8d2>] __lock_acquire+0x622/0xbb0
[   39.315731]   [<b01402e2>] lock_acquire+0x62/0x80
[   39.315883]   [<b050fc55>] _read_lock+0x35/0x50
[   39.316043]   [<b050c1e0>] sctp_v6_copy_addrlist+0x30/0xc0
[   39.316197]   [<b04f9cd2>] sctp_get_local_addr_list+0x32/0x60
[   39.316358]   [<b06de1f0>] sctp_init+0x2c0/0x550
[   39.316516]   [<b06bcc55>] do_initcalls+0x35/0x120
[   39.316687]   [<b06bcd5c>] do_basic_setup+0x1c/0x30
[   39.316840]   [<b06bcdbf>] init+0x3f/0x90
[   39.316994]   [<b0104d87>] kernel_thread_helper+0x7/0x10
[   39.317150]   [<ffffffff>] 0xffffffff
[   39.317300] irq event stamp: 1976
[   39.317348] hardirqs last  enabled at (1976): [<b014087e>] 
debug_check_no_locks_freed+0xbe/0xd0
[   39.317481] hardirqs last disabled at (1975): [<b01407ea>] 
debug_check_no_locks_freed+0x2a/0xd0
[   39.317618] softirqs last  enabled at (1808): [<b0435557>] 
release_sock+0x57/0xb0
[   39.317751] softirqs last disabled at (1853): [<b0125277>] 
do_softirq+0x47/0x50
[   39.317884] 
[   39.317885] other info that might help us debug this:
[   39.317978] 3 locks held by udevd/1424:
[   39.318026]  #0:  (&mm->mmap_sem){----}, at: [<b0116ae6>] 
do_page_fault+0xb6/0x5a0
[   39.318263]  #1:  (&npinfo->poll_lock){-+..}, at: [<b043cf28>] 
net_rx_action+0x158/0x1a0
[   39.318500]  #2:  (&tp->rx_lock){-+..}, at: [<b034ef7e>] 
rtl8139_poll+0x3e/0x120
[   39.318742] 
[   39.318743] stack backtrace:
[   39.318833]  [<b0104f4a>] show_trace_log_lvl+0x1a/0x30
[   39.318924]  [<b0104f72>] show_trace+0x12/0x20
[   39.319009]  [<b0105086>] dump_stack+0x16/0x20
[   39.319094]  [<b013e4d1>] print_usage_bug+0x151/0x160
[   39.319180]  [<b013e9a3>] mark_lock+0x4c3/0x580
[   39.319265]  [<b013f8b0>] __lock_acquire+0x600/0xbb0
[   39.319354]  [<b01402e2>] lock_acquire+0x62/0x80
[   39.319439]  [<b050fff5>] _write_lock+0x35/0x50
[   39.319526]  [<b04a57c4>] ipv6_add_addr+0x164/0x1e0
[   39.319612]  [<b04a76a2>] addrconf_prefix_rcv+0x2b2/0x560
[   39.319707]  [<b04b39b1>] ndisc_router_discovery+0x431/0x600
[   39.319798]  [<b04b4552>] ndisc_rcv+0x92/0xc0
[   39.319883]  [<b04b98b0>] icmpv6_rcv+0x2b0/0x2d0
[   39.319971]  [<b04a4975>] ip6_input+0x1a5/0x3c0
[   39.320057]  [<b04a4f3a>] ip6_mc_input+0x3a/0x60
[   39.320145]  [<b04a45ab>] ipv6_rcv+0x15b/0x380
[   39.320231]  [<b043cc31>] netif_receive_skb+0x271/0x2f0
[   39.320318]  [<b034ebb2>] rtl8139_rx+0x142/0x340
[   39.320405]  [<b034ef99>] rtl8139_poll+0x59/0x120
[   39.320494]  [<b043ce59>] net_rx_action+0x89/0x1a0
[   39.320580]  [<b01251d2>] __do_softirq+0x52/0xb0
[   39.320666]  [<b0125277>] do_softirq+0x47/0x50
[   39.320754]  [<b0125335>] irq_exit+0x75/0x80
[   39.320843]  [<b0106931>] do_IRQ+0x41/0x80
[   39.320929]  [<b0104be2>] common_interrupt+0x2e/0x34
[   39.321016]  [<b0510634>] error_code+0x74/0x7c
[   39.321103]  =======================

Also this patch was actually for 2.6.20 and you seem to have applied it to 
2.6.21-rc2. I haven't even checked that it cleanly applies to that kernel.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-04 21:49 ` Con Kolivas
       [not found]   ` <45EB45F7.3050208@simon.arlott.org.uk>
@ 2007-03-04 23:13   ` Willy Tarreau
  2007-03-04 23:58     ` Con Kolivas
                       ` (2 more replies)
  1 sibling, 3 replies; 198+ messages in thread
From: Willy Tarreau @ 2007-03-04 23:13 UTC (permalink / raw)
  To: Con Kolivas; +Cc: Al Boldi, ck list, linux-kernel

On Mon, Mar 05, 2007 at 08:49:29AM +1100, Con Kolivas wrote:
(...)
> > That's just what it did, but when you "nice make -j4", things (gears) start
> > to stutter.  Is that due to the staircase?
> 
> gears isn't an interactive task. Apart from using it as a background load to 
> check for starvation because it loads up the cpu fully (which a gpu intensive 
> but otherwise simple app like this should _not_ do) graphics card drivers and 
> interrupts and so on, I wouldn't put much credence on gears as anything else. 
> However I suspect that gears will still get a fair share of the cpu on RSDL 
> which almost never happens on any other scheduler.

Con,

I've now given it a try with HZ=250 on my dual-athlon. It works beautifully.
I also quickly checked that playing mp3 doesn't skip during make -j4, and
that gears runs fairly smoothly, since those are the references people often
use.

But with real work, it's excellent too. When I saturate my CPUs by injecting
HTTP traffic on haproxy, the load is stable and the command line perfectly
responsive, while in the past the load would oscillate and the command line
sometimes stopped to respond for a few seconds.

I've also launched my scheddos program (you may remember, the one we did a
few experiments with). I could not cause any freeze at all. Plain 2.6.20 had
already improved a lot in this area, but above 4 processes/CPU, occasional
short freezes did still occur. This time, even at 100 processes, the system
was rather slow (of course!) but just as expected, and nothing more.

I also tried the good old "dd if=/dev/zero bs=1|...|dd bs=1 of=/dev/null" and
it did not cause any trouble.

I will boot 2.6 slightly more often to test the code under various conditions,
and I will recommend it to a few people I know who tend to switch back to 2.4
after one day full of 2.6 jerkiness.

Overall, you have done a great job !

I hope that more people will give it a try, first to help find possible
remaining bugs, and to pronounce in favour of its inclusion in mainline.

Cheers,
Willy

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-04 23:13   ` Willy Tarreau
@ 2007-03-04 23:58     ` Con Kolivas
  2007-03-05  0:10     ` [ck] " jos poortvliet
  2007-03-05  1:09     ` Gene Heskett
  2 siblings, 0 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-04 23:58 UTC (permalink / raw)
  To: Willy Tarreau; +Cc: Al Boldi, ck list, linux-kernel

On Monday 05 March 2007 10:13, Willy Tarreau wrote:
> I've now given it a try with HZ=250 on my dual-athlon.

Great, thanks. The HZ should make very little difference, except for slightly 
lower latencies as you increase the HZ.

> It works 
> beautifully. I also quickly checked that playing mp3 doesn't skip during
> make -j4, and that gears runs fairly smoothly, since those are the
> references people often use.

Good. I obviously need to ensure that these remain well managed since I, for 
one, never want linux to suck on the desktop. In your case, I was more 
interested in your real world problems you were having so the rest of your 
report is even more important to me.

> But with real work, it's excellent too. When I saturate my CPUs by
> injecting HTTP traffic on haproxy, the load is stable and the command line
> perfectly responsive, while in the past the load would oscillate and the
> command line sometimes stopped to respond for a few seconds.
>
> I've also launched my scheddos program (you may remember, the one we did a
> few experiments with). I could not cause any freeze at all. Plain 2.6.20
> had already improved a lot in this area, but above 4 processes/CPU,
> occasional short freezes did still occur. This time, even at 100 processes,
> the system was rather slow (of course!) but just as expected, and nothing
> more.
>
> I also tried the good old "dd if=/dev/zero bs=1|...|dd bs=1 of=/dev/null"
> and it did not cause any trouble.
>
> I will boot 2.6 slightly more often to test the code under various
> conditions, and I will recommend it to a few people I know who tend to
> switch back to 2.4 after one day full of 2.6 jerkiness.
>
> Overall, you have done a great job !

Thank you very much. I think you have nicely pointed out some of the real 
world advantages to this design. Those testcases are good for testing average 
latency, fairness, and consistency of both of them under various loads so 
they are excellent real world examples. If you extrapolate those facts to 
server loads you can understand how the fairness and latency can translate 
into advantages anywhere.

> I hope that more people will give it a try, first to help find possible
> remaining bugs, and to pronounce in favour of its inclusion in mainline.

Thanks. I've already had a hundred or so people testing this code (thanks to 
the -ck mailing list people!) without any recent bugs prior to announcing it 
to lkml so I believe it is quite safe for testing.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-04 23:13   ` Willy Tarreau
  2007-03-04 23:58     ` Con Kolivas
@ 2007-03-05  0:10     ` jos poortvliet
  2007-03-05  1:09     ` Gene Heskett
  2 siblings, 0 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-05  0:10 UTC (permalink / raw)
  To: ck; +Cc: Willy Tarreau, Con Kolivas, Al Boldi, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 2248 bytes --]

Op Monday 05 March 2007, schreef Willy Tarreau:
> On Mon, Mar 05, 2007 at 08:49:29AM +1100, Con Kolivas wrote:
> (...)
>
> > > That's just what it did, but when you "nice make -j4", things (gears)
> > > start to stutter.  Is that due to the staircase?
> >
> > gears isn't an interactive task. Apart from using it as a background load
> > to check for starvation because it loads up the cpu fully (which a gpu
> > intensive but otherwise simple app like this should _not_ do) graphics
> > card drivers and interrupts and so on, I wouldn't put much credence on
> > gears as anything else. However I suspect that gears will still get a
> > fair share of the cpu on RSDL which almost never happens on any other
> > scheduler.
>
> Con,
>
> I've now given it a try with HZ=250 on my dual-athlon. It works
> beautifully. I also quickly checked that playing mp3 doesn't skip during
> make -j4, and that gears runs fairly smoothly, since those are the
> references people often use.
>
> But with real work, it's excellent too. When I saturate my CPUs by
> injecting HTTP traffic on haproxy, the load is stable and the command line
> perfectly responsive, while in the past the load would oscillate and the
> command line sometimes stopped to respond for a few seconds.
>
> I've also launched my scheddos program (you may remember, the one we did a
> few experiments with). I could not cause any freeze at all. Plain 2.6.20
> had already improved a lot in this area, but above 4 processes/CPU,
> occasional short freezes did still occur. This time, even at 100 processes,
> the system was rather slow (of course!) but just as expected, and nothing
> more.
>
> I also tried the good old "dd if=/dev/zero bs=1|...|dd bs=1 of=/dev/null"
> and it did not cause any trouble.
>
> I will boot 2.6 slightly more often to test the code under various
> conditions, and I will recommend it to a few people I know who tend to
> switch back to 2.4 after one day full of 2.6 jerkiness.
>
> Overall, you have done a great job !
>
> I hope that more people will give it a try, first to help find possible
> remaining bugs, and to pronounce in favour of its inclusion in mainline.
>
> Cheers,
> Willy

Sounds nice... you might want to put this in the -ck wiki: ck.wikia.com/

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-04 23:13   ` Willy Tarreau
  2007-03-04 23:58     ` Con Kolivas
  2007-03-05  0:10     ` [ck] " jos poortvliet
@ 2007-03-05  1:09     ` Gene Heskett
  2 siblings, 0 replies; 198+ messages in thread
From: Gene Heskett @ 2007-03-05  1:09 UTC (permalink / raw)
  To: linux-kernel; +Cc: Willy Tarreau, Con Kolivas, Al Boldi, ck list

On Sunday 04 March 2007, Willy Tarreau wrote:
>On Mon, Mar 05, 2007 at 08:49:29AM +1100, Con Kolivas wrote:
>(...)
>
>> > That's just what it did, but when you "nice make -j4", things
>> > (gears) start to stutter.  Is that due to the staircase?
>>
>> gears isn't an interactive task. Apart from using it as a background
>> load to check for starvation because it loads up the cpu fully (which
>> a gpu intensive but otherwise simple app like this should _not_ do)
>> graphics card drivers and interrupts and so on, I wouldn't put much
>> credence on gears as anything else. However I suspect that gears will
>> still get a fair share of the cpu on RSDL which almost never happens
>> on any other scheduler.
>
>Con,
>
>I've now given it a try with HZ=250 on my dual-athlon. It works
> beautifully. I also quickly checked that playing mp3 doesn't skip
> during make -j4, and that gears runs fairly smoothly, since those are
> the references people often use.
>
>But with real work, it's excellent too. When I saturate my CPUs by
> injecting HTTP traffic on haproxy, the load is stable and the command
> line perfectly responsive, while in the past the load would oscillate
> and the command line sometimes stopped to respond for a few seconds.
>
>I've also launched my scheddos program (you may remember, the one we did
> a few experiments with). I could not cause any freeze at all. Plain
> 2.6.20 had already improved a lot in this area, but above 4
> processes/CPU, occasional short freezes did still occur. This time,
> even at 100 processes, the system was rather slow (of course!) but just
> as expected, and nothing more.
>
>I also tried the good old "dd if=/dev/zero bs=1|...|dd bs=1
> of=/dev/null" and it did not cause any trouble.
>
>I will boot 2.6 slightly more often to test the code under various
> conditions, and I will recommend it to a few people I know who tend to
> switch back to 2.4 after one day full of 2.6 jerkiness.
>
>Overall, you have done a great job !
>
>I hope that more people will give it a try, first to help find possible
>remaining bugs, and to pronounce in favour of its inclusion in mainline.
>
>Cheers,
>Willy
>
+1 for mainline, this is a HUGE improvement in usability.  End of 
discussion AFAIK.
>-
>To unsubscribe from this list: send the line "unsubscribe linux-kernel"
> in the body of a message to majordomo@vger.kernel.org
>More majordomo info at  http://vger.kernel.org/majordomo-info.html
>Please read the FAQ at  http://www.tux.org/lkml/



-- 
Cheers, Gene
"There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
       [not found]   ` <20070305060732.GQ30401@nysv.org>
@ 2007-03-05 11:59     ` Al Boldi
  2007-03-05 12:29       ` Con Kolivas
  0 siblings, 1 reply; 198+ messages in thread
From: Al Boldi @ 2007-03-05 11:59 UTC (permalink / raw)
  To: Markus Törnqvist; +Cc: Con Kolivas, ck list, linux-kernel

Markus Törnqvist wrote:
> On Mon, Mar 05, 2007 at 08:34:45AM +0300, Al Boldi wrote:
> >Ok, gears is smooth when you run "make -j4", but with "nice make -j4",
> > gears becomes bursty.  This looks like a problem with nice-levels.  In
> > general, looking subjectively at top d.1, procs appear to show jerkiness
> > when nice'd.
>
> Don't use glxgears, please. Ever. Unless you want meaningless gears.
>
> It displays totally erratic behaviour anyway, and does sched_yield (strace
> tells us this) which means IT GIVES UP ITS TIME TO RUN, ie yields the
> cpu to someone else.

I just strace'd it here.  It doesn't show any yield in the mesa-5.0 version.  
Which version are you using?

> >Do you have an objective test-case that can show the even-ness of RSDL in
> >both nice'd and normal scenarios?
>
> A big movie, like DVD-quality, full resolution should do the trick.
> That's what I used ;)

The problem with audio/video is that they usually do buffering, which hides 
scheduler anomalies.

> Debian and Ubuntu at least ship "stress" which I also used and reniced
> in-flight with excellent results, but do note to start off with small
> settings and work up or you might overcommit your box, in a way which
> no scheduler can handle.

Can you give a link?


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-05 11:59     ` [ck] " Al Boldi
@ 2007-03-05 12:29       ` Con Kolivas
       [not found]         ` <200703052123.01095.a1426z@gawab.com>
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-05 12:29 UTC (permalink / raw)
  To: Al Boldi; +Cc: Markus Törnqvist, ck list, linux-kernel

On Monday 05 March 2007 22:59, Al Boldi wrote:
> Markus Törnqvist wrote:
> > On Mon, Mar 05, 2007 at 08:34:45AM +0300, Al Boldi wrote:
> > >Ok, gears is smooth when you run "make -j4", but with "nice make -j4",
> > > gears becomes bursty.  This looks like a problem with nice-levels.  In
> > > general, looking subjectively at top d.1, procs appear to show
> > > jerkiness when nice'd.

I wouldn't place much value on what you can see just from looking at top. 
Gears just isn't an interactive task and just about anything but gears would 
be a better test case since its behaviour varies wildly under different 
combinations of graphics cards, memory bandwidth, cpu and so on. I'm not even 
sure what you're trying to prove with gears. If it's to see "smooth 
behaviour" under load then gears is not the thing to do it with. Maybe 
unbuffered live video on xawtv or something? Perhaps even capturing video 
where you know what the cpu usage will be of the video codec you're using. 
Say you knew xvid at a fixed bitrate required 30% cpu to encode a live video 
then you could see if it did it real time with a make -j2 running assuming 
you don't run out of disk bandwidth and so on. Or something that required 75% 
cpu and you ran a nice -19 make -j4. Or even try interbench and fiddle with 
the nice values and size of the loads since those options exist and see what 
the latencies and %desired cpu are. By default without arguments interbench 
looks for unfair scheduling and finds plenty of it in other scheduler 
designs. Either way your testcase must mean something to you.

> > Don't use glxgears, please. Ever. Unless you want meaningless gears.

That I definitely agree with.

> > It displays totally erratic behaviour anyway, and does sched_yield
> > (strace tells us this) which means IT GIVES UP ITS TIME TO RUN, ie yields
> > the cpu to someone else.
>
> I just strace'd it here.  It doesn't show any yield in the mesa-5.0
> version. Which version are you using?

Curious. On a machine with onboard intel graphics (915 driver), and on an ATI 
r200 dri based driver I can see yields. Yet with the nvidia driver I see no 
yields. This sched_yield seems to be more done by the driver rather than the 
application. That would cause woeful performance. How odd that modern drivers 
out there today are still using this basically defunct unix function. No, I'm 
not interested in getting into a discussion of the "possible" uses of 
sched_yield. That's been done to death many times before.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-04 22:27     ` Con Kolivas
@ 2007-03-05 18:29       ` Simon Arlott
  2007-03-05 21:36         ` Con Kolivas
  0 siblings, 1 reply; 198+ messages in thread
From: Simon Arlott @ 2007-03-05 18:29 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck list, linux-kernel

On 04/03/07 22:27, Con Kolivas wrote:
> On Monday 05 March 2007 09:19, Simon Arlott wrote:
>> If I run glxgears, thunderbird/firefox become really slow to
>> respond/display and cpu usage isn't even at 100%. I had thunderbird lagging
>> on keyboard character repeat earlier but can't reproduce that now even with
>> glxgears - however firefox lags really badly on keyboard input with
>> glxgears running.
> 
> Hi Simon
> 
> You're hitting a nasty udev bug here that is unrelated to the cpu scheduler 
Yes, I've already reported this.

> and almost certainly responsible for your bad behaviour. 
What? How do you make the leap from lockdep output on boot to my experience of 
slow interactive response?

> Also this patch was actually for 2.6.20 and you seem to have applied it to 
> 2.6.21-rc2. I haven't even checked that it cleanly applies to that kernel.

It doesn't, a one line fix from Ingo conflicts (and a comment that was changed).
7355690ead6d61f6344072ae61060f985060da29 [PATCH] sched: fix SMT scheduler bug
72fd4a35a824331d7a0f4168d7576502d95d34b3 [PATCH] Numerous fixes to kernel-doc info in source files.

I have reverted your patch and the laggy behaviour remains when running 
glxgears (in fact it seems a bit worse), I don't know about the thunderbird 
lag while typing since I can't easily reproduce it but I don't recall it ever 
happening before. So, your patch isn't the cause of a serious slowdown when 
running glxgears in the background (although it doesn't improve the situation 
much). It looks like it just slows down X in general rather than using much 
CPU.

-- 
Simon Arlott

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu   scheduler
  2007-03-05 18:29       ` Simon Arlott
@ 2007-03-05 21:36         ` Con Kolivas
  0 siblings, 0 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-05 21:36 UTC (permalink / raw)
  To: Simon Arlott; +Cc: ck list, linux-kernel

On Tuesday 06 March 2007 05:29, Simon Arlott wrote:
> On 04/03/07 22:27, Con Kolivas wrote:
> > On Monday 05 March 2007 09:19, Simon Arlott wrote:
> >> If I run glxgears, thunderbird/firefox become really slow to
> >> respond/display and cpu usage isn't even at 100%. I had thunderbird
> >> lagging on keyboard character repeat earlier but can't reproduce that
> >> now even with glxgears - however firefox lags really badly on keyboard
> >> input with glxgears running.
> >
> > Hi Simon
> >
> > You're hitting a nasty udev bug here that is unrelated to the cpu
> > scheduler
>
> Yes, I've already reported this.
>
> > and almost certainly responsible for your bad behaviour.
>
> What? How do you make the leap from lockdep output on boot to my experience
> of slow interactive response?

In general I've found once the kernel did something funny on boot that not 
much beyond that could be trusted. However a very small period of alarm with 
lockdep, _assuming it was transient and did not happen later_ as you say, 
would not cause problems.

> > Also this patch was actually for 2.6.20 and you seem to have applied it
> > to 2.6.21-rc2. I haven't even checked that it cleanly applies to that
> > kernel.
>
> It doesn't, a one line fix from Ingo conflicts (and a comment that was
> changed). 7355690ead6d61f6344072ae61060f985060da29 [PATCH] sched: fix SMT
> scheduler bug 72fd4a35a824331d7a0f4168d7576502d95d34b3 [PATCH] Numerous
> fixes to kernel-doc info in source files.

Ok, but I was very reluctant to release this code as a patch to -rc2 because 
the potential for all sorts of weird and wonderful fireworks and explosions 
from the -pre release makes it a lousy testing ground for something else. 
It's often hard to differentiate breakage in behaviour in the unstable kernel 
or the patch you're applying. I was even somewhat reluctant to be releasing 
this patch for 2.6.20 since people are mumbling (offlist) that they're 
getting weird stalls on that kernel - however they're afraid to report them 
to lkml since their issues are fuzzy and they know they'll be ask to do git 
bisect on an intermittent problem; git bisect as we know is rocket science 
(or brain surgery if you happen to be a rocket scientist).

> I have reverted your patch and the laggy behaviour remains when running
> glxgears (in fact it seems a bit worse), I don't know about the thunderbird
> lag while typing since I can't easily reproduce it but I don't recall it
> ever happening before. So, your patch isn't the cause of a serious slowdown
> when running glxgears in the background (although it doesn't improve the
> situation much). It looks like it just slows down X in general rather than
> using much CPU.

Which goes back to my original comment about glxgears, only I should have made 
my advice stronger; glxgears should not be used as a benchmarking or test 
tool of any sort period. It displays pretty gears and that's all. On your 
machine, for example, it is not cpu bound from the looks of your description, 
and either loads up the bus or drowns the gpu in ways that delays other gpu 
activity. On other hardware combinations, it does something completely 
different. Given that, it's easy to see how it is impossible to use it as a 
cpu scheduling test of any sort.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
       [not found]         ` <200703052123.01095.a1426z@gawab.com>
@ 2007-03-05 22:10           ` Con Kolivas
  2007-03-06  8:42             ` Xavier Bestel
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-05 22:10 UTC (permalink / raw)
  To: Al Boldi; +Cc: Markus Törnqvist, ck list, linux-kernel

On Tuesday 06 March 2007 05:23, Al Boldi wrote:
> Con Kolivas wrote:
> > Gears just isn't an interactive task and just about anything but gears
> > would be a better test case since its behaviour varies wildly under
> > different combinations of graphics cards, memory bandwidth, cpu and so
> > on.
>
> What about reflect?  It has the same problem.
>
> > I'm not even sure what you're trying to prove with gears.
>
> RSDL fairness.
>
> Ok, try this:
> # gears & gears &
> Both run smoothly.
> # nice gears & nice gears &
> Both run smoothly.
>
> Now:
> # gears & nice -10 gears &
> Both stutter.
>
> But:
> # gears & nice --10 gears &
> Both run smoothly.
>
> Something looks fishy with nice levels.

Hah I just wish gears would go away. If I get hardware where it runs at just 
the right speed it looks like it doesn't move at all. On other hardware the 
wheels go backwards and forwards where the screen refresh rate is just 
perfectly a factor of the frames per second (or something like that). 

This is not a cpu scheduler test and you're inferring that there are cpu 
scheduling artefacts based on an application that has bottlenecks at 
different places depending on the hardware combination. 

To imply something is fishy with nice levels, do a test that _only_ uses cpu 
(and not the bus, memory bandwidth, the gpu and has driver interactions) and 
prove that there is something wrong. What happens to other resources on the 
machine the cpu scheduler has no control over. The -rt tree tries to address 
these factors for example but it's a huge - some would say insurmountable - 
thing to try and manage at all levels on a general purpose operating system. 
The cpu is proportioned out very fairly with rsdl on both quota and latency 
according to nice vs cpu usage. When something is fully cpu bound on rsdl its 
average and maximum latency will be greater than something that only 
intermittently uses cpu. The (somewhat lengthy and not easily digestible) 
docs describe how you can model what these latencies will be.

> Otherwise, your scheduler looks great!

Thanks!

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-05 22:10           ` Con Kolivas
@ 2007-03-06  8:42             ` Xavier Bestel
  2007-03-06 15:15               ` Al Boldi
  0 siblings, 1 reply; 198+ messages in thread
From: Xavier Bestel @ 2007-03-06  8:42 UTC (permalink / raw)
  To: Con Kolivas; +Cc: Al Boldi, Markus Törnqvist, ck list, linux-kernel

On Tue, 2007-03-06 at 09:10 +1100, Con Kolivas wrote:
> Hah I just wish gears would go away. If I get hardware where it runs at just 
> the right speed it looks like it doesn't move at all. On other hardware the 
> wheels go backwards and forwards where the screen refresh rate is just 
> perfectly a factor of the frames per second (or something like that). 
> 
> This is not a cpu scheduler test and you're inferring that there are cpu 
> scheduling artefacts based on an application that has bottlenecks at 
> different places depending on the hardware combination. 

I'd add that Xorg has its own scheduler (for X11 operations, of course),
that has its own quirks, and chances are that it is the one you're
testing with glxgears. And as Con said, as long as glxgears does more
FPS than your screen refresh rate, its flickering its completely
meaningless: it doesn't even attempt to sync with vblank. Al, you'd
better try with Quake3 or Nexuiz, or even Blender if you want to test 3D
interactivity under load.

	Xav



^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-06  8:42             ` Xavier Bestel
@ 2007-03-06 15:15               ` Al Boldi
  2007-03-11 18:11                 ` Al Boldi
  0 siblings, 1 reply; 198+ messages in thread
From: Al Boldi @ 2007-03-06 15:15 UTC (permalink / raw)
  To: Xavier Bestel; +Cc: Markus Törnqvist, ck list, linux-kernel, Con Kolivas

Xavier Bestel wrote:
> On Tue, 2007-03-06 at 09:10 +1100, Con Kolivas wrote:
> > Hah I just wish gears would go away. If I get hardware where it runs at
> > just the right speed it looks like it doesn't move at all. On other
> > hardware the wheels go backwards and forwards where the screen refresh
> > rate is just perfectly a factor of the frames per second (or something
> > like that).
> >
> > This is not a cpu scheduler test and you're inferring that there are cpu
> > scheduling artefacts based on an application that has bottlenecks at
> > different places depending on the hardware combination.
>
> I'd add that Xorg has its own scheduler (for X11 operations, of course),
> that has its own quirks, and chances are that it is the one you're
> testing with glxgears. And as Con said, as long as glxgears does more
> FPS than your screen refresh rate, its flickering its completely
> meaningless: it doesn't even attempt to sync with vblank. Al, you'd
> better try with Quake3 or Nexuiz, or even Blender if you want to test 3D
> interactivity under load.

Actually, games aren't really usefull to evaluate scheduler performance, due 
to their bursty nature.

OTOH, gears runs full throttle, including any of its bottlenecks. In fact, 
it's the bottlenecks that add to its realism.  It exposes underlying 
scheduler hickups visually, unless buffered by the display-driver, in which 
case you just use the vesa-driver to be sure.

If gears starts to flicker on you, just slow it down with a cpu hog like:

	# while :; do :; done &

Add as many hogs as you need to make the hickups visible.

Again, these hickups are only visible when using uneven nice+ levels.

BTW, another way to show these hickups would be through some kind of a 
cpu/proc timing-tracer.  Do we have something like that?

Thanks!

--
Al

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-06 15:15               ` Al Boldi
@ 2007-03-11 18:11                 ` Al Boldi
  2007-03-11 21:52                   ` Con Kolivas
  0 siblings, 1 reply; 198+ messages in thread
From: Al Boldi @ 2007-03-11 18:11 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck list, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 6035 bytes --]

Al Boldi wrote:
> BTW, another way to show these hickups would be through some kind of a
> cpu/proc timing-tracer.  Do we have something like that?

Here is something like a tracer.

Original idea by Chris Friesen, thanks, from this post:
http://marc.theaimsgroup.com/?l=linux-kernel&m=117331003029329&w=4

Try attached chew.c like this:
Boot into /bin/sh.
Run chew in one console.
Run nice chew in another console.
Watch timings.

Console 1: ./chew
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    5 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    5 ms
pid 655, prio   0, out for    5 ms
pid 655, prio   0, out for    5 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    5 ms
pid 655, prio   0, out for    5 ms
pid 655, prio   0, out for    5 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    5 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    5 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    6 ms
pid 655, prio   0, out for    5 ms

Console 2: nice -10 ./chew
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    5 ms
pid 669, prio  10, out for   65 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    5 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    5 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for   65 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms
pid 669, prio  10, out for    6 ms

Console 2: nice -15 ./chew
pid 673, prio  15, out for    5 ms
pid 673, prio  15, out for    6 ms
pid 673, prio  15, out for   95 ms
pid 673, prio  15, out for    5 ms
pid 673, prio  15, out for    5 ms
pid 673, prio  15, out for    6 ms
pid 673, prio  15, out for    5 ms
pid 673, prio  15, out for   95 ms
pid 673, prio  15, out for    5 ms
pid 673, prio  15, out for    5 ms
pid 673, prio  15, out for    6 ms
pid 673, prio  15, out for    6 ms
pid 673, prio  15, out for   95 ms
pid 673, prio  15, out for    6 ms
pid 673, prio  15, out for    6 ms
pid 673, prio  15, out for    6 ms
pid 673, prio  15, out for    6 ms
pid 673, prio  15, out for   95 ms
pid 673, prio  15, out for    5 ms
pid 673, prio  15, out for    5 ms
pid 673, prio  15, out for    6 ms
pid 673, prio  15, out for    5 ms

Console 2: nice -18 ./chew
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    6 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    6 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    5 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    6 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    5 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    5 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    5 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    6 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    6 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    5 ms
pid 677, prio  18, out for  113 ms
pid 677, prio  18, out for    5 ms

Console 2: nice -19 ./chew
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms
pid 679, prio  19, out for  119 ms

Now with negative nice:
Console 1: ./chew
pid 674, prio   0, out for    6 ms
pid 674, prio   0, out for  125 ms
pid 674, prio   0, out for    6 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    6 ms
pid 674, prio   0, out for    6 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    6 ms
pid 674, prio   0, out for    6 ms
pid 674, prio   0, out for    6 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    6 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for    6 ms
pid 674, prio   0, out for    5 ms
pid 674, prio   0, out for  110 ms

Console 2: nice --20 ./chew
pid 687, prio -20, out for    5 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    5 ms
pid 687, prio -20, out for    5 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    5 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    5 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    5 ms
pid 687, prio -20, out for    5 ms
pid 687, prio -20, out for    6 ms
pid 687, prio -20, out for    6 ms


OTOH, mainline is completely smooth, albeit huge drop-outs.


Thanks!

--
Al



[-- Attachment #2: chew.c --]
[-- Type: text/x-csrc, Size: 1027 bytes --]

/*
 * orignal idea by Chris Friesen.  Thanks.
 */

#include <stdio.h>
#include <sys/time.h>
#include <sys/resource.h>

#define THRESHOLD_USEC 2000

unsigned long long stamp()
{
        struct timeval tv;
        gettimeofday(&tv, 0);
        return (unsigned long long) tv.tv_usec + ((unsigned long long) tv.tv_sec)*1000000;
}

int main()
{
        unsigned long long thresh_ticks = THRESHOLD_USEC;
        unsigned long long cur,last;
        struct timespec ts;

        sched_rr_get_interval(0, &ts);
        printf("pid %d, prio %3d, interval of %d nsec\n", getpid(), getpriority(PRIO_PROCESS, 0), ts.tv_nsec);

        last = stamp();
        while(1) {
                cur = stamp();
                unsigned long long delta = cur-last;
                if (delta > thresh_ticks) {
                        printf("pid %d, prio %3d, out for %4llu ms\n", getpid(), getpriority(PRIO_PROCESS, 0), delta/1000);
                        cur = stamp();
                }
                last = cur;
        }

        return 0;
}

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-11 18:11                 ` Al Boldi
@ 2007-03-11 21:52                   ` Con Kolivas
  2007-03-11 22:12                     ` Con Kolivas
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-11 21:52 UTC (permalink / raw)
  To: Al Boldi; +Cc: ck list, linux-kernel

On Monday 12 March 2007 05:11, Al Boldi wrote:
> Al Boldi wrote:
> > BTW, another way to show these hickups would be through some kind of a
> > cpu/proc timing-tracer.  Do we have something like that?
>
> Here is something like a tracer.
>
> Original idea by Chris Friesen, thanks, from this post:
> http://marc.theaimsgroup.com/?l=linux-kernel&m=117331003029329&w=4
>
> Try attached chew.c like this:
> Boot into /bin/sh.
> Run chew in one console.
> Run nice chew in another console.
> Watch timings.
>
> Console 1: ./chew

> Console 2: nice -10 ./chew

> pid 669, prio  10, out for    5 ms
> pid 669, prio  10, out for   65 ms

One full expiration

> pid 669, prio  10, out for    6 ms
> pid 669, prio  10, out for   65 ms

again

> Console 2: nice -15 ./chew
> pid 673, prio  15, out for    6 ms
> pid 673, prio  15, out for   95 ms

again and so on..

> OTOH, mainline is completely smooth, albeit huge drop-outs.

Heh. That's not much good either is it.

> Thanks!

And thank you! I think I know what's going on now. I think each rotation is 
followed by another rotation before the higher priority task is getting a 
look in in schedule() to even get quota and add it to the runqueue quota. 
I'll try a simple change to see if that helps. Patch coming up shortly.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-11 21:52                   ` Con Kolivas
@ 2007-03-11 22:12                     ` Con Kolivas
  2007-03-12  4:42                       ` Al Boldi
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-11 22:12 UTC (permalink / raw)
  To: Al Boldi; +Cc: ck list, linux-kernel

On Monday 12 March 2007 08:52, Con Kolivas wrote:
> And thank you! I think I know what's going on now. I think each rotation is
> followed by another rotation before the higher priority task is getting a
> look in in schedule() to even get quota and add it to the runqueue quota.
> I'll try a simple change to see if that helps. Patch coming up shortly.

Can you try the following patch and see if it helps. There's also one minor
preemption logic fix in there that I'm planning on including. Thanks!

---
 kernel/sched.c |   24 +++++++++++-------------
 1 file changed, 11 insertions(+), 13 deletions(-)

Index: linux-2.6.21-rc3-mm2/kernel/sched.c
===================================================================
--- linux-2.6.21-rc3-mm2.orig/kernel/sched.c	2007-03-12 08:47:43.000000000 +1100
+++ linux-2.6.21-rc3-mm2/kernel/sched.c	2007-03-12 09:10:33.000000000 +1100
@@ -96,10 +96,9 @@ unsigned long long __attribute__((weak))
  * provided it is not a realtime comparison.
  */
 #define TASK_PREEMPTS_CURR(p, curr) \
-	(((p)->prio < (curr)->prio) || (((p)->prio == (curr)->prio) && \
+	(((p)->prio < (curr)->prio) || (!rt_task(p) && \
 		((p)->static_prio < (curr)->static_prio && \
-			((curr)->static_prio > (curr)->prio)) && \
-				!rt_task(p)))
+			((curr)->static_prio > (curr)->prio))))
 
 /*
  * This is the time all tasks within the same priority round robin.
@@ -3323,7 +3322,7 @@ static inline void major_prio_rotation(s
  */
 static inline void rotate_runqueue_priority(struct rq *rq)
 {
-	int new_prio_level, remaining_quota;
+	int new_prio_level;
 	struct prio_array *array;
 
 	/*
@@ -3334,7 +3333,6 @@ static inline void rotate_runqueue_prior
 	if (unlikely(sched_find_first_bit(rq->dyn_bitmap) < rq->prio_level))
 		return;
 
-	remaining_quota = rq_quota(rq, rq->prio_level);
 	array = rq->active;
 	if (rq->prio_level > MAX_PRIO - 2) {
 		/* Major rotation required */
@@ -3368,10 +3366,11 @@ static inline void rotate_runqueue_prior
 	}
 	rq->prio_level = new_prio_level;
 	/*
-	 * While we usually rotate with the rq quota being 0, it is possible
-	 * to be negative so we subtract any deficit from the new level.
+	 * As we are merging to a prio_level that may not have anything in
+	 * its quota we add 1 to ensure the tasks get to run in schedule() to
+	 * add their quota to it.
 	 */
-	rq_quota(rq, new_prio_level) += remaining_quota;
+	rq_quota(rq, new_prio_level) += 1;
 }
 
 static void task_running_tick(struct rq *rq, struct task_struct *p)
@@ -3397,12 +3396,11 @@ static void task_running_tick(struct rq 
 	if (!--p->time_slice)
 		task_expired_entitlement(rq, p);
 	/*
-	 * The rq quota can become negative due to a task being queued in
-	 * scheduler without any quota left at that priority level. It is
-	 * cheaper to allow it to run till this scheduler tick and then
-	 * subtract it from the quota of the merged queues.
+	 * We only employ the deadline mechanism if we run over the quota.
+	 * It allows aliasing problems around the scheduler_tick to be
+	 * less harmful.
 	 */
-	if (!rt_task(p) && --rq_quota(rq, rq->prio_level) <= 0) {
+	if (!rt_task(p) && --rq_quota(rq, rq->prio_level) < 0) {
 		if (unlikely(p->first_time_slice))
 			p->first_time_slice = 0;
 		rotate_runqueue_priority(rq);


-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-11 22:12                     ` Con Kolivas
@ 2007-03-12  4:42                       ` Al Boldi
  2007-03-12  4:53                         ` Con Kolivas
  0 siblings, 1 reply; 198+ messages in thread
From: Al Boldi @ 2007-03-12  4:42 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck list, linux-kernel

Con Kolivas wrote:
> On Monday 12 March 2007 08:52, Con Kolivas wrote:
> > And thank you! I think I know what's going on now. I think each rotation
> > is followed by another rotation before the higher priority task is
> > getting a look in in schedule() to even get quota and add it to the
> > runqueue quota. I'll try a simple change to see if that helps. Patch
> > coming up shortly.
>
> Can you try the following patch and see if it helps. There's also one
> minor preemption logic fix in there that I'm planning on including.
> Thanks!

Applied on top of v0.28 mainline, and there is no difference.

What's it look like on your machine?


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12  4:42                       ` Al Boldi
@ 2007-03-12  4:53                         ` Con Kolivas
  2007-03-12 11:26                           ` Al Boldi
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-12  4:53 UTC (permalink / raw)
  To: Al Boldi; +Cc: ck list, linux-kernel

On Monday 12 March 2007 15:42, Al Boldi wrote:
> Con Kolivas wrote:
> > On Monday 12 March 2007 08:52, Con Kolivas wrote:
> > > And thank you! I think I know what's going on now. I think each
> > > rotation is followed by another rotation before the higher priority
> > > task is getting a look in in schedule() to even get quota and add it to
> > > the runqueue quota. I'll try a simple change to see if that helps.
> > > Patch coming up shortly.
> >
> > Can you try the following patch and see if it helps. There's also one
> > minor preemption logic fix in there that I'm planning on including.
> > Thanks!
>
> Applied on top of v0.28 mainline, and there is no difference.
>
> What's it look like on your machine?

The higher priority one always get 6-7ms whereas the lower priority one runs 
6-7ms and then one larger perfectly bound expiration amount. Basically 
exactly as I'd expect. The higher priority task gets precisely RR_INTERVAL 
maximum latency whereas the lower priority task gets RR_INTERVAL min and full 
expiration (according to the virtual deadline) as a maximum. That's exactly 
how I intend it to work. Yes I realise that the max latency ends up being 
longer intermittently on the niced task but that's -in my opinion- perfectly 
fine as a compromise to ensure the nice 0 one always gets low latency.

Eg:
nice 0 vs nice 10

nice 0:
pid 6288, prio   0, out for    7 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms
pid 6288, prio   0, out for    6 ms

nice 10:
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for   66 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms
pid 6290, prio  10, out for    6 ms

exactly as I'd expect. If you want fixed latencies _of niced tasks_ in the 
presence of less niced tasks you will not get them with this scheduler. What 
you will get, though, is a perfectly bound relationship knowing exactly what 
the maximum latency will ever be.

Thanks for the test case. It's interesting and nice that it confirms this 
scheduler works as I expect it to.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12  4:53                         ` Con Kolivas
@ 2007-03-12 11:26                           ` Al Boldi
  2007-03-12 12:52                             ` Con Kolivas
  2007-03-13 15:31                             ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice Con Kolivas
  0 siblings, 2 replies; 198+ messages in thread
From: Al Boldi @ 2007-03-12 11:26 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck list, linux-kernel

Con Kolivas wrote:
> On Monday 12 March 2007 15:42, Al Boldi wrote:
> > Con Kolivas wrote:
> > > On Monday 12 March 2007 08:52, Con Kolivas wrote:
> > > > And thank you! I think I know what's going on now. I think each
> > > > rotation is followed by another rotation before the higher priority
> > > > task is getting a look in in schedule() to even get quota and add it
> > > > to the runqueue quota. I'll try a simple change to see if that
> > > > helps. Patch coming up shortly.
> > >
> > > Can you try the following patch and see if it helps. There's also one
> > > minor preemption logic fix in there that I'm planning on including.
> > > Thanks!
> >
> > Applied on top of v0.28 mainline, and there is no difference.
> >
> > What's it look like on your machine?
>
> The higher priority one always get 6-7ms whereas the lower priority one
> runs 6-7ms and then one larger perfectly bound expiration amount.
> Basically exactly as I'd expect. The higher priority task gets precisely
> RR_INTERVAL maximum latency whereas the lower priority task gets
> RR_INTERVAL min and full expiration (according to the virtual deadline) as
> a maximum. That's exactly how I intend it to work. Yes I realise that the
> max latency ends up being longer intermittently on the niced task but
> that's -in my opinion- perfectly fine as a compromise to ensure the nice 0
> one always gets low latency.

I think, it should be possible to spread this max expiration latency across 
the rotation, should it not?


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 11:26                           ` Al Boldi
@ 2007-03-12 12:52                             ` Con Kolivas
  2007-03-12 14:14                               ` Al Boldi
                                                 ` (2 more replies)
  2007-03-13 15:31                             ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice Con Kolivas
  1 sibling, 3 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-12 12:52 UTC (permalink / raw)
  To: Al Boldi; +Cc: ck list, linux-kernel

On Monday 12 March 2007 22:26, Al Boldi wrote:
> Con Kolivas wrote:
> > On Monday 12 March 2007 15:42, Al Boldi wrote:
> > > Con Kolivas wrote:
> > > > On Monday 12 March 2007 08:52, Con Kolivas wrote:
> > > > > And thank you! I think I know what's going on now. I think each
> > > > > rotation is followed by another rotation before the higher priority
> > > > > task is getting a look in in schedule() to even get quota and add
> > > > > it to the runqueue quota. I'll try a simple change to see if that
> > > > > helps. Patch coming up shortly.
> > > >
> > > > Can you try the following patch and see if it helps. There's also one
> > > > minor preemption logic fix in there that I'm planning on including.
> > > > Thanks!
> > >
> > > Applied on top of v0.28 mainline, and there is no difference.
> > >
> > > What's it look like on your machine?
> >
> > The higher priority one always get 6-7ms whereas the lower priority one
> > runs 6-7ms and then one larger perfectly bound expiration amount.
> > Basically exactly as I'd expect. The higher priority task gets precisely
> > RR_INTERVAL maximum latency whereas the lower priority task gets
> > RR_INTERVAL min and full expiration (according to the virtual deadline)
> > as a maximum. That's exactly how I intend it to work. Yes I realise that
> > the max latency ends up being longer intermittently on the niced task but
> > that's -in my opinion- perfectly fine as a compromise to ensure the nice
> > 0 one always gets low latency.
>
> I think, it should be possible to spread this max expiration latency across
> the rotation, should it not?

There is a way that I toyed with of creating maps of slots to use for each 
different priority, but it broke the O(1) nature of the virtual deadline 
management. Minimising algorithmic complexity seemed more important to 
maintain than getting slightly better latency spreads for niced tasks. It 
also appeared to be less cache friendly in design. I could certainly try and 
implement it but how much importance are we to place on latency of niced 
tasks? Are you aware of any usage scenario where latency sensitive tasks are 
ever significantly niced in the real world?

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 12:52                             ` Con Kolivas
@ 2007-03-12 14:14                               ` Al Boldi
  2007-03-12 14:58                                 ` [ck] " jos poortvliet
  2007-03-12 18:05                                 ` Con Kolivas
  2007-03-18  1:30                               ` Bill Davidsen
  2007-03-18 10:50                               ` [ck] " jos poortvliet
  2 siblings, 2 replies; 198+ messages in thread
From: Al Boldi @ 2007-03-12 14:14 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck list, linux-kernel

Con Kolivas wrote:
> > > The higher priority one always get 6-7ms whereas the lower priority
> > > one runs 6-7ms and then one larger perfectly bound expiration amount.
> > > Basically exactly as I'd expect. The higher priority task gets
> > > precisely RR_INTERVAL maximum latency whereas the lower priority task
> > > gets RR_INTERVAL min and full expiration (according to the virtual
> > > deadline) as a maximum. That's exactly how I intend it to work. Yes I
> > > realise that the max latency ends up being longer intermittently on
> > > the niced task but that's -in my opinion- perfectly fine as a
> > > compromise to ensure the nice 0 one always gets low latency.
> >
> > I think, it should be possible to spread this max expiration latency
> > across the rotation, should it not?
>
> There is a way that I toyed with of creating maps of slots to use for each
> different priority, but it broke the O(1) nature of the virtual deadline
> management. Minimising algorithmic complexity seemed more important to
> maintain than getting slightly better latency spreads for niced tasks. It
> also appeared to be less cache friendly in design. I could certainly try
> and implement it but how much importance are we to place on latency of
> niced tasks? Are you aware of any usage scenario where latency sensitive
> tasks are ever significantly niced in the real world?

It only takes one negatively nice'd proc to affect X adversely.


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 14:14                               ` Al Boldi
@ 2007-03-12 14:58                                 ` jos poortvliet
  2007-03-12 16:37                                   ` michael chang
  2007-03-12 17:41                                   ` Al Boldi
  2007-03-12 18:05                                 ` Con Kolivas
  1 sibling, 2 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-12 14:58 UTC (permalink / raw)
  To: ck; +Cc: Al Boldi, Con Kolivas, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 2344 bytes --]

Op Monday 12 March 2007, schreef Al Boldi:
> Con Kolivas wrote:
> > > > The higher priority one always get 6-7ms whereas the lower priority
> > > > one runs 6-7ms and then one larger perfectly bound expiration amount.
> > > > Basically exactly as I'd expect. The higher priority task gets
> > > > precisely RR_INTERVAL maximum latency whereas the lower priority task
> > > > gets RR_INTERVAL min and full expiration (according to the virtual
> > > > deadline) as a maximum. That's exactly how I intend it to work. Yes I
> > > > realise that the max latency ends up being longer intermittently on
> > > > the niced task but that's -in my opinion- perfectly fine as a
> > > > compromise to ensure the nice 0 one always gets low latency.
> > >
> > > I think, it should be possible to spread this max expiration latency
> > > across the rotation, should it not?
> >
> > There is a way that I toyed with of creating maps of slots to use for
> > each different priority, but it broke the O(1) nature of the virtual
> > deadline management. Minimising algorithmic complexity seemed more
> > important to maintain than getting slightly better latency spreads for
> > niced tasks. It also appeared to be less cache friendly in design. I
> > could certainly try and implement it but how much importance are we to
> > place on latency of niced tasks? Are you aware of any usage scenario
> > where latency sensitive tasks are ever significantly niced in the real
> > world?
>
> It only takes one negatively nice'd proc to affect X adversely.

Then, maybe, we should start nicing X again, like we did/had to do until a few 
years ago? Or should we just wait until X gets fixed (after all, development 
goes faster than ever)? Or is this really the scheduler's fault?

> Thanks!
>
> --
> Al
>
> _______________________________________________
> http://ck.kolivas.org/faqs/replying-to-mailing-list.txt
> ck mailing list - mailto: ck@vds.kolivas.org
> http://vds.kolivas.org/mailman/listinfo/ck



-- 
Disclaimer:

Alles wat ik doe denk en zeg is gebaseerd op het wereldbeeld wat ik nu heb. 
Ik ben niet verantwoordelijk voor wijzigingen van de wereld, of het beeld wat 
ik daarvan heb, noch voor de daaruit voortvloeiende gedragingen van mezelf. 
Alles wat ik zeg is aardig bedoeld, tenzij expliciet vermeld.

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 14:58                                 ` [ck] " jos poortvliet
@ 2007-03-12 16:37                                   ` michael chang
  2007-03-12 17:41                                   ` Al Boldi
  1 sibling, 0 replies; 198+ messages in thread
From: michael chang @ 2007-03-12 16:37 UTC (permalink / raw)
  To: jos poortvliet; +Cc: ck, Al Boldi, linux-kernel

On 3/12/07, jos poortvliet <jos@mijnkamer.nl> wrote:
> Op Monday 12 March 2007, schreef Al Boldi:
> >
> > It only takes one negatively nice'd proc to affect X adversely.
>
> goes faster than ever)? Or is this really the scheduler's fault?
>

Take this with a grain of salt, but, I don't think this is the
scheduler's _fault_. That said, if the scheduler can fix it, it's not
necessarily a bad thing.

-- 
~Mike
 - Just the crazy copy cat.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 14:58                                 ` [ck] " jos poortvliet
  2007-03-12 16:37                                   ` michael chang
@ 2007-03-12 17:41                                   ` Al Boldi
  1 sibling, 0 replies; 198+ messages in thread
From: Al Boldi @ 2007-03-12 17:41 UTC (permalink / raw)
  To: jos poortvliet, ck; +Cc: Con Kolivas, linux-kernel

jos poortvliet wrote:
> > It only takes one negatively nice'd proc to affect X adversely.
>
> Then, maybe, we should start nicing X again, like we did/had to do until a
> few years ago? Or should we just wait until X gets fixed (after all,
> development goes faster than ever)? Or is this really the scheduler's
> fault?

It's not enough to renice X.  You would have to renice it, and any app that 
needed fixed latency, to the same nice of the negatively nice'd proc, which 
defeats the purpose...


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 14:14                               ` Al Boldi
  2007-03-12 14:58                                 ` [ck] " jos poortvliet
@ 2007-03-12 18:05                                 ` Con Kolivas
  2007-03-12 18:47                                   ` [ck] " jos poortvliet
  1 sibling, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-12 18:05 UTC (permalink / raw)
  To: Al Boldi; +Cc: ck list, linux-kernel

On Tuesday 13 March 2007 01:14, Al Boldi wrote:
> Con Kolivas wrote:
> > > > The higher priority one always get 6-7ms whereas the lower priority
> > > > one runs 6-7ms and then one larger perfectly bound expiration amount.
> > > > Basically exactly as I'd expect. The higher priority task gets
> > > > precisely RR_INTERVAL maximum latency whereas the lower priority task
> > > > gets RR_INTERVAL min and full expiration (according to the virtual
> > > > deadline) as a maximum. That's exactly how I intend it to work. Yes I
> > > > realise that the max latency ends up being longer intermittently on
> > > > the niced task but that's -in my opinion- perfectly fine as a
> > > > compromise to ensure the nice 0 one always gets low latency.
> > >
> > > I think, it should be possible to spread this max expiration latency
> > > across the rotation, should it not?
> >
> > There is a way that I toyed with of creating maps of slots to use for
> > each different priority, but it broke the O(1) nature of the virtual
> > deadline management. Minimising algorithmic complexity seemed more
> > important to maintain than getting slightly better latency spreads for
> > niced tasks. It also appeared to be less cache friendly in design. I
> > could certainly try and implement it but how much importance are we to
> > place on latency of niced tasks? Are you aware of any usage scenario
> > where latency sensitive tasks are ever significantly niced in the real
> > world?
>
> It only takes one negatively nice'd proc to affect X adversely.

I have an idea. Give me some time to code up my idea. Lack of sleep is making 
me very unpleasant.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 18:05                                 ` Con Kolivas
@ 2007-03-12 18:47                                   ` jos poortvliet
  2007-03-12 18:58                                     ` Antonio Vargas
  0 siblings, 1 reply; 198+ messages in thread
From: jos poortvliet @ 2007-03-12 18:47 UTC (permalink / raw)
  To: ck; +Cc: Con Kolivas, Al Boldi, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 1914 bytes --]

Op Monday 12 March 2007, schreef Con Kolivas:
> On Tuesday 13 March 2007 01:14, Al Boldi wrote:
> > Con Kolivas wrote:
> > > > > The higher priority one always get 6-7ms whereas the lower priority
> > > > > one runs 6-7ms and then one larger perfectly bound expiration
> > > > > amount. Basically exactly as I'd expect. The higher priority task
> > > > > gets precisely RR_INTERVAL maximum latency whereas the lower
> > > > > priority task gets RR_INTERVAL min and full expiration (according
> > > > > to the virtual deadline) as a maximum. That's exactly how I intend
> > > > > it to work. Yes I realise that the max latency ends up being longer
> > > > > intermittently on the niced task but that's -in my opinion-
> > > > > perfectly fine as a compromise to ensure the nice 0 one always gets
> > > > > low latency.
> > > >
> > > > I think, it should be possible to spread this max expiration latency
> > > > across the rotation, should it not?
> > >
> > > There is a way that I toyed with of creating maps of slots to use for
> > > each different priority, but it broke the O(1) nature of the virtual
> > > deadline management. Minimising algorithmic complexity seemed more
> > > important to maintain than getting slightly better latency spreads for
> > > niced tasks. It also appeared to be less cache friendly in design. I
> > > could certainly try and implement it but how much importance are we to
> > > place on latency of niced tasks? Are you aware of any usage scenario
> > > where latency sensitive tasks are ever significantly niced in the real
> > > world?
> >
> > It only takes one negatively nice'd proc to affect X adversely.
>
> I have an idea. Give me some time to code up my idea. Lack of sleep is
> making me very unpleasant.

You're excited by RSDL and the positive comments, aren't you? Well, don't 
forget to sleep, sleeping makes ppl smarter you know ;-)

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 18:47                                   ` [ck] " jos poortvliet
@ 2007-03-12 18:58                                     ` Antonio Vargas
  2007-03-19 10:47                                       ` Helge Hafting
  0 siblings, 1 reply; 198+ messages in thread
From: Antonio Vargas @ 2007-03-12 18:58 UTC (permalink / raw)
  To: linux-kernel; +Cc: ck, Al Boldi, Con Kolivas, jos poortvliet

On 3/12/07, jos poortvliet <jos@mijnkamer.nl> wrote:
> Op Monday 12 March 2007, schreef Con Kolivas:
> > On Tuesday 13 March 2007 01:14, Al Boldi wrote:
> > > Con Kolivas wrote:
> > > > > > The higher priority one always get 6-7ms whereas the lower priority
> > > > > > one runs 6-7ms and then one larger perfectly bound expiration
> > > > > > amount. Basically exactly as I'd expect. The higher priority task
> > > > > > gets precisely RR_INTERVAL maximum latency whereas the lower
> > > > > > priority task gets RR_INTERVAL min and full expiration (according
> > > > > > to the virtual deadline) as a maximum. That's exactly how I intend
> > > > > > it to work. Yes I realise that the max latency ends up being longer
> > > > > > intermittently on the niced task but that's -in my opinion-
> > > > > > perfectly fine as a compromise to ensure the nice 0 one always gets
> > > > > > low latency.
> > > > >
> > > > > I think, it should be possible to spread this max expiration latency
> > > > > across the rotation, should it not?
> > > >
> > > > There is a way that I toyed with of creating maps of slots to use for
> > > > each different priority, but it broke the O(1) nature of the virtual
> > > > deadline management. Minimising algorithmic complexity seemed more
> > > > important to maintain than getting slightly better latency spreads for
> > > > niced tasks. It also appeared to be less cache friendly in design. I
> > > > could certainly try and implement it but how much importance are we to
> > > > place on latency of niced tasks? Are you aware of any usage scenario
> > > > where latency sensitive tasks are ever significantly niced in the real
> > > > world?
> > >
> > > It only takes one negatively nice'd proc to affect X adversely.
> >
> > I have an idea. Give me some time to code up my idea. Lack of sleep is
> > making me very unpleasant.
>
> You're excited by RSDL and the positive comments, aren't you? Well, don't
> forget to sleep, sleeping makes ppl smarter you know ;-)
>
>

IIRC, about 2 or three years ago (or maybe on the 2.6.10 timeframe),
there was a patch which managed to pass the interactive from one app
to another when there was a pipe or udp connection between them. This
meant that a marked-as-interactive xterm would, when blocked waiting
for an Xserver response, transfer some of its interactiveness to the
Xserver, and aparently it worked very good for desktop workloads so,
maybe adapting it for this new scheduler would be good.



-- 
Greetz, Antonio Vargas aka winden of network

http://network.amigascne.org/
windNOenSPAMntw@gmail.com
thesameasabove@amigascne.org

Every day, every year
you have to work
you have to study
you have to scene.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice
  2007-03-12 11:26                           ` Al Boldi
  2007-03-12 12:52                             ` Con Kolivas
@ 2007-03-13 15:31                             ` Con Kolivas
  2007-03-13 16:03                               ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice -1 Con Kolivas
  2007-03-14  9:13                               ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice Mike Galbraith
  1 sibling, 2 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-13 15:31 UTC (permalink / raw)
  To: Al Boldi, Andrew Morton, Ingo Molnar; +Cc: ck list, linux-kernel

On Monday 12 March 2007 22:26, Al Boldi wrote:
> I think, it should be possible to spread this max expiration latency across
> the rotation, should it not?

Can you try the attached patch please Al and Mike? It "dithers" the priority
bitmap which tends to fluctuate the latency a lot more but in a cyclical
fashion. This tends to make the max latency bound to a smaller value and
should make it possible to run -nice tasks without killing the latency of the
non niced tasks. Eg you could possibly run X nice -10 at a guess like we used
to in 2.4 days. It's not essential of course, but is a workaround for Mike's
testcase.

Thanks.

---
Modify the priority bitmaps of different nice levels to be dithered
minimising the latency likely when different nice levels are used. This
allows low cpu using relatively niced tasks to still get low latency in the
presence of less niced tasks.

Fix the accounting on -nice levels to not be scaled by HZ.

Signed-off-by: Con Kolivas <kernel@kolivas.org>

---
 kernel/sched.c |   69 +++++++++++++++++++++++++++++++++++++--------------------
 1 file changed, 45 insertions(+), 24 deletions(-)

Index: linux-2.6.21-rc3-mm2/kernel/sched.c
===================================================================
--- linux-2.6.21-rc3-mm2.orig/kernel/sched.c	2007-03-13 23:17:29.000000000 +1100
+++ linux-2.6.21-rc3-mm2/kernel/sched.c	2007-03-14 02:20:37.000000000 +1100
@@ -89,24 +89,34 @@ unsigned long long __attribute__((weak))
 #define SCHED_PRIO(p)		((p)+MAX_RT_PRIO)
 #define MAX_DYN_PRIO		(MAX_PRIO + PRIO_RANGE)
 
-/*
- * Preemption needs to take into account that a low priority task can be
- * at a higher prio due to list merging. Its priority is artificially
- * elevated and it should be preempted if anything higher priority wakes up
- * provided it is not a realtime comparison.
- */
-#define TASK_PREEMPTS_CURR(p, curr) \
-	(((p)->prio < (curr)->prio) || (!rt_task(p) && \
-		((p)->static_prio < (curr)->static_prio && \
-			((curr)->static_prio > (curr)->prio))))
+#define TASK_PREEMPTS_CURR(p, curr)	((p)->prio < (curr)->prio)
 
 /*
  * This is the time all tasks within the same priority round robin.
  * Set to a minimum of 6ms.
  */
-#define RR_INTERVAL		((6 * HZ / 1001) + 1)
+#define __RR_INTERVAL		6
+#define RR_INTERVAL		((__RR_INTERVAL * HZ / 1001) + 1)
 #define DEF_TIMESLICE		(RR_INTERVAL * 20)
 
+/*
+ * This contains a bitmap for each dynamic priority level with empty slots
+ * for the valid priorities each different nice level can have. It allows
+ * us to stagger the slots where differing priorities run in a way that
+ * keeps latency differences between different nice levels at a minimum.
+ * ie, where 0 means a slot for that priority, priority running from left to
+ * right:
+ * nice -20 0000000000000000000000000000000000000000
+ * nice -10 1001000100100010001001000100010010001000
+ * nice   0 1010101010101010101010101010101010101010
+ * nice   5 1101011010110101101011010110101101011011
+ * nice  10 1101110110111011101101110111011011101110
+ * nice  15 1111101111110111111011111011111101111110
+ * nice  19 1111111111111111111011111111111111111111
+  */
+static unsigned long prio_matrix[PRIO_RANGE][BITS_TO_LONGS(PRIO_RANGE)]
+				__read_mostly;
+
 #ifdef CONFIG_SMP
 /*
  * Divide a load by a sched group cpu_power : (load / sg->__cpu_power)
@@ -649,15 +659,6 @@ static inline int task_queued(struct tas
 static inline void set_task_entitlement(struct task_struct *p)
 {
 	__set_bit(USER_PRIO(p->prio), p->bitmap);
-
-	/*
-	 * In the case this task has been part of a merged list that has
-	 * made it to higher priority than it should be, we remove the
-	 * quota from its own priority since it will get a quota at this
-	 * priority.
-	 */
-	if (p->normal_prio < p->static_prio)
-		__set_bit(USER_PRIO(p->static_prio), p->bitmap);
 	p->time_slice = p->quota;
 }
 
@@ -705,7 +706,8 @@ static void dequeue_task(struct task_str
  */
 static inline void task_new_array(struct task_struct *p, struct rq *rq)
 {
-	bitmap_zero(p->bitmap, PRIO_RANGE);
+	bitmap_copy(p->bitmap, prio_matrix[USER_PRIO(p->static_prio)],
+		    PRIO_RANGE);
 	p->rotation = rq->prio_rotation;
 }
 
@@ -833,11 +835,12 @@ static void requeue_task(struct task_str
  */
 static inline unsigned int task_timeslice(struct task_struct *p)
 {
-	unsigned int slice, rr;
+	unsigned int slice;
 
-	slice = rr = p->quota;
+	slice = p->quota;
 	if (likely(!rt_task(p)))
-		slice += (PRIO_RANGE - 1 - TASK_USER_PRIO(p)) * rr;
+		slice += (PRIO_RANGE - 1 - TASK_USER_PRIO(p)) *
+			__RR_INTERVAL / HZ;
 	return slice;
 }
 
@@ -7066,6 +7069,24 @@ void __init sched_init(void)
 	int i, j, k;
 	int highest_cpu = 0;
 
+	for (i = 0; i < PRIO_RANGE; i++) {
+		if (i < 20) {
+			bitmap_zero(prio_matrix[i] , PRIO_RANGE);
+			j = PRIO_RANGE * PRIO_RANGE / (i + 1);
+			for (k = j; k < PRIO_RANGE * PRIO_RANGE; k += j)
+				__set_bit(k / PRIO_RANGE, prio_matrix[i]);
+		} else if (i == 20) {
+			bitmap_fill(prio_matrix[i], PRIO_RANGE);
+			for (k = 1; k < PRIO_RANGE; k += 2)
+				__clear_bit(k, prio_matrix[i]);
+		} else {
+			bitmap_fill(prio_matrix[i], PRIO_RANGE);
+			j = PRIO_RANGE * PRIO_RANGE / (PRIO_RANGE - i + 1);
+			for (k = j; k < PRIO_RANGE * PRIO_RANGE; k += j)
+				__clear_bit(k / PRIO_RANGE, prio_matrix[i]);
+		}
+	}
+
 	for_each_possible_cpu(i) {
 		struct prio_array *array;
 		struct rq *rq;

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice -1
  2007-03-13 15:31                             ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice Con Kolivas
@ 2007-03-13 16:03                               ` Con Kolivas
  2007-03-13 16:08                                 ` Con Kolivas
  2007-03-13 20:58                                 ` Con Kolivas
  2007-03-14  9:13                               ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice Mike Galbraith
  1 sibling, 2 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-13 16:03 UTC (permalink / raw)
  To: Al Boldi; +Cc: Andrew Morton, Ingo Molnar, ck list, linux-kernel

On Wednesday 14 March 2007 02:31, Con Kolivas wrote:
> On Monday 12 March 2007 22:26, Al Boldi wrote:
> > I think, it should be possible to spread this max expiration latency
> > across the rotation, should it not?
>
> Can you try the attached patch please Al and Mike? It "dithers" the
> priority bitmap which tends to fluctuate the latency a lot more but in a
> cyclical fashion. This tends to make the max latency bound to a smaller
> value and should make it possible to run -nice tasks without killing the
> latency of the non niced tasks. Eg you could possibly run X nice -10 at a
> guess like we used to in 2.4 days. It's not essential of course, but is a
> workaround for Mike's testcase.

Oops, one tiny fix. This is a respin of the patch, sorry.
---
Modify the priority bitmaps of different nice levels to be dithered
minimising the latency likely when different nice levels are used. This
allows low cpu using relatively niced tasks to still get low latency in the
presence of less niced tasks.

Fix the accounting on -nice levels to not be scaled by HZ.

Signed-off-by: Con Kolivas <kernel@kolivas.org>

---
 kernel/sched.c |   73 ++++++++++++++++++++++++++++++++++++---------------------
 1 file changed, 47 insertions(+), 26 deletions(-)

Index: linux-2.6.21-rc3-mm2/kernel/sched.c
===================================================================
--- linux-2.6.21-rc3-mm2.orig/kernel/sched.c	2007-03-13 23:17:29.000000000 +1100
+++ linux-2.6.21-rc3-mm2/kernel/sched.c	2007-03-14 03:01:58.000000000 +1100
@@ -89,24 +89,34 @@ unsigned long long __attribute__((weak))
 #define SCHED_PRIO(p)		((p)+MAX_RT_PRIO)
 #define MAX_DYN_PRIO		(MAX_PRIO + PRIO_RANGE)
 
-/*
- * Preemption needs to take into account that a low priority task can be
- * at a higher prio due to list merging. Its priority is artificially
- * elevated and it should be preempted if anything higher priority wakes up
- * provided it is not a realtime comparison.
- */
-#define TASK_PREEMPTS_CURR(p, curr) \
-	(((p)->prio < (curr)->prio) || (!rt_task(p) && \
-		((p)->static_prio < (curr)->static_prio && \
-			((curr)->static_prio > (curr)->prio))))
+#define TASK_PREEMPTS_CURR(p, curr)	((p)->prio < (curr)->prio)
 
 /*
  * This is the time all tasks within the same priority round robin.
  * Set to a minimum of 6ms.
  */
-#define RR_INTERVAL		((6 * HZ / 1001) + 1)
+#define __RR_INTERVAL		6
+#define RR_INTERVAL		((__RR_INTERVAL * HZ / 1001) + 1)
 #define DEF_TIMESLICE		(RR_INTERVAL * 20)
 
+/*
+ * This contains a bitmap for each dynamic priority level with empty slots
+ * for the valid priorities each different nice level can have. It allows
+ * us to stagger the slots where differing priorities run in a way that
+ * keeps latency differences between different nice levels at a minimum.
+ * ie, where 0 means a slot for that priority, priority running from left to
+ * right:
+ * nice -20 0000000000000000000000000000000000000000
+ * nice -10 1001000100100010001001000100010010001000
+ * nice   0 1010101010101010101010101010101010101010
+ * nice   5 1101011010110101101011010110101101011011
+ * nice  10 1101110110111011101101110111011011101110
+ * nice  15 1111101111110111111011111011111101111110
+ * nice  19 1111111111111111111011111111111111111111
+  */
+static unsigned long prio_matrix[PRIO_RANGE][BITS_TO_LONGS(PRIO_RANGE)]
+				__read_mostly;
+
 #ifdef CONFIG_SMP
 /*
  * Divide a load by a sched group cpu_power : (load / sg->__cpu_power)
@@ -649,15 +659,6 @@ static inline int task_queued(struct tas
 static inline void set_task_entitlement(struct task_struct *p)
 {
 	__set_bit(USER_PRIO(p->prio), p->bitmap);
-
-	/*
-	 * In the case this task has been part of a merged list that has
-	 * made it to higher priority than it should be, we remove the
-	 * quota from its own priority since it will get a quota at this
-	 * priority.
-	 */
-	if (p->normal_prio < p->static_prio)
-		__set_bit(USER_PRIO(p->static_prio), p->bitmap);
 	p->time_slice = p->quota;
 }
 
@@ -705,7 +706,8 @@ static void dequeue_task(struct task_str
  */
 static inline void task_new_array(struct task_struct *p, struct rq *rq)
 {
-	bitmap_zero(p->bitmap, PRIO_RANGE);
+	bitmap_copy(p->bitmap, prio_matrix[USER_PRIO(p->static_prio)],
+		    PRIO_RANGE);
 	p->rotation = rq->prio_rotation;
 }
 
@@ -746,7 +748,7 @@ static void recalc_task_prio(struct task
 			task_new_array(p, rq);
 	} else
 		task_new_array(p, rq);
-	search_prio = p->static_prio;
+	search_prio = MAX_RT_PRIO;
 
 	/*
 	 * SCHED_BATCH tasks never start at better priority than any other
@@ -755,7 +757,7 @@ static void recalc_task_prio(struct task
 	 * non SCHED_BATCH tasks of the same nice level.
 	 */
 	if (unlikely(p->policy == SCHED_BATCH))
-		search_prio = max(p->static_prio, rq->prio_level);
+		search_prio = rq->prio_level;
 	queue_prio = SCHED_PRIO(find_next_zero_bit(p->bitmap, PRIO_RANGE,
 		     USER_PRIO(search_prio)));
 	if (queue_prio == MAX_PRIO) {
@@ -833,11 +835,12 @@ static void requeue_task(struct task_str
  */
 static inline unsigned int task_timeslice(struct task_struct *p)
 {
-	unsigned int slice, rr;
+	unsigned int slice;
 
-	slice = rr = p->quota;
+	slice = p->quota;
 	if (likely(!rt_task(p)))
-		slice += (PRIO_RANGE - 1 - TASK_USER_PRIO(p)) * rr;
+		slice += (PRIO_RANGE - 1 - TASK_USER_PRIO(p)) *
+			__RR_INTERVAL / HZ;
 	return slice;
 }
 
@@ -7066,6 +7069,24 @@ void __init sched_init(void)
 	int i, j, k;
 	int highest_cpu = 0;
 
+	for (i = 0; i < PRIO_RANGE; i++) {
+		if (i < 20) {
+			bitmap_zero(prio_matrix[i] , PRIO_RANGE);
+			j = PRIO_RANGE * PRIO_RANGE / (i + 1);
+			for (k = j; k < PRIO_RANGE * PRIO_RANGE; k += j)
+				__set_bit(k / PRIO_RANGE, prio_matrix[i]);
+		} else if (i == 20) {
+			bitmap_fill(prio_matrix[i], PRIO_RANGE);
+			for (k = 1; k < PRIO_RANGE; k += 2)
+				__clear_bit(k, prio_matrix[i]);
+		} else {
+			bitmap_fill(prio_matrix[i], PRIO_RANGE);
+			j = PRIO_RANGE * PRIO_RANGE / (PRIO_RANGE - i + 1);
+			for (k = j; k < PRIO_RANGE * PRIO_RANGE; k += j)
+				__clear_bit(k / PRIO_RANGE, prio_matrix[i]);
+		}
+	}
+
 	for_each_possible_cpu(i) {
 		struct prio_array *array;
 		struct rq *rq;

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [PATCH] [RSDL-0.30] sched: rsdl improve latencies with  differential nice -1
  2007-03-13 16:03                               ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice -1 Con Kolivas
@ 2007-03-13 16:08                                 ` Con Kolivas
  2007-03-13 20:58                                 ` Con Kolivas
  1 sibling, 0 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-13 16:08 UTC (permalink / raw)
  To: ck; +Cc: Al Boldi, Andrew Morton, linux-kernel

On Wednesday 14 March 2007 03:03, Con Kolivas wrote:
> On Wednesday 14 March 2007 02:31, Con Kolivas wrote:
> > On Monday 12 March 2007 22:26, Al Boldi wrote:
> > > I think, it should be possible to spread this max expiration latency
> > > across the rotation, should it not?
> >
> > Can you try the attached patch please Al and Mike? It "dithers" the
> > priority bitmap which tends to fluctuate the latency a lot more but in a
> > cyclical fashion. This tends to make the max latency bound to a smaller
> > value and should make it possible to run -nice tasks without killing the
> > latency of the non niced tasks. Eg you could possibly run X nice -10 at a
> > guess like we used to in 2.4 days. It's not essential of course, but is a
> > workaround for Mike's testcase.
>
> Oops, one tiny fix. This is a respin of the patch, sorry.

A few other minor things would need to be updated before this patch is in a 
good enough shape to join the rsdl patches. This one will be good for testing 
though.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice -1
  2007-03-13 16:03                               ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice -1 Con Kolivas
  2007-03-13 16:08                                 ` Con Kolivas
@ 2007-03-13 20:58                                 ` Con Kolivas
  2007-03-13 23:08                                   ` RSDL development plans Con Kolivas
  1 sibling, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-13 20:58 UTC (permalink / raw)
  To: Al Boldi; +Cc: Andrew Morton, Ingo Molnar, ck list, linux-kernel

On Wednesday 14 March 2007 03:03, Con Kolivas wrote:
> On Wednesday 14 March 2007 02:31, Con Kolivas wrote:
> > On Monday 12 March 2007 22:26, Al Boldi wrote:
> > > I think, it should be possible to spread this max expiration latency
> > > across the rotation, should it not?
> >
> > Can you try the attached patch please Al and Mike? It "dithers" the
> > priority bitmap which tends to fluctuate the latency a lot more but in a
> > cyclical fashion. This tends to make the max latency bound to a smaller
> > value and should make it possible to run -nice tasks without killing the
> > latency of the non niced tasks. Eg you could possibly run X nice -10 at a
> > guess like we used to in 2.4 days. It's not essential of course, but is a
> > workaround for Mike's testcase.
>
> Oops, one tiny fix. This is a respin of the patch, sorry.
> ---

Bah with a bit more sleep under my belt it became clear that I forgot to 
update the expired array in any proper way so this change almost breaks stuff 
at the moment in the shape it's in. Please disregard this change for now 
apart from interest in how I'm tackling the nice issue.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* RSDL development plans
  2007-03-13 20:58                                 ` Con Kolivas
@ 2007-03-13 23:08                                   ` Con Kolivas
  2007-03-16 12:25                                     ` Con Kolivas
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-13 23:08 UTC (permalink / raw)
  To: ck, Ingo Molnar; +Cc: Al Boldi, Andrew Morton, linux-kernel

On Wednesday 14 March 2007 07:58, Con Kolivas wrote:
> On Wednesday 14 March 2007 03:03, Con Kolivas wrote:
> > On Wednesday 14 March 2007 02:31, Con Kolivas wrote:
> > > On Monday 12 March 2007 22:26, Al Boldi wrote:
> > > > I think, it should be possible to spread this max expiration latency
> > > > across the rotation, should it not?
> > >
> > > Can you try the attached patch please Al and Mike? It "dithers" the
> > > priority bitmap which tends to fluctuate the latency a lot more but in
> > > a cyclical fashion. This tends to make the max latency bound to a
> > > smaller value and should make it possible to run -nice tasks without
> > > killing the latency of the non niced tasks. Eg you could possibly run X
> > > nice -10 at a guess like we used to in 2.4 days. It's not essential of
> > > course, but is a workaround for Mike's testcase.
> >
> > Oops, one tiny fix. This is a respin of the patch, sorry.

> Bah with a bit more sleep under my belt it became clear that I forgot to
> update the expired array in any proper way so this change almost breaks
> stuff at the moment in the shape it's in. Please disregard this change for
> now apart from interest in how I'm tackling the nice issue.

The rsdl patches queued up so far are stable and boot fine and are reasonably 
performant on many architectures so I'm quite happy for them to get a run 
in -mm. The changes planned will (as you may have seen on this email thread) 
decrease average latencies across all nice levels, and make differential nice 
levels run better together. This will allow -nice to be used without 
significant latency harm to not niced tasks (as there is presently in rsdl 
and mainline). The change required on top of the patch earlier in this email 
is to make the dynamic bitmap reflect where the tasks will actually be on an 
array swap.

However, I must inform people that I have to arrest the RSDL development for 
at least this week. I have a new and fairly serious neck problem that is 
being exacerbated badly by sitting in front of the computer for any extended 
period.

I suspect the inner workings of RSDL currently are not well understood yet by 
anyone else well enough to hack on it. I'm not at all opposed to someone 
taking up the code at the moment and making the necessary changes I've 
mentioned above in the meantime though if they can get their head around it.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice
  2007-03-13 15:31                             ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice Con Kolivas
  2007-03-13 16:03                               ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice -1 Con Kolivas
@ 2007-03-14  9:13                               ` Mike Galbraith
  2007-03-14  9:25                                 ` Con Kolivas
  1 sibling, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-14  9:13 UTC (permalink / raw)
  To: Con Kolivas; +Cc: Al Boldi, Andrew Morton, Ingo Molnar, ck list, linux-kernel

On Wed, 2007-03-14 at 02:31 +1100, Con Kolivas wrote:

> Can you try the attached patch please Al and Mike? It "dithers" the priority
> bitmap which tends to fluctuate the latency a lot more but in a cyclical
> fashion. This tends to make the max latency bound to a smaller value and
> should make it possible to run -nice tasks without killing the latency of the
> non niced tasks. Eg you could possibly run X nice -10 at a guess like we used
> to in 2.4 days. It's not essential of course, but is a workaround for Mike's
> testcase.
> 

Oh my.  I thought I was all done staring mindlessly at gforce (chinese
water torture).  Oh well, a few more brain cells dying of boredom won't
kill me I guess ;-)  Will give it a shot.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [PATCH] [RSDL-0.30] sched: rsdl improve latencies with  differential nice
  2007-03-14  9:13                               ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice Mike Galbraith
@ 2007-03-14  9:25                                 ` Con Kolivas
  2007-03-14  9:42                                   ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-14  9:25 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Al Boldi, Andrew Morton, Ingo Molnar, ck list, linux-kernel

On Wednesday 14 March 2007 20:13, Mike Galbraith wrote:
> On Wed, 2007-03-14 at 02:31 +1100, Con Kolivas wrote:
> > Can you try the attached patch please Al and Mike? It "dithers" the
> > priority bitmap which tends to fluctuate the latency a lot more but in a
> > cyclical fashion. This tends to make the max latency bound to a smaller
> > value and should make it possible to run -nice tasks without killing the
> > latency of the non niced tasks. Eg you could possibly run X nice -10 at a
> > guess like we used to in 2.4 days. It's not essential of course, but is a
> > workaround for Mike's testcase.
>
> Oh my.  I thought I was all done staring mindlessly at gforce (chinese
> water torture).  Oh well, a few more brain cells dying of boredom won't
> kill me I guess ;-)  Will give it a shot.

No don't. It's buggy and you missed the warning. Boy were you lucky I was 
looking right now.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [PATCH] [RSDL-0.30] sched: rsdl improve latencies with  differential nice
  2007-03-14  9:25                                 ` Con Kolivas
@ 2007-03-14  9:42                                   ` Mike Galbraith
  0 siblings, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-14  9:42 UTC (permalink / raw)
  To: Con Kolivas; +Cc: Al Boldi, Andrew Morton, Ingo Molnar, ck list, linux-kernel

On Wed, 2007-03-14 at 20:25 +1100, Con Kolivas wrote:

> No don't. It's buggy and you missed the warning. Boy were you lucky I was 
> looking right now.

I'll wait for a .31 and construct a 2.6.21-rc3 (isolation) test-tree
from that.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL development plans
  2007-03-13 23:08                                   ` RSDL development plans Con Kolivas
@ 2007-03-16 12:25                                     ` Con Kolivas
  2007-03-16 13:40                                       ` RSDL v0.31 Con Kolivas
  2007-03-16 13:42                                       ` RSDL development plans Mike Galbraith
  0 siblings, 2 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-16 12:25 UTC (permalink / raw)
  To: ck, Mike Galbraith; +Cc: Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Wednesday 14 March 2007 10:08, Con Kolivas wrote:
> However, I must inform people that I have to arrest the RSDL development
> for at least this week. I have a new and fairly serious neck problem that
> is being exacerbated badly by sitting in front of the computer for any
> extended period.

Thanks everyone for your offlist sympathy you sent me with respect to this 
neck problem. 

There is good news and bad news. The bad news is the nature of this problem is 
that it will get a bit worse before it gets better, and it has not peaked 
yet.

The good news is that in order to stay sane I've found ways of optimising my 
computing position and could work for short stints on the rsdl code. So I'll 
be releasing a new version shortly.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* RSDL v0.31
  2007-03-16 12:25                                     ` Con Kolivas
@ 2007-03-16 13:40                                       ` Con Kolivas
  2007-03-16 15:34                                         ` Mike Galbraith
                                                           ` (2 more replies)
  2007-03-16 13:42                                       ` RSDL development plans Mike Galbraith
  1 sibling, 3 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-16 13:40 UTC (permalink / raw)
  To: ck; +Cc: Mike Galbraith, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

Here are full patches for rsdl 0.31 for various base kernels. A full announce 
with a fresh -mm series will follow...

http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.patch
http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-sched-rsdl-0.31.patch
http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2-rsdl-0.31.patch

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL development plans
  2007-03-16 12:25                                     ` Con Kolivas
  2007-03-16 13:40                                       ` RSDL v0.31 Con Kolivas
@ 2007-03-16 13:42                                       ` Mike Galbraith
  2007-03-16 13:59                                         ` Con Kolivas
  1 sibling, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-16 13:42 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Fri, 2007-03-16 at 23:25 +1100, Con Kolivas wrote:

> The good news is that in order to stay sane I've found ways of optimising my 
> computing position and could work for short stints on the rsdl code. So I'll 
> be releasing a new version shortly.

Can you please make an incremental diff as well?  That will make it
easier to see what changes have taken place, and make it easier to
integrate those into hacked-upon trees.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL development plans
  2007-03-16 13:42                                       ` RSDL development plans Mike Galbraith
@ 2007-03-16 13:59                                         ` Con Kolivas
  2007-03-16 14:07                                           ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-16 13:59 UTC (permalink / raw)
  To: Mike Galbraith; +Cc: ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Saturday 17 March 2007 00:42, Mike Galbraith wrote:
> On Fri, 2007-03-16 at 23:25 +1100, Con Kolivas wrote:
> > The good news is that in order to stay sane I've found ways of optimising
> > my computing position and could work for short stints on the rsdl code.
> > So I'll be releasing a new version shortly.
>
> Can you please make an incremental diff as well?  That will make it
> easier to see what changes have taken place, and make it easier to
> integrate those into hacked-upon trees.

They're already there in the subdirs.
ie
http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3/2.6.20.3-rsdl-0.30-0.31.patch
http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3/2.6.21-rc3-rsdl-0.30-0.31.patch
http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2/2.6.21-rc3-mm2-rsdl-0.30-0.31.patch

> 	-Mike

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL development plans
  2007-03-16 13:59                                         ` Con Kolivas
@ 2007-03-16 14:07                                           ` Mike Galbraith
  0 siblings, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-16 14:07 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Sat, 2007-03-17 at 00:59 +1100, Con Kolivas wrote:
> On Saturday 17 March 2007 00:42, Mike Galbraith wrote:
> > On Fri, 2007-03-16 at 23:25 +1100, Con Kolivas wrote:
> > > The good news is that in order to stay sane I've found ways of optimising
> > > my computing position and could work for short stints on the rsdl code.
> > > So I'll be releasing a new version shortly.
> >
> > Can you please make an incremental diff as well?  That will make it
> > easier to see what changes have taken place, and make it easier to
> > integrate those into hacked-upon trees.
> 
> They're already there in the subdirs.
> ie
> http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3/2.6.20.3-rsdl-0.30-0.31.patch
> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3/2.6.21-rc3-rsdl-0.30-0.31.patch
> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2/2.6.21-rc3-mm2-rsdl-0.30-0.31.patch

Ah, got it.  Thanks.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-16 13:40                                       ` RSDL v0.31 Con Kolivas
@ 2007-03-16 15:34                                         ` Mike Galbraith
  2007-03-16 21:13                                           ` Con Kolivas
  2007-03-16 17:12                                         ` AshMilsted
  2007-03-16 21:55                                         ` Al Boldi
  2 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-16 15:34 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Sat, 2007-03-17 at 00:40 +1100, Con Kolivas wrote:
> Here are full patches for rsdl 0.31 for various base kernels. A full announce 
> with a fresh -mm series will follow...
> 
> http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.patch
> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-sched-rsdl-0.31.patch
> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2-rsdl-0.31.patch
> 

It still has trouble with the x/gforce vs two niced encoders scenario.
The previously reported choppiness is still present.  

I suspect that x/gforce landing in the expired array is the trouble, and
that this will never be smooth without some kind of exemption.  I added
some targeted unfairness to .30, and it didn't help much at all.
 
Priorities going all the way to 1 were a surprise.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-16 13:40                                       ` RSDL v0.31 Con Kolivas
  2007-03-16 15:34                                         ` Mike Galbraith
@ 2007-03-16 17:12                                         ` AshMilsted
  2007-03-16 17:41                                           ` Gabriel C
  2007-03-16 21:55                                         ` Al Boldi
  2 siblings, 1 reply; 198+ messages in thread
From: AshMilsted @ 2007-03-16 17:12 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck, Al Boldi, Mike Galbraith, Andrew Morton, linux-kernel


Con Kolivas writes:

> Here are full patches for rsdl 0.31 for various base kernels. A full announce 
> with a fresh -mm series will follow...
> 
> http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.patch
> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-sched-rsdl-0.31.patch
> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2-rsdl-0.31.patch
> 
> -- 
> -ck
> _______________________________________________
> http://ck.kolivas.org/faqs/replying-to-mailing-list.txt
> ck mailing list - mailto: ck@vds.kolivas.org
> http://vds.kolivas.org/mailman/listinfo/ck

Hi, I tried to send this before but I'm not sure it got through.
All I wanted to say was that this version does not improve my
problem with Wengophone (tested with 2.6.21-rc4). Otherwise it
seems to run fine. I won't be able to test anything new for
about a month, but I guess by then it'll be perfect anyway :)

Ash
---------------------------------------------
Free POP3 Email from www.Gawab.com 
Sign up NOW and get your account @gawab.com!!

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-16 17:12                                         ` AshMilsted
@ 2007-03-16 17:41                                           ` Gabriel C
  0 siblings, 0 replies; 198+ messages in thread
From: Gabriel C @ 2007-03-16 17:41 UTC (permalink / raw)
  To: AshMilsted
  Cc: Con Kolivas, ck, Al Boldi, Mike Galbraith, Andrew Morton, linux-kernel

AshMilsted wrote:
> Con Kolivas writes:
>
>   
>> Here are full patches for rsdl 0.31 for various base kernels. A full announce 
>> with a fresh -mm series will follow...
>>
>> http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.patch
>> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-sched-rsdl-0.31.patch
>> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2-rsdl-0.31.patch
>>
>> -- 
>> -ck
>>     
>
> Hi, I tried to send this before but I'm not sure it got through.
> All I wanted to say was that this version does not improve my
> problem with Wengophone (tested with 2.6.21-rc4). Otherwise it
>   

Wengophone does not really work fine on 'Linux'  yet with whatever kernel.

The Linux version has a lot problems ( AFAIK all known by the Wengophone 
devels )

> seems to run fine. I won't be able to test anything new for
> about a month, but I guess by then it'll be perfect anyway :)
>
> Ash
> ---------------------------------------------
>   

Gabriel




^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-16 15:34                                         ` Mike Galbraith
@ 2007-03-16 21:13                                           ` Con Kolivas
  2007-03-16 22:30                                             ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-16 21:13 UTC (permalink / raw)
  To: Mike Galbraith; +Cc: ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Saturday 17 March 2007 02:34, Mike Galbraith wrote:
> On Sat, 2007-03-17 at 00:40 +1100, Con Kolivas wrote:
> > Here are full patches for rsdl 0.31 for various base kernels. A full
> > announce with a fresh -mm series will follow...
> >
> > http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.patch
> > http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-sched-rsdl-0.
> >31.patch
> > http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2-rsdl-0.31
> >.patch
>
> It still has trouble with the x/gforce vs two niced encoders scenario.
> The previously reported choppiness is still present.
>
> I suspect that x/gforce landing in the expired array is the trouble, and
> that this will never be smooth without some kind of exemption.  I added
> some targeted unfairness to .30, and it didn't help much at all.
>
> Priorities going all the way to 1 were a surprise.

It wasn't going to change that case without renicing X. I said that from the 
start to maintain fairness it's the only way to keep a fair design, and give 
more cpu to X. The major difference in this one is the ability to run 
different nice values without killing the latency of the relatively niced 
ones.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-16 13:40                                       ` RSDL v0.31 Con Kolivas
  2007-03-16 15:34                                         ` Mike Galbraith
  2007-03-16 17:12                                         ` AshMilsted
@ 2007-03-16 21:55                                         ` Al Boldi
  2007-03-17  2:51                                           ` Con Kolivas
  2 siblings, 1 reply; 198+ messages in thread
From: Al Boldi @ 2007-03-16 21:55 UTC (permalink / raw)
  To: Con Kolivas, ck; +Cc: Mike Galbraith, Ingo Molnar, Andrew Morton, linux-kernel

Con Kolivas wrote:
> Here are full patches for rsdl 0.31 for various base kernels. A full
> announce with a fresh -mm series will follow...
>
> http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.patch

Thanks!  It looks much better now.

With X nice'd at -10, and 11 hogs loading the cpu, interactivity looks good 
until the default timeslice/quota is exhausted and slows down.  Maybe 
adjusting this according to nice could help.

It may also be advisable to fix latencies according to nice, and adjust 
timeslices instead.  This may help scaleability a lot, as there are some 
timing sensitive apps that may crash under high load.

Thanks!

--
Al

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-16 21:13                                           ` Con Kolivas
@ 2007-03-16 22:30                                             ` Mike Galbraith
  2007-03-16 23:05                                               ` [ck] " Dirk Schoebel
                                                                 ` (2 more replies)
  0 siblings, 3 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-16 22:30 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Sat, 2007-03-17 at 08:13 +1100, Con Kolivas wrote:
> On Saturday 17 March 2007 02:34, Mike Galbraith wrote:
> > On Sat, 2007-03-17 at 00:40 +1100, Con Kolivas wrote:
> > > Here are full patches for rsdl 0.31 for various base kernels. A full
> > > announce with a fresh -mm series will follow...
> > >
> > > http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.patch
> > > http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-sched-rsdl-0.
> > >31.patch
> > > http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2-rsdl-0.31
> > >.patch
> >
> > It still has trouble with the x/gforce vs two niced encoders scenario.
> > The previously reported choppiness is still present.
> >
> > I suspect that x/gforce landing in the expired array is the trouble, and
> > that this will never be smooth without some kind of exemption.  I added
> > some targeted unfairness to .30, and it didn't help much at all.
> >
> > Priorities going all the way to 1 were a surprise.
> 
> It wasn't going to change that case without renicing X.

Con.  You are trying to wedge a fair scheduler into an environment where
totally fair simply can not possibly function.

If this is your final answer to the problem space, I am done testing,
and as far as _I_ am concerned, your scheduler is an utter failure.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-16 22:30                                             ` Mike Galbraith
@ 2007-03-16 23:05                                               ` Dirk Schoebel
  2007-03-17  4:24                                               ` Nicholas Miell
  2007-03-17 14:32                                               ` Rik van Riel
  2 siblings, 0 replies; 198+ messages in thread
From: Dirk Schoebel @ 2007-03-16 23:05 UTC (permalink / raw)
  To: ck; +Cc: linux-kernel

[-- Attachment #1: Type: text/plain, Size: 3415 bytes --]

Freitag, 16. März 2007 wrote Mike Galbraith:
> On Sat, 2007-03-17 at 08:13 +1100, Con Kolivas wrote:
> > On Saturday 17 March 2007 02:34, Mike Galbraith wrote:
> > > On Sat, 2007-03-17 at 00:40 +1100, Con Kolivas wrote:
> > > > Here are full patches for rsdl 0.31 for various base kernels. A full
> > > > announce with a fresh -mm series will follow...
> > > >
> > > > http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.p
> > > >atch
> > > > http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-sched-rsd
> > > >l-0. 31.patch
> > > > http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2-rsdl-
> > > >0.31 .patch
> > >
> > > It still has trouble with the x/gforce vs two niced encoders scenario.
> > > The previously reported choppiness is still present.
> > >
> > > I suspect that x/gforce landing in the expired array is the trouble,
> > > and that this will never be smooth without some kind of exemption.  I
> > > added some targeted unfairness to .30, and it didn't help much at all.
> > >
> > > Priorities going all the way to 1 were a surprise.
> >
> > It wasn't going to change that case without renicing X.
>
> Con.  You are trying to wedge a fair scheduler into an environment where
> totally fair simply can not possibly function.
>
> If this is your final answer to the problem space, I am done testing,
> and as far as _I_ am concerned, your scheduler is an utter failure.
>

I can not let this comment stay like that. I have an AMD X2 4400+ Dual Core 
running Gentoo and now kernel 2.6.21-rc3 with RSDL 0.30 (HZ=300).
Up till now whenever I wanted to watch a movie i had to stop compiling with 
more than one task for the movie to run without skips. When playing games i 
have to renice the game (-15-) or else it would get 'choppy'.
With the new RSDL i compile packages with -j3 (reniced to 15), my wife lets up 
to 8 computations (scientific computations) running at the same time and the 
game and a movie still run without any visible flaws. The only thing i saw 
till now was that the mouse cursor was a little less responsive and scrolling 
in firefox took a little longer. But amarok for music, the movie in mplayer, 
the 3d game, everything went smooth though a load of > 11. This all without 
even renicing anything but the compiles. With mainline kernel already 
watching a movie with this load was impossible.
I used the staircase scheduler before RSDL but even with staircase such 
overload was not possible while watching a movie.
Mike, maybe use higher nice levels for your encoders or just use one. Or maybe 
scheck your memory, i guess if the memory bandwidth is too low there's no 
scheduler which can foresee such thing and react accordingly. Since you have 
a HT system it's just one physical ALU, so everything has to be squeezed onto 
this one ALU, up to a certain degree it works, but not forever. And the lame 
encoders i suppose won't wait that very much and long for their data to get 
delivered from memory so they'll utilize the ALU quite a lot.
Con, continue your scheduler development as it helps many cases which were not 
possible otherwise. I'm amazed of the ability of the scheduler to handle a 5 
times overloaded system without too much hazzle.
Great work Con.

Dirk.

PS: Con, don't stress your neck too much, your health is the only thing you 
have to keep for live.

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-16 21:55                                         ` Al Boldi
@ 2007-03-17  2:51                                           ` Con Kolivas
  2007-03-17  4:40                                             ` Al Boldi
  0 siblings, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-17  2:51 UTC (permalink / raw)
  To: Al Boldi; +Cc: ck, Ingo Molnar, Andrew Morton, linux-kernel

On Saturday 17 March 2007 08:55, Al Boldi wrote:
> Con Kolivas wrote:
> > Here are full patches for rsdl 0.31 for various base kernels. A full
> > announce with a fresh -mm series will follow...
> >
> > http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.patch
>
> Thanks!  It looks much better now.

Thank you. You should find most of your latency concerns you brought up have 
been addressed by this change.

> With X nice'd at -10, and 11 hogs loading the cpu, interactivity looks good
> until the default timeslice/quota is exhausted and slows down.  Maybe
> adjusting this according to nice could help.

Not sure what you mean by that. It's still a fair distribution system, it's 
just that nice -10 has quite a bit more cpu allocated than nice 0. Eventually 
if you throw enough load at it it will slow down too.

> It may also be advisable to fix latencies according to nice, and adjust
> timeslices instead.  This may help scaleability a lot, as there are some
> timing sensitive apps that may crash under high load.

You will find that is the case already with this version. Even under heavy 
load if you were to be running one server niced (say httpd nice 19 in the 
presence of mysql nice 0) the latencies would be drastically reduced compared 
to mainline behaviour. I am aware this becomes an issue for some heavily 
loaded servers because some servers run multithreaded while others do not, 
forcing the admins to nice their multithreaded ones. 

> Thanks!
>
> --
> Al

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-16 22:30                                             ` Mike Galbraith
  2007-03-16 23:05                                               ` [ck] " Dirk Schoebel
@ 2007-03-17  4:24                                               ` Nicholas Miell
  2007-03-17  5:56                                                 ` Mike Galbraith
  2007-03-17 14:32                                               ` Rik van Riel
  2 siblings, 1 reply; 198+ messages in thread
From: Nicholas Miell @ 2007-03-17  4:24 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Con Kolivas, ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Fri, 2007-03-16 at 23:30 +0100, Mike Galbraith wrote:
> On Sat, 2007-03-17 at 08:13 +1100, Con Kolivas wrote:
> > On Saturday 17 March 2007 02:34, Mike Galbraith wrote:
> > > On Sat, 2007-03-17 at 00:40 +1100, Con Kolivas wrote:
> > > > Here are full patches for rsdl 0.31 for various base kernels. A full
> > > > announce with a fresh -mm series will follow...
> > > >
> > > > http://ck.kolivas.org/patches/staircase-deadline/2.6.20.3-rsdl-0.31.patch
> > > > http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-sched-rsdl-0.
> > > >31.patch
> > > > http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc3-mm2-rsdl-0.31
> > > >.patch
> > >
> > > It still has trouble with the x/gforce vs two niced encoders scenario.
> > > The previously reported choppiness is still present.
> > >
> > > I suspect that x/gforce landing in the expired array is the trouble, and
> > > that this will never be smooth without some kind of exemption.  I added
> > > some targeted unfairness to .30, and it didn't help much at all.
> > >
> > > Priorities going all the way to 1 were a surprise.
> > 
> > It wasn't going to change that case without renicing X.
> 
> Con.  You are trying to wedge a fair scheduler into an environment where
> totally fair simply can not possibly function.
> 
> If this is your final answer to the problem space, I am done testing,
> and as far as _I_ am concerned, your scheduler is an utter failure.
> 

Sorry, I haven't really been following this thread and now I'm confused.

You're saying that it's somehow the scheduler's fault that X isn't
running with a high enough priority?

-- 
Nicholas Miell <nmiell@comcast.net>


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  2:51                                           ` Con Kolivas
@ 2007-03-17  4:40                                             ` Al Boldi
  2007-03-17  4:57                                               ` Con Kolivas
  0 siblings, 1 reply; 198+ messages in thread
From: Al Boldi @ 2007-03-17  4:40 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck, Ingo Molnar, Andrew Morton, linux-kernel

Con Kolivas wrote:
> On Saturday 17 March 2007 08:55, Al Boldi wrote:
> > With X nice'd at -10, and 11 hogs loading the cpu, interactivity looks
> > good until the default timeslice/quota is exhausted and slows down. 
> > Maybe adjusting this according to nice could help.
>
> Not sure what you mean by that. It's still a fair distribution system,
> it's just that nice -10 has quite a bit more cpu allocated than nice 0.
> Eventually if you throw enough load at it it will slow down too.

I mean #DEF_TIMESLICE seems like an initial quota, which gets reset with each 
major rotation.  Increasing this according to nice may give it more room for 
an interactivity boost, before being expired.  Alternatively, you may just 
reset the rotation, if the task didn't use it's quota within one full major 
rotation, effectively skipping the expiry toll.

> > It may also be advisable to fix latencies according to nice, and adjust
> > timeslices instead.  This may help scaleability a lot, as there are some
> > timing sensitive apps that may crash under high load.
>
> You will find that is the case already with this version. Even under heavy
> load if you were to be running one server niced (say httpd nice 19 in the
> presence of mysql nice 0) the latencies would be drastically reduced
> compared to mainline behaviour. I am aware this becomes an issue for some
> heavily loaded servers because some servers run multithreaded while others
> do not, forcing the admins to nice their multithreaded ones.

The thing is, latencies are currently dependent on the number of tasks in the 
run-queue; i.e. more rq-tasks means higher latencies, yet fixed timeslices 
according to nice.  Just switching this the other way around, by fixing 
latencies according to nice, and adjusting the timeslices depending on 
rq-load, may yield a much more scalable system.

Thanks!

--
Al

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  4:40                                             ` Al Boldi
@ 2007-03-17  4:57                                               ` Con Kolivas
  2007-03-17  5:15                                                 ` Gene Heskett
                                                                   ` (2 more replies)
  0 siblings, 3 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-17  4:57 UTC (permalink / raw)
  To: Al Boldi; +Cc: ck, Ingo Molnar, Andrew Morton, linux-kernel

On Saturday 17 March 2007 15:40, Al Boldi wrote:
> Con Kolivas wrote:
> > On Saturday 17 March 2007 08:55, Al Boldi wrote:
> > > With X nice'd at -10, and 11 hogs loading the cpu, interactivity looks
> > > good until the default timeslice/quota is exhausted and slows down.
> > > Maybe adjusting this according to nice could help.
> >
> > Not sure what you mean by that. It's still a fair distribution system,
> > it's just that nice -10 has quite a bit more cpu allocated than nice 0.
> > Eventually if you throw enough load at it it will slow down too.
>
> I mean #DEF_TIMESLICE seems like an initial quota, which gets reset with
> each major rotation.  Increasing this according to nice may give it more
> room for an interactivity boost, before being expired.  Alternatively, you
> may just reset the rotation, if the task didn't use it's quota within one
> full major rotation, effectively skipping the expiry toll.

DEF_TIMESLICE is a value used for smp balancing and has no effect on quota so 
I doubt you mean that value. The quota you're describing of not resetting is 
something like the sleep average idea of current systems where you accumulate 
bonus points by sleeping when you would be running and redeem them later. 
This is exactly the system I'm avoiding using in rsdl as you'd have to decide 
just how much sleep time it could accumulate, and over how long it would run 
out, and so on. ie that's the interactivity estimator. This is the system 
that destroys any guarantee of cpu percentage, and ends up leading to periods 
of relative starvation, is open to tuning that can either be too short or too 
long depending on the values you chose and so on.

> > > It may also be advisable to fix latencies according to nice, and adjust
> > > timeslices instead.  This may help scaleability a lot, as there are
> > > some timing sensitive apps that may crash under high load.
> >
> > You will find that is the case already with this version. Even under
> > heavy load if you were to be running one server niced (say httpd nice 19
> > in the presence of mysql nice 0) the latencies would be drastically
> > reduced compared to mainline behaviour. I am aware this becomes an issue
> > for some heavily loaded servers because some servers run multithreaded
> > while others do not, forcing the admins to nice their multithreaded ones.
>
> The thing is, latencies are currently dependent on the number of tasks in
> the run-queue; i.e. more rq-tasks means higher latencies, yet fixed
> timeslices according to nice.  Just switching this the other way around, by
> fixing latencies according to nice, and adjusting the timeslices depending
> on rq-load, may yield a much more scalable system.

That is not really feasible to implement. How can you guarantee latencies when 
the system is overloaded? If you have 1000 tasks all trying to get scheduled 
in say 10ms you end up running for only 10 microseconds at a time. That will 
achieve the exact opposite whereby as the load increases the runtime gets 
shorter and shorter till cpu cache trashing and no real work occurs.

> Thanks!
>
> --
> Al

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  4:57                                               ` Con Kolivas
@ 2007-03-17  5:15                                                 ` Gene Heskett
  2007-03-17 13:50                                                 ` Ed Tomlinson
  2007-03-17 16:12                                                 ` Al Boldi
  2 siblings, 0 replies; 198+ messages in thread
From: Gene Heskett @ 2007-03-17  5:15 UTC (permalink / raw)
  To: linux-kernel; +Cc: Con Kolivas, Al Boldi, ck, Ingo Molnar, Andrew Morton

Greetings Con & company;

I built and rebooted to 2.6.20.3-rdsl-0.31 earlier this evening, but 
purposely waited till amanda was well underway to make a report.

The report is that I really really have to work hard to tell that amanda 
is running even though the cpu according to gkrellm is running between 97 
and 99%.

For my loading then, this is as much an improvement over the -ck1 patch as 
it was over the un-patched but same version of the kernel.  FWIW, I'd 
also built a 2.6.20.3-rdsl-0.30 and ran it for a day but it was nearly as 
spastic as no patch.

Did I say I like this yet? :)

Now I'm waiting for 2.6.21-rc4 to make the mirrors & see if tar is still 
broken.  Based on the clues I've been able to find, I bz'd the tar since 
that's a fedora supplied rpm install.  Humm, I just now recalled that I 
have a tarball built tar-1.15.1 on another drive, I was using it when I 
was running FC2, so that might be something else to bisect against, and I 
will, bet on it.

Many thanks Con, this is very nice.  I've only seen one split second when 
the screen was about 2 chars behind my typing.  This is great. :-)

-- 
Cheers, Gene
"There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
When a cow laughs, does milk come out of its nose?

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  4:24                                               ` Nicholas Miell
@ 2007-03-17  5:56                                                 ` Mike Galbraith
  2007-03-17  6:08                                                   ` Mike Galbraith
  2007-03-17  6:26                                                   ` Nicholas Miell
  0 siblings, 2 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17  5:56 UTC (permalink / raw)
  To: Nicholas Miell
  Cc: Con Kolivas, ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Fri, 2007-03-16 at 21:24 -0700, Nicholas Miell wrote:

> Sorry, I haven't really been following this thread and now I'm confused.
> 
> You're saying that it's somehow the scheduler's fault that X isn't
> running with a high enough priority?

I'm saying that the current scheduler adjusts for interactive loads,
this new one doesn't.  I'm seeing interactivity regressions, and they
are not fixed with nice unless nice is used to maximum effect.  I'm
saying yes, I can lower my expectations, but no I don't want to.

A four line summary is as short as I can make it.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  5:56                                                 ` Mike Galbraith
@ 2007-03-17  6:08                                                   ` Mike Galbraith
  2007-03-17 13:56                                                     ` Ed Tomlinson
                                                                       ` (2 more replies)
  2007-03-17  6:26                                                   ` Nicholas Miell
  1 sibling, 3 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17  6:08 UTC (permalink / raw)
  To: Nicholas Miell
  Cc: Con Kolivas, ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

P.S.  "utter failure" was too harsh.  What sticks in my craw is that the
world has to adjust to fit this new scheduler.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  5:56                                                 ` Mike Galbraith
  2007-03-17  6:08                                                   ` Mike Galbraith
@ 2007-03-17  6:26                                                   ` Nicholas Miell
  2007-03-17  7:11                                                     ` Mike Galbraith
                                                                       ` (2 more replies)
  1 sibling, 3 replies; 198+ messages in thread
From: Nicholas Miell @ 2007-03-17  6:26 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Con Kolivas, ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Sat, 2007-03-17 at 06:56 +0100, Mike Galbraith wrote:
> On Fri, 2007-03-16 at 21:24 -0700, Nicholas Miell wrote:
> 
> > Sorry, I haven't really been following this thread and now I'm confused.
> > 
> > You're saying that it's somehow the scheduler's fault that X isn't
> > running with a high enough priority?
> 
> I'm saying that the current scheduler adjusts for interactive loads,
> this new one doesn't.  I'm seeing interactivity regressions, and they
> are not fixed with nice unless nice is used to maximum effect.  I'm
> saying yes, I can lower my expectations, but no I don't want to.
> 
> A four line summary is as short as I can make it.
> 
> 	-Mike

Uh, no. Essentially, the current scheduler works around X's brokenness,
in an often unpredictable manner.

RSDL appears to be completely deterministic, which is a very strong
virtue.

The X people have plans for how to go about fixing this, but until then,
there's no reason to hold up kernel development.

-- 
Nicholas Miell <nmiell@comcast.net>


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  6:26                                                   ` Nicholas Miell
@ 2007-03-17  7:11                                                     ` Mike Galbraith
  2007-03-17  7:25                                                       ` William Lee Irwin III
  2007-03-17 11:48                                                       ` Gene Heskett
  2007-03-17  7:45                                                     ` Ingo Molnar
  2007-03-17  7:56                                                     ` Ingo Molnar
  2 siblings, 2 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17  7:11 UTC (permalink / raw)
  To: Nicholas Miell
  Cc: Con Kolivas, ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Fri, 2007-03-16 at 23:26 -0700, Nicholas Miell wrote:

> RSDL appears to be completely deterministic, which is a very strong
> virtue.

Yes.  That's why RSDL aroused my curiosity big time.

> The X people have plans for how to go about fixing this, but until then,
> there's no reason to hold up kernel development.

I'm not in a position to hold up development.

On a side note, I wonder how long it's going to take to fix all the
X/client combinations out there.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  7:11                                                     ` Mike Galbraith
@ 2007-03-17  7:25                                                       ` William Lee Irwin III
  2007-03-17  7:29                                                         ` Nicholas Miell
  2007-03-17 11:48                                                       ` Gene Heskett
  1 sibling, 1 reply; 198+ messages in thread
From: William Lee Irwin III @ 2007-03-17  7:25 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Nicholas Miell, Con Kolivas, ck, Ingo Molnar, Al Boldi,
	Andrew Morton, linux-kernel

On Sat, Mar 17, 2007 at 08:11:57AM +0100, Mike Galbraith wrote:
> On a side note, I wonder how long it's going to take to fix all the
> X/client combinations out there.

AIUI X's clients largely access it via libraries X ships, so the X
update will sweep the vast majority of them in one shot. You'll have
to either run the clients from remote hosts with downrev libraries or
have downrev libraries around (e.g. in chroots) for clients to link to
for the clients not to cooperate.


-- wli

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  7:25                                                       ` William Lee Irwin III
@ 2007-03-17  7:29                                                         ` Nicholas Miell
  0 siblings, 0 replies; 198+ messages in thread
From: Nicholas Miell @ 2007-03-17  7:29 UTC (permalink / raw)
  To: William Lee Irwin III
  Cc: Mike Galbraith, Con Kolivas, ck, Ingo Molnar, Al Boldi,
	Andrew Morton, linux-kernel

On Sat, 2007-03-17 at 00:25 -0700, William Lee Irwin III wrote:
> On Sat, Mar 17, 2007 at 08:11:57AM +0100, Mike Galbraith wrote:
> > On a side note, I wonder how long it's going to take to fix all the
> > X/client combinations out there.
> 
> AIUI X's clients largely access it via libraries X ships, so the X
> update will sweep the vast majority of them in one shot. You'll have
> to either run the clients from remote hosts with downrev libraries or
> have downrev libraries around (e.g. in chroots) for clients to link to
> for the clients not to cooperate.
> 

The changes will probably be entirely server-side anyway, so stray
ancient libraries won't be a problem.

-- 
Nicholas Miell <nmiell@comcast.net>


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  7:45                                                     ` Ingo Molnar
@ 2007-03-17  7:44                                                       ` David Lang
  2007-03-17  8:46                                                         ` Mike Galbraith
  2007-03-17  8:23                                                       ` Nicholas Miell
  2007-03-17  8:41                                                       ` RSDL v0.31 Serge Belyshev
  2 siblings, 1 reply; 198+ messages in thread
From: David Lang @ 2007-03-17  7:44 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Nicholas Miell, Mike Galbraith, Con Kolivas, ck, Al Boldi,
	Andrew Morton, Linus Torvalds, linux-kernel

On Sat, 17 Mar 2007, Ingo Molnar wrote:

> * Nicholas Miell <nmiell@comcast.net> wrote:
>
>> The X people have plans for how to go about fixing this, [...]
>
> then we'll first have wait for those X changes to at least be done in a
> minimal manner so that they can be tested for real with RSDL. (is it
> _really_ due to that? Or will X regress forever once we switch to RSDL?)
> We cannot regress the scheduling of a workload as important as "X mixed
> with CPU-intense tasks". And "in theory this should be fixed if X is
> fixed" does not cut it. X is pretty much _the_ most important thing to
> optimize the interactive behavior of a Linux scheduler for. Also,
> paradoxically, it is precisely the improvement of _X_ workloads that
> RSDL argues with.
>
> this regression has to be fixed before RSDL can be merged, simply
> because it is a pretty negative effect that goes beyond any of the
> visible positive improvements that RSDL brings over the current
> scheduler. If it is better to fix X, then X has to be fixed _first_, at
> least in form of a prototype patch that can be _tested_, and then the
> result has to be validated against RSDL.

why isn't niceing X to -10 an acceptable option?

if you overload the box enough things slow down, what scheduler avoids that?

where RSDL 'regresses' is with multiple CPU hog running at once (more then the 
number of real CPU's you have available) at the same priority, with one of them 
being the X server process.

the initial report was that with X + 2 cpu hogs on 1.5 cpu's there's more of a 
slowdown (even with a nice difference of 5 between X and the other processes)

the latest report is that with X and 11 cpu hogs and a nice difference of 10 
things slow down (I don't remember the number of cpu's from this report)

how much of an overload should the scheduler adjust for? a load of 3xcpu's? 
10x cpu's? what would be deemed acceptable?

if the key factor in a scheduler is to be able to run multiple CPU hogs at the 
same time as the X process, why not just check the name of the process running, 
and if it's X give it a nice boost?

while that would be easy to abuse, it would at least be predictable.

if the nice levels don't have enough of an effect, how much of an effect should 
a given nice level have? (con has asked this several times, I haven't seen an 
answer from anyone)

David Lang

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  6:26                                                   ` Nicholas Miell
  2007-03-17  7:11                                                     ` Mike Galbraith
@ 2007-03-17  7:45                                                     ` Ingo Molnar
  2007-03-17  7:44                                                       ` David Lang
                                                                         ` (2 more replies)
  2007-03-17  7:56                                                     ` Ingo Molnar
  2 siblings, 3 replies; 198+ messages in thread
From: Ingo Molnar @ 2007-03-17  7:45 UTC (permalink / raw)
  To: Nicholas Miell
  Cc: Mike Galbraith, Con Kolivas, ck, Al Boldi, Andrew Morton,
	Linus Torvalds, linux-kernel

* Nicholas Miell <nmiell@comcast.net> wrote:

> The X people have plans for how to go about fixing this, [...]

then we'll first have wait for those X changes to at least be done in a 
minimal manner so that they can be tested for real with RSDL. (is it 
_really_ due to that? Or will X regress forever once we switch to RSDL?) 
We cannot regress the scheduling of a workload as important as "X mixed 
with CPU-intense tasks". And "in theory this should be fixed if X is 
fixed" does not cut it. X is pretty much _the_ most important thing to 
optimize the interactive behavior of a Linux scheduler for. Also, 
paradoxically, it is precisely the improvement of _X_ workloads that 
RSDL argues with.

this regression has to be fixed before RSDL can be merged, simply 
because it is a pretty negative effect that goes beyond any of the 
visible positive improvements that RSDL brings over the current 
scheduler. If it is better to fix X, then X has to be fixed _first_, at 
least in form of a prototype patch that can be _tested_, and then the 
result has to be validated against RSDL.

	Ingo

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  6:26                                                   ` Nicholas Miell
  2007-03-17  7:11                                                     ` Mike Galbraith
  2007-03-17  7:45                                                     ` Ingo Molnar
@ 2007-03-17  7:56                                                     ` Ingo Molnar
  2007-03-17 11:07                                                       ` [ck] " jos poortvliet
  2 siblings, 1 reply; 198+ messages in thread
From: Ingo Molnar @ 2007-03-17  7:56 UTC (permalink / raw)
  To: Nicholas Miell
  Cc: Mike Galbraith, Con Kolivas, ck, Al Boldi, Andrew Morton,
	Linus Torvalds, linux-kernel

* Nicholas Miell <nmiell@comcast.net> wrote:

> > I'm saying that the current scheduler adjusts for interactive loads, 
> > this new one doesn't.  I'm seeing interactivity regressions, and 
> > they are not fixed with nice unless nice is used to maximum effect.  
> > I'm saying yes, I can lower my expectations, but no I don't want to.
> 
> Uh, no. Essentially, the current scheduler works around X's 
> brokenness, in an often unpredictable manner.

No. The two schedulers simply use different heuristics. RSDL uses _less_ 
heuristics, and thus gets some workloads right that the heuristics in 
the current scheduler got wrong. But it also gets some other workloads 
wrong.

so basically, the current scheduler has a built-in "auto-nice" feature, 
while RSDL relies more on manual assignment of nice values.

if you want no heuristics at all you can do it in the current scheduler: 
use SCHED_BATCH on your shell and start up X with that. I'd not mind 
tweaking SCHED_BATCH with an RSDL-alike timeslice quota system.

so it is not at all clear to me that RSDL is indeed an improvement, if 
it does not have comparable auto-nice properties.

	Ingo

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  7:45                                                     ` Ingo Molnar
  2007-03-17  7:44                                                       ` David Lang
@ 2007-03-17  8:23                                                       ` Nicholas Miell
  2007-03-17  9:42                                                         ` [patch] CFS scheduler: Completely Fair Scheduler / CONFIG_SCHED_FAIR Ingo Molnar
  2007-03-17  8:41                                                       ` RSDL v0.31 Serge Belyshev
  2 siblings, 1 reply; 198+ messages in thread
From: Nicholas Miell @ 2007-03-17  8:23 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Mike Galbraith, Con Kolivas, ck, Al Boldi, Andrew Morton,
	Linus Torvalds, linux-kernel

(sorry for the duplicate Ingo, this time I managed to Repy to All)

On Sat, 2007-03-17 at 08:45 +0100, Ingo Molnar wrote:
> * Nicholas Miell <nmiell@comcast.net> wrote:
> 
> > The X people have plans for how to go about fixing this, [...]
> 
> then we'll first have wait for those X changes to at least be done in a 
> minimal manner so that they can be tested for real with RSDL. (is it 
> _really_ due to that? Or will X regress forever once we switch to RSDL?)

Yes, it's an X problem.

There's two issues, really -- smooth pointer movement or the lack
thereof and the servicing of clients at varying priorities. There's
vague plans floating around about moving all input processing off into a
separate high-priority thread and pretty much no ideas how to deal with
mixed priority clients.

So, the current scheduler works around this brain damage using
heuristics that sort of do the job and sometimes screw things up.

> We cannot regress the scheduling of a workload as important as "X mixed 
> with CPU-intense tasks". And "in theory this should be fixed if X is 
> fixed" does not cut it. X is pretty much _the_ most important thing to 
> optimize the interactive behavior of a Linux scheduler for. Also, 
> paradoxically, it is precisely the improvement of _X_ workloads that 
> RSDL argues with.
> 
> this regression has to be fixed before RSDL can be merged, simply 
> because it is a pretty negative effect that goes beyond any of the 
> visible positive improvements that RSDL brings over the current 
> scheduler. If it is better to fix X, then X has to be fixed _first_, at 
> least in form of a prototype patch that can be _tested_, and then the 
> result has to be validated against RSDL.
> 

RSDL is, above all else, fair. Predictably so.
Hacking around X's stupidity makes it no longer *be* RSDL.

Until they catch up to the early-90s technology-wise, we can just nice
-19 X.

-- 
Nicholas Miell <nmiell@comcast.net>


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  7:45                                                     ` Ingo Molnar
  2007-03-17  7:44                                                       ` David Lang
  2007-03-17  8:23                                                       ` Nicholas Miell
@ 2007-03-17  8:41                                                       ` Serge Belyshev
  2007-03-17  9:48                                                         ` Con Kolivas
  2 siblings, 1 reply; 198+ messages in thread
From: Serge Belyshev @ 2007-03-17  8:41 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Nicholas Miell, Al Boldi, Mike Galbraith, linux-kernel, ck,
	Andrew Morton, Linus Torvalds

Ingo Molnar <mingo@elte.hu> writes:

> * Nicholas Miell <nmiell@comcast.net> wrote:
>
>> The X people have plans for how to go about fixing this, [...]
>
> [...] Or will X regress forever once we switch to RSDL?) 
> We cannot regress the scheduling of a workload as important as "X mixed 
> with CPU-intense tasks". And "in theory this should be fixed if X is 
> fixed" does not cut it. X is pretty much _the_ most important thing to 
> optimize the interactive behavior of a Linux scheduler for. Also, 
> paradoxically, it is precisely the improvement of _X_ workloads that 
> RSDL argues with.
>
> this regression has to be fixed before RSDL can be merged, [...]

Let me restate the fact, if it wasn't obvious enough, that most people
who tried RSDL (and most of them use desktop systems, me including) never
see any regressions compared to mainline. Quite contrary -- their impressions
were that with RSDL desktop system runs more smoothly, even under fierce load,
which was never possible with mainline scheduler.

(see http://article.gmane.org/gmane.linux.kernel/504068 for a list
of references.)

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  7:44                                                       ` David Lang
@ 2007-03-17  8:46                                                         ` Mike Galbraith
  2007-03-17 14:09                                                           ` [ck] " Mark Glines
  0 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17  8:46 UTC (permalink / raw)
  To: David Lang
  Cc: Ingo Molnar, Nicholas Miell, Con Kolivas, ck, Al Boldi,
	Andrew Morton, Linus Torvalds, linux-kernel

On Fri, 2007-03-16 at 23:44 -0800, David Lang wrote:

> why isn't niceing X to -10 an acceptable option?

Xorg's priority is only part of the problem.  Every client that needs a
substantial quantity of cpu while a hog is running will also need to be
negative nice, no?

> if you overload the box enough things slow down, what scheduler avoids that?

(Hmm.  What's overload in a multi-tasking multi-threaded world?  I'm
always going to have more tasks available than cpus at some time.  With
KDE, seems to be the norm any time I poke a button)

> where RSDL 'regresses' is with multiple CPU hog running at once (more then the 
> number of real CPU's you have available) at the same priority, with one of them 
> being the X server process.
> 
> the initial report was that with X + 2 cpu hogs on 1.5 cpu's there's more of a 
> slowdown (even with a nice difference of 5 between X and the other processes)

I see interactivity regression with both X and client at nice -10 in the
presence of any cpu hog load.  Maybe a bug lurks.  Maybe it's fairness.

	-Mike

^ permalink raw reply	[flat|nested] 198+ messages in thread

* [patch] CFS scheduler: Completely Fair Scheduler / CONFIG_SCHED_FAIR
  2007-03-17  8:23                                                       ` Nicholas Miell
@ 2007-03-17  9:42                                                         ` Ingo Molnar
  0 siblings, 0 replies; 198+ messages in thread
From: Ingo Molnar @ 2007-03-17  9:42 UTC (permalink / raw)
  To: Nicholas Miell
  Cc: Mike Galbraith, Con Kolivas, Andrew Morton, Linus Torvalds, linux-kernel


* Nicholas Miell <nmiell@comcast.net> wrote:

> > this regression has to be fixed before RSDL can be merged, simply 
> > because it is a pretty negative effect that goes beyond any of the 
> > visible positive improvements that RSDL brings over the current 
> > scheduler. If it is better to fix X, then X has to be fixed _first_, 
> > at least in form of a prototype patch that can be _tested_, and then 
> > the result has to be validated against RSDL.
> 
> RSDL is, above all else, fair. Predictably so.

SCHED_BATCH (an existing feature of the current scheduler) is even 
fairer and even more deterministic than RSDL, because it has _zero_ 
heuristics.

so how about the patch below (against current -git), which adds the 
"CFS, Completely Fair Scheduler" feature? With that you could test your 
upcoming X fixes. (it also adds /proc/sys/kernel/sched_fair so that you 
can compare the fair scheduler against the vanilla scheduler.) It's very 
simple and unintrusive:

 4 files changed, 28 insertions(+)

furthermore, this is just the first step: if CONFIG_SCHED_FAIR becomes 
widespread amongst distributions then we can remove the interactivity 
estimator code altogether, and simplify the code quite a bit.

( NOTE: more improvements are possible as well: right now most
  interactivity calculations are still done even if CONFIG_SCHED_FAIR is
  enabled - that could be improved upon. )

	Ingo

------------------------------>
Subject: [patch] CFS scheduler: Completely Fair Scheduler
From: Ingo Molnar <mingo@elte.hu>

add the CONFIG_SCHED_FAIR option (default: off): this turns the Linux 
scheduler into a completely fair scheduler for SCHED_OTHER tasks: with 
perfect roundrobin scheduling, fair distribution of timeslices combined 
with no interactivity boosting and no heuristics.

a /proc/sys/kernel/sched_fair option is also available to turn
this behavior on/off.

if this option establishes itself amongst leading distributions then we
could in the future remove the interactivity estimator altogether.

Signed-off-by: Ingo Molnar <mingo@elte.hu>
---
 include/linux/sched.h  |    1 +
 kernel/Kconfig.preempt |    9 +++++++++
 kernel/sched.c         |    8 ++++++++
 kernel/sysctl.c        |   10 ++++++++++
 4 files changed, 28 insertions(+)

Index: linux/include/linux/sched.h
===================================================================
--- linux.orig/include/linux/sched.h
+++ linux/include/linux/sched.h
@@ -119,6 +119,7 @@ extern unsigned long avenrun[];		/* Load
 	load += n*(FIXED_1-exp); \
 	load >>= FSHIFT;
 
+extern unsigned int sched_fair;
 extern unsigned long total_forks;
 extern int nr_threads;
 DECLARE_PER_CPU(unsigned long, process_counts);
Index: linux/kernel/Kconfig.preempt
===================================================================
--- linux.orig/kernel/Kconfig.preempt
+++ linux/kernel/Kconfig.preempt
@@ -63,3 +63,12 @@ config PREEMPT_BKL
 	  Say Y here if you are building a kernel for a desktop system.
 	  Say N if you are unsure.
 
+config SCHED_FAIR
+	bool "Completely Fair Scheduler"
+	help
+	  This option turns the Linux scheduler into a completely fair
+	  scheduler. User-space workloads will round-robin fairly, and
+	  they have to be prioritized using nice levels.
+
+	  Say N if you are unsure.
+
Index: linux/kernel/sched.c
===================================================================
--- linux.orig/kernel/sched.c
+++ linux/kernel/sched.c
@@ -4040,6 +4040,10 @@ static inline struct task_struct *find_p
 	return pid ? find_task_by_pid(pid) : current;
 }
 
+#ifdef CONFIG_SCHED_FAIR
+unsigned int sched_fair = 1;
+#endif
+
 /* Actually do priority change: must hold rq lock. */
 static void __setscheduler(struct task_struct *p, int policy, int prio)
 {
@@ -4055,6 +4059,10 @@ static void __setscheduler(struct task_s
 	 */
 	if (policy == SCHED_BATCH)
 		p->sleep_avg = 0;
+#ifdef CONFIG_SCHED_FAIR
+	if (policy == SCHED_NORMAL && sched_fair)
+		p->sleep_avg = 0;
+#endif
 	set_load_weight(p);
 }
 
Index: linux/kernel/sysctl.c
===================================================================
--- linux.orig/kernel/sysctl.c
+++ linux/kernel/sysctl.c
@@ -205,6 +205,16 @@ static ctl_table root_table[] = {
 };
 
 static ctl_table kern_table[] = {
+#ifdef CONFIG_SCHED_FAIR
+	{
+		.ctl_name	= CTL_UNNUMBERED,
+		.procname	= "sched_fair",
+		.data		= &sched_fair,
+		.maxlen		= sizeof(int),
+		.mode		= 0644,
+		.proc_handler	= &proc_dointvec,
+	},
+#endif
 	{
 		.ctl_name	= KERN_PANIC,
 		.procname	= "panic",

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  8:41                                                       ` RSDL v0.31 Serge Belyshev
@ 2007-03-17  9:48                                                         ` Con Kolivas
  2007-03-17  9:58                                                           ` Mike Galbraith
                                                                             ` (2 more replies)
  0 siblings, 3 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-17  9:48 UTC (permalink / raw)
  To: ck
  Cc: Serge Belyshev, Ingo Molnar, Al Boldi, Mike Galbraith,
	linux-kernel, Nicholas Miell, Linus Torvalds, Andrew Morton

On Saturday 17 March 2007 19:41, Serge Belyshev wrote:
> Ingo Molnar <mingo@elte.hu> writes:
> > * Nicholas Miell <nmiell@comcast.net> wrote:
> >> The X people have plans for how to go about fixing this, [...]
> >
> > [...] Or will X regress forever once we switch to RSDL?)
> > We cannot regress the scheduling of a workload as important as "X mixed
> > with CPU-intense tasks". And "in theory this should be fixed if X is
> > fixed" does not cut it. X is pretty much _the_ most important thing to
> > optimize the interactive behavior of a Linux scheduler for. Also,
> > paradoxically, it is precisely the improvement of _X_ workloads that
> > RSDL argues with.
> >
> > this regression has to be fixed before RSDL can be merged, [...]
>
> Let me restate the fact, if it wasn't obvious enough, that most people
> who tried RSDL (and most of them use desktop systems, me including) never
> see any regressions compared to mainline. Quite contrary -- their
> impressions were that with RSDL desktop system runs more smoothly, even
> under fierce load, which was never possible with mainline scheduler.
>
> (see http://article.gmane.org/gmane.linux.kernel/504068 for a list
> of references.)

Well despite being in a drug induced stupor I find I have to reply on this 
thread. Hopefully I'm not doing my code a disservice by doing so. Who knows, 
maybe I make more sense?

The most frustrating part of a discussion of this nature on lkml is that 
earlier information in a thread seems to be long forgotten after a few days 
and all that is left is the one reporter having a problem. That's not to deny 
the one user is having a problem, but when you have a thousand downloads (no 
exaggeration) and only one person remains reporting badness it's frustrating 
that the problem actually comes down to one of semantics rather than a bug 
(will I nice or won't I).

So in an attempt to summarise the situation, what are the advantages of RSDL 
over mainline.

Fairness
Starvation free
Much lower and bound latencies
Deterministic
Better interactivity for the majority of cases.

Now concentrating on the very last aspect since that seems to be the sticking 
point.

I won't try and estimate what percentage is better, but overall it is _far_ 
more, _not_ less. The few scenarios that mainline remains better are 
unpredictable. This is where it gets interesting, because unlike mainline 
which does not have a good solution for the rest of the problems, all it 
takes is to renice X and then you have RSDL outperforming virtually always.

As for SCHED_BATCH on mainline, I think you'll find it is NOT as deterministic 
as you believe, leads to woeful interactivity, and still is starveable (just 
sleep just before your timeslice runs out). That is not a valid solution I'm 
sorry to say.

Despite the claims to the contrary, RSDL does not have _less_ heuristics, it 
does not have _any_. It's purely entitlement based. 

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  9:48                                                         ` Con Kolivas
@ 2007-03-17  9:58                                                           ` Mike Galbraith
  2007-03-17 10:49                                                             ` Mike Galbraith
                                                                               ` (3 more replies)
  2007-03-17 11:49                                                           ` is RSDL an "unfair" scheduler too? Ingo Molnar
  2007-03-17 15:13                                                           ` RSDL v0.31 Mark Hahn
  2 siblings, 4 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17  9:58 UTC (permalink / raw)
  To: Con Kolivas
  Cc: ck, Serge Belyshev, Ingo Molnar, Al Boldi, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

On Sat, 2007-03-17 at 20:48 +1100, Con Kolivas wrote:

> The most frustrating part of a discussion of this nature on lkml is that 
> earlier information in a thread seems to be long forgotten after a few days 
> and all that is left is the one reporter having a problem.

One?  I'm not the only person who reported regression.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  9:58                                                           ` Mike Galbraith
@ 2007-03-17 10:49                                                             ` Mike Galbraith
  2007-03-17 12:05                                                               ` Gene Heskett
  2007-03-17 13:58                                                             ` michael chang
                                                                               ` (2 subsequent siblings)
  3 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17 10:49 UTC (permalink / raw)
  To: Con Kolivas
  Cc: ck, Serge Belyshev, Ingo Molnar, Al Boldi, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

Rocks and clubs at work (down boy <whack>, down i say!;).

This is .30 with some targeted unfairness. I seem to be making progress
toward beating it to a bloody but cooperative pulp.  It might be
possible to have my cake and eat it too.  Likely too ugly to live
though.

top - 11:35:50 up 57 min, 12 users,  load average: 5.20, 4.30, 2.57

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P COMMAND
 6599 root      26   0  174m  30m 8028 R   51  3.1   7:08.70 0 Xorg
 7991 root      29   0 18196  14m 5188 R   47  1.4   0:55.70 0 amarok_libvisua
 7995 root      37   5  3720 2444  976 R   44  0.2   0:27.53 1 lame
 7993 root      37   5  3720 2448  976 R   40  0.2   0:46.60 1 lame


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-17  7:56                                                     ` Ingo Molnar
@ 2007-03-17 11:07                                                       ` jos poortvliet
  2007-03-17 12:44                                                         ` Ingo Molnar
  2007-03-17 14:04                                                         ` [ck] " Ed Tomlinson
  0 siblings, 2 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-17 11:07 UTC (permalink / raw)
  To: ck
  Cc: Ingo Molnar, Nicholas Miell, Al Boldi, Mike Galbraith,
	linux-kernel, Andrew Morton, Linus Torvalds

[-- Attachment #1: Type: text/plain, Size: 1539 bytes --]

Op Saturday 17 March 2007, schreef Ingo Molnar:
> so it is not at all clear to me that RSDL is indeed an improvement, if
> it does not have comparable auto-nice properties.

Wasn't the point of RSDL to get rid of the auto-nice, because it caused 
starvation, unpredictable behaviour and other problems?

Anyway, I think it's a good thing we keep having a look at mike's problem, but 
it's not clear to me how far he got in solving it. Does the latest patch 
solve the interactivity problem, providing X is niced -10 (or something)??? 

If it does, I think that's the solution - at least until the X ppl fix X 
itself. Distributions can just go back renicing X (they did that before, 
after all), and the biggest problem is fixed. Then all other users can have 
the improvements RSDL offers, the developers can rejoice over the simpler and 
cleaner design and code, and everybody is happy.

If it doesn't solve the problem, more work is in order. I think ignoring a 
clear regression to mainline, no matter how rare, isn't smart. It might 
indicate an underlying problem, and even if it doesn't - you don't want ppl 
complaining the new kernel isn't interactive anymore or something...

/Jos

-- 
Disclaimer:

Alles wat ik doe denk en zeg is gebaseerd op het wereldbeeld wat ik nu heb. 
Ik ben niet verantwoordelijk voor wijzigingen van de wereld, of het beeld wat 
ik daarvan heb, noch voor de daaruit voortvloeiende gedragingen van mezelf. 
Alles wat ik zeg is aardig bedoeld, tenzij expliciet vermeld.

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  7:11                                                     ` Mike Galbraith
  2007-03-17  7:25                                                       ` William Lee Irwin III
@ 2007-03-17 11:48                                                       ` Gene Heskett
  1 sibling, 0 replies; 198+ messages in thread
From: Gene Heskett @ 2007-03-17 11:48 UTC (permalink / raw)
  To: linux-kernel
  Cc: Mike Galbraith, Nicholas Miell, Con Kolivas, ck, Ingo Molnar,
	Al Boldi, Andrew Morton

On Saturday 17 March 2007, Mike Galbraith wrote:
>On Fri, 2007-03-16 at 23:26 -0700, Nicholas Miell wrote:
>> RSDL appears to be completely deterministic, which is a very strong
>> virtue.
>
>Yes.  That's why RSDL aroused my curiosity big time.
>
>> The X people have plans for how to go about fixing this, but until
>> then, there's no reason to hold up kernel development.
>
>I'm not in a position to hold up development.
>
>On a side note, I wonder how long it's going to take to fix all the
>X/client combinations out there.

And on yet another side note Mike, I just did a make -j8 for all make 
options in my makeit script, something I don't normally do, and while 
running 2.6.20.3-rdsl-0.31, building 2.6.21-rc4, the machine remained 
100% responsive, worst case keyboard lag that I observed might have been 
200 or 300 milliseconds.  The machine remained usable, which to me is the 
bottom line.  And no, I wasn't running xmms at the time or watching 
tvtime else I'd have awakened the missus.

I'm having a hard time justifying your continual fussing as its obviously 
a huge improvement to me.  What makes your system and loading so much 
different from mine I wonder...

In the meantime I'm a very happy camper about this patch.

-- 
Cheers, Gene
"There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
I prefer rogues to imbeciles because they sometimes take a rest.
		-- Alexandre Dumas, fils

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-17  9:48                                                         ` Con Kolivas
  2007-03-17  9:58                                                           ` Mike Galbraith
@ 2007-03-17 11:49                                                           ` Ingo Molnar
  2007-03-17 12:02                                                             ` Con Kolivas
                                                                               ` (2 more replies)
  2007-03-17 15:13                                                           ` RSDL v0.31 Mark Hahn
  2 siblings, 3 replies; 198+ messages in thread
From: Ingo Molnar @ 2007-03-17 11:49 UTC (permalink / raw)
  To: Con Kolivas
  Cc: ck, Serge Belyshev, Al Boldi, Mike Galbraith, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

* Con Kolivas <kernel@kolivas.org> wrote:

> Despite the claims to the contrary, RSDL does not have _less_ 
> heuristics, it does not have _any_. It's purely entitlement based.

RSDL still has heuristics very much, but this time it's hardcoded into 
the design! Let me demonstrate this via a simple experiment.

in the vanilla scheduler, the heuristics are ontop of a fairly basic 
(and fast) scheduler, they are plain visible and thus 'optional'. In 
RSDL, the heuristics are still present but more hidden and more 
engrained into the design.

But it's easy to demonstrate this under RSDL: consider the following two 
scenarios, which implement precisely the same fundamental computing 
workload (everything running on the same, default nice 0 level):

1) a single task runs almost all the time and sleeps about 1 msec every
   100 msecs.

   [ run "while N=1; do N=1; done &" under bash to create such a 
     workload. ]

2) tasks are in a 'ring' where each runs for 100 msec, sleeps for 1
   msec and passes the 'token' around to the next task in the ring. (in
   essence every task will sleep 9900 msecs before getting another run)

   [ run http://redhat.com/~mingo/scheduler-patches/ring-test.c to
     create this workload. If the 100 tasks default is too much for you 
     then you can run "./ring-test 10" - that will show similar effects. 
   ]

Workload #1 uses 100% of CPU time. Workload #2 uses 99% of CPU time. 
They both do in essence the same thing.

if RSDL had no heuristics at all then if i mixed #1 with #2, both 
workloads would get roughly 50%/50% of the CPU, right? (as happens if i 
mix #1 with #1 - both CPU-intense workloads get half of the CPU)

in reality, in the 'ring workload' case, RSDL will only give about _5%_ 
of CPU time to the #1 CPU-intense task, and will give 95% of CPU time to 
the #2 'ring' of tasks. So the distribution of timeslices is 
significantly unfair!

Why? Because RSDL still has heuristics, just elsewhere and more hidden: 
in the "straightforward CPU intense task" case RSDL will 'penalize' the 
task by depleting its quota for running nearly all the time, in the 
"ring of tasks" case the 100 tasks will each run near their priority 
maximum, fed by 'major epoch' events of RSDL, thus they get 'rewarded' 
for seemingly sleeping alot and spreading things out. So RSDL has 
fundamental unfairness built in as well - it's just different from the 
vanilla scheduler.

	Ingo

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-17 11:49                                                           ` is RSDL an "unfair" scheduler too? Ingo Molnar
@ 2007-03-17 12:02                                                             ` Con Kolivas
  2007-03-17 12:23                                                               ` [ck] " jos poortvliet
  2007-03-17 12:28                                                               ` Ingo Molnar
  2007-03-17 12:15                                                             ` jos poortvliet
  2007-03-17 20:41                                                             ` Avi Kivity
  2 siblings, 2 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-17 12:02 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: ck, Serge Belyshev, Al Boldi, Mike Galbraith, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

On Saturday 17 March 2007 22:49, Ingo Molnar wrote:
> * Con Kolivas <kernel@kolivas.org> wrote:
> > Despite the claims to the contrary, RSDL does not have _less_
> > heuristics, it does not have _any_. It's purely entitlement based.
>
> RSDL still has heuristics very much, but this time it's hardcoded into
> the design! Let me demonstrate this via a simple experiment.
>
> in the vanilla scheduler, the heuristics are ontop of a fairly basic
> (and fast) scheduler, they are plain visible and thus 'optional'. In
> RSDL, the heuristics are still present but more hidden and more
> engrained into the design.
>
> But it's easy to demonstrate this under RSDL: consider the following two
> scenarios, which implement precisely the same fundamental computing
> workload (everything running on the same, default nice 0 level):
>
> 1) a single task runs almost all the time and sleeps about 1 msec every
>    100 msecs.
>
>    [ run "while N=1; do N=1; done &" under bash to create such a
>      workload. ]
>
> 2) tasks are in a 'ring' where each runs for 100 msec, sleeps for 1
>    msec and passes the 'token' around to the next task in the ring. (in
>    essence every task will sleep 9900 msecs before getting another run)
>
>    [ run http://redhat.com/~mingo/scheduler-patches/ring-test.c to
>      create this workload. If the 100 tasks default is too much for you
>      then you can run "./ring-test 10" - that will show similar effects.
>    ]
>
> Workload #1 uses 100% of CPU time. Workload #2 uses 99% of CPU time.
> They both do in essence the same thing.
>
> if RSDL had no heuristics at all then if i mixed #1 with #2, both
> workloads would get roughly 50%/50% of the CPU, right? (as happens if i
> mix #1 with #1 - both CPU-intense workloads get half of the CPU)
>
> in reality, in the 'ring workload' case, RSDL will only give about _5%_
> of CPU time to the #1 CPU-intense task, and will give 95% of CPU time to
> the #2 'ring' of tasks. So the distribution of timeslices is
> significantly unfair!
>
> Why? Because RSDL still has heuristics, just elsewhere and more hidden:
> in the "straightforward CPU intense task" case RSDL will 'penalize' the
> task by depleting its quota for running nearly all the time, in the
> "ring of tasks" case the 100 tasks will each run near their priority
> maximum, fed by 'major epoch' events of RSDL, thus they get 'rewarded'
> for seemingly sleeping alot and spreading things out. So RSDL has
> fundamental unfairness built in as well - it's just different from the
> vanilla scheduler.

We're obviously disagreeing on what heuristics are so call it what you like.

You're simply cashing in on the deep pipes that do kernel work for other 
tasks. You know very well that I dropped the TASK_NONINTERACTIVE flag from 
rsdl which checks that tasks are waiting on pipes and you're exploiting it. 
That's not the RSDL heuristics at work at all, but you're trying to make it 
look like it is the intrinsic RSDL system at work. Putting that flag back in 
is simple enough when I'm not drugged. You could have simply pointed that out 
instead of trying to make my code look responsible. 

For the moment I'll assume you're not simply trying to make my code look bad 
and that you thought there really was an intrinsic design problem, otherwise 
I'd really be unhappy with what was happening to me.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 10:49                                                             ` Mike Galbraith
@ 2007-03-17 12:05                                                               ` Gene Heskett
  2007-03-17 13:36                                                                 ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Gene Heskett @ 2007-03-17 12:05 UTC (permalink / raw)
  To: linux-kernel
  Cc: Mike Galbraith, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	Al Boldi, Nicholas Miell, Linus Torvalds, Andrew Morton

On Saturday 17 March 2007, Mike Galbraith wrote:
>Rocks and clubs at work (down boy <whack>, down i say!;).
>
>This is .30 with some targeted unfairness. I seem to be making progress

Try -0.31, its better yet.

>toward beating it to a bloody but cooperative pulp.  It might be
>possible to have my cake and eat it too.  Likely too ugly to live
>though.
>
>top - 11:35:50 up 57 min, 12 users,  load average: 5.20, 4.30, 2.57
>
>  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P COMMAND
> 6599 root      26   0  174m  30m 8028 R   51  3.1   7:08.70 0 Xorg
> 7991 root      29   0 18196  14m 5188 R   47  1.4   0:55.70 0
> amarok_libvisua 7995 root      37   5  3720 2444  976 R   44  0.2  
> 0:27.53 1 lame 7993 root      37   5  3720 2448  976 R   40  0.2  
> 0:46.60 1 lame

What X driver are you using Mike?  Are you on a builtin video chipset 
using mobo shared memory?  I'm running the nvidia 9755 driver here with a 
jaton nvidia 6200-256 card here, and YES I KNOW that taints the kernel, 
but x is using on average, 1.3% of the cpu.  Its sitting at a -1 nice and 
it continues to give usable results here in the face of a make -j8 
running in another shell.

With figures (over 50% cpu to X)  like that, its no wonder you're 
squawking about it.

An old phrase comes to mind "Physician, Heal thyself".  Your X is broken 
and you are trying to blame everything but X.  Fix your hardware and 
drivers.

-- 
Cheers, Gene
"There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
Two can Live as Cheaply as One for Half as Long.
		-- Howard Kandel

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: is RSDL an "unfair" scheduler too?
  2007-03-17 11:49                                                           ` is RSDL an "unfair" scheduler too? Ingo Molnar
  2007-03-17 12:02                                                             ` Con Kolivas
@ 2007-03-17 12:15                                                             ` jos poortvliet
  2007-03-17 20:41                                                             ` Avi Kivity
  2 siblings, 0 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-17 12:15 UTC (permalink / raw)
  To: ck
  Cc: Ingo Molnar, Con Kolivas, Al Boldi, Mike Galbraith, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

[-- Attachment #1: Type: text/plain, Size: 3355 bytes --]

Op Saturday 17 March 2007, schreef Ingo Molnar:
> * Con Kolivas <kernel@kolivas.org> wrote:
> > Despite the claims to the contrary, RSDL does not have _less_
> > heuristics, it does not have _any_. It's purely entitlement based.
>
> RSDL still has heuristics very much, but this time it's hardcoded into
> the design! Let me demonstrate this via a simple experiment.
>
> in the vanilla scheduler, the heuristics are ontop of a fairly basic
> (and fast) scheduler, they are plain visible and thus 'optional'. In
> RSDL, the heuristics are still present but more hidden and more
> engrained into the design.
>
> But it's easy to demonstrate this under RSDL: consider the following two
> scenarios, which implement precisely the same fundamental computing
> workload (everything running on the same, default nice 0 level):
>
> 1) a single task runs almost all the time and sleeps about 1 msec every
>    100 msecs.
>
>    [ run "while N=1; do N=1; done &" under bash to create such a
>      workload. ]
>
> 2) tasks are in a 'ring' where each runs for 100 msec, sleeps for 1
>    msec and passes the 'token' around to the next task in the ring. (in
>    essence every task will sleep 9900 msecs before getting another run)
>
>    [ run http://redhat.com/~mingo/scheduler-patches/ring-test.c to
>      create this workload. If the 100 tasks default is too much for you
>      then you can run "./ring-test 10" - that will show similar effects.
>    ]
>
> Workload #1 uses 100% of CPU time. Workload #2 uses 99% of CPU time.
> They both do in essence the same thing.
>
> if RSDL had no heuristics at all then if i mixed #1 with #2, both
> workloads would get roughly 50%/50% of the CPU, right? (as happens if i
> mix #1 with #1 - both CPU-intense workloads get half of the CPU)

Isn't RSDL fair to each task? So each of the 101 tasks (1 and 2) gets an equal 
share... That doesn't sound unfair to me. if the current scheduler manages to 
give the single task 10 times more cpu to task one, that wouldn't be fair. 
Unless you want to be fair to a single process, no matter how many threads.

> in reality, in the 'ring workload' case, RSDL will only give about _5%_
> of CPU time to the #1 CPU-intense task, and will give 95% of CPU time to
> the #2 'ring' of tasks. So the distribution of timeslices is
> significantly unfair!
>
> Why? Because RSDL still has heuristics, just elsewhere and more hidden:
> in the "straightforward CPU intense task" case RSDL will 'penalize' the
> task by depleting its quota for running nearly all the time, in the
> "ring of tasks" case the 100 tasks will each run near their priority
> maximum, fed by 'major epoch' events of RSDL, thus they get 'rewarded'
> for seemingly sleeping alot and spreading things out. So RSDL has
> fundamental unfairness built in as well - it's just different from the
> vanilla scheduler.

I don't see RSDL having heuristics here - it just gives each task an equal 
share of CPU. The single task gets 1/101th of cpu, equal to all others...

> 	Ingo

I guess I just don't get it, would the current kernel give 50% cpu to the 
single thread, and 0,5% to each of the 100 other threads?!? How would it do 
that? Does it schedule a process with several threads equal to a process 
having 1 thread, so each gets an equal share of cpu?

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: is RSDL an "unfair" scheduler too?
  2007-03-17 12:02                                                             ` Con Kolivas
@ 2007-03-17 12:23                                                               ` jos poortvliet
  2007-03-17 17:31                                                                 ` David Schwartz
  2007-03-17 12:28                                                               ` Ingo Molnar
  1 sibling, 1 reply; 198+ messages in thread
From: jos poortvliet @ 2007-03-17 12:23 UTC (permalink / raw)
  To: ck
  Cc: Con Kolivas, Ingo Molnar, Al Boldi, Mike Galbraith, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

[-- Attachment #1: Type: text/plain, Size: 4560 bytes --]

Op Saturday 17 March 2007, schreef Con Kolivas:
> On Saturday 17 March 2007 22:49, Ingo Molnar wrote:
> > * Con Kolivas <kernel@kolivas.org> wrote:
> > > Despite the claims to the contrary, RSDL does not have _less_
> > > heuristics, it does not have _any_. It's purely entitlement based.
> >
> > RSDL still has heuristics very much, but this time it's hardcoded into
> > the design! Let me demonstrate this via a simple experiment.
> >
> > in the vanilla scheduler, the heuristics are ontop of a fairly basic
> > (and fast) scheduler, they are plain visible and thus 'optional'. In
> > RSDL, the heuristics are still present but more hidden and more
> > engrained into the design.
> >
> > But it's easy to demonstrate this under RSDL: consider the following two
> > scenarios, which implement precisely the same fundamental computing
> > workload (everything running on the same, default nice 0 level):
> >
> > 1) a single task runs almost all the time and sleeps about 1 msec every
> >    100 msecs.
> >
> >    [ run "while N=1; do N=1; done &" under bash to create such a
> >      workload. ]
> >
> > 2) tasks are in a 'ring' where each runs for 100 msec, sleeps for 1
> >    msec and passes the 'token' around to the next task in the ring. (in
> >    essence every task will sleep 9900 msecs before getting another run)
> >
> >    [ run http://redhat.com/~mingo/scheduler-patches/ring-test.c to
> >      create this workload. If the 100 tasks default is too much for you
> >      then you can run "./ring-test 10" - that will show similar effects.
> >    ]
> >
> > Workload #1 uses 100% of CPU time. Workload #2 uses 99% of CPU time.
> > They both do in essence the same thing.
> >
> > if RSDL had no heuristics at all then if i mixed #1 with #2, both
> > workloads would get roughly 50%/50% of the CPU, right? (as happens if i
> > mix #1 with #1 - both CPU-intense workloads get half of the CPU)
> >
> > in reality, in the 'ring workload' case, RSDL will only give about _5%_
> > of CPU time to the #1 CPU-intense task, and will give 95% of CPU time to
> > the #2 'ring' of tasks. So the distribution of timeslices is
> > significantly unfair!
> >
> > Why? Because RSDL still has heuristics, just elsewhere and more hidden:
> > in the "straightforward CPU intense task" case RSDL will 'penalize' the
> > task by depleting its quota for running nearly all the time, in the
> > "ring of tasks" case the 100 tasks will each run near their priority
> > maximum, fed by 'major epoch' events of RSDL, thus they get 'rewarded'
> > for seemingly sleeping alot and spreading things out. So RSDL has
> > fundamental unfairness built in as well - it's just different from the
> > vanilla scheduler.
>
> We're obviously disagreeing on what heuristics are so call it what you
> like.
>
> You're simply cashing in on the deep pipes that do kernel work for other
> tasks. You know very well that I dropped the TASK_NONINTERACTIVE flag from
> rsdl which checks that tasks are waiting on pipes and you're exploiting it.
> That's not the RSDL heuristics at work at all, but you're trying to make it
> look like it is the intrinsic RSDL system at work. Putting that flag back
> in is simple enough when I'm not drugged. You could have simply pointed
> that out instead of trying to make my code look responsible.
>
> For the moment I'll assume you're not simply trying to make my code look
> bad and that you thought there really was an intrinsic design problem,
> otherwise I'd really be unhappy with what was happening to me.

Well, re-reading his post, he has a point - one WOULD expect each of these 2 
tasks to have an equal share of CPU, and if RSDL doesn't currently take this 
pipe-thing into account, it might need some fixing. call it heuristics or not 
(after all, how could one NOT say a scheduler uses heuristics of some kind?).

Anyway, relax (you know getting angry won't help you getting better) and 
remember this is email - not exactly a perfet way to communicate, esp in the 
emotional area. I haven't said this anywhere else, as I'm waiting for RSDL to 
be a bit more mature, but I have irritations with it as well - I don't have 
the long full-system stalls I had with staircase (hail RSDL!) but I do have 
more frequent, shorter stalls, when one app doesn't respond for up to 10 
seconds, while others just continue to work. So it's not perfect yet, and 
when I have time, I'll try to find out what's wrong. BTW, nice seems to help, 
but not entirely.

grtz

Jos


[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-17 12:02                                                             ` Con Kolivas
  2007-03-17 12:23                                                               ` [ck] " jos poortvliet
@ 2007-03-17 12:28                                                               ` Ingo Molnar
  2007-03-17 12:43                                                                 ` Con Kolivas
  1 sibling, 1 reply; 198+ messages in thread
From: Ingo Molnar @ 2007-03-17 12:28 UTC (permalink / raw)
  To: Con Kolivas
  Cc: ck, Serge Belyshev, Al Boldi, Mike Galbraith, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

* Con Kolivas <kernel@kolivas.org> wrote:

> We're obviously disagreeing on what heuristics are [...]

that could very well be so - it would be helpful if you could provide 
your own rough definition for the term, so that we can agree on how to 
call things?

[ in any case, there's no rush here, please reply at your own pace, as
  your condition allows. I wish you a speedy recovery! ]

> You're simply cashing in on the deep pipes that do kernel work for 
> other tasks. You know very well that I dropped the TASK_NONINTERACTIVE 
> flag from rsdl which checks that tasks are waiting on pipes and you're 
> exploiting it.

Con, i am not 'cashing in' on anything and i'm not 'exploiting' 
anything. The TASK_NONINTERACTIVE flag is totally irrelevant to my 
argument because i was not testing the vanilla scheduler, i was testing 
RSDL. I could have written this test using plain sockets, because i was 
testing RSDL's claim of not having heuristics, i was not testing the 
vanilla scheduler.

I have simply replied to this claim of yours:

> > Despite the claims to the contrary, RSDL does not have _less_ 
> > heuristics, it does not have _any_. [...]

and i showed you a workload under _RSDL_ that clearly shows that RSDL is 
an unfair scheduler too.

my whole point was to counter the myth of 'RSDL has no heuristics'. Of 
course it has heuristics, which results in unfairness. (If it didnt have 
any heuristics that tilt the balance of scheduling towards sleep-intense 
tasks then a default Linux desktop would not be usable at all.)

so the decision is _not_ a puristic "do we want to have heuristics or 
not", the question is a more practical "which heuristics are simpler, 
which heuristics are more flexible, which heuristics result in better 
behavior".

	Ingo

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-17 12:28                                                               ` Ingo Molnar
@ 2007-03-17 12:43                                                                 ` Con Kolivas
  2007-03-17 16:34                                                                   ` Ingo Molnar
  2007-03-18  2:13                                                                   ` Bill Davidsen
  0 siblings, 2 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-17 12:43 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: ck, Serge Belyshev, Al Boldi, Mike Galbraith, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

On Saturday 17 March 2007 23:28, Ingo Molnar wrote:
> * Con Kolivas <kernel@kolivas.org> wrote:
> > We're obviously disagreeing on what heuristics are [...]
>
> that could very well be so - it would be helpful if you could provide
> your own rough definition for the term, so that we can agree on how to
> call things?
>
> [ in any case, there's no rush here, please reply at your own pace, as
>   your condition allows. I wish you a speedy recovery! ]
>
> > You're simply cashing in on the deep pipes that do kernel work for
> > other tasks. You know very well that I dropped the TASK_NONINTERACTIVE
> > flag from rsdl which checks that tasks are waiting on pipes and you're
> > exploiting it.
>
> Con, i am not 'cashing in' on anything and i'm not 'exploiting'
> anything. The TASK_NONINTERACTIVE flag is totally irrelevant to my
> argument because i was not testing the vanilla scheduler, i was testing
> RSDL. I could have written this test using plain sockets, because i was
> testing RSDL's claim of not having heuristics, i was not testing the
> vanilla scheduler.
>
> I have simply replied to this claim of yours:
> > > Despite the claims to the contrary, RSDL does not have _less_
> > > heuristics, it does not have _any_. [...]
>
> and i showed you a workload under _RSDL_ that clearly shows that RSDL is
> an unfair scheduler too.
>
> my whole point was to counter the myth of 'RSDL has no heuristics'. Of
> course it has heuristics, which results in unfairness. (If it didnt have
> any heuristics that tilt the balance of scheduling towards sleep-intense
> tasks then a default Linux desktop would not be usable at all.)
>
> so the decision is _not_ a puristic "do we want to have heuristics or
> not", the question is a more practical "which heuristics are simpler,
> which heuristics are more flexible, which heuristics result in better
> behavior".
>
> 	Ingo

Ok but please look at how it appears from my end (illness aside).

I spend 3 years just diddling with scheduler code trying my hardest to find a 
design that fixes a whole swag of problems we still have, and a swag of 
problems we might get with other fixes.

You initially said you were pleased with this design.

..lots of code, testing, bugfixes and good feedback.

Then Mike has one testcase that most other users disagree is worthy of being 
considered a regresssion. You latched onto that and basically called it a 
showstopper in spite of who knows how many other positive things.

Then you quickly produce a counter patch designed to kill off RSDL with a 
config option for mainline.

Then you boldly announce on LKML "is RSDL an "unfair" scheduler too?" with 
some test case you whipped up to try and find fault with the design.

What am I supposed to think? Considering just how many problems I have 
addressed and tried to correct with RSDL succesfully I'm surprised that 
despite your enthusiasm for it initially you have spent the rest of the time 
trying to block it.

Please, either help me (and I'm in no shape to code at the moment despite what 
I have done so far), or say you have no intention of including it. I'm 
risking paralysis just by sitting at the computer right now so I'm dropping 
the code as is at the moment and will leave it up to your better judgement as 
to what to do with it.

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 11:07                                                       ` [ck] " jos poortvliet
@ 2007-03-17 12:44                                                         ` Ingo Molnar
  2007-03-17 13:44                                                           ` jos poortvliet
  2007-03-17 14:04                                                         ` [ck] " Ed Tomlinson
  1 sibling, 1 reply; 198+ messages in thread
From: Ingo Molnar @ 2007-03-17 12:44 UTC (permalink / raw)
  To: jos poortvliet
  Cc: Mike Galbraith, Con Kolivas, linux-kernel, Andrew Morton, Linus Torvalds

* jos poortvliet <jos@mijnkamer.nl> wrote:

> Op Saturday 17 March 2007, schreef Ingo Molnar:
> > so it is not at all clear to me that RSDL is indeed an improvement, 
> > if it does not have comparable auto-nice properties.
> 
> Wasn't the point of RSDL to get rid of the auto-nice, because it 
> caused starvation, unpredictable behaviour and other problems?

it doesnt really get rid of it, it replaces it with another mechanism 
that is fundamentally unfair too.

RSDL has _another_, albeit more hidden "auto-nice" behavior: this time 
expressed not via the plain manipulation of priorities based on the 
sleep average, but expressed via the quota-depletion flux of tasks over 
time, fed into a complex dance of rotating priorities - which 
quota-depletion flux is in essence a sleep average too, just more 
derived and more hardcoded.

or looking at it from another angle, code size:

   text    data     bss     dec     hex filename
  15750      24    6008   21782    5516 sched.o.vanilla
  15960     360    6336   22656    5880 sched.o.rsdl

there's no reduction in complexity, it just moved elsewhere.

	Ingo

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 12:05                                                               ` Gene Heskett
@ 2007-03-17 13:36                                                                 ` Mike Galbraith
  2007-03-17 17:03                                                                   ` Gene Heskett
  0 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17 13:36 UTC (permalink / raw)
  To: Gene Heskett
  Cc: linux-kernel, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	Al Boldi, Nicholas Miell, Linus Torvalds, Andrew Morton

On Sat, 2007-03-17 at 08:05 -0400, Gene Heskett wrote:
> >
> >top - 11:35:50 up 57 min, 12 users,  load average: 5.20, 4.30, 2.57
> >
> >  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P COMMAND
> > 6599 root      26   0  174m  30m 8028 R   51  3.1   7:08.70 0 Xorg
> > 7991 root      29   0 18196  14m 5188 R   47  1.4   0:55.70 0
> > amarok_libvisua 7995 root      37   5  3720 2444  976 R   44  0.2  
> > 0:27.53 1 lame 7993 root      37   5  3720 2448  976 R   40  0.2  
> > 0:46.60 1 lame
> 
> What X driver are you using Mike?  Are you on a builtin video chipset 
> using mobo shared memory?  I'm running the nvidia 9755 driver here with a 
> jaton nvidia 6200-256 card here, and YES I KNOW that taints the kernel, 
> but x is using on average, 1.3% of the cpu.

Xorg is using 50% cpu because I'm asking it to.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 12:44                                                         ` Ingo Molnar
@ 2007-03-17 13:44                                                           ` jos poortvliet
  0 siblings, 0 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-17 13:44 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Mike Galbraith, Con Kolivas, linux-kernel, Andrew Morton, Linus Torvalds

[-- Attachment #1: Type: text/plain, Size: 2478 bytes --]

Op Saturday 17 March 2007, schreef Ingo Molnar:
> * jos poortvliet <jos@mijnkamer.nl> wrote:
> > Op Saturday 17 March 2007, schreef Ingo Molnar:
> > > so it is not at all clear to me that RSDL is indeed an improvement,
> > > if it does not have comparable auto-nice properties.
> >
> > Wasn't the point of RSDL to get rid of the auto-nice, because it
> > caused starvation, unpredictable behaviour and other problems?
>
> it doesnt really get rid of it, it replaces it with another mechanism
> that is fundamentally unfair too.
>
> RSDL has _another_, albeit more hidden "auto-nice" behavior: this time
> expressed not via the plain manipulation of priorities based on the
> sleep average, but expressed via the quota-depletion flux of tasks over
> time, fed into a complex dance of rotating priorities - which
> quota-depletion flux is in essence a sleep average too, just more
> derived and more hardcoded.

Hmmm. I wonder, then, does RSDL give an advantage in the areas I mentioned 
(starvation and predictability)? 

RSDL does give equal timeslices (eg equal cpu time) to each process - it's 
just that processes which didn't use their time yet can quickly run, right? 
Now I might not understand things here, but that it sounds more fair, though 
you're more quallified to judge that. So, for me, as an user, it boils down 
to: does this solve problems, and does it introduce problems?

I must say I compare RSDL to staircase, as that's what I'm used to - a bit 
more interactive compared to mainline. RSDL does slightly worse AND slightly 
better - worse in interactivity on heavy loads (8 makes running on my 
dualcore) but it doesn't have the systemwide stalls sometimes occurring on 
both mainline and staircase - though it replaces them with shorter, 
app-specific ones (the worse interactivity I mention).

> or looking at it from another angle, code size:
>
>    text    data     bss     dec     hex filename
>   15750      24    6008   21782    5516 sched.o.vanilla
>   15960     360    6336   22656    5880 sched.o.rsdl
>
> there's no reduction in complexity, it just moved elsewhere.
>
> 	Ingo

-- 
Disclaimer:

Alles wat ik doe denk en zeg is gebaseerd op het wereldbeeld wat ik nu heb. 
Ik ben niet verantwoordelijk voor wijzigingen van de wereld, of het beeld wat 
ik daarvan heb, noch voor de daaruit voortvloeiende gedragingen van mezelf. 
Alles wat ik zeg is aardig bedoeld, tenzij expliciet vermeld.

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  4:57                                               ` Con Kolivas
  2007-03-17  5:15                                                 ` Gene Heskett
@ 2007-03-17 13:50                                                 ` Ed Tomlinson
  2007-03-17 16:12                                                 ` Al Boldi
  2 siblings, 0 replies; 198+ messages in thread
From: Ed Tomlinson @ 2007-03-17 13:50 UTC (permalink / raw)
  To: Con Kolivas; +Cc: Al Boldi, ck, Ingo Molnar, Andrew Morton, linux-kernel

On Saturday 17 March 2007 00:57, Con Kolivas wrote:
> On Saturday 17 March 2007 15:40, Al Boldi wrote:
> > Con Kolivas wrote:
> > > On Saturday 17 March 2007 08:55, Al Boldi wrote:
> > > > With X nice'd at -10, and 11 hogs loading the cpu, interactivity looks
> > > > good until the default timeslice/quota is exhausted and slows down.
> > > > Maybe adjusting this according to nice could help.
> > >
> > > Not sure what you mean by that. It's still a fair distribution system,
> > > it's just that nice -10 has quite a bit more cpu allocated than nice 0.
> > > Eventually if you throw enough load at it it will slow down too.
> >
> > I mean #DEF_TIMESLICE seems like an initial quota, which gets reset with
> > each major rotation.  Increasing this according to nice may give it more
> > room for an interactivity boost, before being expired.  Alternatively, you
> > may just reset the rotation, if the task didn't use it's quota within one
> > full major rotation, effectively skipping the expiry toll.
> 
> DEF_TIMESLICE is a value used for smp balancing and has no effect on quota so 
> I doubt you mean that value. The quota you're describing of not resetting is 
> something like the sleep average idea of current systems where you accumulate 
> bonus points by sleeping when you would be running and redeem them later. 
> This is exactly the system I'm avoiding using in rsdl as you'd have to decide 
> just how much sleep time it could accumulate, and over how long it would run 
> out, and so on. ie that's the interactivity estimator. This is the system 
> that destroys any guarantee of cpu percentage, and ends up leading to periods 
> of relative starvation, is open to tuning that can either be too short or too 
> long depending on the values you chose and so on.
> 
> > > > It may also be advisable to fix latencies according to nice, and adjust
> > > > timeslices instead.  This may help scaleability a lot, as there are
> > > > some timing sensitive apps that may crash under high load.
> > >
> > > You will find that is the case already with this version. Even under
> > > heavy load if you were to be running one server niced (say httpd nice 19
> > > in the presence of mysql nice 0) the latencies would be drastically
> > > reduced compared to mainline behaviour. I am aware this becomes an issue
> > > for some heavily loaded servers because some servers run multithreaded
> > > while others do not, forcing the admins to nice their multithreaded ones.
> >
> > The thing is, latencies are currently dependent on the number of tasks in
> > the run-queue; i.e. more rq-tasks means higher latencies, yet fixed
> > timeslices according to nice.  Just switching this the other way around, by
> > fixing latencies according to nice, and adjusting the timeslices depending
> > on rq-load, may yield a much more scalable system.
> 
> That is not really feasible to implement. How can you guarantee latencies when 
> the system is overloaded? If you have 1000 tasks all trying to get scheduled 
> in say 10ms you end up running for only 10 microseconds at a time. That will 
> achieve the exact opposite whereby as the load increases the runtime gets 
> shorter and shorter till cpu cache trashing and no real work occurs.

Con

Think that Al may have a point.  Yes there are cases where it not work well.  How about
reversing the idea?  Set the initial timeslice so it will give good latency with a fairly high 
load.  Increase the timeslice when the load is lower than the load we attempt to guarantee
good latency for.  Idea being to reduce the number of context switches while retaining a
fixed latency.

Ed Tomlinson

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  6:08                                                   ` Mike Galbraith
@ 2007-03-17 13:56                                                     ` Ed Tomlinson
  2007-03-18 19:37                                                     ` Lee Revell
  2007-03-19  2:27                                                     ` David Schwartz
  2 siblings, 0 replies; 198+ messages in thread
From: Ed Tomlinson @ 2007-03-17 13:56 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Nicholas Miell, Con Kolivas, ck, Ingo Molnar, Al Boldi,
	Andrew Morton, linux-kernel

On Saturday 17 March 2007 02:08, Mike Galbraith wrote:
> P.S.  "utter failure" was too harsh.  What sticks in my craw is that the
> world has to adjust to fit this new scheduler.

If a new scheduler has a better 'normal' performance adjusting to its quirks
is fine.  Your testing is important.  We need to understand what needs
to be tweaked and why.  

>From my POV 0.30 is better than mainline - it handles my load(s) better.

Thanks
Ed Tomlinson

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-17  9:58                                                           ` Mike Galbraith
  2007-03-17 10:49                                                             ` Mike Galbraith
@ 2007-03-17 13:58                                                             ` michael chang
  2007-03-17 20:55                                                             ` Al Boldi
  2007-03-19 16:03                                                             ` Mark Lord
  3 siblings, 0 replies; 198+ messages in thread
From: michael chang @ 2007-03-17 13:58 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Con Kolivas, Al Boldi, Andrew Morton, linux-kernel, ck,
	Linus Torvalds, Nicholas Miell

On 3/17/07, Mike Galbraith <efault@gmx.de> wrote:
> On Sat, 2007-03-17 at 20:48 +1100, Con Kolivas wrote:
>
> > The most frustrating part of a discussion of this nature on lkml is that
> > earlier information in a thread seems to be long forgotten after a few days
> > and all that is left is the one reporter having a problem.
>
> One?  I'm not the only person who reported regression.
>

Con is over-simplifying here -- he is saying the number of
regression-reporters is dwarfed by the number of positive responses.
One regression in particular, though, is rather persistent, and we are
unsure of how to solve it in a way that fits the ideals of RSDL.

There has been one class of problems that have been reported against
RSDL (problems with X or some X+GL-based app in the context of
CPU-intensive programs) that has yet to be "resolved", AFAICS. The
possible solutions being brought up (e.g. auto-nice in the kernel) go
against the fundamental reasons and logic behind RSDL.

The latest (RSDL 0.31) is supposed to help with relatively-niced
latency-sensitive programs (which were reported earlier) and there is
some progress, although I believe maybe one or two people are still
reporting issues here. We are making progress in this circumstance
that Con feels is acceptable. (Again, the remaining potential
solutions being proposed so far go against RSDL's fundamental ideals.
If we actually have to use these solutions, then basically we're just
doing the vanilla scheduler all over again -- defeating the purpose of
RSDL in the first place.)

akpm, IIRC, reported an issue on PPC with an early version of RSDL
(presumably a broken one) when he tried to build the first public
release with -mm; we have yet to hear negative feedback from him (on
the -ck list at least) saying this problem persisted with future
releases.

I think the idea is that Con has seen much greater positive response
(particularly with earlier releases, i.e. a week ago), and a lack of
thorough, viable solutions (particularly ones that either come with a
patch or fit in with the current ideal of RSDL) for the complaints
being brought up that he is feeling frustrated. (Con's neck problem is
preventing him from spending too much time coding solutions himself,
and I feel the discussion that is going on here is starting to become
counterproductive in consideration of that.)

I've also seen no one mention the possibility of slowdowns on RSDL
being a placebo effect type thing.  That said, this "regression" on
the "broken code" in X is probably an issue that we do need to at
least watch for. The "average" desktop user, I would assume, is
probably running X. (The question is, whether they would be running
"too many" CPU-intensive programs at the same time. If so, then I
would imagine they're not "average" any more, but I could some
misunderstandings about this.) I think many of us have certain ideals
about the performance of graphical user interfaces (we'd like X to be
super-responsive, all the time, regardless of what processes are
running, with as little impact on the CPU-intensive processes as
possible) although I don't think we're at the level where we can get
there yet. The question is (for the time being) how we're willing to
assign things in this circumstance.

And, as far as I can see, no one appears to have even attempted to
answer Con's question of how much CPU a nice 5 process should get
relative to nice 0, etc., apart from maybe Con himself.

Personally, I think a nice 5 process should get 50% of the CPU time of
a nice 0, but I'm not sure how that would scale to nice 19 or nice
-19.

-- 
-- Michael Chang
~Just the crazy copy cat~

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-17 11:07                                                       ` [ck] " jos poortvliet
  2007-03-17 12:44                                                         ` Ingo Molnar
@ 2007-03-17 14:04                                                         ` Ed Tomlinson
  1 sibling, 0 replies; 198+ messages in thread
From: Ed Tomlinson @ 2007-03-17 14:04 UTC (permalink / raw)
  To: jos poortvliet
  Cc: ck, Ingo Molnar, Nicholas Miell, Al Boldi, Mike Galbraith,
	linux-kernel, Andrew Morton, Linus Torvalds

On Saturday 17 March 2007 07:07, jos poortvliet wrote:
> Op Saturday 17 March 2007, schreef Ingo Molnar:
> > so it is not at all clear to me that RSDL is indeed an improvement, if
> > it does not have comparable auto-nice properties.
> 
> Wasn't the point of RSDL to get rid of the auto-nice, because it caused 
> starvation, unpredictable behaviour and other problems?
> 
> Anyway, I think it's a good thing we keep having a look at mike's problem, but 
> it's not clear to me how far he got in solving it. Does the latest patch 
> solve the interactivity problem, providing X is niced -10 (or something)??? 
> 
> If it does, I think that's the solution - at least until the X ppl fix X 
> itself. Distributions can just go back renicing X (they did that before, 
> after all), and the biggest problem is fixed. Then all other users can have 
> the improvements RSDL offers, the developers can rejoice over the simpler and 
> cleaner design and code, and everybody is happy.
> 
> If it doesn't solve the problem, more work is in order. I think ignoring a 
> clear regression to mainline, no matter how rare, isn't smart. It might 
> indicate an underlying problem, and even if it doesn't - you don't want ppl 
> complaining the new kernel isn't interactive anymore or something...

Ingo,

The other point to make here is that you only need to nice X if you are heavily
overloading the box.  Here X is NOT niced and RSDL 0.30 is giving me better
performance.

Ed Tomlinson

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-17  8:46                                                         ` Mike Galbraith
@ 2007-03-17 14:09                                                           ` Mark Glines
  2007-03-17 14:33                                                             ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Mark Glines @ 2007-03-17 14:09 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: David Lang, Al Boldi, linux-kernel, ck, Andrew Morton,
	Linus Torvalds, Nicholas Miell

On Sat, 17 Mar 2007 09:46:27 +0100
Mike Galbraith <efault@gmx.de> wrote:

> On Fri, 2007-03-16 at 23:44 -0800, David Lang wrote:
> 
> > why isn't niceing X to -10 an acceptable option?
> 
> Xorg's priority is only part of the problem.  Every client that needs
> a substantial quantity of cpu while a hog is running will also need
> to be negative nice, no?

I don't suppose you can be a bit more specific, and define how much CPU
constitutes a "substantial quantity"?  It looks to me like X already got
about half of a CPU.

>  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P COMMAND
> 6599 root      26   0  174m  30m 8028 R   51  3.1   7:08.70 0 Xorg


I'm hoping that actually quantifying this issue will result in a better
understanding of the issue...

Thanks,

Mark

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-16 22:30                                             ` Mike Galbraith
  2007-03-16 23:05                                               ` [ck] " Dirk Schoebel
  2007-03-17  4:24                                               ` Nicholas Miell
@ 2007-03-17 14:32                                               ` Rik van Riel
  2007-03-17 14:43                                                 ` Mike Galbraith
  2007-03-17 15:39                                                 ` Ingo Molnar
  2 siblings, 2 replies; 198+ messages in thread
From: Rik van Riel @ 2007-03-17 14:32 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Con Kolivas, ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

Mike Galbraith wrote:

> If this is your final answer to the problem space, I am done testing,
> and as far as _I_ am concerned, your scheduler is an utter failure.

The increased AIM7 throughput (and the other benchmark results)
looked very promising to me.

I wonder what we're doing wrong in the normal scheduler...

-- 
Politics is the struggle between those who want to make their country
the best in the world, and those who believe it already is.  Each group
calls the other unpatriotic.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-17 14:09                                                           ` [ck] " Mark Glines
@ 2007-03-17 14:33                                                             ` Mike Galbraith
  2007-03-17 14:54                                                               ` Mark Glines
  0 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17 14:33 UTC (permalink / raw)
  To: Mark Glines
  Cc: David Lang, Al Boldi, linux-kernel, ck, Andrew Morton,
	Linus Torvalds, Nicholas Miell

On Sat, 2007-03-17 at 07:09 -0700, Mark Glines wrote:

> I don't suppose you can be a bit more specific, and define how much CPU
> constitutes a "substantial quantity"?  It looks to me like X already got
> about half of a CPU.
> 
> >  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P COMMAND
> > 6599 root      26   0  174m  30m 8028 R   51  3.1   7:08.70 0 Xorg
> 

This is a snippet from a hacked up by me version of RSDL.30, not stock.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 14:32                                               ` Rik van Riel
@ 2007-03-17 14:43                                                 ` Mike Galbraith
  2007-03-17 15:39                                                 ` Ingo Molnar
  1 sibling, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17 14:43 UTC (permalink / raw)
  To: Rik van Riel
  Cc: Con Kolivas, ck, Ingo Molnar, Al Boldi, Andrew Morton, linux-kernel

On Sat, 2007-03-17 at 10:32 -0400, Rik van Riel wrote:
> Mike Galbraith wrote:
> 
> > If this is your final answer to the problem space, I am done testing,
> > and as far as _I_ am concerned, your scheduler is an utter failure.
> 
> The increased AIM7 throughput (and the other benchmark results)
> looked very promising to me.
> 
> I wonder what we're doing wrong in the normal scheduler...

Yeah, I tried AIM7 with both schedulers, but apparently you need size
mondo hardware.  My poor little box produced identical results.

(I only noticed one other benchmark result, and IIRC it showed a ~5%
drop in throughput, which was attributed to higher context switch rate)

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-17 14:33                                                             ` Mike Galbraith
@ 2007-03-17 14:54                                                               ` Mark Glines
  2007-03-17 14:58                                                                 ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Mark Glines @ 2007-03-17 14:54 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: David Lang, Al Boldi, linux-kernel, ck, Andrew Morton,
	Linus Torvalds, Nicholas Miell

On Sat, 17 Mar 2007 15:33:41 +0100
Mike Galbraith <efault@gmx.de> wrote:
> > >  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P
> > > COMMAND 6599 root      26   0  174m  30m 8028 R   51  3.1
> > > 7:08.70 0 Xorg
> > 
> 
> This is a snippet from a hacked up by me version of RSDL.30, not
> stock.

Oops.  Thanks, sorry about my confusion.  What does it look like without
your patches?  (I'm not sure if you've already sent this... I can't
find any in the list archives.)

Mark

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-17 14:54                                                               ` Mark Glines
@ 2007-03-17 14:58                                                                 ` Mike Galbraith
  0 siblings, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17 14:58 UTC (permalink / raw)
  To: Mark Glines
  Cc: David Lang, Al Boldi, linux-kernel, ck, Andrew Morton,
	Linus Torvalds, Nicholas Miell

On Sat, 2007-03-17 at 07:54 -0700, Mark Glines wrote:
> On Sat, 17 Mar 2007 15:33:41 +0100
> Mike Galbraith <efault@gmx.de> wrote:
> > > >  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P
> > > > COMMAND 6599 root      26   0  174m  30m 8028 R   51  3.1
> > > > 7:08.70 0 Xorg
> > > 
> > 
> > This is a snippet from a hacked up by me version of RSDL.30, not
> > stock.
> 
> Oops.  Thanks, sorry about my confusion.  What does it look like without
> your patches?  (I'm not sure if you've already sent this... I can't
> find any in the list archives.)

If you go to the beginning of the thread, I described the test load.

(I don't want to rehash... 'nuff turbulence)

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  9:48                                                         ` Con Kolivas
  2007-03-17  9:58                                                           ` Mike Galbraith
  2007-03-17 11:49                                                           ` is RSDL an "unfair" scheduler too? Ingo Molnar
@ 2007-03-17 15:13                                                           ` Mark Hahn
  2007-03-17 17:22                                                             ` Stephen Clark
  2007-03-19 15:06                                                             ` Chris Friesen
  2 siblings, 2 replies; 198+ messages in thread
From: Mark Hahn @ 2007-03-17 15:13 UTC (permalink / raw)
  To: Con Kolivas; +Cc: linux-kernel

> So in an attempt to summarise the situation, what are the advantages of RSDL
> over mainline.
>
> Fairness

why do you think fairness is good, especially always good?

> Starvation free

even starvation is sometimes a good thing - there's a place for processes
that only use the CPU if it is otherwise idle.  that is, they are
deliberately starved all the rest of the time.

> Much lower and bound latencies

in an average sense?  also, under what circumstances does this actually
matter?  (please don't offer something like RT audio on an overloaded machine-
that's operator error, not something to design for.)

> Deterministic

not a bad thing, but how does this make itself apparent and of value 
to the user?  I think everyone is extremely comfortable with non-determinism
(stemming from networks, caches, interleaved workloads, etc)

> Better interactivity for the majority of cases.

how is this measured?  is this statement really just a reiteration of 
the latency claim?

> Now concentrating on the very last aspect since that seems to be the sticking
> point.

nah, I think the fairness and latency claims are the real issues.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 14:32                                               ` Rik van Riel
  2007-03-17 14:43                                                 ` Mike Galbraith
@ 2007-03-17 15:39                                                 ` Ingo Molnar
  1 sibling, 0 replies; 198+ messages in thread
From: Ingo Molnar @ 2007-03-17 15:39 UTC (permalink / raw)
  To: Rik van Riel
  Cc: Mike Galbraith, Con Kolivas, ck, Al Boldi, Andrew Morton, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 556 bytes --]

* Rik van Riel <riel@redhat.com> wrote:

> The increased AIM7 throughput (and the other benchmark results) looked 
> very promising to me.
> 
> I wonder what we're doing wrong in the normal scheduler...

there's a relatively easy way to figure out whether it's related to the 
interactivity code: try AIM7 with SCHED_BATCH as well, to take most of 
the 'interactivity effects' out of the picture.

build the attached setbatch.c code and do "./setbatch $$" to change the 
shell to SCHED_BATCH (and all its future children will be SCHED_BATCH 
too).

	Ingo

[-- Attachment #2: setbatch.c --]
[-- Type: text/plain, Size: 616 bytes --]

/*
 * Set a given PID to be a SCHED_BATCH process.
 * 
 * Copyright (C) 2002 Ingo Molnar
 */
#include <time.h>
#include <stdio.h>
#include <sched.h>
#include <stdlib.h>
#include <sys/types.h>
#include <linux/unistd.h>

int main (int argc, char **argv)
{
	int pid, ret;
	struct sched_param p;

	p.sched_priority = 0;

	if (argc != 2) {
		printf("usage: setbatch <pid>\n");
		exit(-1);
	}
	pid = atol(argv[1]);

	ret = sched_setscheduler(pid, 3, &p);

	if (ret) {
		printf("could not set pid %d to SCHED_BATCH: err %d.\n", pid, ret);
		return -1;
	}
	printf("pid %d is SCHED_BATCH from now on.\n", pid);
	return 0;
}

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  4:57                                               ` Con Kolivas
  2007-03-17  5:15                                                 ` Gene Heskett
  2007-03-17 13:50                                                 ` Ed Tomlinson
@ 2007-03-17 16:12                                                 ` Al Boldi
  2 siblings, 0 replies; 198+ messages in thread
From: Al Boldi @ 2007-03-17 16:12 UTC (permalink / raw)
  To: Con Kolivas; +Cc: ck, Ingo Molnar, Andrew Morton, linux-kernel

Con Kolivas wrote:
> DEF_TIMESLICE is a value used for smp balancing and has no effect on quota
> so I doubt you mean that value. The quota you're describing of not
> resetting is something like the sleep average idea of current systems
> where you accumulate bonus points by sleeping when you would be running
> and redeem them later. This is exactly the system I'm avoiding using in
> rsdl as you'd have to decide just how much sleep time it could accumulate,
> and over how long it would run out, and so on. ie that's the interactivity
> estimator. This is the system that destroys any guarantee of cpu
> percentage, and ends up leading to periods of relative starvation, is open
> to tuning that can either be too short or too long depending on the values
> you chose and so on.

Ok, but I can clearly see an expiration happening for sleeping tasks, like X.  
It looks like it's climbing a ladder halfway, then it sleeps, and when it 
wakes up, it continues to complete the ladder to expiration.  Couldn't this 
sleep be taken into account, and reset the ladder position accordingly?

> > The thing is, latencies are currently dependent on the number of tasks
> > in the run-queue; i.e. more rq-tasks means higher latencies, yet fixed
> > timeslices according to nice.  Just switching this the other way around,
> > by fixing latencies according to nice, and adjusting the timeslices
> > depending on rq-load, may yield a much more scalable system.
>
> That is not really feasible to implement. How can you guarantee latencies
> when the system is overloaded? If you have 1000 tasks all trying to get
> scheduled in say 10ms you end up running for only 10 microseconds at a
> time. That will achieve the exact opposite whereby as the load increases
> the runtime gets shorter and shorter till cpu cache trashing and no real
> work occurs.

Most of the time we only run a small number of tasks.  And for the case of 
1000's of tasks, you could put in some lower threshold, that would trigger 
an increase of latency, if the timeslice became ridiculously small.


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-17 12:43                                                                 ` Con Kolivas
@ 2007-03-17 16:34                                                                   ` Ingo Molnar
  2007-03-18  3:23                                                                     ` Bill Davidsen
  2007-03-18  2:13                                                                   ` Bill Davidsen
  1 sibling, 1 reply; 198+ messages in thread
From: Ingo Molnar @ 2007-03-17 16:34 UTC (permalink / raw)
  To: Con Kolivas
  Cc: Serge Belyshev, Al Boldi, Mike Galbraith, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

* Con Kolivas <kernel@kolivas.org> wrote:

> Ok but please look at how it appears from my end (illness aside).

( i really think we should continue this debate after you get better. 
  Everything looks much darker when you are ill! )

> You initially said you were pleased with this design.

I said that 2 years ago about the staircase scheduler and i am still 
saying this about RSDL today. That doesnt make my position automatically 
correct though :-) For example i wrote and maintained the 4g:4g patchset 
for over 2 years and still that was no guarantee of it making sense 
upstream ;) And it was a hell of a lot of work (much uglier and nastier 
work than any scheduler hacking and tuning, believe me), and it was 
thrown away as a cute but unnecessary complication we dont need. So 
what? To me what matters is the path you walk, not the destination you 
reach.

in terms of RSDL design, i like it, still i'm kind of asking myself 
'couldnt something in this direction be done in a much simpler and less 
revolutionary way'? For example couldnt we introduce per-priority level 
timeslice quotas in the current scheme as well, instead of the very 
simplistic and crude STARVATION_LIMIT approach? Furthermore, couldnt we 
make the timeslices become smaller as the runqueue length increases, to 
make starvation less of a problem? It seems like the problem cases with 
the current scheduler arent so much centered around the interactivity 
estimator, it is more that timeslices get distributed too coarsely, 
while RSDL distributes timeslices in a more finegrained way and is thus 
less suspect to starvation under certain workloads.

in any case, regardless the technical picture, i do get nervous when a 
group of people tries to out-shout clearly valid feedback like Mike's, 
and i'll try to balance such effects out a bit. _Of course_ those people 
that are not happy with the current scheduler have a higher likelyhood 
to try out another scheduler and be happy about it - but we should not 
allow that natural bias (which, btw., could easily be the _truth_, so 
i'm not assuming anything) to stand in the way of critical thinking. I 
also get slightly nervous about what appears to be dubious technical 
claims like "this is X's fault and if you rewrite X it will work much 
better so go away and do not stand in the way of progress" or "this new 
scheduler does away with all those ugly heuristics". I'm almost getting 
a reiser4 de ja vu :-/

we should really let this sit through in -mm for at least one more 
kernel iteration, let more folks be exposed to it and keep an eye on 
regression reports. [ An _eye_, not a machine gun ;-) ]

	Ingo

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 13:36                                                                 ` Mike Galbraith
@ 2007-03-17 17:03                                                                   ` Gene Heskett
  2007-03-17 17:37                                                                     ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Gene Heskett @ 2007-03-17 17:03 UTC (permalink / raw)
  To: linux-kernel
  Cc: Mike Galbraith, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	Al Boldi, Nicholas Miell, Linus Torvalds, Andrew Morton

On Saturday 17 March 2007, Mike Galbraith wrote:
[...]
>Xorg is using 50% cpu because I'm asking it to.

What advantage is that giving you?

>	-Mike

-- 
Cheers, Gene
"There are four boxes to be used in defense of liberty:
 soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
Once upon a time, four AMPHIBIOUS HOG CALLERS attacked a family of
DEFENSELESS, SENSITIVE COIN COLLECTORS and brought DOWN their PROPERTY
VALUES!!

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 15:13                                                           ` RSDL v0.31 Mark Hahn
@ 2007-03-17 17:22                                                             ` Stephen Clark
  2007-03-19 15:06                                                             ` Chris Friesen
  1 sibling, 0 replies; 198+ messages in thread
From: Stephen Clark @ 2007-03-17 17:22 UTC (permalink / raw)
  To: Mark Hahn; +Cc: Con Kolivas, linux-kernel

Mark Hahn wrote:

>>So in an attempt to summarise the situation, what are the advantages of RSDL
>>over mainline.
>>
>>Fairness
>>    
>>
>
>why do you think fairness is good, especially always good?
>
>  
>
>>Starvation free
>>    
>>
>
>even starvation is sometimes a good thing - there's a place for processes
>that only use the CPU if it is otherwise idle.  that is, they are
>deliberately starved all the rest of the time.
>
>  
>
>>Much lower and bound latencies
>>    
>>
>
>in an average sense?  also, under what circumstances does this actually
>matter?  (please don't offer something like RT audio on an overloaded machine-
>that's operator error, not something to design for.)
>
>  
>
>>Deterministic
>>    
>>
>
>not a bad thing, but how does this make itself apparent and of value 
>to the user?  I think everyone is extremely comfortable with non-determinism
>(stemming from networks, caches, interleaved workloads, etc)
>
>  
>
>>Better interactivity for the majority of cases.
>>    
>>
>
>how is this measured?  is this statement really just a reiteration of 
>the latency claim?
>
>  
>
>>Now concentrating on the very last aspect since that seems to be the sticking
>>point.
>>    
>>
>
>nah, I think the fairness and latency claims are the real issues.
>-
>To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
>the body of a message to majordomo@vger.kernel.org
>More majordomo info at  http://vger.kernel.org/majordomo-info.html
>Please read the FAQ at  http://www.tux.org/lkml/
>
>  
>
I guess I wonder what is wrong with the current scheduler? I am running 
2.6.20.2 on a whitebook
laptop with a core 2 duo 1.86ghz 2gb of mem with an intel 945 shared 
memory graphics processor.
I am currently running X with beryl have vncserver connected to my main 
system, which I am
using to write this email, I am running also firefox and both of ingo 
test programs. I also am running
a make -j4 on a kernel rebuild without having any pauses or any problem 
doing anything interactively.

So again what does this new scheduler fix?

Steve

-- 

"They that give up essential liberty to obtain temporary safety, 
deserve neither liberty nor safety."  (Ben Franklin)

"The course of history shows that as a government grows, liberty 
decreases."  (Thomas Jefferson)




^ permalink raw reply	[flat|nested] 198+ messages in thread

* RE: [ck] Re: is RSDL an "unfair" scheduler too?
  2007-03-17 12:23                                                               ` [ck] " jos poortvliet
@ 2007-03-17 17:31                                                                 ` David Schwartz
  0 siblings, 0 replies; 198+ messages in thread
From: David Schwartz @ 2007-03-17 17:31 UTC (permalink / raw)
  To: Linux-Kernel@Vger. Kernel. Org



> -----Original Message-----
> From: linux-kernel-owner@vger.kernel.org
> [mailto:linux-kernel-owner@vger.kernel.org]On Behalf Of jos poortvliet
> Sent: Saturday, March 17, 2007 5:24 AM
> To: ck@vds.kolivas.org
> Cc: Con Kolivas; Ingo Molnar; Al Boldi; Mike Galbraith;
> linux-kernel@vger.kernel.org; Nicholas Miell; Linus Torvalds; Andrew
> Morton
> Subject: Re: [ck] Re: is RSDL an "unfair" scheduler too?
>
>
> Op Saturday 17 March 2007, schreef Con Kolivas:
> > On Saturday 17 March 2007 22:49, Ingo Molnar wrote:
> > > * Con Kolivas <kernel@kolivas.org> wrote:
> > > > Despite the claims to the contrary, RSDL does not have _less_
> > > > heuristics, it does not have _any_. It's purely entitlement based.
> > >
> > > RSDL still has heuristics very much, but this time it's hardcoded into
> > > the design! Let me demonstrate this via a simple experiment.
> > >
> > > in the vanilla scheduler, the heuristics are ontop of a fairly basic
> > > (and fast) scheduler, they are plain visible and thus 'optional'. In
> > > RSDL, the heuristics are still present but more hidden and more
> > > engrained into the design.
> > >
> > > But it's easy to demonstrate this under RSDL: consider the
> following two
> > > scenarios, which implement precisely the same fundamental computing
> > > workload (everything running on the same, default nice 0 level):
> > >
> > > 1) a single task runs almost all the time and sleeps about 1
> msec every
> > >    100 msecs.
> > >
> > >    [ run "while N=1; do N=1; done &" under bash to create such a
> > >      workload. ]
> > >
> > > 2) tasks are in a 'ring' where each runs for 100 msec, sleeps for 1
> > >    msec and passes the 'token' around to the next task in the
> ring. (in
> > >    essence every task will sleep 9900 msecs before getting
> another run)
> > >
> > >    [ run http://redhat.com/~mingo/scheduler-patches/ring-test.c to
> > >      create this workload. If the 100 tasks default is too
> much for you
> > >      then you can run "./ring-test 10" - that will show
> similar effects.
> > >    ]

> Well, re-reading his post, he has a point - one WOULD expect each
> of these 2
> tasks to have an equal share of CPU, and if RSDL doesn't
> currently take this
> pipe-thing into account, it might need some fixing. call it
> heuristics or not
> (after all, how could one NOT say a scheduler uses heuristics of
> some kind?).

I can't get myself to expect that no matter how hard I try. The scheduler
is, by design, fair to *tasks*. So why would one expect two different
approaches that do the same thing, but with different numbers of tasks, to
each get the same amount of CPU if they compete with each other? One would
expect the approach that uses the most tasks to get more CPU.

DS



^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 17:03                                                                   ` Gene Heskett
@ 2007-03-17 17:37                                                                     ` Mike Galbraith
  2007-03-17 18:23                                                                       ` [ck] " Kacper Wysocki
  0 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17 17:37 UTC (permalink / raw)
  To: Gene Heskett
  Cc: linux-kernel, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	Al Boldi, Nicholas Miell, Linus Torvalds, Andrew Morton

On Sat, 2007-03-17 at 13:03 -0400, Gene Heskett wrote:
> On Saturday 17 March 2007, Mike Galbraith wrote:
> [...]
> >Xorg is using 50% cpu because I'm asking it to.
> 
> What advantage is that giving you?

It's a test scenario.  Read the thread please, I really don't want to
repeat myself endlessly.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-17 17:37                                                                     ` Mike Galbraith
@ 2007-03-17 18:23                                                                       ` Kacper Wysocki
  2007-03-17 18:45                                                                         ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Kacper Wysocki @ 2007-03-17 18:23 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Gene Heskett, Al Boldi, Andrew Morton, linux-kernel, ck,
	Linus Torvalds, Nicholas Miell

On 3/17/07, Mike Galbraith <efault@gmx.de> wrote:
> On Sat, 2007-03-17 at 13:03 -0400, Gene Heskett wrote:
> > On Saturday 17 March 2007, Mike Galbraith wrote:
> > [...]
> > >Xorg is using 50% cpu because I'm asking it to.
> >
> > What advantage is that giving you?
>
> It's a test scenario.  Read the thread please, I really don't want to
> repeat myself endlessly.

I've been following "this thread" since Con's .31 announcement - and
the only reference to your test scenario that you've given is that
you're still "having trouble with x/gforce vs two niced encoders",
that you've added "some targeted unfairness", that Con's new scheduler
is "an utter failure" and something about beating it to a bloody pulp.

You haven't detailed what your test actually is or what it's trying to
acheive, nor provided anyone else with the means to reproduce it or
understand any of the behaviour you're seeing. Now if you've done that
in any other thread, consider referencing it instead of worrying about
"repeating yourself endlessly". Otherwise, you're making it pretty
clear that you're just trying to be difficult, rather than being
heard.

Remember, this thread is not only cross-posted, but also exists in a
high-volume mailing list where things aren't as easy to track as in
one's own head.

And for Mark and others who are as confused as I was, this is the
thread that Mike meant to reference:
http://thread.gmane.org/gmane.linux.kernel/503455/focus=6614

Cheers,
 -Kacper

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-17 18:23                                                                       ` [ck] " Kacper Wysocki
@ 2007-03-17 18:45                                                                         ` Mike Galbraith
  0 siblings, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-17 18:45 UTC (permalink / raw)
  To: Kacper Wysocki
  Cc: Gene Heskett, Al Boldi, Andrew Morton, linux-kernel, ck,
	Linus Torvalds, Nicholas Miell

On Sat, 2007-03-17 at 19:23 +0100, Kacper Wysocki wrote:

> And for Mark and others who are as confused as I was, this is the
> thread that Mike meant to reference:
> http://thread.gmane.org/gmane.linux.kernel/503455/focus=6614

Nope, with all the back and forth (and noise), I lost track of which
thread was which.  Thanks. 

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-17 11:49                                                           ` is RSDL an "unfair" scheduler too? Ingo Molnar
  2007-03-17 12:02                                                             ` Con Kolivas
  2007-03-17 12:15                                                             ` jos poortvliet
@ 2007-03-17 20:41                                                             ` Avi Kivity
  2007-03-18  1:25                                                               ` William Lee Irwin III
  2 siblings, 1 reply; 198+ messages in thread
From: Avi Kivity @ 2007-03-17 20:41 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Con Kolivas, ck, Serge Belyshev, Al Boldi, Mike Galbraith,
	linux-kernel, Nicholas Miell, Linus Torvalds, Andrew Morton

Ingo Molnar wrote:
> * Con Kolivas <kernel@kolivas.org> wrote:
>
>   
>> Despite the claims to the contrary, RSDL does not have _less_ 
>> heuristics, it does not have _any_. It's purely entitlement based.
>>     
>
> RSDL still has heuristics very much, but this time it's hardcoded into 
> the design! Let me demonstrate this via a simple experiment.
>
>
>   
[...]

> But it's easy to demonstrate this under RSDL: consider the following two 
> scenarios, which implement precisely the same fundamental computing 
> workload (everything running on the same, default nice 0 level):
>
> 1) a single task runs almost all the time and sleeps about 1 msec every
>    100 msecs.
>
>    [ run "while N=1; do N=1; done &" under bash to create such a 
>      workload. ]
>
> 2) tasks are in a 'ring' where each runs for 100 msec, sleeps for 1
>    msec and passes the 'token' around to the next task in the ring. (in
>    essence every task will sleep 9900 msecs before getting another run)
>
>
> Workload #1 uses 100% of CPU time. Workload #2 uses 99% of CPU time. 
> They both do in essence the same thing.
>
> if RSDL had no heuristics at all then if i mixed #1 with #2, both 
> workloads would get roughly 50%/50% of the CPU, right? (as happens if i 
> mix #1 with #1 - both CPU-intense workloads get half of the CPU)
>   

Well, the heuristic here is that process == job.  I'm not sure heuristic 
is the right name for it, but it does point out a deficieny.

A cpu-bound process with many threads will overwhelm a cpu-bound single 
threaded threaded process.

A job with many processes will overwhelm a job with a single process.

A user with many jobs can starve a user with a single job.

I don't think the problem here is heuristics, rather that the 
scheduler's manages cpu quotas at the task level rather than at the user 
visible level.  If scheduling were managed at all three hierarchies I 
mentioned ('job' is a bit artificial, but process and user are not) then:

- if N users are contending for the cpu on a multiuser machine, each 
should get just 1/N of available cpu power.  As it is, a user can run a 
few of your #1 workloads (or a make -j 20) and slow every other user down
- your example would work perfectly (if we can communicate to the kernel 
what a job is)
- multi-threaded processes would not get an unfair advantage

-- 
Do not meddle in the internals of kernels, for they are subtle and quick to panic.


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  9:58                                                           ` Mike Galbraith
  2007-03-17 10:49                                                             ` Mike Galbraith
  2007-03-17 13:58                                                             ` michael chang
@ 2007-03-17 20:55                                                             ` Al Boldi
  2007-03-18  6:17                                                               ` Mike Galbraith
  2007-03-19 16:07                                                               ` Mark Lord
  2007-03-19 16:03                                                             ` Mark Lord
  3 siblings, 2 replies; 198+ messages in thread
From: Al Boldi @ 2007-03-17 20:55 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Con Kolivas, ck, Serge Belyshev, Ingo Molnar, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

Mike Galbraith wrote:
> On Sat, 2007-03-17 at 20:48 +1100, Con Kolivas wrote:
> > The most frustrating part of a discussion of this nature on lkml is that
> > earlier information in a thread seems to be long forgotten after a few
> > days and all that is left is the one reporter having a problem.
>
> One?  I'm not the only person who reported regression.

Mike, I'm not saying RSDL is perfect, but v0.31 is by far better than 
mainline.  Try this easy test:

startx with the vesa driver
run reflect from the mesa5.0-demos
load 5 cpu-hogs
start moving the mouse

On my desktop, mainline completely breaks down, and no nicing may rescue.

On RSDL, even without nicing, the desktop is at least useable.

What we need is constructive criticism to improve the situation, either with 
mainline or with RSDL.  And for now RSDL is better.


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-17 20:41                                                             ` Avi Kivity
@ 2007-03-18  1:25                                                               ` William Lee Irwin III
  2007-03-18  1:32                                                                 ` Linus Torvalds
  2007-03-18  5:00                                                                 ` Avi Kivity
  0 siblings, 2 replies; 198+ messages in thread
From: William Lee Irwin III @ 2007-03-18  1:25 UTC (permalink / raw)
  To: Avi Kivity
  Cc: Ingo Molnar, Con Kolivas, ck, Serge Belyshev, Al Boldi,
	Mike Galbraith, linux-kernel, Nicholas Miell, Linus Torvalds,
	Andrew Morton

On Sat, Mar 17, 2007 at 10:41:01PM +0200, Avi Kivity wrote:
> Well, the heuristic here is that process == job.  I'm not sure heuristic 
> is the right name for it, but it does point out a deficieny.
> A cpu-bound process with many threads will overwhelm a cpu-bound single 
> threaded threaded process.
> A job with many processes will overwhelm a job with a single process.
> A user with many jobs can starve a user with a single job.
> I don't think the problem here is heuristics, rather that the 
> scheduler's manages cpu quotas at the task level rather than at the user 
> visible level.  If scheduling were managed at all three hierarchies I 
> mentioned ('job' is a bit artificial, but process and user are not) then:
> - if N users are contending for the cpu on a multiuser machine, each 
> should get just 1/N of available cpu power.  As it is, a user can run a 
> few of your #1 workloads (or a make -j 20) and slow every other user down
> - your example would work perfectly (if we can communicate to the kernel 
> what a job is)
> - multi-threaded processes would not get an unfair advantage

I like this notion very much. I should probably mention pgrp's' typical
association with the notion of "job," at least as far as shells go.

One issue this raises is prioritizing users on a system, threads within
processes, jobs within users, etc. Maybe sessions would make sense, too,
and classes of users, and maybe whatever they call the affairs that pid
namespaces are a part of (someone will doubtless choke on the hierarchy
depth implied here but it doesn't bother me in the least). It's not a
deep or difficult issue. There just needs to be some user API to set the
relative scheduling priorities of all these affairs within the next higher
level of hierarchy, regardless of how many levels of hierarchy (aleph_0?).


-- wli

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 12:52                             ` Con Kolivas
  2007-03-12 14:14                               ` Al Boldi
@ 2007-03-18  1:30                               ` Bill Davidsen
  2007-03-18 10:50                               ` [ck] " jos poortvliet
  2 siblings, 0 replies; 198+ messages in thread
From: Bill Davidsen @ 2007-03-18  1:30 UTC (permalink / raw)
  To: Con Kolivas; +Cc: Al Boldi, ck list, linux-kernel

Con Kolivas wrote:
> On Monday 12 March 2007 22:26, Al Boldi wrote:
>> Con Kolivas wrote:
>>> On Monday 12 March 2007 15:42, Al Boldi wrote:
>>>> Con Kolivas wrote:
>>>>> On Monday 12 March 2007 08:52, Con Kolivas wrote:
>>>>>> And thank you! I think I know what's going on now. I think each
>>>>>> rotation is followed by another rotation before the higher priority
>>>>>> task is getting a look in in schedule() to even get quota and add
>>>>>> it to the runqueue quota. I'll try a simple change to see if that
>>>>>> helps. Patch coming up shortly.
>>>>> Can you try the following patch and see if it helps. There's also one
>>>>> minor preemption logic fix in there that I'm planning on including.
>>>>> Thanks!
>>>> Applied on top of v0.28 mainline, and there is no difference.
>>>>
>>>> What's it look like on your machine?
>>> The higher priority one always get 6-7ms whereas the lower priority one
>>> runs 6-7ms and then one larger perfectly bound expiration amount.
>>> Basically exactly as I'd expect. The higher priority task gets precisely
>>> RR_INTERVAL maximum latency whereas the lower priority task gets
>>> RR_INTERVAL min and full expiration (according to the virtual deadline)
>>> as a maximum. That's exactly how I intend it to work. Yes I realise that
>>> the max latency ends up being longer intermittently on the niced task but
>>> that's -in my opinion- perfectly fine as a compromise to ensure the nice
>>> 0 one always gets low latency.
>> I think, it should be possible to spread this max expiration latency across
>> the rotation, should it not?
> 
> There is a way that I toyed with of creating maps of slots to use for each 
> different priority, but it broke the O(1) nature of the virtual deadline 
> management. Minimising algorithmic complexity seemed more important to 
> maintain than getting slightly better latency spreads for niced tasks. It 
> also appeared to be less cache friendly in design. I could certainly try and 
> implement it but how much importance are we to place on latency of niced 
> tasks? Are you aware of any usage scenario where latency sensitive tasks are 
> ever significantly niced in the real world?
> 
It depends on how you reconcile "completely fair" and "order of 
magnitude blips in latency." It looks (from the results, not the code) 
as if nice is implemented by round-robin scheduling followed by once in 
a while just not giving the CPU to the nice task for a while. Given the 
smooth nature of the performance otherwise, it's more obvious than if 
you weren't doing such a good job most of the time.

Ugly stands out more on something beautiful!

-- 
Bill Davidsen <davidsen@tmr.com>
   "We have more to fear from the bungling of the incompetent than from
the machinations of the wicked."  - from Slashdot

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  1:25                                                               ` William Lee Irwin III
@ 2007-03-18  1:32                                                                 ` Linus Torvalds
  2007-03-18  5:24                                                                   ` Willy Tarreau
  2007-03-18  5:00                                                                 ` Avi Kivity
  1 sibling, 1 reply; 198+ messages in thread
From: Linus Torvalds @ 2007-03-18  1:32 UTC (permalink / raw)
  To: William Lee Irwin III
  Cc: Avi Kivity, Ingo Molnar, Con Kolivas, ck, Serge Belyshev,
	Al Boldi, Mike Galbraith, linux-kernel, Nicholas Miell,
	Andrew Morton

On Sat, 17 Mar 2007, William Lee Irwin III wrote:
> 
> One issue this raises is prioritizing users on a system, threads within
> processes, jobs within users, etc.

Doing some "classing" even by just euid might be a good idea. It would 
actually catch X automatically most of the time, because the euid of the X 
server is likely to be root, so even for the "trivial" desktop example, it 
would kind of automatically mean that X would get about 50% of CPU time 
even if you have a hundred user clients, just because that's "fair" by 
euid.

Dunno. I guess a lot of people would like to then manage the classes, 
which would be painful as hell. 

		Linus

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-17 12:43                                                                 ` Con Kolivas
  2007-03-17 16:34                                                                   ` Ingo Molnar
@ 2007-03-18  2:13                                                                   ` Bill Davidsen
  2007-03-18  3:20                                                                     ` Kasper Sandberg
  2007-03-18  5:37                                                                     ` Mike Galbraith
  1 sibling, 2 replies; 198+ messages in thread
From: Bill Davidsen @ 2007-03-18  2:13 UTC (permalink / raw)
  To: Con Kolivas
  Cc: Ingo Molnar, Al Boldi, Mike Galbraith, linux-kernel, ck,
	Nicholas Miell, Linus Torvalds, Andrew Morton

Con Kolivas wrote:
> On Saturday 17 March 2007 23:28, Ingo Molnar wrote:
>> * Con Kolivas <kernel@kolivas.org> wrote:
>>> We're obviously disagreeing on what heuristics are [...]
>> that could very well be so - it would be helpful if you could provide
>> your own rough definition for the term, so that we can agree on how to
>> call things?
>>
>> [ in any case, there's no rush here, please reply at your own pace, as
>>   your condition allows. I wish you a speedy recovery! ]
>>
>>> You're simply cashing in on the deep pipes that do kernel work for
>>> other tasks. You know very well that I dropped the TASK_NONINTERACTIVE
>>> flag from rsdl which checks that tasks are waiting on pipes and you're
>>> exploiting it.
>> Con, i am not 'cashing in' on anything and i'm not 'exploiting'
>> anything. The TASK_NONINTERACTIVE flag is totally irrelevant to my
>> argument because i was not testing the vanilla scheduler, i was testing
>> RSDL. I could have written this test using plain sockets, because i was
>> testing RSDL's claim of not having heuristics, i was not testing the
>> vanilla scheduler.
>>
>> I have simply replied to this claim of yours:
>>>> Despite the claims to the contrary, RSDL does not have _less_
>>>> heuristics, it does not have _any_. [...]
>> and i showed you a workload under _RSDL_ that clearly shows that RSDL is
>> an unfair scheduler too.
>>
>> my whole point was to counter the myth of 'RSDL has no heuristics'. Of
>> course it has heuristics, which results in unfairness. (If it didnt have
>> any heuristics that tilt the balance of scheduling towards sleep-intense
>> tasks then a default Linux desktop would not be usable at all.)
>>
>> so the decision is _not_ a puristic "do we want to have heuristics or
>> not", the question is a more practical "which heuristics are simpler,
>> which heuristics are more flexible, which heuristics result in better
>> behavior".
>>
>> 	Ingo
> 
> Ok but please look at how it appears from my end (illness aside).
> 
> I spend 3 years just diddling with scheduler code trying my hardest to find a 
> design that fixes a whole swag of problems we still have, and a swag of 
> problems we might get with other fixes.
> 
> You initially said you were pleased with this design.
> 
> ..lots of code, testing, bugfixes and good feedback.
> 
> Then Mike has one testcase that most other users disagree is worthy of being 
> considered a regresssion. You latched onto that and basically called it a 
> showstopper in spite of who knows how many other positive things.
> 
> Then you quickly produce a counter patch designed to kill off RSDL with a 
> config option for mainline.
> 
> Then you boldly announce on LKML "is RSDL an "unfair" scheduler too?" with 
> some test case you whipped up to try and find fault with the design.

No damn it! He's pointing out that you do have heuristics, they are just 
built into the design. And of course he's whipping up test cases, how 
else can anyone help you find corner cases where it behaves in an 
unexpected or undesirable manner?

I think he's trying to help, please stop taking it personally.
> 
> What am I supposed to think? Considering just how many problems I have 
> addressed and tried to correct with RSDL succesfully I'm surprised that 
> despite your enthusiasm for it initially you have spent the rest of the time 
> trying to block it.
> 
> Please, either help me (and I'm in no shape to code at the moment despite what 
> I have done so far), or say you have no intention of including it. I'm 
> risking paralysis just by sitting at the computer right now so I'm dropping 
> the code as is at the moment and will leave it up to your better judgement as 
> to what to do with it.
> 
Actually I think Ingo has tried to help get it in, that's his patch 
offered for CONFIG_SCHED_FAIR, lets people try it and all.

Now for something constructive... by any chance is Mike running KDE 
instead of GNOME? I only had a short time to play because I had to look 
at another problem in 2.6.21-rc3 (nbd not working), so the test machine 
is in use. But it looked as if behavior was not as smooth with KDE. May 
that thought be useful.

-- 
Bill Davidsen <davidsen@tmr.com>
   "We have more to fear from the bungling of the incompetent than from
the machinations of the wicked."  - from Slashdot

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  2:13                                                                   ` Bill Davidsen
@ 2007-03-18  3:20                                                                     ` Kasper Sandberg
  2007-03-18  5:37                                                                     ` Mike Galbraith
  1 sibling, 0 replies; 198+ messages in thread
From: Kasper Sandberg @ 2007-03-18  3:20 UTC (permalink / raw)
  To: Bill Davidsen
  Cc: Con Kolivas, Ingo Molnar, Al Boldi, Mike Galbraith, linux-kernel,
	ck, Nicholas Miell, Linus Torvalds, Andrew Morton

On Sat, 2007-03-17 at 21:13 -0500, Bill Davidsen wrote:
> Con Kolivas wrote:
<snip>
> 
> Now for something constructive... by any chance is Mike running KDE 
> instead of GNOME? I only had a short time to play because I had to look 
> at another problem in 2.6.21-rc3 (nbd not working), so the test machine 
> is in use. But it looked as if behavior was not as smooth with KDE. May 
> that thought be useful.

Now i must say here, i use KDE, and have been testing 0.31, and i have
been observing all the effects, in contrast with vanilla and staircase.
this is on 2.6.20

I do not notice kde being slower, in fact i notice various interactivity
"speedups" compared to mainline.

first one i noticed(because i deliberately tested) was kicker. Kickers
hide function was very very smooth during boot, and still is under load.
It is not entirely as smooth under vanilla during boot(i suspect IO
issue), under staircase it is, however under huge loads, it even is not
smooth under staircase.

then i started my konsole, and the one thing i immediately noticed was
that zsh started instantly, usually i can see/(feel) zsh starting, as in
it takes like 0.2 before my prompt comes. This is simply gone now with
rsdl, behavior used to be the same in vanilla/rsdl.

But the most interresting, and dare i say, completely unexpected things
are much more important.

I have for a long time had issues with tvtime, if i did stuff like move
windows, tvtime would drop frames, or simply hovering javascript stuff
on sites in konqueror, (this seemed to be introduced in 2.6.~5+), cause
in EARLY 2.6 i did not have this problem, but it was the same in
staircase and vanilla. But this is gone completely, tvtime no longer
drops any frames when doing this.

Another thing i noticed, which almost blew my mind as badly as with
tvtime, was with wine, and world of warcraft(and nvidia blob driver, but
this IS what many "desktop" users runs). While loading a level, the
sound no longer skipped. This problem afaik, EVERYBODY which runs wine
+wow has(unless they change the buffer size to ridicoulesly high which
annoys gameplay).

And more playing wow has shown me that rsdl seems to be doing an
extremely good job of not letting other tasks interfere. For example i
have spamasassin going quite very often (every minute, for lots of
accounts), and this usually kills all sorts of high performance opengl
stuff, causing severe stuttering, but with RSDL i only noticed my
framerate dropping, but no strange stuttering as i usually experience,
only lowered fps, which to me is quite natural, as spamasassin now has
to use cpu.

the last of my immediate observations are ktorrent. my ktorrent has LOTS
of open fd's (in fact like ~5-6k), and the application tends to be very
sluggish, but that is not so anymore.

And now for the side effects i have observed.

The only down right "regression" (which i wouldnt even call it, cause
its a NATURAL thing when other stuff uses cpu) is that when i play a
movie in kaffeine, and move the kaffeine window insanely fast all over
the desktop, the audio skipped once, the strange thing is, even if i
keep moving it, it does not skip any more, only the one time when you
start to move it. But 720p h264 does take SOME cpu to decode, so i dont
feel that this is unfair.

i have however with X observed that under high load(720p h264 video
playback, while doing video encoding, ktorrent running, and make
running) that windows take longer time to redraw in X, if i move windows
over each other, but not really a problem.

another effect i have observed is that stuff does not stutter as much
when under load, it simply gets slower.(but i suppose thats what i said
when describing the good effects i have noticed)

it should be noted that i have not reniced a single thing, and this is a
singlecore amd64 2ghz with 1.5gb ram.

> 

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-17 16:34                                                                   ` Ingo Molnar
@ 2007-03-18  3:23                                                                     ` Bill Davidsen
  0 siblings, 0 replies; 198+ messages in thread
From: Bill Davidsen @ 2007-03-18  3:23 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Con Kolivas, Serge Belyshev, Al Boldi, Mike Galbraith,
	linux-kernel, Nicholas Miell, Linus Torvalds, Andrew Morton,
	vishpat

Ingo Molnar wrote:

> * Con Kolivas <kernel@kolivas.org> wrote:
> 
>> Ok but please look at how it appears from my end (illness aside).
> 
> ( i really think we should continue this debate after you get better. 
>   Everything looks much darker when you are ill! )
> 
>> You initially said you were pleased with this design.
> 
> I said that 2 years ago about the staircase scheduler and i am still 
> saying this about RSDL today. That doesnt make my position automatically 
> correct though :-) For example i wrote and maintained the 4g:4g patchset 
> for over 2 years and still that was no guarantee of it making sense 
> upstream ;) And it was a hell of a lot of work (much uglier and nastier 
> work than any scheduler hacking and tuning, believe me), and it was 
> thrown away as a cute but unnecessary complication we dont need. So 
> what? To me what matters is the path you walk, not the destination you 
> reach.
> 
> in terms of RSDL design, i like it, still i'm kind of asking myself 
> 'couldnt something in this direction be done in a much simpler and less 
> revolutionary way'? For example couldnt we introduce per-priority level 
> timeslice quotas in the current scheme as well, instead of the very 
> simplistic and crude STARVATION_LIMIT approach? Furthermore, couldnt we 
> make the timeslices become smaller as the runqueue length increases, to 
> make starvation less of a problem? It seems like the problem cases with 
> the current scheduler arent so much centered around the interactivity 
> estimator, it is more that timeslices get distributed too coarsely, 
> while RSDL distributes timeslices in a more finegrained way and is thus 
> less suspect to starvation under certain workloads.

Yes. The "doorknob scheduler" was a scheduler which worked as follows: 
the runnable processes in the system were put in a priority sorted list 
and counted. Then the length of one cycle (turn) was divided by the 
number of processes and that was the timeslice. In the next version 
upper and lower limits were put on the length of a timeslice, so the 
system didn't get eaten by context switches under load or be jerky under 
light load. That worked fairly well.

Then people got into creating unfairness to address what they thought 
were corner cases, the code turned into a plumber's nightmare, and 
occasional jackpot cases created occasional (non-reproducible) hangs.

Finally management noted that the peripherals cost three times as much 
as the CPU, so jobs doing i/o should run first. That made batch run like 
the clappers of hell, and actually didn't do all the bad things you 
might expect. User input was waitio, disk was waitio, response was about 
as good as it could be for that hardware.

===> it might be useful to give high priority to a process going from 
waitio to runable state, once only.

The "doorknob" term came from "everybody gets a turn" and the year was 1970.

Avi Kivity wrote:
> Well, the heuristic here is that process == job. I'm not sure
> heuristic is the right name for it, but it does point out a
> deficieny.
 >
> A cpu-bound process with many threads will overwhelm a cpu-bound
> single threaded threaded process.
 >
 > A job with many processes will overwhelm a job with a single process.
 >
 > A user with many jobs can starve a user with a single job.
 >
> I don't think the problem here is heuristics, rather that the 
> scheduler's manages cpu quotas at the task level rather than at the
> user visible level. If scheduling were managed at all three
> hierarchies I mentioned ('job' is a bit artificial, but process and
> user are not) then:
 >
> - if N users are contending for the cpu on a multiuser machine, each 
> should get just 1/N of available cpu power. As it is, a user can run
> a few of your #1 workloads (or a make -j 20) and slow every other
> user down - your example would work perfectly (if we can communicate
> to the kernel what a job is)
 >
 > - multi-threaded processes would not get an unfair advantage

If we wanted to do this, a job would be defined as all children or 
threads of the oldest parent process with a PPID of one. So if I logged 
on  and did
   make -j4
on a kernel, and someone else did:
   find /var -type f | xargs grep -l zumblegarfe
and someone else was doing:
   foo & mumble & barfe

We would all be equal. That's good! And there would be some recursive 
scheduler which would pick a "job" and then a process, and run it. That 
too is good!

But we have a mail server, and there are 671 threads with a socket and 
POP3 user on each one, and they only get 1/N of a job worth of CPU 
between them, and that sucks rocks off the bottom of the ocean. So 
pretty soon the code gets some "fixes" to make POP3 work, and X work, 
and the code once again becomes plumber's nightmare... Then you start 
playing with process groups to address some of this, and address exec() 
corner cases, and complexity goes way up again. This is NOT an easy problem!

I think Con has done a good job, I think in most cases (heresy) the 
current mainline scheduler does a pretty good job. I'm in favor of 
having several schedulers, because I don't believe any one can match the 
behavior goals of all users.

-- 
Bill Davidsen <davidsen@tmr.com>
   "We have more to fear from the bungling of the incompetent than from
the machinations of the wicked."  - from Slashdot

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  1:25                                                               ` William Lee Irwin III
  2007-03-18  1:32                                                                 ` Linus Torvalds
@ 2007-03-18  5:00                                                                 ` Avi Kivity
  1 sibling, 0 replies; 198+ messages in thread
From: Avi Kivity @ 2007-03-18  5:00 UTC (permalink / raw)
  To: William Lee Irwin III
  Cc: Ingo Molnar, Con Kolivas, ck, Serge Belyshev, Al Boldi,
	Mike Galbraith, linux-kernel, Nicholas Miell, Linus Torvalds,
	Andrew Morton

William Lee Irwin III wrote:
> On Sat, Mar 17, 2007 at 10:41:01PM +0200, Avi Kivity wrote:
>   
>> Well, the heuristic here is that process == job.  I'm not sure heuristic 
>> is the right name for it, but it does point out a deficieny.
>> A cpu-bound process with many threads will overwhelm a cpu-bound single 
>> threaded threaded process.
>> A job with many processes will overwhelm a job with a single process.
>> A user with many jobs can starve a user with a single job.
>> I don't think the problem here is heuristics, rather that the 
>> scheduler's manages cpu quotas at the task level rather than at the user 
>> visible level.  If scheduling were managed at all three hierarchies I 
>> mentioned ('job' is a bit artificial, but process and user are not) then:
>> - if N users are contending for the cpu on a multiuser machine, each 
>> should get just 1/N of available cpu power.  As it is, a user can run a 
>> few of your #1 workloads (or a make -j 20) and slow every other user down
>> - your example would work perfectly (if we can communicate to the kernel 
>> what a job is)
>> - multi-threaded processes would not get an unfair advantage
>>     
>
> I like this notion very much. I should probably mention pgrp's' typical
> association with the notion of "job," at least as far as shells go.
>
>   

One day I might understand what pgrp, sessions, and all that stuff is.

> One issue this raises is prioritizing users on a system, threads within
> processes, jobs within users, etc. Maybe sessions would make sense, too,
> and classes of users, and maybe whatever they call the affairs that pid
> namespaces are a part of (someone will doubtless choke on the hierarchy
> depth implied here but it doesn't bother me in the least). It's not a
> deep or difficult issue. There just needs to be some user API to set the
> relative scheduling priorities of all these affairs within the next higher
> level of hierarchy, regardless of how many levels of hierarchy (aleph_0?).
>   

I think it follows naturally.

Note that more than the scheduler needs to be taught about this.  The 
page cache and swapper should prevent a user from swapping out too many 
of another user's pages when there is contention for memory; there 
should be per-user quotas for network and disk bandwidth, etc.  Until 
then people who want true multiuser with untrusted users will be forced 
to use ugly hacks like virtualization.  Fortunately it seems the 
container people are addressing at least a part of this.

-- 
Do not meddle in the internals of kernels, for they are subtle and quick to panic.


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  1:32                                                                 ` Linus Torvalds
@ 2007-03-18  5:24                                                                   ` Willy Tarreau
  2007-03-18  5:55                                                                     ` Avi Kivity
                                                                                       ` (2 more replies)
  0 siblings, 3 replies; 198+ messages in thread
From: Willy Tarreau @ 2007-03-18  5:24 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: William Lee Irwin III, Avi Kivity, Ingo Molnar, Con Kolivas, ck,
	Serge Belyshev, Al Boldi, Mike Galbraith, linux-kernel,
	Nicholas Miell, Andrew Morton

On Sat, Mar 17, 2007 at 06:32:29PM -0700, Linus Torvalds wrote:
> 
> 
> On Sat, 17 Mar 2007, William Lee Irwin III wrote:
> > 
> > One issue this raises is prioritizing users on a system, threads within
> > processes, jobs within users, etc.
> 
> Doing some "classing" even by just euid might be a good idea. It would 
> actually catch X automatically most of the time, because the euid of the X 
> server is likely to be root, so even for the "trivial" desktop example, it 
> would kind of automatically mean that X would get about 50% of CPU time 
> even if you have a hundred user clients, just because that's "fair" by 
> euid.

Warning: all these ideas seem interesting for desktop, but are definitely
not for servers. I found RSDL to be excellent on servers, compared to
mainline in which some services are starving under load. I can understand
that on the desktop people want some unfairness, and I like the pgrp idea
for instance. But this one will certainly fail on servers, or make the
admins get grey hair very soon.

Maybe we're all discussing the problem because we have reached the point
where we need two types of schedulers : one for the desktop and one for
the servers. After all, this is already what is proposed with preempt,
it would make sense provided they share the same core and avoid ifdefs
or unused structure members. Maybe adding OPTIONAL unfairness to RSDL
would help some scenarios, but in any case it is important to retain
the default fairness it provides.

> Dunno. I guess a lot of people would like to then manage the classes, 
> which would be painful as hell. 

Sure ! I wouldn't like people to point the finger on Linux saying "hey
look, they can't write a good scheduler so you have to adjust the knobs
yourself!". I keep in mind that Solaris' scheduler is very good, both
fair and interactive. FreeBSD was good (I haven't tested for a long time).
We should manage to get something good for most usages, and optimize
later for specific uses.

Regards,
Willy

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  2:13                                                                   ` Bill Davidsen
  2007-03-18  3:20                                                                     ` Kasper Sandberg
@ 2007-03-18  5:37                                                                     ` Mike Galbraith
  2007-03-18 10:58                                                                       ` [ck] " jos poortvliet
  1 sibling, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-18  5:37 UTC (permalink / raw)
  To: Bill Davidsen
  Cc: Con Kolivas, Ingo Molnar, Al Boldi, linux-kernel, ck,
	Nicholas Miell, Linus Torvalds, Andrew Morton

On Sat, 2007-03-17 at 21:13 -0500, Bill Davidsen wrote:

> Now for something constructive... by any chance is Mike running KDE 
> instead of GNOME?

Yes.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  5:24                                                                   ` Willy Tarreau
@ 2007-03-18  5:55                                                                     ` Avi Kivity
  2007-03-19  2:27                                                                       ` David Schwartz
  2007-03-18  6:09                                                                     ` Bill Huey
  2007-03-18  6:26                                                                     ` Mike Galbraith
  2 siblings, 1 reply; 198+ messages in thread
From: Avi Kivity @ 2007-03-18  5:55 UTC (permalink / raw)
  To: Willy Tarreau
  Cc: Linus Torvalds, William Lee Irwin III, Ingo Molnar, Con Kolivas,
	ck, Serge Belyshev, Al Boldi, Mike Galbraith, linux-kernel,
	Nicholas Miell, Andrew Morton

Willy Tarreau wrote:
> On Sat, Mar 17, 2007 at 06:32:29PM -0700, Linus Torvalds wrote:
>   
>> On Sat, 17 Mar 2007, William Lee Irwin III wrote:
>>     
>>> One issue this raises is prioritizing users on a system, threads within
>>> processes, jobs within users, etc.
>>>       
>> Doing some "classing" even by just euid might be a good idea. It would 
>> actually catch X automatically most of the time, because the euid of the X 
>> server is likely to be root, so even for the "trivial" desktop example, it 
>> would kind of automatically mean that X would get about 50% of CPU time 
>> even if you have a hundred user clients, just because that's "fair" by 
>> euid.
>>     
>
> Warning: all these ideas seem interesting for desktop, but are definitely
> not for servers. I found RSDL to be excellent on servers, compared to
> mainline in which some services are starving under load. I can understand
> that on the desktop people want some unfairness, and I like the pgrp idea
> for instance. But this one will certainly fail on servers, or make the
> admins get grey hair very soon.
>   

I didn't suggest adding any unfairness!  I suggested being fair by 
user/job/process instead of being fair by thread (which is actually 
unfair as it favors multi threaded processes over single threaded 
processes).

> Maybe we're all discussing the problem because we have reached the point
> where we need two types of schedulers : one for the desktop and one for
> the servers. After all, this is already what is proposed with preempt,
> it would make sense provided they share the same core and avoid ifdefs
> or unused structure members. Maybe adding OPTIONAL unfairness to RSDL
> would help some scenarios, but in any case it is important to retain
> the default fairness it provides.
>   

I hope not.  I think that reducing the timeslice base, combined with 
renicing X all the way to hell should suffice.


-- 
Do not meddle in the internals of kernels, for they are subtle and quick to panic.


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  5:24                                                                   ` Willy Tarreau
  2007-03-18  5:55                                                                     ` Avi Kivity
@ 2007-03-18  6:09                                                                     ` Bill Huey
  2007-03-18  6:37                                                                       ` Mike Galbraith
  2007-03-19 21:14                                                                       ` Bill Davidsen
  2007-03-18  6:26                                                                     ` Mike Galbraith
  2 siblings, 2 replies; 198+ messages in thread
From: Bill Huey @ 2007-03-18  6:09 UTC (permalink / raw)
  To: Willy Tarreau
  Cc: Linus Torvalds, William Lee Irwin III, Avi Kivity, Ingo Molnar,
	Con Kolivas, ck, Serge Belyshev, Al Boldi, Mike Galbraith,
	linux-kernel, Nicholas Miell, Andrew Morton, Bill Huey (hui)

On Sun, Mar 18, 2007 at 06:24:40AM +0100, Willy Tarreau wrote:
> > Dunno. I guess a lot of people would like to then manage the classes, 
> > which would be painful as hell. 
> 
> Sure ! I wouldn't like people to point the finger on Linux saying "hey
> look, they can't write a good scheduler so you have to adjust the knobs
> yourself!". I keep in mind that Solaris' scheduler is very good, both
> fair and interactive. FreeBSD was good (I haven't tested for a long time).
> We should manage to get something good for most usages, and optimize
> later for specific uses.

Like I've said in a previous email, SGI schedulers have an interactive
term in addition to the normal "nice" values. If RSDL ends up being too
rigid for desktop use, then this might be a good idea to explore in
addition to priority manipulation.

However, it hasn't been completely proven that RSDL can't handle desktop
loads and that needs to be completely explored first. It certain seems
like, from the .jpgs that were posted earlier in the thread regarding mysql
performance, that RSDL seems to have improved performance for those set
ups so it's not universally the case that it sucks for server loads. The
cause of this performance difference has yet to be pinpointed.

Also, bandwidth scheduler like this are a new critical development for
things like the -rt patch. It would benefit greatly if the RSDL basic
mechanisms (RR and deadlines) were to somehow slip into that patch and
be used for a more strict -rt based scheduling class. It would be the basis
for first-class control over process resource usage and would be a first
in Linux or any mainstream kernel.

This would be a powerful addition to Linux as a whole and RSDL should
not be dismissed without these considerations. If it can somehow be
integrated into the kernel with interactivity concerns addressed, then
it would be an all out win for the kernel in both these areas.

bill

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 20:55                                                             ` Al Boldi
@ 2007-03-18  6:17                                                               ` Mike Galbraith
  2007-03-18  6:47                                                                 ` Kasper Sandberg
  2007-03-19 16:07                                                               ` Mark Lord
  1 sibling, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-18  6:17 UTC (permalink / raw)
  To: Al Boldi
  Cc: Con Kolivas, ck, Serge Belyshev, Ingo Molnar, linux-kernel,
	Nicholas Miell, Linus Torvalds, Andrew Morton

On Sat, 2007-03-17 at 23:55 +0300, Al Boldi wrote:

> Mike, I'm not saying RSDL is perfect, but v0.31 is by far better than 
> mainline.  Try this easy test:
> 
> startx with the vesa driver
> run reflect from the mesa5.0-demos
> load 5 cpu-hogs
> start moving the mouse
> 
> On my desktop, mainline completely breaks down, and no nicing may rescue.

(hmm.. i would think that _renicing_ X would help that)

> On RSDL, even without nicing, the desktop is at least useable.

So neither does a good job with this load.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  5:24                                                                   ` Willy Tarreau
  2007-03-18  5:55                                                                     ` Avi Kivity
  2007-03-18  6:09                                                                     ` Bill Huey
@ 2007-03-18  6:26                                                                     ` Mike Galbraith
  2007-03-18  6:54                                                                       ` [ck] " Radoslaw Szkodzinski
  2 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-18  6:26 UTC (permalink / raw)
  To: Willy Tarreau
  Cc: Linus Torvalds, William Lee Irwin III, Avi Kivity, Ingo Molnar,
	Con Kolivas, ck, Serge Belyshev, Al Boldi, linux-kernel,
	Nicholas Miell, Andrew Morton

On Sun, 2007-03-18 at 06:24 +0100, Willy Tarreau wrote:

> Maybe we're all discussing the problem because we have reached the point
> where we need two types of schedulers : one for the desktop and one for
> the servers. After all, this is already what is proposed with preempt,
> it would make sense provided they share the same core and avoid ifdefs
> or unused structure members. Maybe adding OPTIONAL unfairness to RSDL
> would help some scenarios, but in any case it is important to retain
> the default fairness it provides.

Bingo.


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  6:09                                                                     ` Bill Huey
@ 2007-03-18  6:37                                                                       ` Mike Galbraith
  2007-03-18  7:35                                                                         ` Bill Huey
  2007-03-19 21:14                                                                       ` Bill Davidsen
  1 sibling, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-18  6:37 UTC (permalink / raw)
  To: Bill Huey
  Cc: Willy Tarreau, Linus Torvalds, William Lee Irwin III, Avi Kivity,
	Ingo Molnar, Con Kolivas, ck, Serge Belyshev, Al Boldi,
	linux-kernel, Nicholas Miell, Andrew Morton

On Sat, 2007-03-17 at 23:09 -0700, Bill Huey wrote:

> Like I've said in a previous email, SGI schedulers have an interactive
> term in addition to the normal "nice" values. If RSDL ends up being too
> rigid for desktop use, then this might be a good idea to explore in
> addition to priority manipulation.

I've done that already (ain't perfect yet, maybe never be).  The hard
part is making it automatic, and not ruining the good side of RSDL in
the process.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-18  6:17                                                               ` Mike Galbraith
@ 2007-03-18  6:47                                                                 ` Kasper Sandberg
  2007-03-18  7:08                                                                   ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Kasper Sandberg @ 2007-03-18  6:47 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Al Boldi, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Linus Torvalds, Andrew Morton

On Sun, 2007-03-18 at 07:17 +0100, Mike Galbraith wrote:
> On Sat, 2007-03-17 at 23:55 +0300, Al Boldi wrote:
> 
> > Mike, I'm not saying RSDL is perfect, but v0.31 is by far better than 
> > mainline.  Try this easy test:
> > 
> > startx with the vesa driver
> > run reflect from the mesa5.0-demos
> > load 5 cpu-hogs
> > start moving the mouse
> > 
> > On my desktop, mainline completely breaks down, and no nicing may rescue.
> 
> (hmm.. i would think that _renicing_ X would help that)
> 
> > On RSDL, even without nicing, the desktop is at least useable.
> 
> So neither does a good job with this load.
that sorely depends on what you mean by good job.

It seems like what you call a good job is preserving the speed of the
gui(X + apps which uses it) at _ALL_ costs to other stuff.

> 
> 	-Mike
> 
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
> 


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: is RSDL an "unfair" scheduler too?
  2007-03-18  6:26                                                                     ` Mike Galbraith
@ 2007-03-18  6:54                                                                       ` Radoslaw Szkodzinski
  2007-03-18  7:58                                                                         ` Willy Tarreau
  0 siblings, 1 reply; 198+ messages in thread
From: Radoslaw Szkodzinski @ 2007-03-18  6:54 UTC (permalink / raw)
  To: linux-kernel
  Cc: Willy Tarreau, Al Boldi, Andrew Morton, William Lee Irwin III,
	ck, Avi Kivity, Linus Torvalds, Nicholas Miell

On 3/18/07, Mike Galbraith <efault@gmx.de> wrote:
> On Sun, 2007-03-18 at 06:24 +0100, Willy Tarreau wrote:
>
> > Maybe we're all discussing the problem because we have reached the point
> > where we need two types of schedulers : one for the desktop and one for
> > the servers. After all, this is already what is proposed with preempt,
> > it would make sense provided they share the same core and avoid ifdefs
> > or unused structure members. Maybe adding OPTIONAL unfairness to RSDL
> > would help some scenarios, but in any case it is important to retain
> > the default fairness it provides.
>
> Bingo.
>

Sounds like Staircase's interactive mode switch, except this actually
requires writing additional code.

The per-user system would also be nice for servers, provided there are
CPU/disc IO/swapper/... quotas or priorities at least.

All in all, I'd hate to see mldonkey eating 1/3 of CPU time, just
because it runs as another user.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-18  6:47                                                                 ` Kasper Sandberg
@ 2007-03-18  7:08                                                                   ` Mike Galbraith
  2007-03-18  7:22                                                                     ` [ck] " Radoslaw Szkodzinski
  0 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-18  7:08 UTC (permalink / raw)
  To: Kasper Sandberg
  Cc: Al Boldi, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Linus Torvalds, Andrew Morton

On Sun, 2007-03-18 at 07:47 +0100, Kasper Sandberg wrote:

> > So neither does a good job with this load.
> that sorely depends on what you mean by good job.
> 
> It seems like what you call a good job is preserving the speed of the
> gui(X + apps which uses it) at _ALL_ costs to other stuff.

Wrong.  I call a good job giving a _preference_ to the desktop.  I call
rigid fairness impractical for the desktop, and a denial of reality.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18  7:08                                                                   ` Mike Galbraith
@ 2007-03-18  7:22                                                                     ` Radoslaw Szkodzinski
  2007-03-18  7:38                                                                       ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Radoslaw Szkodzinski @ 2007-03-18  7:22 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Kasper Sandberg, Al Boldi, Andrew Morton, linux-kernel, ck,
	Linus Torvalds, Nicholas Miell

On 3/18/07, Mike Galbraith <efault@gmx.de> wrote:
> On Sun, 2007-03-18 at 07:47 +0100, Kasper Sandberg wrote:
>
> > > So neither does a good job with this load.
> > that sorely depends on what you mean by good job.
> >
> > It seems like what you call a good job is preserving the speed of the
> > gui(X + apps which uses it) at _ALL_ costs to other stuff.
>
> Wrong.  I call a good job giving a _preference_ to the desktop.  I call
> rigid fairness impractical for the desktop, and a denial of reality.

My sound programs (audacity, non-RT) and mplayer disaggree with you. :-)
Not to mention some more mundane stuff like Gajim. (no stall with its
slow PyGTK UI on RSDL)

(Hint: I'm using Xfce, not KDE)

I'd recon KDE regresses because of kioslaves waiting on a pipe
(communication with the app they're doing IO for) and then expiring.
That's why splitting IO from an app isn't exactly smart. It should at
least be ran in an another thread.

A much better approach would be running IO in the context of the app,
but using a common shared library.

Also, Beryl works better with RSDL too.
(blur doesn't "disable" itself sometimes - which is the result of a lag)

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  6:37                                                                       ` Mike Galbraith
@ 2007-03-18  7:35                                                                         ` Bill Huey
  0 siblings, 0 replies; 198+ messages in thread
From: Bill Huey @ 2007-03-18  7:35 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Willy Tarreau, Linus Torvalds, William Lee Irwin III, Avi Kivity,
	Ingo Molnar, Con Kolivas, ck, Serge Belyshev, Al Boldi,
	linux-kernel, Nicholas Miell, Andrew Morton, Bill Huey (hui)

On Sun, Mar 18, 2007 at 07:37:02AM +0100, Mike Galbraith wrote:
> On Sat, 2007-03-17 at 23:09 -0700, Bill Huey wrote:
> 
> > Like I've said in a previous email, SGI schedulers have an interactive
> > term in addition to the normal "nice" values. If RSDL ends up being too
> > rigid for desktop use, then this might be a good idea to explore in
> > addition to priority manipulation.
> 
> I've done that already (ain't perfect yet, maybe never be).  The hard
> part is making it automatic, and not ruining the good side of RSDL in
> the process.

I can't fully qualify what aspects of the X server that's creating this
problem. More experimentation is needed (various display drivers, etc...)
should be played with to see what kind of problematic situations arise.
It's a bit too new with too few users to know what are the specific
problems just yet. Your case is too sparse for it to be an completely
exhaustive exploration of what's failing with this scheduler.

There's a policy decision that needs to be made of whether adding another
term to the scheduler calcuation is blessed or not. My opinion is that
is should be. Meanwhile, we should experiment more with different
configurations.

bill

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18  7:22                                                                     ` [ck] " Radoslaw Szkodzinski
@ 2007-03-18  7:38                                                                       ` Mike Galbraith
  2007-03-18  8:04                                                                         ` Mike Galbraith
                                                                                           ` (3 more replies)
  0 siblings, 4 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-18  7:38 UTC (permalink / raw)
  To: Radoslaw Szkodzinski
  Cc: Kasper Sandberg, Al Boldi, Andrew Morton, linux-kernel, ck,
	Linus Torvalds, Nicholas Miell

On Sun, 2007-03-18 at 08:22 +0100, Radoslaw Szkodzinski wrote:

> I'd recon KDE regresses because of kioslaves waiting on a pipe
> (communication with the app they're doing IO for) and then expiring.
> That's why splitting IO from an app isn't exactly smart. It should at
> least be ran in an another thread.

Hm.  Sounds rather a lot like the...
X sucks, fix X and RSDL will rock your world.  RSDL is perfect.
...that I've been getting.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: is RSDL an "unfair" scheduler too?
  2007-03-18  6:54                                                                       ` [ck] " Radoslaw Szkodzinski
@ 2007-03-18  7:58                                                                         ` Willy Tarreau
  2007-03-18  8:45                                                                           ` Avi Kivity
  0 siblings, 1 reply; 198+ messages in thread
From: Willy Tarreau @ 2007-03-18  7:58 UTC (permalink / raw)
  To: Radoslaw Szkodzinski
  Cc: linux-kernel, Al Boldi, Andrew Morton, William Lee Irwin III, ck,
	Avi Kivity, Linus Torvalds, Nicholas Miell

On Sun, Mar 18, 2007 at 07:54:20AM +0100, Radoslaw Szkodzinski wrote:
> On 3/18/07, Mike Galbraith <efault@gmx.de> wrote:
> >On Sun, 2007-03-18 at 06:24 +0100, Willy Tarreau wrote:
> >
> >> Maybe we're all discussing the problem because we have reached the point
> >> where we need two types of schedulers : one for the desktop and one for
> >> the servers. After all, this is already what is proposed with preempt,
> >> it would make sense provided they share the same core and avoid ifdefs
> >> or unused structure members. Maybe adding OPTIONAL unfairness to RSDL
> >> would help some scenarios, but in any case it is important to retain
> >> the default fairness it provides.
> >
> >Bingo.
> >
> 
> Sounds like Staircase's interactive mode switch, except this actually
> requires writing additional code.
> 
> The per-user system would also be nice for servers, provided there are
> CPU/disc IO/swapper/... quotas or priorities at least.

This is too hard to adjust. Imagine what would happen to your hundreds of
apache processes when the "backup" user will start the rsync or tar+gzip,
or when user "root" will start rotating and compressing the logs. Being
able to group processes may be useful on servers, but it should be enabled
on purpose by the admin.

Willy


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18  7:38                                                                       ` Mike Galbraith
@ 2007-03-18  8:04                                                                         ` Mike Galbraith
  2007-03-18  8:20                                                                         ` jimmy bahuleyan
                                                                                           ` (2 subsequent siblings)
  3 siblings, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-18  8:04 UTC (permalink / raw)
  To: Radoslaw Szkodzinski
  Cc: Kasper Sandberg, Al Boldi, Andrew Morton, linux-kernel, ck,
	Linus Torvalds, Nicholas Miell

On Sun, 2007-03-18 at 08:38 +0100, Mike Galbraith wrote:
> On Sun, 2007-03-18 at 08:22 +0100, Radoslaw Szkodzinski wrote:
> 
> > I'd recon KDE regresses because of kioslaves waiting on a pipe
> > (communication with the app they're doing IO for) and then expiring.
> > That's why splitting IO from an app isn't exactly smart. It should at
> > least be ran in an another thread.
> 
> Hm.  Sounds rather a lot like the...
> X sucks, fix X and RSDL will rock your world.  RSDL is perfect.
> ...that I've been getting.

P.S.  For those folks who appear to think that I'm in love with mainline
behavior and blissfully ignorant of it's shortcomings.

http://lwn.net/Articles/176635/


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18  7:38                                                                       ` Mike Galbraith
  2007-03-18  8:04                                                                         ` Mike Galbraith
@ 2007-03-18  8:20                                                                         ` jimmy bahuleyan
  2007-03-18  8:34                                                                           ` Mike Galbraith
  2007-03-18  9:57                                                                         ` Kasper Sandberg
  2007-03-18 15:44                                                                         ` Radoslaw Szkodzinski
  3 siblings, 1 reply; 198+ messages in thread
From: jimmy bahuleyan @ 2007-03-18  8:20 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Radoslaw Szkodzinski, Kasper Sandberg, Al Boldi, Andrew Morton,
	linux-kernel, ck, Linus Torvalds, Nicholas Miell

Mike Galbraith wrote:
> On Sun, 2007-03-18 at 08:22 +0100, Radoslaw Szkodzinski wrote:
> 
>> I'd recon KDE regresses because of kioslaves waiting on a pipe
>> (communication with the app they're doing IO for) and then expiring.
>> That's why splitting IO from an app isn't exactly smart. It should at
>> least be ran in an another thread.
> 
> Hm.  Sounds rather a lot like the...
> X sucks, fix X and RSDL will rock your world.  RSDL is perfect.
> ...that I've been getting.
> 
> 	-Mike

maybe if it is possible to classify program behaviors that cause RSDL to
do bad (relatively) or the mainline scheduler to jitter, we could try
modifying the existing heuristics to get a better default scheduler.

of course, it wouldn't be able to cater to all the workloads and would
meet everybody's definition of optimal. but getting close to optimal in
most cases should be a good enough goal for linux's default sched!

i've been following this thread, and there's been many instances of
'RSDL is gr8' and 'RSDL regresses'.

maybe RSDL isn't the answer. maybe the current mainline sched isn't
either. but RSDL definitely has done *something* right.

What i think is needed is 'why this works here' and 'how to get this
behavior to work with some other possibly conflicting but important
workloads'.

(just my 2c :-)

-jb
-- 
I am professionally trained in computer science, which is to say
(in all seriousness) that I am extremely poorly educated.
                -- Joseph Weizenbaum

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18  8:20                                                                         ` jimmy bahuleyan
@ 2007-03-18  8:34                                                                           ` Mike Galbraith
  0 siblings, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-18  8:34 UTC (permalink / raw)
  To: jimmy bahuleyan
  Cc: Radoslaw Szkodzinski, Kasper Sandberg, Al Boldi, Andrew Morton,
	linux-kernel, ck, Linus Torvalds, Nicholas Miell

On Sun, 2007-03-18 at 13:50 +0530, jimmy bahuleyan wrote:

> maybe if it is possible to classify program behaviors that cause RSDL to
> do bad (relatively) or the mainline scheduler to jitter, we could try
> modifying the existing heuristics to get a better default scheduler.
> 
> of course, it wouldn't be able to cater to all the workloads and would
> meet everybody's definition of optimal. but getting close to optimal in
> most cases should be a good enough goal for linux's default sched!
> 
> i've been following this thread, and there's been many instances of
> 'RSDL is gr8' and 'RSDL regresses'.
> 
> maybe RSDL isn't the answer. maybe the current mainline sched isn't
> either. but RSDL definitely has done *something* right.

Agreed.

> What i think is needed is 'why this works here' and 'how to get this
> behavior to work with some other possibly conflicting but important
> workloads'.
> 
> (just my 2c :-)

IMHO, that's worth more than 2c.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: is RSDL an "unfair" scheduler too?
  2007-03-18  7:58                                                                         ` Willy Tarreau
@ 2007-03-18  8:45                                                                           ` Avi Kivity
  0 siblings, 0 replies; 198+ messages in thread
From: Avi Kivity @ 2007-03-18  8:45 UTC (permalink / raw)
  To: Willy Tarreau
  Cc: Radoslaw Szkodzinski, linux-kernel, Al Boldi, Andrew Morton,
	William Lee Irwin III, ck, Linus Torvalds, Nicholas Miell

Willy Tarreau wrote:
>> The per-user system would also be nice for servers, provided there are
>> CPU/disc IO/swapper/... quotas or priorities at least.
>>     
>
> This is too hard to adjust. Imagine what would happen to your hundreds of
> apache processes when the "backup" user will start the rsync or tar+gzip,
> or when user "root" will start rotating and compressing the logs. Being
> able to group processes may be useful on servers, but it should be enabled
> on purpose by the admin.
>   

Sure, if implemented, it should default to the old behavior to avoid 
surprises.  Maintenance jobs may have to be niced to avoid getting too 
much cpu.  But it should also make hosting different applications on the 
the server much more predictable and easier (think of your hundreds of 
apache processes swamping an unrelated load when slashdotted).

-- 
error compiling committee.c: too many arguments to function


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18  7:38                                                                       ` Mike Galbraith
  2007-03-18  8:04                                                                         ` Mike Galbraith
  2007-03-18  8:20                                                                         ` jimmy bahuleyan
@ 2007-03-18  9:57                                                                         ` Kasper Sandberg
  2007-03-18 13:57                                                                           ` Avuton Olrich
  2007-03-19 20:47                                                                           ` Bill Davidsen
  2007-03-18 15:44                                                                         ` Radoslaw Szkodzinski
  3 siblings, 2 replies; 198+ messages in thread
From: Kasper Sandberg @ 2007-03-18  9:57 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Radoslaw Szkodzinski, Al Boldi, Andrew Morton, linux-kernel, ck,
	Linus Torvalds, Nicholas Miell

On Sun, 2007-03-18 at 08:38 +0100, Mike Galbraith wrote:
> On Sun, 2007-03-18 at 08:22 +0100, Radoslaw Szkodzinski wrote:
> 
> > I'd recon KDE regresses because of kioslaves waiting on a pipe
> > (communication with the app they're doing IO for) and then expiring.
> > That's why splitting IO from an app isn't exactly smart. It should at
> > least be ran in an another thread.
> 
> Hm.  Sounds rather a lot like the...
> X sucks, fix X and RSDL will rock your world.  RSDL is perfect.
> ...that I've been getting.
> 
not really, only X sucks. KDE works atleast as good with rsdl as
vanilla. i dont know how originally said kde works worse, wasnt it just
someone that thought?

> 	-Mike
> 
> 


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 12:52                             ` Con Kolivas
  2007-03-12 14:14                               ` Al Boldi
  2007-03-18  1:30                               ` Bill Davidsen
@ 2007-03-18 10:50                               ` jos poortvliet
  2 siblings, 0 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-18 10:50 UTC (permalink / raw)
  To: ck; +Cc: Con Kolivas, Al Boldi, linux-kernel

[-- Attachment #1: Type: text/plain, Size: 2718 bytes --]

Op Sunday 18 March 2007, schreef Con Kolivas:
> On Monday 12 March 2007 22:26, Al Boldi wrote:
> > Con Kolivas wrote:
> > > On Monday 12 March 2007 15:42, Al Boldi wrote:
> > > > Con Kolivas wrote:
> > > > > On Monday 12 March 2007 08:52, Con Kolivas wrote:
> > > > > > And thank you! I think I know what's going on now. I think each
> > > > > > rotation is followed by another rotation before the higher
> > > > > > priority task is getting a look in in schedule() to even get
> > > > > > quota and add it to the runqueue quota. I'll try a simple change
> > > > > > to see if that helps. Patch coming up shortly.
> > > > >
> > > > > Can you try the following patch and see if it helps. There's also
> > > > > one minor preemption logic fix in there that I'm planning on
> > > > > including. Thanks!
> > > >
> > > > Applied on top of v0.28 mainline, and there is no difference.
> > > >
> > > > What's it look like on your machine?
> > >
> > > The higher priority one always get 6-7ms whereas the lower priority one
> > > runs 6-7ms and then one larger perfectly bound expiration amount.
> > > Basically exactly as I'd expect. The higher priority task gets
> > > precisely RR_INTERVAL maximum latency whereas the lower priority task
> > > gets RR_INTERVAL min and full expiration (according to the virtual
> > > deadline) as a maximum. That's exactly how I intend it to work. Yes I
> > > realise that the max latency ends up being longer intermittently on the
> > > niced task but that's -in my opinion- perfectly fine as a compromise to
> > > ensure the nice 0 one always gets low latency.
> >
> > I think, it should be possible to spread this max expiration latency
> > across the rotation, should it not?
>
> There is a way that I toyed with of creating maps of slots to use for each
> different priority, but it broke the O(1) nature of the virtual deadline
> management. Minimising algorithmic complexity seemed more important to
> maintain than getting slightly better latency spreads for niced tasks. It
> also appeared to be less cache friendly in design. I could certainly try
> and implement it but how much importance are we to place on latency of
> niced tasks? Are you aware of any usage scenario where latency sensitive
> tasks are ever significantly niced in the real world?

I do always nice down heavy games, it makes them more smooth...

-- 
Disclaimer:

Alles wat ik doe denk en zeg is gebaseerd op het wereldbeeld wat ik nu heb. 
Ik ben niet verantwoordelijk voor wijzigingen van de wereld, of het beeld wat 
ik daarvan heb, noch voor de daaruit voortvloeiende gedragingen van mezelf. 
Alles wat ik zeg is aardig bedoeld, tenzij expliciet vermeld.

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: is RSDL an "unfair" scheduler too?
  2007-03-18  5:37                                                                     ` Mike Galbraith
@ 2007-03-18 10:58                                                                       ` jos poortvliet
  0 siblings, 0 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-18 10:58 UTC (permalink / raw)
  To: ck
  Cc: Mike Galbraith, Bill Davidsen, Al Boldi, Andrew Morton,
	linux-kernel, Linus Torvalds, Nicholas Miell

[-- Attachment #1: Type: text/plain, Size: 843 bytes --]

Op Sunday 18 March 2007, schreef Mike Galbraith:
> On Sat, 2007-03-17 at 21:13 -0500, Bill Davidsen wrote:
> > Now for something constructive... by any chance is Mike running KDE
> > instead of GNOME?
>
> Yes.
>
> 	-Mike

Well, then, it might indeed be the KIOslave/pipe stuff. I experience sometimes 
horrible behaviour in certain apps who use a lot of KIO slaves (konqueror, 
kontact). Is there a solution to this? Cuz if there isn't, the majority of 
the Linux Desktops are going to regress with RSDL...

-- 
Disclaimer:

Alles wat ik doe denk en zeg is gebaseerd op het wereldbeeld wat ik nu heb. 
Ik ben niet verantwoordelijk voor wijzigingen van de wereld, of het beeld wat 
ik daarvan heb, noch voor de daaruit voortvloeiende gedragingen van mezelf. 
Alles wat ik zeg is aardig bedoeld, tenzij expliciet vermeld.

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18  9:57                                                                         ` Kasper Sandberg
@ 2007-03-18 13:57                                                                           ` Avuton Olrich
  2007-03-19 20:47                                                                           ` Bill Davidsen
  1 sibling, 0 replies; 198+ messages in thread
From: Avuton Olrich @ 2007-03-18 13:57 UTC (permalink / raw)
  To: Kasper Sandberg
  Cc: Mike Galbraith, Radoslaw Szkodzinski, Al Boldi, Andrew Morton,
	linux-kernel, ck, Linus Torvalds, Nicholas Miell

On 3/18/07, Kasper Sandberg <lkml@metanurb.dk> wrote:
> not really, only X sucks. KDE works atleast as good with rsdl as
> vanilla. i dont know how originally said kde works worse, wasnt it just
> someone that thought?

Couldn't agree more, been using RSDL+KDE for a week now, and as far as
I'm concerned I will be sticking with this until it goes to mainline,
or mainline exhibits better behaviour. The fact of the matter is I was
always unsure why windows 'feels' so much better than linux. RSDL
makes it feel like windows, all the time, no matter what's going on.
I'm really miffed when anyone talks about regressions, because I have
scenarios that will completely lock up linux for 10s or so on my
athlon 4200x2, due to [really] poorly designed apps that I need to
run. RSDL seems to work right though it; it works faster, mouse feels
better, browser scrolls smoothly, can't make the sound skip, video is
fluid, opening a term is instant, boot up is faster, all which are a
step up from mainline. That's the facts. IME I have seen nothing but
good with RSDL.

-- 
avuton
--
 Anyone who quotes me in their sig is an idiot. -- Rusty Russell.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18  7:38                                                                       ` Mike Galbraith
                                                                                           ` (2 preceding siblings ...)
  2007-03-18  9:57                                                                         ` Kasper Sandberg
@ 2007-03-18 15:44                                                                         ` Radoslaw Szkodzinski
  2007-03-18 16:09                                                                           ` jos poortvliet
  3 siblings, 1 reply; 198+ messages in thread
From: Radoslaw Szkodzinski @ 2007-03-18 15:44 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Kasper Sandberg, Al Boldi, Andrew Morton, linux-kernel, ck,
	Linus Torvalds, Nicholas Miell

On 3/18/07, Mike Galbraith <efault@gmx.de> wrote:
> Hm.  Sounds rather a lot like the...
> X sucks, fix X and RSDL will rock your world.  RSDL is perfect.
> ...that I've been getting.

Blah. Nothing's perfect. Especially not computer programs.

Still, it's not a smart decision on KDE's part.
It will break a lot of scheduling decisions, esp. you can't use IO priorities.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18 15:44                                                                         ` Radoslaw Szkodzinski
@ 2007-03-18 16:09                                                                           ` jos poortvliet
  0 siblings, 0 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-18 16:09 UTC (permalink / raw)
  To: ck
  Cc: Radoslaw Szkodzinski, Mike Galbraith, Al Boldi, linux-kernel,
	Andrew Morton, Linus Torvalds, Kasper Sandberg, Nicholas Miell

[-- Attachment #1: Type: text/plain, Size: 815 bytes --]

Op Sunday 18 March 2007, schreef Radoslaw Szkodzinski:
> On 3/18/07, Mike Galbraith <efault@gmx.de> wrote:
> > Hm.  Sounds rather a lot like the...
> > X sucks, fix X and RSDL will rock your world.  RSDL is perfect.
> > ...that I've been getting.
>
> Blah. Nothing's perfect. Especially not computer programs.
>
> Still, it's not a smart decision on KDE's part.
> It will break a lot of scheduling decisions, esp. you can't use IO
> priorities. 

Well, if somebody could explain what they should do instead of what they're 
doing, I can contact them - the libraries for KDE 4 aren't in feature freeze 
yet (they will be, though) so they can solve the problem(s). The KIO 
infrastructure is ATM under a redesign, so please, if you know what they 
should do/are doing wrong, speak up!

grtz

Jos

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  6:08                                                   ` Mike Galbraith
  2007-03-17 13:56                                                     ` Ed Tomlinson
@ 2007-03-18 19:37                                                     ` Lee Revell
  2007-03-18 19:55                                                       ` Mike Galbraith
  2007-03-18 22:45                                                       ` Szonyi Calin
  2007-03-19  2:27                                                     ` David Schwartz
  2 siblings, 2 replies; 198+ messages in thread
From: Lee Revell @ 2007-03-18 19:37 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Nicholas Miell, Con Kolivas, ck, Ingo Molnar, Al Boldi,
	Andrew Morton, linux-kernel

On 3/17/07, Mike Galbraith <efault@gmx.de> wrote:
> P.S.  "utter failure" was too harsh.  What sticks in my craw is that the
> world has to adjust to fit this new scheduler.

I have never seen X run nearly as smooth as our favorite proprietary
OS on similar spec hardware with ANY scheduler.

Lee

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-18 19:37                                                     ` Lee Revell
@ 2007-03-18 19:55                                                       ` Mike Galbraith
  2007-03-18 22:45                                                       ` Szonyi Calin
  1 sibling, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-18 19:55 UTC (permalink / raw)
  To: Lee Revell
  Cc: Nicholas Miell, Con Kolivas, ck, Ingo Molnar, Al Boldi,
	Andrew Morton, linux-kernel

On Sun, 2007-03-18 at 15:37 -0400, Lee Revell wrote:
> On 3/17/07, Mike Galbraith <efault@gmx.de> wrote:
> > P.S.  "utter failure" was too harsh.  What sticks in my craw is that the
> > world has to adjust to fit this new scheduler.
> 
> I have never seen X run nearly as smooth as our favorite proprietary
> OS on similar spec hardware with ANY scheduler.

Heh, I _have_, but they do have an edge.  It's a lot easier when you're
more or less a single user single tasking OS.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-18 19:37                                                     ` Lee Revell
  2007-03-18 19:55                                                       ` Mike Galbraith
@ 2007-03-18 22:45                                                       ` Szonyi Calin
  1 sibling, 0 replies; 198+ messages in thread
From: Szonyi Calin @ 2007-03-18 22:45 UTC (permalink / raw)
  To: Lee Revell
  Cc: Mike Galbraith, Nicholas Miell, Con Kolivas, ck, Ingo Molnar,
	Al Boldi, Andrew Morton, linux-kernel

On Sun, 18 Mar 2007, Lee Revell wrote:

> On 3/17/07, Mike Galbraith <efault@gmx.de> wrote:
>> P.S.  "utter failure" was too harsh.  What sticks in my craw is that the
>> world has to adjust to fit this new scheduler.
>
> I have never seen X run nearly as smooth as our favorite proprietary
> OS on similar spec hardware with ANY scheduler.
>

i have never seen Windows (or were you talking about Mac OSX ?) run 
smooth. Win2k (scheduler) is almost usable if your computer is very fast 
but on common hardware every version of windows for me was a joke. or
maybe you have a special version ;) [1]

I don't run KDE or Gnome in linux so ... maybe that's the problem ;)

[1] And no, i don't consider waiting 2-5-20-50 seconds for a program to 
start a feature. YMMV

--

"frate, trezeste-te, aici nu-i razboiul stelelor"
 				Radu R. pe offtopic at lug.ro

^ permalink raw reply	[flat|nested] 198+ messages in thread

* RE: is RSDL an "unfair" scheduler too?
  2007-03-18  5:55                                                                     ` Avi Kivity
@ 2007-03-19  2:27                                                                       ` David Schwartz
  2007-03-19 13:27                                                                         ` Radoslaw Szkodzinski
  2007-03-19 15:25                                                                         ` Avi Kivity
  0 siblings, 2 replies; 198+ messages in thread
From: David Schwartz @ 2007-03-19  2:27 UTC (permalink / raw)
  To: Linux-Kernel@Vger. Kernel. Org

> I didn't suggest adding any unfairness!  I suggested being fair by
> user/job/process instead of being fair by thread (which is actually
> unfair as it favors multi threaded processes over single threaded
> processes).

Wouldn't that be unfair because it favors multi-user approaches over
single-user approaches with the same number of processes?

Consider two otherwise equivalent web server designs. They both use a helper
process owned by the user who owns the file the web server is sending. One
does a lot of work in the helper process, the other does very little. A
"fair by user" scheduler would give the approach that puts more work in the
helper process more CPU than the one that puts little work in the helper
process.

Being fair by user builds lots of assumptions into the scheduler. When
they're not true, the scheduler becomes sub-optimal. For example, consider a
web server that runs two very important tools, 'foo' and 'bar'. Rather than
running them as root, they run as users 'foo' and 'bar' for security. "Fair
to user" would mean that just because most other people are using 'foo', I
get less CPU when I try to use 'foo', because the OS doesn't know the "real
user", just the fake user who owns the process -- a security decision that
has no relationship to fairness. This would be handled perfectly by a "fair
to process" approach.

As for favoring multi-threaded processes over single-threaded processes,
sometimes that's what you want. Consider two servers, one using thread per
job the other using process per job. Does it make sense to give the "process
per job" server as much CPU to do a single task as the "thread per job"
server gets for all the clients it's dealing with?

It's really more important that the scheduler be tunable and predictable.
That way, we can tell it what we want and get it. But the scheduler cannot
read our minds.

DS

^ permalink raw reply	[flat|nested] 198+ messages in thread

* RE: RSDL v0.31
  2007-03-17  6:08                                                   ` Mike Galbraith
  2007-03-17 13:56                                                     ` Ed Tomlinson
  2007-03-18 19:37                                                     ` Lee Revell
@ 2007-03-19  2:27                                                     ` David Schwartz
  2007-03-19  6:21                                                       ` Mike Galbraith
  2 siblings, 1 reply; 198+ messages in thread
From: David Schwartz @ 2007-03-19  2:27 UTC (permalink / raw)
  To: Linux-Kernel@Vger. Kernel. Org

> P.S.  "utter failure" was too harsh.  What sticks in my craw is that the
> world has to adjust to fit this new scheduler.
>
> 	-Mike

Even when it's totally clear that this scheduler is doing what you asked it
do while the old one wasn't? It still bothers you that now you have to ask
for what you want rather than asking for what happens to give you what you
want?

> Wrong.  I call a good job giving a _preference_ to the desktop.  I call
> rigid fairness impractical for the desktop, and a denial of reality.

Assuming you *want* that. It's possible that the desktop may not be
particularly important and the machine may be doing much more important
server work with critical latency issues. So if you want that, you have to
ask for it.

Again, your complaint is that the other server gave you what you wanted even
when you didn't ask for it. That's great for you but totally sucks for the
majority of other people who want something else.

DS

^ permalink raw reply	[flat|nested] 198+ messages in thread

* RE: RSDL v0.31
  2007-03-19  2:27                                                     ` David Schwartz
@ 2007-03-19  6:21                                                       ` Mike Galbraith
  2007-03-19  6:59                                                         ` Willy Tarreau
  0 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-19  6:21 UTC (permalink / raw)
  To: davids; +Cc: Linux-Kernel@Vger. Kernel. Org

On Sun, 2007-03-18 at 19:27 -0700, David Schwartz wrote:

> > Wrong.  I call a good job giving a _preference_ to the desktop.  I call
> > rigid fairness impractical for the desktop, and a denial of reality.
> 
> Assuming you *want* that. It's possible that the desktop may not be
> particularly important and the machine may be doing much more important
> server work with critical latency issues. So if you want that, you have to
> ask for it.

Amusing argument ;-) I doubt that there are many admins ripping and
encoding CDs on their employers critical production servers.

> Again, your complaint is that the other server gave you what you wanted even
> when you didn't ask for it. That's great for you but totally sucks for the
> majority of other people who want something else.

I don't presume to speak for the majority...

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-19  6:21                                                       ` Mike Galbraith
@ 2007-03-19  6:59                                                         ` Willy Tarreau
  0 siblings, 0 replies; 198+ messages in thread
From: Willy Tarreau @ 2007-03-19  6:59 UTC (permalink / raw)
  To: Mike Galbraith; +Cc: davids, Linux-Kernel@Vger. Kernel. Org

On Mon, Mar 19, 2007 at 07:21:47AM +0100, Mike Galbraith wrote:
> On Sun, 2007-03-18 at 19:27 -0700, David Schwartz wrote:
> 
> > > Wrong.  I call a good job giving a _preference_ to the desktop.  I call
> > > rigid fairness impractical for the desktop, and a denial of reality.
> > 
> > Assuming you *want* that. It's possible that the desktop may not be
> > particularly important and the machine may be doing much more important
> > server work with critical latency issues. So if you want that, you have to
> > ask for it.
> 
> Amusing argument ;-) I doubt that there are many admins ripping and
> encoding CDs on their employers critical production servers.

I've known one at least, he said he was ensuring the shiny new dual athlons
were stable enough for production ;-) But that does not make a rule.

Cheers,
Willy


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler
  2007-03-12 18:58                                     ` Antonio Vargas
@ 2007-03-19 10:47                                       ` Helge Hafting
  0 siblings, 0 replies; 198+ messages in thread
From: Helge Hafting @ 2007-03-19 10:47 UTC (permalink / raw)
  To: Antonio Vargas; +Cc: linux-kernel, ck, Al Boldi, Con Kolivas, jos poortvliet

Antonio Vargas wrote:
>
> IIRC, about 2 or three years ago (or maybe on the 2.6.10 timeframe),
> there was a patch which managed to pass the interactive from one app
> to another when there was a pipe or udp connection between them. This
> meant that a marked-as-interactive xterm would, when blocked waiting
> for an Xserver response, transfer some of its interactiveness to the
> Xserver, and aparently it worked very good for desktop workloads so,
> maybe adapting it for this new scheduler would be good.
And it was dropped because of some very nasty side effect,
probably a DOS opportunity.

Helge Hafting


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-19  2:27                                                                       ` David Schwartz
@ 2007-03-19 13:27                                                                         ` Radoslaw Szkodzinski
  2007-03-19 18:30                                                                           ` David Lang
  2007-03-19 15:25                                                                         ` Avi Kivity
  1 sibling, 1 reply; 198+ messages in thread
From: Radoslaw Szkodzinski @ 2007-03-19 13:27 UTC (permalink / raw)
  To: davids; +Cc: Linux-Kernel@Vger. Kernel. Org

On 3/19/07, David Schwartz <davids@webmaster.com> wrote:
>
> > I didn't suggest adding any unfairness!  I suggested being fair by
> > user/job/process instead of being fair by thread (which is actually
> > unfair as it favors multi threaded processes over single threaded
> > processes).
>
> Wouldn't that be unfair because it favors multi-user approaches over
> single-user approaches with the same number of processes?

Not necessarily. Use GID rotations too.
And if you can't assign these properly, it's your own/your distro's fault.

> Consider two otherwise equivalent web server designs. They both use a helper
> process owned by the user who owns the file the web server is sending. One
> does a lot of work in the helper process, the other does very little. A
> "fair by user" scheduler would give the approach that puts more work in the
> helper process more CPU than the one that puts little work in the helper
> process.

Indeed, it's a drawback. Though a configurable one.

> Being fair by user builds lots of assumptions into the scheduler. When
> they're not true, the scheduler becomes sub-optimal. For example, consider a
> web server that runs two very important tools, 'foo' and 'bar'. Rather than
> running them as root, they run as users 'foo' and 'bar' for security. "Fair
> to user" would mean that just because most other people are using 'foo', I
> get less CPU when I try to use 'foo', because the OS doesn't know the "real
> user", just the fake user who owns the process -- a security decision that
> has no relationship to fairness. This would be handled perfectly by a "fair
> to process" approach.

Then, use a group quota. But checking that will be slower, and that
overhead might kill other gains.

> As for favoring multi-threaded processes over single-threaded processes,
> sometimes that's what you want.

Not on desktop.

Typical multi-threaded workloads:
- apache
- some P2P clients
- some audio servers/applications (small number of threads)

Single-threaded is much more popular.

> Consider two servers,
^ ^ ^

Well, aren't we discussing desktops?
Server admins can fine-tune the rights and CPU quotas per group.

> one using thread per
> job the other using process per job. Does it make sense to give the "process
> per job" server as much CPU to do a single task as the "thread per job"
> server gets for all the clients it's dealing with?

Not necessarily. You see, the processes themselves are schedulable,
the threads aren't.

>
> It's really more important that the scheduler be tunable and predictable.

This kind of scheduler, yes. Except it's much more tunable than a
simple fair or unfair scheduler, and much more suited to real-time
applications.

> That way, we can tell it what we want and get it. But the scheduler cannot
> read our minds.

That's why the per-user or per-group part would have to be optional.
It just doesn't make much sense on single-user desktops.
Then, RSDL design or even RSDL+bonus could be used.

The bonus part would have to be really simple, e.g. priority
inheritance for pipes, startup priority boost for nice 0 tasks.
(warning - fork bombs. :P )
No sleep estimator. Maybe these would suffice?

The interactive bonus would be disabled by default, same as
per-user/per-group scheduling.

Some syscalls would have to be added, maybe using LSM framework?

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 15:13                                                           ` RSDL v0.31 Mark Hahn
  2007-03-17 17:22                                                             ` Stephen Clark
@ 2007-03-19 15:06                                                             ` Chris Friesen
  1 sibling, 0 replies; 198+ messages in thread
From: Chris Friesen @ 2007-03-19 15:06 UTC (permalink / raw)
  To: Mark Hahn; +Cc: Con Kolivas, linux-kernel

Just so you know the context, I'm coming at this from the point of view 
of an embedded call server designer.

Mark Hahn wrote:
> why do you think fairness is good, especially always good?

Fairness is good because it promotes predictability.  See the 
"deterministic" section below.

> even starvation is sometimes a good thing - there's a place for processes
> that only use the CPU if it is otherwise idle.  that is, they are
> deliberately starved all the rest of the time.

If you have nice 19 be sufficiently low priority, then the difference 
between "using cpu if otherwise idle" and "gets a little bit of cpu even 
if not totally idle" is unimportant.

Starvation is a very *bad* thing when you don't want it.

>> Much lower and bound latencies

> in an average sense?  also, under what circumstances does this actually
> matter?  (please don't offer something like RT audio on an overloaded 
> machine- that's operator error, not something to design for.)

In my environment, latency *matters*.  If a packet doesn't get processed 
in time, you drop it.  With mainline it can be quite tricky to tune the 
latency, especially when you don't want to resort to soft realtime 
because you don't entirely trust the code thats running (because it came 
from a third party vendor).

>> Deterministic

> not a bad thing, but how does this make itself apparent and of value to 
> the user?  I think everyone is extremely comfortable with non-determinism
> (stemming from networks, caches, interleaved workloads, etc)

Determinism is really important.  It almost doesn't matter what the 
behaviour is, as long as we can predict it.  We model the system to 
determine how to tweak the system (niceness, sched policy, etc.), as 
well as what performance numbers we can advertise.  If the system is 
non-deterministic, it makes this modelling extremely difficult--you end 
up having to give up significant performance due to worst-case spikes.

If the system is deterministic, it makes it much easier to predict its 
actions.

Chris

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-19  2:27                                                                       ` David Schwartz
  2007-03-19 13:27                                                                         ` Radoslaw Szkodzinski
@ 2007-03-19 15:25                                                                         ` Avi Kivity
  2007-03-19 16:06                                                                           ` Helge Hafting
  1 sibling, 1 reply; 198+ messages in thread
From: Avi Kivity @ 2007-03-19 15:25 UTC (permalink / raw)
  To: davids; +Cc: Linux-Kernel@Vger. Kernel. Org

[what happened to the 'To' header?]

David Schwartz wrote:
>> I didn't suggest adding any unfairness!  I suggested being fair by
>> user/job/process instead of being fair by thread (which is actually
>> unfair as it favors multi threaded processes over single threaded
>> processes).
>>     
>
> Wouldn't that be unfair because it favors multi-user approaches over
> single-user approaches with the same number of processes?
>
> Consider two otherwise equivalent web server designs. They both use a helper
> process owned by the user who owns the file the web server is sending. One
> does a lot of work in the helper process, the other does very little. A
> "fair by user" scheduler would give the approach that puts more work in the
> helper process more CPU than the one that puts little work in the helper
> process.
>   

A fairly contrived example, but I see your point.  Of course any system 
can be broken.  I think that user-level scheduling is good for real 
multi user systems, where 'user' means a person, not an artificial 
entity.  It's also good for a multi application server, where typically 
each service runs (or can be made to run) as a separate user.

> Being fair by user builds lots of assumptions into the scheduler. When
> they're not true, the scheduler becomes sub-optimal. For example, consider a
> web server that runs two very important tools, 'foo' and 'bar'. Rather than
> running them as root, they run as users 'foo' and 'bar' for security. "Fair
> to user" would mean that just because most other people are using 'foo', I
> get less CPU when I try to use 'foo', because the OS doesn't know the "real
> user", just the fake user who owns the process -- a security decision that
> has no relationship to fairness. This would be handled perfectly by a "fair
> to process" approach.
>   

Perhaps we need a scheduling class instead, defaulting to each user 
being in its own class, and system processes in another (or maybe more 
than one) class.  Root can configure the relative priorities of the 
various classes according to preferences.

> As for favoring multi-threaded processes over single-threaded processes,
> sometimes that's what you want. Consider two servers, one using thread per
> job the other using process per job. Does it make sense to give the "process
> per job" server as much CPU to do a single task as the "thread per job"
> server gets for all the clients it's dealing with?
>
>   

That's why I wanted the job abstraction, which currently isn't 
communicated to the kernel (or at least not well).  Within a user's 
quota, jobs are scheduled fairly.

> It's really more important that the scheduler be tunable and predictable.
> That way, we can tell it what we want and get it. But the scheduler cannot
> read our minds.
>   

For multiuser systems, it also has to provide predictable response to 
unpredictable loads.  RSDL accomplishes part of this by removing 
heuristics.  User-level scheduling does more by limiting the impact of a 
single user to 1/N of the system's cpu capacity, and similarly limits 
the impact of a single job to 1/number_of_active_jobs_for_this_user.

-- 
error compiling committee.c: too many arguments to function


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17  9:58                                                           ` Mike Galbraith
                                                                               ` (2 preceding siblings ...)
  2007-03-17 20:55                                                             ` Al Boldi
@ 2007-03-19 16:03                                                             ` Mark Lord
  3 siblings, 0 replies; 198+ messages in thread
From: Mark Lord @ 2007-03-19 16:03 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Con Kolivas, ck, Serge Belyshev, Ingo Molnar, Al Boldi,
	linux-kernel, Nicholas Miell, Linus Torvalds, Andrew Morton

Mike Galbraith wrote:
> On Sat, 2007-03-17 at 20:48 +1100, Con Kolivas wrote:
> 
>> The most frustrating part of a discussion of this nature on lkml is that 
>> earlier information in a thread seems to be long forgotten after a few days 
>> and all that is left is the one reporter having a problem.
> 
> One?  I'm not the only person who reported regression.

Ditto here.  I'm not the only one after all!
Reverted back to stock scheduler, and desktop is interactive again.

-ml

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-19 15:25                                                                         ` Avi Kivity
@ 2007-03-19 16:06                                                                           ` Helge Hafting
  2007-03-19 16:37                                                                             ` Avi Kivity
  0 siblings, 1 reply; 198+ messages in thread
From: Helge Hafting @ 2007-03-19 16:06 UTC (permalink / raw)
  To: Avi Kivity; +Cc: davids, Linux-Kernel@Vger. Kernel. Org

Avi Kivity wrote:
>
> A fairly contrived example, but I see your point.  Of course any 
> system can be broken.  I think that user-level scheduling is good for 
> real multi user systems, where 'user' means a person, not an 
> artificial entity.  It's also good for a multi application server, 
> where typically each service runs (or can be made to run) as a 
> separate user.
For a not so contrived example, look at email delivery.  Some mailservers do
all work as root (or some fixed email user)

Some servers will switch to the UID of the user receiving the message, 
limiting the
damage in case of buffer overflow etc. A fair amount of work is then done
as that user - running the message through virus/spam-checks and
then perhaps procmail.

Helge Hafting

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-17 20:55                                                             ` Al Boldi
  2007-03-18  6:17                                                               ` Mike Galbraith
@ 2007-03-19 16:07                                                               ` Mark Lord
  2007-03-19 16:26                                                                 ` Xavier Bestel
  2007-03-19 20:53                                                                 ` Al Boldi
  1 sibling, 2 replies; 198+ messages in thread
From: Mark Lord @ 2007-03-19 16:07 UTC (permalink / raw)
  To: Al Boldi
  Cc: Mike Galbraith, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Linus Torvalds, Andrew Morton

Al Boldi wrote:
>..
> Mike, I'm not saying RSDL is perfect, but v0.31 is by far better than 
> mainline.  Try this easy test:
> 
> startx with the vesa driver
> run reflect from the mesa5.0-demos
> load 5 cpu-hogs
> start moving the mouse
> 
> On my desktop, mainline completely breaks down, and no nicing may rescue.
> 
> On RSDL, even without nicing, the desktop is at least useable.

I use a simpler, far more common (for lkml participants) workload:

Dell notebook, single P-M-2GHz, ATI X300, open source X.org:
(1) build a kernel in one window with "make -j$((NUMBER_OF_CPUS + 1))".
(2) try to read email and/or surf in Firefox/Thunderbird.

Stock scheduler wins easily, no contest.

Cheers

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-19 16:07                                                               ` Mark Lord
@ 2007-03-19 16:26                                                                 ` Xavier Bestel
  2007-03-19 16:36                                                                   ` Mark Lord
  2007-03-19 20:53                                                                 ` Al Boldi
  1 sibling, 1 reply; 198+ messages in thread
From: Xavier Bestel @ 2007-03-19 16:26 UTC (permalink / raw)
  To: Mark Lord
  Cc: Al Boldi, Mike Galbraith, Con Kolivas, ck, Serge Belyshev,
	Ingo Molnar, linux-kernel, Nicholas Miell, Linus Torvalds,
	Andrew Morton

On Mon, 2007-03-19 at 12:07 -0400, Mark Lord wrote:
> Dell notebook, single P-M-2GHz, ATI X300, open source X.org:
> (1) build a kernel in one window with "make -j$((NUMBER_OF_CPUS + 1))".
> (2) try to read email and/or surf in Firefox/Thunderbird.
> 
> Stock scheduler wins easily, no contest.

What happens when you renice X ?

	Xav



^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-19 16:26                                                                 ` Xavier Bestel
@ 2007-03-19 16:36                                                                   ` Mark Lord
  2007-03-19 16:43                                                                     ` Xavier Bestel
  0 siblings, 1 reply; 198+ messages in thread
From: Mark Lord @ 2007-03-19 16:36 UTC (permalink / raw)
  To: Xavier Bestel
  Cc: Al Boldi, Mike Galbraith, Con Kolivas, ck, Serge Belyshev,
	Ingo Molnar, linux-kernel, Nicholas Miell, Linus Torvalds,
	Andrew Morton

Xavier Bestel wrote:
> On Mon, 2007-03-19 at 12:07 -0400, Mark Lord wrote:
>> Dell notebook, single P-M-2GHz, ATI X300, open source X.org:
>> (1) build a kernel in one window with "make -j$((NUMBER_OF_CPUS + 1))".
>> (2) try to read email and/or surf in Firefox/Thunderbird.
>>
>> Stock scheduler wins easily, no contest.
> 
> What happens when you renice X ?

Dunno -- not necessary with the stock scheduler.
Nicing the "make" helped with RSDL, though.
But again, the stock scheduler "just works" in that regard.

I agree with Ingo -- the auto-renice feature is something
very useful for desktops, and is missing from RSDL.

Cheers

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-19 16:06                                                                           ` Helge Hafting
@ 2007-03-19 16:37                                                                             ` Avi Kivity
  0 siblings, 0 replies; 198+ messages in thread
From: Avi Kivity @ 2007-03-19 16:37 UTC (permalink / raw)
  To: Helge Hafting; +Cc: davids, Linux-Kernel@Vger. Kernel. Org

Helge Hafting wrote:
> Avi Kivity wrote:
>>
>> A fairly contrived example, but I see your point.  Of course any 
>> system can be broken.  I think that user-level scheduling is good for 
>> real multi user systems, where 'user' means a person, not an 
>> artificial entity.  It's also good for a multi application server, 
>> where typically each service runs (or can be made to run) as a 
>> separate user.
> For a not so contrived example, look at email delivery.  Some 
> mailservers do
> all work as root (or some fixed email user)
>
> Some servers will switch to the UID of the user receiving the message, 
> limiting the
> damage in case of buffer overflow etc. A fair amount of work is then done
> as that user - running the message through virus/spam-checks and
> then perhaps procmail.
>

Actually that makes some sense with user level scheduling - delivering 
email is charged to the recipient instead of to the system.  But I agree 
it's a surprising side effect and if this is ever implemented it should 
be optional.

-- 
error compiling committee.c: too many arguments to function


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-19 16:36                                                                   ` Mark Lord
@ 2007-03-19 16:43                                                                     ` Xavier Bestel
  2007-03-20  3:11                                                                       ` Linus Torvalds
  0 siblings, 1 reply; 198+ messages in thread
From: Xavier Bestel @ 2007-03-19 16:43 UTC (permalink / raw)
  To: Mark Lord
  Cc: Al Boldi, Mike Galbraith, Con Kolivas, ck, Serge Belyshev,
	Ingo Molnar, linux-kernel, Nicholas Miell, Linus Torvalds,
	Andrew Morton

On Mon, 2007-03-19 at 12:36 -0400, Mark Lord wrote:
> Xavier Bestel wrote:
> > On Mon, 2007-03-19 at 12:07 -0400, Mark Lord wrote:
> >> Dell notebook, single P-M-2GHz, ATI X300, open source X.org:
> >> (1) build a kernel in one window with "make -j$((NUMBER_OF_CPUS +
> 1))".
> >> (2) try to read email and/or surf in Firefox/Thunderbird.
> >>
> >> Stock scheduler wins easily, no contest.
> > 
> > What happens when you renice X ?
> 
> Dunno -- not necessary with the stock scheduler.

Could you try something like renice -10 $(pidof Xorg) ?

	Xav



^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-19 13:27                                                                         ` Radoslaw Szkodzinski
@ 2007-03-19 18:30                                                                           ` David Lang
  0 siblings, 0 replies; 198+ messages in thread
From: David Lang @ 2007-03-19 18:30 UTC (permalink / raw)
  To: Radoslaw Szkodzinski; +Cc: davids, Linux-Kernel@Vger. Kernel. Org

On Mon, 19 Mar 2007, Radoslaw Szkodzinski wrote:

>
>> Consider two servers,
> ^ ^ ^
>
> Well, aren't we discussing desktops?
> Server admins can fine-tune the rights and CPU quotas per group.
>

how many multi-user desktops are there? most desktops that I have seen run just 
about everything as a single user. I know that on mine, I don't want the 
updatedb process that runs as 'nobody' out of cron to have the same percentage 
of cpu as all the processes running as my userid.

David Lang

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-18  9:57                                                                         ` Kasper Sandberg
  2007-03-18 13:57                                                                           ` Avuton Olrich
@ 2007-03-19 20:47                                                                           ` Bill Davidsen
  2007-03-20 10:19                                                                             ` jos poortvliet
  2007-03-21  8:58                                                                             ` Kasper Sandberg
  1 sibling, 2 replies; 198+ messages in thread
From: Bill Davidsen @ 2007-03-19 20:47 UTC (permalink / raw)
  To: Kasper Sandberg
  Cc: Mike Galbraith, Radoslaw Szkodzinski, Al Boldi, Andrew Morton,
	linux-kernel, ck, Linus Torvalds, Nicholas Miell

Kasper Sandberg wrote:
> On Sun, 2007-03-18 at 08:38 +0100, Mike Galbraith wrote:
>> On Sun, 2007-03-18 at 08:22 +0100, Radoslaw Szkodzinski wrote:
>>
>>> I'd recon KDE regresses because of kioslaves waiting on a pipe
>>> (communication with the app they're doing IO for) and then expiring.
>>> That's why splitting IO from an app isn't exactly smart. It should at
>>> least be ran in an another thread.
>> Hm.  Sounds rather a lot like the...
>> X sucks, fix X and RSDL will rock your world.  RSDL is perfect.
>> ...that I've been getting.
>>
> not really, only X sucks. KDE works atleast as good with rsdl as
> vanilla. i dont know how originally said kde works worse, wasnt it just
> someone that thought?
> 
It was probably me, and I had the opinion that KDE is not as smooth as 
GNOME with RSDL. I haven't had time to measure, but using for daily 
stuff for about an hour each way hasn't changed my opinion. Every once 
in a while KDE will KLUNK to a halt for 200-300ms doing mundane stuff 
like redrawing a page, scrolling, etc. I don't see it with GNOME.

-- 
Bill Davidsen <davidsen@tmr.com>
   "We have more to fear from the bungling of the incompetent than from
the machinations of the wicked."  - from Slashdot

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-19 16:07                                                               ` Mark Lord
  2007-03-19 16:26                                                                 ` Xavier Bestel
@ 2007-03-19 20:53                                                                 ` Al Boldi
  2007-03-20 19:50                                                                   ` Artur Skawina
  1 sibling, 1 reply; 198+ messages in thread
From: Al Boldi @ 2007-03-19 20:53 UTC (permalink / raw)
  To: Mark Lord
  Cc: Mike Galbraith, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Linus Torvalds, Andrew Morton

Mark Lord wrote:
> Al Boldi wrote:
> >..
> > Mike, I'm not saying RSDL is perfect, but v0.31 is by far better than
> > mainline.  Try this easy test:
> >
> > startx with the vesa driver
> > run reflect from the mesa5.0-demos
> > load 5 cpu-hogs
> > start moving the mouse
> >
> > On my desktop, mainline completely breaks down, and no nicing may
> > rescue.
> >
> > On RSDL, even without nicing, the desktop is at least useable.
>
> I use a simpler, far more common (for lkml participants) workload:
>
> Dell notebook, single P-M-2GHz, ATI X300, open source X.org:
> (1) build a kernel in one window with "make -j$((NUMBER_OF_CPUS + 1))".
> (2) try to read email and/or surf in Firefox/Thunderbird.
>
> Stock scheduler wins easily, no contest.

Try this on RSDL:

--- sched.bak.c	2007-03-16 23:07:23.000000000 +0300
+++ sched.c	2007-03-19 23:49:40.000000000 +0300
@@ -938,7 +938,11 @@ static void activate_task(struct task_st
 				     (now - p->timestamp) >> 20);
 	}
 
-	p->quota = rr_quota(p);
+	/*
+	 * boost factor hardcoded to 5; adjust to your liking
+	 * higher means more likely to DoS
+	 */
+	p->quota = rr_quota(p) + (((now - p->timestamp) >> 20) * 5);
 	p->prio = effective_prio(p);
 	p->timestamp = now;
 	__activate_task(p, rq);


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: is RSDL an "unfair" scheduler too?
  2007-03-18  6:09                                                                     ` Bill Huey
  2007-03-18  6:37                                                                       ` Mike Galbraith
@ 2007-03-19 21:14                                                                       ` Bill Davidsen
  1 sibling, 0 replies; 198+ messages in thread
From: Bill Davidsen @ 2007-03-19 21:14 UTC (permalink / raw)
  To: linux-kernel; +Cc: ck

Bill Huey (hui) wrote:
> On Sun, Mar 18, 2007 at 06:24:40AM +0100, Willy Tarreau wrote:
>>> Dunno. I guess a lot of people would like to then manage the classes, 
>>> which would be painful as hell. 
>> Sure ! I wouldn't like people to point the finger on Linux saying "hey
>> look, they can't write a good scheduler so you have to adjust the knobs
>> yourself!". I keep in mind that Solaris' scheduler is very good, both
>> fair and interactive. FreeBSD was good (I haven't tested for a long time).
>> We should manage to get something good for most usages, and optimize
>> later for specific uses.
> 
> Like I've said in a previous email, SGI schedulers have an interactive
> term in addition to the normal "nice" values. If RSDL ends up being too
> rigid for desktop use, then this might be a good idea to explore in
> addition to priority manipulation.
> 
> However, it hasn't been completely proven that RSDL can't handle desktop
> loads and that needs to be completely explored first. It certain seems
> like, from the .jpgs that were posted earlier in the thread regarding mysql
> performance, that RSDL seems to have improved performance for those set
> ups so it's not universally the case that it sucks for server loads. The
> cause of this performance difference has yet to be pinpointed.

I would say that RSDL is probably a bit better than default for server 
use, although if the server starves for CPU interactive processing at 
the console becomes leisurely indeed. The only thing I would like to 
address is the order of magnitude blips in latency of nice processes, 
which may be solved by playing with time slices. Con hasn't really 
commented on that (or I haven't read down to it).

> 
> Also, bandwidth scheduler like this are a new critical development for
> things like the -rt patch. It would benefit greatly if the RSDL basic
> mechanisms (RR and deadlines) were to somehow slip into that patch and
> be used for a more strict -rt based scheduling class. It would be the basis
> for first-class control over process resource usage and would be a first
> in Linux or any mainstream kernel.

I don't think that RSDL and -rt should be merged, but that's for Ingo 
and Con to discuss. I would love to see RSDL in mainline as soon as it 
is practical, marked as EXPERIMENTAL.
> 
> This would be a powerful addition to Linux as a whole and RSDL should
> not be dismissed without these considerations. If it can somehow be
> integrated into the kernel with interactivity concerns addressed, then
> it would be an all out win for the kernel in both these areas.
> 
I don't think there are a lot of places where it underperforms the 
default scheduler, and it avoids a lot of jackpot cases where an 
overloaded system really bogs down. I would like to see more varied 
testing before any changes are made, unless a simple change would 
improve consistency of latency.

-- 
Bill Davidsen <davidsen@tmr.com>
   "We have more to fear from the bungling of the incompetent than from
the machinations of the wicked."  - from Slashdot


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-19 16:43                                                                     ` Xavier Bestel
@ 2007-03-20  3:11                                                                       ` Linus Torvalds
  2007-03-20  6:11                                                                         ` Willy Tarreau
                                                                                           ` (2 more replies)
  0 siblings, 3 replies; 198+ messages in thread
From: Linus Torvalds @ 2007-03-20  3:11 UTC (permalink / raw)
  To: Xavier Bestel
  Cc: Mark Lord, Al Boldi, Mike Galbraith, Con Kolivas, ck,
	Serge Belyshev, Ingo Molnar, linux-kernel, Nicholas Miell,
	Andrew Morton

On Mon, 19 Mar 2007, Xavier Bestel wrote:
> > >> Stock scheduler wins easily, no contest.
> > > 
> > > What happens when you renice X ?
> > 
> > Dunno -- not necessary with the stock scheduler.
> 
> Could you try something like renice -10 $(pidof Xorg) ?

Could you try something as simple and accepting that maybe this is a 
problem?

Quite frankly, I was *planning* on merging RSDL very early after 2.6.21, 
but there is one thing that has turned me completely off the whole thing:

 - the people involved seem to be totally unwilling to even admit there 
   might be a problem.

This is like alcoholism. If you cannot admit that you might have a 
problem, you'll never get anywhere. And quite frankly, the RSDL proponents 
seem to be in denial ("we're always better", "it's your problem if the old 
scheduler works better", "just one report of old scheduler being better").

And the thing is, if people aren't even _willing_ to admit that there may 
be issues, there's *no*way*in*hell* I will merge it even for testing. 
Because the whole and only point of merging RSDL was to see if it could 
replace the old scheduler, and the most important feature in that case is 
not whether it is perfect, BUT WHETHER ANYBODY IS INTERESTED IN TRYING TO 
FIX THE INEVITABLE PROBLEMS!

See?

Can you people not see that the way you're doing that "RSDL is perfect" 
chorus in the face of people who report problems, you're just making it 
totally unrealistic that it will *ever* get merged.

So unless somebody steps up to the plate and actually *talks* about the 
problem reports, and admits that maybe RSDL will need some tweaking, I'm 
not going to merge it.

Because there is just _one_ thing that is more important than code - and 
that is the willingness to fix the code...

		Linus

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20  3:11                                                                       ` Linus Torvalds
@ 2007-03-20  6:11                                                                         ` Willy Tarreau
  2007-03-20  8:03                                                                           ` Mike Galbraith
                                                                                             ` (2 more replies)
  2007-03-20 10:26                                                                         ` [ck] " jos poortvliet
  2007-03-20 13:22                                                                         ` Mark Lord
  2 siblings, 3 replies; 198+ messages in thread
From: Willy Tarreau @ 2007-03-20  6:11 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: Xavier Bestel, Mark Lord, Al Boldi, Mike Galbraith, Con Kolivas,
	ck, Serge Belyshev, Ingo Molnar, linux-kernel, Nicholas Miell,
	Andrew Morton

On Mon, Mar 19, 2007 at 08:11:55PM -0700, Linus Torvalds wrote:
> Quite frankly, I was *planning* on merging RSDL very early after 2.6.21, 
> but there is one thing that has turned me completely off the whole thing:
> 
>  - the people involved seem to be totally unwilling to even admit there 
>    might be a problem.
> 
> This is like alcoholism. If you cannot admit that you might have a 
> problem, you'll never get anywhere. And quite frankly, the RSDL proponents 
> seem to be in denial ("we're always better", "it's your problem if the old 
> scheduler works better", "just one report of old scheduler being better").
> 
> And the thing is, if people aren't even _willing_ to admit that there may 
> be issues, there's *no*way*in*hell* I will merge it even for testing. 
> Because the whole and only point of merging RSDL was to see if it could 
> replace the old scheduler, and the most important feature in that case is 
> not whether it is perfect, BUT WHETHER ANYBODY IS INTERESTED IN TRYING TO 
> FIX THE INEVITABLE PROBLEMS!

Linus, you're unfair with Con. He initially was on this position, and lately
worked with Mike by proposing changes to try to improve his X responsiveness.
But he's ill right now and cannot touch the keyboard, so only his supporters
speak for him, and as you know, speech is not code and does not fix problems.

Leave him a week or so to relieve and let's see what he can propose. Hopefully
a week away from the keyboard will help him think with a more general approach.
Also, Mike has already modified the code a bit to get better experience.

Also, while I don't agree with starting to renice X to get something usable,
it seems real that there's something funny on Mike's system which makes it
behave particularly strangely when combined with RSDL, because other people
in comparable tests (including me) have found X perfectly smooth even with
loads in the tens or even hundreds. I really suspect that we will find a bug
in RSDL which triggers the problem and that this fix will help discover
another problem on Mike's hardware which was not triggered by mainline.

Regards,
Willy

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20  6:11                                                                         ` Willy Tarreau
@ 2007-03-20  8:03                                                                           ` Mike Galbraith
  2007-03-21 14:57                                                                             ` Mike Galbraith
  2007-03-20  9:03                                                                           ` Xavier Bestel
  2007-03-20 15:31                                                                           ` Linus Torvalds
  2 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-20  8:03 UTC (permalink / raw)
  To: Willy Tarreau
  Cc: Linus Torvalds, Xavier Bestel, Mark Lord, Al Boldi, Con Kolivas,
	ck, Serge Belyshev, Ingo Molnar, linux-kernel, Nicholas Miell,
	Andrew Morton

On Tue, 2007-03-20 at 07:11 +0100, Willy Tarreau wrote:

> Also, while I don't agree with starting to renice X to get something usable,
> it seems real that there's something funny on Mike's system which makes it
> behave particularly strangely when combined with RSDL, because other people
> in comparable tests (including me) have found X perfectly smooth even with
> loads in the tens or even hundreds. I really suspect that we will find a bug
> in RSDL which triggers the problem and that this fix will help discover
> another problem on Mike's hardware which was not triggered by mainline.

I don't _think_ there's anything funny in my system, and Con said it was
the expected behavior with my testcase, but I won't rule it out.

Moving right along to the bugs part, I hope others are looking as well,
and not only talking.

One area that looks pretty fishy to me is cross-cpu wakeups and task
migration.  p->rotation appears to lose all meaning when you cross the
cpu boundary, and try_to_wake_up()is using that information in the
cross-cpu case.  In pull_task() OTOH, it checks to see if the task ran
on the remote cpu (at all, hmm), and if so tags the task accordingly.
It is not immediately obvious to me why this would be a good thing
though, because quotas of one runqueue don't appear to have any relation
to quotas of some other runqueue.  (i'm going to it that this old
information is meaningless)

	-Mike

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20  6:11                                                                         ` Willy Tarreau
  2007-03-20  8:03                                                                           ` Mike Galbraith
@ 2007-03-20  9:03                                                                           ` Xavier Bestel
  2007-03-20 12:31                                                                             ` Artur Skawina
  2007-03-21  7:50                                                                             ` Ingo Molnar
  2007-03-20 15:31                                                                           ` Linus Torvalds
  2 siblings, 2 replies; 198+ messages in thread
From: Xavier Bestel @ 2007-03-20  9:03 UTC (permalink / raw)
  To: Willy Tarreau
  Cc: Linus Torvalds, Mark Lord, Al Boldi, Mike Galbraith, Con Kolivas,
	ck, Serge Belyshev, Ingo Molnar, linux-kernel, Nicholas Miell,
	Andrew Morton

On Tue, 2007-03-20 at 07:11 +0100, Willy Tarreau wrote:
> I don't agree with starting to renice X to get something usable

X looks very special to me: it's a big userspace driver, the primary
task handling user interaction on the desktop, and on some OS the part
responsible for moving the mouse pointer and interacting with windows is
even implemented as an interrupt handler, and that for sure provides for
smooth user experience even on very low-end hardware. Why not compensate
for X design by prioritizing it a bit ?
If RSDL + reniced X makes for a better desktop than sotck kernel + X, on
all kind of workloads, it's good to know.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-19 20:47                                                                           ` Bill Davidsen
@ 2007-03-20 10:19                                                                             ` jos poortvliet
  2007-03-21  8:58                                                                             ` Kasper Sandberg
  1 sibling, 0 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-20 10:19 UTC (permalink / raw)
  To: ck
  Cc: Bill Davidsen, Kasper Sandberg, Al Boldi, Mike Galbraith,
	linux-kernel, Andrew Morton, Linus Torvalds, Nicholas Miell

[-- Attachment #1: Type: text/plain, Size: 1694 bytes --]

Op Tuesday 20 March 2007, schreef Bill Davidsen:
> Kasper Sandberg wrote:
> > On Sun, 2007-03-18 at 08:38 +0100, Mike Galbraith wrote:
> >> On Sun, 2007-03-18 at 08:22 +0100, Radoslaw Szkodzinski wrote:
> >>> I'd recon KDE regresses because of kioslaves waiting on a pipe
> >>> (communication with the app they're doing IO for) and then expiring.
> >>> That's why splitting IO from an app isn't exactly smart. It should at
> >>> least be ran in an another thread.
> >>
> >> Hm.  Sounds rather a lot like the...
> >> X sucks, fix X and RSDL will rock your world.  RSDL is perfect.
> >> ...that I've been getting.
> >
> > not really, only X sucks. KDE works atleast as good with rsdl as
> > vanilla. i dont know how originally said kde works worse, wasnt it just
> > someone that thought?
>
> It was probably me, and I had the opinion that KDE is not as smooth as
> GNOME with RSDL. I haven't had time to measure, but using for daily
> stuff for about an hour each way hasn't changed my opinion. Every once
> in a while KDE will KLUNK to a halt for 200-300ms doing mundane stuff
> like redrawing a page, scrolling, etc. I don't see it with GNOME.

yeah, here too... sometimes even longer (and I have a dualcore, 3gb ram, 
damnit!)

-- 
Disclaimer:

Alles wat ik doe denk en zeg is gebaseerd op het wereldbeeld wat ik nu heb. 
Ik ben niet verantwoordelijk voor wijzigingen van de wereld, of het beeld wat 
ik daarvan heb, noch voor de daaruit voortvloeiende gedragingen van mezelf. 
Alles wat ik zeg is aardig bedoeld, tenzij expliciet vermeld.

Please avoid sending me Word or PowerPoint attachments.
See http://www.gnu.org/philosophy/no-word-attachments.html

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-20  3:11                                                                       ` Linus Torvalds
  2007-03-20  6:11                                                                         ` Willy Tarreau
@ 2007-03-20 10:26                                                                         ` jos poortvliet
  2007-03-20 13:22                                                                         ` Mark Lord
  2 siblings, 0 replies; 198+ messages in thread
From: jos poortvliet @ 2007-03-20 10:26 UTC (permalink / raw)
  To: ck
  Cc: Linus Torvalds, Xavier Bestel, Al Boldi, Andrew Morton,
	Mike Galbraith, linux-kernel, Mark Lord, Nicholas Miell

[-- Attachment #1: Type: text/plain, Size: 3190 bytes --]

Op Tuesday 20 March 2007, schreef Linus Torvalds:
> On Mon, 19 Mar 2007, Xavier Bestel wrote:
> > > >> Stock scheduler wins easily, no contest.
> > > >
> > > > What happens when you renice X ?
> > >
> > > Dunno -- not necessary with the stock scheduler.
> >
> > Could you try something like renice -10 $(pidof Xorg) ?
>
> Could you try something as simple and accepting that maybe this is a
> problem?
>
> Quite frankly, I was *planning* on merging RSDL very early after 2.6.21,
> but there is one thing that has turned me completely off the whole thing:
>
>  - the people involved seem to be totally unwilling to even admit there
>    might be a problem.
>
> This is like alcoholism. If you cannot admit that you might have a
> problem, you'll never get anywhere. And quite frankly, the RSDL proponents
> seem to be in denial ("we're always better", "it's your problem if the old
> scheduler works better", "just one report of old scheduler being better").
>
> And the thing is, if people aren't even _willing_ to admit that there may
> be issues, there's *no*way*in*hell* I will merge it even for testing.
> Because the whole and only point of merging RSDL was to see if it could
> replace the old scheduler, and the most important feature in that case is
> not whether it is perfect, BUT WHETHER ANYBODY IS INTERESTED IN TRYING TO
> FIX THE INEVITABLE PROBLEMS!

Con simply isn't available right now, but you're right. RSDL isn't ready yet, 
imho, there seem to be some regressions (and I'm bitten by them, too). But if 
con's past behaviour says anything about how he's going to behave in the 
future (and according to my psych prof it's the most reliable predictor ;-)), 
I'm pretty sure he'll jump on this when he's healthy again. He's gone through 
great lengths to fix problems with staircase, no matter how obscure, so I see 
no reason why he wouldn't do the same for RSDL... Though scheduler problems 
can be extremely hard to reproduce on other hardware.

> See?
>
> Can you people not see that the way you're doing that "RSDL is perfect"
> chorus in the face of people who report problems, you're just making it
> totally unrealistic that it will *ever* get merged.
>
> So unless somebody steps up to the plate and actually *talks* about the
> problem reports, and admits that maybe RSDL will need some tweaking, I'm
> not going to merge it.
>
> Because there is just _one_ thing that is more important than code - and
> that is the willingness to fix the code...
>
> 		Linus
> _______________________________________________
> http://ck.kolivas.org/faqs/replying-to-mailing-list.txt
> ck mailing list - mailto: ck@vds.kolivas.org
> http://vds.kolivas.org/mailman/listinfo/ck



-- 
Disclaimer:

Alles wat ik doe denk en zeg is gebaseerd op het wereldbeeld wat ik nu heb. 
Ik ben niet verantwoordelijk voor wijzigingen van de wereld, of het beeld wat 
ik daarvan heb, noch voor de daaruit voortvloeiende gedragingen van mezelf. 
Alles wat ik zeg is aardig bedoeld, tenzij expliciet vermeld.

Please avoid sending me Word or PowerPoint attachments.
See http://www.gnu.org/philosophy/no-word-attachments.html

[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20  9:03                                                                           ` Xavier Bestel
@ 2007-03-20 12:31                                                                             ` Artur Skawina
  2007-03-20 19:16                                                                               ` Artur Skawina
  2007-03-21  7:50                                                                             ` Ingo Molnar
  1 sibling, 1 reply; 198+ messages in thread
From: Artur Skawina @ 2007-03-20 12:31 UTC (permalink / raw)
  To: Xavier Bestel
  Cc: Willy Tarreau, Linus Torvalds, Mark Lord, Al Boldi,
	Mike Galbraith, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

Xavier Bestel wrote:
> On Tue, 2007-03-20 at 07:11 +0100, Willy Tarreau wrote:
>> I don't agree with starting to renice X to get something usable
> 
> X looks very special to me: it's a big userspace driver, the primary
> task handling user interaction on the desktop, and on some OS the part
> responsible for moving the mouse pointer and interacting with windows is
> even implemented as an interrupt handler, and that for sure provides for
> smooth user experience even on very low-end hardware. Why not compensate
> for X design by prioritizing it a bit ?
> If RSDL + reniced X makes for a better desktop than sotck kernel + X, on
> all kind of workloads, it's good to know.

No, running X at a different priority than its clients is not really
a good idea. If it isn't immediately obvious why try something like
this:

mkdir /tmp/tempdir
cd /tmp/tempdir
for i in `seq -w 1 10000` ; do touch
longfilenamexxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx$i
; done
nice --20 xterm &
xterm &
nice -20 xterm &

then do "time ls -l ." in each xterm.

This is what i get on UP 2.6.20+RSDL.31 w/ X at nice 0:
-20: 0m0.244s user   0m0.156s system   0m3.113s elapsed   12.84% CPU
  0: 0m0.216s user   0m0.168s system   0m2.801s elapsed   13.70% CPU
 19: 0m0.188s user   0m0.196s system   0m3.268s elapsed   11.75% CPU

I just made this simple example up and it doesn't show the problem
too well, but you can already see the ~10% performance drop. It's
actually worse in practice, because for some apps the increased
amount of rendering is clearly visible; text areas scroll
line-by-line, content is incrementally redrawn several times etc.
This happens because an X server running at a higher priority than a
client will often get scheduled immediately after some x11 traffic
arrives; when the process priorities are equal usually the client
gets a chance to supply some more data. IOW by renicing the server
you make X almost synchronous.

This isn't specific to RSDL - it happens w/ any cpu scheduler; and
while the effects of less extreme prio differences (ie -5 instead of
-20 etc) may be less visible i also doubt they will help much.

A better approach to X interactivity might be allowing the server to
use (part of) the clients timeslice, but it's not trivial -- you'd
only want to do that when the client is waiting for a reply and you
almost never want to preempt the client just because the server
received some data.

As to RSDL - it seems to work great for desktop use and feels better
than mainline. However top output under 100% load (eg kernel
compilation) looks like below -- the %CPU error seems a bit high...

Tasks:  97 total,   6 running,  91 sleeping,   0 stopped,   0 zombie
Cpu(s): 81.7% us, 18.3% sy,  0.0% ni,  0.0% id,  0.0% wa,  0.0% hi,
 0.0% si
  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND

 7566 root      17   0  9196 4108 1188 R  3.0  0.8   0:00.09 cc1

 7499 root      11   0  1952  924  648 S  0.3  0.2   0:00.01 make

12279 root       1   0  5556 2928 2064 S  0.3  0.6   0:00.83 xterm

31510 root       1   0  2152 1100  840 R  0.3  0.2   0:00.25 top

    1 root       1   0  1584   88   60 S  0.0  0.0   0:00.30 init

artur

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20  3:11                                                                       ` Linus Torvalds
  2007-03-20  6:11                                                                         ` Willy Tarreau
  2007-03-20 10:26                                                                         ` [ck] " jos poortvliet
@ 2007-03-20 13:22                                                                         ` Mark Lord
  2007-03-20 15:16                                                                           ` Ray Lee
  2 siblings, 1 reply; 198+ messages in thread
From: Mark Lord @ 2007-03-20 13:22 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: Xavier Bestel, Al Boldi, Mike Galbraith, Con Kolivas, ck,
	Serge Belyshev, Ingo Molnar, linux-kernel, Nicholas Miell,
	Andrew Morton

Linus Torvalds wrote:
> 
> Quite frankly, I was *planning* on merging RSDL very early after 2.6.21, 
> but there is one thing that has turned me completely off the whole thing:
> 
>  - the people involved seem to be totally unwilling to even admit there 
>    might be a problem.

Not to mention that it seems to only be tested thus far
by a very vocal and supportive core.  It needs much wider
exposure for much longer before risking it in mainline.
It likely will get there, eventually, just not yet.

I've droppped it from my machine -- interactive response is much
more important for my primary machine right now.

I believe Ingo's much simpler hack produces as good/bad results
as this RSDL thingie, and with one important extra:
it can be switched on/off at runtime.

----->forwarded message:

Subject: [patch] CFS scheduler: Completely Fair Scheduler
From: Ingo Molnar <mingo@elte.hu>

add the CONFIG_SCHED_FAIR option (default: off): this turns the Linux 
scheduler into a completely fair scheduler for SCHED_OTHER tasks: with 
perfect roundrobin scheduling, fair distribution of timeslices combined 
with no interactivity boosting and no heuristics.

a /proc/sys/kernel/sched_fair option is also available to turn
this behavior on/off.

if this option establishes itself amongst leading distributions then we
could in the future remove the interactivity estimator altogether.

Signed-off-by: Ingo Molnar <mingo@elte.hu>
---
 include/linux/sched.h  |    1 +
 kernel/Kconfig.preempt |    9 +++++++++
 kernel/sched.c         |    8 ++++++++
 kernel/sysctl.c        |   10 ++++++++++
 4 files changed, 28 insertions(+)

Index: linux/include/linux/sched.h
===================================================================
--- linux.orig/include/linux/sched.h
+++ linux/include/linux/sched.h
@@ -119,6 +119,7 @@ extern unsigned long avenrun[];		/* Load
 	load += n*(FIXED_1-exp); \
 	load >>= FSHIFT;
 
+extern unsigned int sched_fair;
 extern unsigned long total_forks;
 extern int nr_threads;
 DECLARE_PER_CPU(unsigned long, process_counts);
Index: linux/kernel/Kconfig.preempt
===================================================================
--- linux.orig/kernel/Kconfig.preempt
+++ linux/kernel/Kconfig.preempt
@@ -63,3 +63,12 @@ config PREEMPT_BKL
 	  Say Y here if you are building a kernel for a desktop system.
 	  Say N if you are unsure.
 
+config SCHED_FAIR
+	bool "Completely Fair Scheduler"
+	help
+	  This option turns the Linux scheduler into a completely fair
+	  scheduler. User-space workloads will round-robin fairly, and
+	  they have to be prioritized using nice levels.
+
+	  Say N if you are unsure.
+
Index: linux/kernel/sched.c
===================================================================
--- linux.orig/kernel/sched.c
+++ linux/kernel/sched.c
@@ -4040,6 +4040,10 @@ static inline struct task_struct *find_p
 	return pid ? find_task_by_pid(pid) : current;
 }
 
+#ifdef CONFIG_SCHED_FAIR
+unsigned int sched_fair = 1;
+#endif
+
 /* Actually do priority change: must hold rq lock. */
 static void __setscheduler(struct task_struct *p, int policy, int prio)
 {
@@ -4055,6 +4059,10 @@ static void __setscheduler(struct task_s
 	 */
 	if (policy == SCHED_BATCH)
 		p->sleep_avg = 0;
+#ifdef CONFIG_SCHED_FAIR
+	if (policy == SCHED_NORMAL && sched_fair)
+		p->sleep_avg = 0;
+#endif
 	set_load_weight(p);
 }
 
Index: linux/kernel/sysctl.c
===================================================================
--- linux.orig/kernel/sysctl.c
+++ linux/kernel/sysctl.c
@@ -205,6 +205,16 @@ static ctl_table root_table[] = {
 };
 
 static ctl_table kern_table[] = {
+#ifdef CONFIG_SCHED_FAIR
+	{
+		.ctl_name	= CTL_UNNUMBERED,
+		.procname	= "sched_fair",
+		.data		= &sched_fair,
+		.maxlen		= sizeof(int),
+		.mode		= 0644,
+		.proc_handler	= &proc_dointvec,
+	},
+#endif
 	{
 		.ctl_name	= KERN_PANIC,
 		.procname	= "panic",

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20 13:22                                                                         ` Mark Lord
@ 2007-03-20 15:16                                                                           ` Ray Lee
  2007-03-20 15:20                                                                             ` Mark Lord
  2007-03-21  8:55                                                                             ` Kasper Sandberg
  0 siblings, 2 replies; 198+ messages in thread
From: Ray Lee @ 2007-03-20 15:16 UTC (permalink / raw)
  To: Mark Lord
  Cc: Linus Torvalds, Xavier Bestel, Al Boldi, Mike Galbraith,
	Con Kolivas, ck, Serge Belyshev, Ingo Molnar, linux-kernel,
	Nicholas Miell, Andrew Morton

On 3/20/07, Mark Lord <lkml@rtr.ca> wrote:
> I've droppped it from my machine -- interactive response is much
> more important for my primary machine right now.

Help out with a data point? Are you running KDE as well? If you are,
then it looks like the common denominator that RSDL is handling poorly
is client-server communication. (KDE's KIO slaves in this case, but X
in general.)

If so, one would hope that a variation on Linus's 2.5.63 pipe wakeup
pass-the-interactivity idea could work here. The problem with that
original patch, IIRC, was that a couple of tasks could bounce their
interactivity bonus back and forth and thereby starve others. Which
might be expected given there was no 'decaying' of the interactivity
bonus, which means you can make a feedback loop.

Anyway, looks like processes that do A -> B -> A communication chains
are getting penalized under RSDL. In which case, perhaps I can make a
test case that exhibits the problem without having to have the same
graphics card or desktop as you.

Ray

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20 15:16                                                                           ` Ray Lee
@ 2007-03-20 15:20                                                                             ` Mark Lord
  2007-03-21  8:55                                                                             ` Kasper Sandberg
  1 sibling, 0 replies; 198+ messages in thread
From: Mark Lord @ 2007-03-20 15:20 UTC (permalink / raw)
  To: ray-gmail
  Cc: Linus Torvalds, Xavier Bestel, Al Boldi, Mike Galbraith,
	Con Kolivas, ck, Serge Belyshev, Ingo Molnar, linux-kernel,
	Nicholas Miell, Andrew Morton

Ray Lee wrote:
> On 3/20/07, Mark Lord <lkml@rtr.ca> wrote:
>> I've droppped it from my machine -- interactive response is much
>> more important for my primary machine right now.
> 
> Help out with a data point? Are you running KDE as well? 

Yes, KDE.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20  6:11                                                                         ` Willy Tarreau
  2007-03-20  8:03                                                                           ` Mike Galbraith
  2007-03-20  9:03                                                                           ` Xavier Bestel
@ 2007-03-20 15:31                                                                           ` Linus Torvalds
  2007-03-20 18:08                                                                             ` Al Boldi
                                                                                               ` (2 more replies)
  2 siblings, 3 replies; 198+ messages in thread
From: Linus Torvalds @ 2007-03-20 15:31 UTC (permalink / raw)
  To: Willy Tarreau
  Cc: Xavier Bestel, Mark Lord, Al Boldi, Mike Galbraith, Con Kolivas,
	ck, Serge Belyshev, Ingo Molnar, linux-kernel, Nicholas Miell,
	Andrew Morton

[-- Attachment #1: Type: TEXT/PLAIN, Size: 1504 bytes --]

On Tue, 20 Mar 2007, Willy Tarreau wrote:
> 
> Linus, you're unfair with Con. He initially was on this position, and lately
> worked with Mike by proposing changes to try to improve his X responsiveness.

I was not actually so much speaking about Con, as about a lot of the 
tone in general here. And yes, it's not been entirely black and white. I 
was very happy to see the "try this patch" email from Al Boldi - not 
because I think that patch per se was necessarily the right fix (I have no 
idea), but simply because I think that's the kind of mindset we need to 
have.

Not a lot of people really *like* the old scheduler, but it's been tweaked 
over the years to try to avoid some nasty behaviour. I'm really hoping 
that RSDL would be a lot better (and by all accounts it has the potential 
for that), but I think it's totally naïve to expect that it won't need 
some tweaking too.

So I'll happily still merge RSDL right after 2.6.21 (and it won't even be 
a config option - if we want to make it good, we need to make sure 
*everybody* tests it), but what I want to see is that "can do" spirit wrt 
tweaking for issues that come up.

Because let's face it - nothing is ever perfect. Even a really nice 
conceptual idea always ends up hitting the "but in real life, things are 
ugly and complex, and we've depended on behaviour X in the past and can't 
change it, so we need some tweaking for problem Y".

And everything is totally fixable - at least as long as people are willing 
to!

		Linus

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20 15:31                                                                           ` Linus Torvalds
@ 2007-03-20 18:08                                                                             ` Al Boldi
  2007-03-21  8:22                                                                             ` Keith Duthie
  2007-03-28 23:43                                                                             ` Bill Davidsen
  2 siblings, 0 replies; 198+ messages in thread
From: Al Boldi @ 2007-03-20 18:08 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: Willy Tarreau, Xavier Bestel, Mark Lord, Mike Galbraith,
	Con Kolivas, ck, Serge Belyshev, Ingo Molnar, linux-kernel,
	Nicholas Miell, Andrew Morton

Linus Torvalds wrote:
> I was very happy to see the "try this patch" email from Al Boldi - not
> because I think that patch per se was necessarily the right fix (I have no
> idea), 

Well, it wasn't really meant as a fix, but rather to point out that 
interactivity boosting is possible with RSDL.

It probably needs a lot more work, but just this one-liner gives an 
unbelievable ia boost.

> but simply because I think that's the kind of mindset we need to have.

Thanks.

> Not a lot of people really *like* the old scheduler, but it's been tweaked
> over the years to try to avoid some nasty behaviour. I'm really hoping
> that RSDL would be a lot better (and by all accounts it has the potential
> for that), but I think it's totally naïve to expect that it won't need
> some tweaking too.

Aside from ia boosting, I think fixed latencies per nice levels may be 
desirable, when physically possible, to allow for more deterministic 
scheduling.

> So I'll happily still merge RSDL right after 2.6.21 (and it won't even be
> a config option - if we want to make it good, we need to make sure
> *everybody* tests it), but what I want to see is that "can do" spirit wrt
> tweaking for issues that come up.
>
> Because let's face it - nothing is ever perfect. Even a really nice
> conceptual idea always ends up hitting the "but in real life, things are
> ugly and complex, and we've depended on behaviour X in the past and can't
> change it, so we need some tweaking for problem Y".
>
> And everything is totally fixable - at least as long as people are willing
> to!

Agreed.


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20 12:31                                                                             ` Artur Skawina
@ 2007-03-20 19:16                                                                               ` Artur Skawina
  0 siblings, 0 replies; 198+ messages in thread
From: Artur Skawina @ 2007-03-20 19:16 UTC (permalink / raw)
  Cc: Con Kolivas, linux-kernel

> than mainline. However top output under 100% load (eg kernel
> compilation) looks like below -- the %CPU error seems a bit high...

ignore that - i didn't notice ccache was involved.


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-19 20:53                                                                 ` Al Boldi
@ 2007-03-20 19:50                                                                   ` Artur Skawina
  2007-03-21  4:15                                                                     ` Al Boldi
  0 siblings, 1 reply; 198+ messages in thread
From: Artur Skawina @ 2007-03-20 19:50 UTC (permalink / raw)
  To: Al Boldi; +Cc: linux-kernel

Al Boldi wrote:
> --- sched.bak.c	2007-03-16 23:07:23.000000000 +0300
> +++ sched.c	2007-03-19 23:49:40.000000000 +0300
> @@ -938,7 +938,11 @@ static void activate_task(struct task_st
>  				     (now - p->timestamp) >> 20);
>  	}
>  
> -	p->quota = rr_quota(p);
> +	/*
> +	 * boost factor hardcoded to 5; adjust to your liking
> +	 * higher means more likely to DoS
> +	 */
> +	p->quota = rr_quota(p) + (((now - p->timestamp) >> 20) * 5);
>  	p->prio = effective_prio(p);
>  	p->timestamp = now;
>  	__activate_task(p, rq);

i've tried this and it lasted only a few minutes -- i was seeing
mouse cursor stalls lasting almost 1s. i/o bound tasks starving X?
After reverting the patch everything is smooth again.

artur


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20 19:50                                                                   ` Artur Skawina
@ 2007-03-21  4:15                                                                     ` Al Boldi
  2007-03-21 17:24                                                                       ` Artur Skawina
  0 siblings, 1 reply; 198+ messages in thread
From: Al Boldi @ 2007-03-21  4:15 UTC (permalink / raw)
  To: Artur Skawina; +Cc: linux-kernel

Artur Skawina wrote:
> Al Boldi wrote:
> > --- sched.bak.c	2007-03-16 23:07:23.000000000 +0300
> > +++ sched.c	2007-03-19 23:49:40.000000000 +0300
> > @@ -938,7 +938,11 @@ static void activate_task(struct task_st
> >  				     (now - p->timestamp) >> 20);
> >  	}
> >
> > -	p->quota = rr_quota(p);
> > +	/*
> > +	 * boost factor hardcoded to 5; adjust to your liking
> > +	 * higher means more likely to DoS
> > +	 */
> > +	p->quota = rr_quota(p) + (((now - p->timestamp) >> 20) * 5);
> >  	p->prio = effective_prio(p);
> >  	p->timestamp = now;
> >  	__activate_task(p, rq);
>
> i've tried this and it lasted only a few minutes -- i was seeing
> mouse cursor stalls lasting almost 1s. i/o bound tasks starving X?
> After reverting the patch everything is smooth again.

This patch wasn't really meant for production, as any sleeping background 
proc turned cpu-hog may DoS the system.

If you like to play with this, then you probably want to at least reset the 
quota in its expiration.


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20  9:03                                                                           ` Xavier Bestel
  2007-03-20 12:31                                                                             ` Artur Skawina
@ 2007-03-21  7:50                                                                             ` Ingo Molnar
  2007-03-21 10:43                                                                               ` David Schwartz
  1 sibling, 1 reply; 198+ messages in thread
From: Ingo Molnar @ 2007-03-21  7:50 UTC (permalink / raw)
  To: Xavier Bestel
  Cc: Willy Tarreau, Linus Torvalds, Mark Lord, Al Boldi,
	Mike Galbraith, Con Kolivas, ck, Serge Belyshev, linux-kernel,
	Nicholas Miell, Andrew Morton

* Xavier Bestel <xavier.bestel@free.fr> wrote:

> On Tue, 2007-03-20 at 07:11 +0100, Willy Tarreau wrote:
> > I don't agree with starting to renice X to get something usable
>
> [...] Why not compensate for X design by prioritizing it a bit ?

there were multiple attempts with renicing X under the vanilla 
scheduler, and they were utter failures most of the time. _More_ people 
complained about interactivity issues _after_ X has been reniced to -5 
(or -10) than people complained about "nice 0" interactivity issues to 
begin with.

The vanilla scheduler's auto-nice feature rewards _behavior_, so it gets 
X right most of the time. The fundamental issue is that sometimes X is 
very interactive - we boost it then, there's lots of scheduling but nice 
low latencies. Sometimes it's a hog - we penalize it then and things 
start to batch up more and we get out of the overload situation faster. 
That's the case even if all you care about is desktop performance.

no doubt it's hard to get the auto-nice thing right, but one thing is 
clear: currently RSDL causes problems in areas that worked well in the 
vanilla scheduler for a long time, so RSDL needs to improve. RSDL should 
not lure itself into the false promise of 'just renice X statically'. It 
wont work. (You might want to rewrite X's request scheduling - but if so 
then i'd like to see that being done _first_, because i just dont trust 
such 10-mile-distance problem analysis.)

	Ingo

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20 15:31                                                                           ` Linus Torvalds
  2007-03-20 18:08                                                                             ` Al Boldi
@ 2007-03-21  8:22                                                                             ` Keith Duthie
  2007-03-28 23:43                                                                             ` Bill Davidsen
  2 siblings, 0 replies; 198+ messages in thread
From: Keith Duthie @ 2007-03-21  8:22 UTC (permalink / raw)
  To: linux-kernel
  Cc: Willy Tarreau, Xavier Bestel, Mark Lord, Al Boldi,
	Mike Galbraith, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	Nicholas Miell, Andrew Morton, Linus Torvalds

Another data point: I'm getting stalls in mplayer. I'm assuming the stalls
occur when procmail runs messages through spamprobe, as the system is
otherwise idle. The stalls continue to occur (and I'm not sure that they
aren't worse) when X and/or mplayer are reniced to negative nice levels.

This is on a dual core amd64 system running 2.6.20.3 with rsdl 0.31.
Admittedly I'm also running the nvidia binary driver with X.

-- 
The universe hates you, but don't worry - it's nothing personal.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20 15:16                                                                           ` Ray Lee
  2007-03-20 15:20                                                                             ` Mark Lord
@ 2007-03-21  8:55                                                                             ` Kasper Sandberg
  1 sibling, 0 replies; 198+ messages in thread
From: Kasper Sandberg @ 2007-03-21  8:55 UTC (permalink / raw)
  To: ray-gmail
  Cc: Mark Lord, Linus Torvalds, Xavier Bestel, Al Boldi,
	Mike Galbraith, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

On Tue, 2007-03-20 at 08:16 -0700, Ray Lee wrote:
> On 3/20/07, Mark Lord <lkml@rtr.ca> wrote:
> > I've droppped it from my machine -- interactive response is much
> > more important for my primary machine right now.
> 
> Help out with a data point? Are you running KDE as well? If you are,
> then it looks like the common denominator that RSDL is handling poorly
> is client-server communication. (KDE's KIO slaves in this case, but X
> in general.)

im not experiencing any problems with KDE. if anything ktorrent seems to
be going a teeny tiny bit smoother, though its nothing i can back up
with data.

now i havent tested ALL kioslaves yet, but stuff like sftp, fish, tar
and such works just as good.


> 
> If so, one would hope that a variation on Linus's 2.5.63 pipe wakeup
> pass-the-interactivity idea could work here. The problem with that
> original patch, IIRC, was that a couple of tasks could bounce their
> interactivity bonus back and forth and thereby starve others. Which
> might be expected given there was no 'decaying' of the interactivity
> bonus, which means you can make a feedback loop.
> 
> Anyway, looks like processes that do A -> B -> A communication chains
> are getting penalized under RSDL. In which case, perhaps I can make a
> test case that exhibits the problem without having to have the same
> graphics card or desktop as you.
An easy-to-reproduce testcase would be good.

> 
> Ray
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
> 


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [ck] Re: RSDL v0.31
  2007-03-19 20:47                                                                           ` Bill Davidsen
  2007-03-20 10:19                                                                             ` jos poortvliet
@ 2007-03-21  8:58                                                                             ` Kasper Sandberg
  1 sibling, 0 replies; 198+ messages in thread
From: Kasper Sandberg @ 2007-03-21  8:58 UTC (permalink / raw)
  To: Bill Davidsen
  Cc: Mike Galbraith, Radoslaw Szkodzinski, Al Boldi, Andrew Morton,
	linux-kernel, ck, Linus Torvalds, Nicholas Miell

On Mon, 2007-03-19 at 16:47 -0400, Bill Davidsen wrote:
> Kasper Sandberg wrote:
> > On Sun, 2007-03-18 at 08:38 +0100, Mike Galbraith wrote:
> >> On Sun, 2007-03-18 at 08:22 +0100, Radoslaw Szkodzinski wrote:
> >>
> >>> I'd recon KDE regresses because of kioslaves waiting on a pipe
> >>> (communication with the app they're doing IO for) and then expiring.
> >>> That's why splitting IO from an app isn't exactly smart. It should at
> >>> least be ran in an another thread.
> >> Hm.  Sounds rather a lot like the...
> >> X sucks, fix X and RSDL will rock your world.  RSDL is perfect.
> >> ...that I've been getting.
> >>
> > not really, only X sucks. KDE works atleast as good with rsdl as
> > vanilla. i dont know how originally said kde works worse, wasnt it just
> > someone that thought?
> > 
> It was probably me, and I had the opinion that KDE is not as smooth as 
> GNOME with RSDL. I haven't had time to measure, but using for daily 
> stuff for about an hour each way hasn't changed my opinion. Every once 
> in a while KDE will KLUNK to a halt for 200-300ms doing mundane stuff 
> like redrawing a page, scrolling, etc. I don't see it with GNOME.

umm, could you try to find something that always does it, so i can try
to reproduce? cause i dont really hit any such thing, and i only have a
2ghz amd64

> 


^ permalink raw reply	[flat|nested] 198+ messages in thread

* RE: RSDL v0.31
  2007-03-21  7:50                                                                             ` Ingo Molnar
@ 2007-03-21 10:43                                                                               ` David Schwartz
  2007-03-28 23:37                                                                                 ` Bill Davidsen
  0 siblings, 1 reply; 198+ messages in thread
From: David Schwartz @ 2007-03-21 10:43 UTC (permalink / raw)
  To: Linux-Kernel@Vger. Kernel. Org

> there were multiple attempts with renicing X under the vanilla
> scheduler, and they were utter failures most of the time. _More_ people
> complained about interactivity issues _after_ X has been reniced to -5
> (or -10) than people complained about "nice 0" interactivity issues to
> begin with.

Unfortunately, nicing X is not going to work. It causes X to pre-empt any
local process that tries to batch requests to it, defeating the batching.
What you really want is X to get scheduled after the client pauses in
sending data to it or has sent more than a certain amount. It seems kind of
crazy to put such login in a scheduler.

Perhaps when one process unblocks another, you put that other process at the
head of the run queue but don't pre-empt the currently running process. That
way, the process can continue to batch requests, but X's maximum latency
delay will be the quantum of the client program.

> The vanilla scheduler's auto-nice feature rewards _behavior_, so it gets
> X right most of the time. The fundamental issue is that sometimes X is
> very interactive - we boost it then, there's lots of scheduling but nice
> low latencies. Sometimes it's a hog - we penalize it then and things
> start to batch up more and we get out of the overload situation faster.
> That's the case even if all you care about is desktop performance.
>
> no doubt it's hard to get the auto-nice thing right, but one thing is
> clear: currently RSDL causes problems in areas that worked well in the
> vanilla scheduler for a long time, so RSDL needs to improve. RSDL should
> not lure itself into the false promise of 'just renice X statically'. It
> wont work. (You might want to rewrite X's request scheduling - but if so
> then i'd like to see that being done _first_, because i just dont trust
> such 10-mile-distance problem analysis.)

I am hopeful that there exists a heuristic that both improves this problem
and is also inherently fair. If that's true, then such a heuristic can be
added to RSDL without damaging its properties and without requiring any
special settings. Perhaps longer-term latency benefits to processes that
have yielded in the past?

I think there are certain circumstances, however, where it is inherently
reasonable to insist that 'nice' be used. If you want a CPU-starved task to
get more than 1/X of the CPU, where X is the number of CPU-starved tasks,
you should have to ask for that. If you want one CPU-starved task to get
better latency than other CPU-starved tasks, you should have to ask for
that.

Fundamentally, the scheduler cannot do it by itself. You can create cases
where the load is precisely identical and one person wants X and another
person wants Y. The scheduler cannot know what's important to you.

DS

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20  8:03                                                                           ` Mike Galbraith
@ 2007-03-21 14:57                                                                             ` Mike Galbraith
  2007-03-21 16:02                                                                               ` Peter Zijlstra
       [not found]                                                                               ` <20070321161147.54c7a727@localhost>
  0 siblings, 2 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-21 14:57 UTC (permalink / raw)
  To: Willy Tarreau
  Cc: Linus Torvalds, Xavier Bestel, Mark Lord, Al Boldi, Con Kolivas,
	ck, Serge Belyshev, Ingo Molnar, linux-kernel, Nicholas Miell,
	Andrew Morton

On Tue, 2007-03-20 at 09:03 +0100, Mike Galbraith wrote:

> Moving right along to the bugs part, I hope others are looking as well,
> and not only talking.
> 
> One area that looks pretty fishy to me is cross-cpu wakeups and task
> migration.  p->rotation appears to lose all meaning when you cross the
> cpu boundary, and try_to_wake_up()is using that information in the
> cross-cpu case.  In pull_task() OTOH, it checks to see if the task ran
> on the remote cpu (at all, hmm), and if so tags the task accordingly.

Doing the same in try_to_wake_up()delivered a counter intuitive result.
I expected sleeping tasks to suffer a bit, because when a task wakes up
on a different cpu, the chance of it being in the same rotation is
practically nil, so it would be issued a new quota when it hit
recalc_task_prio() and begin a new walk down the stairs.  In the case
where it's is told that the awakening task is running in the same
rotation (as is done in pull_task, and with the patchlet below), since
p->array isn't NULLed any more when the task is dequeued, there would be
an array (last it was queued in), there's going to be time_slice (see no
way 0 time_slice can happen, and nothing good would happen in
task_running_tick() if it could), and since per instrumentation nobody
is ever overrunning runqueue quota, it should just continue to march
down the stairs, and receive less bandwidth than the full restart.

What happened is below.

'f' is a progglet which sleeps a bit and burns a bit, duration depending
on argument given. 'sh' is a shell 100% hog.  In this scenario, the
argument was set such that 'f' used right at 50% cpu.  All are started
at the same time, and I froze top when the first 'f' reached 1:00.

virgin 2.6.21-rc3-rsdl-smp
top - 13:52:50 up 7 min, 12 users,  load average: 3.45, 2.89, 1.51

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P COMMAND
 6560 root      31   0  2892 1236 1032 R   82  0.1   1:50.24 1 sh
 6558 root      28   0  1428  276  228 S   42  0.0   1:00.09 1 f
 6557 root      30   0  1424  280  228 R   35  0.0   1:00.25 0 f
 6559 root      39   0  1424  276  228 R   33  0.0   0:58.36 0 f
 6420 root      23   0  2372 1068  764 R    3  0.1   0:04.68 0 top

patched as below 2.6.21-rc3-rsdl-smp
top - 14:09:28 up 6 min, 12 users,  load average: 3.52, 2.70, 1.29

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P COMMAND
 6517 root      38   0  2892 1240 1032 R   59  0.1   1:31.12 1 sh
 6515 root      24   0  1424  280  228 R   51  0.0   1:00.10 0 f
 6514 root      37   0  1428  280  228 R   42  0.0   1:00.58 1 f
 6516 root      24   0  1428  280  228 R   41  0.0   1:00.01 0 f
 6430 root      23   0  2372 1056  764 R    2  0.1   0:05.53 0 top

--- kernel/sched.c.org	2007-03-15 07:04:51.000000000 +0100
+++ kernel/sched.c	2007-03-21 13:55:22.000000000 +0100
@@ -1416,7 +1416,8 @@ static int try_to_wake_up(struct task_st
 	if (cpu == this_cpu) {
 		schedstat_inc(rq, ttwu_local);
 		goto out_set_cpu;
-	}
+	} else if (p->rotation == cpu_rq(cpu)->prio_rotation)
+		p->rotation = cpu_rq(this_cpu)->prio_rotation;

 	for_each_domain(this_cpu, sd) {
 		if (cpu_isset(cpu, sd->span)) {

Same test with virgin 2.6.20.3-smp for reference.
top - 14:46:10 up 18 min, 12 users,  load average: 3.70, 1.89, 1.07

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  P COMMAND
 6529 root      15   0  1424  280  228 S   54  0.0   1:00.26 1 f
 6530 root      15   0  1428  280  228 R   50  0.0   0:59.03 0 f
 6531 root      15   0  1424  280  228 R   48  0.0   0:59.29 1 f
 6532 root      25   0  2892 1240 1032 R   40  0.1   1:00.54 0 sh
 6457 root      15   0  2380 1056  764 R    1  0.1   0:02.34 1 top

I was more than a bit surprised that mainline did this well, considering
that the proggy was one someone posted long time ago to demonstrate
starvation issues with the interactivity estimator.  (source not
available unfortunately, was apparently still on my old PIII box along
with the one Willy posted when I installed opensuse 10.2 on it.  damn.
trivial thing though)

	-Mike

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-21 14:57                                                                             ` Mike Galbraith
@ 2007-03-21 16:02                                                                               ` Peter Zijlstra
  2007-03-21 17:06                                                                                 ` Mike Galbraith
  2007-03-22  7:07                                                                                 ` Mike Galbraith
       [not found]                                                                               ` <20070321161147.54c7a727@localhost>
  1 sibling, 2 replies; 198+ messages in thread
From: Peter Zijlstra @ 2007-03-21 16:02 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Willy Tarreau, Linus Torvalds, Xavier Bestel, Mark Lord,
	Al Boldi, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

On Wed, 2007-03-21 at 15:57 +0100, Mike Galbraith wrote:

> 'f' is a progglet which sleeps a bit and burns a bit, duration depending
> on argument given. 'sh' is a shell 100% hog.  In this scenario, the
> argument was set such that 'f' used right at 50% cpu.  All are started
> at the same time, and I froze top when the first 'f' reached 1:00.

May one enquire how much CPU the mythical 'f' uses when ran alone? Just
to get a gauge for the numbers?




^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-21 16:02                                                                               ` Peter Zijlstra
@ 2007-03-21 17:06                                                                                 ` Mike Galbraith
  2007-03-22  7:07                                                                                 ` Mike Galbraith
  1 sibling, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-21 17:06 UTC (permalink / raw)
  To: Peter Zijlstra
  Cc: Willy Tarreau, Linus Torvalds, Xavier Bestel, Mark Lord,
	Al Boldi, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

On Wed, 2007-03-21 at 17:02 +0100, Peter Zijlstra wrote:
> On Wed, 2007-03-21 at 15:57 +0100, Mike Galbraith wrote:
> 
> > 'f' is a progglet which sleeps a bit and burns a bit, duration depending
> > on argument given. 'sh' is a shell 100% hog.  In this scenario, the
> > argument was set such that 'f' used right at 50% cpu.  All are started
> > at the same time, and I froze top when the first 'f' reached 1:00.
> 
> May one enquire how much CPU the mythical 'f' uses when ran alone? Just
> to get a gauge for the numbers?

Right at 50%

	-Mike

(mythical?  i can send you the binary if you want)


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
       [not found]                                                                               ` <20070321161147.54c7a727@localhost>
@ 2007-03-21 17:07                                                                                 ` Mike Galbraith
  2007-03-22  4:49                                                                                   ` Willy Tarreau
  0 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-21 17:07 UTC (permalink / raw)
  To: Paolo Ornati
  Cc: Willy Tarreau, Linus Torvalds, Xavier Bestel, Mark Lord,
	Al Boldi, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

On Wed, 2007-03-21 at 16:11 +0100, Paolo Ornati wrote:
> On Wed, 21 Mar 2007 15:57:44 +0100
> Mike Galbraith <efault@gmx.de> wrote:
> 
> > I was more than a bit surprised that mainline did this well, considering
> > that the proggy was one someone posted long time ago to demonstrate
> > starvation issues with the interactivity estimator.  (source not
> > available unfortunately, was apparently still on my old PIII box along
> > with the one Willy posted when I installed opensuse 10.2 on it.  damn.
> > trivial thing though)
> 
> This one?  :)

No, but that one went to bit heaven too ;-)

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-21  4:15                                                                     ` Al Boldi
@ 2007-03-21 17:24                                                                       ` Artur Skawina
  0 siblings, 0 replies; 198+ messages in thread
From: Artur Skawina @ 2007-03-21 17:24 UTC (permalink / raw)
  To: Al Boldi; +Cc: linux-kernel, Con Kolivas

Al Boldi wrote:
> Artur Skawina wrote:
>> Al Boldi wrote:
>>> -	p->quota = rr_quota(p);
>>> +	/*
>>> +	 * boost factor hardcoded to 5; adjust to your liking
>>> +	 * higher means more likely to DoS
>>> +	 */
>>> +	p->quota = rr_quota(p) + (((now - p->timestamp) >> 20) * 5);

>> mouse cursor stalls lasting almost 1s. i/o bound tasks starving X?
>> After reverting the patch everything is smooth again.
> 
> This patch wasn't really meant for production, as any sleeping background 
> proc turned cpu-hog may DoS the system.
> 
> If you like to play with this, then you probably want to at least reset the 
> quota in its expiration.

well, the problem is that i can't reproduce the problem :) I tried
the patch because i suspected it could introduce regressions, and it
did. Maybe with some tuning a reasonable compromise could be found,
but first we need to know what to tune for... Does anybody have a
simple reproducible way to show the scheduling regressions of RSDL
vs mainline? ie one that does not involve (or at least is
independent of) specific X drivers, binary apps etc. Some reports
mentioned MP, is UP less susceptible?

I've now tried a -j2 kernel compilation on UP and in the not niced
case X interactivity suffers, which i guess is to be expected when
you have ~5 processes competing for one cpu (2*(cc+as)+X). "nice -5"
helps a bit, but does not eliminate the effect completely. Obviously
the right solution is to nice the makes, but i think the scheduler
could do better, at least in the case of almost idle X (once you
start moving windows etc it becomes a cpuhog just as the the
compiler). I'll look into this, maybe there's a way to prioritize
often sleeping tasks which can not be abused.
Another thing is the nice levels; right now "nice -10" means ~35%
and "nice -19" gives ~5% cpu; that's probably 2..5 times too much.

artur

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-21 17:07                                                                                 ` Mike Galbraith
@ 2007-03-22  4:49                                                                                   ` Willy Tarreau
  2007-03-22  7:14                                                                                     ` Mike Galbraith
  0 siblings, 1 reply; 198+ messages in thread
From: Willy Tarreau @ 2007-03-22  4:49 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Paolo Ornati, Linus Torvalds, Xavier Bestel, Mark Lord, Al Boldi,
	Con Kolivas, ck, Serge Belyshev, Ingo Molnar, linux-kernel,
	Nicholas Miell, Andrew Morton

On Wed, Mar 21, 2007 at 06:07:33PM +0100, Mike Galbraith wrote:
> On Wed, 2007-03-21 at 16:11 +0100, Paolo Ornati wrote:
> > On Wed, 21 Mar 2007 15:57:44 +0100
> > Mike Galbraith <efault@gmx.de> wrote:
> > 
> > > I was more than a bit surprised that mainline did this well, considering
> > > that the proggy was one someone posted long time ago to demonstrate
> > > starvation issues with the interactivity estimator.  (source not
> > > available unfortunately, was apparently still on my old PIII box along
> > > with the one Willy posted when I installed opensuse 10.2 on it.  damn.
> > > trivial thing though)
> > 
> > This one?  :)
> 
> No, but that one went to bit heaven too ;-)

Mike, if you need my old scheddos, I can resend it to you as well as to
any people working on the scheduler and asking for it. Although trivial,
I'm a bit reluctant to publish it to the whole world because I suspect
that distros based on older kernels are still vulnerable and the fixes
may not be easy. Anyway, it has absolutely no effect on non-interactive
schedulers.

Regards,
Willy


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-21 16:02                                                                               ` Peter Zijlstra
  2007-03-21 17:06                                                                                 ` Mike Galbraith
@ 2007-03-22  7:07                                                                                 ` Mike Galbraith
  2007-03-22  9:18                                                                                   ` Ingo Molnar
  2007-03-22 22:50                                                                                   ` Con Kolivas
  1 sibling, 2 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-22  7:07 UTC (permalink / raw)
  To: Peter Zijlstra
  Cc: Willy Tarreau, Linus Torvalds, Xavier Bestel, Mark Lord,
	Al Boldi, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

This is a rather long message, and isn't directed at anyone in
particular, it's for others who may be digging into their own problems
with RSDL, and for others (if any other than Con exist) who understand
RSDL well enough to tell me if I'm missing something.  Anyone who's not
interested in RSDL's gizzard hit 'D' now.

On Wed, 2007-03-21 at 17:02 +0100, Peter Zijlstra wrote:
> On Wed, 2007-03-21 at 15:57 +0100, Mike Galbraith wrote:
> 
> > 'f' is a progglet which sleeps a bit and burns a bit, duration depending
> > on argument given. 'sh' is a shell 100% hog.  In this scenario, the
> > argument was set such that 'f' used right at 50% cpu.  All are started
> > at the same time, and I froze top when the first 'f' reached 1:00.
> 
> May one enquire how much CPU the mythical 'f' uses when ran alone? Just
> to get a gauge for the numbers?

Actually, the numbers are an interesting curiosity point, but not as
interesting as the fact that the deadline mechanism isn't kicking in.

>From task_running_tick():
	/*
	 * Accounting is performed by both the task and the runqueue. This
	 * allows frequently sleeping tasks to get their proper quota of
	 * cpu as the runqueue will have their quota still available at
	 * the appropriate priority level. It also means frequently waking
	 * tasks that might miss the scheduler_tick() will get forced down
	 * priority regardless.
	 */
	if (!--p->time_slice)
		task_expired_entitlement(rq, p);
	/*
	 * We only employ the deadline mechanism if we run over the quota.
	 * It allows aliasing problems around the scheduler_tick to be
	 * less harmful.
	 */
	if (!rt_task(p) && --rq_quota(rq, rq->prio_level) < 0) {
		if (unlikely(p->first_time_slice))
			p->first_time_slice = 0;
		rotate_runqueue_priority(rq);
		set_tsk_need_resched(p);
	}

The reason for ticking both runqueue and task is that you can't sample a
say 100KHz information stream at 1KHz and reproduce that information
accurately.  IOW, task time slices "blur" at high switch frequency, you
can't always hit tasks, so you hit what you _can_ hit every sample, the
runqueue, to minimize the theoretical effects of time slice theft.
(I've instrumented this before, and caught fast movers stealing 10s of
milliseconds in extreme cases.)  Generally speaking, statistics even
things out very much, the fast mover eventually gets hit, and pays a
full tick for his sub-tick dip in the pool, so in practice it's not a
great big hairy deal.

If you can accept that tasks can and do dodge the tick, an imbalance
between runqueue quota and task quota must occur.  It isn't happening
here, and the reason appears to be bean counting error, tasks migrate
but their quotas don't follow.  The first time a task is queued at any
priority, quota is allocated, task goes to sleep, quota on departed
runqueue stays behind, task awakens on a different runqueue, allocate
more quota, repeat.  For migration, there's twist, if you pull an
expired task, expired tasks don't have a quota yet, so they shouldn't
screw up bean counting.

>From pull_task():
	/*
	 * If this task has already been running on src_rq this priority
	 * cycle, make the new runqueue think it has been on its cycle
	 */
	if (p->rotation == src_rq->prio_rotation)
		p->rotation = this_rq->prio_rotation;

The intent here is clearly that this task continue on the new cpu as if
nothing has happened.  However, when the task was dequeued, p->array was
left as it was, points to the last place it was queued.  Stale data.

>From recalc_task_prio(), which is called by enqueue_task():
static void recalc_task_prio(struct task_struct *p, struct rq *rq)
{
	struct prio_array *array = rq->active;
	int queue_prio, search_prio;

	if (p->rotation == rq->prio_rotation) {
		if (p->array == array) {
			if (p->time_slice && rq_quota(rq, p->prio))
				return;
		} else if (p->array == rq->expired) {
			queue_expired(p, rq);
			return;
		} else
			task_new_array(p, rq);
	} else
		task_new_array(p, rq);
	search_prio = p->static_prio;

p->rotation was set to this runqueue's prio_rotation, but p->array is
stale, still points to the old cpu's runqueue, so...

static inline void task_new_array(struct task_struct *p, struct rq *rq)
{
	bitmap_zero(p->bitmap, PRIO_RANGE);
	p->rotation = rq->prio_rotation;
}

p->bitmap is the history of all priorities where this task has been
allocated a quota.  Here, that history is erased, so the task can't
continue it's staircase walk.  It is instead given a new runqueue quota
and time_slice (didn't it just gain ticks?).  Now, what if a cross-cpu
wakeup or migrating task _didn't_ have a stale array pointer, rather it
pointed to the current active array, _and_ there were no quota available
at it's priority?  It didn't bring any of it's old quota with it after
all.  In that case, it would fall through as well, and get the full new
time_slice and runqueue quota treatment.

The next logical step if the above is correct is obvious.  If it's not,
really bad things are going to happen here shortly :)

	-Mike (broom and dustpan at the ready)

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-22  4:49                                                                                   ` Willy Tarreau
@ 2007-03-22  7:14                                                                                     ` Mike Galbraith
  0 siblings, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-22  7:14 UTC (permalink / raw)
  To: Willy Tarreau
  Cc: Paolo Ornati, Linus Torvalds, Xavier Bestel, Mark Lord, Al Boldi,
	Con Kolivas, ck, Serge Belyshev, Ingo Molnar, linux-kernel,
	Nicholas Miell, Andrew Morton

On Thu, 2007-03-22 at 05:49 +0100, Willy Tarreau wrote:

> Mike, if you need my old scheddos, I can resend it to you as well as to
> any people working on the scheduler and asking for it. Although trivial,
> I'm a bit reluctant to publish it to the whole world because I suspect
> that distros based on older kernels are still vulnerable and the fixes
> may not be easy. Anyway, it has absolutely no effect on non-interactive
> schedulers.

Sure.  I'm really irked that I lost most of my collection of posted
exploits.  I prefer to test with the same widget the poster tested
with.  

(in this particular case, it's not _very_ important which one i test
with, i'm exploring my RSDL troubles with sleepers in general...
duration matters though, need to be fairly short/short burn.)

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-22  7:07                                                                                 ` Mike Galbraith
@ 2007-03-22  9:18                                                                                   ` Ingo Molnar
  2007-03-22  9:34                                                                                     ` Mike Galbraith
  2007-03-22 22:03                                                                                     ` Con Kolivas
  2007-03-22 22:50                                                                                   ` Con Kolivas
  1 sibling, 2 replies; 198+ messages in thread
From: Ingo Molnar @ 2007-03-22  9:18 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Peter Zijlstra, Willy Tarreau, Linus Torvalds, Xavier Bestel,
	Mark Lord, Al Boldi, Con Kolivas, ck, Serge Belyshev,
	linux-kernel, Nicholas Miell, Andrew Morton


* Mike Galbraith <efault@gmx.de> wrote:

> Actually, the numbers are an interesting curiosity point, but not as 
> interesting as the fact that the deadline mechanism isn't kicking in.

it's not just the scheduling accounting being off, RSDL also seems to be 
accessing stale data here:

> >From pull_task():
> 	/*
> 	 * If this task has already been running on src_rq this priority
> 	 * cycle, make the new runqueue think it has been on its cycle
> 	 */
> 	if (p->rotation == src_rq->prio_rotation)
> 		p->rotation = this_rq->prio_rotation;
> 
> The intent here is clearly that this task continue on the new cpu as
> if nothing has happened.  However, when the task was dequeued,
> p->array was left as it was, points to the last place it was queued.
> Stale data.

it might point to a hot-unplugged CPU's runqueue as well. Which might 
work accidentally, but we want this fixed nevertheless.

	Ingo

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-22  9:18                                                                                   ` Ingo Molnar
@ 2007-03-22  9:34                                                                                     ` Mike Galbraith
  2007-03-22  9:41                                                                                       ` Mike Galbraith
  2007-03-22 22:03                                                                                     ` Con Kolivas
  1 sibling, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-22  9:34 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Peter Zijlstra, Willy Tarreau, Linus Torvalds, Xavier Bestel,
	Mark Lord, Al Boldi, Con Kolivas, ck, Serge Belyshev,
	linux-kernel, Nicholas Miell, Andrew Morton

On Thu, 2007-03-22 at 10:18 +0100, Ingo Molnar wrote:
> * Mike Galbraith <efault@gmx.de> wrote:
> 
> > Actually, the numbers are an interesting curiosity point, but not as 
> > interesting as the fact that the deadline mechanism isn't kicking in.
> 
> it's not just the scheduling accounting being off, RSDL also seems to be 
> accessing stale data here:
> 
> > >From pull_task():
> > 	/*
> > 	 * If this task has already been running on src_rq this priority
> > 	 * cycle, make the new runqueue think it has been on its cycle
> > 	 */
> > 	if (p->rotation == src_rq->prio_rotation)
> > 		p->rotation = this_rq->prio_rotation;
> > 
> > The intent here is clearly that this task continue on the new cpu as
> > if nothing has happened.  However, when the task was dequeued,
> > p->array was left as it was, points to the last place it was queued.
> > Stale data.
> 
> it might point to a hot-unplugged CPU's runqueue as well. Which might 
> work accidentally, but we want this fixed nevertheless.

Erk!  I mentioned to Con offline that I've seen RSDL bring up only one
of my two (halves of a) penguins a couple three times out of a zillion
boots.  Maybe that's why?

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-22  9:34                                                                                     ` Mike Galbraith
@ 2007-03-22  9:41                                                                                       ` Mike Galbraith
  0 siblings, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-22  9:41 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Peter Zijlstra, Willy Tarreau, Linus Torvalds, Xavier Bestel,
	Mark Lord, Al Boldi, Con Kolivas, ck, Serge Belyshev,
	linux-kernel, Nicholas Miell, Andrew Morton

On Thu, 2007-03-22 at 10:34 +0100, Mike Galbraith wrote:

> Erk!

bzzt.  singletasking brain :)


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-22  9:18                                                                                   ` Ingo Molnar
  2007-03-22  9:34                                                                                     ` Mike Galbraith
@ 2007-03-22 22:03                                                                                     ` Con Kolivas
  1 sibling, 0 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-22 22:03 UTC (permalink / raw)
  To: Ingo Molnar
  Cc: Mike Galbraith, Peter Zijlstra, Willy Tarreau, Linus Torvalds,
	Xavier Bestel, Mark Lord, Al Boldi, ck, Serge Belyshev,
	linux-kernel, Nicholas Miell, Andrew Morton

All code reviews are most welcome indeed!

On Thursday 22 March 2007 20:18, Ingo Molnar wrote:
> * Mike Galbraith <efault@gmx.de> wrote:
> > Actually, the numbers are an interesting curiosity point, but not as
> > interesting as the fact that the deadline mechanism isn't kicking in.
>
> it's not just the scheduling accounting being off, RSDL also seems to be

I'll look at that when I have time.

> accessing stale data here:
> > >From pull_task():
> >
> > 	/*
> > 	 * If this task has already been running on src_rq this priority
> > 	 * cycle, make the new runqueue think it has been on its cycle
> > 	 */
> > 	if (p->rotation == src_rq->prio_rotation)
> > 		p->rotation = this_rq->prio_rotation;
> >
> > The intent here is clearly that this task continue on the new cpu as
> > if nothing has happened.  However, when the task was dequeued,
> > p->array was left as it was, points to the last place it was queued.
> > Stale data.

I don't think this is a problem because immediately after this in pull_task it 
calls enqueue_task() which always updates p->array in recalc_task_prio(). 
Every enqueue_task always calls recalc_task_prio on non-rt tasks so the array 
should always be set no matter where the entry point to scheduling is from 
unless I have a logic error in setting the p->array in recalc_task_prio() or 
there is another path to schedule() that I've not accounted for by making 
sure recalc_task_prio is done.

> it might point to a hot-unplugged CPU's runqueue as well. Which might
> work accidentally, but we want this fixed nevertheless.

The hot unplugged cpu's prio_rotation will be examined, and then it sets the 
prio_rotation from this runqueue's value. That shouldn't lead to any more 
problems than setting the timestamp based on the hot unplug cpus timestamp 
lower down also in pull_task()

p->timestamp = (p->timestamp - src_rq->most_recent_timestamp) +  
this_rq->most_recent_timestamp;

Thanks for looking!

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-22  7:07                                                                                 ` Mike Galbraith
  2007-03-22  9:18                                                                                   ` Ingo Molnar
@ 2007-03-22 22:50                                                                                   ` Con Kolivas
  2007-03-23  4:39                                                                                     ` Mike Galbraith
  1 sibling, 1 reply; 198+ messages in thread
From: Con Kolivas @ 2007-03-22 22:50 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Peter Zijlstra, Willy Tarreau, Linus Torvalds, Xavier Bestel,
	Mark Lord, Al Boldi, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

Thanks for taking the time to actually look at the code. All audits are most 
welcome!.

On Thursday 22 March 2007 18:07, Mike Galbraith wrote:
> This is a rather long message, and isn't directed at anyone in
> particular, it's for others who may be digging into their own problems
> with RSDL, and for others (if any other than Con exist) who understand
> RSDL well enough to tell me if I'm missing something.  Anyone who's not
> interested in RSDL's gizzard hit 'D' now.
>
> On Wed, 2007-03-21 at 17:02 +0100, Peter Zijlstra wrote:
> > On Wed, 2007-03-21 at 15:57 +0100, Mike Galbraith wrote:
> > > 'f' is a progglet which sleeps a bit and burns a bit, duration
> > > depending on argument given. 'sh' is a shell 100% hog.  In this
> > > scenario, the argument was set such that 'f' used right at 50% cpu. 
> > > All are started at the same time, and I froze top when the first 'f'
> > > reached 1:00.
> >
> > May one enquire how much CPU the mythical 'f' uses when ran alone? Just
> > to get a gauge for the numbers?
>
> Actually, the numbers are an interesting curiosity point, but not as
> interesting as the fact that the deadline mechanism isn't kicking in.
>
> >From task_running_tick():
>
> 	/*
> 	 * Accounting is performed by both the task and the runqueue. This
> 	 * allows frequently sleeping tasks to get their proper quota of
> 	 * cpu as the runqueue will have their quota still available at
> 	 * the appropriate priority level. It also means frequently waking
> 	 * tasks that might miss the scheduler_tick() will get forced down
> 	 * priority regardless.
> 	 */
> 	if (!--p->time_slice)
> 		task_expired_entitlement(rq, p);
> 	/*
> 	 * We only employ the deadline mechanism if we run over the quota.
> 	 * It allows aliasing problems around the scheduler_tick to be
> 	 * less harmful.
> 	 */
> 	if (!rt_task(p) && --rq_quota(rq, rq->prio_level) < 0) {
> 		if (unlikely(p->first_time_slice))
> 			p->first_time_slice = 0;
> 		rotate_runqueue_priority(rq);
> 		set_tsk_need_resched(p);
> 	}
>
> The reason for ticking both runqueue and task is that you can't sample a
> say 100KHz information stream at 1KHz and reproduce that information
> accurately.  IOW, task time slices "blur" at high switch frequency, you
> can't always hit tasks, so you hit what you _can_ hit every sample, the
> runqueue, to minimize the theoretical effects of time slice theft.
> (I've instrumented this before, and caught fast movers stealing 10s of
> milliseconds in extreme cases.)  Generally speaking, statistics even
> things out very much, the fast mover eventually gets hit, and pays a
> full tick for his sub-tick dip in the pool, so in practice it's not a
> great big hairy deal.
>
> If you can accept that tasks can and do dodge the tick, an imbalance
> between runqueue quota and task quota must occur.  It isn't happening
> here, and the reason appears to be bean counting error, tasks migrate
> but their quotas don't follow.  The first time a task is queued at any
> priority, quota is allocated, task goes to sleep, quota on departed
> runqueue stays behind, task awakens on a different runqueue, allocate
> more quota, repeat.  For migration, there's twist, if you pull an
> expired task, expired tasks don't have a quota yet, so they shouldn't
> screw up bean counting.

I had considered the quota not migrating to the new runqueue but basically it 
screws up the "set quota once and deadline only kicks in if absolutely 
necessary" policy. Migration means some extra quota is left behind on the 
runqueue it left from. It is never a huge extra quota and is reset on major 
rotation which occurs very frequently on rsdl. If I was to carry the quota 
over I would need to deduct p->time_slice from the source runqueue's quota, 
and add it to the target runqueue's quota. The problem there is that once the 
time_slice has been handed out to a task it is my position that I no longer 
trust the task to keep its accounting right and may well have exhausted all 
its quota from the source runqueue and is pulling quota away from tasks that 
haven't used theirs yet.

See below for more on updating prio rotation and adding quota to new runqueue.

>
> >From pull_task():
>
> 	/*
> 	 * If this task has already been running on src_rq this priority
> 	 * cycle, make the new runqueue think it has been on its cycle
> 	 */
> 	if (p->rotation == src_rq->prio_rotation)
> 		p->rotation = this_rq->prio_rotation;
>
> The intent here is clearly that this task continue on the new cpu as if
> nothing has happened.  However, when the task was dequeued, p->array was
> left as it was, points to the last place it was queued.  Stale data.
>
> >From recalc_task_prio(), which is called by enqueue_task():
>
> static void recalc_task_prio(struct task_struct *p, struct rq *rq)
> {
> 	struct prio_array *array = rq->active;
> 	int queue_prio, search_prio;
>
> 	if (p->rotation == rq->prio_rotation) {
> 		if (p->array == array) {
> 			if (p->time_slice && rq_quota(rq, p->prio))
> 				return;
> 		} else if (p->array == rq->expired) {
> 			queue_expired(p, rq);
> 			return;
> 		} else
> 			task_new_array(p, rq);
> 	} else
> 		task_new_array(p, rq);
> 	search_prio = p->static_prio;
> p->rotation was set to this runqueue's prio_rotation, but p->array is
> stale, still points to the old cpu's runqueue, so...
>
> static inline void task_new_array(struct task_struct *p, struct rq *rq)
> {
> 	bitmap_zero(p->bitmap, PRIO_RANGE);
> 	p->rotation = rq->prio_rotation;
> }
>
> p->bitmap is the history of all priorities where this task has been
> allocated a quota.  Here, that history is erased, so the task can't
> continue it's staircase walk.  It is instead given a new runqueue quota
> and time_slice (didn't it just gain ticks?).  Now, what if a cross-cpu
> wakeup or migrating task _didn't_ have a stale array pointer, rather it
> pointed to the current active array, _and_ there were no quota available
> at it's priority?  It didn't bring any of it's old quota with it after
> all.  In that case, it would fall through as well, and get the full new
> time_slice and runqueue quota treatment.

Cross cpu migrating task can't have p->array pointing to the new runqueue's 
active array by any means, but fork and friends could. The other point about 
cross cpu history having the wrong effect though is most valid. Good 
spotting! While it's unlikely this could cause an oops... you never know so 
hopefully fixing this will fix Andy's bug.

> The next logical step if the above is correct is obvious.  If it's not,
> really bad things are going to happen here shortly :)

Now to figure out some meaningful cheap way of improving this accounting.

> 	-Mike (broom and dustpan at the ready)

Thanks again!

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-22 22:50                                                                                   ` Con Kolivas
@ 2007-03-23  4:39                                                                                     ` Mike Galbraith
  2007-03-23  5:59                                                                                       ` Con Kolivas
  0 siblings, 1 reply; 198+ messages in thread
From: Mike Galbraith @ 2007-03-23  4:39 UTC (permalink / raw)
  To: Con Kolivas
  Cc: Peter Zijlstra, Willy Tarreau, Linus Torvalds, Xavier Bestel,
	Mark Lord, Al Boldi, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

On Fri, 2007-03-23 at 09:50 +1100, Con Kolivas wrote:

> Now to figure out some meaningful cheap way of improving this accounting.

The accounting is easy iff tick resolution is good enough, the deadline
mechanism is harder.  I did the "quota follows task" thing, but nothing
good happens.  That just ensured that the deadline mechanism kicks in
constantly because tick theft is a fact of tick-based life.  A
reasonable fudge factor would help, but...

I see problems wrt with trying to implement the deadline mechanism.

As implemented, it can't identify who is doing the stealing (which
happens constantly, even if userland if 100% hog) because of tick
resolution accounting.  If you can't identify the culprit, you can't
enforce the quota, and quotas which are not enforced are, strictly
speaking, not quotas.  At tick time, you can only close the barn door
after the cow has been stolen, and the thief can theoretically visit
your barn an infinite number of times while you aren't watching the
door.  ("don't blink" scenarios, and tick is backward-assward blink)

You can count nanoseconds in schedule, and store the actual usage, but
then you still have the problem of inaccuracies in sched_clock() from
cross-cpu wakeup and migration.  Cross-cpu wakeups happen quite a lot.
If sched_clock() _were_ absolutely accurate, you wouldn't need the
runqueue deadline mechanism, because at slice tick time you can see
everything you will ever see without moving enforcement directly into
the most critical of paths.

IMHO, unless it can be demonstrated that timeslice theft is a problem
with a real-life scenario, you'd be better off dropping the queue
ticking.  Time slices are a deadline mechanism, and in practice the god
of randomness ensures that even fast movers do get caught often enough
to make ticking tasks sufficient.

(that was a very long-winded reply to one sentence because I spent a lot
of time looking into this very subject and came to the conclusion that
you can't get there from here.  fwiw, ymmv and all that of course;)

> Thanks again!

You're welcome.

	-Mike

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-23  4:39                                                                                     ` Mike Galbraith
@ 2007-03-23  5:59                                                                                       ` Con Kolivas
  2007-03-23  6:11                                                                                         ` Mike Galbraith
  2007-03-23 12:17                                                                                         ` Mike Galbraith
  0 siblings, 2 replies; 198+ messages in thread
From: Con Kolivas @ 2007-03-23  5:59 UTC (permalink / raw)
  To: Mike Galbraith
  Cc: Peter Zijlstra, Willy Tarreau, Linus Torvalds, Xavier Bestel,
	Mark Lord, Al Boldi, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

On Friday 23 March 2007 15:39, Mike Galbraith wrote:
> On Fri, 2007-03-23 at 09:50 +1100, Con Kolivas wrote:
> > Now to figure out some meaningful cheap way of improving this accounting.
>
> The accounting is easy iff tick resolution is good enough, the deadline
> mechanism is harder.  I did the "quota follows task" thing, but nothing
> good happens.  That just ensured that the deadline mechanism kicks in
> constantly because tick theft is a fact of tick-based life.  A
> reasonable fudge factor would help, but...
>
> I see problems wrt with trying to implement the deadline mechanism.
>
> As implemented, it can't identify who is doing the stealing (which
> happens constantly, even if userland if 100% hog) because of tick
> resolution accounting.  If you can't identify the culprit, you can't
> enforce the quota, and quotas which are not enforced are, strictly
> speaking, not quotas.  At tick time, you can only close the barn door
> after the cow has been stolen, and the thief can theoretically visit
> your barn an infinite number of times while you aren't watching the
> door.  ("don't blink" scenarios, and tick is backward-assward blink)
>
> You can count nanoseconds in schedule, and store the actual usage, but
> then you still have the problem of inaccuracies in sched_clock() from
> cross-cpu wakeup and migration.  Cross-cpu wakeups happen quite a lot.
> If sched_clock() _were_ absolutely accurate, you wouldn't need the
> runqueue deadline mechanism, because at slice tick time you can see
> everything you will ever see without moving enforcement directly into
> the most critical of paths.
>
> IMHO, unless it can be demonstrated that timeslice theft is a problem
> with a real-life scenario, you'd be better off dropping the queue
> ticking.  Time slices are a deadline mechanism, and in practice the god
> of randomness ensures that even fast movers do get caught often enough
> to make ticking tasks sufficient.
>
> (that was a very long-winded reply to one sentence because I spent a lot
> of time looking into this very subject and came to the conclusion that
> you can't get there from here.  fwiw, ymmv and all that of course;)
>
> > Thanks again!
>
> You're welcome.

The deadline mechanism is easy to hit and works. Try printk'ing it. There is 
some leeway to take tick accounting into the equation and I don't believe 
nanosecond resolution is required at all for this (how much leeway would you 
give then ;)). Eventually there is nothing to stop us using highres timers 
(blessed if they work as planned everywhere eventually) to do the events and 
do away with scheduler_tick entirely. For now ticks works fine; a reasonable 
estimate for smp migration will suffice (patch forthcoming).

-- 
-ck

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-23  5:59                                                                                       ` Con Kolivas
@ 2007-03-23  6:11                                                                                         ` Mike Galbraith
  2007-03-23 12:17                                                                                         ` Mike Galbraith
  1 sibling, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-23  6:11 UTC (permalink / raw)
  To: Con Kolivas
  Cc: Peter Zijlstra, Willy Tarreau, Linus Torvalds, Xavier Bestel,
	Mark Lord, Al Boldi, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

On Fri, 2007-03-23 at 16:59 +1100, Con Kolivas wrote:

> The deadline mechanism is easy to hit and works. Try printk'ing it.

Hm.  I did (.30), and it didn't in an hours time doing this and that.
After I did the take your quota with you, it did kick in.  Lots.

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-23  5:59                                                                                       ` Con Kolivas
  2007-03-23  6:11                                                                                         ` Mike Galbraith
@ 2007-03-23 12:17                                                                                         ` Mike Galbraith
  1 sibling, 0 replies; 198+ messages in thread
From: Mike Galbraith @ 2007-03-23 12:17 UTC (permalink / raw)
  To: Con Kolivas
  Cc: Peter Zijlstra, Willy Tarreau, Linus Torvalds, Xavier Bestel,
	Mark Lord, Al Boldi, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

On Fri, 2007-03-23 at 16:59 +1100, Con Kolivas wrote:
> 
> The deadline mechanism is easy to hit and works. Try printk'ing it.

I tried rc4-rsdl.33, and in a log that's 782kb, there is only one
instance of an overrun, which I created.  On my box, it's dead code.  

	-Mike


^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-21 10:43                                                                               ` David Schwartz
@ 2007-03-28 23:37                                                                                 ` Bill Davidsen
  2007-03-29  7:10                                                                                   ` David Schwartz
  0 siblings, 1 reply; 198+ messages in thread
From: Bill Davidsen @ 2007-03-28 23:37 UTC (permalink / raw)
  To: davids; +Cc: Linux-Kernel@Vger. Kernel. Org

David Schwartz wrote:
>> there were multiple attempts with renicing X under the vanilla
>> scheduler, and they were utter failures most of the time. _More_ people
>> complained about interactivity issues _after_ X has been reniced to -5
>> (or -10) than people complained about "nice 0" interactivity issues to
>> begin with.
> 
> Unfortunately, nicing X is not going to work. It causes X to pre-empt any
> local process that tries to batch requests to it, defeating the batching.
> What you really want is X to get scheduled after the client pauses in
> sending data to it or has sent more than a certain amount. It seems kind of
> crazy to put such login in a scheduler.
> 
> Perhaps when one process unblocks another, you put that other process at the
> head of the run queue but don't pre-empt the currently running process. That
> way, the process can continue to batch requests, but X's maximum latency
> delay will be the quantum of the client program.

In general I think that's the right idea. See below for more...
> 
>> The vanilla scheduler's auto-nice feature rewards _behavior_, so it gets
>> X right most of the time. The fundamental issue is that sometimes X is
>> very interactive - we boost it then, there's lots of scheduling but nice
>> low latencies. Sometimes it's a hog - we penalize it then and things
>> start to batch up more and we get out of the overload situation faster.
>> That's the case even if all you care about is desktop performance.
>>
>> no doubt it's hard to get the auto-nice thing right, but one thing is
>> clear: currently RSDL causes problems in areas that worked well in the
>> vanilla scheduler for a long time, so RSDL needs to improve. RSDL should
>> not lure itself into the false promise of 'just renice X statically'. It
>> wont work. (You might want to rewrite X's request scheduling - but if so
>> then i'd like to see that being done _first_, because i just dont trust
>> such 10-mile-distance problem analysis.)
> 
> I am hopeful that there exists a heuristic that both improves this problem
> and is also inherently fair. If that's true, then such a heuristic can be
> added to RSDL without damaging its properties and without requiring any
> special settings. Perhaps longer-term latency benefits to processes that
> have yielded in the past?
> 
> I think there are certain circumstances, however, where it is inherently
> reasonable to insist that 'nice' be used. If you want a CPU-starved task to
> get more than 1/X of the CPU, where X is the number of CPU-starved tasks,
> you should have to ask for that. If you want one CPU-starved task to get
> better latency than other CPU-starved tasks, you should have to ask for
> that.

I agree for giving a process more than a fair share, but I don't think 
"latency" is the best term for what you describe later. If you think of 
latency as the time between a process unblocking and the time when it 
gets CPU, that is a more traditional interpretation. I'm not really sure 
latency and CPU-starved are compatible.

I would like to see processes at the head of the queue (for latency) 
which were blocked for long term events, keyboard input, network input, 
mouse input, etc. Then processes blocked for short term events like 
disk, then processes which exhausted their time slice. This helps 
latency and responsiveness, while keeping all processes running.

A variation is to give those processes at the head of the queue short
> 
> Fundamentally, the scheduler cannot do it by itself. You can create cases
> where the load is precisely identical and one person wants X and another
> person wants Y. The scheduler cannot know what's important to you.
> 
> DS
> 
> 


-- 
Bill Davidsen <davidsen@tmr.com>
   "We have more to fear from the bungling of the incompetent than from
the machinations of the wicked."  - from Slashdot

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-20 15:31                                                                           ` Linus Torvalds
  2007-03-20 18:08                                                                             ` Al Boldi
  2007-03-21  8:22                                                                             ` Keith Duthie
@ 2007-03-28 23:43                                                                             ` Bill Davidsen
  2 siblings, 0 replies; 198+ messages in thread
From: Bill Davidsen @ 2007-03-28 23:43 UTC (permalink / raw)
  To: Linus Torvalds
  Cc: Willy Tarreau, Xavier Bestel, Mark Lord, Al Boldi,
	Mike Galbraith, Con Kolivas, ck, Serge Belyshev, Ingo Molnar,
	linux-kernel, Nicholas Miell, Andrew Morton

Linus Torvalds wrote:
> 
> On Tue, 20 Mar 2007, Willy Tarreau wrote:
>> Linus, you're unfair with Con. He initially was on this position, and lately
>> worked with Mike by proposing changes to try to improve his X responsiveness.
> 
> I was not actually so much speaking about Con, as about a lot of the 
> tone in general here. And yes, it's not been entirely black and white. I 
> was very happy to see the "try this patch" email from Al Boldi - not 
> because I think that patch per se was necessarily the right fix (I have no 
> idea), but simply because I think that's the kind of mindset we need to 
> have.
> 
> Not a lot of people really *like* the old scheduler, but it's been tweaked 
> over the years to try to avoid some nasty behaviour. I'm really hoping 
> that RSDL would be a lot better (and by all accounts it has the potential 
> for that), but I think it's totally naïve to expect that it won't need 
> some tweaking too.
> 
> So I'll happily still merge RSDL right after 2.6.21 (and it won't even be 
> a config option - if we want to make it good, we need to make sure 
> *everybody* tests it), but what I want to see is that "can do" spirit wrt 
> tweaking for issues that come up.
> 
May I suggest that if you want proper testing that it not only should be 
a config option but a boot time option as well? Otherwise people will be 
comparing an old scheduler with an RSDL kernel, and they will diverge as 
time goes on.

More people would be willing to reboot and test on a similar load than 
will keep two versions of the kernel around. And if you get people 
testing RSDL against a vendor kernel which might be hacked, it will be 
even less meaningful.

Please consider the benefits of making RSDL the default scheduler, and 
leaving people with the old scheduler with an otherwise identical kernel 
as a fair and meaningful comparison.

There, that's a technical argument ;-)

> Because let's face it - nothing is ever perfect. Even a really nice 
> conceptual idea always ends up hitting the "but in real life, things are 
> ugly and complex, and we've depended on behaviour X in the past and can't 
> change it, so we need some tweaking for problem Y".
> 
> And everything is totally fixable - at least as long as people are willing 
> to!
> 
> 		Linus


-- 
Bill Davidsen <davidsen@tmr.com>
   "We have more to fear from the bungling of the incompetent than from
the machinations of the wicked."  - from Slashdot


^ permalink raw reply	[flat|nested] 198+ messages in thread

* RE: RSDL v0.31
  2007-03-28 23:37                                                                                 ` Bill Davidsen
@ 2007-03-29  7:10                                                                                   ` David Schwartz
  2007-03-29  7:34                                                                                     ` Nick Piggin
  0 siblings, 1 reply; 198+ messages in thread
From: David Schwartz @ 2007-03-29  7:10 UTC (permalink / raw)
  To: Linux-Kernel@Vger. Kernel. Org

Bill Davidsen wrote:

> I agree for giving a process more than a fair share, but I don't think
> "latency" is the best term for what you describe later. If you think of
> latency as the time between a process unblocking and the time when it
> gets CPU, that is a more traditional interpretation. I'm not really sure
> latency and CPU-starved are compatible.

For CPU-starvation, I think 'nice' is always going to be the fix. If you
want a process to get more than its 'fair share' of the CPU, you have to ask
for that. I think the scheduler should be fair by default.

However, cleverness in the scheduler with latency can make things better
without being unfair to anyone. It's perfectly fair for a task that has been
blocked for awhile to pre-empt a CPU-limited task when it unblocks.

What I'm arguing is that if your task is CPU-limited and the scheduler is
fair, that's your fault -- nice it. If your task is suffering from poor
latency, and it's using less than its fair share of the CPU (because it is
not CPU-limited), that is something the scheduler can be smarter about.

Two things that I think can help improve interactivity without breaking
fairness are:

1) Keep a longer-term history of tasks that have yielded the CPU so that
they can be more likely to pre-empt when they are unblocked by I/O. (The
improved accounting accuracy may go a long way towards doing this. I
personally like exponential decay measurements of CPU usage.)

2) Be smart about things like pipes. When one process unblocks another
through a pipe, socket, or the like, do not pre-empt (this defeats batching
and blows out caches needlessly), but do try to schedule the unblocked
process soon. Don't penalize one process for unblocking another, that's a
good thing for it to do.

I believe that the process of making schedulers smarter and fairer (and
fixing bugs in them) will get us to a place where interactivity is superb
without sacrificing fairness among tasks at equal static priority.

Honestly, I have always been against aggressive pre-emption. I think as CPUs
get faster and timeslices get shorter, it makes less and less sense. In many
cases you are better off just making the task ready-to-run and allowing its
higher dynamic priority to make it next. I strongly believe this for cases
where the running task unblocked the other task. (I think in too many cases,
you blow out the caches and force a context switch on a task that was just a
few hundred instructions short of being finished with what it was doing as
you punish it for getting useful work done.)

DS

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: RSDL v0.31
  2007-03-29  7:10                                                                                   ` David Schwartz
@ 2007-03-29  7:34                                                                                     ` Nick Piggin
  0 siblings, 0 replies; 198+ messages in thread
From: Nick Piggin @ 2007-03-29  7:34 UTC (permalink / raw)
  To: davids; +Cc: Linux-Kernel@Vger. Kernel. Org

David Schwartz wrote:
> Bill Davidsen wrote:
> 
> 
>>I agree for giving a process more than a fair share, but I don't think
>>"latency" is the best term for what you describe later. If you think of
>>latency as the time between a process unblocking and the time when it
>>gets CPU, that is a more traditional interpretation. I'm not really sure
>>latency and CPU-starved are compatible.
> 
> 
> For CPU-starvation, I think 'nice' is always going to be the fix. If you
> want a process to get more than its 'fair share' of the CPU, you have to ask
> for that. I think the scheduler should be fair by default.
> 
> However, cleverness in the scheduler with latency can make things better
> without being unfair to anyone. It's perfectly fair for a task that has been
> blocked for awhile to pre-empt a CPU-limited task when it unblocks.
> 
> What I'm arguing is that if your task is CPU-limited and the scheduler is
> fair, that's your fault -- nice it. If your task is suffering from poor
> latency, and it's using less than its fair share of the CPU (because it is
> not CPU-limited), that is something the scheduler can be smarter about.

Agreed. That's what I've been saying for years (since early 2.6 when we had
all those scheduler troubles and I started nicksched).

> Honestly, I have always been against aggressive pre-emption. I think as CPUs
> get faster and timeslices get shorter, it makes less and less sense. In many

I think scheduler timeslices actually shouldn't really be getting shorter.
While I found it is quite easy to get good interactivity with a pretty
dumb scheduler and tiny timeslices (at least until load ramps up enough
that the "off-time" for your critical processes builds up too much), I
think we want to aim for large timeslices. CPU caches are still getting
bigger, and I don't think misses are getting cheaper (especially if you
consider multi core). Also, the energy cost of a memory access is much
higher even if hardware or software is able to hide the latency.

-- 
SUSE Labs, Novell Inc.

^ permalink raw reply	[flat|nested] 198+ messages in thread

* Re: [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice -1
@ 2007-03-13 19:54 Al Boldi
  0 siblings, 0 replies; 198+ messages in thread
From: Al Boldi @ 2007-03-13 19:54 UTC (permalink / raw)
  To: linux-kernel; +Cc: ck

Con Kolivas wrote:
> On Wednesday 14 March 2007 03:03, Con Kolivas wrote:
> > On Wednesday 14 March 2007 02:31, Con Kolivas wrote:
> > > On Monday 12 March 2007 22:26, Al Boldi wrote:
> > > > I think, it should be possible to spread this max expiration latency
> > > > across the rotation, should it not?
> > >
> > > Can you try the attached patch please Al and Mike? It "dithers" the
> > > priority bitmap which tends to fluctuate the latency a lot more but in
> > > a cyclical fashion. This tends to make the max latency bound to a
> > > smaller value and should make it possible to run -nice tasks without
> > > killing the latency of the non niced tasks. Eg you could possibly run
> > > X nice -10 at a guess like we used to in 2.4 days. It's not essential
> > > of course, but is a workaround for Mike's testcase.
> >
> > Oops, one tiny fix. This is a respin of the patch, sorry.
>
> A few other minor things would need to be updated before this patch is in
> a good enough shape to join the rsdl patches. This one will be good for
> testing though.

Applied against v0.30 mainline.  It only works on prio +16 to +19.


Thanks!

--
Al


^ permalink raw reply	[flat|nested] 198+ messages in thread

end of thread, other threads:[~2007-03-29  7:34 UTC | newest]

Thread overview: 198+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-03-04 20:35 [ANNOUNCE] RSDL completely fair starvation free interactive cpu scheduler Al Boldi
2007-03-04 21:49 ` Con Kolivas
     [not found]   ` <45EB45F7.3050208@simon.arlott.org.uk>
2007-03-04 22:27     ` Con Kolivas
2007-03-05 18:29       ` Simon Arlott
2007-03-05 21:36         ` Con Kolivas
2007-03-04 23:13   ` Willy Tarreau
2007-03-04 23:58     ` Con Kolivas
2007-03-05  0:10     ` [ck] " jos poortvliet
2007-03-05  1:09     ` Gene Heskett
     [not found] ` <200703050834.45712.a1426z@gawab.com>
     [not found]   ` <20070305060732.GQ30401@nysv.org>
2007-03-05 11:59     ` [ck] " Al Boldi
2007-03-05 12:29       ` Con Kolivas
     [not found]         ` <200703052123.01095.a1426z@gawab.com>
2007-03-05 22:10           ` Con Kolivas
2007-03-06  8:42             ` Xavier Bestel
2007-03-06 15:15               ` Al Boldi
2007-03-11 18:11                 ` Al Boldi
2007-03-11 21:52                   ` Con Kolivas
2007-03-11 22:12                     ` Con Kolivas
2007-03-12  4:42                       ` Al Boldi
2007-03-12  4:53                         ` Con Kolivas
2007-03-12 11:26                           ` Al Boldi
2007-03-12 12:52                             ` Con Kolivas
2007-03-12 14:14                               ` Al Boldi
2007-03-12 14:58                                 ` [ck] " jos poortvliet
2007-03-12 16:37                                   ` michael chang
2007-03-12 17:41                                   ` Al Boldi
2007-03-12 18:05                                 ` Con Kolivas
2007-03-12 18:47                                   ` [ck] " jos poortvliet
2007-03-12 18:58                                     ` Antonio Vargas
2007-03-19 10:47                                       ` Helge Hafting
2007-03-18  1:30                               ` Bill Davidsen
2007-03-18 10:50                               ` [ck] " jos poortvliet
2007-03-13 15:31                             ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice Con Kolivas
2007-03-13 16:03                               ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice -1 Con Kolivas
2007-03-13 16:08                                 ` Con Kolivas
2007-03-13 20:58                                 ` Con Kolivas
2007-03-13 23:08                                   ` RSDL development plans Con Kolivas
2007-03-16 12:25                                     ` Con Kolivas
2007-03-16 13:40                                       ` RSDL v0.31 Con Kolivas
2007-03-16 15:34                                         ` Mike Galbraith
2007-03-16 21:13                                           ` Con Kolivas
2007-03-16 22:30                                             ` Mike Galbraith
2007-03-16 23:05                                               ` [ck] " Dirk Schoebel
2007-03-17  4:24                                               ` Nicholas Miell
2007-03-17  5:56                                                 ` Mike Galbraith
2007-03-17  6:08                                                   ` Mike Galbraith
2007-03-17 13:56                                                     ` Ed Tomlinson
2007-03-18 19:37                                                     ` Lee Revell
2007-03-18 19:55                                                       ` Mike Galbraith
2007-03-18 22:45                                                       ` Szonyi Calin
2007-03-19  2:27                                                     ` David Schwartz
2007-03-19  6:21                                                       ` Mike Galbraith
2007-03-19  6:59                                                         ` Willy Tarreau
2007-03-17  6:26                                                   ` Nicholas Miell
2007-03-17  7:11                                                     ` Mike Galbraith
2007-03-17  7:25                                                       ` William Lee Irwin III
2007-03-17  7:29                                                         ` Nicholas Miell
2007-03-17 11:48                                                       ` Gene Heskett
2007-03-17  7:45                                                     ` Ingo Molnar
2007-03-17  7:44                                                       ` David Lang
2007-03-17  8:46                                                         ` Mike Galbraith
2007-03-17 14:09                                                           ` [ck] " Mark Glines
2007-03-17 14:33                                                             ` Mike Galbraith
2007-03-17 14:54                                                               ` Mark Glines
2007-03-17 14:58                                                                 ` Mike Galbraith
2007-03-17  8:23                                                       ` Nicholas Miell
2007-03-17  9:42                                                         ` [patch] CFS scheduler: Completely Fair Scheduler / CONFIG_SCHED_FAIR Ingo Molnar
2007-03-17  8:41                                                       ` RSDL v0.31 Serge Belyshev
2007-03-17  9:48                                                         ` Con Kolivas
2007-03-17  9:58                                                           ` Mike Galbraith
2007-03-17 10:49                                                             ` Mike Galbraith
2007-03-17 12:05                                                               ` Gene Heskett
2007-03-17 13:36                                                                 ` Mike Galbraith
2007-03-17 17:03                                                                   ` Gene Heskett
2007-03-17 17:37                                                                     ` Mike Galbraith
2007-03-17 18:23                                                                       ` [ck] " Kacper Wysocki
2007-03-17 18:45                                                                         ` Mike Galbraith
2007-03-17 13:58                                                             ` michael chang
2007-03-17 20:55                                                             ` Al Boldi
2007-03-18  6:17                                                               ` Mike Galbraith
2007-03-18  6:47                                                                 ` Kasper Sandberg
2007-03-18  7:08                                                                   ` Mike Galbraith
2007-03-18  7:22                                                                     ` [ck] " Radoslaw Szkodzinski
2007-03-18  7:38                                                                       ` Mike Galbraith
2007-03-18  8:04                                                                         ` Mike Galbraith
2007-03-18  8:20                                                                         ` jimmy bahuleyan
2007-03-18  8:34                                                                           ` Mike Galbraith
2007-03-18  9:57                                                                         ` Kasper Sandberg
2007-03-18 13:57                                                                           ` Avuton Olrich
2007-03-19 20:47                                                                           ` Bill Davidsen
2007-03-20 10:19                                                                             ` jos poortvliet
2007-03-21  8:58                                                                             ` Kasper Sandberg
2007-03-18 15:44                                                                         ` Radoslaw Szkodzinski
2007-03-18 16:09                                                                           ` jos poortvliet
2007-03-19 16:07                                                               ` Mark Lord
2007-03-19 16:26                                                                 ` Xavier Bestel
2007-03-19 16:36                                                                   ` Mark Lord
2007-03-19 16:43                                                                     ` Xavier Bestel
2007-03-20  3:11                                                                       ` Linus Torvalds
2007-03-20  6:11                                                                         ` Willy Tarreau
2007-03-20  8:03                                                                           ` Mike Galbraith
2007-03-21 14:57                                                                             ` Mike Galbraith
2007-03-21 16:02                                                                               ` Peter Zijlstra
2007-03-21 17:06                                                                                 ` Mike Galbraith
2007-03-22  7:07                                                                                 ` Mike Galbraith
2007-03-22  9:18                                                                                   ` Ingo Molnar
2007-03-22  9:34                                                                                     ` Mike Galbraith
2007-03-22  9:41                                                                                       ` Mike Galbraith
2007-03-22 22:03                                                                                     ` Con Kolivas
2007-03-22 22:50                                                                                   ` Con Kolivas
2007-03-23  4:39                                                                                     ` Mike Galbraith
2007-03-23  5:59                                                                                       ` Con Kolivas
2007-03-23  6:11                                                                                         ` Mike Galbraith
2007-03-23 12:17                                                                                         ` Mike Galbraith
     [not found]                                                                               ` <20070321161147.54c7a727@localhost>
2007-03-21 17:07                                                                                 ` Mike Galbraith
2007-03-22  4:49                                                                                   ` Willy Tarreau
2007-03-22  7:14                                                                                     ` Mike Galbraith
2007-03-20  9:03                                                                           ` Xavier Bestel
2007-03-20 12:31                                                                             ` Artur Skawina
2007-03-20 19:16                                                                               ` Artur Skawina
2007-03-21  7:50                                                                             ` Ingo Molnar
2007-03-21 10:43                                                                               ` David Schwartz
2007-03-28 23:37                                                                                 ` Bill Davidsen
2007-03-29  7:10                                                                                   ` David Schwartz
2007-03-29  7:34                                                                                     ` Nick Piggin
2007-03-20 15:31                                                                           ` Linus Torvalds
2007-03-20 18:08                                                                             ` Al Boldi
2007-03-21  8:22                                                                             ` Keith Duthie
2007-03-28 23:43                                                                             ` Bill Davidsen
2007-03-20 10:26                                                                         ` [ck] " jos poortvliet
2007-03-20 13:22                                                                         ` Mark Lord
2007-03-20 15:16                                                                           ` Ray Lee
2007-03-20 15:20                                                                             ` Mark Lord
2007-03-21  8:55                                                                             ` Kasper Sandberg
2007-03-19 20:53                                                                 ` Al Boldi
2007-03-20 19:50                                                                   ` Artur Skawina
2007-03-21  4:15                                                                     ` Al Boldi
2007-03-21 17:24                                                                       ` Artur Skawina
2007-03-19 16:03                                                             ` Mark Lord
2007-03-17 11:49                                                           ` is RSDL an "unfair" scheduler too? Ingo Molnar
2007-03-17 12:02                                                             ` Con Kolivas
2007-03-17 12:23                                                               ` [ck] " jos poortvliet
2007-03-17 17:31                                                                 ` David Schwartz
2007-03-17 12:28                                                               ` Ingo Molnar
2007-03-17 12:43                                                                 ` Con Kolivas
2007-03-17 16:34                                                                   ` Ingo Molnar
2007-03-18  3:23                                                                     ` Bill Davidsen
2007-03-18  2:13                                                                   ` Bill Davidsen
2007-03-18  3:20                                                                     ` Kasper Sandberg
2007-03-18  5:37                                                                     ` Mike Galbraith
2007-03-18 10:58                                                                       ` [ck] " jos poortvliet
2007-03-17 12:15                                                             ` jos poortvliet
2007-03-17 20:41                                                             ` Avi Kivity
2007-03-18  1:25                                                               ` William Lee Irwin III
2007-03-18  1:32                                                                 ` Linus Torvalds
2007-03-18  5:24                                                                   ` Willy Tarreau
2007-03-18  5:55                                                                     ` Avi Kivity
2007-03-19  2:27                                                                       ` David Schwartz
2007-03-19 13:27                                                                         ` Radoslaw Szkodzinski
2007-03-19 18:30                                                                           ` David Lang
2007-03-19 15:25                                                                         ` Avi Kivity
2007-03-19 16:06                                                                           ` Helge Hafting
2007-03-19 16:37                                                                             ` Avi Kivity
2007-03-18  6:09                                                                     ` Bill Huey
2007-03-18  6:37                                                                       ` Mike Galbraith
2007-03-18  7:35                                                                         ` Bill Huey
2007-03-19 21:14                                                                       ` Bill Davidsen
2007-03-18  6:26                                                                     ` Mike Galbraith
2007-03-18  6:54                                                                       ` [ck] " Radoslaw Szkodzinski
2007-03-18  7:58                                                                         ` Willy Tarreau
2007-03-18  8:45                                                                           ` Avi Kivity
2007-03-18  5:00                                                                 ` Avi Kivity
2007-03-17 15:13                                                           ` RSDL v0.31 Mark Hahn
2007-03-17 17:22                                                             ` Stephen Clark
2007-03-19 15:06                                                             ` Chris Friesen
2007-03-17  7:56                                                     ` Ingo Molnar
2007-03-17 11:07                                                       ` [ck] " jos poortvliet
2007-03-17 12:44                                                         ` Ingo Molnar
2007-03-17 13:44                                                           ` jos poortvliet
2007-03-17 14:04                                                         ` [ck] " Ed Tomlinson
2007-03-17 14:32                                               ` Rik van Riel
2007-03-17 14:43                                                 ` Mike Galbraith
2007-03-17 15:39                                                 ` Ingo Molnar
2007-03-16 17:12                                         ` AshMilsted
2007-03-16 17:41                                           ` Gabriel C
2007-03-16 21:55                                         ` Al Boldi
2007-03-17  2:51                                           ` Con Kolivas
2007-03-17  4:40                                             ` Al Boldi
2007-03-17  4:57                                               ` Con Kolivas
2007-03-17  5:15                                                 ` Gene Heskett
2007-03-17 13:50                                                 ` Ed Tomlinson
2007-03-17 16:12                                                 ` Al Boldi
2007-03-16 13:42                                       ` RSDL development plans Mike Galbraith
2007-03-16 13:59                                         ` Con Kolivas
2007-03-16 14:07                                           ` Mike Galbraith
2007-03-14  9:13                               ` [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice Mike Galbraith
2007-03-14  9:25                                 ` Con Kolivas
2007-03-14  9:42                                   ` Mike Galbraith
2007-03-13 19:54 [PATCH] [RSDL-0.30] sched: rsdl improve latencies with differential nice -1 Al Boldi

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).