All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH v3] sched/deadline: overrun could happen in start_hrtick_dl
@ 2014-08-06  9:08 xiaofeng.yan
  2014-08-06 10:39 ` Juri Lelli
  2014-08-12 14:52 ` Ingo Molnar
  0 siblings, 2 replies; 4+ messages in thread
From: xiaofeng.yan @ 2014-08-06  9:08 UTC (permalink / raw)
  To: mingo, peterz, juri.lelli, linux-kernel; +Cc: xiaofeng.yan2012, xiaofeng.yan

It could be wrong for the precision of runtime and deadline
when the precision is within microsecond level. For example:
Task runtime deadline period
 P1   200us   500us   500us

This case need enbale HRTICK feature by the next command
PC#echo "HRTICK" > /sys/kernel/debug/sched_features
PC#trace-cmd record -e sched_switch &
PC#./schedtool -E -t 200000:500000 -e ./test

Some of runtime and deadline run with millisecond level by
reading kernershark. Some pieces of trace.dat are as follows:
(remove some irrelevant information)
<idle>-0   157.603157: sched_switch: :R ==> 2481:4294967295: test
test-2481  157.603203: sched_switch:  2481:R ==> 0:120: swapper/2
<idle>-0   157.605657: sched_switch:  :R ==> 2481:4294967295: test
test-2481  157.608183: sched_switch:  2481:R ==> 2483:120: trace-cmd
trace-cmd-2483 157.609656: sched_switch:2483:R==>2481:4294967295: test

We can get the runtime from the information at some point.
runtime = 157.605657 - 157.608183
runtime = 0.002526(2.526ms)
The correct runtime should be less than or equal to 200us at some point.

The problem is caused by a conditional judgment "delta > 10000".
Because no hrtimer start up to control the runtime when runtime is less than 10us.
So the process will continue to run until tick-period coming.

Move the code with the limit of the least time slice
from hrtick_start_fair() to hrtick_start() because
EDF schedule class also need this function in start_hrtick_dl().

To fix this problem, we call hrtimer_start() unconditionally in start_hrtick_dl(),
and make sure schedule slice won't be smaller than 10us in hrtimer_start().

Signed-off-by: Xiaofeng Yan <xiaofeng.yan@huawei.com>
Reviewed-by:   Peter Zijlstra <peterz@infradead.org>
Reviewed-by:   Li Zefan <lizefan@huawei.com>
---
 kernel/sched/core.c     |    8 +++++++-
 kernel/sched/deadline.c |    5 +----
 kernel/sched/fair.c     |    8 --------
 3 files changed, 8 insertions(+), 13 deletions(-)

diff --git a/kernel/sched/core.c b/kernel/sched/core.c
index 1211575..53514ba 100644
--- a/kernel/sched/core.c
+++ b/kernel/sched/core.c
@@ -449,8 +449,14 @@ static void __hrtick_start(void *arg)
 void hrtick_start(struct rq *rq, u64 delay)
 {
 	struct hrtimer *timer = &rq->hrtick_timer;
-	ktime_t time = ktime_add_ns(timer->base->get_time(), delay);
+	ktime_t time;
 
+	/*
+	 * Don't schedule slices shorter than 10000ns, that just
+	 * doesn't make sense and can cause timer DoS.
+	 */
+	s64 delta = max_t(s64, delay, 10000LL);
+	time = ktime_add_ns(timer->base->get_time(), delta);
 	hrtimer_set_expires(timer, time);
 
 	if (rq == this_rq()) {
diff --git a/kernel/sched/deadline.c b/kernel/sched/deadline.c
index 255ce13..ce52d07 100644
--- a/kernel/sched/deadline.c
+++ b/kernel/sched/deadline.c
@@ -997,10 +997,7 @@ static void check_preempt_curr_dl(struct rq *rq, struct task_struct *p,
 #ifdef CONFIG_SCHED_HRTICK
 static void start_hrtick_dl(struct rq *rq, struct task_struct *p)
 {
-	s64 delta = p->dl.dl_runtime - p->dl.runtime;
-
-	if (delta > 10000)
-		hrtick_start(rq, p->dl.runtime);
+	hrtick_start(rq, p->dl.runtime);
 }
 #endif
 
diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index bfa3c86..0d6b3e6 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -3892,14 +3892,6 @@ static void hrtick_start_fair(struct rq *rq, struct task_struct *p)
 				resched_curr(rq);
 			return;
 		}
-
-		/*
-		 * Don't schedule slices shorter than 10000ns, that just
-		 * doesn't make sense. Rely on vruntime for fairness.
-		 */
-		if (rq->curr != p)
-			delta = max_t(s64, 10000LL, delta);
-
 		hrtick_start(rq, delta);
 	}
 }
-- 
1.7.9.5


^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [PATCH v3] sched/deadline: overrun could happen in start_hrtick_dl
  2014-08-06  9:08 [PATCH v3] sched/deadline: overrun could happen in start_hrtick_dl xiaofeng.yan
@ 2014-08-06 10:39 ` Juri Lelli
  2014-08-12 14:52 ` Ingo Molnar
  1 sibling, 0 replies; 4+ messages in thread
From: Juri Lelli @ 2014-08-06 10:39 UTC (permalink / raw)
  To: xiaofeng.yan, mingo, peterz, juri.lelli, linux-kernel; +Cc: xiaofeng.yan2012

Hi all,

On 06/08/14 10:08, xiaofeng.yan wrote:
> It could be wrong for the precision of runtime and deadline
> when the precision is within microsecond level. For example:
> Task runtime deadline period
>  P1   200us   500us   500us
> 
> This case need enbale HRTICK feature by the next command
> PC#echo "HRTICK" > /sys/kernel/debug/sched_features
> PC#trace-cmd record -e sched_switch &
> PC#./schedtool -E -t 200000:500000 -e ./test
> 
> Some of runtime and deadline run with millisecond level by
> reading kernershark. Some pieces of trace.dat are as follows:
> (remove some irrelevant information)
> <idle>-0   157.603157: sched_switch: :R ==> 2481:4294967295: test
> test-2481  157.603203: sched_switch:  2481:R ==> 0:120: swapper/2
> <idle>-0   157.605657: sched_switch:  :R ==> 2481:4294967295: test
> test-2481  157.608183: sched_switch:  2481:R ==> 2483:120: trace-cmd
> trace-cmd-2483 157.609656: sched_switch:2483:R==>2481:4294967295: test
> 
> We can get the runtime from the information at some point.
> runtime = 157.605657 - 157.608183
> runtime = 0.002526(2.526ms)
> The correct runtime should be less than or equal to 200us at some point.
> 
> The problem is caused by a conditional judgment "delta > 10000".
> Because no hrtimer start up to control the runtime when runtime is less than 10us.
> So the process will continue to run until tick-period coming.
> 
> Move the code with the limit of the least time slice
> from hrtick_start_fair() to hrtick_start() because
> EDF schedule class also need this function in start_hrtick_dl().
> 
> To fix this problem, we call hrtimer_start() unconditionally in start_hrtick_dl(),
> and make sure schedule slice won't be smaller than 10us in hrtimer_start().
> 
> Signed-off-by: Xiaofeng Yan <xiaofeng.yan@huawei.com>
> Reviewed-by:   Peter Zijlstra <peterz@infradead.org>
> Reviewed-by:   Li Zefan <lizefan@huawei.com>

For what concerns sched/deadline bits,

Acked-by: Juri Lelli <juri.lelli@arm.com>

Thanks!

- Juri

> ---
>  kernel/sched/core.c     |    8 +++++++-
>  kernel/sched/deadline.c |    5 +----
>  kernel/sched/fair.c     |    8 --------
>  3 files changed, 8 insertions(+), 13 deletions(-)
> 
> diff --git a/kernel/sched/core.c b/kernel/sched/core.c
> index 1211575..53514ba 100644
> --- a/kernel/sched/core.c
> +++ b/kernel/sched/core.c
> @@ -449,8 +449,14 @@ static void __hrtick_start(void *arg)
>  void hrtick_start(struct rq *rq, u64 delay)
>  {
>  	struct hrtimer *timer = &rq->hrtick_timer;
> -	ktime_t time = ktime_add_ns(timer->base->get_time(), delay);
> +	ktime_t time;
>  
> +	/*
> +	 * Don't schedule slices shorter than 10000ns, that just
> +	 * doesn't make sense and can cause timer DoS.
> +	 */
> +	s64 delta = max_t(s64, delay, 10000LL);
> +	time = ktime_add_ns(timer->base->get_time(), delta);
>  	hrtimer_set_expires(timer, time);
>  
>  	if (rq == this_rq()) {
> diff --git a/kernel/sched/deadline.c b/kernel/sched/deadline.c
> index 255ce13..ce52d07 100644
> --- a/kernel/sched/deadline.c
> +++ b/kernel/sched/deadline.c
> @@ -997,10 +997,7 @@ static void check_preempt_curr_dl(struct rq *rq, struct task_struct *p,
>  #ifdef CONFIG_SCHED_HRTICK
>  static void start_hrtick_dl(struct rq *rq, struct task_struct *p)
>  {
> -	s64 delta = p->dl.dl_runtime - p->dl.runtime;
> -
> -	if (delta > 10000)
> -		hrtick_start(rq, p->dl.runtime);
> +	hrtick_start(rq, p->dl.runtime);
>  }
>  #endif
>  
> diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
> index bfa3c86..0d6b3e6 100644
> --- a/kernel/sched/fair.c
> +++ b/kernel/sched/fair.c
> @@ -3892,14 +3892,6 @@ static void hrtick_start_fair(struct rq *rq, struct task_struct *p)
>  				resched_curr(rq);
>  			return;
>  		}
> -
> -		/*
> -		 * Don't schedule slices shorter than 10000ns, that just
> -		 * doesn't make sense. Rely on vruntime for fairness.
> -		 */
> -		if (rq->curr != p)
> -			delta = max_t(s64, 10000LL, delta);
> -
>  		hrtick_start(rq, delta);
>  	}
>  }
> 


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH v3] sched/deadline: overrun could happen in start_hrtick_dl
  2014-08-06  9:08 [PATCH v3] sched/deadline: overrun could happen in start_hrtick_dl xiaofeng.yan
  2014-08-06 10:39 ` Juri Lelli
@ 2014-08-12 14:52 ` Ingo Molnar
  2014-08-15  8:38   ` xiaofeng.yan
  1 sibling, 1 reply; 4+ messages in thread
From: Ingo Molnar @ 2014-08-12 14:52 UTC (permalink / raw)
  To: xiaofeng.yan; +Cc: mingo, peterz, juri.lelli, linux-kernel, xiaofeng.yan2012


* xiaofeng.yan <xiaofeng.yan@huawei.com> wrote:

> It could be wrong for the precision of runtime and deadline
> when the precision is within microsecond level. For example:
> Task runtime deadline period
>  P1   200us   500us   500us
> 
> This case need enbale HRTICK feature by the next command
>
> PC#echo "HRTICK" > /sys/kernel/debug/sched_features
> PC#trace-cmd record -e sched_switch &
> PC#./schedtool -E -t 200000:500000 -e ./test
> 
> Some of runtime and deadline run with millisecond level by
> reading kernershark. Some pieces of trace.dat are as follows:
> (remove some irrelevant information)
> <idle>-0   157.603157: sched_switch: :R ==> 2481:4294967295: test
> test-2481  157.603203: sched_switch:  2481:R ==> 0:120: swapper/2
> <idle>-0   157.605657: sched_switch:  :R ==> 2481:4294967295: test
> test-2481  157.608183: sched_switch:  2481:R ==> 2483:120: trace-cmd
> trace-cmd-2483 157.609656: sched_switch:2483:R==>2481:4294967295: test
> 
> We can get the runtime from the information at some point.
> runtime = 157.605657 - 157.608183
> runtime = 0.002526(2.526ms)
> The correct runtime should be less than or equal to 200us at some point.
> 
> The problem is caused by a conditional judgment "delta > 10000".
> Because no hrtimer start up to control the runtime when runtime is less than 10us.
> So the process will continue to run until tick-period coming.
> 
> Move the code with the limit of the least time slice
> from hrtick_start_fair() to hrtick_start() because
> EDF schedule class also need this function in start_hrtick_dl().
> 
> To fix this problem, we call hrtimer_start() unconditionally in start_hrtick_dl(),
> and make sure schedule slice won't be smaller than 10us in hrtimer_start().
> 
> Signed-off-by: Xiaofeng Yan <xiaofeng.yan@huawei.com>
> Reviewed-by:   Peter Zijlstra <peterz@infradead.org>
> Reviewed-by:   Li Zefan <lizefan@huawei.com>

The whole changelog is very hard to read and isn't proper English, nor 
is it truly explanatory. Could you please fix the changelog, or bounce 
it to someone who will fix it for you?

Thanks,

	Ingo

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH v3] sched/deadline: overrun could happen in start_hrtick_dl
  2014-08-12 14:52 ` Ingo Molnar
@ 2014-08-15  8:38   ` xiaofeng.yan
  0 siblings, 0 replies; 4+ messages in thread
From: xiaofeng.yan @ 2014-08-15  8:38 UTC (permalink / raw)
  To: Ingo Molnar; +Cc: mingo, peterz, juri.lelli, linux-kernel, xiaofeng.yan2012

On 2014/8/12 22:52, Ingo Molnar wrote:
> * xiaofeng.yan <xiaofeng.yan@huawei.com> wrote:
>
>> It could be wrong for the precision of runtime and deadline
>> when the precision is within microsecond level. For example:
>> Task runtime deadline period
>>   P1   200us   500us   500us
>>
>> This case need enbale HRTICK feature by the next command
>>
>> PC#echo "HRTICK" > /sys/kernel/debug/sched_features
>> PC#trace-cmd record -e sched_switch &
>> PC#./schedtool -E -t 200000:500000 -e ./test
>>
>> Some of runtime and deadline run with millisecond level by
>> reading kernershark. Some pieces of trace.dat are as follows:
>> (remove some irrelevant information)
>> <idle>-0   157.603157: sched_switch: :R ==> 2481:4294967295: test
>> test-2481  157.603203: sched_switch:  2481:R ==> 0:120: swapper/2
>> <idle>-0   157.605657: sched_switch:  :R ==> 2481:4294967295: test
>> test-2481  157.608183: sched_switch:  2481:R ==> 2483:120: trace-cmd
>> trace-cmd-2483 157.609656: sched_switch:2483:R==>2481:4294967295: test
>>
>> We can get the runtime from the information at some point.
>> runtime = 157.605657 - 157.608183
>> runtime = 0.002526(2.526ms)
>> The correct runtime should be less than or equal to 200us at some point.
>>
>> The problem is caused by a conditional judgment "delta > 10000".
>> Because no hrtimer start up to control the runtime when runtime is less than 10us.
>> So the process will continue to run until tick-period coming.
>>
>> Move the code with the limit of the least time slice
>> from hrtick_start_fair() to hrtick_start() because
>> EDF schedule class also need this function in start_hrtick_dl().
>>
>> To fix this problem, we call hrtimer_start() unconditionally in start_hrtick_dl(),
>> and make sure schedule slice won't be smaller than 10us in hrtimer_start().
>>
>> Signed-off-by: Xiaofeng Yan <xiaofeng.yan@huawei.com>
>> Reviewed-by:   Peter Zijlstra <peterz@infradead.org>
>> Reviewed-by:   Li Zefan <lizefan@huawei.com>
> The whole changelog is very hard to read and isn't proper English, nor
> is it truly explanatory. Could you please fix the changelog, or bounce
> it to someone who will fix it for you?
>
> Thanks,
>
> 	Ingo

Thanks for your reply. I will fix my change log with proper English.
Thanks
Yan
> .
>



^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2014-08-15  8:38 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2014-08-06  9:08 [PATCH v3] sched/deadline: overrun could happen in start_hrtick_dl xiaofeng.yan
2014-08-06 10:39 ` Juri Lelli
2014-08-12 14:52 ` Ingo Molnar
2014-08-15  8:38   ` xiaofeng.yan

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.