linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] Remove spurious cputimer restart and eliminate its drift
@ 2013-04-17 17:41 Olivier Langlois
  2013-04-19 13:27 ` Frederic Weisbecker
  0 siblings, 1 reply; 3+ messages in thread
From: Olivier Langlois @ 2013-04-17 17:41 UTC (permalink / raw)
  To: tglx, peterz, mingo; +Cc: linux-kernel

Move the call to stop_process_timers() in order to:

1. It catches the exceptionnal case where it would be
   started without arming any timers in posix_cpu_timer_set()
 2. You could end up with no process timers for a short
    period of time when they find themselves in the firing
    list. Let 1 tick to periodic timers to have the
    opportunity to get rearmed.

Also forbids the cputimer to drift ahead of its process clock by
blocking its update when a tick occurs while a autoreaping task
is currently in do_exit() between the call to release_task() and
its final call to schedule().

Any task stats update after having called release_task() will
be lost because they are added to the global process stats located
in the signal struct from release_task().

Ideally, you should postpone the release_task() call after the
final context switch to get all the stats added but this is
more complex to achieve.

Signed-off-by: Olivier Langlois <olivier@trillion01.com>
---
 kernel/posix-cpu-timers.c | 44 +++++++++++++++++++++++++-------------------
 kernel/sched/fair.c       | 10 +++++++++-
 2 files changed, 34 insertions(+), 20 deletions(-)

diff --git a/kernel/posix-cpu-timers.c b/kernel/posix-cpu-timers.c
index 8fd709c..7cecdb9 100644
--- a/kernel/posix-cpu-timers.c
+++ b/kernel/posix-cpu-timers.c
@@ -1130,8 +1130,6 @@ static void check_process_timers(struct task_struct *tsk,
 	sig->cputime_expires.prof_exp = prof_expires;
 	sig->cputime_expires.virt_exp = virt_expires;
 	sig->cputime_expires.sched_exp = sched_expires;
-	if (task_cputime_zero(&sig->cputime_expires))
-		stop_process_timers(sig);
 }
 
 /*
@@ -1238,34 +1236,42 @@ static inline int task_cputime_expired(const struct task_cputime *sample,
  */
 static inline int fastpath_timer_check(struct task_struct *tsk)
 {
-	struct signal_struct *sig;
-	cputime_t utime, stime;
+	struct signal_struct *sig = tsk->signal;
 
-	task_cputime(tsk, &utime, &stime);
+	if (sig->cputimer.running) {
+		if (likely(!task_cputime_zero(&sig->cputime_expires))) {
+			struct task_cputime group_sample;
+
+			raw_spin_lock(&sig->cputimer.lock);
+			group_sample = sig->cputimer.cputime;
+			raw_spin_unlock(&sig->cputimer.lock);
+
+			if (task_cputime_expired(&group_sample, &sig->cputime_expires))
+				return 1;
+		} else
+		/*
+		 * Stopping the process timer here has 2 benefits.
+		 *
+		 * 1. It catches the exceptionnal case where it would be
+		 *   started without arming any timers in posix_cpu_timer_set()
+		 * 2. You could end up with no process timers for a short
+		 *    period of time when they find themselves in the firing
+		 *    list. Let 1 tick to periodic timers to have the
+		 *    opportunity to get rearmed.
+		 */
+			stop_process_timers(sig);
+	}
 
 	if (!task_cputime_zero(&tsk->cputime_expires)) {
 		struct task_cputime task_sample = {
-			.utime = utime,
-			.stime = stime,
 			.sum_exec_runtime = tsk->se.sum_exec_runtime
 		};
+		task_cputime(tsk, &task_sample.utime, &task_sample.stime);
 
 		if (task_cputime_expired(&task_sample, &tsk->cputime_expires))
 			return 1;
 	}
 
-	sig = tsk->signal;
-	if (sig->cputimer.running) {
-		struct task_cputime group_sample;
-
-		raw_spin_lock(&sig->cputimer.lock);
-		group_sample = sig->cputimer.cputime;
-		raw_spin_unlock(&sig->cputimer.lock);
-
-		if (task_cputime_expired(&group_sample, &sig->cputime_expires))
-			return 1;
-	}
-
 	return 0;
 }
 
diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index 7a33e59..52d7b10 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -708,7 +708,15 @@ static void update_curr(struct cfs_rq *cfs_rq)
 
 		trace_sched_stat_runtime(curtask, delta_exec, curr->vruntime);
 		cpuacct_charge(curtask, delta_exec);
-		account_group_exec_runtime(curtask, delta_exec);
+		/*
+		 * Do not update the cputimer if the task is already released by
+		 * release_task().
+		 *
+		 * it would preferable to defer the autoreap release_task
+		 * after the last context switch but harder to do.
+		 */
+		if (likely(curtask->sighand))
+			account_group_exec_runtime(curtask, delta_exec);
 	}
 
 	account_cfs_rq_runtime(cfs_rq, delta_exec);
-- 
1.8.2.1





^ permalink raw reply related	[flat|nested] 3+ messages in thread

* Re: [PATCH] Remove spurious cputimer restart and eliminate its drift
  2013-04-17 17:41 [PATCH] Remove spurious cputimer restart and eliminate its drift Olivier Langlois
@ 2013-04-19 13:27 ` Frederic Weisbecker
  2013-04-19 13:29   ` Frederic Weisbecker
  0 siblings, 1 reply; 3+ messages in thread
From: Frederic Weisbecker @ 2013-04-19 13:27 UTC (permalink / raw)
  To: Olivier Langlois; +Cc: tglx, peterz, mingo, linux-kernel, Andrew Morton

2013/4/17 Olivier Langlois <olivier@trillion01.com>:
> Move the call to stop_process_timers() in order to:
>
> 1. It catches the exceptionnal case where it would be
>    started without arming any timers in posix_cpu_timer_set()

Oh I see now. fastpath_timer_check() sees the sig->cputimer.running
but return 0 because the timer is 0. So sig->cputimer.running is never
cleared.

>  2. You could end up with no process timers for a short
>     period of time when they find themselves in the firing
>     list. Let 1 tick to periodic timers to have the
>     opportunity to get rearmed.

Not sure what you mean. Is it that window in check_process_timers()
when expired timers are moved to the firing list, until we call
stop_process_timers() ?

>
> Also forbids the cputimer to drift ahead of its process clock by
> blocking its update when a tick occurs while a autoreaping task
> is currently in do_exit() between the call to release_task() and
> its final call to schedule().

Ah, I suggest you move this to a seperate patch to ease the review.
Plus that deserve some discussion on its own because there should be
no more posix cpu timers at that time.

>
> Any task stats update after having called release_task() will
> be lost because they are added to the global process stats located
> in the signal struct from release_task().
>
> Ideally, you should postpone the release_task() call after the
> final context switch to get all the stats added but this is
> more complex to achieve.

Yep.

>
> Signed-off-by: Olivier Langlois <olivier@trillion01.com>
> ---
>  kernel/posix-cpu-timers.c | 44 +++++++++++++++++++++++++-------------------
>  kernel/sched/fair.c       | 10 +++++++++-
>  2 files changed, 34 insertions(+), 20 deletions(-)
>
> diff --git a/kernel/posix-cpu-timers.c b/kernel/posix-cpu-timers.c
> index 8fd709c..7cecdb9 100644
> --- a/kernel/posix-cpu-timers.c
> +++ b/kernel/posix-cpu-timers.c
> @@ -1130,8 +1130,6 @@ static void check_process_timers(struct task_struct *tsk,
>         sig->cputime_expires.prof_exp = prof_expires;
>         sig->cputime_expires.virt_exp = virt_expires;
>         sig->cputime_expires.sched_exp = sched_expires;
> -       if (task_cputime_zero(&sig->cputime_expires))
> -               stop_process_timers(sig);
>  }
>
>  /*
> @@ -1238,34 +1236,42 @@ static inline int task_cputime_expired(const struct task_cputime *sample,
>   */
>  static inline int fastpath_timer_check(struct task_struct *tsk)
>  {
> -       struct signal_struct *sig;
> -       cputime_t utime, stime;
> +       struct signal_struct *sig = tsk->signal;
>
> -       task_cputime(tsk, &utime, &stime);
> +       if (sig->cputimer.running) {
> +               if (likely(!task_cputime_zero(&sig->cputime_expires))) {
> +                       struct task_cputime group_sample;
> +
> +                       raw_spin_lock(&sig->cputimer.lock);
> +                       group_sample = sig->cputimer.cputime;
> +                       raw_spin_unlock(&sig->cputimer.lock);
> +
> +                       if (task_cputime_expired(&group_sample, &sig->cputime_expires))
> +                               return 1;
> +               } else
> +               /*
> +                * Stopping the process timer here has 2 benefits.
> +                *
> +                * 1. It catches the exceptionnal case where it would be
> +                *   started without arming any timers in posix_cpu_timer_set()

I wonder if this shouldn't be fixed in posix_cpu_timer_set() instead.
I have the feeling we are working here around the bug of another
function.

> +                * 2. You could end up with no process timers for a short
> +                *    period of time when they find themselves in the firing
> +                *    list. Let 1 tick to periodic timers to have the
> +                *    opportunity to get rearmed.
> +                */

So that needs to be clarified.

> +                       stop_process_timers(sig);
> +       }
>
>         if (!task_cputime_zero(&tsk->cputime_expires)) {
>                 struct task_cputime task_sample = {
> -                       .utime = utime,
> -                       .stime = stime,
>                         .sum_exec_runtime = tsk->se.sum_exec_runtime
>                 };
> +               task_cputime(tsk, &task_sample.utime, &task_sample.stime);

That too needs to be in another patch. It's a bug that concerns full
dynticks cputime accounting :)

>
>                 if (task_cputime_expired(&task_sample, &tsk->cputime_expires))
>                         return 1;
>         }
>
> -       sig = tsk->signal;
> -       if (sig->cputimer.running) {
> -               struct task_cputime group_sample;
> -
> -               raw_spin_lock(&sig->cputimer.lock);
> -               group_sample = sig->cputimer.cputime;
> -               raw_spin_unlock(&sig->cputimer.lock);
> -
> -               if (task_cputime_expired(&group_sample, &sig->cputime_expires))
> -                       return 1;
> -       }
> -
>         return 0;
>  }
>
> diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
> index 7a33e59..52d7b10 100644
> --- a/kernel/sched/fair.c
> +++ b/kernel/sched/fair.c
> @@ -708,7 +708,15 @@ static void update_curr(struct cfs_rq *cfs_rq)
>
>                 trace_sched_stat_runtime(curtask, delta_exec, curr->vruntime);
>                 cpuacct_charge(curtask, delta_exec);
> -               account_group_exec_runtime(curtask, delta_exec);
> +               /*
> +                * Do not update the cputimer if the task is already released by
> +                * release_task().
> +                *
> +                * it would preferable to defer the autoreap release_task
> +                * after the last context switch but harder to do.
> +                */
> +               if (likely(curtask->sighand))
> +                       account_group_exec_runtime(curtask, delta_exec);

So that part wants another patch too :)

Thanks, interesting patches!

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [PATCH] Remove spurious cputimer restart and eliminate its drift
  2013-04-19 13:27 ` Frederic Weisbecker
@ 2013-04-19 13:29   ` Frederic Weisbecker
  0 siblings, 0 replies; 3+ messages in thread
From: Frederic Weisbecker @ 2013-04-19 13:29 UTC (permalink / raw)
  To: Olivier Langlois; +Cc: tglx, peterz, mingo, linux-kernel, Andrew Morton

2013/4/19 Frederic Weisbecker <fweisbec@gmail.com>:
>>         if (!task_cputime_zero(&tsk->cputime_expires)) {
>>                 struct task_cputime task_sample = {
>> -                       .utime = utime,
>> -                       .stime = stime,
>>                         .sum_exec_runtime = tsk->se.sum_exec_runtime
>>                 };
>> +               task_cputime(tsk, &task_sample.utime, &task_sample.stime);
>
> That too needs to be in another patch. It's a bug that concerns full
> dynticks cputime accounting :)

Oops, misread, it's just moving the function, I thought it was
missing. Nevermind.

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2013-04-19 13:29 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-04-17 17:41 [PATCH] Remove spurious cputimer restart and eliminate its drift Olivier Langlois
2013-04-19 13:27 ` Frederic Weisbecker
2013-04-19 13:29   ` Frederic Weisbecker

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).