linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: do_wait fix for 2.6.10-rc1
       [not found]         ` <41AA9F21.9060408@in.ibm.com>
@ 2004-11-29 11:53           ` Klaus Dittrich
  0 siblings, 0 replies; only message in thread
From: Klaus Dittrich @ 2004-11-29 11:53 UTC (permalink / raw)
  To: lkml; +Cc: Sripathi Kodi, Klaus Dittrich, Linus Torvalds

[-- Attachment #1: Type: text/plain, Size: 993 bytes --]

Sripathi Kodi wrote:

> Can you test this patch please?
>
>> -- 
>> Regards Klaus
>
>
> Hi Klaus,
>
> I have tested your patch against my testcase and it seems to be alright.
>
> My testcase is right here in my first post on LKML about this: 
> http://lkml.org/lkml/2004/11/5/34! The essence of the testcase is two 
> threads racing in waitpid to collect the exit status of a child. 
> Sometimes the thread that loses this race gets stuck in waitpid 
> forever unless manually interrupted. I had been able to recreate this 
> problem only on a multiprocessor machine. The essence of the fix was 
> to make the thread that loses this race not to set 'flag =1'. The 
> patch you sent me does not alter this behavior, so it works alright.
>
> Thanks and regards,
> Sripathi.

I encountered a bug in waitpid after Sripathi's patch
was applied.
See lkml
"kernel reports kill too late? Thu Nov 18 2004 - 07:36:48 EST"

Attached is a patch for do_wait() that may fix both problems.

--
Regards Klaus

[-- Attachment #2: 2patch-exit.c --]
[-- Type: text/plain, Size: 937 bytes --]

--- kernel/exit.ORIG.c	2004-11-24 17:20:32.000000000 +0100
+++ kernel/exit.c	2004-11-26 17:38:36.000000000 +0100
@@ -1316,6 +1316,9 @@ static long do_wait(pid_t pid, int optio
 	DECLARE_WAITQUEUE(wait, current);
 	struct task_struct *tsk;
 	int flag, retval;
+	struct task_struct *p;
+	struct list_head *_p;
+	int ret;
 
 	add_wait_queue(&current->wait_chldexit,&wait);
 repeat:
@@ -1324,9 +1327,6 @@ repeat:
 	read_lock(&tasklist_lock);
 	tsk = current;
 	do {
-		struct task_struct *p;
-		struct list_head *_p;
-		int ret;
 
 		list_for_each(_p,&tsk->children) {
 			p = list_entry(_p,struct task_struct,sibling);
@@ -1377,10 +1377,11 @@ repeat:
 						goto end;
 					break;
 				}
-				flag = 1;
 check_continued:
-				if (!unlikely(options & WCONTINUED))
+				if (!unlikely(options & WCONTINUED)) {
+					flag = 1;
 					continue;
+				}
 				retval = wait_task_continued(
 					p, (options & WNOWAIT),
 					infop, stat_addr, ru);

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2004-11-29 11:56 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <20041120132403.GA2062@xeon2.local.here>
     [not found] ` <41A0738B.1070100@in.ibm.com>
     [not found]   ` <41A1AD33.8080108@xeon2.local.here>
     [not found]     ` <41A58B51.7060600@in.ibm.com>
     [not found]       ` <41A763A1.6090806@xeon2.local.here>
     [not found]         ` <41AA9F21.9060408@in.ibm.com>
2004-11-29 11:53           ` do_wait fix for 2.6.10-rc1 Klaus Dittrich

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).