All of lore.kernel.org
 help / color / mirror / Atom feed
* 2.5.27: Software Suspend failure / JFS errors
@ 2002-07-21 12:29 axel
  2002-07-21 14:42 ` axel
  0 siblings, 1 reply; 9+ messages in thread
From: axel @ 2002-07-21 12:29 UTC (permalink / raw)
  To: linux-kernel

Hi,

I invoked software suspend with kernel 2.5.27 and get the following messages
from kernel:

Stopping tasks: ========================
 stopping tasks failed (3 tasks remaining)
Suspend failed: Not all processes stopped!
Restarting tasks...<6> Strange, jfsIO not stopped
 Strange, jfsCommit not stopped
 Strange, jfsSync not stopped
 done

Afterwards, I have full cpu utilization of the JFS kernel threads:

CPU states:   1.0% user,  99.0% system,   0.0% nice,   0.0% idle
Mem:    126284K total,    97112K used,    29172K free,        0K buffers
Swap:   289160K total,        0K used,   289160K free,    53224K cached

  PID USER     PRI  NI  SIZE  RSS SHARE STAT %CPU %MEM   TIME COMMAND
    7 root      25   0     0    0     0 RW   36.6  0.0   3:56 jfsIO
    8 root      25   0     0    0     0 RW   30.6  0.0   3:56 jfsCommit
    9 root      25   0     0    0     0 RW   29.6  0.0   3:56 jfsSync
  361 root      15   0   972  972   768 R     0.9  0.7   0:00 top
  235 axel      15   0  3064 3064  1928 R     0.0  2.4   0:00 sawfish
  244 axel      15   0  5004 5004  3724 R     0.0  3.9   0:00 panel
  248 axel      15   0  4508 4508  3324 R     0.0  3.5   0:03 gkrellm

And constant activity of VM:

   procs                      memory    swap          io     system
cpu
 r  b  w   swpd   free   buff  cache  si  so    bi    bo   in    cs  us  sy
id
 0  0  3      0  29284      0  53224   0   0    61     6 1051   106   2  89
10
 0  0  2      0  29284      0  53224   0   0     0     0 1006    94   0 100
0
 0  0  2      0  29284      0  53224   0   0     0     0 1006    75   1  99
0
 0  0  2      0  29284      0  53224   0   0     0     0 1006    79   1  99
0
 0  0  2      0  29284      0  53224   0   0     0     0 1006    88   1  99
0

I used to have problems with JFS anyway when unpacking big tar archives. The
the system gives an oops and locks up a short while after. The process it is
stuck in is JFSCommit.
I tried latest 2.4 and 2.5, always had the same problems. Strangely JFS
causes no problems at all when I uses the kernel my partitions were
formatted with. That is slackware kernel 2.4.18, jfs 1.0.18. Any kernel
later with jfs versions higher causes these JFSCommit freezes.

I will send an oops report of JFS later.

Regards,
Axel Siebenwirth

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.5.27: Software Suspend failure / JFS errors
  2002-07-21 12:29 2.5.27: Software Suspend failure / JFS errors axel
@ 2002-07-21 14:42 ` axel
  2002-07-23 14:54   ` Dave Kleikamp
  0 siblings, 1 reply; 9+ messages in thread
From: axel @ 2002-07-21 14:42 UTC (permalink / raw)
  To: linux-kernel; +Cc: jfs-discussion, shaggy

Hi!

This oops occurred during build of gcc..
Kernel 2.4.19-rc2-ac2.
About the same happens with 2.5.27. I will post an oops of jfsCommit of
2.5.27 as soon as I get one.

ksymoops 2.4.5 on i686 2.4.19-rc2-ac2.  Options used
     -V (default)
     -k /proc/ksyms (default)
     -l /proc/modules (default)
     -o /lib/modules/2.4.19-rc2-ac2/ (default)
     -m /boot/System.map-2.4.19-rc2-ac2 (specified)

Unable to handle kernel NULL pointer dereference at virtual address 00000018
*pde = 00000000
Oops: 0002
CPU:    0
EIP:    0010:[<c018b565>]       Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
EFLAGS: 00010202
eax: c7e5c000 ebx: c8802490 ecx: 00000000 edx: 00000000
esi: c8802490 edi: c880cf58 ebp: c7dbd980 esp: c7e5df58
ds: 0018 es: 0018 ss: 0018
Process jfsCommit (pid: 8, stackpage=c7e5d000)
Stack: 000000b1 c0190800 00000000 00000000 00000000 00000286 00000000
00000040
       c7e5e000 c0118486 c7e5dfa8 c7e5c000 c8802490 c8802490 c7e5c000
00000001
       c0190fb3 c8802490 c7e5c000 00000246 c8802490 c01911db c8802490
c7e5c000
Call Trace: [<c0190800>] [<c0118486>] [<c0190fb3>] [<c01911db>] [<c0105000>]
            [<c010739e>] [<c0191080>]
Code: ff 41 18 85 d2 74 34 31 c0 0f ab 41 14 19 c0 85 c0 74 09 b8


>>EIP; c018b565 <hold_metapage+15/70>   <=====

>>eax; c7e5c000 <_end+7b3e314/85ff314>
>>ebx; c8802490 <_end+84e47a4/85ff314>
>>esi; c8802490 <_end+84e47a4/85ff314>
>>edi; c880cf58 <_end+84ef26c/85ff314>
>>ebp; c7dbd980 <_end+7a9fc94/85ff314>
>>esp; c7e5df58 <_end+7b4026c/85ff314>

Trace; c0190800 <txUpdateMap+2c0/2d0>
Trace; c0118486 <schedule+1a6/310>
Trace; c0190fb3 <txLazyCommit+23/f0>
Trace; c01911db <jfs_lazycommit+15b/250>
Trace; c0105000 <_stext+0/0>
Trace; c010739e <kernel_thread+2e/40>
Trace; c0191080 <jfs_lazycommit+0/250>

Code;  c018b565 <hold_metapage+15/70>
00000000 <_EIP>:
Code;  c018b565 <hold_metapage+15/70>   <=====
   0:   ff 41 18                  incl   0x18(%ecx)   <=====
Code;  c018b568 <hold_metapage+18/70>
   3:   85 d2                     test   %edx,%edx
Code;  c018b56a <hold_metapage+1a/70>
   5:   74 34                     je     3b <_EIP+0x3b> c018b5a0
<hold_metapage+50/70>
Code;  c018b56c <hold_metapage+1c/70>
   7:   31 c0                     xor    %eax,%eax
Code;  c018b56e <hold_metapage+1e/70>
   9:   0f ab 41 14               bts    %eax,0x14(%ecx)
Code;  c018b572 <hold_metapage+22/70>
   d:   19 c0                     sbb    %eax,%eax
Code;  c018b574 <hold_metapage+24/70>
   f:   85 c0                     test   %eax,%eax
Code;  c018b576 <hold_metapage+26/70>
  11:   74 09                     je     1c <_EIP+0x1c> c018b581
<hold_metapage+31/70>
Code;  c018b578 <hold_metapage+28/70>
  13:   b8 00 00 00 00            mov    $0x0,%eax


Regards,
Axel Siebenwirth

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: 2.5.27: Software Suspend failure / JFS errors
  2002-07-21 14:42 ` axel
@ 2002-07-23 14:54   ` Dave Kleikamp
  2002-07-23 15:06     ` [Jfs-discussion] " Christoph Hellwig
  2002-07-25 22:27     ` axel
  0 siblings, 2 replies; 9+ messages in thread
From: Dave Kleikamp @ 2002-07-23 14:54 UTC (permalink / raw)
  To: axel, linux-kernel; +Cc: jfs-discussion

On Sunday 21 July 2002 09:42, axel@hh59.org wrote:
> This oops occurred during build of gcc..
> Kernel 2.4.19-rc2-ac2.
> About the same happens with 2.5.27. I will post an oops of jfsCommit
> of 2.5.27 as soon as I get one.

I just built gcc on 2.4.19-rc3 + latest JFS and didn't have a problem.  
I'll repeat it on 2.4.19-rc2-ac2, but there shouldn't be more than a 
comsmetic difference in the JFS code.  I haven't tried 2.5.27 yet.

> ksymoops 2.4.5 on i686 2.4.19-rc2-ac2.  Options used
  --- ksymoops output deleted ---
>
> Trace; c0190800 <txUpdateMap+2c0/2d0>
> Trace; c0118486 <schedule+1a6/310>
> Trace; c0190fb3 <txLazyCommit+23/f0>
> Trace; c01911db <jfs_lazycommit+15b/250>
> Trace; c0105000 <_stext+0/0>
> Trace; c010739e <kernel_thread+2e/40>
> Trace; c0191080 <jfs_lazycommit+0/250>
>
> Code;  c018b565 <hold_metapage+15/70>
> 00000000 <_EIP>:
> Code;  c018b565 <hold_metapage+15/70>   <=====
>    0:   ff 41 18                  incl   0x18(%ecx)   <=====
> Code;  c018b568 <hold_metapage+18/70>
>    3:   85 d2                     test   %edx,%edx

It looks like tlck->mp was null in txUpdateMap, and hold_metapage was 
called with the null pointer.  I haven't seen this before, but I am 
looking at the code to see if I can figure out how it may have 
happened.  I'm guessing that you have built the kernel without 
CONFIG_JFS_DEBUG set.  If I'm right, can you set this before you try to 
stress JFS again.  It may help find the problem earlier.

> Regards,
> Axel Siebenwirth

Thanks,
Shaggy
-- 
David Kleikamp
IBM Linux Technology Center


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [Jfs-discussion] Re: 2.5.27: Software Suspend failure / JFS errors
  2002-07-23 14:54   ` Dave Kleikamp
@ 2002-07-23 15:06     ` Christoph Hellwig
  2002-07-23 16:15       ` Dave Kleikamp
  2002-07-29 18:39       ` Pavel Machek
  2002-07-25 22:27     ` axel
  1 sibling, 2 replies; 9+ messages in thread
From: Christoph Hellwig @ 2002-07-23 15:06 UTC (permalink / raw)
  To: Dave Kleikamp; +Cc: axel, linux-kernel, jfs-discussion

On Tue, Jul 23, 2002 at 09:54:35AM -0500, Dave Kleikamp wrote:
> On Sunday 21 July 2002 09:42, axel@hh59.org wrote:
> > This oops occurred during build of gcc..
> > Kernel 2.4.19-rc2-ac2.
> > About the same happens with 2.5.27. I will post an oops of jfsCommit
> > of 2.5.27 as soon as I get one.
> 
> I just built gcc on 2.4.19-rc3 + latest JFS and didn't have a problem.  
> I'll repeat it on 2.4.19-rc2-ac2, but there shouldn't be more than a 
> comsmetic difference in the JFS code.  I haven't tried 2.5.27 yet.

As I read 'Software Suspend' in the subject I guess it's swsusp fault.
Swsusp needs magic flags for kernel threads which no one has added to
JFS yet.


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [Jfs-discussion] Re: 2.5.27: Software Suspend failure / JFS errors
  2002-07-23 15:06     ` [Jfs-discussion] " Christoph Hellwig
@ 2002-07-23 16:15       ` Dave Kleikamp
  2002-07-23 16:20         ` Christoph Hellwig
  2002-07-29 18:39       ` Pavel Machek
  1 sibling, 1 reply; 9+ messages in thread
From: Dave Kleikamp @ 2002-07-23 16:15 UTC (permalink / raw)
  To: Christoph Hellwig; +Cc: axel, linux-kernel, jfs-discussion

On Tuesday 23 July 2002 10:06, Christoph Hellwig wrote:
> As I read 'Software Suspend' in the subject I guess it's swsusp
> fault. Swsusp needs magic flags for kernel threads which no one has
> added to JFS yet.

I understood the swsusp to be an unrelated issue.  Is swsusp even 
available in a 2.4 kernel?

I believe to fix the swsusp problem, the kernel threads need to test 
(current->flags & PF_FREEZE), and if set call 
refrigerator(PF_IOTHREAD).

-- 
David Kleikamp
IBM Linux Technology Center


^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [Jfs-discussion] Re: 2.5.27: Software Suspend failure / JFS errors
  2002-07-23 16:15       ` Dave Kleikamp
@ 2002-07-23 16:20         ` Christoph Hellwig
  0 siblings, 0 replies; 9+ messages in thread
From: Christoph Hellwig @ 2002-07-23 16:20 UTC (permalink / raw)
  To: Dave Kleikamp; +Cc: axel, linux-kernel, jfs-discussion

On Tue, Jul 23, 2002 at 11:15:14AM -0500, Dave Kleikamp wrote:
> On Tuesday 23 July 2002 10:06, Christoph Hellwig wrote:
> > As I read 'Software Suspend' in the subject I guess it's swsusp
> > fault. Swsusp needs magic flags for kernel threads which no one has
> > added to JFS yet.
> 
> I understood the swsusp to be an unrelated issue.  Is swsusp even 
> available in a 2.4 kernel?

There is a 2.4 patch and it was merged in -ac for some period.

> I believe to fix the swsusp problem, the kernel threads need to test 
> (current->flags & PF_FREEZE), and if set call 
> refrigerator(PF_IOTHREAD).

I think so.  (although I have to admit that I don't care for it)

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: JFS errors
  2002-07-23 14:54   ` Dave Kleikamp
  2002-07-23 15:06     ` [Jfs-discussion] " Christoph Hellwig
@ 2002-07-25 22:27     ` axel
  2002-07-26 14:04       ` Dave Kleikamp
  1 sibling, 1 reply; 9+ messages in thread
From: axel @ 2002-07-25 22:27 UTC (permalink / raw)
  To: Dave Kleikamp; +Cc: linux-kernel, jfs-discussion

Hi Dave!

On Tue, 23 Jul 2002, Dave Kleikamp wrote:

> happened.  I'm guessing that you have built the kernel without 
> CONFIG_JFS_DEBUG set.  If I'm right, can you set this before you try to 
> stress JFS again.  It may help find the problem earlier.

No, it's built with JFS_DEBUG. That was the first thing I compiled into a
new kernel when I first encountered this.
How can it help you? Shall I provide info from /proc/fs/jfs after oops
occured?
Oops itself I have to handcopy each time. Hard work! ;) But I guess I can
access /proc tree.

Axel Siebenwirth

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: JFS errors
  2002-07-25 22:27     ` axel
@ 2002-07-26 14:04       ` Dave Kleikamp
  0 siblings, 0 replies; 9+ messages in thread
From: Dave Kleikamp @ 2002-07-26 14:04 UTC (permalink / raw)
  To: axel; +Cc: linux-kernel, jfs-discussion

> No, it's built with JFS_DEBUG. That was the first thing I compiled into a
> new kernel when I first encountered this.

I'll take another look at the oops.  My initial thought was that if I was
right in my assumptions, a dereference in an ASSERT statement would have
caused a trap slightly earlier than the one you hit.  Without debug, the
ASSERT is compiled out.

> How can it help you?

If it's already on, it won't provide any more help.  There was just a
chance that if it wasn't on, it might have caught something earlier.

> Shall I provide info from /proc/fs/jfs after oops
> occured?

I doubt anything there would be useful.

> Oops itself I have to handcopy each time. Hard work! ;) But I guess I can
> access /proc tree.

The oops was helpful, and I'll need to take a closer look at the code.  I'll
let you know if I want you to try anything else.

Thanks for the feedback.

Shaggy
-- 
David Kleikamp
IBM Linux Technology Center

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [Jfs-discussion] Re: 2.5.27: Software Suspend failure / JFS errors
  2002-07-23 15:06     ` [Jfs-discussion] " Christoph Hellwig
  2002-07-23 16:15       ` Dave Kleikamp
@ 2002-07-29 18:39       ` Pavel Machek
  1 sibling, 0 replies; 9+ messages in thread
From: Pavel Machek @ 2002-07-29 18:39 UTC (permalink / raw)
  To: Christoph Hellwig, Dave Kleikamp, axel, linux-kernel, jfs-discussion

Hi!

> > > This oops occurred during build of gcc..
> > > Kernel 2.4.19-rc2-ac2.
> > > About the same happens with 2.5.27. I will post an oops of jfsCommit
> > > of 2.5.27 as soon as I get one.
> > 
> > I just built gcc on 2.4.19-rc3 + latest JFS and didn't have a problem.  
> > I'll repeat it on 2.4.19-rc2-ac2, but there shouldn't be more than a 
> > comsmetic difference in the JFS code.  I haven't tried 2.5.27 yet.
> 
> As I read 'Software Suspend' in the subject I guess it's swsusp fault.
> Swsusp needs magic flags for kernel threads which no one has added to
> JFS yet.

Hehe. Really someone should add if (current->flags & PF_FREEZE) refrigerator(); 
at the right place of JFS threads. I don't have JFS installed so it is hard
for me to do that, sorry.
								Pavel
-- 
Philips Velo 1: 1"x4"x8", 300gram, 60, 12MB, 40bogomips, linux, mutt,
details at http://atrey.karlin.mff.cuni.cz/~pavel/velo/index.html.


^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2002-07-29 17:45 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2002-07-21 12:29 2.5.27: Software Suspend failure / JFS errors axel
2002-07-21 14:42 ` axel
2002-07-23 14:54   ` Dave Kleikamp
2002-07-23 15:06     ` [Jfs-discussion] " Christoph Hellwig
2002-07-23 16:15       ` Dave Kleikamp
2002-07-23 16:20         ` Christoph Hellwig
2002-07-29 18:39       ` Pavel Machek
2002-07-25 22:27     ` axel
2002-07-26 14:04       ` Dave Kleikamp

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.