All of lore.kernel.org
 help / color / mirror / Atom feed
* [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-16  3:38 ` JiSheng Zhang
  0 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-16  3:38 UTC (permalink / raw)
  To: linux-kernel; +Cc: linux-mm, stable, gregkh

Hi,

I triggered a failure in an fs test with fsx-linux from ltp. It seems that
fsx-linux failed at mmap->write sequence.

Tested kernel is 2.6.27.12 and 2.6.27.39
Tested file system: ext3, tmpfs.
IMHO, it impacts all file systems.

Some fsx-linux log is:

READ BAD DATA: offset = 0x2771b, size = 0xa28e
OFFSET  GOOD    BAD     RANGE
0x287e0 0x35c9  0x15a9     0x80
operation# (mod 256) for the bad datamay be 21
...
7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
 ******WWWW
7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
 ***RRRR***
Correct content saved for comparison
...

^ permalink raw reply	[flat|nested] 21+ messages in thread

* [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-16  3:38 ` JiSheng Zhang
  0 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-16  3:38 UTC (permalink / raw)
  To: linux-kernel; +Cc: linux-mm, stable, gregkh

Hi,

I triggered a failure in an fs test with fsx-linux from ltp. It seems that
fsx-linux failed at mmap->write sequence.

Tested kernel is 2.6.27.12 and 2.6.27.39
Tested file system: ext3, tmpfs.
IMHO, it impacts all file systems.

Some fsx-linux log is:

READ BAD DATA: offset = 0x2771b, size = 0xa28e
OFFSET  GOOD    BAD     RANGE
0x287e0 0x35c9  0x15a9     0x80
operation# (mod 256) for the bad datamay be 21
...
7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
 ******WWWW
7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
 ***RRRR***
Correct content saved for comparison
...

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-16  3:38 ` JiSheng Zhang
@ 2009-11-17  1:56   ` Greg KH
  -1 siblings, 0 replies; 21+ messages in thread
From: Greg KH @ 2009-11-17  1:56 UTC (permalink / raw)
  To: JiSheng Zhang; +Cc: linux-kernel, linux-mm, stable

On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> Hi,
> 
> I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> fsx-linux failed at mmap->write sequence.
> 
> Tested kernel is 2.6.27.12 and 2.6.27.39

Does this work on any kernel you have tested?  Or is it a regression?

> Tested file system: ext3, tmpfs.
> IMHO, it impacts all file systems.
> 
> Some fsx-linux log is:
> 
> READ BAD DATA: offset = 0x2771b, size = 0xa28e
> OFFSET  GOOD    BAD     RANGE
> 0x287e0 0x35c9  0x15a9     0x80
> operation# (mod 256) for the bad datamay be 21
> ...
> 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
>  ******WWWW
> 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
>  ***RRRR***
> Correct content saved for comparison
> ...

Are you sure that the LTP is correct?  It wouldn't be the first time it
wasn't...

thanks,

greg k-h

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-17  1:56   ` Greg KH
  0 siblings, 0 replies; 21+ messages in thread
From: Greg KH @ 2009-11-17  1:56 UTC (permalink / raw)
  To: JiSheng Zhang; +Cc: linux-kernel, linux-mm, stable

On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> Hi,
> 
> I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> fsx-linux failed at mmap->write sequence.
> 
> Tested kernel is 2.6.27.12 and 2.6.27.39

Does this work on any kernel you have tested?  Or is it a regression?

> Tested file system: ext3, tmpfs.
> IMHO, it impacts all file systems.
> 
> Some fsx-linux log is:
> 
> READ BAD DATA: offset = 0x2771b, size = 0xa28e
> OFFSET  GOOD    BAD     RANGE
> 0x287e0 0x35c9  0x15a9     0x80
> operation# (mod 256) for the bad datamay be 21
> ...
> 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
>  ******WWWW
> 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
>  ***RRRR***
> Correct content saved for comparison
> ...

Are you sure that the LTP is correct?  It wouldn't be the first time it
wasn't...

thanks,

greg k-h

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-17  1:56   ` Greg KH
@ 2009-11-17 11:07     ` JiSheng Zhang
  -1 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-17 11:07 UTC (permalink / raw)
  To: Greg KH; +Cc: linux-kernel, linux-mm, stable

Hi Greg,

2009/11/17 Greg KH <gregkh@suse.de>:
>>
>> Tested kernel is 2.6.27.12 and 2.6.27.39
>
> Does this work on any kernel you have tested?  Or is it a regression?

I have tested on both 2.6.27.12 and 2.6.27.39, fsx-linux all failed.

>
>> Tested file system: ext3, tmpfs.
>> IMHO, it impacts all file systems.
>>
>> Some fsx-linux log is:
>>
>> READ BAD DATA: offset = 0x2771b, size = 0xa28e
>> OFFSET  GOOD    BAD     RANGE
> Are you sure that the LTP is correct?  It wouldn't be the first time it
> wasn't...

hmmm, I read the source again, IMHO it is correct.

>
> thanks,
>
> greg k-h
>

One more findings: If I add "return" at the beginning of domapwrite,
no fail found yet.

Regards,
Jisheng

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-17 11:07     ` JiSheng Zhang
  0 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-17 11:07 UTC (permalink / raw)
  To: Greg KH; +Cc: linux-kernel, linux-mm, stable

Hi Greg,

2009/11/17 Greg KH <gregkh@suse.de>:
>>
>> Tested kernel is 2.6.27.12 and 2.6.27.39
>
> Does this work on any kernel you have tested?  Or is it a regression?

I have tested on both 2.6.27.12 and 2.6.27.39, fsx-linux all failed.

>
>> Tested file system: ext3, tmpfs.
>> IMHO, it impacts all file systems.
>>
>> Some fsx-linux log is:
>>
>> READ BAD DATA: offset = 0x2771b, size = 0xa28e
>> OFFSET  GOOD    BAD     RANGE
> Are you sure that the LTP is correct?  It wouldn't be the first time it
> wasn't...

hmmm, I read the source again, IMHO it is correct.

>
> thanks,
>
> greg k-h
>

One more findings: If I add "return" at the beginning of domapwrite,
no fail found yet.

Regards,
Jisheng

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-17  1:56   ` Greg KH
@ 2009-11-17 12:36     ` Chris Mason
  -1 siblings, 0 replies; 21+ messages in thread
From: Chris Mason @ 2009-11-17 12:36 UTC (permalink / raw)
  To: Greg KH; +Cc: JiSheng Zhang, linux-kernel, linux-mm, stable, jack

On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
> On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> > Hi,
> > 
> > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> > fsx-linux failed at mmap->write sequence.
> > 
> > Tested kernel is 2.6.27.12 and 2.6.27.39
> 
> Does this work on any kernel you have tested?  Or is it a regression?
> 
> > Tested file system: ext3, tmpfs.
> > IMHO, it impacts all file systems.
> > 
> > Some fsx-linux log is:
> > 
> > READ BAD DATA: offset = 0x2771b, size = 0xa28e
> > OFFSET  GOOD    BAD     RANGE
> > 0x287e0 0x35c9  0x15a9     0x80
> > operation# (mod 256) for the bad datamay be 21
> > ...
> > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
> >  ******WWWW
> > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
> >  ***RRRR***
> > Correct content saved for comparison
> > ...
> 
> Are you sure that the LTP is correct?  It wouldn't be the first time it
> wasn't...

I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
something here in ext3, does 2.6.32-rc work?

-chris


^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-17 12:36     ` Chris Mason
  0 siblings, 0 replies; 21+ messages in thread
From: Chris Mason @ 2009-11-17 12:36 UTC (permalink / raw)
  To: Greg KH; +Cc: JiSheng Zhang, linux-kernel, linux-mm, stable, jack

On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
> On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> > Hi,
> > 
> > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> > fsx-linux failed at mmap->write sequence.
> > 
> > Tested kernel is 2.6.27.12 and 2.6.27.39
> 
> Does this work on any kernel you have tested?  Or is it a regression?
> 
> > Tested file system: ext3, tmpfs.
> > IMHO, it impacts all file systems.
> > 
> > Some fsx-linux log is:
> > 
> > READ BAD DATA: offset = 0x2771b, size = 0xa28e
> > OFFSET  GOOD    BAD     RANGE
> > 0x287e0 0x35c9  0x15a9     0x80
> > operation# (mod 256) for the bad datamay be 21
> > ...
> > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
> >  ******WWWW
> > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
> >  ***RRRR***
> > Correct content saved for comparison
> > ...
> 
> Are you sure that the LTP is correct?  It wouldn't be the first time it
> wasn't...

I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
something here in ext3, does 2.6.32-rc work?

-chris

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-17 12:36     ` Chris Mason
@ 2009-11-17 19:06       ` Jan Kara
  -1 siblings, 0 replies; 21+ messages in thread
From: Jan Kara @ 2009-11-17 19:06 UTC (permalink / raw)
  To: Chris Mason; +Cc: Greg KH, JiSheng Zhang, linux-kernel, linux-mm, stable, jack

On Tue 17-11-09 07:36:22, Chris Mason wrote:
> On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
> > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> > > Hi,
> > > 
> > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> > > fsx-linux failed at mmap->write sequence.
> > > 
> > > Tested kernel is 2.6.27.12 and 2.6.27.39
> > 
> > Does this work on any kernel you have tested?  Or is it a regression?
> > 
> > > Tested file system: ext3, tmpfs.
> > > IMHO, it impacts all file systems.
> > > 
> > > Some fsx-linux log is:
> > > 
> > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
> > > OFFSET  GOOD    BAD     RANGE
> > > 0x287e0 0x35c9  0x15a9     0x80
> > > operation# (mod 256) for the bad datamay be 21
> > > ...
> > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
> > >  ******WWWW
> > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
> > >  ***RRRR***
> > > Correct content saved for comparison
> > > ...
  Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
for a while on 2.6.27.21 and didn't hit the problem yet.

> > Are you sure that the LTP is correct?  It wouldn't be the first time it
> > wasn't...
> 
> I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
> something here in ext3, does 2.6.32-rc work?
  Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
be interesting.

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-17 19:06       ` Jan Kara
  0 siblings, 0 replies; 21+ messages in thread
From: Jan Kara @ 2009-11-17 19:06 UTC (permalink / raw)
  To: Chris Mason; +Cc: Greg KH, JiSheng Zhang, linux-kernel, linux-mm, stable, jack

On Tue 17-11-09 07:36:22, Chris Mason wrote:
> On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
> > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> > > Hi,
> > > 
> > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> > > fsx-linux failed at mmap->write sequence.
> > > 
> > > Tested kernel is 2.6.27.12 and 2.6.27.39
> > 
> > Does this work on any kernel you have tested?  Or is it a regression?
> > 
> > > Tested file system: ext3, tmpfs.
> > > IMHO, it impacts all file systems.
> > > 
> > > Some fsx-linux log is:
> > > 
> > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
> > > OFFSET  GOOD    BAD     RANGE
> > > 0x287e0 0x35c9  0x15a9     0x80
> > > operation# (mod 256) for the bad datamay be 21
> > > ...
> > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
> > >  ******WWWW
> > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
> > >  ***RRRR***
> > > Correct content saved for comparison
> > > ...
  Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
for a while on 2.6.27.21 and didn't hit the problem yet.

> > Are you sure that the LTP is correct?  It wouldn't be the first time it
> > wasn't...
> 
> I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
> something here in ext3, does 2.6.32-rc work?
  Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
be interesting.

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-17 19:06       ` Jan Kara
@ 2009-11-18 14:17         ` JiSheng Zhang
  -1 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-18 14:17 UTC (permalink / raw)
  To: Jan Kara
  Cc: Chris Mason, Greg KH, linux-kernel, linux-mm, stable, jack, rmk,
	linux-arm

On Tue, 17 Nov 2009 20:06:35 +0100
Jan Kara <jack@suse.cz> wrote:

> On Tue 17-11-09 07:36:22, Chris Mason wrote:
> > On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
> > > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> > > > Hi,
> > > > 
> > > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> > > > fsx-linux failed at mmap->write sequence.
> > > > 
> > > > Tested kernel is 2.6.27.12 and 2.6.27.39
> > > 
> > > Does this work on any kernel you have tested?  Or is it a regression?
> > > 
> > > > Tested file system: ext3, tmpfs.
> > > > IMHO, it impacts all file systems.
> > > > 
> > > > Some fsx-linux log is:
> > > > 
> > > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
> > > > OFFSET  GOOD    BAD     RANGE
> > > > 0x287e0 0x35c9  0x15a9     0x80
> > > > operation# (mod 256) for the bad datamay be 21
> > > > ...
> > > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> > > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
> > > >  ******WWWW
> > > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
> > > >  ***RRRR***
> > > > Correct content saved for comparison
> > > > ...
>   Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
> for a while on 2.6.27.21 and didn't hit the problem yet.

I forget to mention that the test were done on an arm board with 64M ram. 
I have tested fsx-linux again on pc, it seems that failure go away.

> 
> > > Are you sure that the LTP is correct?  It wouldn't be the first time it
> > > wasn't...
> > 
> > I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
> > something here in ext3, does 2.6.32-rc work?
>   Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
> so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
> be interesting.

Currently the arm board doesn't support 2.6.32-rc. But I test with 2.6.32-rc7
On my pc box, there's no failure so far.

> 
> 								Honza

I found this via google:
http://marc.info/?t=118026315000001&r=1&w=2

I even tried the code from
http://marc.info/?l=linux-arch&m=118030601701617&w=2
I got mostly:
firstfirstfirst
firstfirstfirst
firstfirstfirst


No change after pass "MS_SYNC|MS_INVALIDATE" to msync and make the 
flush_dcache_page() call unconditional in do_generic_mapping_read.
This behavior is different from what I read from the mail thread above.

> void do_generic_mapping_read(struct address_space *mapping,
>                              struct file_ra_state *_ra,
>                              struct file *filp,
>                              loff_t *ppos,
>                              read_descriptor_t *desc,
>                              read_actor_t actor)
> {
> ...
>                 /* If users can be writing to this page using arbitrary
>                  * virtual addresses, take care about potential aliasing
>                  * before reading the page on the kernel side.
>                  */
>                 if (1 || mapping_writably_mapped(mapping))
>                         flush_dcache_page(page);

Then I run fsx-linux after the above modification, fsx-linux failed all 
the same both on tmpfs and ext3

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-18 14:17         ` JiSheng Zhang
  0 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-18 14:17 UTC (permalink / raw)
  To: Jan Kara
  Cc: Chris Mason, Greg KH, linux-kernel, linux-mm, stable, rmk, linux-arm

On Tue, 17 Nov 2009 20:06:35 +0100
Jan Kara <jack@suse.cz> wrote:

> On Tue 17-11-09 07:36:22, Chris Mason wrote:
> > On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
> > > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> > > > Hi,
> > > > 
> > > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> > > > fsx-linux failed at mmap->write sequence.
> > > > 
> > > > Tested kernel is 2.6.27.12 and 2.6.27.39
> > > 
> > > Does this work on any kernel you have tested?  Or is it a regression?
> > > 
> > > > Tested file system: ext3, tmpfs.
> > > > IMHO, it impacts all file systems.
> > > > 
> > > > Some fsx-linux log is:
> > > > 
> > > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
> > > > OFFSET  GOOD    BAD     RANGE
> > > > 0x287e0 0x35c9  0x15a9     0x80
> > > > operation# (mod 256) for the bad datamay be 21
> > > > ...
> > > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> > > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
> > > >  ******WWWW
> > > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
> > > >  ***RRRR***
> > > > Correct content saved for comparison
> > > > ...
>   Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
> for a while on 2.6.27.21 and didn't hit the problem yet.

I forget to mention that the test were done on an arm board with 64M ram. 
I have tested fsx-linux again on pc, it seems that failure go away.

> 
> > > Are you sure that the LTP is correct?  It wouldn't be the first time it
> > > wasn't...
> > 
> > I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
> > something here in ext3, does 2.6.32-rc work?
>   Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
> so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
> be interesting.

Currently the arm board doesn't support 2.6.32-rc. But I test with 2.6.32-rc7
On my pc box, there's no failure so far.

> 
> 								Honza

I found this via google:
http://marc.info/?t=118026315000001&r=1&w=2

I even tried the code from
http://marc.info/?l=linux-arch&m=118030601701617&w=2
I got mostly:
firstfirstfirst
firstfirstfirst
firstfirstfirst


No change after pass "MS_SYNC|MS_INVALIDATE" to msync and make the 
flush_dcache_page() call unconditional in do_generic_mapping_read.
This behavior is different from what I read from the mail thread above.

> void do_generic_mapping_read(struct address_space *mapping,
>                              struct file_ra_state *_ra,
>                              struct file *filp,
>                              loff_t *ppos,
>                              read_descriptor_t *desc,
>                              read_actor_t actor)
> {
> ...
>                 /* If users can be writing to this page using arbitrary
>                  * virtual addresses, take care about potential aliasing
>                  * before reading the page on the kernel side.
>                  */
>                 if (1 || mapping_writably_mapped(mapping))
>                         flush_dcache_page(page);

Then I run fsx-linux after the above modification, fsx-linux failed all 
the same both on tmpfs and ext3

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Fwd: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-18 14:17         ` JiSheng Zhang
  (?)
@ 2009-11-19  5:56         ` JiSheng Zhang
  -1 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-19  5:56 UTC (permalink / raw)
  To: linux-arm-kernel

---------- Forwarded message ----------
From: JiSheng Zhang <jszhang3@gmail.com>
Date: 2009/11/18
Subject: Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
To: Jan Kara <jack@suse.cz>
??? Chris Mason <chris.mason@oracle.com>, Greg KH <gregkh@suse.de>,
linux-kernel at vger.kernel.org, linux-mm at kvack.org, stable at kernel.org,
jack at suse.cz, rmk at arm.linux.org.uk, linux-arm at lists.infradead.org


On Tue, 17 Nov 2009 20:06:35 +0100
Jan Kara <jack@suse.cz> wrote:

> On Tue 17-11-09 07:36:22, Chris Mason wrote:
> > On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
> > > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> > > > Hi,
> > > >
> > > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> > > > fsx-linux failed at mmap->write sequence.
> > > >
> > > > Tested kernel is 2.6.27.12 and 2.6.27.39
> > >
> > > Does this work on any kernel you have tested?  Or is it a regression?
> > >
> > > > Tested file system: ext3, tmpfs.
> > > > IMHO, it impacts all file systems.
> > > >
> > > > Some fsx-linux log is:
> > > >
> > > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
> > > > OFFSET  GOOD    BAD     RANGE
> > > > 0x287e0 0x35c9  0x15a9     0x80
> > > > operation# (mod 256) for the bad datamay be 21
> > > > ...
> > > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> > > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
> > > >  ******WWWW
> > > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
> > > >  ***RRRR***
> > > > Correct content saved for comparison
> > > > ...
>   Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
> for a while on 2.6.27.21 and didn't hit the problem yet.

I forget to mention that the test were done on an arm board with 64M ram.
I have tested fsx-linux again on pc, it seems that failure go away.

>
> > > Are you sure that the LTP is correct?  It wouldn't be the first time it
> > > wasn't...
> >
> > I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
> > something here in ext3, does 2.6.32-rc work?
>   Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
> so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
> be interesting.

Currently the arm board doesn't support 2.6.32-rc. But I test with 2.6.32-rc7
On my pc box, there's no failure so far.

>
>                                                               Honza

I found this via google:
http://marc.info/?t=118026315000001&r=1&w=2

I even tried the code from
http://marc.info/?l=linux-arch&m=118030601701617&w=2
I got mostly:
firstfirstfirst
firstfirstfirst
firstfirstfirst


No change after pass "MS_SYNC|MS_INVALIDATE" to msync and make the
flush_dcache_page() call unconditional in do_generic_mapping_read.
This behavior is different from what I read from the mail thread above.

> void do_generic_mapping_read(struct address_space *mapping,
>                              struct file_ra_state *_ra,
>                              struct file *filp,
>                              loff_t *ppos,
>                              read_descriptor_t *desc,
>                              read_actor_t actor)
> {
> ...
>                 /* If users can be writing to this page using arbitrary
>                  * virtual addresses, take care about potential aliasing
>                  * before reading the page on the kernel side.
>                  */
>                 if (1 || mapping_writably_mapped(mapping))
>                         flush_dcache_page(page);

Then I run fsx-linux after the above modification, fsx-linux failed all
the same both on tmpfs and ext3

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-18 14:17         ` JiSheng Zhang
@ 2009-11-19 14:43           ` Jan Kara
  -1 siblings, 0 replies; 21+ messages in thread
From: Jan Kara @ 2009-11-19 14:43 UTC (permalink / raw)
  To: JiSheng Zhang
  Cc: Jan Kara, Chris Mason, Greg KH, linux-kernel, linux-mm, stable,
	rmk, linux-arm

On Wed 18-11-09 22:17:56, JiSheng Zhang wrote:
> On Tue, 17 Nov 2009 20:06:35 +0100
> Jan Kara <jack@suse.cz> wrote:
> 
> > On Tue 17-11-09 07:36:22, Chris Mason wrote:
> > > On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
> > > > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> > > > > Hi,
> > > > > 
> > > > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> > > > > fsx-linux failed at mmap->write sequence.
> > > > > 
> > > > > Tested kernel is 2.6.27.12 and 2.6.27.39
> > > > 
> > > > Does this work on any kernel you have tested?  Or is it a regression?
> > > > 
> > > > > Tested file system: ext3, tmpfs.
> > > > > IMHO, it impacts all file systems.
> > > > > 
> > > > > Some fsx-linux log is:
> > > > > 
> > > > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
> > > > > OFFSET  GOOD    BAD     RANGE
> > > > > 0x287e0 0x35c9  0x15a9     0x80
> > > > > operation# (mod 256) for the bad datamay be 21
> > > > > ...
> > > > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> > > > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
> > > > >  ******WWWW
> > > > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
> > > > >  ***RRRR***
> > > > > Correct content saved for comparison
> > > > > ...
> >   Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
> > for a while on 2.6.27.21 and didn't hit the problem yet.
> 
> I forget to mention that the test were done on an arm board with 64M ram. 
> I have tested fsx-linux again on pc, it seems that failure go away.
>
> > > > Are you sure that the LTP is correct?  It wouldn't be the first time it
> > > > wasn't...
> > > 
> > > I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
> > > something here in ext3, does 2.6.32-rc work?
> >   Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
> > so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
> > be interesting.
> 
> Currently the arm board doesn't support 2.6.32-rc. But I test with 2.6.32-rc7
> On my pc box, there's no failure so far.
  OK, so it's either ARM specific or it's triggered by low amount of
available memory (you might want to try testing your PC with mem=64M).

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-19 14:43           ` Jan Kara
  0 siblings, 0 replies; 21+ messages in thread
From: Jan Kara @ 2009-11-19 14:43 UTC (permalink / raw)
  To: JiSheng Zhang
  Cc: Jan Kara, Chris Mason, Greg KH, linux-kernel, linux-mm, stable,
	rmk, linux-arm

On Wed 18-11-09 22:17:56, JiSheng Zhang wrote:
> On Tue, 17 Nov 2009 20:06:35 +0100
> Jan Kara <jack@suse.cz> wrote:
> 
> > On Tue 17-11-09 07:36:22, Chris Mason wrote:
> > > On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
> > > > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
> > > > > Hi,
> > > > > 
> > > > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
> > > > > fsx-linux failed at mmap->write sequence.
> > > > > 
> > > > > Tested kernel is 2.6.27.12 and 2.6.27.39
> > > > 
> > > > Does this work on any kernel you have tested?  Or is it a regression?
> > > > 
> > > > > Tested file system: ext3, tmpfs.
> > > > > IMHO, it impacts all file systems.
> > > > > 
> > > > > Some fsx-linux log is:
> > > > > 
> > > > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
> > > > > OFFSET  GOOD    BAD     RANGE
> > > > > 0x287e0 0x35c9  0x15a9     0x80
> > > > > operation# (mod 256) for the bad datamay be 21
> > > > > ...
> > > > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
> > > > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
> > > > >  ******WWWW
> > > > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
> > > > >  ***RRRR***
> > > > > Correct content saved for comparison
> > > > > ...
> >   Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
> > for a while on 2.6.27.21 and didn't hit the problem yet.
> 
> I forget to mention that the test were done on an arm board with 64M ram. 
> I have tested fsx-linux again on pc, it seems that failure go away.
>
> > > > Are you sure that the LTP is correct?  It wouldn't be the first time it
> > > > wasn't...
> > > 
> > > I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
> > > something here in ext3, does 2.6.32-rc work?
> >   Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
> > so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
> > be interesting.
> 
> Currently the arm board doesn't support 2.6.32-rc. But I test with 2.6.32-rc7
> On my pc box, there's no failure so far.
  OK, so it's either ARM specific or it's triggered by low amount of
available memory (you might want to try testing your PC with mem=64M).

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-18 14:17         ` JiSheng Zhang
@ 2009-11-19 15:25           ` Russell King - ARM Linux
  -1 siblings, 0 replies; 21+ messages in thread
From: Russell King - ARM Linux @ 2009-11-19 15:25 UTC (permalink / raw)
  To: JiSheng Zhang
  Cc: Jan Kara, Greg KH, linux-kernel, linux-mm, stable, Chris Mason,
	linux-arm

On Wed, Nov 18, 2009 at 10:17:56PM +0800, JiSheng Zhang wrote:
> I forget to mention that the test were done on an arm board with 64M ram. 
> I have tested fsx-linux again on pc, it seems that failure go away.

Could provide a full bug report please, as in:

- CPU type
- is it a SMP CPU
- are you running a SMP kernel
- board type

All the above can be provided by supplying the kernel boot messages
(preferred)

- the storage peripheral being used for this test
- is DMA being used for this periperal
- any additional block layers (eg, lvm, dm, md)
- filesystem type

Plus, please cc suspected ARM problems to the ARM _kernel_ mailing list.

Thanks.

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-19 15:25           ` Russell King - ARM Linux
  0 siblings, 0 replies; 21+ messages in thread
From: Russell King - ARM Linux @ 2009-11-19 15:25 UTC (permalink / raw)
  To: JiSheng Zhang
  Cc: Jan Kara, Greg KH, linux-kernel, linux-mm, stable, Chris Mason,
	linux-arm

On Wed, Nov 18, 2009 at 10:17:56PM +0800, JiSheng Zhang wrote:
> I forget to mention that the test were done on an arm board with 64M ram. 
> I have tested fsx-linux again on pc, it seems that failure go away.

Could provide a full bug report please, as in:

- CPU type
- is it a SMP CPU
- are you running a SMP kernel
- board type

All the above can be provided by supplying the kernel boot messages
(preferred)

- the storage peripheral being used for this test
- is DMA being used for this periperal
- any additional block layers (eg, lvm, dm, md)
- filesystem type

Plus, please cc suspected ARM problems to the ARM _kernel_ mailing list.

Thanks.

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-18 14:17         ` JiSheng Zhang
@ 2009-11-20  3:41           ` JiSheng Zhang
  -1 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-20  3:41 UTC (permalink / raw)
  To: Jan Kara
  Cc: Chris Mason, Greg KH, linux-kernel, linux-mm, stable, rmk, linux-arm

Hi,

Russell King wrote
>- CPU type
ARM926EJ-S
>- is it a SMP CPU
no. UP
>- are you running a SMP kernel
no
>- board type
an soc


>- the storage peripheral being used for this test
memory and harddrive
>- is DMA being used for this periperal
for memory, DMA? for harddrive, yes
>- any additional block layers (eg, lvm, dm, md)
no
>- filesystem type
tmpfs and ext3

2009/11/18 JiSheng Zhang <jszhang3@gmail.com>:
> On Tue, 17 Nov 2009 20:06:35 +0100
> Jan Kara <jack@suse.cz> wrote:
>
>> On Tue 17-11-09 07:36:22, Chris Mason wrote:
>> > On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
>> > > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
>> > > > Hi,
>> > > >
>> > > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
>> > > > fsx-linux failed at mmap->write sequence.
>> > > >
>> > > > Tested kernel is 2.6.27.12 and 2.6.27.39
>> > >
>> > > Does this work on any kernel you have tested?  Or is it a regression?
>> > >
>> > > > Tested file system: ext3, tmpfs.
>> > > > IMHO, it impacts all file systems.
>> > > >
>> > > > Some fsx-linux log is:
>> > > >
>> > > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
>> > > > OFFSET  GOOD    BAD     RANGE
>> > > > 0x287e0 0x35c9  0x15a9     0x80
>> > > > operation# (mod 256) for the bad datamay be 21
>> > > > ...
>> > > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
>> > > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
>> > > >  ******WWWW
>> > > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
>> > > >  ***RRRR***
>> > > > Correct content saved for comparison
>> > > > ...
>>   Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
>> for a while on 2.6.27.21 and didn't hit the problem yet.
>
> I forget to mention that the test were done on an arm board with 64M ram.
> I have tested fsx-linux again on pc, it seems that failure go away.
>
>>
>> > > Are you sure that the LTP is correct?  It wouldn't be the first time it
>> > > wasn't...
>> >
>> > I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
>> > something here in ext3, does 2.6.32-rc work?
>>   Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
>> so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
>> be interesting.
>
> Currently the arm board doesn't support 2.6.32-rc. But I test with 2.6.32-rc7
> On my pc box, there's no failure so far.
>
>>
>>                                                               Honza
>
> I found this via google:
> http://marc.info/?t=118026315000001&r=1&w=2
>
> I even tried the code from
> http://marc.info/?l=linux-arch&m=118030601701617&w=2
> I got mostly:
> firstfirstfirst
> firstfirstfirst
> firstfirstfirst
>
>
> No change after pass "MS_SYNC|MS_INVALIDATE" to msync and make the
> flush_dcache_page() call unconditional in do_generic_mapping_read.
> This behavior is different from what I read from the mail thread above.
>
>> void do_generic_mapping_read(struct address_space *mapping,
>>                              struct file_ra_state *_ra,
>>                              struct file *filp,
>>                              loff_t *ppos,
>>                              read_descriptor_t *desc,
>>                              read_actor_t actor)
>> {
>> ...
>>                 /* If users can be writing to this page using arbitrary
>>                  * virtual addresses, take care about potential aliasing
>>                  * before reading the page on the kernel side.
>>                  */
>>                 if (1 || mapping_writably_mapped(mapping))
>>                         flush_dcache_page(page);
>
> Then I run fsx-linux after the above modification, fsx-linux failed all
> the same both on tmpfs and ext3

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
@ 2009-11-20  3:41           ` JiSheng Zhang
  0 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-20  3:41 UTC (permalink / raw)
  To: Jan Kara
  Cc: Chris Mason, Greg KH, linux-kernel, linux-mm, stable, rmk, linux-arm

Hi,

Russell King wrote
>- CPU type
ARM926EJ-S
>- is it a SMP CPU
no. UP
>- are you running a SMP kernel
no
>- board type
an soc


>- the storage peripheral being used for this test
memory and harddrive
>- is DMA being used for this periperal
for memory, DMA? for harddrive, yes
>- any additional block layers (eg, lvm, dm, md)
no
>- filesystem type
tmpfs and ext3

2009/11/18 JiSheng Zhang <jszhang3@gmail.com>:
> On Tue, 17 Nov 2009 20:06:35 +0100
> Jan Kara <jack@suse.cz> wrote:
>
>> On Tue 17-11-09 07:36:22, Chris Mason wrote:
>> > On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
>> > > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
>> > > > Hi,
>> > > >
>> > > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
>> > > > fsx-linux failed at mmap->write sequence.
>> > > >
>> > > > Tested kernel is 2.6.27.12 and 2.6.27.39
>> > >
>> > > Does this work on any kernel you have tested?  Or is it a regression?
>> > >
>> > > > Tested file system: ext3, tmpfs.
>> > > > IMHO, it impacts all file systems.
>> > > >
>> > > > Some fsx-linux log is:
>> > > >
>> > > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
>> > > > OFFSET  GOOD    BAD     RANGE
>> > > > 0x287e0 0x35c9  0x15a9     0x80
>> > > > operation# (mod 256) for the bad datamay be 21
>> > > > ...
>> > > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
>> > > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
>> > > >  ******WWWW
>> > > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
>> > > >  ***RRRR***
>> > > > Correct content saved for comparison
>> > > > ...
>>   Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
>> for a while on 2.6.27.21 and didn't hit the problem yet.
>
> I forget to mention that the test were done on an arm board with 64M ram.
> I have tested fsx-linux again on pc, it seems that failure go away.
>
>>
>> > > Are you sure that the LTP is correct?  It wouldn't be the first time it
>> > > wasn't...
>> >
>> > I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
>> > something here in ext3, does 2.6.32-rc work?
>>   Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
>> so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
>> be interesting.
>
> Currently the arm board doesn't support 2.6.32-rc. But I test with 2.6.32-rc7
> On my pc box, there's no failure so far.
>
>>
>>                                                               Honza
>
> I found this via google:
> http://marc.info/?t=118026315000001&r=1&w=2
>
> I even tried the code from
> http://marc.info/?l=linux-arch&m=118030601701617&w=2
> I got mostly:
> firstfirstfirst
> firstfirstfirst
> firstfirstfirst
>
>
> No change after pass "MS_SYNC|MS_INVALIDATE" to msync and make the
> flush_dcache_page() call unconditional in do_generic_mapping_read.
> This behavior is different from what I read from the mail thread above.
>
>> void do_generic_mapping_read(struct address_space *mapping,
>>                              struct file_ra_state *_ra,
>>                              struct file *filp,
>>                              loff_t *ppos,
>>                              read_descriptor_t *desc,
>>                              read_actor_t actor)
>> {
>> ...
>>                 /* If users can be writing to this page using arbitrary
>>                  * virtual addresses, take care about potential aliasing
>>                  * before reading the page on the kernel side.
>>                  */
>>                 if (1 || mapping_writably_mapped(mapping))
>>                         flush_dcache_page(page);
>
> Then I run fsx-linux after the above modification, fsx-linux failed all
> the same both on tmpfs and ext3

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Fwd: [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-20  3:41           ` JiSheng Zhang
  (?)
@ 2009-11-20  3:41           ` JiSheng Zhang
  -1 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-20  3:41 UTC (permalink / raw)
  To: linux-arm-kernel

---------- Forwarded message ----------
From: JiSheng Zhang <jszhang3@gmail.com>
Date: 2009/11/20
Subject: Re: [BUG]2.6.27.y some contents lost after writing to mmaped file
To: Jan Kara <jack@suse.cz>
??? Chris Mason <chris.mason@oracle.com>, Greg KH <gregkh@suse.de>,
linux-kernel at vger.kernel.org, linux-mm at kvack.org, stable at kernel.org,
rmk at arm.linux.org.uk, linux-arm at lists.infradead.org


Hi,

Russell King wrote
>- CPU type
ARM926EJ-S
>- is it a SMP CPU
no. UP
>- are you running a SMP kernel
no
>- board type
an soc


>- the storage peripheral being used for this test
memory and harddrive
>- is DMA being used for this periperal
for memory, DMA? for harddrive, yes
>- any additional block layers (eg, lvm, dm, md)
no
>- filesystem type
tmpfs and ext3

2009/11/18 JiSheng Zhang <jszhang3@gmail.com>:
> On Tue, 17 Nov 2009 20:06:35 +0100
> Jan Kara <jack@suse.cz> wrote:
>
>> On Tue 17-11-09 07:36:22, Chris Mason wrote:
>> > On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
>> > > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
>> > > > Hi,
>> > > >
>> > > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
>> > > > fsx-linux failed at mmap->write sequence.
>> > > >
>> > > > Tested kernel is 2.6.27.12 and 2.6.27.39
>> > >
>> > > Does this work on any kernel you have tested?  Or is it a regression?
>> > >
>> > > > Tested file system: ext3, tmpfs.
>> > > > IMHO, it impacts all file systems.
>> > > >
>> > > > Some fsx-linux log is:
>> > > >
>> > > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
>> > > > OFFSET  GOOD    BAD     RANGE
>> > > > 0x287e0 0x35c9  0x15a9     0x80
>> > > > operation# (mod 256) for the bad datamay be 21
>> > > > ...
>> > > > 7828: 1257514978.306753 READ     0x23dba thru 0x25699 (0x18e0 bytes)
>> > > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
>> > > >  ******WWWW
>> > > > 7830: 1257514978.307504 READ     0x2771b thru 0x319a8 (0xa28e bytes)
>> > > >  ***RRRR***
>> > > > Correct content saved for comparison
>> > > > ...
>>   Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
>> for a while on 2.6.27.21 and didn't hit the problem yet.
>
> I forget to mention that the test were done on an arm board with 64M ram.
> I have tested fsx-linux again on pc, it seems that failure go away.
>
>>
>> > > Are you sure that the LTP is correct?  It wouldn't be the first time it
>> > > wasn't...
>> >
>> > I'm afraid fsx usually finds bugs.  I thought Jan Kara recently fixed
>> > something here in ext3, does 2.6.32-rc work?
>>   Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
>> so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
>> be interesting.
>
> Currently the arm board doesn't support 2.6.32-rc. But I test with 2.6.32-rc7
> On my pc box, there's no failure so far.
>
>>
>>                                                               Honza
>
> I found this via google:
> http://marc.info/?t=118026315000001&r=1&w=2
>
> I even tried the code from
> http://marc.info/?l=linux-arch&m=118030601701617&w=2
> I got mostly:
> firstfirstfirst
> firstfirstfirst
> firstfirstfirst
>
>
> No change after pass "MS_SYNC|MS_INVALIDATE" to msync and make the
> flush_dcache_page() call unconditional in do_generic_mapping_read.
> This behavior is different from what I read from the mail thread above.
>
>> void do_generic_mapping_read(struct address_space *mapping,
>>                              struct file_ra_state *_ra,
>>                              struct file *filp,
>>                              loff_t *ppos,
>>                              read_descriptor_t *desc,
>>                              read_actor_t actor)
>> {
>> ...
>>                 /* If users can be writing to this page using arbitrary
>>                  * virtual addresses, take care about potential aliasing
>>                  * before reading the page on the kernel side.
>>                  */
>>                 if (1 || mapping_writably_mapped(mapping))
>>                         flush_dcache_page(page);
>
> Then I run fsx-linux after the above modification, fsx-linux failed all
> the same both on tmpfs and ext3

^ permalink raw reply	[flat|nested] 21+ messages in thread

* [BUG]2.6.27.y some contents lost after writing to mmaped file
  2009-11-20  3:41           ` JiSheng Zhang
  (?)
  (?)
@ 2009-11-20  3:43           ` JiSheng Zhang
  -1 siblings, 0 replies; 21+ messages in thread
From: JiSheng Zhang @ 2009-11-20  3:43 UTC (permalink / raw)
  To: linux-arm-kernel

oops, I forgot to cc linux-arm-kernel

2009/11/20 JiSheng Zhang <jszhang3@gmail.com>:
> Hi,
>
> Russell King wrote
>>- CPU type
> ARM926EJ-S
>>- is it a SMP CPU
> no. UP
>>- are you running a SMP kernel
> no
>>- board type
> an soc
>
>
>>- the storage peripheral being used for this test
> memory and harddrive
>>- is DMA being used for this periperal
> for memory, DMA? for harddrive, yes
>>- any additional block layers (eg, lvm, dm, md)
> no
>>- filesystem type
> tmpfs and ext3
>
> 2009/11/18 JiSheng Zhang <jszhang3@gmail.com>:
>> On Tue, 17 Nov 2009 20:06:35 +0100
>> Jan Kara <jack@suse.cz> wrote:
>>
>>> On Tue 17-11-09 07:36:22, Chris Mason wrote:
>>> > On Mon, Nov 16, 2009 at 05:56:55PM -0800, Greg KH wrote:
>>> > > On Mon, Nov 16, 2009 at 11:38:57AM +0800, JiSheng Zhang wrote:
>>> > > > Hi,
>>> > > >
>>> > > > I triggered a failure in an fs test with fsx-linux from ltp. It seems that
>>> > > > fsx-linux failed at mmap->write sequence.
>>> > > >
>>> > > > Tested kernel is 2.6.27.12 and 2.6.27.39
>>> > >
>>> > > Does this work on any kernel you have tested? ?Or is it a regression?
>>> > >
>>> > > > Tested file system: ext3, tmpfs.
>>> > > > IMHO, it impacts all file systems.
>>> > > >
>>> > > > Some fsx-linux log is:
>>> > > >
>>> > > > READ BAD DATA: offset = 0x2771b, size = 0xa28e
>>> > > > OFFSET ?GOOD ? ?BAD ? ? RANGE
>>> > > > 0x287e0 0x35c9 ?0x15a9 ? ? 0x80
>>> > > > operation# (mod 256) for the bad datamay be 21
>>> > > > ...
>>> > > > 7828: 1257514978.306753 READ ? ? 0x23dba thru 0x25699 (0x18e0 bytes)
>>> > > > 7829: 1257514978.306899 MAPWRITE 0x27eeb thru 0x2a516 (0x262c bytes)
>>> > > > ?******WWWW
>>> > > > 7830: 1257514978.307504 READ ? ? 0x2771b thru 0x319a8 (0xa28e bytes)
>>> > > > ?***RRRR***
>>> > > > Correct content saved for comparison
>>> > > > ...
>>> ? Hmm, how long does it take to reproduce? I'm running fsx-linux on tmpfs
>>> for a while on 2.6.27.21 and didn't hit the problem yet.
>>
>> I forget to mention that the test were done on an arm board with 64M ram.
>> I have tested fsx-linux again on pc, it seems that failure go away.
>>
>>>
>>> > > Are you sure that the LTP is correct? ?It wouldn't be the first time it
>>> > > wasn't...
>>> >
>>> > I'm afraid fsx usually finds bugs. ?I thought Jan Kara recently fixed
>>> > something here in ext3, does 2.6.32-rc work?
>>> ? Yeah, fsx usually finds bugs. Note that he sees the problem also on tmpfs
>>> so it's not ext3 problem. Anyway, trying to reproduce with 2.6.32-rc? would
>>> be interesting.
>>
>> Currently the arm board doesn't support 2.6.32-rc. But I test with 2.6.32-rc7
>> On my pc box, there's no failure so far.
>>
>>>
>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? Honza
>>
>> I found this via google:
>> http://marc.info/?t=118026315000001&r=1&w=2
>>
>> I even tried the code from
>> http://marc.info/?l=linux-arch&m=118030601701617&w=2
>> I got mostly:
>> firstfirstfirst
>> firstfirstfirst
>> firstfirstfirst
>>
>>
>> No change after pass "MS_SYNC|MS_INVALIDATE" to msync and make the
>> flush_dcache_page() call unconditional in do_generic_mapping_read.
>> This behavior is different from what I read from the mail thread above.
>>
>>> void do_generic_mapping_read(struct address_space *mapping,
>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?struct file_ra_state *_ra,
>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?struct file *filp,
>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?loff_t *ppos,
>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?read_descriptor_t *desc,
>>> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?read_actor_t actor)
>>> {
>>> ...
>>> ? ? ? ? ? ? ? ? /* If users can be writing to this page using arbitrary
>>> ? ? ? ? ? ? ? ? ?* virtual addresses, take care about potential aliasing
>>> ? ? ? ? ? ? ? ? ?* before reading the page on the kernel side.
>>> ? ? ? ? ? ? ? ? ?*/
>>> ? ? ? ? ? ? ? ? if (1 || mapping_writably_mapped(mapping))
>>> ? ? ? ? ? ? ? ? ? ? ? ? flush_dcache_page(page);
>>
>> Then I run fsx-linux after the above modification, fsx-linux failed all
>> the same both on tmpfs and ext3
>

^ permalink raw reply	[flat|nested] 21+ messages in thread

end of thread, other threads:[~2009-11-20  3:43 UTC | newest]

Thread overview: 21+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2009-11-16  3:38 [BUG]2.6.27.y some contents lost after writing to mmaped file JiSheng Zhang
2009-11-16  3:38 ` JiSheng Zhang
2009-11-17  1:56 ` Greg KH
2009-11-17  1:56   ` Greg KH
2009-11-17 11:07   ` JiSheng Zhang
2009-11-17 11:07     ` JiSheng Zhang
2009-11-17 12:36   ` Chris Mason
2009-11-17 12:36     ` Chris Mason
2009-11-17 19:06     ` Jan Kara
2009-11-17 19:06       ` Jan Kara
2009-11-18 14:17       ` JiSheng Zhang
2009-11-18 14:17         ` JiSheng Zhang
2009-11-19  5:56         ` Fwd: " JiSheng Zhang
2009-11-19 14:43         ` Jan Kara
2009-11-19 14:43           ` Jan Kara
2009-11-19 15:25         ` Russell King - ARM Linux
2009-11-19 15:25           ` Russell King - ARM Linux
2009-11-20  3:41         ` JiSheng Zhang
2009-11-20  3:41           ` JiSheng Zhang
2009-11-20  3:41           ` Fwd: " JiSheng Zhang
2009-11-20  3:43           ` JiSheng Zhang

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.