linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH - 2.6.10] generic_file_buffered_write handle partial DIO writes with multiple iovecs
@ 2005-01-19  1:22 Daniel McNeil
  2005-01-19  2:31 ` Sami Farin
  0 siblings, 1 reply; 5+ messages in thread
From: Daniel McNeil @ 2005-01-19  1:22 UTC (permalink / raw)
  To: Andrew Morton; +Cc: linux-aio, Linux Kernel Mailing List

Andrew,

This is a patch to generic_file_buffered_write() to correctly
handle partial O_DIRECT writes (because of unallocated blocks)
when there is more than 1 iovec.  Without this patch, the code is
writing the wrong iovec (it writes the first iovec a 2nd time).

Included is a test program dio_bug.c that shows the problem by:
	writing 4k to offset 4k
	writing 4k to offset 12k
	writing 8k to offset 4k
The result is that 8k write writes the 1st 4k of the buffer twice.

$ rm f; ./dio_bug f
wrong value offset 8k expected 0x33 got 0x11
wrong value offset 10k expected 0x44 got 0x22

with patch
$ rm f; ./dio_bug f

Here's the patch:

--- linux-2.6.10.orig/mm/filemap.c	2005-01-18 15:32:52.531207134 -0800
+++ linux-2.6.10/mm/filemap.c	2005-01-18 15:32:09.252319333 -0800
@@ -1908,7 +1908,16 @@ generic_file_buffered_write(struct kiocb
 
 	pagevec_init(&lru_pvec, 0);
 
-	buf = iov->iov_base + written;	/* handle partial DIO write */
+	/*
+	 * handle partial DIO write.  Adjust cur_iov if needed.
+	 */
+	if (likely(nr_segs == 1))
+		buf = iov->iov_base + written;
+	else {
+		filemap_set_next_iovec(&cur_iov, &iov_base, written);
+		buf = iov->iov_base + iov_base;
+	}
+
 	do {
 		unsigned long index;
 		unsigned long offset;
 

Here is the test program:
#define _GNU_SOURCE
#include <sys/types.h>
#include <unistd.h>
#include <stdlib.h>
#include <fcntl.h>
#include <sys/uio.h>

main(int argc, char **argv)
{
	int fd;
	char *buf;
	int i;
	struct iovec v[2];

	fd = open(argv[1], O_DIRECT|O_RDWR|O_CREAT, 0666);

	if (fd < 0) {
		perror("open");
		exit(1);
	}

	buf = valloc(8192);

	lseek(fd, 0x1000, SEEK_SET);
	memset(buf, 0x11, 2048);
	memset(buf+2048, 0x22, 2048);
	i = write(fd, buf, 4096);	/* 4k write of 0x11 and 0x22 at 4k */

	lseek(fd, 0x3000, SEEK_SET);
	memset(buf, 0x55, 2048);
	memset(buf+2048, 0x66, 2048);
	i = write(fd, buf, 4096);	/* 4k write of 0x55 and 0x66 at 12k */

	lseek(fd, 0x1000, SEEK_SET);
	i = read(fd, buf, 4096);
	memset(buf+4096, 0x33 , 2048);
	memset(buf+4096+2048, 0x44 , 2048);

	v[0].iov_base = buf;
	v[0].iov_len = 4096;
	v[1].iov_base = buf + 4096;
	v[1].iov_len = 4096;
	lseek(fd, 0x1000, SEEK_SET);
	i = writev(fd, v, 2);	/* 8k write of 0x11, 0x22, 0x33, 0x44 at 4k */

	lseek(fd, 0x2000, SEEK_SET);
	i = read(fd, buf, 4096);
	if (buf[0] != 0x33)
		printf("wrong value offset 8k expected 0x33 got 0x%x\n",
			buf[0]);
	if (buf[2048] != 0x44)
		printf("wrong value offset 10k expected 0x44 got 0x%x\n",
			buf[2048]);
	
}



^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH - 2.6.10] generic_file_buffered_write handle partial DIO writes with multiple iovecs
  2005-01-19  1:22 [PATCH - 2.6.10] generic_file_buffered_write handle partial DIO writes with multiple iovecs Daniel McNeil
@ 2005-01-19  2:31 ` Sami Farin
  2005-01-19 16:55   ` Daniel McNeil
  0 siblings, 1 reply; 5+ messages in thread
From: Sami Farin @ 2005-01-19  2:31 UTC (permalink / raw)
  To: Linux Kernel Mailing List

On Tue, Jan 18, 2005 at 05:22:44PM -0800, Daniel McNeil wrote:
> Andrew,
> 
> This is a patch to generic_file_buffered_write() to correctly
> handle partial O_DIRECT writes (because of unallocated blocks)
> when there is more than 1 iovec.  Without this patch, the code is
> writing the wrong iovec (it writes the first iovec a 2nd time).
> 
> Included is a test program dio_bug.c that shows the problem by:
> 	writing 4k to offset 4k
> 	writing 4k to offset 12k
> 	writing 8k to offset 4k
> The result is that 8k write writes the 1st 4k of the buffer twice.
> 
> $ rm f; ./dio_bug f
> wrong value offset 8k expected 0x33 got 0x11
> wrong value offset 10k expected 0x44 got 0x22
> 
> with patch
> $ rm f; ./dio_bug f

I have Linux 2.6.10-ac9 + bio clone memory corruption -patch,
and dio_bug does not give errors (without your patch).

-- 

^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH - 2.6.10] generic_file_buffered_write handle partial DIO writes with multiple iovecs
  2005-01-19  2:31 ` Sami Farin
@ 2005-01-19 16:55   ` Daniel McNeil
  2005-01-19 17:28     ` Sami Farin
  0 siblings, 1 reply; 5+ messages in thread
From: Daniel McNeil @ 2005-01-19 16:55 UTC (permalink / raw)
  To: Sami Farin; +Cc: Linux Kernel Mailing List

On Tue, 2005-01-18 at 18:31, Sami Farin wrote:
> On Tue, Jan 18, 2005 at 05:22:44PM -0800, Daniel McNeil wrote:
> > Andrew,
> > 
> > This is a patch to generic_file_buffered_write() to correctly
> > handle partial O_DIRECT writes (because of unallocated blocks)
> > when there is more than 1 iovec.  Without this patch, the code is
> > writing the wrong iovec (it writes the first iovec a 2nd time).
> > 
> > Included is a test program dio_bug.c that shows the problem by:
> > 	writing 4k to offset 4k
> > 	writing 4k to offset 12k
> > 	writing 8k to offset 4k
> > The result is that 8k write writes the 1st 4k of the buffer twice.
> > 
> > $ rm f; ./dio_bug f
> > wrong value offset 8k expected 0x33 got 0x11
> > wrong value offset 10k expected 0x44 got 0x22
> > 
> > with patch
> > $ rm f; ./dio_bug f
> 
> I have Linux 2.6.10-ac9 + bio clone memory corruption -patch,
> and dio_bug does not give errors (without your patch).

I should have mentioned that my testing was on ext3 with 4k
block size.   The bio clone patch might affect this by merging
the i/o into a single iovec.  Here's an updated test program
that uses 2 different buffers allocated separately that might
prevent the merging.  See if this works on your system.

#define _GNU_SOURCE
#include <sys/types.h>
#include <unistd.h>
#include <stdlib.h>
#include <fcntl.h>
#include <sys/uio.h>

main(int argc, char **argv)
{
	int fd;
	char *buf;
	char *buf2;
	int i;
	struct iovec v[2];

	fd = open(argv[1], O_DIRECT|O_RDWR|O_CREAT, 0666);

	if (fd < 0) {
		perror("open");
		exit(1);
	}

	buf = valloc(8192);
	buf2 = valloc(8192);

	lseek(fd, 0x1000, SEEK_SET);
	memset(buf, 0x11, 2048);
	memset(buf+2048, 0x22, 2048);
	i = write(fd, buf, 4096);	/* 4k write of 0x11 and 0x22 at 4k */

	lseek(fd, 0x3000, SEEK_SET);
	memset(buf, 0x55, 2048);
	memset(buf+2048, 0x66, 2048);
	i = write(fd, buf, 4096);	/* 4k write of 0x55 and 0x66 at 12k */

	lseek(fd, 0x1000, SEEK_SET);
	i = read(fd, buf, 4096);
	memset(buf2, 0x33 , 2048);
	memset(buf2+2048, 0x44 , 2048);

	v[0].iov_base = buf;
	v[0].iov_len = 4096;
	v[1].iov_base = buf2;
	v[1].iov_len = 4096;
	lseek(fd, 0x1000, SEEK_SET);
	i = writev(fd, v, 2);	/* 8k write of 0x11, 0x22, 0x33, 0x44 at 4k */

	lseek(fd, 0x2000, SEEK_SET);
	i = read(fd, buf, 4096);
	if (buf[0] != 0x33)
		printf("wrong value offset 8k expected 0x33 got 0x%x\n",
			buf[0]);
	if (buf[2048] != 0x44)
		printf("wrong value offset 10k expected 0x44 got 0x%x\n",
			buf[2048]);
	
}

Thanks,

Daniel


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH - 2.6.10] generic_file_buffered_write handle partial DIO writes with multiple iovecs
  2005-01-19 16:55   ` Daniel McNeil
@ 2005-01-19 17:28     ` Sami Farin
  2005-01-19 21:06       ` Daniel McNeil
  0 siblings, 1 reply; 5+ messages in thread
From: Sami Farin @ 2005-01-19 17:28 UTC (permalink / raw)
  To: Linux Kernel Mailing List

On Wed, Jan 19, 2005 at 08:55:40AM -0800, Daniel McNeil wrote:
> On Tue, 2005-01-18 at 18:31, Sami Farin wrote:
...
> > I have Linux 2.6.10-ac9 + bio clone memory corruption -patch,
> > and dio_bug does not give errors (without your patch).
> 
> I should have mentioned that my testing was on ext3 with 4k
> block size.   The bio clone patch might affect this by merging
> the i/o into a single iovec.  Here's an updated test program
> that uses 2 different buffers allocated separately that might
> prevent the merging.  See if this works on your system.

I have reiserfs... and this version does not give errors, either. 

-- 


^ permalink raw reply	[flat|nested] 5+ messages in thread

* Re: [PATCH - 2.6.10] generic_file_buffered_write handle partial DIO writes with multiple iovecs
  2005-01-19 17:28     ` Sami Farin
@ 2005-01-19 21:06       ` Daniel McNeil
  0 siblings, 0 replies; 5+ messages in thread
From: Daniel McNeil @ 2005-01-19 21:06 UTC (permalink / raw)
  To: Sami Farin; +Cc: Linux Kernel Mailing List

On Wed, 2005-01-19 at 09:28, Sami Farin wrote:
> On Wed, Jan 19, 2005 at 08:55:40AM -0800, Daniel McNeil wrote:
> > On Tue, 2005-01-18 at 18:31, Sami Farin wrote:
> ...
> > > I have Linux 2.6.10-ac9 + bio clone memory corruption -patch,
> > > and dio_bug does not give errors (without your patch).
> > 
> > I should have mentioned that my testing was on ext3 with 4k
> > block size.   The bio clone patch might affect this by merging
> > the i/o into a single iovec.  Here's an updated test program
> > that uses 2 different buffers allocated separately that might
> > prevent the merging.  See if this works on your system.
> 
> I have reiserfs... and this version does not give errors, either. 

Reisefs must handle allocations differently.  The bug only
shows up if there is an unallocated hole and a O_DIRECT write with
multiple iovecs writes partially into the allocated space 
and completes at least 1 iovec worth of data and then finishes
the i/o using buffer i/o to allocate space for the hole and write
the remaining data.

Daniel



^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2005-01-19 21:06 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2005-01-19  1:22 [PATCH - 2.6.10] generic_file_buffered_write handle partial DIO writes with multiple iovecs Daniel McNeil
2005-01-19  2:31 ` Sami Farin
2005-01-19 16:55   ` Daniel McNeil
2005-01-19 17:28     ` Sami Farin
2005-01-19 21:06       ` Daniel McNeil

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).