From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1424662Ab2LGWCq (ORCPT ); Fri, 7 Dec 2012 17:02:46 -0500 Received: from mx1.redhat.com ([209.132.183.28]:19063 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753493Ab2LGWCp (ORCPT ); Fri, 7 Dec 2012 17:02:45 -0500 Message-ID: <50C26778.7000500@redhat.com> Date: Fri, 07 Dec 2012 17:02:32 -0500 From: Ric Wheeler User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:17.0) Gecko/17.0 Thunderbird/17.0 MIME-Version: 1.0 To: "Theodore Ts'o" , Chris Mason , Chris Mason , Linus Torvalds , Ingo Molnar , Christoph Hellwig , Martin Steigerwald , Linux Kernel Mailing List , Dave Chinner , linux-fsdevel Subject: Re: [PATCH, 3.7-rc7, RESEND] fs: revert commit bbdd6808 to fallocate UAPI References: <201212051148.28039.Martin@lichtvoll.de> <20121206120532.GA14100@infradead.org> <20121207011628.GB16373@gmail.com> <50C22923.90102@redhat.com> <20121207190306.GB14972@shiny> <20121207204325.GC29435@thunk.org> <20121207210932.GA25713@shiny> <50C262AE.5060701@redhat.com> <20121207215727.GF29435@thunk.org> In-Reply-To: <20121207215727.GF29435@thunk.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 12/07/2012 04:57 PM, Theodore Ts'o wrote: > On Fri, Dec 07, 2012 at 04:42:06PM -0500, Ric Wheeler wrote: >> The other things that I think we should try would be to convert over >> larger chunks as we discussed on the list back in the summer (just >> because the user writes 4KB does not mean that we cannot flip over >> 1MB and zero that). > Writing a megabyte is not free. If you assume that your HDD has a > sustained write throughput of 100-125 MB/s, writing a megabyte will > take 8-10ms. It might be a win if you amortize it over a large number > of writes, but it doesn't help your 99.9 percentile latency numbers. > (99.9 percentile latency numbers matters because eventually you'll > have a user request which hits multiple serial long latency > operations, and then the delay looks **really** user visible.) > > - Ted Writing 4KB at a time to a disk cost XX units of time. Writing to the same sector (especially for a HDD), cost XX units + a small amount. I suggest that we try it out. For SSD's, much better to use specific HW offload commands if possible like WRITE_SAME (zeroed) or UNMAP/TRIM to get that performance boost since no actual data is moved... ric