From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.2 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id D10DFC33C8C for ; Tue, 7 Jan 2020 13:32:40 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id AFF6A2081E for ; Tue, 7 Jan 2020 13:32:40 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728063AbgAGNch (ORCPT ); Tue, 7 Jan 2020 08:32:37 -0500 Received: from mx2.suse.de ([195.135.220.15]:49894 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727658AbgAGNcg (ORCPT ); Tue, 7 Jan 2020 08:32:36 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id 83530AF05; Tue, 7 Jan 2020 13:32:34 +0000 (UTC) Received: by quack2.suse.cz (Postfix, from userid 1000) id 7E02D1E0B47; Tue, 7 Jan 2020 14:32:33 +0100 (CET) Date: Tue, 7 Jan 2020 14:32:33 +0100 From: Jan Kara To: Pali =?iso-8859-1?Q?Roh=E1r?= Cc: linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org, linux-ntfs-dev@lists.sourceforge.net, linux-cifs@vger.kernel.org, Alexander Viro , Luis de Bethencourt , Salah Triki , Steve French , OGAWA Hirofumi , Greg Kroah-Hartman , Thomas Gleixner , David Sterba , Dave Kleikamp , Anton Altaparmakov , Jan Kara , "Theodore Y. Ts'o" , Eric Sandeen , Namjae Jeon , Pavel Machek , Christoph Hellwig Subject: Re: Unification of filesystem encoding options Message-ID: <20200107133233.GC25547@quack2.suse.cz> References: <20200102211855.gg62r7jshp742d6i@pali> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20200102211855.gg62r7jshp742d6i@pali> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-fsdevel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-fsdevel@vger.kernel.org On Thu 02-01-20 22:18:55, Pali Rohár wrote: > 1) Unify mount options for specifying charset. > > Currently all filesystems except msdos and hfsplus have mount option > iocharset=. hfsplus has nls= and msdos does not > implement re-encoding support. Plus vfat, udf and isofs have broken > iocharset=utf8 option (but working utf8 option) And ntfs has deprecated > iocharset= option. > > I would suggest following changes for unification: > > * Add a new alias iocharset= for hfsplus which would do same as nls= > * Make iocharset=utf8 option for vfat, udf and isofs to do same as utf8 > * Un-deprecate iocharset= option for ntfs > > This would cause that all filesystems would have iocharset= > option which would work for any charset, including iocharset=utf8. > And it would fix also broken iocharset=utf8 for vfat, udf and isofs. Makes sense to me. > 2) Add support for Unicode code points above U+FFFF for filesystems > befs, hfs, hfsplus, jfs and ntfs, so iocharset=utf8 option would work > also with filenames in userspace which would be 4 bytes long UTF-8. Also looks good but when doing this, I'd suggest we extend NLS to support full UTF-8 rather than implementing it by hand like e.g. we did for UDF. > 3) Add support for iocharset= and codepage= options for msdos > filesystem. It shares lot of pars of code with vfat driver. I guess this is for msdos filesystem maintainers to decide. Honza -- Jan Kara SUSE Labs, CR