[RFC] Filesystem name storage (Was: A Great Idea (tm) about reimplementing NLS.)

On Jun 15, 2005, at 21:55:04, Patrick McFarland wrote:

On Wednesday 15 June 2005 05:13 am, Denis Vlasenko wrote:
I do not understand how this is going to look from userspaceperspective.
Can you give examples how this will work?
IMHO, he means that the userspace would only see Unicode filenames,and theuserspace could only give Unicode names back to the kernel. Thekernel, using
this global NLS layer would translate back and forth, and the userland
wouldn't know about it.
Its basically the only sane way to approach the problem of gettingthe entire
Linux community to convert to Unicode.

Would the following system for filenames resolve most of the issuespeople

are raising:

First load charset tables into the kernel. These would be stored infiles inuserspace and could be easily updated, renamed, deleted, etc. Such atablewould always be a translation from Unicode <=> Charset. A kernelwith thissystem built in would understand natively "raw", "utf8", "utf16", and"utf32",

anything else would need loaded charset tables.

The following mount options would available:
  nls_raw=(0|1)  [default 1]:

This would cause Linux to pass all chars through unmolested.This modeworks well on multiuser systems where users want to use theirown NLStools, or where the whole system uses UTF-8, including thefilesystems.This is backwards compatible with the way Linux currentlypresents most(all?) filesystems. If the options "nls_disk" or "nls_user" areused,

    then this option is forced to be zero.
  nls_disk=<string-charset>

This specifies the underlying charset which should be used onthe disk

    or filesystem itself.  This may be "negotiate" for any filesystems

which support NLS *and* can identify which charset is in use.Built inoptions are "utf8", "utf16", and "utf32". Defaults to"negotiate" if

    available otherwise "utf8", but only defaults if "nls_raw" is 0.
  nls_user=<string-charset>

This specifies the charset which should be presented to theuser. This

    may be used to allow a backwards compatibility (IE: A program wants

ISO8859-1, but the admin wants the underlying filesystem to useUTF-8.Built in options are "utf8", "utf16", and "utf32". Defaults to"utf8"

    if "nls_raw" is 0.

The end result is that specifying either nls_disk or nls_user willturn on

automatic NLS conversion, with the unspecified nls_ option being utf8.

If these options are used on bind mounts, they should override theunderlyingfilesystem's mount options (Instead of stacking). This will allowthe admin

to specify:

# mount -t ext3 -o nls_disk=utf8,nls_user=utf8 /dev/hdb /mnt

# mount --bind -o nls_disk=utf8,nls_user=iso8850-1 /mnt/mail /var/spool/mail


if he/she wants to provide backwards compatibility with a legacy mail
spooling program.  Note: A part of each translation table would be an

entry for "Unspecified character", such that any UTF-8 character notmappedin the table could be translated to a sane default, such as '?'. Ifnamescollide under such translation, the kernel would need a way to keeptrack ofthe collisions (Appended numbers?) and properly re-resolve them whenasked.


Cheers,
Kyle Moffett

-----BEGIN GEEK CODE BLOCK-----
Version: 3.12
GCM/CS/IT/U d- s++: a18 C++++>$ UB/L/X/*++++(+)>$ P+++(++++)>$
L++++(+++) E W++(+) N+++(++) o? K? w--- O? M++ V? PS+() PE+(-) Y+

PGP+++ t+(+++) 5 X R? tv-(--) b++++(++) DI+ D+ G e->++++$ h!*()>++$r !y?(-)

------END GEEK CODE BLOCK------

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [email protected]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Follow-Ups:
- Re: [RFC] Filesystem name storage (Was: A Great Idea (tm) about reimplementing NLS.)
  - From: Lukasz Stelmach <[email protected]>

References:
- A Great Idea (tm) about reimplementing NLS.
  - From: Alexey Zaytsev <[email protected]>
- Re: A Great Idea (tm) about reimplementing NLS.
  - From: Denis Vlasenko <[email protected]>
- Re: A Great Idea (tm) about reimplementing NLS.
  - From: Patrick McFarland <[email protected]>

Prev by Date: [PATCH linux-2.6.12-rc6-mm1] blk: cfq_find_next_crq fix
Next by Date: Re: [patch 2.6.12-rc3] Adds persistent entryies using request_firmware_nowaitManuel Estrada Sainz <[email protected]>,
Previous by thread: Re: A Great Idea (tm) about reimplementing NLS.
Next by thread: Re: [RFC] Filesystem name storage (Was: A Great Idea (tm) about reimplementing NLS.)
Index(es):
- Date
- Thread

[Index of Archives] [Kernel Newbies] [Netfilter] [Bugtraq] [Photo] [Stuff] [Gimp] [Yosemite News] [MIPS Linux] [ARM Linux] [Linux Security] [Linux RAID] [Video 4 Linux] [Linux for the blind] [Linux Resources]