[OpenAFS-devel] AFS vs UNICODE

Dave Botsch botsch@cnf.cornell.edu
Tue, 6 May 2008 18:25:04 -0400


> 
> (1) The same text in Unicode can be represented by different sequences 
> of characters.  As a result you could have client A and client B both 
> create a file with the same name that can not be visually distinguished 
> by the end user.   Now which one do you open?

If everyone is using UTF-8 encoding, does the above problem still exist? And
by UTF-8, I mean the "real" UTF-8 encoding, not one of the many variants
(which would mean that the oafs client would have to do some translation)?

> 
> (2) Since the directory lookups are performed using a hash table, a file 
> with the name being searched for might exist but it cannot be found 
> because the input to the hash function on client B is different than the 
> input used to create the entry on client A.

It would be different because of #1 above?

> 
> Storing file names as opaque octet sequences is broken in other ways. 
> Depending on the character set used on the client the file name might or 
> might not be representable since the octet sequence contains no 
> indication whether the sequence is CP437, CP850, CP1252, ISO Latin-1,
> ISO-Latin-9, UTF-7, UTF-8, etc.
> 

So, if we know what sequence we're using...?

How do local filesystems handle this? I might very well create a file on my
ext3 filesystem usbkey with my locale set to ISO Latin-1 then try to access
it from another box with the charset set to UTF-16 (or something completely
different). Or maybe I named the file using some non-arabic character set?

I've read that Java uses a "modified UTF-8" which can cause there to be 6
instead of 4 octets per character... how does this not break other
applications from being able to access the files on the local filesystem?

> 
> 
> _______________________________________________
> OpenAFS-devel mailing list
> OpenAFS-devel@openafs.org
> https://lists.openafs.org/mailman/listinfo/openafs-devel
> 

-- 
********************************
David William Botsch
Programmer/Analyst
CNF Computing
botsch@cnf.cornell.edu
********************************