Why en_US.UTF-8 locale consider a < A?
Baptiste Daroussin
bapt at freebsd.org
Thu Mar 9 10:03:28 UTC 2017
On Wed, Mar 08, 2017 at 04:59:47PM +0100, Matthias Apitz wrote:
> El día Wednesday, March 08, 2017 a las 12:51:11AM -0800, Xin Li escribió:
>
> >
> >
> > On 3/8/17 00:40, Baptiste Daroussin wrote:
> > >> Is this result correct? It matches some Debian behavior but not macOS
> > >> behavior.
> > >
> > > Yes the result is correct, macOS does not have unicode collation if you want to
> > > match the macos behaviour you have to set LC_COLLATE=C
> >
> > Thanks, I also found this https://www.cl.cam.ac.uk/~mgk25/unicode.html
> > just for the record if someone else hits the same issue.
>
> I recently came across with a related problem and have two questions
> (unresolved until now):
>
> 1.
> Using sort, reading the man page of it, it should be sufficient to
> set LC_COLLATE correctly. It seems that setting LANG (or unsetting it)
> changes the sort Order, why?
This has been answered by someone else already.
>
> 2.
> Speaking about German Umlauts, should they be treated as their normal
> letters, i.e. 'ä' is like 'a', as one can read in Wiki, or how they are
> sorted exactly?
I don't know the details for this particular case, but we do take the data from
cldr (http://cldr.unicode.org/), so if you check there you will have your answer
Best regards,
Bapt
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: not available
URL: <http://lists.freebsd.org/pipermail/freebsd-hackers/attachments/20170309/c6f78647/attachment.sig>
More information about the freebsd-hackers
mailing list