tr(1) and LANG=de_DE.UTF-8
Matthias Apitz
guru at unixarea.de
Sun Nov 1 17:16:28 UTC 2015
El día Sunday, November 01, 2015 a las 04:21:25PM +0000, Christian Weisgerber escribió:
> On 2015-10-29, Matthias Apitz <guru at unixarea.de> wrote:
>
> > I know that the man page of tr(1) contains a hint about the LANG and
> > environment(7), but would not expect that this means that I can't change
> > a single byte, octal given value, only for the reason that \357 is not a valid
> > Unicode code point.
> >
> > Any ideas/comments on this?
>
> POSIX jabbers something about "multi-character collating elements",
> which seems related, but I don't understand it.
I think, the truth is (as always) in the man page, which reads from
tr(1):
...
In the first synopsis form, the characters in string1 are translated
into the characters in string2
...
i.e. it speaks about 'characters' and not 'bytes' and the char given by
me was not a valid UTF-8 character.
matthias
--
Matthias Apitz, ✉ guru at unixarea.de, 🌐 http://www.unixarea.de/ ☎ +49-176-38902045
More information about the freebsd-questions
mailing list