CLDR import for src/share/*def definitions
Edwin Groothuis
edwin at mavetju.org
Wed Jul 1 22:37:28 UTC 2009
I have been playing with the CLDR database to see if I can get the
monetary, time, messages and numerical definitions right. The CLDR
is in UTF-8, I use iconv to translate to other charactersets.
So far most of it is fine, except (subset of issues):
- A couple of languages are not known (es_FR, es_IT)
- A couple of languages have a different abbrevation
no_NO -> nb_NO nn_NO
*_YU -> *_RS
- A couple of charactersets are not known to iconv:
(CP1131 ISCII-DECV)
- A couple of translations went wrong:
Writing to fi_FI in ISO8859-1
Could not convert currency_symbol from UTF-8 to ISO8859-1
- It is not clear what the difference between "Long month names (as
in a date)" and "Long month names (without case ending)" is. (could
be my language problem :-)
The biggest problem so far is not a technical: WHich data is more
authoritative - The one in the CLDR database or the one we have
collected over the years from various sources and people?
Another problem I'm facing is that there is little documentation
on what the format of the *def/ files is, it is mostly a UTSL
approach in lib/libc/locale, but that doesn't show me neither if I
can safely replace (for example in uk_UA)
# yesstr
-<E2><D0><DA>
+<E2><D0><DA>:<E2>:<C2><B0><BA>:<C2>:yes:y:YES:Y
So euhm... Is there anybody who wants to give their opinion or
wisdom about things, please speak up, I need it :-)
Edwin
--
Edwin Groothuis Website: http://www.mavetju.org/
edwin at mavetju.org Weblog: http://www.mavetju.org/weblog/
More information about the freebsd-i18n
mailing list