Ctype patch for review
Taku YAMAMOTO
taku at tackymt.homeip.net
Tue Sep 18 19:12:10 PDT 2007
On Mon, 17 Sep 2007 21:16:33 +0400
Andrey Chernov <ache at nagual.pp.ru> wrote:
> On Tue, Sep 18, 2007 at 02:01:00AM +0900, YAMAMOTO, Taku wrote:
> > Checking for __mb_cur_max is not enough for certain locales.
> > For example, SJIS has following range for JIS X0201 (a.k.a. HALFWIDTH KANA).
> >
> > /*
> > * JIS X201
> > */
> > PUNCT 0xa1-0xa5
> > SPACE 0xa0
> > BLANK 0xa0
> > SPECIAL 0xa1-0xdf
> > PHONOGRAM 0xa6-0xdf
> > SWIDTH1 0xa0-0xdf
>
> I don't understand your remark. MSKanji have __mb_cur_max = 2 and so those
> ranges are wchar_t ranges. My patch restrict unsigned char ranges only.
These characters ARE single byte.
The problem is that a byte >= 0x80 does not always mean it composes a
multi-byte character in that locale.
--
-|-__ YAMAMOTO, Taku
| __ < <taku at tackymt.homeip.net>
- A chicken is an egg's way of producing more eggs. -
More information about the freebsd-i18n
mailing list