[Bug 272386] The iconv converter from EUC-JP to UTF-8 accepts second and third bytes outside of the valid range
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 272386] The iconv converter from EUC-JP to UTF-8 accepts second and third bytes outside of the valid range"
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 272386] The iconv converter from EUC-JP to UTF-8 accepts second and third bytes outside of the valid range"
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 272386] The iconv converter from EUC-JP to UTF-8 accepts second and third bytes outside of the valid range"
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 272386] The iconv converter from EUC-JP to UTF-8 accepts second and third bytes outside of the valid range"
- Reply: bugzilla-noreply_a_freebsd.org: "[Bug 272386] The iconv converter from EUC-JP to UTF-8 accepts second and third bytes outside of the valid range"
- Go to: [ bottom of page ] [ top of archives ] [ this month ]
Date: Wed, 05 Jul 2023 14:51:52 UTC
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=272386 Bug ID: 272386 Summary: The iconv converter from EUC-JP to UTF-8 accepts second and third bytes outside of the valid range Product: Base System Version: 13.2-RELEASE Hardware: Any OS: Any Status: New Severity: Affects Only Me Priority: --- Component: bin Assignee: bugs@FreeBSD.org Reporter: bruno@clisp.org The structure of the EUC-JP encoding is explained in https://en.wikipedia.org/wiki/Extended_Unix_Code#EUC-JP . The second byte of a two- or three-bytes sequence must be in the range 0xA1..0xFE for the sequence to be valid. The third byte of a three-bytes sequence must be in the range 0xA1..0xFE for the sequence to be valid. So, bytes in the range 0x00..0x7F are only valid as the first byte. The FreeBSD 13.2 converter from EUC-JP to UTF-8 accepts bytes < 0x80 in these positions. This is not helpful, because it breaks detection of encodings by applications. How to reproduce: $ cc -Wall -o table-from table-from.c $ ./table-from EUC-JP > EUC-JP.TXT Attached are the actual and the expected EUC-JP.TXT. -- You are receiving this mail because: You are the assignee for the bug.