[Bug 212125] korean/man-doc - Encoding errors in manual pages
bugzilla-noreply at freebsd.org
bugzilla-noreply at freebsd.org
Thu Aug 25 01:22:13 UTC 2016
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=212125
Bug ID: 212125
Summary: korean/man-doc - Encoding errors in manual pages
Product: Ports & Packages
Version: Latest
Hardware: Any
OS: Any
Status: New
Severity: Affects Only Me
Priority: ---
Component: Individual Port(s)
Assignee: freebsd-ports-bugs at FreeBSD.org
Reporter: benkasminbullock at gmail.com
There are multiple encoding errors in the Korean language manual pages. Running
the following Perl script on the Korean language manual pages reveals about
twenty files with encoding errors:
-----
use warnings;
use strict;
use utf8;
use FindBin '$Bin';
my @files = <$Bin/man*/*.gz>;
my $fakefile = 'guff';
for my $file (@files) {
# Using /dev/null results in "Inappropriate ioctl for device" errors
# from iconv.
my $status = system ("gzcat $file | iconv -f \"EUC-KR\" -t \"UTF-8\" >
$fakefile");
if ($status) {
print "Bad file $file.\n";
}
}
unlink $fakefile or die $!;
-----
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/gcc.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/gdb.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/patch.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/ps.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/sed.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/systat.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/tcsh.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/top.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/troff.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man2/kldload.2.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man2/mount.2.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man2/unmount.2.gz.
iconv: iconv(): Illegal byte sequence
Bad file
/usr/home/ben/projects/manpages/manserver/man/ko/man3/siginterrupt.3.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man3/tempnam.3.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man3/tmpfile.3.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man3/tmpnam.3.gz.
iconv: warning: invalid characters: 1
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/atkbd.4.gz.
iconv: warning: invalid characters: 1
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/psm.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/speaker.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/spkr.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/vinum.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/vn.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/fstab.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/printcap.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/rc.conf.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/termcap.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/terminfo.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man7/groff.7.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man7/tuning.7.gz.
iconv: warning: invalid characters: 2
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/dhclient.8.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/fixmount.8.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/mountd.8.gz.
iconv: warning: invalid characters: 1
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/moused.8.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/sysinstall.8.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/vinum.8.gz.
-----
I carefully confirmed that there is indeed an error in the Korean gcc manual
page, where it contains bytes \x82 and \xC1 on line 1875. These don't map into
the EUC-KR encoding. I don't speak Korean so I cannot suggest what characters
were intended to be placed here. I have not confirmed the encoding errors in
the other pages.
--
You are receiving this mail because:
You are the assignee for the bug.
More information about the freebsd-ports-bugs
mailing list