[Bug 212125] korean/man-doc - Encoding errors in manual pages

bugzilla-noreply at freebsd.org bugzilla-noreply at freebsd.org
Thu Aug 25 01:22:13 UTC 2016


https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=212125

            Bug ID: 212125
           Summary: korean/man-doc - Encoding errors in manual pages
           Product: Ports & Packages
           Version: Latest
          Hardware: Any
                OS: Any
            Status: New
          Severity: Affects Only Me
          Priority: ---
         Component: Individual Port(s)
          Assignee: freebsd-ports-bugs at FreeBSD.org
          Reporter: benkasminbullock at gmail.com

There are multiple encoding errors in the Korean language manual pages. Running
the following Perl script on the Korean language manual pages reveals about
twenty files with encoding errors:

-----

use warnings;
use strict;
use utf8;
use FindBin '$Bin';
my @files = <$Bin/man*/*.gz>;
my $fakefile = 'guff';
for my $file (@files) {

    # Using /dev/null results in "Inappropriate ioctl for device" errors
    # from iconv.
    my $status = system ("gzcat $file | iconv -f \"EUC-KR\" -t \"UTF-8\" >
$fakefile");
    if ($status) {
        print "Bad file $file.\n";
    }
}
unlink $fakefile or die $!;

-----

iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/gcc.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/gdb.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/patch.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/ps.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/sed.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/systat.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/tcsh.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/top.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man1/troff.1.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man2/kldload.2.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man2/mount.2.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man2/unmount.2.gz.
iconv: iconv(): Illegal byte sequence
Bad file
/usr/home/ben/projects/manpages/manserver/man/ko/man3/siginterrupt.3.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man3/tempnam.3.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man3/tmpfile.3.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man3/tmpnam.3.gz.
iconv: warning: invalid characters: 1
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/atkbd.4.gz.
iconv: warning: invalid characters: 1
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/psm.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/speaker.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/spkr.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/vinum.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man4/vn.4.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/fstab.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/printcap.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/rc.conf.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/termcap.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man5/terminfo.5.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man7/groff.7.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man7/tuning.7.gz.
iconv: warning: invalid characters: 2
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/dhclient.8.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/fixmount.8.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/mountd.8.gz.
iconv: warning: invalid characters: 1
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/moused.8.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/sysinstall.8.gz.
iconv: iconv(): Illegal byte sequence
Bad file /usr/home/ben/projects/manpages/manserver/man/ko/man8/vinum.8.gz.

-----

I carefully confirmed that there is indeed an error in the Korean gcc manual
page, where it contains bytes \x82 and \xC1 on line 1875. These don't map into
the EUC-KR encoding. I don't speak Korean so I cannot suggest what characters
were intended to be placed here. I have not confirmed the encoding errors in
the other pages.

-- 
You are receiving this mail because:
You are the assignee for the bug.


More information about the freebsd-ports-bugs mailing list