converting UTF-8 to HTML

Lars Eighner lars at larseighner.com
Sun Apr 22 14:30:26 UTC 2012


On Sun, 22 Apr 2012, Matthew Seaman wrote:

> On 22/04/2012 10:17, Erik Nørgaard wrote:
>> UTF-8 is variable with, ascii characters are stored as single bytes (not
>> sure about iso-8859-1) while other characters are stored as two byte chars.
>
> ascii uses the low 128 values that you can assign to an unsigned char,
> ie. those where the high-order bit is zero.
>
> Programming a text-only display to assume
> everything is UTF-8 would be quite viable, and backwardly compatible
> with ascii-only displays.


The hardware doesn't exist to display UTF-8 characters in text MODE.  The
whole point of avoiding GUIs is rasterized and GUI fonts cannot put 4000
characters on a screen as legibly as VGA does (not to mention the
performance hit the rasterization and GUIs deliver).

One look at recent Linux distributions which make it all but impossible to
reach text MODE because they had the thought that sticking a rasterized
white-on-black font on the screen (via yet another kernel module) would be
"just as good" as VGA should amply demonstrate the point.  Yeah, you need
that crap if you are running a server in Outer Fubaristan where there are 38
languages written in 49 different alphabets -- but crippling text mode is
not worth while for most people, especially people who work in text.


-- 
Lars Eighner
http://www.larseighner.com/index.html
8800 N IH35 APT 1191 AUSTIN TX 78753-5266


More information about the freebsd-questions mailing list