Re: Revised language on: ISO/IEC 10646 as Document Character Set

Terry Allen (terry@ora.com)
Thu, 11 May 95 12:20:14 EDT

Bert:
>I know there are many more characters
then there are in Unicode, but wouldn't it suffice to define character
entities if/when they're needed? For some alphabets (Glagolithic, for
example), such entities already exist.

I agree with the sentiment, but at the moment you can't declare
those entities in HTML because whatever doctype decl you send,
with an internal subset declaring them, should be discarded
and replaced with the doctype decl supplied by the spec (which
is okay by me ad interim).

Unicode for HTML is being put forward as a practical engineering
solution for most cases, and we shouldn't worry too much about
marginal ones. Over in MIMESGML these are exactly the issues
we have to take on, because there's no way 10646 is going to
be declared as the document charset for SGML over the Net; it will
be one possibility, but only one among many.

Regards,

-- 
Terry Allen  (terry@ora.com)   O'Reilly & Associates, Inc.
Editor, Digital Media Group    101 Morris St.
			       Sebastopol, Calif., 95472
occasional column at:  http://gnn.com/meta/imedia/webworks/allen/

A Davenport Group sponsor. For information on the Davenport Group see ftp://ftp.ora.com/pub/davenport/README.html or http://www.ora.com/davenport/README.html