Re: HTML Character Representation/Transmission Model

Gavin Nicol (gtn@ebt.com)
Tue, 11 Apr 95 12:43:46 EDT

>Gavin's attempts to use an SGML declaration which admitted non 8859-1
>characters to use for markup would have to be deferred to a later date.

Actually, I proposed using the ISO 8859-1 repertoire version of ERCS,
or we could also use the ISO 646 repertoire version. I will not push
this because this is a large change (and using the canonical ERCS is
even larger), of dubious long-term benefit.

>The real effects of this change would be to:
>
>(1) rationalize the use of numeric character references in a universal
>fashion (at least normatively speaking)

Yes.

>(2) provide a significant growth path to HTML applications that wish
>to begin exploiting non-Western European (8859-1) language capabilities,
>and do so in a standard fashion

Yes.

>(3) facilitate the use of DSSSL Lite and DSSSL (ISO/IEC 10179) which
>requires that all characters be expressable in terms of ISO/IEC 10646

Ahh. You caught this...

>(4) provide more consistency with newly developed national standards;
>e.g.,
>
> JIS X 0221 = Japanese National Standard based on ISO/IEC 10646

New information. This looks like an interesting development!

>(5) finally, this change would *not* necessarily change current
>behavior or practice

YES! This is a key feature of the solution. We get a good growth path
with little pain.

----
So is my central idea is reasonable? Can we think about putting it
into 2.1? Dan?