I think all these references have been cited on the list before, but I'll
post them anyway.
RFC1641 "Using Unicode with MIME"
and
RFC1642 "A Mail-Safe Transformation Format of Unicode"
are cited in RFC1700 (Assigned Numbers) as the reference for some Unicode
character set names.
There is a "Unicode Home Page" at:
<URL:http://www.stonehand.com/unicode.html>.
It has _some_ version of the Unicode standard (I think), a glossary and a FAQ.
The page at:
<URL:http://www.stonehand.com/unicode/standard/principles.html>
looks especially interesting as it discusses various encodings of Unicode.
I don't know what ERCS is either. If I was guessing I'd say Extended
"Reference Concrete Syntax" (a la SGML).
And looking back in December I find:
Dec 12 Gavin wrote:
>There is a proposal going to SGML Open from a fellow in Australia that
>might be of interest to you. The proposal outlines an Extended
>Concrete Synatax that defines a 16bit CHARSET.
>
>The core concept is that at the lowest level in the parser, you have a
>"normalizer" which converts from the data storage format into the
>document character set. This is roughly akin to my proposal, but
>generalises it so that it *should* be possible to mix encodings, and
>character sets, and let the normaliser take care of all the nasty
>details.
I'm guessing the prior paper Gavin refers to is:
"A truly multilingual WWW" sent to http-wg and html-wg 12/22/94 and archived at:
http://www.acl.lanl.gov/HTML_WG/html-wg-94q4.messages/0576.html
Last round there were more follow-ups on the http-wg list than the html-wg list.
--- Albert Lunde Albert-Lunde@nwu.edu