Re: How to specify 10646 as document char set

Glenn Adams (glenn@stonehand.com)
Mon, 1 May 95 13:10:55 EDT

From: erik@netscape.com (Erik van der Poel)
Date: Mon, 01 May 95 09:43:25 -0700

>Other additional changes can be made in the future that will facilitiate
>greater use of 10646, e.g., the use of characters outside of the ASCII
>repertoire for markup.

Can you give examples of such markup (I'm curious).

For example, you may wish to extend SEPCHAR as follows:

"NO-BREAK-SPACE" SEPCHAR 160
"EN-QUAD" SEPCHAR 8192
"EM-QUAD" SEPCHAR 8193
"EN-SPACE" SEPCHAR 8194
"EM-SPACE" SEPCHAR 8195
"THREE-PER-EM-SPACE" SEPCHAR 8196
"FOUR-PER-EM-SPACE" SEPCHAR 8197
"SIX-PER-EM-SPACE" SEPCHAR 8198
"FIGURE-SPACE" SEPCHAR 8199
"PUNCTUATION-SPACE" SEPCHAR 8200
"THIN-SPACE" SEPCHAR 8201
"HAIR-SPACE" SEPCHAR 8202
"ZERO-WIDTH-SPACE" SEPCHAR 8203
"IDEOGRAPHIC-SPACE" SEPCHAR 12288
"ZERO-WIDTH-NO-BREAK-SPACE" SEPCHAR 65279

You might wish to allow localization of names so that you can use
non-ASCII characters to specify element type names, attribute names,
name tokens used as attribute values, etc. Such an extension, though
highly desirable for SGML in general, probably won't be an issue for
HTML.

Glenn