There's another case, too: try out the following in various browsers:
é é. érunon
Netscape, for example, renders érunon as e'runon, where e' is
a properly accented e. Given that the universe of character entities
is supposed to be closed in HTML, and the particular values it
encompasses now, that's a reasonable mode of error
recovery (probably fortuitous, but anyway).
But I agree with Dan that error recovery is for the developer to
work out, not something that goes in the spec.
However, looking to the future and the above case, there exist
ISO character entity names that are truncations of longer ones:
iso-num.gml:<!ENTITY plus SDATA "[plus ]"--=plus sign B:-- >
iso-num.gml:<!ENTITY plusmn SDATA "[plusmn]"--/pm B: =plus-or-minus sign-->
so one might want to consider that possibility when planning error
recovery.
Regards,
-- Terry Allen (terry@ora.com) O'Reilly & Associates, Inc. Editor, Digital Media Group 101 Morris St. Sebastopol, Calif., 95472A Davenport Group sponsor. For information on the Davenport Group see ftp://ftp.ora.com/pub/davenport/README.html or http://www.ora.com/davenport/README.html