An HTML document that you receive may have À or it may very
well have À in it. How will your system process À? The
document character set does include the "Right Part of Latin Alphabet
Nr. 1" after all.
For the purposes of HTML, the Added Latin 1 entity set was not meant
to extend the expressive capability of the language, but only to
provide mnemonics, and to facilitate 7-bit clean transmission.
I am more confused by the moment...
What exactly is standardized by the ISO Added Latin 1 entity set?
The entity names, the SDATA entities in the public text, the bindings
between them, or all of the above?
I thought it was just the names; i.e. if a DTD includes, by reference,
the ISO Added latin 1 entity set, then À was valid markup --
but there is no telling how it is represented in the ESIS.
I thought the public text was intended to be edited locally for
each SGML system; that the SDATA "[agrave ]" thingies were supposed
to be replaced by SDATA "esc-my-typesetter's-agrave-thingy" on
a per-system basis.
Is it the case, rather, that all SGML systems (that intend to support
the ISO entity sets) are supposed to recognize an SDATA entity of
"[agrave ]" as some sort of special character?
Please explain. I've seen numerous discussions on comp.text.sgml, and
I think they have served only to confuse me.
Dan
>> >CDATA terminated by any ETAGO
>> >-----------------------------
>i'd suggest
>
><!ENTITY % literal "CDATA"
> -- historical, non-conforming parsing mode where
> the only markup signal is the first end tag (for
> any element) encountered and this terminates the
> <literal> element
> -->
>
>of course, this assumes you want to describe what SGML will do.
>if the comment is supposed to be describing what browsers do,
>then ignore my suggestion.
It's supposed to be describing what browsers do, so I guess I'll
ignore this suggestion. There wouldn't be anything "non-conforming"
about a content model where ETAGO delimiter-in-context was the only
markup signal. That's just plain CDATA. This is "a historical,
non-conforming parsing mode..."
Strictly speaking, this isn't allowed in "An SGML Application Conforming
to ..." so it's really just a "Note to implementors." Perhaps I'll
color it that way more explicitly.
Dan