... a couple of good examples removed for brevity
> Example 3:
>
> <!doctype html PUBLIC "-//IETF//DTD HTML Experimental//">
> <html>
> <head>
> <title>blah</title>
> <style>blah...</style>
> <!-- <STYLE> unknown; don't infer </HEAD><BODY>; ignore content -->
> </head>
> <body>
> <newel>blah</newel>
> <!-- <NEWEL> unknown, but <BODY> has been seen; include content -->
If I understand this proposal, it fails for new non-content head tags.
In the above example you assume that even though the browser doesn't know
<STYLE>, it knows to match it to </STYLE> and ignore content. Suppose
you have:
<!doctype html PUBLIC "-//IETF//DTD HTML Experimental//">
<html>
<head>
<title>blah</title>
<newtag>blah...
<!-- <newtag> unknown; how do you know how far ahead to look for
</newtag> before giving up and inferring </HEAD><BODY>
to make "blah..." the start of the body of the document? -->
Eric