Converting preformatted documents to HTML

Eelco van Asperen <evas@cs.few.eur.nl>
From: Eelco van Asperen <evas@cs.few.eur.nl>
Date: Wed, 5 Aug 92 16:51:21 +0200
Message-id: <9208051451.AA20447@slc01.cs.few.eur.nl>
To: www-talk@nxoc01.cern.ch
Subject: Converting preformatted documents to HTML

In our HTML files, I frequently want to refer the reader to man-pages
or the GNU documentation. So, I wrote two small scripts to convert
man-pages and GNU Info files to HTML and added a bit of code to
the HTTPD daemon to call these filters if it found a .man or a .info
file. With this, one can write f.e. 
	"<A HREF=http://kaa.cs.few.eur.nl/info/gcc/gcc.info>GCC</A>"
to refer to the (extensive) documentation for the GNU C compiler.
So far, so good.

The Info files are the GNU equivalent of man pages and they
are complete formatted documents that contain references. 
The problem I've run into is that HTML does not offer a mode in 
which all formatting is preserved *and* tags are recognised.
The LISTING mode does preserve the formatting but does not see the 
hypertext references; the NORMAL mode does recognise the references 
but destroys the formatting (indentation, line breaks). 
Switching from LISTING to NORMAL mode (and back) in the middle of 
a paragraph is not feasible either because that will cause a blank line 
to be inserted at that point.

It would be handy to have a mode that does preserve formatting and
recognizes SGML tags since there are no doubt other hypertext formats
that could be converted to HTML this way.

Thoughts anyone ?

Eelco van Asperen.           || Erasmus University Rotterdam
internet: evas@cs.few.eur.nl || Department of Computer Science, room H4-32
New, Improved, a disclaimer: || PObox 1738, 3000 DR Rotterdam, The Netherlands
*** publishing private email is not allowed by the author of this message. ***