Re: Future of meta-indices: site indexing proposal and Perl script

Dave Raggett <dsr@hplb.hpl.hp.com>
Errors-To: listmaster@www0.cern.ch
Date: Wed, 23 Mar 1994 10:37:47 --100
Message-id: <9403230934.AA24410@dragget.hpl.hp.com>
Errors-To: listmaster@www0.cern.ch
Reply-To: dsr@hplb.hpl.hp.com
Originator: www-talk@info.cern.ch
Sender: www-talk@www0.cern.ch
Precedence: bulk
From: Dave Raggett <dsr@hplb.hpl.hp.com>
To: Multiple recipients of list <www-talk@www0.cern.ch>
Subject: Re: Future of meta-indices: site indexing proposal and Perl script
X-Listprocessor-Version: 6.0c -- ListProcessor by Anastasios Kotsikonas
Content-Length: 1319
Kevin Altis writes:

> you should be able to extract particular sections or pages of
> any document. So, you could retrieve page 3-4 of a PostScript
> document, paragraph three on page 4 of a Word document, a rectangle
> from 10,15 to 108,112 of a GIF image, etc.

and John C. Mallery writes:

> It would be nice if the document fragment could be specified
> stuff between two arbitrary points in a document, rather than
> relying exclusively on everything to be named.

Some of the HyTime concepts may be useful here. HyTime extends
SGML's reference capability to accommodate those cases in which
no unique identifier exists in the document's name space. Authors
can create a location address element that pairs the object in
question with a unique local identifier. There are three kinds
of addressing:

    a)  by name (using ID attributes)
    b)  by position in a coordinate space (the GIF example)
    c)  by semantic construct (e.g. the 3rd polygon on the left)

A summary of these capabilities can be found in the November '91
issue of Communications of the ACM (Volume 34, number 11).

--
Best wishes,

Dave Raggett

-----------------------------------------------------------------------------
Hewlett Packard Laboratories,           +44 272 228046
Bristol, England                        dsr@hplb.hpl.hp.com