Equivalent URLS. Was: searchable index of the web
Tim Berners-Lee <timbl@www3.cern.ch>
Date: Thu, 1 Jul 93 14:30:18 +0200
From: Tim Berners-Lee <timbl@www3.cern.ch>
Message-id: <9307011230.AA06103@www3.cern.ch>
To: marca@ncsa.uiuc.edu (Marc Andreessen)
Subject: Equivalent URLS. Was: searchable index of the web
Cc: sanders@bsdi.com, www-talk@nxoc01.cern.ch
Reply-To: timbl@nxoc01.cern.ch
Status: RO
>> which mans it's not the same. Marc, when doing annotations and
>> checking the "visited" list maybe you should ignore :80 on http:
>> servers?
>
>Yup. And 70 for Gopher servers. And WAIS gateways should be
>equivalenced. And trailing periods in machine names should be
>ignored. And trailing slashes in directory names. And...
>
>Big problem; hopefully solved by URN's; planning on patching the
hell
>out of libwww2 for Mosaic 2.0 to try to do as much as possible
though.
I have changed libwww 2.10 to strip the trailing . and the :80 or
:70 on http: and gopher: whenever an HTParse() of a URL is done
(ie all the time). This should cure those two.
It should result in the end in URLs not being quoted with these
forms too, as editors will strip them out.
Tim