Chronological Awareness Tools: CHRONO and Meta-CHRONO

Lee Li-Jen Chen
Knowledge Science Institute
University of Calgary
Alberta, Canada T2N 1N4

lchen@cpsc.ucalgary.ca http://ksi.cpsc.ucalgary.ca/KSI/

Abstract

This paper introduces CHRONO and Meta-CHRONO tools for managing chronological awareness at websites. The paper acts as a tutorial about how to set up CHRONO for a website. CHRONO is an HTTPD server-side system which generates chronological listings of web pages that have been changed recently at specific sites. It provides a basic awareness-support that lets visitors of a web site (e.g., members of a group, or other Net surfers) see which web pages have been modified since their last visit. Currently, the CHRONO system is implemented on a UNIX platform. CHRONO presents visitors with an HTML document that lists the titles of web pages at the site in reverse chronological order. This chronological listing also functions as a collection of hyper links to the listed web pages (Chen & Gaines, 1996). Meta-CHRONO is another awareness tool which collects CHRONO listings from related websites, then generates a collective chronological awareness listing reflecting the changes at those websites.

User Interface and Its Functionality

The user interface of the CHRONO system is straightforward and intuitive for web users. It looks like an automated what's new page to the users. From the list, the visitors are able to tell at a glance what documents have been modified or created recently. They can also scroll down the list to check those older documents at the site. Because the titles of the listed pages also act as hyper links to the actual web pages, visitors can simply click and jump to the relevant pages of interests (Chen & Gaines, 1996b).

Chronological hyper links presented in the CHRONO listings provide the visitors the means to access the newly modified or created pages. This time-line (or history) dimensionality complements the functionality of the associative memory characteristic found in typical hyper links which join related information.

The time-line dimension allows frequent visitors of a web site an immediate awareness on what have been changed since their latest visit. The changes may reflect some web pages in which they have been previously interested or they may show some pages that the visitors have never seen before but now appeal to them. Hence this chronological browsing characteristic is analog to spatial (subject-category) browsing characteristic that library patrons have often experienced when looking for books on open book-shelves (i.e., accidentally finding other relevant books near the books that they were looking for originally). Unlike a manually updated What's New page in which the users have to rely on timely updates made by a web master (or by the document authors), CHRONO provides the time-line dimension to the users automatically, in a reliable and periodic fashion.

CHRONO Current System Usage

The CHRONO system has been running at the Department of Computer Science at the University of Calgary since March, 1995. It currently services seven web locations: two research units and five individuals. People associated with the two research units: Knowledge Science Institute and GroupLab periodically have utilized the system to check on new developments of each other (both within group and between groups).

Figure 1 Screen Snapshot of CHRONO

From a preliminary examination of the HTTPD access_log of the CHRONO system site and from talking with individual group members, we have found that the chronological listings of the five personal sites have offered other group members more focused chronological awareness about these five individuals' working patterns. Occasionally, some people had discovered new projects the targeted individuals were working on that they were not previously aware. This positive usage experience of personal chronological listings suggests to us that there is a need to further examine the effectiveness of providing different sub-groupings of chronological listings to group members.

Meta-CHRONO

Meta-CHRONO is recent addition to support special interest community awareness for distributed research teams and groups (Gaines, Shaw & Chen, 1996).. This meta-level awareness tool collects CHRONO listings from related websites, then it generates a collective chronological awareness listing reflecting the changes at those websites. This allows visitors to become aware recent modifications of changes at all registered websites via a centralized Meta-CHRONO index-listing. This meta-level awareness support of CHRONO listings is a natural extension from Miller's living systems theory (Miller, 1978) and the CyberOrganism framework (Chen & Gaines, 1997).

Related Chronological Awareness Systems

There are three other systems that provide support for chronological awareness based on different designs and implementations (Chen & Gaines, 1996):

WebWatch (for WIN3.1 or WIN95)
The URL-minder: Your Own Personal Web Robot! (UNIX)
Katipo: A Web Lurker (Mac OS)

Getting and Installing CHRONO for a Website

For UNIX System (SunOS Release 4.1.4 and SGI IRIX 5.3)

This section describes how to install and run CHRONO Version 1 on the UNIX platform. First step describes how to compile CHRONO; second step illustrates how to run CHRONO; and third step shows how to setup crontab to run CHRONO automatically in periodic fashion.

Step 1: Compile the CHRONO program

Download the CHRONO source code for compiling or skip this step by obtaining a pre-compiled SUN OS compressed binary executable and SGI IRIX binrary executable (compressed) for a specific platform.

The source code "chrono.c" can be readily compile on the UNIX platform (SunOS Release 4.1.4 and SGI IRIX 5.3) without modification. It should be relatively easy to compile on the other UNIX versions.

The executable can be compiled under the GNU GCC like the following:

        [dk chrono 1 ]> gcc -o chrono chrono.c

The executable can also be compiled under the normal cc like the following:

        [dk chrono 1 ]> cc -o chrono chrono.c

If nothing goes wrong, an executable named chrono should be created.

Step 2: Running CHRONO manually

To initiate CHRONO on a specific www directory designate for indexing. The following sub-steps test run and demonstrate basic ideas of how CHRONO works.

The CHRONO program takes (at least ) 3 parameters for arguments (the 4^th one is an optional parameter to be discuss later in the advance usage section).

chrono <dir> <outputfile-name> <http path> [index patterns file]

<dir>

the www directory where the indexing starts

<outputfile-name>

the name of the outputting HTML file

<http path>
an http path name this is the entry point to the directory specified in the <dir>

[index patterns file]
the name of file containing string patterns for index matching. If not specified, CHRONO will match files with ".htm" and ".HTM" patterns.

For example: the following statement is used to create an index webpage named KSI-chrono.html of a web directory named /ksi/www/ where the URL: http://ksi.cpsc.ucalgary.ca/ refers to:

        [www 2] > chrono /ksi/www/ KSI-chrono.html http://ksi.cpsc.ucalgary.ca/

CHRONO will recursively traverse the directory /ksi/www/ can create a reverse chronological index webpage called KSI-chrono.html in the current directory

NOTE: Remember to chmod a+r KSI-chrono.html to make it readable via world wide web. For example if the current directory is referable via URL: http://ksi.cpsc.ucalgary.ca/ the http://ksi.cpsc.ucalgary.ca/KSI-chrono.html will point to the CHRONO page for /ksi/www/ website.

Step 3: Setup crontab to run CHRONO automatically

The above step describes how CHRONO can be executed to create a CHRONO webpage. The following sub-steps illustrate how crontab (see detail explanation of crontab by doing a manual page on UNIX: i.e., man crontab) can setup an automatic periodic schedule for running CHRONO and creating CHRONO webpage(s).

Sub-Step 3.1: Crontab statement

crontab is used to register any command line statement for automatic periodic execution by the UNIX system.

The following crontab format is used to execute a specific command line every hour, at half of an hour:

        30 * * * * <command line>

Therefore if one wants to execute a CHRONO program like the one in stage 2, one just substitutes the command statement in the <command line> field.

IMPORTANT NOTICE: the command line has to take the whole path names for the CHRONO program and corresponding files; therefore, if the CHRONO program is resided in /ksi/chrono/ one has to specific the whole path name, e.g.,

        15 * * * * /ksi/chrono/chrono /ksi/www/ /ksi/www/KSI-chrono.html http://ksi.cpsc.ucalgary.ca/

The above crontab statement will create a CHRONO webpage (named KSI-chrono.html) of http://ksi.cpsc.ucalgary.ca/ (under directory /ksi/www/)and put the webpage inside the /ksi/www/ directory. The specific CHRONO webpage will be updated every hour at quarter after. (note: remember to chmod the KSI-Chrono.html to be readable by others after it is created first time).

Sub-Step 3.2: Resister Crontab

To register for SGI IRIX:

[dk chrono 7 ]> crontab _sgi.chrono

where _sgi.chrono is an text file with crontab statement(s).

To edit a crontab schedule file (FOR SUN OS), type:

        [dk chrono 7 ]> crontab -e

that will bring up the editing mode, once in the mode, one can start to enter the crontab statement. Usually the default editor for crontab is vi, one can use one's favorite editor (e.g., emacs) to create a crontab file (e.g., _spuds) in advance then loaded it in while one is in crontab -e mode in vi (key sequence: 'ESC' ':' 'r _spuds' 'CTRL-x') . Or you can set the UNIX environment variable VISUAL or EDITOR to a favorite editor.

For example, here is an example crontab file named _spuds

There are two other useful crontab switches:

crontab -l List the user's crontab file

crontab -r Remove the current user's crontab file from the crontab directory.

Usually, one uses crontab -l to examine the current schedule of cron daemon; if one wants to modified or create a new CHRONO index, one can edit a crontab file (e.g., _spuds) before one uses crontab -e. Once one is satisfied with the crontab file, one can simply use crontab -r to remove (clear) the existing cron schedule and use crontab -e to load the newly modified crontab file. By keeping a crontab file and editing it, the organization of the CHRONO webpages will be relatively simple.

Summary of Basic Installation

Once the CHRONO program has been compiled successfully, CHRONO can be use to index a www directory in reverse chronological fashion to produce a CHRONO webpage. CHRONO takes 3 parameters: (1) the path of the www directory to be indexed; (2) the resulting CHRONO webpage file name; and (3) the http URL of the indexed www directory.

Using crontab, an automatic, periodic command-schedule can be setup to generate automatic CHRONO webpages. The important point is that the CHRONO command and its parameters need to be specified by their whole path-names.

Finally, to make the generated CHRONO webpages readable by others, by using the chmod a+r <file name(s)> command.

After one gets CHRONO running, one can start to experimenting with advance usage features: e.g., matching patterns, CHRONO.CONTROL files, etc.

CHRONO Advance Usage

This section describes how CHRONO indexing patterns can be changed. For example, a website maintainer may wish certain sub-directories or files not to be indexed by CHRONO. He or she may also want to index additional file formats than HTML (e.g., GIF, JPEG).

Changing the Default Global Index Patterns

By default CHRONO will recursively index every HTML files under the directory specific by the <dir> parameter.

chrono <dir> <outputfile-name> <http path> [index patterns file]

There is an optional index pattern file [index patterns file] for CHRONO that allows web master to overwrite the default index patterns for deciding what to be indexed. The absence of this file will result in default patterns which match any file with .htm and .HTM (default) string patterns. For examples, files such as test.html, MORE.HTM, and OK.HTM.also.match will be indexed by CHRONO.

One can specific a global control file, for example, CHRONO.PATTERN:

        #INCL#
        .htm
        .HTM
        .gif
        #EXCL#
        photo
        .HTM.

The keywords #INCL# and #EXCL# signal to CHRONO the followings line(s) will be inclusion (file) pattern(s) and exclusion (file and directory) pattern(s).

If a file or directory name satisfies one of the inclusion patterns it is indexed unless it also satisfies one of the exclusion patterns.

Therefore the above example add one more inclusion file pattern .gif so now any GIF files with the .gif as extension will be matched. And CHRONO will avoid indexing any files or directories containing string patterns like photo or .HTM. (recursively) under www directory <dir>. Notice with such exclusion patterns, file OK.HTM.also.match will no longer be indexed.

NOTE

If the specified index pattern file exist but is null (with size of 0 byte), CHRONO will result in another default which matches all files without regards to any patterns, therefore everything will be matched and be indexed.

Changing the Individual Index Patterns

One can place a control file, named "CHRONO.LOCAL", in any webpage directory (and its sub-directories). Unlike the optional index pattern file described earlier (which the effect is global), the CHRONO.LOCAL file specific exclusion index patterns that are only local to the current directory which the file is placed.

This control file will be used by CHRONO whenever it tries to index that webpage directory; so that, any sub-directory or file names that have watched the string patterns specified in the control file will not be indexed. For examples, if we have a webpage directory containing the following subdirectories and files:

        Agenda0.html
        CHRONO.LOCAL
        private_subdir/
        public_subdir/
        agenda1.html
        agenda2.html
        agenda3.html
        another_world.html
        ok-file.html
        www_agenda/
        www3/

Considered if we have two string patterns, "public_subdir" and "agenda" which we want to use for filtering out subdir or file name patterns. We will write a CHRONO.LOCAL file containing both string patterns, (note that each string pattern is listed in a seperate line; and there is no extra space in front or behind each of pattern string). For example, if CHRONO.LOCAL contains the following two lines (with two patterns for matching).

public_subdir
agenda

With the control file, CHRONO.LOCAL, in place, CHRONO will omit the following files and subdirectories for indexing:

        public_subdir/
        agenda1.html
        agenda2.html
        agenda3.html
        www_agenda/

And the following files and subdirectories will be indexed by CHRONO (note the string pattern is case sensitive, hence "Agenda0.html" won't be matched by string pattern "agenda"):

        Agenda0.html
        public_subdir/
        another_world.html
        ok-file.html
        www3/

Please note that the effects of the omission control file is only local to the directory. So if there are some files called "agenda-4.html" and "agenda-5.html" under the "public_subdir" sub directory, and one wants to omit them for indexing, one needs to place the string pattern "agenda" in another CHRONO.LOCAL file in the "public_subdir" sub directory.

Omitting the whole directory for indexing

One can tell the CHRONO not to index a specific directory (consequently all its subdirectories) by place a null (0 byte size) control file, names "CHRONO.LOCAL" in the directory.

        Agenda0.html
        CHRONO.LOCAL
        private_subdir/
        public_subdir/
        agenda1.html
        agenda2.html
        agenda3.html
        another_world.html
        ok-file.html
        www_agenda/

For example, if one places the control file CHRONO.LOCAL with size of 0 byte in the above example directory, nothing in and under the directory will be indexed. The effect of this null control file, "CHRONO.LOCAL", basically stops CHRONO from traversing down in any of its subdirectories; therefore, nothing under private_subdir and private_subdir will be indexed either.

Questions and Suggestions?

To join the CHRONO Mailing List (fairly low traffic) for discussion about questions, problems, announcements, suggestions, future ports to different platforms, send a message with the body "subscribe" to chrono-request@cpsc.ucalgary.ca

References

Chen, L. L.-J. and Gaines, B. R. (1996). Knowledge Acquisition Processes in Internet Communities. Proceedings of KAW96, Banff, AB, pp. 43-1~43-18. http://ksi.cpsc.ucalgary.ca/KAW/KAW96/chen/ka-chen-gaines.html

Chen, L. L.-J. and Gaines, B. R. (1996b). Methodological Issues in Studying and Supporting Awareness on the World Wide Web. Proceedings of WebNet96. Association for the Advancement of Computing in Education, Charlottesville, VA. http://ksi.cpsc.ucalgary.ca/articles/WN96/WN96Chrono/WN96Chrono.html

Chen, L. L.-J. and Gaines, B. R. (1997). A CyberOrganism Model for Awareness in Collaborative Communities on the Internet. International Journal of Intelligent Systems, (in press). http://www.cpsc.ucalgary.ca/~lchen/current/ijis/

Gaines, B. R., Shaw, L. G., and Chen, L. L.-J. (1996). Utility, Usability, and Likeability: dimensions of the net and web. Proceedings of WebNet96. Association for the Advancement of Computing in Education. http://ksi.cpsc.ucalgary.ca/articles/WN96/WN96HF/WN96HF.html

Miller, J. G. (1978) Living Systems. McGraw Hill, New York, NY

NetMind (1995). The URL-Minder: Your Own Personal Web Robot. NetMind. http://www.netmind.com/URL-minder/URL-minder.html

Newberry, M. (1995). Katipo--a Web Lurker. Victoria University of Wellington, New Zealand. http://www.vuw.ac.nz/~newbery/Katipo.html

Specter (1995). WebWatch. Specter Communications. http://www.specter.com/

[ Knowledge Acquisition Workshop '96, Banff, Alberta, Canada, Nov. 9 - 14, 1996 ]

<dir>	the www directory where the indexing starts
<outputfile-name>	the name of the outputting HTML file
<http path>	an http path name this is the entry point to the directory specified in the <dir>
[index patterns file]	the name of file containing string patterns for index matching. If not specified, CHRONO will match files with ".htm" and ".HTM" patterns.

crontab -l	List the user's crontab file
crontab -r	Remove the current user's crontab file from the crontab directory.