STARTS
Stanford Protocol Proposal for Internet Search and Retrieval

Reference Implementation


Source Description

The reference implementation includes two source collections.

Computer Science Technical Reports

A database of documents that are drawn from the NCSTRL technical reports provided by the Cornell Computer Science Department. These documents are indexed using freeWAIS waisindex with the following fields:

  1. id - The identifier of the document.
  2. dm - The entry date of the document in ISO 8601 format.
  3. ti - The title of the document.
  4. au - The author(s) of the document.
  5. ab - The abstract of the document.
  6. bd - The body (text) of the document.

Linux Newsgroup Archives

A subset of the archives for the newsgroup comp.sys.linux.announce drawn from sunsite. These documents are indexed using freeWAIS waisindex with the following fields:

  1. id - The identifier of the document.
  2. dt - The date of the news message.
  3. fr - The sender ("from" field) of the message.
  4. sb - The subject of the message.
  5. tx - The text (body) of the message.

Send questions to help@ncstrl.org