CS 502
Architecture of Web Information Systems
Spring 2003

Readings and References

General Readings on Digital Libraries and Distributed Information

Books

Online Periodicals

Some useful web sites

XML Reference Books

Examples of Digital Libraries and Electronic Publishing

Cornell University

The rest of the world

Readings

The subject of the course is a dynamic area.  Most of the material in the course is the result of research and implementation over the past 3-5 years.  Fortunately almost all of this work is available through papers on the open-source Web.  Readings are assigned for each week's discussion section as listed in the schedule below.  The content in each week's section is linked to that presented in coincident lectures, listed in the syllabus.

Students are expected to approach each week's readings critically.  Are the ideas sound?  What are the alternatives and trade-offs?   How well do the ideas fit into the larger information context?  What are the barriers to success: technical, social, legal, and economic. Weekly sections are meant to be a forum for discussing these critical reactions, driven by student participation and NOT by instructor or teaching assistant presentations.  The amount of section participation and the degree to which it represents critical evaluation of the readings is an important criteria of grading, as detailed below. 

Reaction papers, which are due every three weeks, are another vehicle for critically evaluating readings.  The reaction paper assignments are structured as follows: you should cover at least two closely related papers relevant to the current section of the course.  One of the papers should be from the course syllabus (assigned for either the current discussion section or the one immediately preceding).  Another should be a related paper that you discover via another method such as references in the papers you have read, searching on Google, ResearchIndex, CiteSeer, or the like, or via the library gateway.  Think of finding this paper as a mini resource discovery exercise.  You should then write approximately 2-3 pages in which you address the following points:

Reaction papers should not just be summaries of the papers you read; most of your text should be focused on synthesis of the underlying ideas, and your own perspective on the papers. Reaction papers should be done individually (i.e. not in groups). 

Submission procedure for reaction papers is as follows:

NOTE: Discussions dates that are in bold correspond to the due date/time of a review paper.  .

Date Topic and Readings
Mon. 1/27 From libraries to the Web: points on a spectrum
Mon. 2/3 Content and documents: from physical to digital
  • S. Payette, C. Lagoze, Value-Added Surrogates for Distributed Content, D-Lib Magazine, June 2000, http://www.dlib.org/dlib/june00/payette/06payette.html.
  • R. Kahn and R. Wilensky, A Framework for Distributed Digital Object Services, Corporation for National Research Initiatives, Reston, Working Paper cnri.dlib/tn95-01, 1995. http://www.cnri.reston.va.us/k-w.html.
  • P. Dourish, W.K. Edwards, A. LaMarca, et. al, Extending Document Mangement Systems with User-Specific Active Properties, ACM Transactions on Information Systems, 18(2), April 2000 (Available in the ACM Digital Library on the CU library gateway).
Mon. 2/10 Documents and data, humans and machines
  • Bosak, J. and Bray, T., “XML and the Second-Generation Web,” Scientific American(May), 1999. http://www.sciam.com/article.cfm?articleID=0008C786-91DB-1CD6-B4A8809EC588EEDF
  • A. Renear, D. Dubin, C.M. Sperberg-MacQueen, C. Huitfeldt, Towards a Semantics for XML Markup, Proceedings of the 2002 ACM symposium on Document engineering (Available in the ACM Digital Library on the CU library gateway).
  • A. Salminen and F. W. Tompa, Requirments for XML Document Database Systems, Proceedings of the 2001 ACM symposium on Document engineering (Available in the ACM Digital Library on the CU library gateway).
Mon. 2/17 Metadata: changing contexts
Mon. 2/24
Mon. 3/3 Federation Architectures (Dienst, OAI, etc.)
Mon. 3/10 FEDORA, SODA, multi-valent documents

BREAK

Date Topic
Mon. 3/24 Web search engines
Mon. 3/31 Automated digital libraries
Mon. 4/7 Preservation and longevity issues and approaches

 

Mon. 4/14 Special FEDORA Help Session for Project 2
Mon. 4/21 Scholarly publishing
Mon. 4/28 Copyright, trust, privacy, authenticity

[CS 502 Home Page]

Carl Lagoze
(lagoze@cs.cornell.edu)
Last changed: April 26, 2003