The subject of the course is a dynamic area. Most of the material in
the course is the result of research and implementation over the past 3-5 years.
Fortunately almost all of this work is available through papers on the
open-source Web. Readings are assigned for each week's discussion section
as listed in the schedule below. The content in each week's section is
linked to that presented in coincident lectures, listed in the
syllabus.
Students are expected to approach each week's readings critically. Are
the ideas sound? What are the alternatives and trade-offs? How
well do the ideas fit into the larger information context? What are the
barriers to success: technical, social, legal, and economic. Weekly sections
are meant to be a forum for discussing these critical reactions, driven by
student participation and NOT by instructor or teaching assistant presentations.
The amount of section participation and the degree to which it represents
critical evaluation of the readings is an important criteria of grading, as detailed
below.
Reaction papers, which are due every three weeks, are another vehicle for critically evaluating readings.
The reaction paper assignments are structured as follows: you should cover at
least two closely related papers relevant to the current section of the course.
One of the papers should be from the course syllabus (assigned for either the
current discussion section or the one immediately preceding). Another should be a
related paper that you discover via another method such as references in the
papers you have read, searching on Google, ResearchIndex, CiteSeer, or the like,
or via the library gateway. Think of finding this paper as a mini resource
discovery exercise. You should then write approximately 2-3 pages in which
you address the following points:
Reaction papers should not just be summaries of the papers you read; most of
your text should be focused on synthesis of the underlying ideas, and your own
perspective on the papers. Reaction papers should be done individually (i.e. not
in groups).
Date |
Topic and Readings |
Mon. 1/27 |
From libraries to the Web: points on a spectrum
|
Mon. 2/3 |
Content and documents: from physical to digital
- S. Payette, C. Lagoze, Value-Added Surrogates for Distributed
Content, D-Lib Magazine, June 2000,
http://www.dlib.org/dlib/june00/payette/06payette.html.
- R. Kahn and R. Wilensky, A Framework for Distributed Digital Object
Services, Corporation for National Research Initiatives, Reston,
Working Paper cnri.dlib/tn95-01, 1995.
http://www.cnri.reston.va.us/k-w.html.
- P. Dourish, W.K. Edwards, A. LaMarca, et. al, Extending Document
Mangement Systems with User-Specific Active Properties, ACM
Transactions on Information Systems, 18(2), April 2000 (Available in the
ACM Digital Library on the CU library gateway).
|
Mon. 2/10 |
Documents and data, humans and machines
- Bosak, J. and Bray, T., “XML and the Second-Generation Web,”
Scientific American(May), 1999.
http://www.sciam.com/article.cfm?articleID=0008C786-91DB-1CD6-B4A8809EC588EEDF
- A. Renear, D. Dubin, C.M. Sperberg-MacQueen, C. Huitfeldt, Towards
a Semantics for XML Markup, Proceedings of the 2002 ACM symposium on
Document engineering (Available in the ACM Digital Library on the CU
library gateway).
- A. Salminen and F. W. Tompa, Requirments for XML Document Database
Systems, Proceedings of the 2001 ACM symposium on Document engineering
(Available in the ACM Digital Library on the CU library gateway).
|
Mon. 2/17 |
Metadata: changing contexts
- D. Levy, “Cataloging in the Digital Order,” presented at The Second
Annual Conference on the Theory and Practice of Digital Libraries, 1995.
http://www.csdl.tamu.edu/DL95/papers/levy/levy.html#RTFToC8.
- Dublin Core Metadata Element Set, Version 1.1: Reference Description, http://www.dublincore.org/documents/1999/07/02/dces/.
- C. Lagoze, Keeping Dublin Core Simple: Cross Domain Discovery or
Research Description?, January 2001,
http://www.dlib.org/dlib/january01/lagoze/01lagoze.html.
- C. Lagoze, Business Unusual, How "event awareness" may breath life
into the catalog, November 2000,
http://www.cs.cornell.edu/lagoze/papers/lagozelc.pdf
|
Mon. 2/24 |
- Semantic Web: from data to knowledge
T. Berners-Lee, et. al., The Semantic Web, Scientific American, May
2001,
http://www.sciam.com/2001/0501issue/0501berners-lee.html.
- A. K. McCallum, K. Nigam, J. Rennie, Automating the Construction of
Internet Portals with Machine Learning, Information Retrieval Journal,
Volume 3, 2000,
http://citeseer.nj.nec.com/mccallum00automating.html. This is a long
paper. You can skim section 3 and not get too wrapped up in the
math. You should read this paper more with an eye towards the
techniques for automatic classification and structural extraction).
- Mitra, P., G. Wiederhold, et al. (2001). A scalable framework for the
interoperation of information sources. 1st International Semantic Web
Working Symposium (SWWS '01), Palo Alto,
http://www.semanticweb.org/SWWS/program/full/paper51.pdf
|
Mon. 3/3 |
Federation Architectures (Dienst, OAI, etc.)
|
Mon. 3/10 |
FEDORA, SODA, multi-valent documents
- Payette, Sandra and Thorton Staples, "The Mellon Fedora Project:
Digital Library Architecture Meets XML and Web Services," European
Conference on Research and Advanced Technology for Digital Libraries,
Rome, Italy, September 2002.
http://www.fedora.info/documents/ecdl2002final.pdf
- Phelps, T. A. and R. Wilensky (2001). The Multivalent Browser: A
Platform for New Ideas. ACM Document Engineering, Atlanta,
http://www.cs.berkeley.edu/~phelps/Multivalent/papers/doceng2001.pdf
- Maly, K., M. L. Nelson, et al. (1999). "Smart Objects, Dumb Archives:
A User-Centric, Layered Digital Library Framework." D-Lib Magazine March.
http://www.dlib.org/dlib/march99/maly/03maly.html
|
Date |
Topic |
Mon. 3/24 |
Web search engines
- S. Brin and L. Page, The Anatomy of a Large-Scale Hypertextual
Search Engine, 1998,
http://www-db.stanford.edu/pub/papers/google.pdf
- S. R. Kumar, et. al., The web as a graph, presented at
Nineteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database
Systems, Dallas, 2000,
http://citeseer.nj.nec.com/290635.html
- A. Heydon and M. Najork, A Scalable, Extensible Web Crawler,
World Wide Web, December, 1999,
http://www.research.compaq.com/SRC/mercator/papers/www/paper.html
|
Mon. 3/31 |
Automated digital libraries
- Arms, W. Y. (2000). "Automated Digital Libraries: How Effectively Can
Computers Be Used for the Skilled Tasks of Professional Librarianship?"
D-Lib Magazine 6(7/9),
http://www.dlib.org/dlib/july00/arms/07arms.html
- V. Kluev, Compiling Document Collections from the Internet,
SIGIR Forum, 2000,
http://www.acm.org/sigir/forum/F2000/Kluev00.pdf
- Soumen Chakrabarti, Martin van den Berg, Byron Dom, Focused
Crawling: a New Approach to Topic-specific Web Resource Discovery,
1999, WWW8,
http://www8.org/w8-papers/5a-search-query/crawling/index.html
- D. Bergmark, Collection Synthesis, 2002,
http://mercator.comm.nsdlib.org/CollectionBuilding/bergmark-paper.pdf
|
Mon. 4/7 |
Preservation and longevity issues and approaches
- Rothenberg, J. Avoiding Technological Quicksand: Finding a Viable
Technical Foundation for Digital Preservation, Council on Library and
Information Resources,
http://www.clir.org/pubs/reports/rothenberg/contents.html
- B. Kahle, Archiving the Internet, Scientific American, March,
1997,
http://www.archive.org/sciam_article.html
- A. Kenney, et. al, Preservation Risk Management for Web Resources,
D-Lib, 2002,
http://www.dlib.org/dlib/january02/kenney/01kenney.html
|
Mon. 4/14 |
Special FEDORA Help Session for Project 2 |
Mon. 4/21 |
Scholarly publishing
- To Publish and Perish, Policy Perspectives, 7 (4), 1998.
http://www.arl.org/scomm/pew/pewrept.html
- Principles for Emerging Systems of Scholarly Publishing, 2000
http://www.arl.org/scomm/tempe.html.
- S. Harnad, Free at Last: The Future of Peer-Reviewed Journals,
D-Lib Magazine, 5 (12), 1999.
http://www.dlib.org/dlib/december99/12harnad.html
- M. Smith, DSpace: An Open Source Dynamic Digital Repository,
D-Lib Magazine, 9 (1), 2003, http://www.dlib.org/dlib/january03/smith/01smith.html
- S. Hitchcock, et. al., Open Citation Linking, D-Lib Magazine,
8(10), 2002, http://www.dlib.org/dlib/october02/hitchcock/10hitchcock
|
Mon. 4/28 |
Copyright, trust, privacy, authenticity
- Charles C. Mann, "Who will own your next good idea?",
Atlantic Monthly, September, 1998, <http://www.theatlantic.com/issues/98sep/copy.htm>
- Lynch, C. (2001). The Battle to Define the Future of the Book in the
Digital World. First Monday. 6.
http://www.firstmonday.dk/issues/issue6_6/lynch/index.html
- Lynch, C. A. (2001). "When Documents Deceive: Trust and Provenance as
New Factors in Information Retrieval in a Tangled Web." Journal of the
American Society of Information Science and Technology 52(1): 12-17,
http://www.cs.ucsd.edu/~rik/others/lynch-trust-jasis00.pdf
- Hirtle, P. B. (2000). Archival Authenticity in a Digital Age.
Authenticity in a Digital Environment, Washington, D.C., Council on
Library and Information Resources.,
http://www.clir.org/pubs/reports/pub92/hirtle.html.
|