CS 779 - Seminar of Web Searching and Mining
Fall 2005
This seminar will provide a venue for interaction among different people at Cornell working on Web search and mining and facilitate the exchange of programs, algorithms, and methods developed in different projects. It will focus on special topics and discuss selected papers from recent Web-related conferences such as the World Wide Web Conference, Hypertext Conference, and the Semantic Web Conference, as well as results from ongoing web-related research projects at Cornell. Hopefully the seminar will encourage new research at Cornell in this exciting area.
Schedule: Wednesday 3-4
Location: 5126 Upson (NOTE: On September 14 we will meet in 5130 Upson)
Format: Throughout the semester we'll choose a number of special topics depending on the interest of the group. We'll devote each meeting to a discussion of one or two papers relevant to the topic. Participants will share responsibilities for presenting papers and leading discussion about them.
Credits: 3
Grading: S/U
Organizers: Pavel Dmitriev and Carl Lagoze
Special Topic I : The Semantic Web: In his 1998 Semantic Web Roadmap, Tim Berners-Lee (inventor of the Web) introduced the semantic Web as follows:
We'll spend some time examining whether the semantic web is actually necessary, useful, practical, sound, etc. There is lots of work on the semantic web in a variety of areas including databases, logics, languages, etc so there are lots of papers available.
| Date | Readings | Comments |
| 09/07 |
|
There are a lot of awful introductions to the semantic web. These two publications have a reasonable amount of integrity and avoid a good bit of the hype. I (Carl) will give a semi-lecture on what the semantic web is and what are some of the research areas within it. |
| 09/14 |
|
This is a nice paper detailing the formal foundations and design decisions underlying OWL, the ontology language for the semantic web. Reading and discussing it should help us understand distinctions between the semantic web and other knowledge representation research. |
| 09/21 |
|
Both of these papers deal with general theme of populating the semantic web with minimal human annotation. The first is a recent survey paper. The second presents research work in automated semantic annotation. |
| 09/28 |
|
Ontologies play a major role in the semantic web. This is a pre-SM paper that provides an excellent foundation on ontology design and why it is important. |
| 10/05 |
|
This chapter in Broekstra's thesis has a nice classification of requirements for storing and querying RDF-based graphs. |
| 10/12 |
|
There is considerable interest in web services in commercial, scientific, and other areas. Semantic web technology provides some promise of automatic matchmaking of services to needs. This paper describes OWL-S, one of the more promising activities in this area. |
| 10/19 |
|
Selcuk will give a practice talk in preparation for his conference talk at ICKM 2005. |
| 10/26 |
|
Recent research by one of the leading researchers in trust, reputation, and the semantic web (and social networks in general). This is a shorter version of a journal article at http://trust.mindswap.org/papers/toit.pdf. |
| 11/02 |
|
We'll stay in the realm of trust. This is well-known paper that extents PageRank techniques into the area of global trust. |
| 11/09 |
|
Any discussion about trust needs to include the issue of intentional deception. One area where this is prevalent is "link spamming", or trying to influence the ranking of search engines. This is a recent paper in this area. |
| 11/16 |
|
Pavel will present a paper that he will present at a conference in January. |
| 11/23 | No Seminar - Thanksgiving Break |
Last Update: Carl Lagoze 11/07/05