|
|||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object | +--uk.ac.soton.harvester.harness
harness is a driver class that creates a citation harvesting object and applies it to a specific data file.
Constructor Summary | |
harness()
|
Method Summary | |
(package private) static int |
getCitations(java.lang.String pdfFile,
java.lang.String articleId,
java.lang.String xmlFile,
java.lang.String[] options)
getCitations performs citation harvesting on a specific data file. |
static void |
main(java.lang.String[] args)
main is the driver for the citation processing. |
Methods inherited from class java.lang.Object |
|
Constructor Detail |
public harness()
Method Detail |
static int getCitations(java.lang.String pdfFile, java.lang.String articleId, java.lang.String xmlFile, java.lang.String[] options) throws java.lang.Exception
pdfFile
- the data file to be interpreted. This is not in fact
a file in PDF formal, but an intermediate XML format directly
produced from the PDF by an independent program (currently bpe5).articleId
- a string which uniquely identifies the articles.
The string will conform to the pattern PP-JJ-STUFF where
PP is a two-letter publisher code, JJ is a two-letter journal
code and STUFF is an uninterpreted article code. The substring
PP-JJ uniqueli identifies a particular journal.xmlFile
- the name of a file to which XML data will be written
containing the citation data extracted from the article.
The XML conforms to the Ingenta DTD.options
- an array of strings (just the arguments passed to
the main method) which contains a list of hints on how to
best parse the article.public static void main(java.lang.String[] args) throws java.lang.Exception
|
|||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||
SUMMARY: INNER | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |