A C D E F G H I L M N O P R S T U V X

A

AttributeMarker - interface uk.ac.soton.harvester.AttributeMarker.
AttributeMarker is an interface that models the phenomenon of an object that knows how to look for a particular attribute of a citation (title, author, volume etc).
authb - Variable in class uk.ac.soton.harvester.DeciterState
authb and authe store the beginning and end offsets of the authors sequence of the citation.
authe - Variable in class uk.ac.soton.harvester.DeciterState
authb and authe store the beginning and end offsets of the authors sequence of the citation.
author - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
AUTHORS - Static variable in class uk.ac.soton.harvester.Deciter
AUTHORS is the index of the object in the AttributeMarkers array that recognises the position of the authors in the citation string.

C

CitationOutput - class uk.ac.soton.harvester.CitationOutput.
CitationOutput s the superclass of each class that puts the citation data onto the specified printwriter in the one specific format requested by doTXT, doHTML or doXML.
CitationOutput() - Constructor for class uk.ac.soton.harvester.CitationOutput
Empty constructor to make an uninitialised object easily from the dynamic loading code.
CitationOutput(DeciterState) - Constructor for class uk.ac.soton.harvester.CitationOutput
Standard constructor that simply stores the deciter state.

D

d - Variable in class uk.ac.soton.harvester.EntityEncoder
d provides a reverse lookup from character number to entity name
d - Variable in class uk.ac.soton.harvester.EntityReader
d provides a lookup from entity name to character number
d - Variable in class uk.ac.soton.harvester.EntityWriter
d provides a reverse lookup from character number to entity name
date - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
DATE - Static variable in class uk.ac.soton.harvester.Deciter
DATE is the index of the object in the AttributeMarkers array that recognises the position of the date in the citation string.
dateb - Variable in class uk.ac.soton.harvester.DeciterState
dateb and datee store the beginning and end offsets of the year substring of the citation.
datee - Variable in class uk.ac.soton.harvester.DeciterState
dateb and datee store the beginning and end offsets of the year substring of the citation.
DEBUG(String) - Static method in class uk.ac.soton.harvester.Utils
DEBUG is a convenience method for producing debugging output.
Deciter - class uk.ac.soton.harvester.Deciter.
deciter class does all the significant work in decoding a set of citations
Deciter(String, String[]) - Constructor for class uk.ac.soton.harvester.Deciter
Constructor sets the value of the article ID and extracts the hints and flags from the array of options passed on the command line.
DeciterState - class uk.ac.soton.harvester.DeciterState.
deciterstate class defines an object that holds all the state of the deciter especially the hints, the marker offsets.
DeciterState(String, String[]) - Constructor for class uk.ac.soton.harvester.DeciterState
 
detag(String) - Static method in class uk.ac.soton.harvester.CitationOutput
detag removes tags from an HTML-style string.
detag(String) - Static method in class uk.ac.soton.harvester.Utils
detag removes tags from an HTML-style string.
digb - Variable in class uk.ac.soton.harvester.DeciterState
digb and dige store the beginning and end offsets of the initial numbering string of the citation.
dige - Variable in class uk.ac.soton.harvester.DeciterState
digb and dige store the beginning and end offsets of the initial numbering string of the citation.
DoAuthors - class uk.ac.soton.harvester.DoAuthors.
doAuthors recognises the author sequence in the citations.
DoAuthors() - Constructor for class uk.ac.soton.harvester.DoAuthors
 
DoAuthors2 - class uk.ac.soton.harvester.DoAuthors2.
DoAuthors2 performs a specific tidying-up function after the authors have been found in a standard way.
DoAuthors2() - Constructor for class uk.ac.soton.harvester.DoAuthors2
 
doAuthorsHint1(DeciterState, int) - Method in class uk.ac.soton.harvester.DoAuthors
doAuthorsHint1 controls the relatively simple (and uncommon) situation where a list of authors is written like "Carr LA and JM Cook."
doAuthorsPlain(DeciterState, int) - Method in class uk.ac.soton.harvester.DoAuthors
doAuthorsPlain controls the gamut of author name list possibilities.
documentid - Variable in class uk.ac.soton.harvester.DeciterState
documentid holds the id which is passed to the harvester from "The System".
DoDate - class uk.ac.soton.harvester.DoDate.
doDate recognises the occurence of the year of publication.
DoDate() - Constructor for class uk.ac.soton.harvester.DoDate
 
dodecite_simple(String, String, String, PrintWriter) - Method in class uk.ac.soton.harvester.Deciter
dodecite_simple handles the whole deciting process for a single citation (sub)entry.
dodecite(String, String, String, PrintWriter) - Method in class uk.ac.soton.harvester.Deciter
dodecite handles the whole deciting process for a single citation entry.
doHTML - Variable in class uk.ac.soton.harvester.DeciterState
doHTML is one of a group of booleans that control the format of deciter's output: text, HTML or XML.
doit(BufferedReader, PrintWriter) - Method in class uk.ac.soton.harvester.Deciter
doit initialises the citation harvesting process by setting up the debugging stream, storing the document id, creating an entity encoder if necessary and calling the readLoop to process all the citations.
DoNothing - class uk.ac.soton.harvester.DoNothing.
DoNothing is a null recogniser.
DoNothing() - Constructor for class uk.ac.soton.harvester.DoNothing
 
DoNumbering - class uk.ac.soton.harvester.DoNumbering.
doNumbering recognises the (possible) numbering scheme prefix for the citations.
DoNumbering() - Constructor for class uk.ac.soton.harvester.DoNumbering
 
DoPageRange - class uk.ac.soton.harvester.DoPageRange.
doPageRange recognises the occurrence of the article's page range.
DoPageRange() - Constructor for class uk.ac.soton.harvester.DoPageRange
 
DoPlace - class uk.ac.soton.harvester.DoPlace.
doPlace attempts to recognise the place of publication of a book citation from the misc section.
DoPlace() - Constructor for class uk.ac.soton.harvester.DoPlace
 
DoPublication - class uk.ac.soton.harvester.DoPublication.
doPublication recognises the occurrence of the journal title.
DoPublication() - Constructor for class uk.ac.soton.harvester.DoPublication
 
DoPublisher - class uk.ac.soton.harvester.DoPublisher.
doPublisher attempts to recognise the publisher of a book citation as a string that immediately precedes the place string in the citation.
DoPublisher() - Constructor for class uk.ac.soton.harvester.DoPublisher
 
doReadLoop(BufferedReader, PrintWriter) - Method in class uk.ac.soton.harvester.Deciter
doReadLoop performs a read loop, reading a line from the input, and processing and printing it to the output.
DoTidyUpMisc - class uk.ac.soton.harvester.DoTidyUpMisc.
tidyUpMisc ensures that the misc section doesn't start with the end of the title section's markup.
DoTidyUpMisc() - Constructor for class uk.ac.soton.harvester.DoTidyUpMisc
 
DoTitle - class uk.ac.soton.harvester.DoTitle.
doTitle recognises the extent of the article title.
DoTitle() - Constructor for class uk.ac.soton.harvester.DoTitle
 
doTXT - Variable in class uk.ac.soton.harvester.DeciterState
doTXT is one of a group of booleans that control the format of deciter's output: text, HTML or XML.
DoVolAndIssue - class uk.ac.soton.harvester.DoVolAndIssue.
doVolAndIssue uses the settings of miscb and misce (arranged by doPageRange) and looks inside that range for the journal volume and issue substrings.
DoVolAndIssue() - Constructor for class uk.ac.soton.harvester.DoVolAndIssue
 
DoWhiteSpace - class uk.ac.soton.harvester.DoWhiteSpace.
doWhiteSpace skips any leading whitespace.
DoWhiteSpace() - Constructor for class uk.ac.soton.harvester.DoWhiteSpace
 
doXML - Variable in class uk.ac.soton.harvester.DeciterState
doXML is one of a group of booleans that control the format of deciter's output: text, HTML or XML.
DoXXXId - class uk.ac.soton.harvester.DoXXXId.
doXXXId tries to recognise the appearance of an XXX eprint identifier within a string.
DoXXXId() - Constructor for class uk.ac.soton.harvester.DoXXXId
 
ds - Variable in class uk.ac.soton.harvester.CitationOutput
state of the deciter, including the attribute markers used to define the output strings.

E

ee - Static variable in class uk.ac.soton.harvester.Utils
ee is an entity encoder object which contains the mapping from (non-)ASCII to ISO-Latin1 entity names.
encode(String) - Method in class uk.ac.soton.harvester.EntityEncoder
encode encodes any unusual characters in a string as ISOLAtin-1 entities.
endofdate - Variable in class uk.ac.soton.harvester.DeciterState
endofdate stores the offset of the first significant character the year substring was matched.
EntityEncoder - class uk.ac.soton.harvester.EntityEncoder.
EntityEncoder is a convenience class that allows the deciter class to directly code entity strings without using an EntityWriter.
EntityEncoder() - Constructor for class uk.ac.soton.harvester.EntityEncoder
 
EntityReader - class uk.ac.soton.harvester.EntityReader.
EntityReader extends the behaviour of BufferedReader so that any ISO-Latin-1 entities are replaced by their ASCII/Unicode characters.
EntityReader(Reader) - Constructor for class uk.ac.soton.harvester.EntityReader
The main constructor allows an EntityReader to be based on any kind of Reader.
EntityWriter - class uk.ac.soton.harvester.EntityWriter.
EntityWriter extends the behaviour of PrintWriter so that any non-ASCII characters are printed by their ISO-Latin-1 entity names.
EntityWriter(BufferedWriter) - Constructor for class uk.ac.soton.harvester.EntityWriter
The main constructor allows an EntityWriter to be based on any kind of BufferedWriter.
entLookup(String) - Method in class uk.ac.soton.harvester.EntityReader
entLookup is a wrapper function which guarantees a char for an entity name.
entName(char) - Method in class uk.ac.soton.harvester.EntityEncoder
entName is a wrapper function which guarantees a safe name for a character position.
entName(char) - Method in class uk.ac.soton.harvester.EntityWriter
entName is a wrapper function which guarantees a safe name for a character position.
entString(String) - Method in class uk.ac.soton.harvester.EntityReader
entString decodes any unusual characters in a string from ISOLAtin-1 entities.
entString(String) - Method in class uk.ac.soton.harvester.EntityWriter
entString encodes any unusual characters in a string as ISOLAtin-1 entities.
extended - Variable in class uk.ac.soton.harvester.DeciterState
extended is a debugging relic which controls whether the original author string is emitted along with the rest of the XML output for immediacy of comparison.
EXTRA - Static variable in class uk.ac.soton.harvester.Deciter
EXTRA is the index of the object in the AttributeMarkers array that recognises the position of any extra features (e.g.

F

firstAuthor - Variable in class uk.ac.soton.harvester.DeciterState
firstAuthor stores the first named author from the splitAuthor() method for subsequent use in a multiCite situation.
firstNameFirstHint - Variable in class uk.ac.soton.harvester.DeciterState
firstNameFirstHint declares that the citation style tends to put the first name before the surname, at least after the initial author has been dealt with (surnames always come first for first authors so that you can see the primary sort key).

G

getArticleId() - Method in class uk.ac.soton.harvester.OUTHarvester
getArticleId accessor method for the id
getCitations(String, String, String, String[]) - Static method in class uk.ac.soton.harvester.harness
getCitations performs citation harvesting on a specific data file.
getInputFile() - Method in class uk.ac.soton.harvester.OUTHarvester
getInputFile accessor method for inFile

H

harness - class uk.ac.soton.harvester.harness.
harness is a driver class that creates a citation harvesting object and applies it to a specific data file.
harness() - Constructor for class uk.ac.soton.harvester.harness
 
harvest(BufferedWriter) - Method in class uk.ac.soton.harvester.OUTHarvester
Create the appropriate readers and writers, make a deciter object, set its hints/options and invoke its doit method to make it to do the dirty work.
hint_Author1 - Variable in class uk.ac.soton.harvester.DeciterState
hint_Author1 declares that a very simple scheme for recognising the extent of an author sequence is in force.
HTMLOutput - class uk.ac.soton.harvester.HTMLOutput.
HTMLOutput puts the citation data onto the specified printwriter in HTMLformat.
HTMLOutput(DeciterState) - Constructor for class uk.ac.soton.harvester.HTMLOutput
 

I

iciSWe(String, String) - Static method in class uk.ac.soton.harvester.Utils
iciSWe "ignore case of initial" version of startsWith used to make "Del " and "del " match.
iciSWp(String, String) - Static method in class uk.ac.soton.harvester.Utils
iciSWp is the same as iciSWe except it looks for punctuation instead of a space.
id - Variable in class uk.ac.soton.harvester.OUTHarvester
id holds the identifier of the current article
in - Variable in class uk.ac.soton.harvester.OUTHarvester
in simply holds a reference to the input reader.
inFile - Variable in class uk.ac.soton.harvester.OUTHarvester
inFile holds the file to which the input reader is to be connected.
isBook(DeciterState) - Static method in class uk.ac.soton.harvester.Utils
isBook is a utility method that encapsulates a naive heuristic (oh, alright then, hack) for determining whether the citation was to a book/thesis or not.
isDash(char) - Static method in class uk.ac.soton.harvester.Utils
isDash recognises the characters from all the character sets which could correspond to a "dash".
isInitial(String) - Static method in class uk.ac.soton.harvester.Utils
isInitial checks to see whether the current word is in fact an inital / a set of initials as opposed to a surname.
ISOLatHashTable - class uk.ac.soton.harvester.ISOLatHashTable.
ISOLatHashTable provides a hash table which is already filled in with a mapping between the ISOLatin-1 entity names and the character positions by which they are represented.
ISOLatHashTable() - Constructor for class uk.ac.soton.harvester.ISOLatHashTable
Default constructor creates the pre-filled HashTable.
ISOLatRevHashTable - class uk.ac.soton.harvester.ISOLatRevHashTable.
ISOLatRevHashTable is the inverse of ISOLatHashTable, and provides a hash table which is already filled in with an inverse mapping between the ISOLatin-1 entity names and the character positions by which they are represented.
ISOLatRevHashTable() - Constructor for class uk.ac.soton.harvester.ISOLatRevHashTable
Default constructor creates the pre-filled HashTable.
isProceedings(DeciterState) - Static method in class uk.ac.soton.harvester.Utils
isProceedings is a utility method that encapsulates a naive heuristic (oh, alright then, hack) for determining whether the citation was to a conf/workshop proceedings
iss - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
issb - Variable in class uk.ac.soton.harvester.DeciterState
issb and isse store the beginning and end offsets of the year substring of the citation.
isse - Variable in class uk.ac.soton.harvester.DeciterState
issb and isse store the beginning and end offsets of the year substring of the citation.

L

line - Variable in class uk.ac.soton.harvester.DeciterState
line contains the whole citation input line from which the fields are eventually teased.
lowerCaseNameComponent(String) - Static method in class uk.ac.soton.harvester.Utils
lowerCaseNameComponent recognises those words which start with a lowercase letter which are in fact parts of names.
lowercaseOrHyphen(String, int) - Static method in class uk.ac.soton.harvester.Utils
lowercaseOrHyphen is a utility method that recognises valid characters (ie [a-z-]) within an XXX eprint article identifier.

M

main(String[]) - Static method in class uk.ac.soton.harvester.harness
main is the driver for the citation processing.
markAttribute(DeciterState, int) - Method in interface uk.ac.soton.harvester.AttributeMarker
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoAuthors
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoAuthors2
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoDate
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoNothing
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoNumbering
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoPageRange
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoPlace
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoPublication
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoPublisher
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoTidyUpMisc
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoTitle
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoVolAndIssue
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoWhiteSpace
 
markAttribute(DeciterState, int) - Method in class uk.ac.soton.harvester.DoXXXId
 
maxi - Variable in class uk.ac.soton.harvester.DeciterState
maxi is the maximum valid offset that can be used with the charAt() method of the string which is the current line.
MDashCiteSep - Variable in class uk.ac.soton.harvester.DeciterState
MDashCiteSep is the 3-emdash sytring which is used to separate some forms of citation (see multiCiteMDashHint).
misc - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
miscb - Variable in class uk.ac.soton.harvester.DeciterState
miscb and misce store the beginning and end offsets of the miscellaneous (unused and unrecognised) substring of the citation.
misce - Variable in class uk.ac.soton.harvester.DeciterState
miscb and misce store the beginning and end offsets of the miscellaneous (unused and unrecognised) substring of the citation.
multiCiteMDashHint - Variable in class uk.ac.soton.harvester.DeciterState
multiCiteMDashHint declares that the citations of a single author may appear to be grouped together as a single entry.
multiCiteSharesAuthorHint - Variable in class uk.ac.soton.harvester.DeciterState
multiCiteSharesAuthorHint declares that the citations of a single author may be grouped together as a single entry.

N

N_AMS - Static variable in class uk.ac.soton.harvester.Deciter
N_AMS is the number of AttributeMarkers that are used.
nCites - Variable in class uk.ac.soton.harvester.DeciterState
nCites holds the number of citations processed for the current article.
noForenameHint - Variable in class uk.ac.soton.harvester.DeciterState
noForeNameHint declares that it is unlinkely that a forename will be given with the surname.
notAuthor - Variable in class uk.ac.soton.harvester.DeciterState
notAuthor is the first potential author-string token which seems to not be an author name.
NUMBERING - Static variable in class uk.ac.soton.harvester.Deciter
NUMBERING is the index of the object in the AttributeMarkers array that recognises any initial preporocessing before the recognition proper gets underway.

O

options - Variable in class uk.ac.soton.harvester.OUTHarvester
options holds the set of hints that control the deciter process.
OUTHarvester - class uk.ac.soton.harvester.OUTHarvester.
The OUTHarvester class sets up the environment for the real citation processing.
OUTHarvester() - Constructor for class uk.ac.soton.harvester.OUTHarvester
 
output(PrintWriter) - Method in class uk.ac.soton.harvester.CitationOutput
output renders the citation onto an output file according to some format.
output(PrintWriter) - Method in class uk.ac.soton.harvester.HTMLOutput
 
output(PrintWriter) - Method in class uk.ac.soton.harvester.TXTOutput
 
output(PrintWriter) - Method in class uk.ac.soton.harvester.XMLOutput
 

P

pagb - Variable in class uk.ac.soton.harvester.DeciterState
pagb and page store the beginning and end offsets of the page range substring of the citation.
page - Variable in class uk.ac.soton.harvester.DeciterState
pagb and page store the beginning and end offsets of the page range substring of the citation.
PAGERANGE - Static variable in class uk.ac.soton.harvester.Deciter
PAGERANGE is the index of the object in the AttributeMarkers array that recognises the position of the pagerange in the citation string.
pages - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
PCDATA(String) - Method in class uk.ac.soton.harvester.EntityEncoder
PCDATA is just an alias for encode.
PCDATA(String) - Static method in class uk.ac.soton.harvester.Utils
PCDATA is a convenience method to access the entity encoder.
place - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
PLACE - Static variable in class uk.ac.soton.harvester.Deciter
PLACE is the index of the object in the AttributeMarkers array that recognises the position of the place of publication in the citation string.
placeb - Variable in class uk.ac.soton.harvester.DeciterState
placeb and placee store the beginning and end offsets of the place name if this citation corresponds to a book.
placee - Variable in class uk.ac.soton.harvester.DeciterState
placeb and placee store the beginning and end offsets of the place name if this citation corresponds to a book.
post(PrintWriter) - Method in class uk.ac.soton.harvester.CitationOutput
post allows the outputter to emit a footer or postamble for the article citations as a body.
post(PrintWriter) - Method in class uk.ac.soton.harvester.XMLOutput
 
POSTPROCESS - Static variable in class uk.ac.soton.harvester.Deciter
POSTPROCESS is the index of the object in the AttributeMarkers array that performs any subsequent postprocessing and rationalisation of the marker values.
pre(PrintWriter) - Method in class uk.ac.soton.harvester.CitationOutput
pre allows the outputter to emit a header or preamble for the article citations as a body.
pre(PrintWriter) - Method in class uk.ac.soton.harvester.XMLOutput
 
prepare() - Method in class uk.ac.soton.harvester.CitationOutput
prepare is a method used by the subclasses 'output' methods to prepare the attribute strings from the attribute markers in the deciter state.
PREPROCESS - Static variable in class uk.ac.soton.harvester.Deciter
PREPROCESS is the index of the object in the AttributeMarkers array that performs any initial preprocessing before the recognition proper gets underway.
println(String) - Method in class uk.ac.soton.harvester.EntityWriter
println prints an entity-encoded version of its parameter.
pub - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
pubb - Variable in class uk.ac.soton.harvester.DeciterState
pubb and pube store the beginning and end offsets of the publication (ie journal) substring of the citation.
pube - Variable in class uk.ac.soton.harvester.DeciterState
pubb and pube store the beginning and end offsets of the publication (ie journal) substring of the citation.
PUBLICATION - Static variable in class uk.ac.soton.harvester.Deciter
PUBLICATION is the index of the object in the AttributeMarkers array that recognises the position of the journal title in the citation string.
publish - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
PUBLISH - Static variable in class uk.ac.soton.harvester.Deciter
PUBLISH is the index of the object in the AttributeMarkers array that recognises the position of the publisher in the citation string.
publishb - Variable in class uk.ac.soton.harvester.DeciterState
 
publishe - Variable in class uk.ac.soton.harvester.DeciterState
 

R

readLine() - Method in class uk.ac.soton.harvester.EntityReader
 
rest - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.

S

setArticleId(String) - Method in class uk.ac.soton.harvester.OUTHarvester
setArticleId sets the article id
setAttributeMarker(int, AttributeMarker) - Method in class uk.ac.soton.harvester.Deciter
setAttributeMarker allows the recogniser for a particular attribute to be changed.
setAttributeMarker(int, String) - Method in class uk.ac.soton.harvester.Deciter
setAttributeMarker allows the recogniser for a particular attribute to be changed.
setAttributeMarker(String, String) - Method in class uk.ac.soton.harvester.Deciter
a version of setAttributeMarker which is useful for argv.
setCitationOutput(CitationOutput) - Method in class uk.ac.soton.harvester.Deciter
setCitationOutput specifies the citation output object.
setCitationOutput(String) - Method in class uk.ac.soton.harvester.Deciter
setCitationOutput specifies the citation output object.
setDebugging(boolean) - Static method in class uk.ac.soton.harvester.Utils
setDebugging controls whether DEBUG messages are printed or not.
setDeciterState(DeciterState) - Method in class uk.ac.soton.harvester.CitationOutput
 
setInputFile(File) - Method in class uk.ac.soton.harvester.OUTHarvester
setInputFile sets the value of inFile and tries to open a BufferedReader on that file.
setNewCitation(String) - Method in class uk.ac.soton.harvester.DeciterState
 
setOptions(String[]) - Method in class uk.ac.soton.harvester.OUTHarvester
store the options (i.e. processing hints) for the deciter to use and change its behaviour.
split_multiCitation(String, String, String, PrintWriter) - Method in class uk.ac.soton.harvester.Deciter
split_multiCitation If significant citation material is found to be left over with a multiCite hint in operation, it may be assumed that another citation occurrence has been found and dodecite may be called recursively.
splitAuthors(String, DeciterState, PrintWriter) - Static method in class uk.ac.soton.harvester.CitationOutput
splitAuthors is one of the most complex methods of the deciter class.
splitPages(String, PrintWriter) - Method in class uk.ac.soton.harvester.CitationOutput
splitPages outputs the identified page range on the given output stream.
substring(String, int, int) - Static method in class uk.ac.soton.harvester.Utils
This is just a safe version of substring

T

tit - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
titb - Variable in class uk.ac.soton.harvester.DeciterState
titb and tite store the beginning and end offsets of the title substring of the citation.
tite - Variable in class uk.ac.soton.harvester.DeciterState
titb and tite store the beginning and end offsets of the title substring of the citation.
TITLE - Static variable in class uk.ac.soton.harvester.Deciter
TITLE is the index of the object in the AttributeMarkers array that recognises the position of the title in the citation string.
toInitials(String) - Static method in class uk.ac.soton.harvester.Utils
toInitials turns a set of "forenames" to an appropriate set of separated, correctly delimited initials.
TXTOutput - class uk.ac.soton.harvester.TXTOutput.
TXTOutput puts the citation data onto the specified printwriter in plain ASCII text format.
TXTOutput(DeciterState) - Constructor for class uk.ac.soton.harvester.TXTOutput
 

U

Utils - class uk.ac.soton.harvester.Utils.
Utils is a place for miscellaneous utility methods to try to control class bloat!
Utils() - Constructor for class uk.ac.soton.harvester.Utils
 

V

vol - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
volb - Variable in class uk.ac.soton.harvester.DeciterState
volb and vole store the beginning and end offsets of the volume substring of the citation.
vole - Variable in class uk.ac.soton.harvester.DeciterState
volb and vole store the beginning and end offsets of the volume substring of the citation.
VOLUMEISSUE - Static variable in class uk.ac.soton.harvester.Deciter
VOLUMEISSUE is the index of the object in the AttributeMarkers array that recognises the position of the volume and issue in the citation string.

X

XMLOutput - class uk.ac.soton.harvester.XMLOutput.
XMLOutput puts the citation data onto the specified printwriter in XML format.
XMLOutput(DeciterState) - Constructor for class uk.ac.soton.harvester.XMLOutput
 
xxxb - Variable in class uk.ac.soton.harvester.DeciterState
xxxb and xxxe store the beginning and end offsets of the XXX id string of the citation.
xxxe - Variable in class uk.ac.soton.harvester.DeciterState
xxxb and xxxe store the beginning and end offsets of the XXX id string of the citation.
xxxHint - Variable in class uk.ac.soton.harvester.DeciterState
xxxHint states that the article is from the XXX archive, ie is a physics preprint publication.
xxxid - Variable in class uk.ac.soton.harvester.CitationOutput
The citation attribute strings are filled in by the prepare method from the data in the attribute markers stored in the deciter state.
xxxId(String) - Static method in class uk.ac.soton.harvester.Utils
xxxId recognises strings which are XXX citation ids.

A C D E F G H I L M N O P R S T U V X