public class Crawler extends ACrawler
Constructor and Description |
---|
Crawler(MainTab caller,
java.lang.String authorFilter,
java.lang.String titleFilter,
int transLev,
int dbThreadLimit,
int ppThreadLimit)
Sets up the crawler.
|
Modifier and Type | Method and Description |
---|---|
protected void |
crawl()
Gets publication database list, sets up search URL and starts the crawl
for all of them.
|
void |
run()
Calls crawl(), and measures running time in 'time' field.
|
getTime, interrupt, launch, scheduleCrawlers, waitForCrawlers
activeCount, checkAccess, clone, countStackFrames, currentThread, destroy, dumpStack, enumerate, getAllStackTraces, getContextClassLoader, getDefaultUncaughtExceptionHandler, getId, getName, getPriority, getStackTrace, getState, getThreadGroup, getUncaughtExceptionHandler, holdsLock, interrupted, isAlive, isDaemon, isInterrupted, join, join, join, resume, setContextClassLoader, setDaemon, setDefaultUncaughtExceptionHandler, setName, setPriority, setUncaughtExceptionHandler, sleep, sleep, start, stop, stop, suspend, toString, yield
public Crawler(MainTab caller, java.lang.String authorFilter, java.lang.String titleFilter, int transLev, int dbThreadLimit, int ppThreadLimit)
caller
- The caller MainTab object, that Crawler will notify at the end.authorFilter
- Search for this author.titleFilter
- Filter for this title.transLev
- 0: only search results, 1: referrer publications also 2: referrer of referrers also will be grabbed.dbThreadLimit
- Maximum count of PubListCrawler threads = databases crawled at the same time.ppThreadLimit
- Maximum count of PubPageCrawler threads = publication pages crawled at the same time.public void run()
ACrawler