|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectnz.ac.waikato.mcennis.rat.crawler.filter.BlockPreviousSite
public class BlockPreviousSite
refuse to parse all previously seen sites by their URL alone
| Constructor Summary | |
|---|---|
BlockPreviousSite()
|
|
| Method Summary | |
|---|---|
void |
add(java.lang.String site)
synonym for load(String site) |
void |
build(java.util.HashSet list,
boolean not)
Builds a new filter. |
boolean |
check(java.lang.String site)
Should the URL this string represents be retrieved |
boolean |
check(java.lang.String site,
Properties parameters)
Should the URL this string represents be retrieved, given the parameters provided |
void |
load(java.lang.String site)
Submit the given site to the filter chain without retrieving it. |
void |
load(java.lang.String site,
Properties parameters)
Submit the given site - parameter combination to the filter chain without retrieving it. |
BlockPreviousSite |
prototype()
Creates a new default version of this class with no common data excepting static variables |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public BlockPreviousSite()
| Method Detail |
|---|
public boolean check(java.lang.String site)
CrawlerFilter
check in interface CrawlerFiltersite - URL of the site to be retrieved
public boolean check(java.lang.String site,
Properties parameters)
CrawlerFilter
check in interface CrawlerFiltersite - URL to be retrievedparameters - parameters governing the retrieval
public void build(java.util.HashSet list,
boolean not)
list - HashSet containing list of sites to skipnot - should blocked sites be skipped or new sites skippedpublic void add(java.lang.String site)
site - site to be skippedpublic void load(java.lang.String site)
CrawlerFilter
load in interface CrawlerFiltersite - URL to be added
public void load(java.lang.String site,
Properties parameters)
CrawlerFilter
load in interface CrawlerFiltersite - URL to be addedpublic BlockPreviousSite prototype()
CrawlerFilter
prototype in interface CrawlerFilter
|
|
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||