nz.ac.waikato.mcennis.rat.crawler.filter
Class BlockPreviousSiteReference

java.lang.Object
  extended by nz.ac.waikato.mcennis.rat.crawler.filter.BlockPreviousSiteReference
All Implemented Interfaces:
CrawlerFilter

public class BlockPreviousSiteReference
extends java.lang.Object
implements CrawlerFilter

Block previously seen sites only if they have the same parameter settings


Constructor Summary
BlockPreviousSiteReference()
           
 
Method Summary
 void build(java.util.HashSet list, boolean not)
          build a new filter from the given set of site parsing requests and whether negation should occur or not.
 boolean check(java.lang.String site)
          Should the URL this string represents be retrieved
 boolean check(java.lang.String site, Properties parameters)
          Should the URL this string represents be retrieved, given the parameters provided
 void load(java.lang.String site)
          Submit the given site to the filter chain without retrieving it.
 void load(java.lang.String site, Properties properties)
          Submit the given site - parameter combination to the filter chain without retrieving it.
 BlockPreviousSiteReference prototype()
          Creates a new default version of this class with no common data excepting static variables
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

BlockPreviousSiteReference

public BlockPreviousSiteReference()
Method Detail

check

public boolean check(java.lang.String site)
Description copied from interface: CrawlerFilter
Should the URL this string represents be retrieved

Specified by:
check in interface CrawlerFilter
Parameters:
site - URL of the site to be retrieved
Returns:
retrieve or not retrieve

check

public boolean check(java.lang.String site,
                     Properties parameters)
Description copied from interface: CrawlerFilter
Should the URL this string represents be retrieved, given the parameters provided

Specified by:
check in interface CrawlerFilter
Parameters:
site - URL to be retrieved
parameters - parameters governing the retrieval
Returns:
retrieve or not retrieve

build

public void build(java.util.HashSet list,
                  boolean not)
build a new filter from the given set of site parsing requests and whether negation should occur or not. A null HashSet retains the previous HashSet

Parameters:
list - HashSet of site references to skip
not - should those in the list be skipped or new references skipped

load

public void load(java.lang.String site,
                 Properties properties)
Description copied from interface: CrawlerFilter
Submit the given site - parameter combination to the filter chain without retrieving it.

Specified by:
load in interface CrawlerFilter
Parameters:
site - URL to be added

load

public void load(java.lang.String site)
Description copied from interface: CrawlerFilter
Submit the given site to the filter chain without retrieving it.

Specified by:
load in interface CrawlerFilter
Parameters:
site - URL to be added

prototype

public BlockPreviousSiteReference prototype()
Description copied from interface: CrawlerFilter
Creates a new default version of this class with no common data excepting static variables

Specified by:
prototype in interface CrawlerFilter
Returns:
new filter of the same class as the parent

Get Relational Analysis Toolkit at SourceForge.net. Fast, secure and Free Open Source software downloads