nz.ac.waikato.mcennis.rat.crawler.filter
Class DomainRestriction

java.lang.Object
  extended by nz.ac.waikato.mcennis.rat.crawler.filter.DomainRestriction
All Implemented Interfaces:
CrawlerFilter

public class DomainRestriction
extends java.lang.Object
implements CrawlerFilter

Filter that rejects all URLs not from the given domain


Constructor Summary
DomainRestriction()
           
 
Method Summary
 void build(java.lang.String domain, boolean not)
          Creates a filter that schedules parsing only if the hostname belongs to the given domain.
 boolean check(java.lang.String site)
          Should the URL this string represents be retrieved
 boolean check(java.lang.String site, Properties parameters)
          Should the URL this string represents be retrieved, given the parameters provided
 void load(java.lang.String site)
          Submit the given site to the filter chain without retrieving it.
 void load(java.lang.String site, Properties parameters)
          Submit the given site - parameter combination to the filter chain without retrieving it.
 DomainRestriction prototype()
          Creates a new default version of this class with no common data excepting static variables
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

DomainRestriction

public DomainRestriction()
Method Detail

check

public boolean check(java.lang.String site)
Description copied from interface: CrawlerFilter
Should the URL this string represents be retrieved

Specified by:
check in interface CrawlerFilter
Parameters:
site - URL of the site to be retrieved
Returns:
retrieve or not retrieve

check

public boolean check(java.lang.String site,
                     Properties parameters)
Description copied from interface: CrawlerFilter
Should the URL this string represents be retrieved, given the parameters provided

Specified by:
check in interface CrawlerFilter
Parameters:
site - URL to be retrieved
parameters - parameters governing the retrieval
Returns:
retrieve or not retrieve

build

public void build(java.lang.String domain,
                  boolean not)
Creates a filter that schedules parsing only if the hostname belongs to the given domain.

Parameters:
domain - hostname suffix to be accepted
not - invert selection to anything but the given domain.

load

public void load(java.lang.String site)
Description copied from interface: CrawlerFilter
Submit the given site to the filter chain without retrieving it.

Specified by:
load in interface CrawlerFilter
Parameters:
site - URL to be added

load

public void load(java.lang.String site,
                 Properties parameters)
Description copied from interface: CrawlerFilter
Submit the given site - parameter combination to the filter chain without retrieving it.

Specified by:
load in interface CrawlerFilter
Parameters:
site - URL to be added

prototype

public DomainRestriction prototype()
Description copied from interface: CrawlerFilter
Creates a new default version of this class with no common data excepting static variables

Specified by:
prototype in interface CrawlerFilter
Returns:
new filter of the same class as the parent

Get Relational Analysis Toolkit at SourceForge.net. Fast, secure and Free Open Source software downloads