nz.ac.waikato.mcennis.rat.crawler.filter
Class HyperGraphSnowball

java.lang.Object
  extended by nz.ac.waikato.mcennis.rat.crawler.filter.HyperGraphSnowball
All Implemented Interfaces:
CrawlerFilter

public class HyperGraphSnowball
extends java.lang.Object
implements CrawlerFilter

Implements a Hypergraph extension to the Snowball sampling method. See Wasserman and Faust 95 for the original method. No publication yet on this sampling method. The parameters read by this filter are:


Constructor Summary
HyperGraphSnowball()
           
 
Method Summary
 void build(int mode, int total)
          Create a new hypergraph snowball with maximum link depth regarldess of relation and maximum link depth by the target relation.
 boolean check(java.lang.String site)
          Should the URL this string represents be retrieved
 boolean check(java.lang.String site, Properties parameters)
          Should the URL this string represents be retrieved, given the parameters provided
 void load(java.lang.String site)
          Submit the given site to the filter chain without retrieving it.
 void load(java.lang.String site, Properties parameters)
          Submit the given site - parameter combination to the filter chain without retrieving it.
 HyperGraphSnowball prototype()
          Creates a new default version of this class with no common data excepting static variables
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

HyperGraphSnowball

public HyperGraphSnowball()
Method Detail

check

public boolean check(java.lang.String site)
Description copied from interface: CrawlerFilter
Should the URL this string represents be retrieved

Specified by:
check in interface CrawlerFilter
Parameters:
site - URL of the site to be retrieved
Returns:
retrieve or not retrieve

check

public boolean check(java.lang.String site,
                     Properties parameters)
Description copied from interface: CrawlerFilter
Should the URL this string represents be retrieved, given the parameters provided

Specified by:
check in interface CrawlerFilter
Parameters:
site - URL to be retrieved
parameters - parameters governing the retrieval
Returns:
retrieve or not retrieve

build

public void build(int mode,
                  int total)
Create a new hypergraph snowball with maximum link depth regarldess of relation and maximum link depth by the target relation. These values are set by parameter.

Parameters:
mode - maximum link depth in the target relation
total - maximum link depth

load

public void load(java.lang.String site)
Description copied from interface: CrawlerFilter
Submit the given site to the filter chain without retrieving it.

Specified by:
load in interface CrawlerFilter
Parameters:
site - URL to be added

load

public void load(java.lang.String site,
                 Properties parameters)
Description copied from interface: CrawlerFilter
Submit the given site - parameter combination to the filter chain without retrieving it.

Specified by:
load in interface CrawlerFilter
Parameters:
site - URL to be added

prototype

public HyperGraphSnowball prototype()
Description copied from interface: CrawlerFilter
Creates a new default version of this class with no common data excepting static variables

Specified by:
prototype in interface CrawlerFilter
Returns:
new filter of the same class as the parent

Get Relational Analysis Toolkit at SourceForge.net. Fast, secure and Free Open Source software downloads