Package nz.ac.waikato.mcennis.rat.crawler.filter

Interface Summary
CrawlerFilter Interface for all algorithms implementing decisions on whether or not a given site or site - parameter combination should be scheduled for retrieval.
 

Class Summary
AndFilter Filter designed to return sites that match all of its sub-filters
AndFilterTest  
BlockPreviousSite refuse to parse all previously seen sites by their URL alone
BlockPreviousSiteReference Block previously seen sites only if they have the same parameter settings
BlockPreviousSiteReferenceTest  
BlockPreviousSiteTest  
CrawlerFilterFactory Factory class for classes implementing the CrawlerFilter interface.
CrawlerFilterFactoryTest  
DomainRestriction Filter that rejects all URLs not from the given domain
DomainRestrictionTest  
HyperGraphSnowball Implements a Hypergraph extension to the Snowball sampling method.
HyperGraphSnowballTest  
NullFilter Creates a filter that passes everything for parsing
NullFilterTest  
OrFilter Performs an Or set operation across the filter's sub-filters in order to determine whether or not to retrieve a given URL.
OrFilterTest  
SiteMatch This filter approves retrieval for any URL that matched the given regular expression.
SiteMatchTest  
SNASnowball Implements the Snowball sampling algorithm from Social Network Analysis Handbook by Wasserman and Faust 95.
SNASnowballTest  
StopCount Implements a filter with a global stop count that ceases retrieval after the given number of pages have been scheduled for retrieval.
StopCountTest  
XorFilter Creates a filter that performs an XOr operation upon the return values of the two sub-filters.
XorFilterTest  
 


Get Relational Analysis Toolkit at SourceForge.net. Fast, secure and Free Open Source software downloads