org.apache.manifoldcf.crawler.connectors.webcrawler
Class WebcrawlerConnector.FindPreferredRedirectionHandler

java.lang.Object
  extended by org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.FindHandler
      extended by org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.FindPreferredRedirectionHandler
All Implemented Interfaces:
IDiscoveredLinkHandler, IRedirectionHandler
Enclosing class:
WebcrawlerConnector

protected class WebcrawlerConnector.FindPreferredRedirectionHandler
extends WebcrawlerConnector.FindHandler
implements IRedirectionHandler

This class is the handler for redirection handling during state transitions


Field Summary
protected  java.util.regex.Pattern redirectionURIPattern
           
 
Fields inherited from class org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.FindHandler
parentURI, targetURI
 
Constructor Summary
WebcrawlerConnector.FindPreferredRedirectionHandler(java.lang.String parentURI, java.util.regex.Pattern redirectionURIPattern)
           
 
Method Summary
 void noteDiscoveredLink(java.lang.String rawURL)
          Override noteDiscoveredLink
 
Methods inherited from class org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.FindHandler
getTargetURI
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

redirectionURIPattern

protected java.util.regex.Pattern redirectionURIPattern
Constructor Detail

WebcrawlerConnector.FindPreferredRedirectionHandler

public WebcrawlerConnector.FindPreferredRedirectionHandler(java.lang.String parentURI,
                                                           java.util.regex.Pattern redirectionURIPattern)
Method Detail

noteDiscoveredLink

public void noteDiscoveredLink(java.lang.String rawURL)
                        throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Override noteDiscoveredLink

Specified by:
noteDiscoveredLink in interface IDiscoveredLinkHandler
Overrides:
noteDiscoveredLink in class WebcrawlerConnector.FindHandler
Parameters:
rawURL - is the raw discovered url. This may be relative, malformed, or otherwise unsuitable for use until final form is acheived.
Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException