org.apache.manifoldcf.crawler.connectors.webcrawler
Class WebcrawlerConnector.FindHandler
java.lang.Object
org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.FindHandler
- All Implemented Interfaces:
- IDiscoveredLinkHandler
- Direct Known Subclasses:
- WebcrawlerConnector.FindHTMLFormHandler, WebcrawlerConnector.FindHTMLHrefHandler, WebcrawlerConnector.FindPreferredRedirectionHandler, WebcrawlerConnector.FindRedirectionHandler
- Enclosing class:
- WebcrawlerConnector
protected class WebcrawlerConnector.FindHandler
- extends java.lang.Object
- implements IDiscoveredLinkHandler
This class is used to discover links in a session login context
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
parentURI
protected java.lang.String parentURI
targetURI
protected java.lang.String targetURI
WebcrawlerConnector.FindHandler
public WebcrawlerConnector.FindHandler(java.lang.String parentURI)
noteDiscoveredLink
public void noteDiscoveredLink(java.lang.String rawURL)
throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
- Inform the world of a discovered link.
- Specified by:
noteDiscoveredLink in interface IDiscoveredLinkHandler
- Parameters:
rawURL - is the raw discovered url. This may be relative, malformed, or otherwise unsuitable for use until final form is acheived.
- Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException
getTargetURI
public java.lang.String getTargetURI()