org.apache.manifoldcf.crawler.connectors.webcrawler
Interface IDiscoveredLinkHandler

All Known Subinterfaces:
IHTMLHandler, IRedirectionHandler, IXMLHandler
All Known Implementing Classes:
WebcrawlerConnector.FindHandler, WebcrawlerConnector.FindHTMLFormHandler, WebcrawlerConnector.FindHTMLHrefHandler, WebcrawlerConnector.FindPreferredRedirectionHandler, WebcrawlerConnector.FindRedirectionHandler, WebcrawlerConnector.ProcessActivityHTMLHandler, WebcrawlerConnector.ProcessActivityLinkHandler, WebcrawlerConnector.ProcessActivityRedirectionHandler, WebcrawlerConnector.ProcessActivityXMLHandler

public interface IDiscoveredLinkHandler

This interface describes the functionality needed by a link extractor to note a discovered link.


Method Summary
 void noteDiscoveredLink(java.lang.String rawURL)
          Inform the world of a discovered link.
 

Method Detail

noteDiscoveredLink

void noteDiscoveredLink(java.lang.String rawURL)
                        throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Inform the world of a discovered link.

Parameters:
rawURL - is the raw discovered url. This may be relative, malformed, or otherwise unsuitable for use until final form is acheived.
Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException