org.apache.manifoldcf.crawler.connectors.webcrawler
Class WebcrawlerConnector.ProcessActivityLinkHandler

java.lang.Object
  extended by org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.ProcessActivityLinkHandler
All Implemented Interfaces:
IDiscoveredLinkHandler
Direct Known Subclasses:
WebcrawlerConnector.ProcessActivityHTMLHandler, WebcrawlerConnector.ProcessActivityRedirectionHandler, WebcrawlerConnector.ProcessActivityXMLHandler
Enclosing class:
WebcrawlerConnector

protected class WebcrawlerConnector.ProcessActivityLinkHandler
extends java.lang.Object
implements IDiscoveredLinkHandler

This class is the handler for links that get added into a IProcessActivity object.


Field Summary
protected  org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities
           
protected  java.lang.String contextDescription
           
protected  java.lang.String documentIdentifier
           
protected  WebcrawlerConnector.DocumentURLFilter filter
           
protected  java.lang.String linkType
           
 
Constructor Summary
WebcrawlerConnector.ProcessActivityLinkHandler(java.lang.String documentIdentifier, org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities, WebcrawlerConnector.DocumentURLFilter filter, java.lang.String contextDescription, java.lang.String linkType)
          Constructor.
 
Method Summary
 void noteDiscoveredLink(java.lang.String rawURL)
          Inform the world of a discovered link.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

documentIdentifier

protected java.lang.String documentIdentifier

activities

protected org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities

filter

protected WebcrawlerConnector.DocumentURLFilter filter

contextDescription

protected java.lang.String contextDescription

linkType

protected java.lang.String linkType
Constructor Detail

WebcrawlerConnector.ProcessActivityLinkHandler

public WebcrawlerConnector.ProcessActivityLinkHandler(java.lang.String documentIdentifier,
                                                      org.apache.manifoldcf.crawler.interfaces.IProcessActivity activities,
                                                      WebcrawlerConnector.DocumentURLFilter filter,
                                                      java.lang.String contextDescription,
                                                      java.lang.String linkType)
Constructor.

Method Detail

noteDiscoveredLink

public void noteDiscoveredLink(java.lang.String rawURL)
                        throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Inform the world of a discovered link.

Specified by:
noteDiscoveredLink in interface IDiscoveredLinkHandler
Parameters:
rawURL - is the raw discovered url. This may be relative, malformed, or otherwise unsuitable for use until final form is acheived.
Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException