org.apache.manifoldcf.crawler.connectors.webcrawler
Interface IHTMLHandler

All Superinterfaces:
IDiscoveredLinkHandler, IMetaTagHandler
All Known Implementing Classes:
WebcrawlerConnector.FindHTMLFormHandler, WebcrawlerConnector.FindHTMLHrefHandler, WebcrawlerConnector.ProcessActivityHTMLHandler

public interface IHTMLHandler
extends IDiscoveredLinkHandler, IMetaTagHandler

This interface describes the functionality needed by an HTML processor in order to handle an HTML document.


Method Summary
 void noteAHREF(java.lang.String rawURL)
          Note discovered href
 void noteFormEnd()
          Note the end of a form
 void noteFormInput(java.util.Map inputAttributes)
          Note an input tag
 void noteFormStart(java.util.Map formAttributes)
          Note the start of a form
 void noteFRAMESRC(java.lang.String rawURL)
          Note discovered FRAME SRC
 void noteIMGSRC(java.lang.String rawURL)
          Note discovered IMG SRC
 void noteLINKHREF(java.lang.String rawURL)
          Note discovered href
 
Methods inherited from interface org.apache.manifoldcf.crawler.connectors.webcrawler.IDiscoveredLinkHandler
noteDiscoveredLink
 
Methods inherited from interface org.apache.manifoldcf.crawler.connectors.webcrawler.IMetaTagHandler
noteMetaTag
 

Method Detail

noteFormStart

void noteFormStart(java.util.Map formAttributes)
                   throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Note the start of a form

Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException

noteFormInput

void noteFormInput(java.util.Map inputAttributes)
                   throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Note an input tag

Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException

noteFormEnd

void noteFormEnd()
                 throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Note the end of a form

Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException

noteAHREF

void noteAHREF(java.lang.String rawURL)
               throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Note discovered href

Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException

noteLINKHREF

void noteLINKHREF(java.lang.String rawURL)
                  throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Note discovered href

Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException

noteIMGSRC

void noteIMGSRC(java.lang.String rawURL)
                throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Note discovered IMG SRC

Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException

noteFRAMESRC

void noteFRAMESRC(java.lang.String rawURL)
                  throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
Note discovered FRAME SRC

Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException