org.apache.manifoldcf.crawler.connectors.webcrawler
Class WebcrawlerConnector.OuterContextClass

java.lang.Object
  extended by org.apache.manifoldcf.agents.common.XMLContext
      extended by org.apache.manifoldcf.crawler.connectors.webcrawler.WebcrawlerConnector.OuterContextClass
Enclosing class:
WebcrawlerConnector

protected class WebcrawlerConnector.OuterContextClass
extends org.apache.manifoldcf.agents.common.XMLContext

This class handles the outermost XML context for the feed document.


Field Summary
protected  java.lang.String documentURI
          The document uri
protected  IXMLHandler handler
          The link handler
protected  int outerTagCount
          Keep track of the number of valid feed signals we saw
 
Fields inherited from class org.apache.manifoldcf.agents.common.XMLContext
localname, namespaceURI, previousContext, qname, theseAttributes, theStream
 
Constructor Summary
WebcrawlerConnector.OuterContextClass(org.apache.manifoldcf.agents.common.XMLStream theStream, java.lang.String documentURI, IXMLHandler handler)
           
 
Method Summary
protected  org.apache.manifoldcf.agents.common.XMLContext beginTag(java.lang.String namespaceURI, java.lang.String localName, java.lang.String qName, org.xml.sax.Attributes atts)
          Handle the tag beginning to set the correct second-level parsing context
 void checkIfValidFeed()
          Check if feed was valid
protected  void endTag()
          Handle the tag ending
 
Methods inherited from class org.apache.manifoldcf.agents.common.XMLContext
characters, cleanup, endDocument, endElement, getAttribute, getLocalname, getNamespaceURI, getQname, startDocument, startElement, tagCleanup, tagContents
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

outerTagCount

protected int outerTagCount
Keep track of the number of valid feed signals we saw


documentURI

protected java.lang.String documentURI
The document uri


handler

protected IXMLHandler handler
The link handler

Constructor Detail

WebcrawlerConnector.OuterContextClass

public WebcrawlerConnector.OuterContextClass(org.apache.manifoldcf.agents.common.XMLStream theStream,
                                             java.lang.String documentURI,
                                             IXMLHandler handler)
Method Detail

checkIfValidFeed

public void checkIfValidFeed()
Check if feed was valid


beginTag

protected org.apache.manifoldcf.agents.common.XMLContext beginTag(java.lang.String namespaceURI,
                                                                  java.lang.String localName,
                                                                  java.lang.String qName,
                                                                  org.xml.sax.Attributes atts)
                                                           throws org.apache.manifoldcf.core.interfaces.ManifoldCFException,
                                                                  org.apache.manifoldcf.agents.interfaces.ServiceInterruption
Handle the tag beginning to set the correct second-level parsing context

Overrides:
beginTag in class org.apache.manifoldcf.agents.common.XMLContext
Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException
org.apache.manifoldcf.agents.interfaces.ServiceInterruption

endTag

protected void endTag()
               throws org.apache.manifoldcf.core.interfaces.ManifoldCFException,
                      org.apache.manifoldcf.agents.interfaces.ServiceInterruption
Handle the tag ending

Overrides:
endTag in class org.apache.manifoldcf.agents.common.XMLContext
Throws:
org.apache.manifoldcf.core.interfaces.ManifoldCFException
org.apache.manifoldcf.agents.interfaces.ServiceInterruption