|
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||
java.lang.Objectorg.apache.manifoldcf.crawler.connectors.webcrawler.DataCache
public class DataCache
This class is a cache of a specific URL's data. It's fetched early and kept, so that (1) an accurate data length can be found, and (2) we can compute a version checksum.
| Nested Class Summary | |
|---|---|
protected static class |
DataCache.DocumentData
This class represents everything we need to know about a document that's getting passed from the getDocumentVersions() phase to the processDocuments() phase. |
| Field Summary | |
|---|---|
static java.lang.String |
_rcsid
|
protected java.util.HashMap |
cacheData
|
| Constructor Summary | |
|---|---|
DataCache()
Constructor. |
|
| Method Summary | |
|---|---|
java.lang.String |
addData(org.apache.manifoldcf.crawler.interfaces.IVersionActivity activities,
java.lang.String documentIdentifier,
IThrottledConnection connection)
Add a data entry into the cache. |
void |
deleteData(java.lang.String documentIdentifier)
Delete specified item of data. |
java.lang.String |
getContentType(java.lang.String documentIdentifier)
Get the content type. |
java.io.InputStream |
getData(java.lang.String documentIdentifier)
Fetch binary data entry from the cache. |
long |
getDataLength(java.lang.String documentIdentifier)
Fetch binary data length. |
java.lang.String |
getReferralURI(java.lang.String documentIdentifier)
Get the referral URI. |
int |
getResponseCode(java.lang.String documentIdentifier)
Get the response code. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final java.lang.String _rcsid
protected java.util.HashMap cacheData
| Constructor Detail |
|---|
public DataCache()
| Method Detail |
|---|
public java.lang.String addData(org.apache.manifoldcf.crawler.interfaces.IVersionActivity activities,
java.lang.String documentIdentifier,
IThrottledConnection connection)
throws org.apache.manifoldcf.core.interfaces.ManifoldCFException,
org.apache.manifoldcf.agents.interfaces.ServiceInterruption
documentIdentifier - is the document identifier (url).connection - is the connection, upon which a fetch has been done that needs to be
cached.
org.apache.manifoldcf.core.interfaces.ManifoldCFException
org.apache.manifoldcf.agents.interfaces.ServiceInterruptionpublic int getResponseCode(java.lang.String documentIdentifier)
documentIdentifier - is the document identifier.
public java.lang.String getContentType(java.lang.String documentIdentifier)
documentIdentifier - is the document identifier.
public java.lang.String getReferralURI(java.lang.String documentIdentifier)
documentIdentifier - is the document identifier.
public long getDataLength(java.lang.String documentIdentifier)
documentIdentifier - is the document identifier.
public java.io.InputStream getData(java.lang.String documentIdentifier)
throws org.apache.manifoldcf.core.interfaces.ManifoldCFException
documentIdentifier - is the document identifier (url).
org.apache.manifoldcf.core.interfaces.ManifoldCFExceptionpublic void deleteData(java.lang.String documentIdentifier)
documentIdentifier - is the document identifier (url).
|
||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||