|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
public interface IJobManager
This manager deals with jobs. Each job is associated with a repository connection, and has a number of scheduling options: starting every n hours/days/weeks/months, on specific dates, or "continuous" (which basically establishes a priority queue based on modification frequency). The job itself also specifies "seeds" (or starting points), which are the places that scanning begins. NOTE WELL: Every job is incremental. This means that the job will check for deletions among all the documents that it has scanned in the past, as part of the process of ingesting.
| Field Summary | |
|---|---|
static java.lang.String |
_rcsid
|
static int |
ACTION_REMOVE
|
static int |
ACTION_RESCAN
|
static int |
DOCSTATE_NEVERPROCESSED
|
static int |
DOCSTATE_PREVIOUSLYPROCESSED
|
static int |
DOCSTATUS_DELETING
|
static int |
DOCSTATUS_EXPIRING
|
static int |
DOCSTATUS_INACTIVE
|
static int |
DOCSTATUS_PROCESSING
|
static int |
DOCSTATUS_READYFOREXPIRATION
|
static int |
DOCSTATUS_READYFORPROCESSING
|
static int |
DOCSTATUS_WAITINGFOREVER
|
static int |
DOCSTATUS_WAITINGFOREXPIRATION
|
static int |
DOCSTATUS_WAITINGFORPROCESSING
|
| Method Summary | |
|---|---|
boolean |
addDocument(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String docIDHash,
java.lang.String docID,
java.lang.String parentIdentifierHash,
java.lang.String relationshipType,
int hopcountMethod,
java.lang.String[] dataNames,
java.lang.Object[][] dataValues,
long currentTime,
double priority,
java.lang.String[] prereqEventNames)
Add a document to the queue. |
boolean[] |
addDocuments(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] docIDHashes,
java.lang.String[] docIDs,
java.lang.String parentIdentifierHash,
java.lang.String relationshipType,
int hopcountMethod,
java.lang.String[][] dataNames,
java.lang.Object[][][] dataValues,
long currentTime,
double[] priorities,
java.lang.String[][] prereqEventNames)
Add documents to the queue in bulk. |
boolean[] |
addDocumentsInitial(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] docIDHashes,
java.lang.String[] docIDs,
boolean overrideSchedule,
int hopcountMethod,
long currentTime,
double[] documentPriorities,
java.lang.String[][] prereqEventNames)
Add an initial set of documents to the queue. |
void |
addRemainingDocumentsInitial(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] docIDHashes,
int hopcountMethod)
Add an initial set of remaining documents to the queue. |
boolean |
beginEventSequence(java.lang.String eventName)
Begin an event sequence. |
boolean |
carrydownChangeDocument(DocumentDescription documentDescription,
long currentTime,
double docPriority)
Requeue a document because of carrydown changes. |
boolean[] |
carrydownChangeDocumentMultiple(DocumentDescription[] documentDescriptions,
long currentTime,
double[] docPriorities)
Requeue a document set because of carrydown changes. |
boolean |
checkIfOutputReference(java.lang.String connectionName)
See if there's a reference to an output connection name. |
boolean |
checkIfReference(java.lang.String connectionName)
See if there's a reference to a connection name. |
boolean |
checkJobActive(java.lang.Long jobID)
Verify that a specific job is indeed still active. |
boolean |
checkJobBusy(java.lang.Long jobID)
Verify if a job is still processing documents, or no longer has any outstanding active documents |
void |
completeEventSequence(java.lang.String eventName)
Complete an event sequence. |
IJobDescription |
createJob()
Create a new job. |
void |
deinstall()
Uninstall the job manager's tables. |
void |
deleteIngestedDocumentIdentifiers(DocumentDescription[] identifiers)
Delete ingested document identifiers (as part of deleting the owning job). |
void |
deleteJob(java.lang.Long id)
Delete a job. |
void |
deleteJobsReadyForDelete()
Delete jobs in need of being deleted (which are marked "ready for delete"). |
void |
doneDocumentsInitial(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
boolean isPartial,
int hopcountMethod)
Signal that a seeding pass has been done. |
boolean |
errorAbort(java.lang.Long jobID,
java.lang.String errorText)
Abort a running job due to a fatal error condition. |
void |
exportConfiguration(java.io.OutputStream os)
Export configuration |
boolean[] |
findHopCounts(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] docIDHashes,
java.lang.String linkType,
int limit,
int hopcountMethod)
Get the specified hop counts, with the limit as described. |
IJobDescription[] |
findJobsForConnection(java.lang.String connectionName)
Get the job IDs associated with a given connection name. |
DocumentDescription[] |
finishDocuments(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] parentIdentifierHashes,
int hopcountMethod)
Complete adding child documents to the queue, for a set of documents. |
void |
finishJobAborts(long timestamp,
java.util.ArrayList abortJobs)
Complete the sequence that aborts jobs and makes them runnable again. |
void |
finishJobs()
Put all eligible jobs in the "shutting down" state. |
IResultSet |
genDocumentStatus(java.lang.String connectionName,
StatusFilterCriteria filterCriteria,
SortOrder sortOrder,
int startRow,
int rowCount)
Run a 'document status' report. |
IResultSet |
genQueueStatus(java.lang.String connectionName,
StatusFilterCriteria filterCriteria,
SortOrder sortOrder,
BucketDescription idBucketDescription,
int startRow,
int rowCount)
Run a 'queue status' report. |
IJobDescription[] |
getAllJobs()
Load a sorted list of job descriptions. |
java.lang.String[] |
getAllSeeds(java.lang.Long jobID)
Get all the current seeds. |
JobStatus[] |
getAllStatus()
Get a list of all jobs, and their status information. |
DocumentSetAndFlags |
getExpiredDocuments(int n,
long currentTime)
Get up to the next n documents to be expired. |
JobStatus[] |
getFinishedJobs()
Get a list of completed jobs, and their statistics. |
JobStartRecord[] |
getJobsReadyForDelete()
Get the list of jobs that are ready for deletion. |
JobStartRecord[] |
getJobsReadyForInactivity()
Find the list of jobs that need to have their connectors notified of job completion. |
JobStartRecord[] |
getJobsReadyForSeeding(long currentTime)
Get the list of jobs that are ready for seeding. |
JobStartRecord[] |
getJobsReadyForStartup()
Get the list of jobs that are ready for startup. |
DocumentDescription[] |
getNextAlreadyProcessedReprioritizationDocuments(long currentTime,
int n)
Get a list of already-processed documents to reprioritize. |
DocumentSetAndFlags |
getNextCleanableDocuments(int n,
long currentTime)
Get list of cleanable document descriptions. |
DocumentDescription[] |
getNextDeletableDocuments(int n,
long currentTime)
Get list of deletable document descriptions. |
DocumentDescription[] |
getNextDocuments(int n,
long currentTime,
long interval,
BlockingDocuments blockingDocuments,
PerformanceStatistics statistics,
DepthStatistics scanRecord)
Get up to the next n document(s) to be fetched and processed. |
DocumentDescription[] |
getNextNotYetProcessedReprioritizationDocuments(long currentTime,
int n)
Get a list of not-yet-processed documents to reprioritize. |
JobStatus[] |
getRunningJobs()
Get a list of running jobs. |
JobStatus |
getStatus(java.lang.Long jobID)
Get the status of a job. |
void |
importConfiguration(java.io.InputStream is)
Import configuration |
void |
inactivateJob(java.lang.Long jobID)
Inactivate a job, from the notification state. |
void |
install()
Install the job manager's tables. |
IJobDescription |
load(java.lang.Long id)
Load a job for editing. |
IJobDescription |
load(java.lang.Long id,
boolean readOnly)
Load a job. |
void |
manualAbort(java.lang.Long jobID)
Manually abort a running job. |
void |
manualAbortRestart(java.lang.Long jobID)
Manually restart a running job. |
void |
manualStart(java.lang.Long jobID)
Manually start a job. |
void |
markDocumentCompleted(DocumentDescription documentDescription)
Note completion of document processing by a job thread of a document. |
void |
markDocumentCompletedMultiple(DocumentDescription[] documentDescriptions)
Note completion of document processing by a job thread of a document. |
DocumentDescription[] |
markDocumentDeleted(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
DocumentDescription documentDescription,
int hopcountMethod)
Note deletion as result of document processing by a job thread of a document. |
DocumentDescription[] |
markDocumentDeletedMultiple(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
DocumentDescription[] documentDescriptions,
int hopcountMethod)
Note deletion as result of document processing by a job thread of a document. |
void |
noteConnectionChange(java.lang.String connectionName)
Note a change in connection configuration. |
void |
noteConnectorDeregistration(java.lang.String[] connectionNames)
Note the deregistration of a connector used by the specified connections. |
void |
noteConnectorRegistration(java.lang.String[] connectionNames)
Note the registration of a connector used by the specified connections. |
void |
noteJobDeleteStarted(java.lang.Long jobID,
long startTime)
Note job delete started. |
void |
noteJobSeeded(java.lang.Long jobID,
long startTime)
Note job seeded. |
void |
noteJobStarted(java.lang.Long jobID,
long startTime)
Note job started. |
void |
noteOutputConnectionChange(java.lang.String connectionName)
Note a change in output connection configuration. |
void |
noteOutputConnectorDeregistration(java.lang.String[] connectionNames)
Note the deregistration of an output connector used by the specified connections. |
void |
noteOutputConnectorRegistration(java.lang.String[] connectionNames)
Note the registration of an output connector used by the specified connections. |
void |
pauseJob(java.lang.Long jobID)
Pause a job. |
void |
prepareDeleteScan(java.lang.Long jobID)
Prepare for a delete scan. |
void |
prepareForStart()
Reset the job queue immediately before starting up. |
void |
prepareFullScan(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
int hopcountMethod)
Prepare for a full scan. |
void |
prepareIncrementalScan(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
int hopcountMethod)
Prepare for an incremental scan. |
void |
requeueDocument(DocumentDescription documentDescription,
java.lang.Long executeTime,
int action)
Requeue a document for further processing in the future. |
void |
requeueDocumentMultiple(DocumentDescription[] documentDescriptions,
java.lang.Long[] executeTimes,
int[] actions)
Requeue a document for further processing in the future. |
void |
resetCleaningDocument(DocumentDescription documentDescription,
long checkTime)
Reset a cleaning document back to its former state. |
void |
resetCleaningDocumentMultiple(DocumentDescription[] documentDescriptions,
long checkTime)
Reset a set of cleaning documents for further processing in the future. |
void |
resetDeleteStartupWorkerStatus()
Reset as part of restoring delete startup threads. |
void |
resetDeletingDocument(DocumentDescription documentDescription,
long checkTime)
Reset a deleting document back to its former state. |
void |
resetDeletingDocumentMultiple(DocumentDescription[] documentDescriptions,
long checkTime)
Reset a set of deleting documents for further processing in the future. |
void |
resetDocCleanupWorkerStatus()
Reset as part of restoring doc cleanup threads. |
void |
resetDocDeleteWorkerStatus()
Reset as part of restoring doc delete threads. |
void |
resetDocument(DocumentDescription documentDescription,
long executeTime,
int action,
long failTime,
int failCount)
Reset an active document back to its former state. |
void |
resetDocumentMultiple(DocumentDescription[] documentDescriptions,
long executeTime,
int action,
long failTime,
int failCount)
Reset documents for further processing in the future. |
void |
resetDocumentWorkerStatus()
Reset as part of restoring document worker threads. |
void |
resetJobs(long currentTime,
java.util.ArrayList resetJobs)
Reset eligible jobs either back to the "inactive" state, or make them active again. |
void |
resetJobSchedule(java.lang.Long jobID)
Reset job schedule. |
void |
resetNotificationWorkerStatus()
Reset as part of restoring notification threads. |
void |
resetNotifyJob(java.lang.Long jobID)
Reset a job that is notifying back to "ready for notify" state. |
void |
resetSeedingWorkerStatus()
Reset as part of restoring seeding threads. |
void |
resetSeedJob(java.lang.Long jobID)
Reset a seeding job back to "active" state. |
void |
resetStartDeleteJob(java.lang.Long jobID)
Reset a job starting for delete back to "ready for delete" state. |
void |
resetStartupJob(java.lang.Long jobID)
Reset a starting job back to "ready for startup" state. |
void |
resetStartupWorkerStatus()
Reset as part of restoring startup threads. |
void |
restartJob(java.lang.Long jobID)
Restart a paused job. |
java.lang.String[] |
retrieveParentData(java.lang.Long jobID,
java.lang.String docIDHash,
java.lang.String dataName)
Retrieve specific parent data for a given document. |
CharacterInput[] |
retrieveParentDataAsFiles(java.lang.Long jobID,
java.lang.String docIDHash,
java.lang.String dataName)
Retrieve specific parent data for a given document. |
void |
save(IJobDescription jobDescription)
Save a job. |
void |
startJobs(long currentTime,
java.util.ArrayList unwaitList)
Start jobs based on schedule. |
void |
waitJobs(long currentTime,
java.util.ArrayList waitList)
Put active or paused jobs in wait state, if they've exceeded their window. |
void |
writeDocumentPriorities(long currentTime,
DocumentDescription[] descriptions,
double[] priorities)
Save a set of document priorities. |
| Field Detail |
|---|
static final java.lang.String _rcsid
static final int ACTION_RESCAN
static final int ACTION_REMOVE
static final int DOCSTATE_NEVERPROCESSED
static final int DOCSTATE_PREVIOUSLYPROCESSED
static final int DOCSTATUS_INACTIVE
static final int DOCSTATUS_PROCESSING
static final int DOCSTATUS_EXPIRING
static final int DOCSTATUS_DELETING
static final int DOCSTATUS_READYFORPROCESSING
static final int DOCSTATUS_READYFOREXPIRATION
static final int DOCSTATUS_WAITINGFORPROCESSING
static final int DOCSTATUS_WAITINGFOREXPIRATION
static final int DOCSTATUS_WAITINGFOREVER
| Method Detail |
|---|
void install()
throws ManifoldCFException
ManifoldCFException
void deinstall()
throws ManifoldCFException
ManifoldCFException
void exportConfiguration(java.io.OutputStream os)
throws java.io.IOException,
ManifoldCFException
java.io.IOException
ManifoldCFException
void importConfiguration(java.io.InputStream is)
throws java.io.IOException,
ManifoldCFException
java.io.IOException
ManifoldCFException
IJobDescription[] getAllJobs()
throws ManifoldCFException
ManifoldCFException
IJobDescription createJob()
throws ManifoldCFException
ManifoldCFException
void deleteJob(java.lang.Long id)
throws ManifoldCFException
id - is the job's identifier. This method will purge all the records belonging to the job from the database, as
well as remove all documents indexed by the job from the index.
ManifoldCFException
IJobDescription load(java.lang.Long id)
throws ManifoldCFException
id - is the job's identifier.
ManifoldCFException
IJobDescription load(java.lang.Long id,
boolean readOnly)
throws ManifoldCFException
id - is the job's identifier.readOnly - is true if a read-only object is desired.
ManifoldCFException
void save(IJobDescription jobDescription)
throws ManifoldCFException
jobDescription - is the job description.
ManifoldCFException
boolean checkIfReference(java.lang.String connectionName)
throws ManifoldCFException
connectionName - is the name of the connection.
ManifoldCFException
boolean checkIfOutputReference(java.lang.String connectionName)
throws ManifoldCFException
connectionName - is the name of the connection.
ManifoldCFException
IJobDescription[] findJobsForConnection(java.lang.String connectionName)
throws ManifoldCFException
connectionName - is the name of the connection.
ManifoldCFException
void prepareForStart()
throws ManifoldCFException
ManifoldCFException
void resetDocumentWorkerStatus()
throws ManifoldCFException
ManifoldCFException
void resetSeedingWorkerStatus()
throws ManifoldCFException
ManifoldCFException
void resetDocDeleteWorkerStatus()
throws ManifoldCFException
ManifoldCFException
void resetDocCleanupWorkerStatus()
throws ManifoldCFException
ManifoldCFException
void resetDeleteStartupWorkerStatus()
throws ManifoldCFException
ManifoldCFException
void resetNotificationWorkerStatus()
throws ManifoldCFException
ManifoldCFException
void resetStartupWorkerStatus()
throws ManifoldCFException
ManifoldCFException
DocumentDescription[] getNextAlreadyProcessedReprioritizationDocuments(long currentTime,
int n)
throws ManifoldCFException
currentTime - is the current time stamp for this prioritization pass. Avoid
picking up any documents that are labeled with this timestamp or after.n - is the maximum number of document descriptions desired.
ManifoldCFException
DocumentDescription[] getNextNotYetProcessedReprioritizationDocuments(long currentTime,
int n)
throws ManifoldCFException
currentTime - is the current time stamp for this prioritization pass. Avoid
picking up any documents that are labeled with this timestamp or after.n - is the maximum number of document descriptions desired.
ManifoldCFException
void writeDocumentPriorities(long currentTime,
DocumentDescription[] descriptions,
double[] priorities)
throws ManifoldCFException
currentTime - is the time in milliseconds since epoch.descriptions - are the document descriptions.priorities - are the desired priorities.
ManifoldCFException
DocumentSetAndFlags getExpiredDocuments(int n,
long currentTime)
throws ManifoldCFException
n - is the maximum number of records desired.currentTime - is the current time.
ManifoldCFException
DocumentDescription[] getNextDocuments(int n,
long currentTime,
long interval,
BlockingDocuments blockingDocuments,
PerformanceStatistics statistics,
DepthStatistics scanRecord)
throws ManifoldCFException
n - is the number of documents desired.currentTime - is the current time; some fetches do not occur until a specific time.interval - is the number of milliseconds that this set of documents should represent (for throttling).blockingDocuments - is the place to record documents that were encountered, are eligible for reprioritization,
but could not be queued due to throttling considerations.statistics - are the current performance statistics per connection, which are used to balance the queue stuffing
so that individual connections are not overwhelmed.scanRecord - retains the bins from all documents encountered from the query, even those that were skipped due
to being overcommitted.
ManifoldCFException
boolean checkJobActive(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job identifier.
ManifoldCFException
boolean checkJobBusy(java.lang.Long jobID)
throws ManifoldCFException
ManifoldCFException
void markDocumentCompletedMultiple(DocumentDescription[] documentDescriptions)
throws ManifoldCFException
documentDescriptions - are the description objects for the documents that were processed.
ManifoldCFException
void markDocumentCompleted(DocumentDescription documentDescription)
throws ManifoldCFException
documentDescription - is the description object for the document that was processed.
ManifoldCFException
DocumentDescription[] markDocumentDeletedMultiple(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
DocumentDescription[] documentDescriptions,
int hopcountMethod)
throws ManifoldCFException
documentDescriptions - are the set of description objects for the documents that were processed.hopcountMethod - is one of complete, partial, or nevercomplete.
ManifoldCFException
DocumentDescription[] markDocumentDeleted(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
DocumentDescription documentDescription,
int hopcountMethod)
throws ManifoldCFException
documentDescription - is the description object for the document that was processed.hopcountMethod - is one of complete, partial, or nevercomplete.
ManifoldCFException
boolean[] carrydownChangeDocumentMultiple(DocumentDescription[] documentDescriptions,
long currentTime,
double[] docPriorities)
throws ManifoldCFException
documentDescriptions - is the set of description objects for the documents that have had their parent carrydown information changed.docPriorities - are the document priorities to assign to the documents, if needed.
ManifoldCFException
boolean carrydownChangeDocument(DocumentDescription documentDescription,
long currentTime,
double docPriority)
throws ManifoldCFException
documentDescription - is the description object for the document that has had its parent carrydown information changed.docPriority - is the document priority to assign to the document, if needed.
ManifoldCFException
void requeueDocumentMultiple(DocumentDescription[] documentDescriptions,
java.lang.Long[] executeTimes,
int[] actions)
throws ManifoldCFException
documentDescriptions - is the set of description objects for the document that was processed.executeTimes - are the times that the documents should be rescanned. Null indicates "never".actions - are what should be done when the time arrives. Choices are ACTION_RESCAN or ACTION_REMOVE.
ManifoldCFException
void requeueDocument(DocumentDescription documentDescription,
java.lang.Long executeTime,
int action)
throws ManifoldCFException
documentDescription - is the description object for the document that was processed.executeTime - is the time that the document should be rescanned. Null indicates "never".action - is what should be done when the time arrives. Choices include ACTION_RESCAN or ACTION_REMOVE.
ManifoldCFException
void resetDocumentMultiple(DocumentDescription[] documentDescriptions,
long executeTime,
int action,
long failTime,
int failCount)
throws ManifoldCFException
documentDescriptions - is the set of description objects for the document that was processed.executeTime - is the time that the documents should be rescanned.failTime - is the time beyond which hard failure should occur.failCount - is the number of permitted failures before a hard error is signalled.
ManifoldCFException
void resetDocument(DocumentDescription documentDescription,
long executeTime,
int action,
long failTime,
int failCount)
throws ManifoldCFException
documentDescription - is the description object for the document that was processed.executeTime - is the time that the document should be rescanned.failTime - is the time that the document should be considered to have failed, if it has not been
successfully read until then.failCount - is the number of permitted failures before a hard error is signalled.
ManifoldCFException
void resetDeletingDocumentMultiple(DocumentDescription[] documentDescriptions,
long checkTime)
throws ManifoldCFException
documentDescriptions - is the set of description objects for the document that was processed.checkTime - is the minimum time for the next cleaning attempt.
ManifoldCFException
void resetDeletingDocument(DocumentDescription documentDescription,
long checkTime)
throws ManifoldCFException
documentDescription - is the description object for the document that was cleaned.checkTime - is the minimum time for the next cleaning attempt.
ManifoldCFException
void resetCleaningDocument(DocumentDescription documentDescription,
long checkTime)
throws ManifoldCFException
documentDescription - is the description object for the document that was cleaned.checkTime - is the minimum time for the next cleaning attempt.
ManifoldCFException
void resetCleaningDocumentMultiple(DocumentDescription[] documentDescriptions,
long checkTime)
throws ManifoldCFException
documentDescriptions - is the set of description objects for the document that was cleaned.checkTime - is the minimum time for the next cleaning attempt.
ManifoldCFException
boolean[] addDocumentsInitial(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] docIDHashes,
java.lang.String[] docIDs,
boolean overrideSchedule,
int hopcountMethod,
long currentTime,
double[] documentPriorities,
java.lang.String[][] prereqEventNames)
throws ManifoldCFException
jobID - is the job identifier.legalLinkTypes - is the set of legal link types that this connector generates.docIDHashes - are the hashes of the local document identifiers (primary key).docIDs - are the local document identifiers.overrideSchedule - is true if any existing document schedule should be overridden.hopcountMethod - is either accurate, nodelete, or neverdelete.currentTime - is the current time in milliseconds since epoch.documentPriorities - are the document priorities corresponding to the document identifiers.prereqEventNames - are the events that must be completed before each document can be processed.
ManifoldCFException
void addRemainingDocumentsInitial(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] docIDHashes,
int hopcountMethod)
throws ManifoldCFException
jobID - is the job identifier.legalLinkTypes - is the set of legal link types that this connector generates.docIDHashes - are the hash values of the local document identifiers.hopcountMethod - is either accurate, nodelete, or neverdelete.
ManifoldCFException
void doneDocumentsInitial(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
boolean isPartial,
int hopcountMethod)
throws ManifoldCFException
jobID - is the job identifier.legalLinkTypes - is the set of legal link types that this connector generates.isPartial - is set if the seeds provided are only a partial list. Some connectors cannot
supply a full list of seeds on every seeding iteration; this acknowledges that limitation.hopcountMethod - describes how to handle deletions for hopcount purposes.
ManifoldCFException
boolean beginEventSequence(java.lang.String eventName)
throws ManifoldCFException
eventName - is the name of the event.
ManifoldCFException
void completeEventSequence(java.lang.String eventName)
throws ManifoldCFException
eventName - is the name of the event.
ManifoldCFException
boolean[] findHopCounts(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] docIDHashes,
java.lang.String linkType,
int limit,
int hopcountMethod)
throws ManifoldCFException
jobID - is the job identifier.legalLinkTypes - is the set of legal link types that this connector generates.docIDHashes - is the set of document hashes to find the hopcount for.linkType - is the kind of link to find the hopcount for.limit - is the limit, beyond which a negative distance may be returned.hopcountMethod - describes how to handle deletions for hopcount purposes.
ManifoldCFException
java.lang.String[] getAllSeeds(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job identifier.
ManifoldCFException
boolean addDocument(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String docIDHash,
java.lang.String docID,
java.lang.String parentIdentifierHash,
java.lang.String relationshipType,
int hopcountMethod,
java.lang.String[] dataNames,
java.lang.Object[][] dataValues,
long currentTime,
double priority,
java.lang.String[] prereqEventNames)
throws ManifoldCFException
jobID - is the job identifier.legalLinkTypes - is the set of legal link types that this connector generates.docIDHash - is the local document identifier hash value.parentIdentifierHash - is the optional parent identifier hash value for this document. Pass null if none.relationshipType - is the optional link type between this document and its parent. Pass null if there
is no relationship with a parent.hopcountMethod - is either accurate, nodelete, or neverdelete.dataNames - are the names of the data to carry down to the child from this parent.dataValues - are the values to carry down to the child from this parent, corresponding to dataNames above. If CharacterInput objects are passed in here,
it is the caller's responsibility to clean these up.currentTime - is the time in milliseconds since epoch that will be recorded for this operation.priority - is the desired document priority for the document.prereqEventNames - are the events that must be completed before the document can be processed.
ManifoldCFException
boolean[] addDocuments(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] docIDHashes,
java.lang.String[] docIDs,
java.lang.String parentIdentifierHash,
java.lang.String relationshipType,
int hopcountMethod,
java.lang.String[][] dataNames,
java.lang.Object[][][] dataValues,
long currentTime,
double[] priorities,
java.lang.String[][] prereqEventNames)
throws ManifoldCFException
jobID - is the job identifier.legalLinkTypes - is the set of legal link types that this connector generates.docIDHashes - are the hashes of the local document identifiers.docIDs - are the local document identifiers.parentIdentifierHash - is the optional parent identifier hash of these documents. Pass null if none.relationshipType - is the optional link type between this document and its parent. Pass null if there
is no relationship with a parent.hopcountMethod - is either accurate, nodelete, or neverdelete.dataNames - are the names of the data to carry down to the child from this parent.dataValues - are the values to carry down to the child from this parent, corresponding to dataNames above. If CharacterInput objects are passed in here,
it is the caller's responsibility to clean these up.currentTime - is the time in milliseconds since epoch that will be recorded for this operation.priorities - are the desired document priorities for the documents.prereqEventNames - are the events that must be completed before each document can be processed.
ManifoldCFException
DocumentDescription[] finishDocuments(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
java.lang.String[] parentIdentifierHashes,
int hopcountMethod)
throws ManifoldCFException
jobID - is the job identifier.legalLinkTypes - is the set of legal link types that this connector generates.parentIdentifierHashes - are the hashes of the document identifiers for whom child link extraction just took place.hopcountMethod - describes how to handle deletions for hopcount purposes.
ManifoldCFException
java.lang.String[] retrieveParentData(java.lang.Long jobID,
java.lang.String docIDHash,
java.lang.String dataName)
throws ManifoldCFException
jobID - is the job identifier.docIDHash - is the hash of the document identifier.dataName - is the kind of data to retrieve.
ManifoldCFException
CharacterInput[] retrieveParentDataAsFiles(java.lang.Long jobID,
java.lang.String docIDHash,
java.lang.String dataName)
throws ManifoldCFException
jobID - is the job identifier.docIDHash - is the document identifier hash value.dataName - is the kind of data to retrieve.
ManifoldCFException
void manualStart(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the ID of the job to start.
ManifoldCFException
void manualAbort(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job to abort.
ManifoldCFException
void manualAbortRestart(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job to abort.
ManifoldCFException
void pauseJob(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job identifier to pause.
ManifoldCFException
void restartJob(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job identifier to restart.
ManifoldCFException
void resetJobSchedule(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job identifier.
ManifoldCFException
void startJobs(long currentTime,
java.util.ArrayList unwaitList)
throws ManifoldCFException
currentTime - is the current time in milliseconds since epoch.unwaitList - is filled in with the set of job id's that were resumed (Long's).
ManifoldCFException
void waitJobs(long currentTime,
java.util.ArrayList waitList)
throws ManifoldCFException
currentTime - is the current time in milliseconds since epoch.waitList - is filled in with the set of job id's that were put into a wait state (Long's).
ManifoldCFException
JobStartRecord[] getJobsReadyForSeeding(long currentTime)
throws ManifoldCFException
currentTime - is the current time in milliseconds since epoch.
ManifoldCFException
void resetSeedJob(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job id.
ManifoldCFException
JobStartRecord[] getJobsReadyForDelete()
throws ManifoldCFException
ManifoldCFException
JobStartRecord[] getJobsReadyForStartup()
throws ManifoldCFException
ManifoldCFException
JobStartRecord[] getJobsReadyForInactivity()
throws ManifoldCFException
ManifoldCFException
void inactivateJob(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the ID of the job to inactivate.
ManifoldCFException
void resetStartDeleteJob(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job id.
ManifoldCFException
void resetNotifyJob(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job id.
ManifoldCFException
void resetStartupJob(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job id.
ManifoldCFException
void prepareDeleteScan(java.lang.Long jobID)
throws ManifoldCFException
jobID - is the job id.
ManifoldCFException
void prepareFullScan(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
int hopcountMethod)
throws ManifoldCFException
jobID - is the job id.legalLinkTypes - are the link types allowed for the job.hopcountMethod - describes how to handle deletions for hopcount purposes.
ManifoldCFException
void prepareIncrementalScan(java.lang.Long jobID,
java.lang.String[] legalLinkTypes,
int hopcountMethod)
throws ManifoldCFException
jobID - is the job id.legalLinkTypes - are the link types allowed for the job.hopcountMethod - describes how to handle deletions for hopcount purposes.
ManifoldCFException
void noteJobDeleteStarted(java.lang.Long jobID,
long startTime)
throws ManifoldCFException
jobID - is the job id.startTime - is the job start time.
ManifoldCFException
void noteJobStarted(java.lang.Long jobID,
long startTime)
throws ManifoldCFException
jobID - is the job id.startTime - is the job start time.
ManifoldCFException
void noteJobSeeded(java.lang.Long jobID,
long startTime)
throws ManifoldCFException
jobID - is the job id.startTime - is the job seed time.
ManifoldCFException
void noteConnectorDeregistration(java.lang.String[] connectionNames)
throws ManifoldCFException
connectionNames - is the set of connection names.
ManifoldCFException
void noteConnectorRegistration(java.lang.String[] connectionNames)
throws ManifoldCFException
connectionNames - is the set of connection names.
ManifoldCFException
void noteConnectionChange(java.lang.String connectionName)
throws ManifoldCFException
ManifoldCFException
void noteOutputConnectorDeregistration(java.lang.String[] connectionNames)
throws ManifoldCFException
connectionNames - is the set of connection names.
ManifoldCFException
void noteOutputConnectorRegistration(java.lang.String[] connectionNames)
throws ManifoldCFException
connectionNames - is the set of connection names.
ManifoldCFException
void noteOutputConnectionChange(java.lang.String connectionName)
throws ManifoldCFException
ManifoldCFException
void deleteJobsReadyForDelete()
throws ManifoldCFException
ManifoldCFException
DocumentDescription[] getNextDeletableDocuments(int n,
long currentTime)
throws ManifoldCFException
n - is the maximum number of documents to return.currentTime - is the current time; some fetches do not occur until a specific time.
ManifoldCFException
DocumentSetAndFlags getNextCleanableDocuments(int n,
long currentTime)
throws ManifoldCFException
n - is the maximum number of documents to return.currentTime - is the current time; some fetches do not occur until a specific time.
ManifoldCFException
void deleteIngestedDocumentIdentifiers(DocumentDescription[] identifiers)
throws ManifoldCFException
identifiers - is the set of document identifiers.
ManifoldCFException
boolean errorAbort(java.lang.Long jobID,
java.lang.String errorText)
throws ManifoldCFException
jobID - is the job to abort.errorText - is the error text.
ManifoldCFException
void finishJobAborts(long timestamp,
java.util.ArrayList abortJobs)
throws ManifoldCFException
timestamp - is the current time in milliseconds since epoch.abortJobs - is filled in with the set of IJobDescription objects that were aborted.
ManifoldCFException
void finishJobs()
throws ManifoldCFException
ManifoldCFException
void resetJobs(long currentTime,
java.util.ArrayList resetJobs)
throws ManifoldCFException
currentTime - is the current time in milliseconds since epoch.resetJobs - is filled in with the set of IJobDescription objects that were reset.
ManifoldCFException
JobStatus getStatus(java.lang.Long jobID)
throws ManifoldCFException
ManifoldCFException
JobStatus[] getAllStatus()
throws ManifoldCFException
ManifoldCFException
JobStatus[] getRunningJobs()
throws ManifoldCFException
ManifoldCFException
JobStatus[] getFinishedJobs()
throws ManifoldCFException
ManifoldCFException
IResultSet genDocumentStatus(java.lang.String connectionName,
StatusFilterCriteria filterCriteria,
SortOrder sortOrder,
int startRow,
int rowCount)
throws ManifoldCFException
connectionName - is the name of the connection.filterCriteria - are the criteria used to limit the records considered for the report.sortOrder - is the specified sort order of the final report.startRow - is the first row to include.rowCount - is the number of rows to include.
ManifoldCFException
IResultSet genQueueStatus(java.lang.String connectionName,
StatusFilterCriteria filterCriteria,
SortOrder sortOrder,
BucketDescription idBucketDescription,
int startRow,
int rowCount)
throws ManifoldCFException
connectionName - is the name of the connection.filterCriteria - are the criteria used to limit the records considered for the report.sortOrder - is the specified sort order of the final report.idBucketDescription - is the bucket description for generating the identifier class.startRow - is the first row to include.rowCount - is the number of rows to include.
ManifoldCFException
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||