| Package | Description |
|---|---|
| org.apache.nutch.parse | |
| org.apache.nutch.segment |
| Modifier and Type | Method and Description |
|---|---|
static ParseText |
ParseText.read(DataInput in) |
| Modifier and Type | Method and Description |
|---|---|
void |
ParseResult.put(String key,
ParseText text,
ParseData data)
Store a result of parsing.
|
void |
ParseResult.put(org.apache.hadoop.io.Text key,
ParseText text,
ParseData data)
Store a result of parsing.
|
| Constructor and Description |
|---|
ParseImpl(ParseText text,
ParseData data) |
ParseImpl(ParseText text,
ParseData data,
boolean isCanonical) |
| Modifier and Type | Method and Description |
|---|---|
boolean |
SegmentMergeFilter.filter(org.apache.hadoop.io.Text key,
CrawlDatum generateData,
CrawlDatum fetchData,
CrawlDatum sigData,
Content content,
ParseData parseData,
ParseText parseText,
Collection<CrawlDatum> linked)
The filtering method which gets all information being merged for a given
key (URL).
|
boolean |
SegmentMergeFilters.filter(org.apache.hadoop.io.Text key,
CrawlDatum generateData,
CrawlDatum fetchData,
CrawlDatum sigData,
Content content,
ParseData parseData,
ParseText parseText,
Collection<CrawlDatum> linked)
Iterates over all
SegmentMergeFilter extensions and if any of them
returns false, it will return false as well. |
Copyright © 2014 The Apache Software Foundation