| Package | Description |
|---|---|
| org.apache.nutch.collection |
Subcollection is a subset of an index.
|
| org.apache.nutch.net | |
| org.apache.nutch.net.urlnormalizer.basic | |
| org.apache.nutch.net.urlnormalizer.pass | |
| org.apache.nutch.net.urlnormalizer.regex | |
| org.apache.nutch.parse | |
| org.apache.nutch.urlfilter.api | |
| org.apache.nutch.urlfilter.automaton |
A url filter plugin based on
dk.brics.automaton Finite-State
Automata for JavaTM.
|
| org.apache.nutch.urlfilter.domain |
A url filter plugin that filters by domain.
|
| org.apache.nutch.urlfilter.domainblacklist | |
| org.apache.nutch.urlfilter.prefix |
A url filter plugin.
|
| org.apache.nutch.urlfilter.regex |
A url filter plugin.
|
| org.apache.nutch.urlfilter.suffix | |
| org.apache.nutch.urlfilter.validator |
A url filter plugin that validates given urls.
|
| Class and Description |
|---|
| URLFilter
Interface used to limit which URLs enter Nutch.
|
| Class and Description |
|---|
| URLFilterException |
| Class and Description |
|---|
| URLNormalizer
Interface used to convert URLs to normal form and optionally perform substitutions
|
| Class and Description |
|---|
| URLNormalizer
Interface used to convert URLs to normal form and optionally perform substitutions
|
| Class and Description |
|---|
| URLNormalizer
Interface used to convert URLs to normal form and optionally perform substitutions
|
| Class and Description |
|---|
| URLFilters
Creates and caches
URLFilter implementing plugins. |
| URLNormalizers
This class uses a "chained filter" pattern to run defined normalizers.
|
| Class and Description |
|---|
| URLFilter
Interface used to limit which URLs enter Nutch.
|
| Class and Description |
|---|
| URLFilter
Interface used to limit which URLs enter Nutch.
|
| Class and Description |
|---|
| URLFilter
Interface used to limit which URLs enter Nutch.
|
| Class and Description |
|---|
| URLFilter
Interface used to limit which URLs enter Nutch.
|
| Class and Description |
|---|
| URLFilter
Interface used to limit which URLs enter Nutch.
|
| Class and Description |
|---|
| URLFilter
Interface used to limit which URLs enter Nutch.
|
| Class and Description |
|---|
| URLFilter
Interface used to limit which URLs enter Nutch.
|
| Class and Description |
|---|
| URLFilter
Interface used to limit which URLs enter Nutch.
|
Copyright © 2014 The Apache Software Foundation