public class NodeDumper
extends org.apache.hadoop.conf.Configured
implements org.apache.hadoop.util.Tool
| Modifier and Type | Class and Description |
|---|---|
static class |
NodeDumper.Dumper
Outputs the hosts or domains with an associated value.
|
static class |
NodeDumper.Sorter
Outputs the top urls sorted in descending order.
|
| Modifier and Type | Field and Description |
|---|---|
static org.slf4j.Logger |
LOG |
| Constructor and Description |
|---|
NodeDumper() |
| Modifier and Type | Method and Description |
|---|---|
void |
dumpNodes(org.apache.hadoop.fs.Path webGraphDb,
org.apache.nutch.scoring.webgraph.NodeDumper.DumpType type,
long topN,
org.apache.hadoop.fs.Path output,
boolean asEff,
org.apache.nutch.scoring.webgraph.NodeDumper.NameType nameType,
org.apache.nutch.scoring.webgraph.NodeDumper.AggrType aggrType,
boolean asSequenceFile)
Runs the process to dump the top urls out to a text file.
|
static void |
main(String[] args) |
int |
run(String[] args)
Runs the node dumper tool.
|
public void dumpNodes(org.apache.hadoop.fs.Path webGraphDb,
org.apache.nutch.scoring.webgraph.NodeDumper.DumpType type,
long topN,
org.apache.hadoop.fs.Path output,
boolean asEff,
org.apache.nutch.scoring.webgraph.NodeDumper.NameType nameType,
org.apache.nutch.scoring.webgraph.NodeDumper.AggrType aggrType,
boolean asSequenceFile)
throws Exception
webGraphDb - The WebGraph from which to pull values.topN - output - IOException - If an error occurs while dumping the top values.ExceptionCopyright © 2014 The Apache Software Foundation