public class CommonCrawlFormatFactory
extends java.lang.Object
CommonCrawlFormat
objects (a.k.a. formatter) that map crawled files to CommonCrawl format.Constructor and Description |
---|
CommonCrawlFormatFactory() |
Modifier and Type | Method and Description |
---|---|
static CommonCrawlFormat |
getCommonCrawlFormat(java.lang.String formatType,
Configuration nutchConf,
CommonCrawlConfig config) |
static CommonCrawlFormat |
getCommonCrawlFormat(java.lang.String formatType,
java.lang.String url,
Content content,
Metadata metadata,
Configuration nutchConf,
CommonCrawlConfig config)
Deprecated.
|
public static CommonCrawlFormat getCommonCrawlFormat(java.lang.String formatType, java.lang.String url, Content content, Metadata metadata, Configuration nutchConf, CommonCrawlConfig config) throws java.io.IOException
CommonCrawlFormat
object specifying the type of formatter.formatType
- the type of formatter to be created.url
- the url.content
- the content.metadata
- the metadata.nutchConf
- the configuration.config
- the CommonCrawl output configuration.CommonCrawlFormat
object.java.io.IOException
- If any I/O error occurs.public static CommonCrawlFormat getCommonCrawlFormat(java.lang.String formatType, Configuration nutchConf, CommonCrawlConfig config) throws java.io.IOException
java.io.IOException
Copyright © 2019 The Apache Software Foundation