public class HttpWebClient
extends java.lang.Object
Modifier and Type | Field and Description |
---|---|
static java.lang.ThreadLocal<WebDriver> |
threadWebDriver |
Constructor and Description |
---|
HttpWebClient() |
Modifier and Type | Method and Description |
---|---|
static void |
cleanUpDriver(WebDriver driver) |
static WebDriver |
getDriverForPage(java.lang.String url,
Configuration conf) |
static java.lang.String |
getHTMLContent(WebDriver driver,
Configuration conf) |
static java.lang.String |
getHtmlPage(java.lang.String url) |
static java.lang.String |
getHtmlPage(java.lang.String url,
Configuration conf)
Function for obtaining the HTML BODY using the selected
selenium webdriver
There are a number of configuration properties within
nutch-site.xml which determine whether to
take screenshots of the rendered pages and persist them
as timestamped .png's into HDFS. |
public static WebDriver getDriverForPage(java.lang.String url, Configuration conf)
public static java.lang.String getHTMLContent(WebDriver driver, Configuration conf)
public static void cleanUpDriver(WebDriver driver)
public static java.lang.String getHtmlPage(java.lang.String url, Configuration conf)
nutch-site.xml
which determine whether to
take screenshots of the rendered pages and persist them
as timestamped .png's into HDFS.url
- the URL to fetch and renderconf
- the Configuration
public static java.lang.String getHtmlPage(java.lang.String url)
Copyright © 2018 The Apache Software Foundation