Interface | Description |
---|---|
LinkExtractorParser |
Interface specifying contract of content parser that aims to extract links
|
Class | Description |
---|---|
BaseParser |
BaseParser is the base class for
LinkExtractorParser
It is advised to make subclasses reusable across parsing, so BaseParser BaseParser.isReusable() returns true by default |
CSSParseExceptionCallback | |
CssParser |
CSS Parser used to extract from CSS files external urls
|
CssParserCacheLoader | |
HTMLParser |
HTMLParser subclasses can parse HTML content to obtain URLs. |
HtmlParsingUtils | |
JsoupBasedHtmlParser |
Parser based on JSOUP
|
LagartoBasedHtmlParser |
Parser based on Lagarto
|
URLCollection |
Collection class designed for handling URLs
|
URLString |
Helper class to allow URLs to be stored in Collections without incurring the
cost of the hostname lookup performed by the URL methods equals() and
hashCode() URL is a final class, so cannot be extended ...
|
Exception | Description |
---|---|
HTMLParseException |
Exception class for use with
HTMLParser classes. |
LinkExtractorParseException |
Exception used with
LinkExtractorParser |
Copyright © 1998-2019 Apache Software Foundation. All Rights Reserved.