| Interface | Description |
|---|---|
| LinkExtractorParser |
Interface specifying contract of content parser that aims to extract links
|
| Class | Description |
|---|---|
| BaseParser |
BaseParser is the base class for
LinkExtractorParser
It is advised to make subclasses reusable across parsing, so BaseParserBaseParser.isReusable() returns true by default |
| CSSParseExceptionCallback | |
| CssParser |
CSS Parser used to extract from CSS files external urls
|
| CssParserCacheLoader | |
| HTMLParser |
HTMLParser subclasses can parse HTML content to obtain URLs. |
| HtmlParsingUtils | |
| JsoupBasedHtmlParser |
Parser based on JSOUP
|
| LagartoBasedHtmlParser |
Parser based on Lagarto
|
| URLCollection |
Collection class designed for handling URLs
|
| URLString |
Helper class to allow URLs to be stored in Collections without incurring the
cost of the hostname lookup performed by the URL methods equals() and
hashCode() URL is a final class, so cannot be extended ...
|
| Exception | Description |
|---|---|
| HTMLParseException |
Exception class for use with
HTMLParser classes. |
| LinkExtractorParseException |
Exception used with
LinkExtractorParser |
Copyright © 1998-2019 Apache Software Foundation. All Rights Reserved.