public class CssParser extends Object implements LinkExtractorParser
Constructor and Description |
---|
CssParser() |
Modifier and Type | Method and Description |
---|---|
Iterator<URL> |
getEmbeddedResourceURLs(String userAgent,
byte[] data,
URL baseUrl,
String encoding)
Get the URLs for all the resources that a browser would automatically
download following the download of the content, that is: images,
stylesheets, javascript files, applets, etc...
|
boolean |
isReusable() |
public Iterator<URL> getEmbeddedResourceURLs(String userAgent, byte[] data, URL baseUrl, String encoding) throws LinkExtractorParseException
LinkExtractorParser
URLs should not appear twice in the returned iterator.
Malformed URLs can be reported to the caller by having the Iterator return the corresponding RL String. Overall problems parsing the html should be reported by throwing an HTMLParseException.
getEmbeddedResourceURLs
in interface LinkExtractorParser
userAgent
- User Agentdata
- Response databaseUrl
- Base URL from which the HTML code was obtainedencoding
- CharsetLinkExtractorParseException
- when extracting the links fails(java.lang.String, byte[], java.net.URL, java.lang.String)
public boolean isReusable()
isReusable
in interface LinkExtractorParser
Copyright © 1998-2019 Apache Software Foundation. All Rights Reserved.