Package | Description |
---|---|
us.codecraft.webmagic.configurable | |
us.codecraft.webmagic.selector |
Selectors for page extraction.
|
us.codecraft.webmagic.utils |
Static utils of webmagic.
|
Modifier and Type | Method and Description |
---|---|
Selector |
ExtractRule.getSelector() |
Modifier and Type | Method and Description |
---|---|
void |
ExtractRule.setSelector(Selector selector) |
Modifier and Type | Class and Description |
---|---|
class |
AndSelector
All selectors will be arranged as a pipeline.
|
class |
BaseElementSelector |
class |
CssSelector
CSS selector.
|
class |
JsonPathSelector
JsonPath selector.
Used to extract content from JSON. |
class |
LinksSelector
Links selector based on jsoup.
|
class |
OrSelector
All extractors will do extracting separately,
and the results of extractors will combined as the final result. |
class |
RegexSelector
Selector in regex.
|
class |
ReplaceSelector
Replace selector.
|
class |
SmartContentSelector
Borrowed from https://code.google.com/p/cx-extractor/
|
class |
Xpath2Selector
支持xpath2.0的选择器。包装了HtmlCleaner和Saxon HE。
|
class |
XpathSelector
XPath selector based on Xsoup.
|
Modifier and Type | Method and Description |
---|---|
static AndSelector |
Selectors.and(Selector... selectors) |
static OrSelector |
Selectors.or(Selector... selectors) |
Selectable |
Selectable.select(Selector selector)
extract by custom selector
|
Selectable |
HtmlNode.select(Selector selector) |
Selectable |
AbstractSelectable.select(Selector selector) |
protected Selectable |
AbstractSelectable.select(Selector selector,
List<String> strings) |
String |
Html.selectDocument(Selector selector) |
List<String> |
Html.selectDocumentForList(Selector selector) |
Selectable |
Selectable.selectList(Selector selector)
extract by custom selector
|
Selectable |
HtmlNode.selectList(Selector selector) |
Selectable |
AbstractSelectable.selectList(Selector selector) |
protected Selectable |
AbstractSelectable.selectList(Selector selector,
List<String> strings) |
Constructor and Description |
---|
AndSelector(Selector... selectors) |
OrSelector(Selector... selectors) |
Constructor and Description |
---|
AndSelector(List<Selector> selectors) |
OrSelector(List<Selector> selectors) |
Modifier and Type | Method and Description |
---|---|
static Selector |
ExtractorUtils.getSelector(ExtractBy extractBy) |
Modifier and Type | Method and Description |
---|---|
static List<Selector> |
ExtractorUtils.getSelectors(ExtractBy[] extractBies) |
Copyright © 2017. All rights reserved.