| Field and Description |
|---|
| us.codecraft.webmagic.selector.Html.DISABLE_HTML_ENTITY_ESCAPE |
| Method and Description |
|---|
| us.codecraft.webmagic.Spider.downloader(Downloader) |
| us.codecraft.webmagic.utils.UrlUtils.encodeIllegalCharacterInUrl(String) |
| us.codecraft.webmagic.Spider.pipeline(Pipeline) |
| us.codecraft.webmagic.Spider.scheduler(Scheduler) |
| us.codecraft.webmagic.Page.setHtml(Html)
since 0.4.0
The html is parse just when first time of calling
Page.getHtml(), so use Page.setRawText(String) instead. |
| us.codecraft.webmagic.selector.Selectors.xsoup(String) |
| Annotation Type Element and Description |
|---|
| us.codecraft.webmagic.model.annotation.ExtractByUrl.multi
since 0.4.2
|
| us.codecraft.webmagic.model.annotation.ExtractBy.multi
since 0.4.2
|
| us.codecraft.webmagic.model.annotation.ComboExtract.multi
since 0.4.2
|
Copyright © 2017. All rights reserved.