Field and Description |
---|
us.codecraft.webmagic.selector.Html.DISABLE_HTML_ENTITY_ESCAPE |
Method and Description |
---|
us.codecraft.webmagic.Spider.downloader(Downloader) |
us.codecraft.webmagic.utils.UrlUtils.encodeIllegalCharacterInUrl(String) |
us.codecraft.webmagic.Spider.pipeline(Pipeline) |
us.codecraft.webmagic.Spider.scheduler(Scheduler) |
us.codecraft.webmagic.Page.setHtml(Html)
since 0.4.0
The html is parse just when first time of calling
Page.getHtml() , so use Page.setRawText(String) instead. |
us.codecraft.webmagic.selector.Selectors.xsoup(String) |
Annotation Type Element and Description |
---|
us.codecraft.webmagic.model.annotation.ExtractByUrl.multi
since 0.4.2
|
us.codecraft.webmagic.model.annotation.ExtractBy.multi
since 0.4.2
|
us.codecraft.webmagic.model.annotation.ComboExtract.multi
since 0.4.2
|
Copyright © 2017. All rights reserved.