Package | Description |
---|---|
us.codecraft.webmagic |
Main class "Spider" and models.
|
us.codecraft.webmagic.downloader |
Downloader is the part that downloads web pages and store in Page object.
|
us.codecraft.webmagic.selector |
Selectors for page extraction.
|
Modifier and Type | Method and Description |
---|---|
Html |
Page.getHtml()
get html content of page
|
Modifier and Type | Method and Description |
---|---|
void |
Page.setHtml(Html html)
Deprecated.
since 0.4.0
The html is parse just when first time of calling
Page.getHtml() , so use Page.setRawText(String) instead. |
Modifier and Type | Method and Description |
---|---|
Html |
AbstractDownloader.download(String url)
A simple method to download a url.
|
Html |
AbstractDownloader.download(String url,
String charset)
A simple method to download a url.
|
Modifier and Type | Method and Description |
---|---|
static Html |
Html.create(String text) |
Copyright © 2017. All rights reserved.