See: Description
Interface | Description |
---|---|
CollectorPipeline<T> |
Pipeline that can collect and store results.
|
PageModelPipeline<T> |
Implements PageModelPipeline to persistent your page model.
|
Pipeline |
Pipeline is the persistent and offline process part of crawler.
The interface Pipeline can be implemented to customize ways of persistent. |
Class | Description |
---|---|
CollectorPageModelPipeline<T> | |
ConsolePipeline |
Write results in console.
Usually used in test. |
FilePageModelPipeline |
Store results objects (page models) to files in plain format.
Use model.getKey() as file name if the model implements HasKey. Otherwise use SHA1 as file name. |
FilePipeline |
Store results in files.
|
JsonFilePageModelPipeline |
Store results objects (page models) to files in JSON format.
Use model.getKey() as file name if the model implements HasKey. Otherwise use SHA1 as file name. |
JsonFilePipeline |
Store results to files in JSON format.
|
MultiPagePipeline |
A pipeline combines the result in more than one page together.
Used for news and articles containing more than one web page. |
ResultItemsCollectorPipeline |
Copyright © 2017. All rights reserved.