PyPI page
Home page
Author:
dlazesz
License:
LGPLv3
Summary:
A crawler program to download content from portals (news, forums, blogs) and convert it to the desired output format according to the configuration.
Latest version:
1.13.0
Required dependencies:
beautifulsoup4
|
chardet
|
lxml
|
mplogger
|
pyyaml
|
ratelimit
|
requests
|
urllib3
|
warcio
|
yamale
Optional dependencies:
newspaper3k
Downloads last day:
0
Downloads last week:
31
Downloads last month:
98