PyPI page
Home page
Author:
Divyanshu Kakwani
License:
Summary:
Generate large textual corpora for almost any language by crawling the web
Latest version:
0.2
Required dependencies:
boilerpipe3
|
click
|
htmldate
|
morfessor
|
nltk
|
pandas
|
scrapy
|
scrapyd
|
scrapyd-client
|
tldextract
|
tqdm
Downloads last day:
5
Downloads last week:
23
Downloads last month:
32