PyPI page
Home page
Author:
dlazesz
License:
LGPLv3
Summary:
Map the HTML schema of portals to valid TEI XML with the tags and structures used in them using small manual portal-specific configurations.
Latest version:
1.2.3
Required dependencies:
beautifulsoup4
|
lxml
|
pyyaml
|
warcio
|
webarticlecurator
Optional dependencies:
justext
|
newspaper3k
Downloads last day:
4
Downloads last week:
14
Downloads last month:
43