PyPI page
Home page
Author:
None
Summary:
A Python library for intelligent HTML segmentation and ROI extraction. It builds a DOM tree from raw HTML and extracts content-rich regions for efficient web scraping and analysis.
Latest version:
0.9.7
Required dependencies:
attrs
|
attrs-strict
|
beautifulsoup4
|
lxml
|
parsel_text
|
treelib
|
typer
Optional dependencies:
ipython
|
prettyprinter
|
pytest
|
pytest-cov
Downloads last day:
47
Downloads last week:
311
Downloads last month:
1,498