PyPI Stats

Search

All packages
Top packages

Track packages

betterhtmlchunking


PyPI page
Home page
Author: None
Summary: A Python library for intelligent HTML segmentation and ROI extraction. It builds a DOM tree from raw HTML and extracts content-rich regions for efficient web scraping and analysis.
Latest version: 0.9.7
Required dependencies: attrs | attrs-strict | beautifulsoup4 | lxml | parsel_text | treelib | typer
Optional dependencies: ipython | prettyprinter | pytest | pytest-cov

Downloads last day: 47
Downloads last week: 311
Downloads last month: 1,498