PyPI page
Home page
Author:
None
Summary:
A robust, extensible Python package for synchronous and asynchronous text extraction from PDF, DOCX, DOC, TXT, ZIP, MD, RTF, HTML, and more.
Latest version:
0.2.3
Optional dependencies:
antiword
|
beautifulsoup4
|
lxml
|
markdown
|
mkdocs
|
mkdocs-gen-files
|
mkdocs-material
|
mkdocstrings
|
pymupdf
|
pytest
|
pytest-asyncio
|
pytest-cov
|
python-docx
|
striprtf
Downloads last day:
52
Downloads last week:
199
Downloads last month:
943