PyPI page
Home page
Author:
None
License:
MIT
Summary:
PDF, DOCX, and HTML text extraction and normalization for academic papers
Latest version:
2.4.69
Required dependencies:
camelot-py
|
pdfplumber
Optional dependencies:
anyio
|
beautifulsoup4
|
httpx
|
lxml
|
mammoth
|
pytest
|
pytest-anyio
|
python-docx
|
rapidfuzz
Downloads last day:
506
Downloads last week:
4,460
Downloads last month:
7,760