PyPI page
Home page
Author:
Adit Bajaj
Summary:
A fast PDF extractor; a 200 pages/s alternative to Marker, Docling, PyMUPDF4LLM & others.
Latest version:
1.1.2
Required dependencies:
ijson
|
pydantic
|
rich
|
shellingham
|
typer
Optional dependencies:
altair
|
apted
|
build
|
datasets
|
docling
|
fibrum-pdf
|
lxml
|
mistune
|
polars
|
pymupdf
|
pymupdf4llm
|
python-levenshtein
|
rapidfuzz
|
rich
|
ruff
|
scipy
|
typer
Downloads last day:
279
Downloads last week:
666
Downloads last month:
1,745