PyPI page
Home page
Author:
Christoph Auer
License:
MIT
Summary:
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
Latest version:
2.32.0
Required dependencies:
accelerate
|
beautifulsoup4
|
certifi
|
docling-core
|
docling-ibm-models
|
docling-parse
|
easyocr
|
filetype
|
huggingface_hub
|
lxml
|
marko
|
ocrmac
|
onnxruntime
|
openpyxl
|
pandas
|
pillow
|
pluggy
|
pydantic
|
pydantic-settings
|
pylatexenc
|
pypdfium2
|
python-docx
|
python-pptx
|
rapidocr-onnxruntime
|
requests
|
rtree
|
scipy
|
tqdm
|
transformers
|
typer
Optional dependencies:
tesserocr
Downloads last day:
2
Downloads last week:
10
Downloads last month:
20