PyPI page
Home page
Author:
Christoph Auer
License:
MIT
Summary:
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
Latest version:
2.13.1
Required dependencies:
beautifulsoup4
|
certifi
|
deepsearch-glm
|
docling-core
|
docling-ibm-models
|
docling-parse
|
easyocr
|
filetype
|
google-api-core
|
google-auth
|
google-cloud-vision
|
googleapis-common-protos
|
huggingface_hub
|
lxml
|
marko
|
ocrmac
|
onnxruntime
|
openpyxl
|
pandas
|
pydantic
|
pydantic-settings
|
pypdfium2
|
python-docx
|
python-pptx
|
rapidocr-onnxruntime
|
requests
|
rtree
|
scipy
|
typer
Optional dependencies:
tesserocr
Downloads last day:
1
Downloads last week:
63
Downloads last month:
90