PyPI page
Home page
Author:
None
Summary:
Modular version of the Docling package: SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
Latest version:
2.93.0
Required dependencies:
certifi
|
docling-core
|
filetype
|
mlx-vlm
|
mlx-whisper
|
ocrmac
|
onnxruntime
|
onnxruntime-gpu
|
pluggy
|
pydantic
|
pydantic-settings
|
requests
|
tqdm
Optional dependencies:
accelerate
|
arelle-release
|
beautifulsoup4
|
defusedxml
|
docling-core
|
docling-ibm-models
|
docling-parse
|
easyocr
|
httpx
|
huggingface-hub
|
lxml
|
marko
|
numba
|
numpy
|
openai-whisper
|
openpyxl
|
pandas
|
peft
|
pillow
|
playwright
|
polyfactory
|
pylatexenc
|
pypdfium2
|
python-docx
|
python-pptx
|
qwen-vl-utils
|
rapidocr
|
rich
|
rtree
|
scikit-image
|
scipy
|
tesserocr
|
torch
|
torchvision
|
transformers
|
tritonclient
|
typer
|
websockets
Downloads last day:
44,012
Downloads last week:
232,977
Downloads last month:
291,851