PyPI page
Home page
Author:
None
Summary:
Privacy-first document intelligence engine — converts PDFs, DOCX, PPTX, XLSX, and CSV into AI-ready Markdown + structured JSON for RAG pipelines.
Latest version:
0.1.5
Required dependencies:
docling
|
docling-core
|
fast-langdetect
|
langgraph-checkpoint-mongodb
|
pydantic
Optional dependencies:
anyio
|
arq
|
build
|
chromadb
|
defusedxml
|
docxlatex
|
faiss-cpu
|
faiss-gpu
|
fastapi
|
httpx
|
langchain
|
langchain-chroma
|
langchain-core
|
langchain-google-genai
|
langchain-groq
|
langchain-huggingface
|
langchain-mongodb
|
langchain-openai
|
langgraph
|
langgraph-checkpoint
|
llama-index-core
|
longparser
|
marker-pdf
|
motor
|
mypy
|
pix2tex
|
pix2text
|
pymupdf4llm
|
pytest
|
pytest-asyncio
|
pytest-cov
|
python-dotenv
|
python-magic
|
python-multipart
|
python-pptx
|
qdrant-client
|
redis
|
ruff
|
sentence-transformers
|
spacy
|
tiktoken
|
twine
|
uvicorn
Downloads last day:
9
Downloads last week:
105
Downloads last month:
772