PyPI page
Home page
Author:
docparseai
License:
MIT License
Copyright (c) 2025 DocParseAI
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documen...
Summary:
Convert PDF, DOCX, PPTX, Images, URLs like Medium, Wikipedia and CSV documents to text or Markdown. Extracts text, images, and tables. Supports LLM-based extraction.
Latest version:
0.2.0
Required dependencies:
beautifulsoup4
|
comtypes
|
docx2pdf
|
google-generativeai
|
html2text
|
pandas
|
pillow
|
pymupdf
|
pytesseract
|
python-docx
|
python-pptx
|
tabulate
|
typing-extensions
Downloads last day:
29
Downloads last week:
200
Downloads last month:
1,084