PyPI page
Home page
Author:
None
License:
MIT License
Copyright (c) 2025 Giuseppe Levi
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated docu...
Summary:
A Python library for extracting text from different types of files (PDF, DOCX, PPTX, XLSX, ODT, ecc.).
Latest version:
0.3.5
Required dependencies:
python-magic
|
python-magic-bin
Optional dependencies:
beautifulsoup4
|
easyocr
|
ebooklib
|
extract-msg
|
lxml
|
markdown
|
odfpy
|
ollama
|
openai-whisper
|
openpyxl
|
pillow
|
pylatexenc
|
pymupdf
|
python-docx
|
python-pptx
|
striprtf
|
xlrd
Downloads last day:
46
Downloads last week:
115
Downloads last month:
424