PyPI page
Home page
Author:
Mykola Melnyk
License:
AGPL-3.0
Summary:
ScaleDP is a library for processing documents and images using Apache Spark and LLMs
Latest version:
0.3.0
Required dependencies:
filelock
|
huggingface-hub
|
imagesize
|
img2pdf
|
levenshtein
|
numpy
|
onnxruntime
|
openai
|
opencv-python
|
pandas
|
pillow
|
pyarrow
|
pyclipper
|
pydantic
|
pymupdf
|
pyspark
|
pytesseract
|
pytest
|
semantic-text-splitter
|
shapely
|
sparkdantic
|
tenacity
|
torch
|
torchvision
Optional dependencies:
easyocr
|
python-doctr
|
sentence-transformers
|
surya-ocr
|
transformers
Downloads last day:
22
Downloads last week:
29
Downloads last month:
475