PyPI page
Home page
Author:
Mykola Melnyk
License:
AGPL-3.0
Summary:
Spark-Pdf is a library for processing documents using Apache Spark
Latest version:
0.1.0rc9
Required dependencies:
imagesize
|
numpy
|
pandas
|
pillow
|
pyarrow
|
pymupdf
|
pyspark
|
pytesseract
|
pytest
Optional dependencies:
torch
|
transformers
Downloads last day:
0
Downloads last week:
28
Downloads last month:
38