PyPI page
Home page
Author:
Yuvaraj Kannan
License:
Apache-2.0
Summary:
A fast, layout-aware OCR decision engine for document processing pipelines. Detects whether files truly require OCR before expensive processing, reducing unnecessary OCR calls while preserving extraction reliability.
Latest version:
1.11.0
Required dependencies:
beautifulsoup4
|
click
|
numpy
|
openpyxl
|
pdfplumber
|
pillow
|
pydantic
|
python-docx
|
python-magic
|
python-pptx
Optional dependencies:
black
|
matplotlib
|
mypy
|
opencv-python-headless
|
pre-commit
|
pymupdf
|
pytest
|
pytest-cov
|
ruff
|
tqdm
Downloads last day:
251
Downloads last week:
650
Downloads last month:
1,475