PyPI page
Home page
Author:
putkoff
License:
MIT
Summary:
A structured pipeline for transforming PDFs into **searchable, metadata-rich, web-ready content**, combining OCR, page-level analysis, metadata generation, and static site scaffolding.
Latest version:
0.0.33
Required dependencies:
abstract_ocr
|
abstract_utilities
|
keybert
|
pymupdf
|
speechrecognition
|
whisper
Downloads last day:
100
Downloads last week:
158
Downloads last month:
629