abstract-pdfs

PyPI page
Home page
Author: putkoff
License: MIT
Summary: A structured pipeline for transforming PDFs into **searchable, metadata-rich, web-ready content**, combining OCR, page-level analysis, metadata generation, and static site scaffolding.
Latest version: 0.0.39
Required dependencies: abstract_ocr | abstract_utilities | keybert | pymupdf | speechrecognition | whisper

Downloads last day: 64
Downloads last week: 1,211
Downloads last month: 2,237