PyPI page
Home page
Author:
None
License:
MIT
Summary:
Data processing pipeline using MLX (scraper, chunker, extractor).
Latest version:
0.1.0
Required dependencies:
beautifulsoup4
|
cchardet
|
cryptography
|
dataclasses-json
|
html2text
|
iso8601
|
lxml
|
mlx
|
mlx-lm
|
numpy
|
pillow
|
playwright
|
progressbar2
|
pybase64
|
pydantic
|
pymupdf
|
pypandoc
|
pytesseract
|
python-docx
|
python-frontmatter
|
python-magic
|
python-pptx
|
python-slugify
|
regex
|
requests
|
requests-html
|
requests-toolbelt
|
tqdm
|
trafilatura
|
url-normalize
|
validators
Optional dependencies:
black
|
isort
|
mypy
|
pytest
|
ruff
Downloads last day:
6
Downloads last week:
83
Downloads last month:
90