PyPI Stats

Search

All packages
Top packages

Track packages

squeakycleantext


PyPI page
Home page
Author: Rehan Fazal
License: MIT
Summary: Text preprocessing & PII anonymization pipeline for NLP/ML: ONNX NER ensemble, language detection, stopword removal, and configurable token replacement.
Latest version: 0.6.1
Required dependencies: beautifulsoup4 | emoji | ftfy | huggingface-hub | lingua-language-detector | numpy | onnxruntime | presidio_anonymizer | regex | stop-words | tokenizers | unidecode
Optional dependencies: coverage | faker | gliclass | gliner | gliner2 | hypothesis | onnxruntime | onnxruntime-gpu | presidio-analyzer | pytest | pytest-cov | pytest-timeout | rapidfuzz | ruff | torch | transformers

Downloads last day: 24
Downloads last week: 715
Downloads last month: 817