PyPI page
Home page
Author:
Rehan Fazal
License:
MIT
Summary:
Text preprocessing & PII anonymization pipeline for NLP/ML: ONNX NER ensemble, language detection, stopword removal, and configurable token replacement.
Latest version:
0.6.1
Required dependencies:
beautifulsoup4
|
emoji
|
ftfy
|
huggingface-hub
|
lingua-language-detector
|
numpy
|
onnxruntime
|
presidio_anonymizer
|
regex
|
stop-words
|
tokenizers
|
unidecode
Optional dependencies:
coverage
|
faker
|
gliclass
|
gliner
|
gliner2
|
hypothesis
|
onnxruntime
|
onnxruntime-gpu
|
presidio-analyzer
|
pytest
|
pytest-cov
|
pytest-timeout
|
rapidfuzz
|
ruff
|
torch
|
transformers
Downloads last day:
24
Downloads last week:
715
Downloads last month:
817