PyPI page
Home page
Author:
Sambhranta Ghosh
Summary:
a modular Python package for cleaning text, categorical, numerical, and datetime data. It offers configurable pipelines with support for preprocessing, typo correction, encoding, imputation, logging, parallel processing, and audit reporting—perfect for data scientists handling messy, real-world datasets in ML workflows.
Latest version:
0.1.13
Required dependencies:
better-profanity
|
contractions
|
emoji
|
joblib
|
nltk
|
numpy
|
pandas
|
python-dateutil
|
python-levenshtein
|
pytz
|
scikit-learn
|
statsmodels
|
textblob
|
thefuzz
|
tqdm
Downloads last day:
7
Downloads last week:
27
Downloads last month:
51