PyPI page
Home page
Author:
Oak Ridge National Laboratory
Summary:
A flexible machine learning data pre-processing pipeline framework.
Latest version:
0.5.0
Required dependencies:
datasets
|
duckdb
|
gensim
|
numpy
|
pandas
|
polars-u64-idx
|
pyarrow
|
scipy
|
tokenizers
|
transformers
Downloads last day:
5
Downloads last week:
5
Downloads last month:
37