PyPI page
Home page
Author:
None
Summary:
Zem: Unified Data Pipeline Framework (ZenML + NeMo Curator + DataJuicer) for multi-domain processing
Latest version:
0.4.0
Required dependencies:
click
|
fastmcp
|
loguru
|
mcp
|
numpy
|
pandas
|
pyarrow
|
pydantic
|
pyyaml
|
rich
|
zenml
Optional dependencies:
accelerate
|
argilla
|
black
|
cachetools
|
dask-cuda
|
datasketch
|
einops
|
faiss-cpu
|
fastapi
|
ftfy
|
h5py
|
krippendorff
|
landingai-ade
|
lhotse
|
librosa
|
litellm
|
matplotlib
|
mypy
|
nemo-curator
|
noisereduce
|
numpy
|
onnxruntime
|
onnxruntime-gpu
|
openai-whisper
|
opencv-python
|
openpyxl
|
opik
|
paddleocr
|
paddlepaddle
|
pandas
|
pdfplumber
|
pesq
|
pillow
|
py-data-juicer
|
pyannote-audio
|
pyclipper
|
pydantic-settings
|
pydub
|
pyloudnorm
|
pymupdf
|
pystoi
|
pytesseract
|
pytest
|
pytest-cov
|
python-magic
|
python-multipart
|
pyvi
|
ruamel-yaml
|
ruff
|
scikit-learn
|
scipy
|
sentence-transformers
|
sentencepiece
|
shapely
|
soundfile
|
tabulate
|
thop
|
toml
|
torch
|
torchaudio
|
torchinfo
|
torchvision
|
transformers
|
underthesea
|
unstructured
|
uvicorn
|
vllm
Downloads last day:
0
Downloads last week:
87
Downloads last month:
162