PyPI Stats

Search

All packages
Top packages

Track packages

py-data-juicer


PyPI page
Home page
Author: SysML Team of Alibaba Tongyi Lab
License: Apache-2.0
Summary: Data Processing for and with Foundation Models.
Latest version: 1.5.1
Required dependencies: av | bs4 | datasets | dep-logic | dill | emoji | fastapi | fsspec | gitpython | httpx | jsonargparse | jsonlines | librosa | loguru | lz4 | matplotlib | mcp | multiprocess | mwparserfromhell | numpy | pandas | pdfplumber | pillow | plotly | psutil | pydantic | pylance | python-docx | rank-bm25 | regex | requests | resampy | samplerate | seaborn | spacy | streamlit | tabulate | tomli | tomli-w | tqdm | uv | wget | wordcloud | zstandard
Optional dependencies: accelerate | audiomentations | bitarray | black | boto3 | build | click | coverage | cudf-cu12 | dashscope | decord | diffusers | docstring-parser | easyocr | einops | fasttext-wheel | ffmpeg-python | fire | flake8-black | ftfy | furo | imagededup | kenlm | label-studio | linkify-it-py | myst-parser | nlpaug | nlpcda | nltk | onnxruntime | openai | opencc | opencv-contrib-python | pre-commit | ptlflow | pyspark | pytest | pytest-cov | qwen-vl-utils | ray | recommonmark | rembg | rouge | s3fs | scenedetect | selectolax | sentencepiece | simhash-pybind | simple-aesthetics-predictor | soundfile | spacy-pkuseg | sphinx | sphinx-autobuild | sphinx-copybutton | tiktoken | timm | toml | torch | torchaudio | torchcodec | transformers | ultralytics | uvloop | vllm | wandb

Downloads last day: 32
Downloads last week: 578
Downloads last month: 2,312