PyPI page
Home page
Author:
Speech Lab of Alibaba Group
License:
The MIT License
Summary:
Industrial-grade speech recognition: 170x realtime, 50+ languages, speaker diarization, emotion detection.
Latest version:
1.3.9
Required dependencies:
editdistance
|
huggingface_hub
|
hydra-core
|
jaconv
|
jamo
|
jieba
|
kaldiio
|
librosa
|
modelscope
|
numpy
|
omegaconf
|
oss2
|
pyyaml
|
requests
|
safetensors
|
scipy
|
sentencepiece
|
soundfile
|
tensorboardx
|
tiktoken
|
torch_complex
|
tqdm
|
transformers
|
umap_learn
Optional dependencies:
accelerate
|
black
|
commonmark
|
configargparse
|
editdistance
|
einops
|
fairscale
|
flake8
|
flake8-docstrings
|
hacking
|
jinja2
|
jsondiff
|
matplotlib
|
mock
|
nbsphinx
|
openai-whisper
|
pillow
|
pycodestyle
|
pytest
|
pytest-cov
|
pytest-pythonpath
|
pytest-timeouts
|
recommonmark
|
scipy
|
sphinx
|
sphinx-argparse
|
sphinx-markdown-tables
|
sphinx-rtd-theme
|
tiktoken
|
torch_optimizer
|
torchvision
|
transformers
|
transformers_stream_generator
Downloads last day:
11,283
Downloads last week:
85,657
Downloads last month:
344,365