PyPI page
Home page
Author:
Md Imbesat Hassan Rizvi
License:
MIT
Summary:
Named after a spell in the Harry Potter Universe, where it amplies the sound of a speaker. In muggles' terminology, this is a repository of modules for audio and speech processing for and on top of machine learning based tasks such as speech-to-text.
Latest version:
0.1.1
Required dependencies:
datasets
|
google-cloud-speech
|
librosa
|
numpy
|
omegaconf
|
pandas
|
praat-parselmouth
|
pyaudio
|
scipy
|
six
|
torch
|
tqdm
|
transformers
|
webrtcvad
|
wget
Optional dependencies:
fairseq
|
kenlm
|
pyflashlight
|
pykaldi
Downloads last day:
1
Downloads last week:
7
Downloads last month:
19