PyPI page
Home page
Author:
Felipe Rosa, Namastex Labs
License:
BSD-2-Clause
Summary:
Modern speech recognition with word-level timestamps and speaker diarization. Fork of WhisperX with torch 2.6+, pyannote 4.x compatibility.
Latest version:
1.0.6
Required dependencies:
av
|
ctranslate2
|
faster-whisper
|
imageio-ffmpeg
|
nltk
|
numpy
|
nvidia-cudnn-cu12
|
omegaconf
|
pandas
|
pyannote-audio
|
torch
|
torchaudio
|
torchcodec
|
transformers
|
triton
Downloads last day:
0
Downloads last week:
90
Downloads last month:
143