PyPI page
Home page
Author:
None
License:
MIT
Summary:
NVIDIA cuDNN Frontend — Python and C++ Graph API with SOTA attention (SDPA / Flash Attention), MoE grouped GEMM fusions, and FP8/MXFP8 kernels for Hopper and Blackwell GPUs.
Latest version:
1.24.0
Optional dependencies:
apache-tvm-ffi
|
cuda-python
|
nvidia-cutlass-dsl
|
torch
|
torch-c-dlpack-ext
Downloads last day:
78,914
Downloads last week:
725,232
Downloads last month:
3,245,023