PyPI page
Home page
Author:
fused-turboquant Contributors
Summary:
Fused Triton encode/decode kernels for TurboQuant KV cache compression, powered by Randomized Hadamard Transform.
Latest version:
0.1.0
Required dependencies:
numpy
|
scipy
|
torch
|
triton
|
triton-windows
|
vllm
Optional dependencies:
accelerate
|
datasets
|
pytest
|
pytest-benchmark
|
ruff
|
transformers
Downloads last day:
22
Downloads last week:
72
Downloads last month:
150