PyPI page
Home page
Author:
Vivek Varikuti
License:
MIT
Summary:
First open-source implementation of TurboQuant (arXiv 2504.19874) — 4-7x LLM KV cache compression
Latest version:
0.1.0
Required dependencies:
scipy
|
torch
|
transformers
Optional dependencies:
accelerate
|
bitsandbytes
|
pytest
Downloads last day:
4
Downloads last week:
9
Downloads last month:
61