PyPI page
Home page
Author:
None
License:
Apache-2.0
Summary:
Single-header LLM inference engine with KV cache compression (7× compression at fp32 parity)
Latest version:
0.13.0
Optional dependencies:
build
|
pytest
|
twine
Downloads last day:
16
Downloads last week:
204
Downloads last month:
12,771