PyPI page
Home page
Author:
Casper Hansen
License:
MIT
Summary:
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
Latest version:
0.2.7.post1
Required dependencies:
accelerate
|
datasets
|
intel-extension-for-pytorch
|
tokenizers
|
torch
|
transformers
|
triton
|
typing-extensions
|
zstandard
Optional dependencies:
black
|
evaluate
|
griffe-typingdoc
|
lm-eval
|
mkdocs-material
|
mkdocstrings-python
|
protobuf
|
scipy
|
tabulate
Downloads last day:
1,938
Downloads last week:
17,209
Downloads last month:
70,315