PyPI page
Home page
Author:
OpenMMLab
Summary:
A toolset for compressing, deploying and serving LLM
Latest version:
0.11.1
Required dependencies:
accelerate
|
aiohttp
|
cloudpickle
|
einops
|
fastapi
|
fire
|
mmengine-lite
|
numpy
|
nvidia-cublas-cu12
|
nvidia-cuda-runtime-cu12
|
nvidia-curand-cu12
|
nvidia-nccl-cu12
|
openai
|
openai_harmony
|
partial_json_parser
|
peft
|
pillow
|
prometheus_client
|
protobuf
|
pybase64
|
pydantic
|
pyzmq
|
ray
|
safetensors
|
sentencepiece
|
shortuuid
|
tiktoken
|
torch
|
torchvision
|
transformers
|
triton
|
uvicorn
|
xgrammar
Optional dependencies:
accelerate
|
aiohttp
|
cloudpickle
|
cmake_build_extension
|
datasets
|
einops
|
fastapi
|
fire
|
mmengine-lite
|
numpy
|
openai
|
openai_harmony
|
partial_json_parser
|
peft
|
pillow
|
prometheus_client
|
protobuf
|
pybase64
|
pybind11
|
pydantic
|
pyzmq
|
ray
|
safetensors
|
sentencepiece
|
setuptools
|
shortuuid
|
tiktoken
|
timm
|
torch
|
torchvision
|
transformers
|
transformers_stream_generator
|
uvicorn
|
xgrammar
Downloads last day:
2,062
Downloads last week:
13,240
Downloads last month:
47,520