PyPI page
Home page
Author:
vLLM Team
License:
Apache 2.0
Summary:
A high-throughput and memory-efficient inference and serving engine for LLMs
Latest version:
0.3.3
Required dependencies:
cmake
|
cupy-cuda12x
|
fastapi
|
ninja
|
numpy
|
outlines
|
prometheus-client
|
psutil
|
pydantic
|
pynvml
|
ray
|
sentencepiece
|
torch
|
transformers
|
triton
|
uvicorn
|
xformers
Downloads last day:
0
Downloads last week:
27
Downloads last month:
43