PyPI page
Home page
Author:
Leandro von Werra
Summary:
Train transformer language models with reinforcement learning.
Latest version:
0.20.0.dev0
Required dependencies:
accelerate
|
datasets
|
fastapi
|
pydantic
|
requests
|
transformers
|
uvicorn
|
vllm
Optional dependencies:
bitsandbytes
|
deepspeed
|
diffusers
|
joblib
|
liger-kernel
|
llm-blender
|
openai
|
parameterized
|
peft
|
pillow
|
pytest
|
pytest-cov
|
pytest-rerunfailures
|
pytest-xdist
|
scikit-learn
Downloads last day:
4
Downloads last week:
6
Downloads last month:
15