PyPI page
Home page
Author:
Leandro von Werra
License:
Apache 2.0
Summary:
Train transformer language models with reinforcement learning.
Latest version:
0.13.0
Required dependencies:
accelerate
|
datasets
|
deepspeed
|
liger-kernel
|
rich
|
transformers
Optional dependencies:
bitsandbytes
|
diffusers
|
llm-blender
|
mergekit
|
openai
|
parameterized
|
peft
|
pillow
|
pytest
|
pytest-cov
|
pytest-rerunfailures
|
pytest-xdist
|
scikit-learn
Downloads last day:
15,558
Downloads last week:
193,645
Downloads last month:
838,381