PyPI page
Home page
Author:
None
Summary:
Train transformer language models with reinforcement learning.
Latest version:
0.25.1
Required dependencies:
accelerate
|
datasets
|
transformers
Optional dependencies:
bitsandbytes
|
deepspeed
|
fastapi
|
hf-doc-builder
|
joblib
|
liger-kernel
|
llm-blender
|
math-verify
|
num2words
|
openai
|
peft
|
pillow
|
pre-commit
|
pydantic
|
pytest
|
pytest-cov
|
pytest-rerunfailures
|
pytest-xdist
|
requests
|
scikit-learn
|
torchvision
|
uvicorn
|
vllm
Downloads last day:
110,374
Downloads last week:
578,552
Downloads last month:
2,578,095