PyPI Stats

Search

All packages
Top packages

Track packages

mod-trl


PyPI page
Home page
Author: Leandro von Werra
Summary: Train transformer language models with reinforcement learning.
Latest version: 0.20.0.dev0
Required dependencies: accelerate | datasets | fastapi | pydantic | requests | transformers | uvicorn | vllm
Optional dependencies: bitsandbytes | deepspeed | diffusers | joblib | liger-kernel | llm-blender | openai | parameterized | peft | pillow | pytest | pytest-cov | pytest-rerunfailures | pytest-xdist | scikit-learn

Downloads last day: 4
Downloads last week: 6
Downloads last month: 15