PyPI page
Home page
Author:
Dheemanth Manur
Summary:
Hackable RL post-training for LLMs
Latest version:
0.2.0
Required dependencies:
datasets
|
rich
|
safetensors
|
torch
|
transformers
|
vllm
Downloads last day:
12
Downloads last week:
12
Downloads last month:
268