PyPI page
Home page
Author:
Ryan Parker
Summary:
Using GRPO with RLVR, fine-tune LLMs to enhance coding capabilities
Latest version:
0.1.2
Required dependencies:
datasets
|
httpx
|
pydantic
|
pydantic-monty
|
tqdm
|
trackio
|
trl
|
typer
Downloads last day:
0
Downloads last week:
32
Downloads last month:
333