PyPI page
Home page
Author:
Rohan Pandey
License:
MIT
Summary:
Fine-tune LLM agents with online reinforcement learning
Latest version:
0.1.1
Required dependencies:
accelerate
|
bitsandbytes
|
gymnasium
|
peft
|
scipy
|
textworld
|
torch
|
transformers
|
trl
|
wandb
Downloads last day:
1
Downloads last week:
94
Downloads last month:
120