PyPI Stats

Search

All packages
Top packages

Track packages

llamagym


PyPI page
Home page
Author: Rohan Pandey
License: MIT
Summary: Fine-tune LLM agents with online reinforcement learning
Latest version: 0.1.1
Required dependencies: accelerate | bitsandbytes | gymnasium | peft | scipy | textworld | torch | transformers | trl | wandb

Downloads last day: 1
Downloads last week: 94
Downloads last month: 120