PyPI Stats

Search

All packages
Top packages

Track packages

unstable-rl


PyPI page
Home page
Author: None
License: MIT License Copyright (c) 2025 LeonGuertler Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated docum...
Summary: An Async Online Multi-Agent RL library for training reasoning models on TextArena games.
Latest version: 0.2.2
Required dependencies: dm-tree | peft | pynvml | ray | textarena | torch | transformers | trueskill | vllm | wandb

Downloads last day: 20
Downloads last week: 36
Downloads last month: 51