PyPI page
Home page
Author:
None
License:
MIT License
Copyright (c) 2025 LeonGuertler
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated docum...
Summary:
An Async Online Multi-Agent RL library for training reasoning models on TextArena games.
Latest version:
0.2.2
Required dependencies:
dm-tree
|
peft
|
pynvml
|
ray
|
textarena
|
torch
|
transformers
|
trueskill
|
vllm
|
wandb
Downloads last day:
20
Downloads last week:
36
Downloads last month:
51