mlx-guided-grpo

PyPI page
Home page
Author: None
License: MIT
Summary: Guided Group Relative Policy Optimization (GRPO) training for MLX on Apple Silicon
Latest version: 2.1.1
Required dependencies: mlx | mlx-lm | numpy | pyyaml | tqdm | transformers
Optional dependencies: black | isort | pytest | scikit-learn | wandb

Downloads last day: 3
Downloads last week: 3
Downloads last month: 36