PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
- 
            Updated
            Oct 22, 2025 
- Python
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
🏋 Modern open-source fitness coaching platform. Create workout plans, track progress, and access a comprehensive exercise database.
Self hosted FLOSS fitness/workout, nutrition and weight tracker
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Simple and easily configurable grid world environments for reinforcement learning
PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Distributed GPU-Accelerated Framework for Evolutionary Computation. Comprehensive Library of Evolutionary Algorithms & Benchmark Problems.
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
Source codes for the book "Reinforcement Learning: Theory and Python Implementation"
Add a description, image, and links to the gym topic page so that developers can more easily learn about it.
To associate your repository with the gym topic, visit your repo's landing page and select "manage topics."