NAACL '24 (Demo) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
-
Updated
Jun 6, 2024 - Python
NAACL '24 (Demo) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
testing MLP, DQN, PPO, SAC, policy-gradient by snake
Mini RL Lab
This project involves creating a custom Blackjack environment and training an AI using reinforcement learning techniques, specifically Proximal Policy Optimization (PPO) and Deep Q-Network (DQN). The goal is to teach the AI to play Blackjack and achieve the best possible win rate.
Massively Parallel Deep Reinforcement Learning. 🔥
[WIP] RL agent for the SuperTuxKart game.
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Code repository with classical reinforcement learning and deep reinforcement learning methods for Pokémon battles in Pokémon Showdown.
Really Fast End-to-End Jax RL Implementations
Engineer-To-Order (ETO) Graph Neural Scheduling (GNS) Project
AI Models for Playing Super Mario Bros
Pytorch implementation of various distributed reinforcement learning algorithms
Add a description, image, and links to the ppo topic page so that developers can more easily learn about it.
To associate your repository with the ppo topic, visit your repo's landing page and select "manage topics."