#

ppo

Here are 630 public repositories matching this topic...

tanyuqian / redco

NAACL '24 (Demo) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference

Updated Jun 6, 2024
Python

Starlight0798 / gymRL

基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)

pytorch dqn gym rl ppo

Updated Jun 6, 2024
Python

jianzhnie / LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

llama ppo dpo chatgpt rlhf qlora qwen mixtral llama3

Updated Jun 6, 2024
Python

xuance

agi-brain / xuance

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Updated Jun 6, 2024
Python

WorldEditor50 / snakeAI

testing MLP, DQN, PPO, SAC, policy-gradient by snake

reinforcement-learning lstm dqn policy-gradient sac ppo snakeai

Updated Jun 5, 2024
C++

itsMyrto / CarRacing-v2-gymnasium

agent deep-reinforcement-learning convolutional-neural-networks deep-q-learning dueling-dqn ppo car-racing-game

Updated Jun 5, 2024
Jupyter Notebook

modelbased / minirllab

Mini RL Lab

reinforcement-learning pytorch beginner-friendly sac gym-environment ppo

Updated Jun 5, 2024
Python

Arena-Rosnav / arena-rosnav

python benchmarking robotics navigation simulation pytorch ros drl ppo

Updated Jun 5, 2024
Python

OctopusMind / RLHF_PPO

ppo算法实现

lora ppo rlhf qwen

Updated Jun 5, 2024
Python

HasancanCakicioglu / Custom-BlackJack-Environment-ReinforcementLearning

This project involves creating a custom Blackjack environment and training an AI using reinforcement learning techniques, specifically Proximal Policy Optimization (PPO) and Deep Q-Network (DQN). The goal is to teach the AI to play Blackjack and achieve the best possible win rate.

reinforcement-learning ai blackjack dqn ppo custom-environment

Updated Jun 5, 2024
Python

AI4Finance-Foundation / ElegantRL

Massively Parallel Deep Reinforcement Learning. 🔥

lightweight reinforcement-learning gae efficient pytorch stable dqn ddpg sac per multiple-gpu ppo a2c td3 model-free-rl drl-pytorch bipedalwalkerhardcore

Updated Jun 5, 2024
Python

notjedi / tuxkart-ai

[WIP] RL agent for the SuperTuxKart game.

reinforcement-learning deep-reinforcement-learning pytorch policy-gradient autoencoder vae rl ppo vae-pytorch

Updated Jun 4, 2024
Python

jianzhnie / deep-rl-toolkit

RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC,A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

deep-reinforcement-learning dqn gym atari ddpg sac actor-critic trpo mujoco ppo td3

Updated Jun 4, 2024
Python

ZJLAB-AMMI / LLM4RL

A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM

reinforcement-learning interaction ppo llm vicuna-7b vicuna-13b

Updated Jun 4, 2024
Python

leolellisr / poke_RL

Code repository with classical reinforcement learning and deep reinforcement learning methods for Pokémon battles in Pokémon Showdown.

game pokemon reinforcement-learning qlearning monte-carlo deep-reinforcement-learning dqn pokemon-showdown reinforce function-approximation double-dqn sarsa-lambda deep-rl ppo ppo2

Updated Jun 4, 2024
Jupyter Notebook

XuTpoKoT / music-shop

db bmstu iu7 sd ppo bmstu-iu7

Updated Jun 3, 2024
Java

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

reinforcement-learning deep-reinforcement-learning reinforcement-learning-algorithms ppo jax

Updated Jun 2, 2024
Python

AnasNeumann / gns

Engineer-To-Order (ETO) Graph Neural Scheduling (GNS) Project

pytorch manufacturing proximal-policy-optimization ppo graphneuralnetwork pytorchgeometric engineer-to-order

Updated Jun 2, 2024
Python

Connor2803 / CITS3001-Mario-Project

AI Models for Playing Super Mario Bros

machine-learning ai pytorch ddqn ppo

Updated Jun 2, 2024
Python

ymg1114 / pytorch-distributed-reinforcement-learning

Pytorch implementation of various distributed reinforcement learning algorithms

reinforcement-learning impala pytorch sac gym-environment ppo v-mpo

Updated Jun 2, 2024
Python

Improve this page

Add a description, image, and links to the ppo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ppo topic, visit your repo's landing page and select "manage topics."