Self-driving car env with PPO algorithm from stable baseline3

Sornsiri.P

Last update: Dec 22, 2022

Related tags

Deep Learning Self-Driving-car

Overview

Self-driving car with RL stable baseline3

Most of the project develop from https://github.com/GerardMaggiolino/Gym-Medium-Post Please check it out!

This project focus on training self-driving car env by implementing PPO algorithm from stable baseline3

Installation

Clone the project

git clone https://github.com/SornsiriP/Self-Driving-car

Then run Gym-Medium-Post/main.py

Update

Wrap env to change observation space from box to RGB image

from simple_driving.resources.wrapper import ProcessFrame84

env = ProcessFrame84(env)

Using PPO with CNN policy instead of TRPO

from stable_baselines3 import PPO

model = PPO('CnnPolicy', env, verbose=1,learning_rate = 0.00025,tensorboard_log="./Simple-driving/",n_steps=10000,batch_size=1000,gamma=0.9995)
model.learn(total_timesteps=150000)

Normalize action space

def map_action(self, action):
  speed_range = [0,1]
  steer_range = [-0.6,0.6]
  new_speed = np.interp(action[0],[-1,1],speed_range)
  new_steer = np.interp(action[0],[-1,1],steer_range)
  return [new_speed, new_steer]

Add limited timestep reset condition

if self.current_step >1000:
  self.current_step = 0
  self.done = True

Normalize distance in reward function

previous_dist_to_goal = np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, self.prev_pos)))
current_dist_to_goal =  np.linalg.norm(tuple(map(lambda i, j: i - j, self.goal, car_ob[0:2])))

Reference

https://github.com/GerardMaggiolino/Gym-Medium-Post

https://www.etedal.net/2020/04/pybullet-panda_3.html

Contributing

Sornsiri Promma

Thanks original project from Gerard Maggiolino

Please make sure to update tests as appropriate.

You might also like...

A clean and robust Pytorch implementation of PPO on continuous action space.

PPO-Continuous-Pytorch I found the current implementation of PPO on continuous action space is whether somewhat complicated or not stable. And this is

56 Dec 16, 2022

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

3k Dec 31, 2022

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Automatic, Readable, Reusable, Extendable Machin is a reinforcement library designed for pytorch. Build status Platform Status Linux Windows Supported

348 Dec 24, 2022

Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, ...).

PyTorch RL Minimal Implementations There are implementations of some reinforcement learning algorithms, whose characteristics are as follow: Less pack

4 Dec 31, 2022

Comments

Project Page Stable Baselines3

Hello,

nice project =)

We would be interested if you could do a pull request on stable-baselines3 where you add your project to the documentation (project section) ;)

opened by araffin 0

Self-driving car env with PPO algorithm from stable baseline3

Related tags

Overview

Self-driving car with RL stable baseline3

Installation

Update

Reference

Contributing

You might also like...

A clean and robust Pytorch implementation of PPO on continuous action space.

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, ...).

PPO Lagrangian in JAX

Tackling Obstacle Tower Challenge using PPO & A2C combined with ICM.

Self-Supervised Pillar Motion Learning for Autonomous Driving (CVPR 2021)

Graph Self-Attention Network for Learning Spatial-Temporal Interaction Representation in Autonomous Driving

Reinforcement learning for self-driving in a 3D simulation

Comments

Project Page Stable Baselines3

Owner

Sornsiri.P

Open source hardware and software platform to build a small scale self driving car.

Simulation of Self Driving Car

RL algorithm PPO and IRL algorithm AIRL written with Tensorflow.

Stable Neural ODE with Lyapunov-Stable Equilibrium Points for Defending Against Adversarial Attacks

CRISCE: Automatically Generating Critical Driving Scenarios From Car Accident Sketches

This script scrapes and stores the availability of timeslots for Car Driving Test at all RTA Serivce NSW centres in the state.

Genetic Algorithm, Particle Swarm Optimization, Simulated Annealing, Ant Colony Optimization Algorithm,Immune Algorithm, Artificial Fish Swarm Algorithm, Differential Evolution and TSP(Traveling salesman)

PPO is a very popular Reinforcement Learning algorithm at present.

Wordle Env: A Daily Word Environment for Reinforcement Learning

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).