Reinforcement learning

Development of Warehouse Robot Arms for Grasping Objects

Implementation of Deep Deterministic Policy Gradient and Twin Delayed Deep Deterministic Policy Gradient with Hind-sight Experience Replay to control robot arm