Cascade-trained deep reinforcement learning for PID gain optimization in precise joint position control of 6-DoF robotic arm

Padhiyar, Nitin

doi:10.1088/2631-8695/ae090e

Cascade-trained deep reinforcement learning for PID gain optimization in precise joint position control of 6-DoF robotic arm

Source

Engineering Research Express

Date Issued

2025-12-31

Author(s)

Vyas, Dhaval R.

Thakar, Parth S.

Markana, Anilkumar

Padhiyar, Nitin

DOI

10.1088/2631-8695/ae090e

Volume

7

Issue

4

Abstract

The problem of precise joint position tracking has remained as a core challenge for a 6-DoF Cobot arm, especially due to often scenario of an arbitrary waypoint reference trajectory generated due to human interactions. To tackle this problem, we propose a novel cascade training based Deep Reinforcement Learning (DRL) algorithm that tunes the PID controller gains for each joint simultaneously, ensuring accurate positional tracking for all joints. This also addresses the problem of overestimation of control parameters by ensuring that performance criteria are met in a phased manner during the training process. The tuned DRL based PID clearly outperforms the conventional PID control by accurately tracking the arbitrary waypoints given to each joints of the Cobot arm. We show the efficacy of the proposed method through exhaustive simulations and performing quantitative analysis of various key performance criteria like- Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Average Control Effort (ACE) error for the Cobot. The obtained results of DRL-PID control, when compared with its conventional PID counterpart, clearly depict the superiority of the proposed DRL-PID scheme via a cascade training approach. We have also remarked on some trade off and implementation aspects of the proposed control policy for the Cobot based applications. This method has the potential to be applicable to similar complex dynamical systems like a Cobot, where arbitrary reference and human interactions are prime concerns.

Unpaywall

URI

https://repository.iitgn.ac.in/handle/IITG2025/33294

Keywords

6-DoF cobot arm | DRL | mycobot 280 | PID | waypoint trajectory