Deep deterministic policy gradient actor-critic algorithm with supervision controller #73

Emercy44 · 2020-02-19T12:37:07Z

The deep deterministic policy gradient actor-critic algorithm with supervision controller is proposed. The latent(indirect) supervision can guarantee the stable convergence of the control actor network. The conventional controller doesn’t need excellent control performance. The main role of the conventional controller is to supply helpful control experience for the actor network. Then the actor network is trained by those experience.
Deep deterministic policy gradient actor-critic algorithm with supervision controller.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deep deterministic policy gradient actor-critic algorithm with supervision controller #73

Deep deterministic policy gradient actor-critic algorithm with supervision controller #73

Emercy44 commented Feb 19, 2020 •

edited

Loading

Deep deterministic policy gradient actor-critic algorithm with supervision controller #73

Deep deterministic policy gradient actor-critic algorithm with supervision controller #73

Comments

Emercy44 commented Feb 19, 2020 • edited Loading

Emercy44 commented Feb 19, 2020 •

edited

Loading