You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The deep deterministic policy gradient actor-critic algorithm with supervision controller is proposed. The latent(indirect) supervision can guarantee the stable convergence of the control actor network. The conventional controller doesn’t need excellent control performance. The main role of the conventional controller is to supply helpful control experience for the actor network. Then the actor network is trained by those experience. Deep deterministic policy gradient actor-critic algorithm with supervision controller.docx
The text was updated successfully, but these errors were encountered:
The deep deterministic policy gradient actor-critic algorithm with supervision controller is proposed. The latent(indirect) supervision can guarantee the stable convergence of the control actor network. The conventional controller doesn’t need excellent control performance. The main role of the conventional controller is to supply helpful control experience for the actor network. Then the actor network is trained by those experience.
Deep deterministic policy gradient actor-critic algorithm with supervision controller.docx
The text was updated successfully, but these errors were encountered: