MEDLINE/PubMed Journal Browser Search

Pubmed for Handhelds

PUBMED FOR HANDHELDS

Journal Abstract Search

366 related items for PubMed ID: 32324571

1. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
Li L, Li D, Song T, Xu X.
IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
[Abstract] [Full Text] [Related]

2. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction.
Li L, Li D, Song T, Xu X.
IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664
[Abstract] [Full Text] [Related]

3. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions.
Shang Z, Li R, Zheng C, Li H, Cui Y.
IEEE Trans Neural Netw Learn Syst; 2023 Nov 09; PP():. PubMed ID: 37943648
[Abstract] [Full Text] [Related]

4. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
Zheng J, Kurt MN, Wang X.
IEEE Trans Neural Netw Learn Syst; 2024 May 09; 35(5):6654-6666. PubMed ID: 36256721
[Abstract] [Full Text] [Related]

5. Boosting On-Policy Actor-Critic With Shallow Updates in Critic.
Li L, Zhu Y.
IEEE Trans Neural Netw Learn Syst; 2024 Apr 15; PP():. PubMed ID: 38619961
[Abstract] [Full Text] [Related]

6. Reinforcement learning solution for HJB equation arising in constrained optimal control problem.
Luo B, Wu HN, Huang T, Liu D.
Neural Netw; 2015 Nov 15; 71():150-8. PubMed ID: 26356598
[Abstract] [Full Text] [Related]

7. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
Duan J, Guan Y, Li SE, Ren Y, Sun Q, Cheng B.
IEEE Trans Neural Netw Learn Syst; 2022 Nov 15; 33(11):6584-6598. PubMed ID: 34101599
[Abstract] [Full Text] [Related]

8. Reinforcement learning in continuous time and space.
Doya K.
Neural Comput; 2000 Jan 15; 12(1):219-45. PubMed ID: 10636940
[Abstract] [Full Text] [Related]

9.
; . PubMed ID:
[No Abstract] [Full Text] [Related]

10.
; . PubMed ID:
[No Abstract] [Full Text] [Related]

11. Kernel-based least squares policy iteration for reinforcement learning.
Xu X, Hu D, Lu X.
IEEE Trans Neural Netw; 2007 Jul 15; 18(4):973-92. PubMed ID: 17668655
[Abstract] [Full Text] [Related]

12. Efficient model learning methods for actor-critic control.
Grondman I, Vaandrager M, Buşoniu L, Babuska R, Schuitema E.
IEEE Trans Syst Man Cybern B Cybern; 2012 Jun 15; 42(3):591-602. PubMed ID: 22156998
[Abstract] [Full Text] [Related]

13. An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems.
Zhao C, Deng N.
Math Biosci Eng; 2024 Jan 15; 21(1):1445-1471. PubMed ID: 38303472
[Abstract] [Full Text] [Related]

14. Mild Policy Evaluation for Offline Actor-Critic.
Huang L, Dong B, Lu J, Zhang W.
IEEE Trans Neural Netw Learn Syst; 2023 Sep 07; PP():. PubMed ID: 37676802
[Abstract] [Full Text] [Related]

15. Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences.
Banerjee C, Chen Z, Noman N.
IEEE Trans Neural Netw Learn Syst; 2024 Mar 07; 35(3):3121-3129. PubMed ID: 35588412
[Abstract] [Full Text] [Related]

16. Implicit incremental natural actor critic algorithm.
Iwaki R, Asada M.
Neural Netw; 2019 Jan 07; 109():103-112. PubMed ID: 30408692
[Abstract] [Full Text] [Related]

17. Deep Deterministic Policy Gradient With Compatible Critic Network.
Wang D, Hu M.
IEEE Trans Neural Netw Learn Syst; 2023 Aug 07; 34(8):4332-4344. PubMed ID: 34653007
[Abstract] [Full Text] [Related]

18. Optimized Backstepping Tracking Control Using Reinforcement Learning for a Class of Stochastic Nonlinear Strict-Feedback Systems.
Wen G, Xu L, Li B.
IEEE Trans Neural Netw Learn Syst; 2023 Mar 07; 34(3):1291-1303. PubMed ID: 34437076
[Abstract] [Full Text] [Related]

19. A policy iteration approach to online optimal control of continuous-time constrained-input systems.
Modares H, Naghibi Sistani MB, Lewis FL.
ISA Trans; 2013 Sep 07; 52(5):611-21. PubMed ID: 23706414
[Abstract] [Full Text] [Related]

20. A3C-GS: Adaptive Moment Gradient Sharing With Locks for Asynchronous Actor-Critic Agents.
Labao AB, Martija MAM, Naval PC.
IEEE Trans Neural Netw Learn Syst; 2021 Mar 07; 32(3):1162-1176. PubMed ID: 32287019
[Abstract] [Full Text] [Related]

Page: [Next] [New Search]