These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
Pubmed for Handhelds
PUBMED FOR HANDHELDS
Journal Abstract Search
366 related items for PubMed ID: 32324571
1. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation. Li L, Li D, Song T, Xu X. IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571 [Abstract] [Full Text] [Related]
2. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction. Li L, Li D, Song T, Xu X. IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664 [Abstract] [Full Text] [Related]
3. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions. Shang Z, Li R, Zheng C, Li H, Cui Y. IEEE Trans Neural Netw Learn Syst; 2023 Nov 09; PP():. PubMed ID: 37943648 [Abstract] [Full Text] [Related]
4. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning. Zheng J, Kurt MN, Wang X. IEEE Trans Neural Netw Learn Syst; 2024 May 09; 35(5):6654-6666. PubMed ID: 36256721 [Abstract] [Full Text] [Related]
5. Boosting On-Policy Actor-Critic With Shallow Updates in Critic. Li L, Zhu Y. IEEE Trans Neural Netw Learn Syst; 2024 Apr 15; PP():. PubMed ID: 38619961 [Abstract] [Full Text] [Related]
6. Reinforcement learning solution for HJB equation arising in constrained optimal control problem. Luo B, Wu HN, Huang T, Liu D. Neural Netw; 2015 Nov 15; 71():150-8. PubMed ID: 26356598 [Abstract] [Full Text] [Related]
7. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors. Duan J, Guan Y, Li SE, Ren Y, Sun Q, Cheng B. IEEE Trans Neural Netw Learn Syst; 2022 Nov 15; 33(11):6584-6598. PubMed ID: 34101599 [Abstract] [Full Text] [Related]
8. Reinforcement learning in continuous time and space. Doya K. Neural Comput; 2000 Jan 15; 12(1):219-45. PubMed ID: 10636940 [Abstract] [Full Text] [Related]
11. Kernel-based least squares policy iteration for reinforcement learning. Xu X, Hu D, Lu X. IEEE Trans Neural Netw; 2007 Jul 15; 18(4):973-92. PubMed ID: 17668655 [Abstract] [Full Text] [Related]
12. Efficient model learning methods for actor-critic control. Grondman I, Vaandrager M, Buşoniu L, Babuska R, Schuitema E. IEEE Trans Syst Man Cybern B Cybern; 2012 Jun 15; 42(3):591-602. PubMed ID: 22156998 [Abstract] [Full Text] [Related]
13. An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems. Zhao C, Deng N. Math Biosci Eng; 2024 Jan 15; 21(1):1445-1471. PubMed ID: 38303472 [Abstract] [Full Text] [Related]
14. Mild Policy Evaluation for Offline Actor-Critic. Huang L, Dong B, Lu J, Zhang W. IEEE Trans Neural Netw Learn Syst; 2023 Sep 07; PP():. PubMed ID: 37676802 [Abstract] [Full Text] [Related]
15. Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences. Banerjee C, Chen Z, Noman N. IEEE Trans Neural Netw Learn Syst; 2024 Mar 07; 35(3):3121-3129. PubMed ID: 35588412 [Abstract] [Full Text] [Related]
16. Implicit incremental natural actor critic algorithm. Iwaki R, Asada M. Neural Netw; 2019 Jan 07; 109():103-112. PubMed ID: 30408692 [Abstract] [Full Text] [Related]
17. Deep Deterministic Policy Gradient With Compatible Critic Network. Wang D, Hu M. IEEE Trans Neural Netw Learn Syst; 2023 Aug 07; 34(8):4332-4344. PubMed ID: 34653007 [Abstract] [Full Text] [Related]
18. Optimized Backstepping Tracking Control Using Reinforcement Learning for a Class of Stochastic Nonlinear Strict-Feedback Systems. Wen G, Xu L, Li B. IEEE Trans Neural Netw Learn Syst; 2023 Mar 07; 34(3):1291-1303. PubMed ID: 34437076 [Abstract] [Full Text] [Related]
19. A policy iteration approach to online optimal control of continuous-time constrained-input systems. Modares H, Naghibi Sistani MB, Lewis FL. ISA Trans; 2013 Sep 07; 52(5):611-21. PubMed ID: 23706414 [Abstract] [Full Text] [Related]
20. A3C-GS: Adaptive Moment Gradient Sharing With Locks for Asynchronous Actor-Critic Agents. Labao AB, Martija MAM, Naval PC. IEEE Trans Neural Netw Learn Syst; 2021 Mar 07; 32(3):1162-1176. PubMed ID: 32287019 [Abstract] [Full Text] [Related] Page: [Next] [New Search]