343 related articles for article (PubMed ID: 32324571)
1. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
Li L; Li D; Song T; Xu X
IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
[TBL] [Abstract][Full Text] [Related]
2. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction.
Li L; Li D; Song T; Xu X
IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664
[TBL] [Abstract][Full Text] [Related]
3. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions.
Shang Z; Li R; Zheng C; Li H; Cui Y
IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37943648
[TBL] [Abstract][Full Text] [Related]
4. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
Zheng J; Kurt MN; Wang X
IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721
[TBL] [Abstract][Full Text] [Related]
5. Reinforcement learning solution for HJB equation arising in constrained optimal control problem.
Luo B; Wu HN; Huang T; Liu D
Neural Netw; 2015 Nov; 71():150-8. PubMed ID: 26356598
[TBL] [Abstract][Full Text] [Related]
6. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
[TBL] [Abstract][Full Text] [Related]
7. Reinforcement learning in continuous time and space.
Doya K
Neural Comput; 2000 Jan; 12(1):219-45. PubMed ID: 10636940
[TBL] [Abstract][Full Text] [Related]
8. Meta attention for Off-Policy Actor-Critic.
Huang J; Huang W; Lan L; Wu D
Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
[TBL] [Abstract][Full Text] [Related]
9. Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.
Zhong S; Liu Q; Fu Q
Comput Intell Neurosci; 2016; 2016():4824072. PubMed ID: 27795704
[TBL] [Abstract][Full Text] [Related]
10. Kernel-based least squares policy iteration for reinforcement learning.
Xu X; Hu D; Lu X
IEEE Trans Neural Netw; 2007 Jul; 18(4):973-92. PubMed ID: 17668655
[TBL] [Abstract][Full Text] [Related]
11. Efficient model learning methods for actor-critic control.
Grondman I; Vaandrager M; Buşoniu L; Babuska R; Schuitema E
IEEE Trans Syst Man Cybern B Cybern; 2012 Jun; 42(3):591-602. PubMed ID: 22156998
[TBL] [Abstract][Full Text] [Related]
12. An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems.
Zhao C; Deng N
Math Biosci Eng; 2024 Jan; 21(1):1445-1471. PubMed ID: 38303472
[TBL] [Abstract][Full Text] [Related]
13. Mild Policy Evaluation for Offline Actor-Critic.
Huang L; Dong B; Lu J; Zhang W
IEEE Trans Neural Netw Learn Syst; 2023 Sep; PP():. PubMed ID: 37676802
[TBL] [Abstract][Full Text] [Related]
14. Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences.
Banerjee C; Chen Z; Noman N
IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):3121-3129. PubMed ID: 35588412
[TBL] [Abstract][Full Text] [Related]
15. Implicit incremental natural actor critic algorithm.
Iwaki R; Asada M
Neural Netw; 2019 Jan; 109():103-112. PubMed ID: 30408692
[TBL] [Abstract][Full Text] [Related]
16. Deep Deterministic Policy Gradient With Compatible Critic Network.
Wang D; Hu M
IEEE Trans Neural Netw Learn Syst; 2023 Aug; 34(8):4332-4344. PubMed ID: 34653007
[TBL] [Abstract][Full Text] [Related]
17. Optimized Backstepping Tracking Control Using Reinforcement Learning for a Class of Stochastic Nonlinear Strict-Feedback Systems.
Wen G; Xu L; Li B
IEEE Trans Neural Netw Learn Syst; 2023 Mar; 34(3):1291-1303. PubMed ID: 34437076
[TBL] [Abstract][Full Text] [Related]
18. A policy iteration approach to online optimal control of continuous-time constrained-input systems.
Modares H; Naghibi Sistani MB; Lewis FL
ISA Trans; 2013 Sep; 52(5):611-21. PubMed ID: 23706414
[TBL] [Abstract][Full Text] [Related]
19. A3C-GS: Adaptive Moment Gradient Sharing With Locks for Asynchronous Actor-Critic Agents.
Labao AB; Martija MAM; Naval PC
IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1162-1176. PubMed ID: 32287019
[TBL] [Abstract][Full Text] [Related]
20. Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.
Shi D; Guo X; Liu Y; Fan W
Entropy (Basel); 2022 May; 24(6):. PubMed ID: 35741495
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]