220 related articles for article (PubMed ID: 37030278)
1. Meta attention for Off-Policy Actor-Critic.
Huang J; Huang W; Lan L; Wu D
Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
[TBL] [Abstract][Full Text] [Related]
2. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
Zheng J; Kurt MN; Wang X
IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721
[TBL] [Abstract][Full Text] [Related]
3. Robust Actor-Critic With Relative Entropy Regulating Actor.
Cheng Y; Huang L; Chen CLP; Wang X
IEEE Trans Neural Netw Learn Syst; 2023 Nov; 34(11):9054-9063. PubMed ID: 35286268
[TBL] [Abstract][Full Text] [Related]
4. Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.
Shi D; Guo X; Liu Y; Fan W
Entropy (Basel); 2022 May; 24(6):. PubMed ID: 35741495
[TBL] [Abstract][Full Text] [Related]
5. Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms.
Chen Y; Zhang F; Liu Z
Neural Netw; 2024 Jan; 169():764-777. PubMed ID: 37981458
[TBL] [Abstract][Full Text] [Related]
6. Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network.
Wu Y; Liao S; Liu X; Li Z; Lu R
IEEE Trans Neural Netw Learn Syst; 2023 Jul; 34(7):3680-3690. PubMed ID: 34669579
[TBL] [Abstract][Full Text] [Related]
7. The Actor-Dueling-Critic Method for Reinforcement Learning.
Wu M; Gao Y; Jung A; Zhang Q; Du S
Sensors (Basel); 2019 Mar; 19(7):. PubMed ID: 30935035
[TBL] [Abstract][Full Text] [Related]
8. Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning.
Wang Y; Tang C; Wang S; Cheng L; Wang R; Tan M; Hou Z
IEEE Trans Neural Netw Learn Syst; 2022 Aug; 33(8):3741-3752. PubMed ID: 33560993
[TBL] [Abstract][Full Text] [Related]
9. Policy-Gradient and Actor-Critic Based State Representation Learning for Safe Driving of Autonomous Vehicles.
Gupta A; Khwaja AS; Anpalagan A; Guan L; Venkatesh B
Sensors (Basel); 2020 Oct; 20(21):. PubMed ID: 33105863
[TBL] [Abstract][Full Text] [Related]
10. Deep Deterministic Policy Gradient With Compatible Critic Network.
Wang D; Hu M
IEEE Trans Neural Netw Learn Syst; 2023 Aug; 34(8):4332-4344. PubMed ID: 34653007
[TBL] [Abstract][Full Text] [Related]
11. Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences.
Banerjee C; Chen Z; Noman N
IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):3121-3129. PubMed ID: 35588412
[TBL] [Abstract][Full Text] [Related]
12. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction.
Li L; Li D; Song T; Xu X
IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664
[TBL] [Abstract][Full Text] [Related]
13. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
Li L; Li D; Song T; Xu X
IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
[TBL] [Abstract][Full Text] [Related]
14. Reinforcement learning in continuous time and space.
Doya K
Neural Comput; 2000 Jan; 12(1):219-45. PubMed ID: 10636940
[TBL] [Abstract][Full Text] [Related]
15. Reinforcement learning for automatic quadrilateral mesh generation: A soft actor-critic approach.
Pan J; Huang J; Cheng G; Zeng Y
Neural Netw; 2023 Jan; 157():288-304. PubMed ID: 36375347
[TBL] [Abstract][Full Text] [Related]
16. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
[TBL] [Abstract][Full Text] [Related]
17. Attention-Shared Multi-Agent Actor-Critic-Based Deep Reinforcement Learning Approach for Mobile Charging Dynamic Scheduling in Wireless Rechargeable Sensor Networks.
Jiang C; Wang Z; Chen S; Li J; Wang H; Xiang J; Xiao W
Entropy (Basel); 2022 Jul; 24(7):. PubMed ID: 35885188
[TBL] [Abstract][Full Text] [Related]
18. Multi-agent reinforcement learning with approximate model learning for competitive games.
Park YJ; Cho YS; Kim SB
PLoS One; 2019; 14(9):e0222215. PubMed ID: 31509568
[TBL] [Abstract][Full Text] [Related]
19. A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives.
Li C; Lowe R; Ziemke T
Front Neurorobot; 2014; 8():23. PubMed ID: 25324773
[TBL] [Abstract][Full Text] [Related]
20. Equivariant Graph-Representation-Based Actor-Critic Reinforcement Learning for Nanoparticle Design.
Elsborg J; Bhowmik A
J Chem Inf Model; 2023 Jun; 63(12):3731-3741. PubMed ID: 37276140
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]