These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
2. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors. Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599 [TBL] [Abstract][Full Text] [Related]
3. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning. Zheng J; Kurt MN; Wang X IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721 [TBL] [Abstract][Full Text] [Related]
4. Meta attention for Off-Policy Actor-Critic. Huang J; Huang W; Lan L; Wu D Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278 [TBL] [Abstract][Full Text] [Related]
5. Robust Actor-Critic With Relative Entropy Regulating Actor. Cheng Y; Huang L; Chen CLP; Wang X IEEE Trans Neural Netw Learn Syst; 2023 Nov; 34(11):9054-9063. PubMed ID: 35286268 [TBL] [Abstract][Full Text] [Related]
6. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions. Shang Z; Li R; Zheng C; Li H; Cui Y IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37943648 [TBL] [Abstract][Full Text] [Related]
7. Graph Soft Actor-Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination. Hu Y; Fu J; Wen G IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37948149 [TBL] [Abstract][Full Text] [Related]
8. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation. Li L; Li D; Song T; Xu X IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571 [TBL] [Abstract][Full Text] [Related]
9. Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor-Critic with Hindsight Experience Replay. Prianto E; Kim M; Park JH; Bae JH; Kim JS Sensors (Basel); 2020 Oct; 20(20):. PubMed ID: 33086774 [TBL] [Abstract][Full Text] [Related]
10. A priority experience replay actor-critic algorithm using self-attention mechanism for strategy optimization of discrete problems. Sun Y; Yang B PeerJ Comput Sci; 2024; 10():e2161. PubMed ID: 38983226 [TBL] [Abstract][Full Text] [Related]
11. A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots. Zhao T; Wang M; Zhao Q; Zheng X; Gao H Biomimetics (Basel); 2023 Oct; 8(6):. PubMed ID: 37887612 [TBL] [Abstract][Full Text] [Related]
12. ACERAC: Efficient Reinforcement Learning in Fine Time Discretization. Lyskawa J; Wawrzynski P IEEE Trans Neural Netw Learn Syst; 2024 Feb; 35(2):2719-2731. PubMed ID: 35857727 [TBL] [Abstract][Full Text] [Related]
13. Boosting On-Policy Actor-Critic With Shallow Updates in Critic. Li L; Zhu Y IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619961 [TBL] [Abstract][Full Text] [Related]
14. Prioritized experience replay based on dynamics priority. Li H; Qian X; Song W Sci Rep; 2024 Mar; 14(1):6014. PubMed ID: 38472457 [TBL] [Abstract][Full Text] [Related]
15. Improving Exploration in Actor-Critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization. Li F; Fu M; Chen W; Zhang F; Zhang H; Qu H; Yi Z IEEE Trans Neural Netw Learn Syst; 2024 Jul; 35(7):8783-8796. PubMed ID: 36306289 [TBL] [Abstract][Full Text] [Related]
16. Adaptive Discount Factor for Deep Reinforcement Learning in Continuing Tasks with Uncertainty. Kim M; Kim JS; Choi MS; Park JH Sensors (Basel); 2022 Sep; 22(19):. PubMed ID: 36236366 [TBL] [Abstract][Full Text] [Related]
17. Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning. Sun H; Fu T; Ling Y; He C Sensors (Basel); 2021 Sep; 21(17):. PubMed ID: 34502796 [TBL] [Abstract][Full Text] [Related]
18. Mild Policy Evaluation for Offline Actor-Critic. Huang L; Dong B; Lu J; Zhang W IEEE Trans Neural Netw Learn Syst; 2023 Sep; PP():. PubMed ID: 37676802 [TBL] [Abstract][Full Text] [Related]
19. De-Pessimism Offline Reinforcement Learning via Value Compensation. Huang Z; Zhao J; Sun S IEEE Trans Neural Netw Learn Syst; 2024 Aug; PP():. PubMed ID: 39178073 [TBL] [Abstract][Full Text] [Related]