These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
3. Dual Parallel Policy Iteration With Coupled Policy Improvement. Cheng Y; Huang L; Chen CLP; Wang X IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):4286-4298. PubMed ID: 36094996 [TBL] [Abstract][Full Text] [Related]
4. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning. Zheng J; Kurt MN; Wang X IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721 [TBL] [Abstract][Full Text] [Related]
5. Reinforcement learning based temperature control of a fermentation bioreactor for ethanol production. Rajasekhar N; Radhakrishnan TK; Mohamed SN Biotechnol Bioeng; 2024 Oct; 121(10):3114-3127. PubMed ID: 38938008 [TBL] [Abstract][Full Text] [Related]
6. Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms. Zhang H; Xu J; Zhang J; Liu Q Comput Intell Neurosci; 2022; 2022():1117781. PubMed ID: 36438689 [TBL] [Abstract][Full Text] [Related]
7. Approximate Policy-Based Accelerated Deep Reinforcement Learning. Wang X; Gu Y; Cheng Y; Liu A; Chen CLP IEEE Trans Neural Netw Learn Syst; 2020 Jun; 31(6):1820-1830. PubMed ID: 31398131 [TBL] [Abstract][Full Text] [Related]
8. Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment. Chen W; Wong KKL; Long S; Sun Z Entropy (Basel); 2022 Mar; 24(4):. PubMed ID: 35455103 [TBL] [Abstract][Full Text] [Related]
9. Adaptive control for circulating cooling water system using deep reinforcement learning. Xu J; Li H; Zhang Q PLoS One; 2024; 19(7):e0307767. PubMed ID: 39047030 [TBL] [Abstract][Full Text] [Related]
10. Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces. Shi J; Li K; Piao C; Gao J; Chen L Sensors (Basel); 2023 Aug; 23(16):. PubMed ID: 37631658 [TBL] [Abstract][Full Text] [Related]
11. Distributional generative adversarial imitation learning with reproducing kernel generalization. Zhou Y; Lu M; Liu X; Che Z; Xu Z; Tang J; Zhang Y; Peng Y; Peng Y Neural Netw; 2023 Aug; 165():43-59. PubMed ID: 37276810 [TBL] [Abstract][Full Text] [Related]
12. Learning Intention-Aware Policies in Deep Reinforcement Learning. Zhao T; Wu S; Li G; Chen Y; Niu G; Sugiyama M Neural Comput; 2023 Sep; 35(10):1657-1677. PubMed ID: 37523456 [TBL] [Abstract][Full Text] [Related]
13. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors. Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599 [TBL] [Abstract][Full Text] [Related]
14. Human-in-the-Loop Reinforcement Learning in Continuous-Action Space. Luo B; Wu Z; Zhou F; Wang BC IEEE Trans Neural Netw Learn Syst; 2024 Nov; 35(11):15735-15744. PubMed ID: 37418406 [TBL] [Abstract][Full Text] [Related]
15. Path planning of mobile robot based on improved TD3 algorithm in dynamic environment. Li P; Chen D; Wang Y; Zhang L; Zhao S Heliyon; 2024 Jun; 10(11):e32167. PubMed ID: 38912483 [TBL] [Abstract][Full Text] [Related]
16. Systematic Performance Evaluation of Reinforcement Learning Algorithms Applied to Wastewater Treatment Control Optimization. Croll HC; Ikuma K; Ong SK; Sarkar S Environ Sci Technol; 2023 Nov; 57(46):18382-18390. PubMed ID: 37405782 [TBL] [Abstract][Full Text] [Related]
17. Entropy-Aware Model Initialization for Effective Exploration in Deep Reinforcement Learning. Jang S; Kim HI Sensors (Basel); 2022 Aug; 22(15):. PubMed ID: 35957399 [TBL] [Abstract][Full Text] [Related]
18. An immediate-return reinforcement learning for the atypical Markov decision processes. Pan Z; Wen G; Tan Z; Yin S; Hu X Front Neurorobot; 2022; 16():1012427. PubMed ID: 36582302 [TBL] [Abstract][Full Text] [Related]
19. Asynchronous Episodic Deep Deterministic Policy Gradient: Toward Continuous Control in Computationally Complex Environments. Zhang Z; Chen J; Chen Z; Li W IEEE Trans Cybern; 2021 Feb; 51(2):604-613. PubMed ID: 31902788 [TBL] [Abstract][Full Text] [Related]
20. A Maximum Divergence Approach to Optimal Policy in Deep Reinforcement Learning. Yang Z; Qu H; Fu M; Hu W; Zhao Y IEEE Trans Cybern; 2023 Mar; 53(3):1499-1510. PubMed ID: 34478393 [TBL] [Abstract][Full Text] [Related] [Next] [New Search]