These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
112 related articles for article (PubMed ID: 36166566)
21. Dynamical Hyperparameter Optimization via Deep Reinforcement Learning in Tracking. Dong X; Shen J; Wang W; Shao L; Ling H; Porikli F IEEE Trans Pattern Anal Mach Intell; 2021 May; 43(5):1515-1529. PubMed ID: 31796388 [TBL] [Abstract][Full Text] [Related]
22. Training a perceptron in a discrete weight space. Rosen-Zvi M; Kanter I Phys Rev E Stat Nonlin Soft Matter Phys; 2001 Oct; 64(4 Pt 2):046109. PubMed ID: 11690092 [TBL] [Abstract][Full Text] [Related]
23. Sampling Efficient Deep Reinforcement Learning Through Preference-Guided Stochastic Exploration. Huang W; Zhang C; Wu J; He X; Zhang J; Lv C IEEE Trans Neural Netw Learn Syst; 2023 Oct; PP():. PubMed ID: 37788189 [TBL] [Abstract][Full Text] [Related]
24. A novel Q-learning algorithm based on improved whale optimization algorithm for path planning. Li Y; Wang H; Fan J; Geng Y PLoS One; 2022; 17(12):e0279438. PubMed ID: 36574399 [TBL] [Abstract][Full Text] [Related]
25. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors. Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599 [TBL] [Abstract][Full Text] [Related]
26. The Actor-Dueling-Critic Method for Reinforcement Learning. Wu M; Gao Y; Jung A; Zhang Q; Du S Sensors (Basel); 2019 Mar; 19(7):. PubMed ID: 30935035 [TBL] [Abstract][Full Text] [Related]
27. Improved Q-Learning Algorithm Based on Approximate State Matching in Agricultural Plant Protection Environment. Sun F; Wang X; Zhang R Entropy (Basel); 2021 Jun; 23(6):. PubMed ID: 34207944 [TBL] [Abstract][Full Text] [Related]
28. Dynamic sparse coding-based value estimation network for deep reinforcement learning. Zhao H; Li Z; Su W; Xie S Neural Netw; 2023 Nov; 168():180-193. PubMed ID: 37757726 [TBL] [Abstract][Full Text] [Related]
29. [Standard technical specifications for methacholine chloride (Methacholine) bronchial challenge test (2023)]. ; ; Zhonghua Jie He He Hu Xi Za Zhi; 2024 Feb; 47(2):101-119. PubMed ID: 38309959 [TBL] [Abstract][Full Text] [Related]
30. Revisiting random walk based sampling in networks: evasion of burn-in period and frequent regenerations. Avrachenkov K; Borkar VS; Kadavankandy A; Sreedharan JK Comput Soc Netw; 2018; 5(1):4. PubMed ID: 29578546 [TBL] [Abstract][Full Text] [Related]
31. Establishment and Implementation of Potential Fluid Therapy Balance Strategies for ICU Sepsis Patients Based on Reinforcement Learning. Su L; Li Y; Liu S; Zhang S; Zhou X; Weng L; Su M; Du B; Zhu W; Long Y Front Med (Lausanne); 2022; 9():766447. PubMed ID: 35492326 [TBL] [Abstract][Full Text] [Related]
32. Estimators of the local false discovery rate designed for small numbers of tests. Padilla M; Bickel DR Stat Appl Genet Mol Biol; 2012 Oct; 11(5):4. PubMed ID: 23079518 [TBL] [Abstract][Full Text] [Related]
33. A delay-robust method for enhanced real-time reinforcement learning. Xia B; Sun H; Yuan B; Li Z; Liang B; Wang X Neural Netw; 2025 Jan; 181():106769. PubMed ID: 39395235 [TBL] [Abstract][Full Text] [Related]
34. [Mathematical models of decision making and learning]. Ito M; Doya K Brain Nerve; 2008 Jul; 60(7):791-8. PubMed ID: 18646619 [TBL] [Abstract][Full Text] [Related]
35. Double Robust Efficient Estimators of Longitudinal Treatment Effects: Comparative Performance in Simulations and a Case Study. Tran L; Yiannoutsos C; Wools-Kaloustian K; Siika A; van der Laan M; Petersen M Int J Biostat; 2019 Feb; 15(2):. PubMed ID: 30811344 [TBL] [Abstract][Full Text] [Related]
36. Multi Pseudo Q-Learning-Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles. Shi W; Song S; Wu C; Chen CLP IEEE Trans Neural Netw Learn Syst; 2019 Dec; 30(12):3534-3546. PubMed ID: 30602426 [TBL] [Abstract][Full Text] [Related]
37. Improvement of Reinforcement Learning With Supermodularity. Meng Y; Shi F; Tang L; Sun D IEEE Trans Neural Netw Learn Syst; 2023 Sep; 34(9):5298-5309. PubMed ID: 37027690 [TBL] [Abstract][Full Text] [Related]
38. Cooperative modular reinforcement learning for large discrete action space problem. Ming F; Gao F; Liu K; Zhao C Neural Netw; 2023 Apr; 161():281-296. PubMed ID: 36774866 [TBL] [Abstract][Full Text] [Related]
39. Optimization of Molecules via Deep Reinforcement Learning. Zhou Z; Kearnes S; Li L; Zare RN; Riley P Sci Rep; 2019 Jul; 9(1):10752. PubMed ID: 31341196 [TBL] [Abstract][Full Text] [Related]
40. Overestimation and Underestimation Biases in Photon Mapping with Non-Constant Kernels. Garcia Hernandez RJ; Ureña C; Poch J; Sbert M IEEE Trans Vis Comput Graph; 2014 Oct; 20(10):1441-50. PubMed ID: 26357390 [TBL] [Abstract][Full Text] [Related] [Previous] [Next] [New Search]