These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
126 related articles for article (PubMed ID: 38804929)
1. Combining Reinforcement Learning and Tensor Networks, with an Application to Dynamical Large Deviations. Gillman E; Rose DC; Garrahan JP Phys Rev Lett; 2024 May; 132(19):197301. PubMed ID: 38804929 [TBL] [Abstract][Full Text] [Related]
2. A priority experience replay actor-critic algorithm using self-attention mechanism for strategy optimization of discrete problems. Sun Y; Yang B PeerJ Comput Sci; 2024; 10():e2161. PubMed ID: 38983226 [TBL] [Abstract][Full Text] [Related]
3. Partial Policy-Based Reinforcement Learning for Anatomical Landmark Localization in 3D Medical Images. Abdullah Al W; Yun ID IEEE Trans Med Imaging; 2020 Apr; 39(4):1245-1255. PubMed ID: 31603816 [TBL] [Abstract][Full Text] [Related]
4. Meta attention for Off-Policy Actor-Critic. Huang J; Huang W; Lan L; Wu D Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278 [TBL] [Abstract][Full Text] [Related]
5. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning. Zheng J; Kurt MN; Wang X IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721 [TBL] [Abstract][Full Text] [Related]
6. Meta-Reinforcement Learning With Dynamic Adaptiveness Distillation. Hu H; Huang G; Li X; Song S IEEE Trans Neural Netw Learn Syst; 2023 Mar; 34(3):1454-1464. PubMed ID: 34464267 [TBL] [Abstract][Full Text] [Related]
7. An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems. Zhao C; Deng N Math Biosci Eng; 2024 Jan; 21(1):1445-1471. PubMed ID: 38303472 [TBL] [Abstract][Full Text] [Related]
8. Reinforcement learning in continuous time and space. Doya K Neural Comput; 2000 Jan; 12(1):219-45. PubMed ID: 10636940 [TBL] [Abstract][Full Text] [Related]
9. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors. Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599 [TBL] [Abstract][Full Text] [Related]
10. Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network. Wu Y; Liao S; Liu X; Li Z; Lu R IEEE Trans Neural Netw Learn Syst; 2023 Jul; 34(7):3680-3690. PubMed ID: 34669579 [TBL] [Abstract][Full Text] [Related]
11. Ensemble algorithms in reinforcement learning. Wiering MA; van Hasselt H IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):930-6. PubMed ID: 18632380 [TBL] [Abstract][Full Text] [Related]
12. Boosting On-Policy Actor-Critic With Shallow Updates in Critic. Li L; Zhu Y IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619961 [TBL] [Abstract][Full Text] [Related]
13. A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots. Mou H; Xue J; Liu J; Feng Z; Li Q; Zhang J Biomimetics (Basel); 2023 Dec; 8(8):. PubMed ID: 38132555 [TBL] [Abstract][Full Text] [Related]
14. Sample efficient reinforcement learning with active learning for molecular design. Dodds M; Guo J; Löhr T; Tibo A; Engkvist O; Janet JP Chem Sci; 2024 Mar; 15(11):4146-4160. PubMed ID: 38487235 [TBL] [Abstract][Full Text] [Related]
15. Actor-critic models of the basal ganglia: new anatomical and computational perspectives. Joel D; Niv Y; Ruppin E Neural Netw; 2002; 15(4-6):535-47. PubMed ID: 12371510 [TBL] [Abstract][Full Text] [Related]
16. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation. Li L; Li D; Song T; Xu X IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571 [TBL] [Abstract][Full Text] [Related]
17. The Wisdom of the Crowd: Reliable Deep Reinforcement Learning Through Ensembles of Q-Functions. Elliott DL; Anderson C IEEE Trans Neural Netw Learn Syst; 2023 Jan; 34(1):43-51. PubMed ID: 34185651 [TBL] [Abstract][Full Text] [Related]
18. ACERAC: Efficient Reinforcement Learning in Fine Time Discretization. Lyskawa J; Wawrzynski P IEEE Trans Neural Netw Learn Syst; 2024 Feb; 35(2):2719-2731. PubMed ID: 35857727 [TBL] [Abstract][Full Text] [Related]
19. Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units. Yu C; Ren G; Dong Y BMC Med Inform Decis Mak; 2020 Jul; 20(Suppl 3):124. PubMed ID: 32646412 [TBL] [Abstract][Full Text] [Related]
20. Combining backpropagation with Equilibrium Propagation to improve an Actor-Critic reinforcement learning framework. Kubo Y; Chalmers E; Luczak A Front Comput Neurosci; 2022; 16():980613. PubMed ID: 36082305 [TBL] [Abstract][Full Text] [Related] [Next] [New Search]