These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
119 related articles for article (PubMed ID: 39240734)
1. Diversifying Policies With Non-Markov Dispersion to Expand the Solution Space. Qu B; Cao X; Chang Y; Tsang IW; Ong YS IEEE Trans Pattern Anal Mach Intell; 2024 Dec; 46(12):11392-11408. PubMed ID: 39240734 [TBL] [Abstract][Full Text] [Related]
2. Hierarchical approximate policy iteration with binary-tree state space decomposition. Xu X; Liu C; Yang SX; Hu D IEEE Trans Neural Netw; 2011 Dec; 22(12):1863-77. PubMed ID: 21990333 [TBL] [Abstract][Full Text] [Related]
3. Sequence Decision Transformer for Adaptive Traffic Signal Control. Zhao R; Hu H; Li Y; Fan Y; Gao F; Gao Z Sensors (Basel); 2024 Sep; 24(19):. PubMed ID: 39409242 [TBL] [Abstract][Full Text] [Related]
4. MOO-MDP: An Object-Oriented Representation for Cooperative Multiagent Reinforcement Learning. Da Silva FL; Glatt R; Costa AHR IEEE Trans Cybern; 2019 Feb; 49(2):567-579. PubMed ID: 29990289 [TBL] [Abstract][Full Text] [Related]
5. Deep reinforcement learning navigation via decision transformer in autonomous driving. Ge L; Zhou X; Li Y; Wang Y Front Neurorobot; 2024; 18():1338189. PubMed ID: 38566892 [TBL] [Abstract][Full Text] [Related]
6. A delay-robust method for enhanced real-time reinforcement learning. Xia B; Sun H; Yuan B; Li Z; Liang B; Wang X Neural Netw; 2025 Jan; 181():106769. PubMed ID: 39395235 [TBL] [Abstract][Full Text] [Related]
7. Sample Efficient Deep Reinforcement Learning With Online State Abstraction and Causal Transformer Model Prediction. Lan Y; Xu X; Fang Q; Hao J IEEE Trans Neural Netw Learn Syst; 2024 Nov; 35(11):16574-16588. PubMed ID: 37581972 [TBL] [Abstract][Full Text] [Related]
8. A Hybrid Online Off-Policy Reinforcement Learning Agent Framework Supported by Transformers. Villarrubia-Martin EA; Rodriguez-Benitez L; Jimenez-Linares L; Muñoz-Valero D; Liu J Int J Neural Syst; 2023 Dec; 33(12):2350065. PubMed ID: 37857407 [TBL] [Abstract][Full Text] [Related]
9. Parameterized MDPs and Reinforcement Learning Problems-A Maximum Entropy Principle-Based Framework. Srivastava A; Salapaka SM IEEE Trans Cybern; 2022 Sep; 52(9):9339-9351. PubMed ID: 34406959 [TBL] [Abstract][Full Text] [Related]
11. On Practical Robust Reinforcement Learning: Adjacent Uncertainty Set and Double-Agent Algorithm. Hwang U; Hong S IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619960 [TBL] [Abstract][Full Text] [Related]
12. Stochastic abstract policies: generalizing knowledge to improve reinforcement learning. Koga ML; Freire V; Costa AH IEEE Trans Cybern; 2015 Jan; 45(1):77-88. PubMed ID: 24835233 [TBL] [Abstract][Full Text] [Related]
13. Safe Reinforcement Learning With Dual Robustness. Li Z; Hu C; Wang Y; Yang Y; Li SE IEEE Trans Pattern Anal Mach Intell; 2024 Dec; 46(12):10876-10890. PubMed ID: 39146157 [TBL] [Abstract][Full Text] [Related]
14. An immediate-return reinforcement learning for the atypical Markov decision processes. Pan Z; Wen G; Tan Z; Yin S; Hu X Front Neurorobot; 2022; 16():1012427. PubMed ID: 36582302 [TBL] [Abstract][Full Text] [Related]
15. Discovering and Exploiting Sparse Rewards in a Learned Behavior Space. Paolo G; Coninx M; Laflaquière A; Doncieux S Evol Comput; 2024 Sep; 32(3):275-305. PubMed ID: 37793063 [TBL] [Abstract][Full Text] [Related]
16. MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning. Li Q; Peng Z; Feng L; Zhang Q; Xue Z; Zhou B IEEE Trans Pattern Anal Mach Intell; 2023 Mar; 45(3):3461-3475. PubMed ID: 35830412 [TBL] [Abstract][Full Text] [Related]
17. Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning. Zhao R; Chen Z; Fan Y; Li Y; Gao F Sensors (Basel); 2024 Jun; 24(13):. PubMed ID: 39000919 [TBL] [Abstract][Full Text] [Related]
18. Markov decision processes: a tool for sequential decision making under uncertainty. Alagoz O; Hsu H; Schaefer AJ; Roberts MS Med Decis Making; 2010; 30(4):474-83. PubMed ID: 20044582 [TBL] [Abstract][Full Text] [Related]
19. Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model. Wang B; Yan Y; Fan J Adv Neural Inf Process Syst; 2021 Dec; 34():16671-16685. PubMed ID: 36168331 [TBL] [Abstract][Full Text] [Related]
20. Reinforcement learning for intensive care medicine: actionable clinical insights from novel approaches to reward shaping and off-policy model evaluation. Roggeveen LF; Hassouni AE; de Grooth HJ; Girbes ARJ; Hoogendoorn M; Elbers PWG; Intensive Care Med Exp; 2024 Mar; 12(1):32. PubMed ID: 38526681 [TBL] [Abstract][Full Text] [Related] [Next] [New Search]