Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

119 related articles for article (PubMed ID: 39240734)

1. Diversifying Policies With Non-Markov Dispersion to Expand the Solution Space.
Qu B; Cao X; Chang Y; Tsang IW; Ong YS
IEEE Trans Pattern Anal Mach Intell; 2024 Dec; 46(12):11392-11408. PubMed ID: 39240734
[TBL] [Abstract][Full Text] [Related]

2. Hierarchical approximate policy iteration with binary-tree state space decomposition.
Xu X; Liu C; Yang SX; Hu D
IEEE Trans Neural Netw; 2011 Dec; 22(12):1863-77. PubMed ID: 21990333
[TBL] [Abstract][Full Text] [Related]

3. Sequence Decision Transformer for Adaptive Traffic Signal Control.
Zhao R; Hu H; Li Y; Fan Y; Gao F; Gao Z
Sensors (Basel); 2024 Sep; 24(19):. PubMed ID: 39409242
[TBL] [Abstract][Full Text] [Related]

4. MOO-MDP: An Object-Oriented Representation for Cooperative Multiagent Reinforcement Learning.
Da Silva FL; Glatt R; Costa AHR
IEEE Trans Cybern; 2019 Feb; 49(2):567-579. PubMed ID: 29990289
[TBL] [Abstract][Full Text] [Related]

5. Deep reinforcement learning navigation via decision transformer in autonomous driving.
Ge L; Zhou X; Li Y; Wang Y
Front Neurorobot; 2024; 18():1338189. PubMed ID: 38566892
[TBL] [Abstract][Full Text] [Related]

6. A delay-robust method for enhanced real-time reinforcement learning.
Xia B; Sun H; Yuan B; Li Z; Liang B; Wang X
Neural Netw; 2025 Jan; 181():106769. PubMed ID: 39395235
[TBL] [Abstract][Full Text] [Related]

7. Sample Efficient Deep Reinforcement Learning With Online State Abstraction and Causal Transformer Model Prediction.
Lan Y; Xu X; Fang Q; Hao J
IEEE Trans Neural Netw Learn Syst; 2024 Nov; 35(11):16574-16588. PubMed ID: 37581972
[TBL] [Abstract][Full Text] [Related]

8. A Hybrid Online Off-Policy Reinforcement Learning Agent Framework Supported by Transformers.
Villarrubia-Martin EA; Rodriguez-Benitez L; Jimenez-Linares L; Muñoz-Valero D; Liu J
Int J Neural Syst; 2023 Dec; 33(12):2350065. PubMed ID: 37857407
[TBL] [Abstract][Full Text] [Related]

9. Parameterized MDPs and Reinforcement Learning Problems-A Maximum Entropy Principle-Based Framework.
Srivastava A; Salapaka SM
IEEE Trans Cybern; 2022 Sep; 52(9):9339-9351. PubMed ID: 34406959
[TBL] [Abstract][Full Text] [Related]

10. MDPs with Non-Deterministic Policies.
Fard MM; Pineau J
Adv Neural Inf Process Syst; 2009; 21():1065-1073. PubMed ID: 21625292
[TBL] [Abstract][Full Text] [Related]

11. On Practical Robust Reinforcement Learning: Adjacent Uncertainty Set and Double-Agent Algorithm.
Hwang U; Hong S
IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619960
[TBL] [Abstract][Full Text] [Related]

12. Stochastic abstract policies: generalizing knowledge to improve reinforcement learning.
Koga ML; Freire V; Costa AH
IEEE Trans Cybern; 2015 Jan; 45(1):77-88. PubMed ID: 24835233
[TBL] [Abstract][Full Text] [Related]

13. Safe Reinforcement Learning With Dual Robustness.
Li Z; Hu C; Wang Y; Yang Y; Li SE
IEEE Trans Pattern Anal Mach Intell; 2024 Dec; 46(12):10876-10890. PubMed ID: 39146157
[TBL] [Abstract][Full Text] [Related]

14. An immediate-return reinforcement learning for the atypical Markov decision processes.
Pan Z; Wen G; Tan Z; Yin S; Hu X
Front Neurorobot; 2022; 16():1012427. PubMed ID: 36582302
[TBL] [Abstract][Full Text] [Related]

15. Discovering and Exploiting Sparse Rewards in a Learned Behavior Space.
Paolo G; Coninx M; Laflaquière A; Doncieux S
Evol Comput; 2024 Sep; 32(3):275-305. PubMed ID: 37793063
[TBL] [Abstract][Full Text] [Related]

16. MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning.
Li Q; Peng Z; Feng L; Zhang Q; Xue Z; Zhou B
IEEE Trans Pattern Anal Mach Intell; 2023 Mar; 45(3):3461-3475. PubMed ID: 35830412
[TBL] [Abstract][Full Text] [Related]

17. Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning.
Zhao R; Chen Z; Fan Y; Li Y; Gao F
Sensors (Basel); 2024 Jun; 24(13):. PubMed ID: 39000919
[TBL] [Abstract][Full Text] [Related]

18. Markov decision processes: a tool for sequential decision making under uncertainty.
Alagoz O; Hsu H; Schaefer AJ; Roberts MS
Med Decis Making; 2010; 30(4):474-83. PubMed ID: 20044582
[TBL] [Abstract][Full Text] [Related]

19. Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model.
Wang B; Yan Y; Fan J
Adv Neural Inf Process Syst; 2021 Dec; 34():16671-16685. PubMed ID: 36168331
[TBL] [Abstract][Full Text] [Related]

20. Reinforcement learning for intensive care medicine: actionable clinical insights from novel approaches to reward shaping and off-policy model evaluation.
Roggeveen LF; Hassouni AE; de Grooth HJ; Girbes ARJ; Hoogendoorn M; Elbers PWG;
Intensive Care Med Exp; 2024 Mar; 12(1):32. PubMed ID: 38526681
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]