Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

145 related articles for article (PubMed ID: 23741197)

1. Linear Fitted-Q Iteration with Multiple Reward Functions.
Lizotte DJ; Bowling M; Murphy SA
J Mach Learn Res; 2012 Nov; 13(Nov):3253-3295. PubMed ID: 23741197
[TBL] [Abstract][Full Text] [Related]

2. Multi-Objective Markov Decision Processes for Data-Driven Decision Support.
Lizotte DJ; Laber EB
J Mach Learn Res; 2016; 17():. PubMed ID: 28018133
[TBL] [Abstract][Full Text] [Related]

3. Finite-approximation-error-based discrete-time iterative adaptive dynamic programming.
Wei Q; Wang FY; Liu D; Yang X
IEEE Trans Cybern; 2014 Dec; 44(12):2820-33. PubMed ID: 25265640
[TBL] [Abstract][Full Text] [Related]

4. Continuous-Time Fitted Value Iteration for Robust Policies.
Lutter M; Belousov B; Mannor S; Fox D; Garg A; Peters J
IEEE Trans Pattern Anal Mach Intell; 2023 May; 45(5):5534-5548. PubMed ID: 36260585
[TBL] [Abstract][Full Text] [Related]

5. An XGBoost-Based Fitted Q Iteration for Finding the Optimal STI Strategies for HIV Patients.
Yu Y; Tran H
IEEE Trans Neural Netw Learn Syst; 2022 Jun; PP():. PubMed ID: 35653445
[TBL] [Abstract][Full Text] [Related]

6. Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning.
Zhang L; Peng Y; Yang W; Zhang Z
IEEE Trans Pattern Anal Mach Intell; 2024 May; 46(5):3722-3735. PubMed ID: 38163315
[TBL] [Abstract][Full Text] [Related]

7. Optimization of anemia treatment in hemodialysis patients via reinforcement learning.
Escandell-Montero P; Chermisi M; Martínez-Martínez JM; Gómez-Sanchis J; Barbieri C; Soria-Olivas E; Mari F; Vila-Francés J; Stopper A; Gatti E; Martín-Guerrero JD
Artif Intell Med; 2014 Sep; 62(1):47-60. PubMed ID: 25091172
[TBL] [Abstract][Full Text] [Related]

8. Impaired Expected Value Computations Coupled With Overreliance on Stimulus-Response Learning in Schizophrenia.
Hernaus D; Gold JM; Waltz JA; Frank MJ
Biol Psychiatry Cogn Neurosci Neuroimaging; 2018 Nov; 3(11):916-926. PubMed ID: 29735404
[TBL] [Abstract][Full Text] [Related]

9. Striatal dopamine, reward, and decision making in schizophrenia.
Deserno L; Schlagenhauf F; Heinz A
Dialogues Clin Neurosci; 2016 Mar; 18(1):77-89. PubMed ID: 27069382
[TBL] [Abstract][Full Text] [Related]

10. Infinite horizon self-learning optimal control of nonaffine discrete-time nonlinear systems.
Wei Q; Liu D; Yang X
IEEE Trans Neural Netw Learn Syst; 2015 Apr; 26(4):866-79. PubMed ID: 25751877
[TBL] [Abstract][Full Text] [Related]

11. Inverse reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.
Yu C; Liu J; Zhao H
BMC Med Inform Decis Mak; 2019 Apr; 19(Suppl 2):57. PubMed ID: 30961594
[TBL] [Abstract][Full Text] [Related]

12. Robust Inverse Q-Learning for Continuous-Time Linear Systems in Adversarial Environments.
Lian B; Xue W; Lewis FL; Chai T
IEEE Trans Cybern; 2022 Dec; 52(12):13083-13095. PubMed ID: 34403352
[TBL] [Abstract][Full Text] [Related]

13. Normative decision rules in changing environments.
Barendregt NW; Gold JI; Josić K; Kilpatrick ZP
Elife; 2022 Oct; 11():. PubMed ID: 36282065
[TBL] [Abstract][Full Text] [Related]

14. Value iteration for streaming data on a continuous space with gradient method in an RKHS.
Liu J; Xu W; Wang Y; Lian H
Neural Netw; 2023 Sep; 166():437-445. PubMed ID: 37566954
[TBL] [Abstract][Full Text] [Related]

15. Negative symptoms and the failure to represent the expected reward value of actions: behavioral and computational modeling evidence.
Gold JM; Waltz JA; Matveeva TM; Kasanova Z; Strauss GP; Herbener ES; Collins AG; Frank MJ
Arch Gen Psychiatry; 2012 Feb; 69(2):129-38. PubMed ID: 22310503
[TBL] [Abstract][Full Text] [Related]

16. Multi-Agent Decision-Making Modes in Uncertain Interactive Traffic Scenarios via Graph Convolution-Based Deep Reinforcement Learning.
Gao X; Li X; Liu Q; Li Z; Yang F; Luan T
Sensors (Basel); 2022 Jun; 22(12):. PubMed ID: 35746364
[TBL] [Abstract][Full Text] [Related]

17. Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems.
Wei Q; Liu D; Lin H
IEEE Trans Cybern; 2016 Mar; 46(3):840-53. PubMed ID: 26552103
[TBL] [Abstract][Full Text] [Related]

18. Composition of web services using Markov decision processes and dynamic programming.
Uc-Cetina V; Moo-Mena F; Hernandez-Ucan R
ScientificWorldJournal; 2015; 2015():545308. PubMed ID: 25874247
[TBL] [Abstract][Full Text] [Related]

19. Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors.
Wei Q; Li B; Song R
IEEE Trans Neural Netw Learn Syst; 2018 Apr; 29(4):1226-1238. PubMed ID: 28362617
[TBL] [Abstract][Full Text] [Related]

20. Sensorimotor learning biases choice behavior: a learning neural field model for decision making.
Klaes C; Schneegans S; Schöner G; Gail A
PLoS Comput Biol; 2012; 8(11):e1002774. PubMed ID: 23166483
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]