These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

145 related articles for article (PubMed ID: 23741197)

  • 1. Linear Fitted-Q Iteration with Multiple Reward Functions.
    Lizotte DJ; Bowling M; Murphy SA
    J Mach Learn Res; 2012 Nov; 13(Nov):3253-3295. PubMed ID: 23741197
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Multi-Objective Markov Decision Processes for Data-Driven Decision Support.
    Lizotte DJ; Laber EB
    J Mach Learn Res; 2016; 17():. PubMed ID: 28018133
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Finite-approximation-error-based discrete-time iterative adaptive dynamic programming.
    Wei Q; Wang FY; Liu D; Yang X
    IEEE Trans Cybern; 2014 Dec; 44(12):2820-33. PubMed ID: 25265640
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Continuous-Time Fitted Value Iteration for Robust Policies.
    Lutter M; Belousov B; Mannor S; Fox D; Garg A; Peters J
    IEEE Trans Pattern Anal Mach Intell; 2023 May; 45(5):5534-5548. PubMed ID: 36260585
    [TBL] [Abstract][Full Text] [Related]  

  • 5. An XGBoost-Based Fitted Q Iteration for Finding the Optimal STI Strategies for HIV Patients.
    Yu Y; Tran H
    IEEE Trans Neural Netw Learn Syst; 2022 Jun; PP():. PubMed ID: 35653445
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning.
    Zhang L; Peng Y; Yang W; Zhang Z
    IEEE Trans Pattern Anal Mach Intell; 2024 May; 46(5):3722-3735. PubMed ID: 38163315
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Optimization of anemia treatment in hemodialysis patients via reinforcement learning.
    Escandell-Montero P; Chermisi M; Martínez-Martínez JM; Gómez-Sanchis J; Barbieri C; Soria-Olivas E; Mari F; Vila-Francés J; Stopper A; Gatti E; Martín-Guerrero JD
    Artif Intell Med; 2014 Sep; 62(1):47-60. PubMed ID: 25091172
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Impaired Expected Value Computations Coupled With Overreliance on Stimulus-Response Learning in Schizophrenia.
    Hernaus D; Gold JM; Waltz JA; Frank MJ
    Biol Psychiatry Cogn Neurosci Neuroimaging; 2018 Nov; 3(11):916-926. PubMed ID: 29735404
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Striatal dopamine, reward, and decision making in schizophrenia.
    Deserno L; Schlagenhauf F; Heinz A
    Dialogues Clin Neurosci; 2016 Mar; 18(1):77-89. PubMed ID: 27069382
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Infinite horizon self-learning optimal control of nonaffine discrete-time nonlinear systems.
    Wei Q; Liu D; Yang X
    IEEE Trans Neural Netw Learn Syst; 2015 Apr; 26(4):866-79. PubMed ID: 25751877
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Inverse reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.
    Yu C; Liu J; Zhao H
    BMC Med Inform Decis Mak; 2019 Apr; 19(Suppl 2):57. PubMed ID: 30961594
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Robust Inverse Q-Learning for Continuous-Time Linear Systems in Adversarial Environments.
    Lian B; Xue W; Lewis FL; Chai T
    IEEE Trans Cybern; 2022 Dec; 52(12):13083-13095. PubMed ID: 34403352
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Normative decision rules in changing environments.
    Barendregt NW; Gold JI; Josić K; Kilpatrick ZP
    Elife; 2022 Oct; 11():. PubMed ID: 36282065
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Value iteration for streaming data on a continuous space with gradient method in an RKHS.
    Liu J; Xu W; Wang Y; Lian H
    Neural Netw; 2023 Sep; 166():437-445. PubMed ID: 37566954
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Negative symptoms and the failure to represent the expected reward value of actions: behavioral and computational modeling evidence.
    Gold JM; Waltz JA; Matveeva TM; Kasanova Z; Strauss GP; Herbener ES; Collins AG; Frank MJ
    Arch Gen Psychiatry; 2012 Feb; 69(2):129-38. PubMed ID: 22310503
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Multi-Agent Decision-Making Modes in Uncertain Interactive Traffic Scenarios via Graph Convolution-Based Deep Reinforcement Learning.
    Gao X; Li X; Liu Q; Li Z; Yang F; Luan T
    Sensors (Basel); 2022 Jun; 22(12):. PubMed ID: 35746364
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Value Iteration Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems.
    Wei Q; Liu D; Lin H
    IEEE Trans Cybern; 2016 Mar; 46(3):840-53. PubMed ID: 26552103
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Composition of web services using Markov decision processes and dynamic programming.
    Uc-Cetina V; Moo-Mena F; Hernandez-Ucan R
    ScientificWorldJournal; 2015; 2015():545308. PubMed ID: 25874247
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors.
    Wei Q; Li B; Song R
    IEEE Trans Neural Netw Learn Syst; 2018 Apr; 29(4):1226-1238. PubMed ID: 28362617
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Sensorimotor learning biases choice behavior: a learning neural field model for decision making.
    Klaes C; Schneegans S; Schöner G; Gail A
    PLoS Comput Biol; 2012; 8(11):e1002774. PubMed ID: 23166483
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 8.