BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

210 related articles for article (PubMed ID: 19885962)

  • 1. Reinforcement learning in supply chains.
    Valluri A; North MJ; Macal CM
    Int J Neural Syst; 2009 Oct; 19(5):331-44. PubMed ID: 19885962
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Robust reinforcement learning control using integral quadratic constraints for recurrent neural networks.
    Anderson CW; Young PM; Buehner MR; Knight JN; Bush KA; Hittle DC
    IEEE Trans Neural Netw; 2007 Jul; 18(4):993-1002. PubMed ID: 17668656
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Online learning of shaping rewards in reinforcement learning.
    Grześ M; Kudenko D
    Neural Netw; 2010 May; 23(4):541-50. PubMed ID: 20116208
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Decentralized learning in Markov games.
    Vrancx P; Verbeeck K; Nowé A
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):976-81. PubMed ID: 18632387
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Autonomous reinforcement learning with experience replay.
    Wawrzyński P; Tanwani AK
    Neural Netw; 2013 May; 41():156-67. PubMed ID: 23237972
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Individualization of pharmacological anemia management using reinforcement learning.
    Gaweda AE; Muezzinoglu MK; Aronoff GR; Jacobs AA; Zurada JM; Brier ME
    Neural Netw; 2005; 18(5-6):826-34. PubMed ID: 16109475
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Parameter-exploring policy gradients.
    Sehnke F; Osendorfer C; Rückstiess T; Graves A; Peters J; Schmidhuber J
    Neural Netw; 2010 May; 23(4):551-9. PubMed ID: 20061118
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Human-level control through deep reinforcement learning.
    Mnih V; Kavukcuoglu K; Silver D; Rusu AA; Veness J; Bellemare MG; Graves A; Riedmiller M; Fidjeland AK; Ostrovski G; Petersen S; Beattie C; Sadik A; Antonoglou I; King H; Kumaran D; Wierstra D; Legg S; Hassabis D
    Nature; 2015 Feb; 518(7540):529-33. PubMed ID: 25719670
    [TBL] [Abstract][Full Text] [Related]  

  • 9. An information-theoretic analysis of return maximization in reinforcement learning.
    Iwata K
    Neural Netw; 2011 Dec; 24(10):1074-81. PubMed ID: 21665429
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
    Hachiya H; Akiyama T; Sugiayma M; Peters J
    Neural Netw; 2009 Dec; 22(10):1399-410. PubMed ID: 19216050
    [TBL] [Abstract][Full Text] [Related]  

  • 11. A graph-based evolutionary algorithm: Genetic Network Programming (GNP) and its extension using reinforcement learning.
    Mabu S; Hirasawa K; Hu J
    Evol Comput; 2007; 15(3):369-98. PubMed ID: 17705783
    [TBL] [Abstract][Full Text] [Related]  

  • 12. A computational neural model of goal-directed utterance selection.
    Klein M; Kamp H; Palm G; Doya K
    Neural Netw; 2010 Jun; 23(5):592-606. PubMed ID: 20116973
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Efficient model learning methods for actor-critic control.
    Grondman I; Vaandrager M; Buşoniu L; Babuska R; Schuitema E
    IEEE Trans Syst Man Cybern B Cybern; 2012 Jun; 42(3):591-602. PubMed ID: 22156998
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Self-organizing neural networks integrating domain knowledge and reinforcement learning.
    Teng TH; Tan AH; Zurada JM
    IEEE Trans Neural Netw Learn Syst; 2015 May; 26(5):889-902. PubMed ID: 25881365
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Interacting with an artificial partner: modeling the role of emotional aspects.
    Cattinelli I; Goldwurm M; Borghese NA
    Biol Cybern; 2008 Dec; 99(6):473-89. PubMed ID: 18813942
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Emergent Solutions to High-Dimensional Multitask Reinforcement Learning.
    Kelly S; Heywood MI
    Evol Comput; 2018; 26(3):347-380. PubMed ID: 29932363
    [TBL] [Abstract][Full Text] [Related]  

  • 17. A new criterion using information gain for action selection strategy in reinforcement learning.
    Iwata K; Ikeda K; Sakai H
    IEEE Trans Neural Netw; 2004 Jul; 15(4):792-9. PubMed ID: 15461073
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Optimization of anemia treatment in hemodialysis patients via reinforcement learning.
    Escandell-Montero P; Chermisi M; Martínez-Martínez JM; Gómez-Sanchis J; Barbieri C; Soria-Olivas E; Mari F; Vila-Francés J; Stopper A; Gatti E; Martín-Guerrero JD
    Artif Intell Med; 2014 Sep; 62(1):47-60. PubMed ID: 25091172
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Auditory learning: a developmental method.
    Zhang Y; Weng J; Hwang WS
    IEEE Trans Neural Netw; 2005 May; 16(3):601-16. PubMed ID: 15940990
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Value-directed human behavior analysis from video using partially observable Markov decision processes.
    Hoey J; Little JJ
    IEEE Trans Pattern Anal Mach Intell; 2007 Jul; 29(7):1118-32. PubMed ID: 17496372
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 11.