BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

289 related articles for article (PubMed ID: 20116208)

  • 1. Online learning of shaping rewards in reinforcement learning.
    Grześ M; Kudenko D
    Neural Netw; 2010 May; 23(4):541-50. PubMed ID: 20116208
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Optimal control in microgrid using multi-agent reinforcement learning.
    Li FD; Wu M; He Y; Chen X
    ISA Trans; 2012 Nov; 51(6):743-51. PubMed ID: 22824135
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Reinforcement learning in supply chains.
    Valluri A; North MJ; Macal CM
    Int J Neural Syst; 2009 Oct; 19(5):331-44. PubMed ID: 19885962
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Posterior weighted reinforcement learning with state uncertainty.
    Larsen T; Leslie DS; Collins EJ; Bogacz R
    Neural Comput; 2010 May; 22(5):1149-79. PubMed ID: 20100078
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Efficient model learning methods for actor-critic control.
    Grondman I; Vaandrager M; Buşoniu L; Babuska R; Schuitema E
    IEEE Trans Syst Man Cybern B Cybern; 2012 Jun; 42(3):591-602. PubMed ID: 22156998
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Decentralized learning in Markov games.
    Vrancx P; Verbeeck K; Nowé A
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):976-81. PubMed ID: 18632387
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Kernel-based least squares policy iteration for reinforcement learning.
    Xu X; Hu D; Lu X
    IEEE Trans Neural Netw; 2007 Jul; 18(4):973-92. PubMed ID: 17668655
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Model-based reinforcement learning under concurrent schedules of reinforcement in rodents.
    Huh N; Jo S; Kim H; Sul JH; Jung MW
    Learn Mem; 2009 May; 16(5):315-23. PubMed ID: 19403794
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Artificial intelligence framework for simulating clinical decision-making: a Markov decision process approach.
    Bennett CC; Hauser K
    Artif Intell Med; 2013 Jan; 57(1):9-19. PubMed ID: 23287490
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Parameter-exploring policy gradients.
    Sehnke F; Osendorfer C; Rückstiess T; Graves A; Peters J; Schmidhuber J
    Neural Netw; 2010 May; 23(4):551-9. PubMed ID: 20061118
    [TBL] [Abstract][Full Text] [Related]  

  • 11. MOSAIC for multiple-reward environments.
    Sugimoto N; Haruno M; Doya K; Kawato M
    Neural Comput; 2012 Mar; 24(3):577-606. PubMed ID: 22168558
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Autonomous reinforcement learning with experience replay.
    Wawrzyński P; Tanwani AK
    Neural Netw; 2013 May; 41():156-67. PubMed ID: 23237972
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning.
    Morimura T; Uchibe E; Yoshimoto J; Peters J; Doya K
    Neural Comput; 2010 Feb; 22(2):342-76. PubMed ID: 19842990
    [TBL] [Abstract][Full Text] [Related]  

  • 14. A spiking neural network model of an actor-critic learning agent.
    Potjans W; Morrison A; Diesmann M
    Neural Comput; 2009 Feb; 21(2):301-39. PubMed ID: 19196231
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
    Hachiya H; Akiyama T; Sugiayma M; Peters J
    Neural Netw; 2009 Dec; 22(10):1399-410. PubMed ID: 19216050
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Context transfer in reinforcement learning using action-value functions.
    Mousavi A; Nadjar Araabi B; Nili Ahmadabadi M
    Comput Intell Neurosci; 2014; 2014():428567. PubMed ID: 25610457
    [TBL] [Abstract][Full Text] [Related]  

  • 17. A spiking neural model for stable reinforcement of synapses based on multiple distal rewards.
    O'Brien MJ; Srinivasa N
    Neural Comput; 2013 Jan; 25(1):123-56. PubMed ID: 23020112
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback.
    Tan AH; Lu N; Xiao D
    IEEE Trans Neural Netw; 2008 Feb; 19(2):230-44. PubMed ID: 18269955
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Online learning control using adaptive critic designs with sparse kernel machines.
    Xu X; Hou Z; Lian C; He H
    IEEE Trans Neural Netw Learn Syst; 2013 May; 24(5):762-75. PubMed ID: 24808426
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Theory meets pigeons: the influence of reward-magnitude on discrimination-learning.
    Rose J; Schmidt R; Grabemann M; Güntürkün O
    Behav Brain Res; 2009 Mar; 198(1):125-9. PubMed ID: 19041347
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 15.