BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

362 related articles for article (PubMed ID: 29932363)

  • 1. Emergent Solutions to High-Dimensional Multitask Reinforcement Learning.
    Kelly S; Heywood MI
    Evol Comput; 2018; 26(3):347-380. PubMed ID: 29932363
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Human-level control through deep reinforcement learning.
    Mnih V; Kavukcuoglu K; Silver D; Rusu AA; Veness J; Bellemare MG; Graves A; Riedmiller M; Fidjeland AK; Ostrovski G; Petersen S; Beattie C; Sadik A; Antonoglou I; King H; Kumaran D; Wierstra D; Legg S; Hassabis D
    Nature; 2015 Feb; 518(7540):529-33. PubMed ID: 25719670
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Multiagent cooperation and competition with deep reinforcement learning.
    Tampuu A; Matiisen T; Kodelja D; Kuzovkin I; Korjus K; Aru J; Aru J; Vicente R
    PLoS One; 2017; 12(4):e0172395. PubMed ID: 28380078
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Model-based reinforcement learning for partially observable games with sampling-based state estimation.
    Fujita H; Ishii S
    Neural Comput; 2007 Nov; 19(11):3051-87. PubMed ID: 17883349
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Spiking neural networks with different reinforcement learning (RL) schemes in a multiagent setting.
    Christodoulou C; Cleanthous A
    Chin J Physiol; 2010 Dec; 53(6):447-53. PubMed ID: 21793357
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Multiagent reinforcement learning in the Iterated Prisoner's Dilemma.
    Sandholm TW; Crites RH
    Biosystems; 1996; 37(1-2):147-66. PubMed ID: 8924633
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Ensemble algorithms in reinforcement learning.
    Wiering MA; van Hasselt H
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):930-6. PubMed ID: 18632380
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Modeling of autonomous problem solving process by dynamic construction of task models in multiple tasks environment.
    Ohigashi Y; Omori T
    Neural Netw; 2006 Oct; 19(8):1169-80. PubMed ID: 16989982
    [TBL] [Abstract][Full Text] [Related]  

  • 9. MOSAIC for multiple-reward environments.
    Sugimoto N; Haruno M; Doya K; Kawato M
    Neural Comput; 2012 Mar; 24(3):577-606. PubMed ID: 22168558
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Reinforcement learning in supply chains.
    Valluri A; North MJ; Macal CM
    Int J Neural Syst; 2009 Oct; 19(5):331-44. PubMed ID: 19885962
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game.
    Patel D; Hazan H; Saunders DJ; Siegelmann HT; Kozma R
    Neural Netw; 2019 Dec; 120():108-115. PubMed ID: 31500931
    [TBL] [Abstract][Full Text] [Related]  

  • 12. A theoretical analysis of temporal difference learning in the iterated prisoner's dilemma game.
    Masuda N; Ohtsuki H
    Bull Math Biol; 2009 Nov; 71(8):1818-50. PubMed ID: 19479310
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Reinforcement learning algorithms for robotic navigation in dynamic environments.
    Yen GG; Hickey TW
    ISA Trans; 2004 Apr; 43(2):217-30. PubMed ID: 15098582
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Robust reinforcement learning.
    Morimoto J; Doya K
    Neural Comput; 2005 Feb; 17(2):335-59. PubMed ID: 15720771
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Reinforcement learning in continuous time and space: interference and not ill conditioning is the main problem when using distributed function approximators.
    Baddeley B
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):950-6. PubMed ID: 18632383
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Multiagent reinforcement learning: spiking and nonspiking agents in the iterated Prisoner's Dilemma.
    Vassiliades V; Cleanthous A; Christodoulou C
    IEEE Trans Neural Netw; 2011 Apr; 22(4):639-53. PubMed ID: 21421435
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.
    Ezaki T; Horita Y; Takezawa M; Masuda N
    PLoS Comput Biol; 2016 Jul; 12(7):e1005034. PubMed ID: 27438888
    [TBL] [Abstract][Full Text] [Related]  

  • 18. A spiking neural network model of an actor-critic learning agent.
    Potjans W; Morrison A; Diesmann M
    Neural Comput; 2009 Feb; 21(2):301-39. PubMed ID: 19196231
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Human-level performance in 3D multiplayer games with population-based reinforcement learning.
    Jaderberg M; Czarnecki WM; Dunning I; Marris L; Lever G; CastaƱeda AG; Beattie C; Rabinowitz NC; Morcos AS; Ruderman A; Sonnerat N; Green T; Deason L; Leibo JZ; Silver D; Hassabis D; Kavukcuoglu K; Graepel T
    Science; 2019 May; 364(6443):859-865. PubMed ID: 31147514
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Reinforcement learning in multidimensional environments relies on attention mechanisms.
    Niv Y; Daniel R; Geana A; Gershman SJ; Leong YC; Radulescu A; Wilson RC
    J Neurosci; 2015 May; 35(21):8145-57. PubMed ID: 26019331
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 19.