362 related articles for article (PubMed ID: 29932363)
1. Emergent Solutions to High-Dimensional Multitask Reinforcement Learning.
Kelly S; Heywood MI
Evol Comput; 2018; 26(3):347-380. PubMed ID: 29932363
[TBL] [Abstract][Full Text] [Related]
2. Human-level control through deep reinforcement learning.
Mnih V; Kavukcuoglu K; Silver D; Rusu AA; Veness J; Bellemare MG; Graves A; Riedmiller M; Fidjeland AK; Ostrovski G; Petersen S; Beattie C; Sadik A; Antonoglou I; King H; Kumaran D; Wierstra D; Legg S; Hassabis D
Nature; 2015 Feb; 518(7540):529-33. PubMed ID: 25719670
[TBL] [Abstract][Full Text] [Related]
3. Multiagent cooperation and competition with deep reinforcement learning.
Tampuu A; Matiisen T; Kodelja D; Kuzovkin I; Korjus K; Aru J; Aru J; Vicente R
PLoS One; 2017; 12(4):e0172395. PubMed ID: 28380078
[TBL] [Abstract][Full Text] [Related]
4. Model-based reinforcement learning for partially observable games with sampling-based state estimation.
Fujita H; Ishii S
Neural Comput; 2007 Nov; 19(11):3051-87. PubMed ID: 17883349
[TBL] [Abstract][Full Text] [Related]
5. Spiking neural networks with different reinforcement learning (RL) schemes in a multiagent setting.
Christodoulou C; Cleanthous A
Chin J Physiol; 2010 Dec; 53(6):447-53. PubMed ID: 21793357
[TBL] [Abstract][Full Text] [Related]
6. Multiagent reinforcement learning in the Iterated Prisoner's Dilemma.
Sandholm TW; Crites RH
Biosystems; 1996; 37(1-2):147-66. PubMed ID: 8924633
[TBL] [Abstract][Full Text] [Related]
7. Ensemble algorithms in reinforcement learning.
Wiering MA; van Hasselt H
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):930-6. PubMed ID: 18632380
[TBL] [Abstract][Full Text] [Related]
8. Modeling of autonomous problem solving process by dynamic construction of task models in multiple tasks environment.
Ohigashi Y; Omori T
Neural Netw; 2006 Oct; 19(8):1169-80. PubMed ID: 16989982
[TBL] [Abstract][Full Text] [Related]
9. MOSAIC for multiple-reward environments.
Sugimoto N; Haruno M; Doya K; Kawato M
Neural Comput; 2012 Mar; 24(3):577-606. PubMed ID: 22168558
[TBL] [Abstract][Full Text] [Related]
10. Reinforcement learning in supply chains.
Valluri A; North MJ; Macal CM
Int J Neural Syst; 2009 Oct; 19(5):331-44. PubMed ID: 19885962
[TBL] [Abstract][Full Text] [Related]
11. Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to Atari Breakout game.
Patel D; Hazan H; Saunders DJ; Siegelmann HT; Kozma R
Neural Netw; 2019 Dec; 120():108-115. PubMed ID: 31500931
[TBL] [Abstract][Full Text] [Related]
12. A theoretical analysis of temporal difference learning in the iterated prisoner's dilemma game.
Masuda N; Ohtsuki H
Bull Math Biol; 2009 Nov; 71(8):1818-50. PubMed ID: 19479310
[TBL] [Abstract][Full Text] [Related]
13. Reinforcement learning algorithms for robotic navigation in dynamic environments.
Yen GG; Hickey TW
ISA Trans; 2004 Apr; 43(2):217-30. PubMed ID: 15098582
[TBL] [Abstract][Full Text] [Related]
14. Robust reinforcement learning.
Morimoto J; Doya K
Neural Comput; 2005 Feb; 17(2):335-59. PubMed ID: 15720771
[TBL] [Abstract][Full Text] [Related]
15. Reinforcement learning in continuous time and space: interference and not ill conditioning is the main problem when using distributed function approximators.
Baddeley B
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):950-6. PubMed ID: 18632383
[TBL] [Abstract][Full Text] [Related]
16. Multiagent reinforcement learning: spiking and nonspiking agents in the iterated Prisoner's Dilemma.
Vassiliades V; Cleanthous A; Christodoulou C
IEEE Trans Neural Netw; 2011 Apr; 22(4):639-53. PubMed ID: 21421435
[TBL] [Abstract][Full Text] [Related]
17. Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.
Ezaki T; Horita Y; Takezawa M; Masuda N
PLoS Comput Biol; 2016 Jul; 12(7):e1005034. PubMed ID: 27438888
[TBL] [Abstract][Full Text] [Related]
18. A spiking neural network model of an actor-critic learning agent.
Potjans W; Morrison A; Diesmann M
Neural Comput; 2009 Feb; 21(2):301-39. PubMed ID: 19196231
[TBL] [Abstract][Full Text] [Related]
19. Human-level performance in 3D multiplayer games with population-based reinforcement learning.
Jaderberg M; Czarnecki WM; Dunning I; Marris L; Lever G; CastaƱeda AG; Beattie C; Rabinowitz NC; Morcos AS; Ruderman A; Sonnerat N; Green T; Deason L; Leibo JZ; Silver D; Hassabis D; Kavukcuoglu K; Graepel T
Science; 2019 May; 364(6443):859-865. PubMed ID: 31147514
[TBL] [Abstract][Full Text] [Related]
20. Reinforcement learning in multidimensional environments relies on attention mechanisms.
Niv Y; Daniel R; Geana A; Gershman SJ; Leong YC; Radulescu A; Wilson RC
J Neurosci; 2015 May; 35(21):8145-57. PubMed ID: 26019331
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]