289 related articles for article (PubMed ID: 20116208)
1. Online learning of shaping rewards in reinforcement learning.
Grześ M; Kudenko D
Neural Netw; 2010 May; 23(4):541-50. PubMed ID: 20116208
[TBL] [Abstract][Full Text] [Related]
2. Optimal control in microgrid using multi-agent reinforcement learning.
Li FD; Wu M; He Y; Chen X
ISA Trans; 2012 Nov; 51(6):743-51. PubMed ID: 22824135
[TBL] [Abstract][Full Text] [Related]
3. Reinforcement learning in supply chains.
Valluri A; North MJ; Macal CM
Int J Neural Syst; 2009 Oct; 19(5):331-44. PubMed ID: 19885962
[TBL] [Abstract][Full Text] [Related]
4. Posterior weighted reinforcement learning with state uncertainty.
Larsen T; Leslie DS; Collins EJ; Bogacz R
Neural Comput; 2010 May; 22(5):1149-79. PubMed ID: 20100078
[TBL] [Abstract][Full Text] [Related]
5. Efficient model learning methods for actor-critic control.
Grondman I; Vaandrager M; Buşoniu L; Babuska R; Schuitema E
IEEE Trans Syst Man Cybern B Cybern; 2012 Jun; 42(3):591-602. PubMed ID: 22156998
[TBL] [Abstract][Full Text] [Related]
6. Decentralized learning in Markov games.
Vrancx P; Verbeeck K; Nowé A
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):976-81. PubMed ID: 18632387
[TBL] [Abstract][Full Text] [Related]
7. Kernel-based least squares policy iteration for reinforcement learning.
Xu X; Hu D; Lu X
IEEE Trans Neural Netw; 2007 Jul; 18(4):973-92. PubMed ID: 17668655
[TBL] [Abstract][Full Text] [Related]
8. Model-based reinforcement learning under concurrent schedules of reinforcement in rodents.
Huh N; Jo S; Kim H; Sul JH; Jung MW
Learn Mem; 2009 May; 16(5):315-23. PubMed ID: 19403794
[TBL] [Abstract][Full Text] [Related]
9. Artificial intelligence framework for simulating clinical decision-making: a Markov decision process approach.
Bennett CC; Hauser K
Artif Intell Med; 2013 Jan; 57(1):9-19. PubMed ID: 23287490
[TBL] [Abstract][Full Text] [Related]
10. Parameter-exploring policy gradients.
Sehnke F; Osendorfer C; Rückstiess T; Graves A; Peters J; Schmidhuber J
Neural Netw; 2010 May; 23(4):551-9. PubMed ID: 20061118
[TBL] [Abstract][Full Text] [Related]
11. MOSAIC for multiple-reward environments.
Sugimoto N; Haruno M; Doya K; Kawato M
Neural Comput; 2012 Mar; 24(3):577-606. PubMed ID: 22168558
[TBL] [Abstract][Full Text] [Related]
12. Autonomous reinforcement learning with experience replay.
Wawrzyński P; Tanwani AK
Neural Netw; 2013 May; 41():156-67. PubMed ID: 23237972
[TBL] [Abstract][Full Text] [Related]
13. Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning.
Morimura T; Uchibe E; Yoshimoto J; Peters J; Doya K
Neural Comput; 2010 Feb; 22(2):342-76. PubMed ID: 19842990
[TBL] [Abstract][Full Text] [Related]
14. A spiking neural network model of an actor-critic learning agent.
Potjans W; Morrison A; Diesmann M
Neural Comput; 2009 Feb; 21(2):301-39. PubMed ID: 19196231
[TBL] [Abstract][Full Text] [Related]
15. Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
Hachiya H; Akiyama T; Sugiayma M; Peters J
Neural Netw; 2009 Dec; 22(10):1399-410. PubMed ID: 19216050
[TBL] [Abstract][Full Text] [Related]
16. Context transfer in reinforcement learning using action-value functions.
Mousavi A; Nadjar Araabi B; Nili Ahmadabadi M
Comput Intell Neurosci; 2014; 2014():428567. PubMed ID: 25610457
[TBL] [Abstract][Full Text] [Related]
17. A spiking neural model for stable reinforcement of synapses based on multiple distal rewards.
O'Brien MJ; Srinivasa N
Neural Comput; 2013 Jan; 25(1):123-56. PubMed ID: 23020112
[TBL] [Abstract][Full Text] [Related]
18. Integrating temporal difference methods and self-organizing neural networks for reinforcement learning with delayed evaluative feedback.
Tan AH; Lu N; Xiao D
IEEE Trans Neural Netw; 2008 Feb; 19(2):230-44. PubMed ID: 18269955
[TBL] [Abstract][Full Text] [Related]
19. Online learning control using adaptive critic designs with sparse kernel machines.
Xu X; Hou Z; Lian C; He H
IEEE Trans Neural Netw Learn Syst; 2013 May; 24(5):762-75. PubMed ID: 24808426
[TBL] [Abstract][Full Text] [Related]
20. Theory meets pigeons: the influence of reward-magnitude on discrimination-learning.
Rose J; Schmidt R; Grabemann M; Güntürkün O
Behav Brain Res; 2009 Mar; 198(1):125-9. PubMed ID: 19041347
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]