210 related articles for article (PubMed ID: 19885962)
1. Reinforcement learning in supply chains.
Valluri A; North MJ; Macal CM
Int J Neural Syst; 2009 Oct; 19(5):331-44. PubMed ID: 19885962
[TBL] [Abstract][Full Text] [Related]
2. Robust reinforcement learning control using integral quadratic constraints for recurrent neural networks.
Anderson CW; Young PM; Buehner MR; Knight JN; Bush KA; Hittle DC
IEEE Trans Neural Netw; 2007 Jul; 18(4):993-1002. PubMed ID: 17668656
[TBL] [Abstract][Full Text] [Related]
3. Online learning of shaping rewards in reinforcement learning.
Grześ M; Kudenko D
Neural Netw; 2010 May; 23(4):541-50. PubMed ID: 20116208
[TBL] [Abstract][Full Text] [Related]
4. Decentralized learning in Markov games.
Vrancx P; Verbeeck K; Nowé A
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):976-81. PubMed ID: 18632387
[TBL] [Abstract][Full Text] [Related]
5. Autonomous reinforcement learning with experience replay.
Wawrzyński P; Tanwani AK
Neural Netw; 2013 May; 41():156-67. PubMed ID: 23237972
[TBL] [Abstract][Full Text] [Related]
6. Individualization of pharmacological anemia management using reinforcement learning.
Gaweda AE; Muezzinoglu MK; Aronoff GR; Jacobs AA; Zurada JM; Brier ME
Neural Netw; 2005; 18(5-6):826-34. PubMed ID: 16109475
[TBL] [Abstract][Full Text] [Related]
7. Parameter-exploring policy gradients.
Sehnke F; Osendorfer C; Rückstiess T; Graves A; Peters J; Schmidhuber J
Neural Netw; 2010 May; 23(4):551-9. PubMed ID: 20061118
[TBL] [Abstract][Full Text] [Related]
8. Human-level control through deep reinforcement learning.
Mnih V; Kavukcuoglu K; Silver D; Rusu AA; Veness J; Bellemare MG; Graves A; Riedmiller M; Fidjeland AK; Ostrovski G; Petersen S; Beattie C; Sadik A; Antonoglou I; King H; Kumaran D; Wierstra D; Legg S; Hassabis D
Nature; 2015 Feb; 518(7540):529-33. PubMed ID: 25719670
[TBL] [Abstract][Full Text] [Related]
9. An information-theoretic analysis of return maximization in reinforcement learning.
Iwata K
Neural Netw; 2011 Dec; 24(10):1074-81. PubMed ID: 21665429
[TBL] [Abstract][Full Text] [Related]
10. Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
Hachiya H; Akiyama T; Sugiayma M; Peters J
Neural Netw; 2009 Dec; 22(10):1399-410. PubMed ID: 19216050
[TBL] [Abstract][Full Text] [Related]
11. A graph-based evolutionary algorithm: Genetic Network Programming (GNP) and its extension using reinforcement learning.
Mabu S; Hirasawa K; Hu J
Evol Comput; 2007; 15(3):369-98. PubMed ID: 17705783
[TBL] [Abstract][Full Text] [Related]
12. A computational neural model of goal-directed utterance selection.
Klein M; Kamp H; Palm G; Doya K
Neural Netw; 2010 Jun; 23(5):592-606. PubMed ID: 20116973
[TBL] [Abstract][Full Text] [Related]
13. Efficient model learning methods for actor-critic control.
Grondman I; Vaandrager M; Buşoniu L; Babuska R; Schuitema E
IEEE Trans Syst Man Cybern B Cybern; 2012 Jun; 42(3):591-602. PubMed ID: 22156998
[TBL] [Abstract][Full Text] [Related]
14. Self-organizing neural networks integrating domain knowledge and reinforcement learning.
Teng TH; Tan AH; Zurada JM
IEEE Trans Neural Netw Learn Syst; 2015 May; 26(5):889-902. PubMed ID: 25881365
[TBL] [Abstract][Full Text] [Related]
15. Interacting with an artificial partner: modeling the role of emotional aspects.
Cattinelli I; Goldwurm M; Borghese NA
Biol Cybern; 2008 Dec; 99(6):473-89. PubMed ID: 18813942
[TBL] [Abstract][Full Text] [Related]
16. Emergent Solutions to High-Dimensional Multitask Reinforcement Learning.
Kelly S; Heywood MI
Evol Comput; 2018; 26(3):347-380. PubMed ID: 29932363
[TBL] [Abstract][Full Text] [Related]
17. A new criterion using information gain for action selection strategy in reinforcement learning.
Iwata K; Ikeda K; Sakai H
IEEE Trans Neural Netw; 2004 Jul; 15(4):792-9. PubMed ID: 15461073
[TBL] [Abstract][Full Text] [Related]
18. Optimization of anemia treatment in hemodialysis patients via reinforcement learning.
Escandell-Montero P; Chermisi M; Martínez-Martínez JM; Gómez-Sanchis J; Barbieri C; Soria-Olivas E; Mari F; Vila-Francés J; Stopper A; Gatti E; Martín-Guerrero JD
Artif Intell Med; 2014 Sep; 62(1):47-60. PubMed ID: 25091172
[TBL] [Abstract][Full Text] [Related]
19. Auditory learning: a developmental method.
Zhang Y; Weng J; Hwang WS
IEEE Trans Neural Netw; 2005 May; 16(3):601-16. PubMed ID: 15940990
[TBL] [Abstract][Full Text] [Related]
20. Value-directed human behavior analysis from video using partially observable Markov decision processes.
Hoey J; Little JJ
IEEE Trans Pattern Anal Mach Intell; 2007 Jul; 29(7):1118-32. PubMed ID: 17496372
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]