404 related articles for article (PubMed ID: 23237972)
1. Autonomous reinforcement learning with experience replay.
Wawrzyński P; Tanwani AK
Neural Netw; 2013 May; 41():156-67. PubMed ID: 23237972
[TBL] [Abstract][Full Text] [Related]
2. Real-time reinforcement learning by sequential Actor-Critics and experience replay.
Wawrzyński P
Neural Netw; 2009 Dec; 22(10):1484-97. PubMed ID: 19523786
[TBL] [Abstract][Full Text] [Related]
3. Reinforcement learning of motor skills with policy gradients.
Peters J; Schaal S
Neural Netw; 2008 May; 21(4):682-97. PubMed ID: 18482830
[TBL] [Abstract][Full Text] [Related]
4. Parameter-exploring policy gradients.
Sehnke F; Osendorfer C; Rückstiess T; Graves A; Peters J; Schmidhuber J
Neural Netw; 2010 May; 23(4):551-9. PubMed ID: 20061118
[TBL] [Abstract][Full Text] [Related]
5. A parameter control method in reinforcement learning to rapidly follow unexpected environmental changes.
Murakoshi K; Mizuno J
Biosystems; 2004 Nov; 77(1-3):109-17. PubMed ID: 15527950
[TBL] [Abstract][Full Text] [Related]
6. Robust reinforcement learning control using integral quadratic constraints for recurrent neural networks.
Anderson CW; Young PM; Buehner MR; Knight JN; Bush KA; Hittle DC
IEEE Trans Neural Netw; 2007 Jul; 18(4):993-1002. PubMed ID: 17668656
[TBL] [Abstract][Full Text] [Related]
7. Impedance learning for robotic contact tasks using natural actor-critic algorithm.
Kim B; Park J; Park S; Kang S
IEEE Trans Syst Man Cybern B Cybern; 2010 Apr; 40(2):433-43. PubMed ID: 19696001
[TBL] [Abstract][Full Text] [Related]
8. Efficient model learning methods for actor-critic control.
Grondman I; Vaandrager M; Buşoniu L; Babuska R; Schuitema E
IEEE Trans Syst Man Cybern B Cybern; 2012 Jun; 42(3):591-602. PubMed ID: 22156998
[TBL] [Abstract][Full Text] [Related]
9. Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks.
Modares H; Lewis FL; Naghibi-Sistani MB
IEEE Trans Neural Netw Learn Syst; 2013 Oct; 24(10):1513-25. PubMed ID: 24808590
[TBL] [Abstract][Full Text] [Related]
10. Acceleration of reinforcement learning by policy evaluation using nonstationary iterative method.
Senda K; Hattori S; Hishinuma T; Kohda T
IEEE Trans Cybern; 2014 Dec; 44(12):2696-705. PubMed ID: 24733037
[TBL] [Abstract][Full Text] [Related]
11. Reinforcement learning for a biped robot based on a CPG-actor-critic method.
Nakamura Y; Mori T; Sato MA; Ishii S
Neural Netw; 2007 Aug; 20(6):723-35. PubMed ID: 17412559
[TBL] [Abstract][Full Text] [Related]
12. Detecting and preventing error propagation via competitive learning.
Silva TC; Zhao L
Neural Netw; 2013 May; 41():70-84. PubMed ID: 23200192
[TBL] [Abstract][Full Text] [Related]
13. Decentralized learning in Markov games.
Vrancx P; Verbeeck K; Nowé A
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):976-81. PubMed ID: 18632387
[TBL] [Abstract][Full Text] [Related]
14. Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
Hachiya H; Akiyama T; Sugiayma M; Peters J
Neural Netw; 2009 Dec; 22(10):1399-410. PubMed ID: 19216050
[TBL] [Abstract][Full Text] [Related]
15. SOVEREIGN: An autonomous neural system for incrementally learning planned action sequences to navigate towards a rewarded goal.
Gnadt W; Grossberg S
Neural Netw; 2008 Jun; 21(5):699-758. PubMed ID: 17996419
[TBL] [Abstract][Full Text] [Related]
16. Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning.
Morimura T; Uchibe E; Yoshimoto J; Peters J; Doya K
Neural Comput; 2010 Feb; 22(2):342-76. PubMed ID: 19842990
[TBL] [Abstract][Full Text] [Related]
17. Ensemble algorithms in reinforcement learning.
Wiering MA; van Hasselt H
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):930-6. PubMed ID: 18632380
[TBL] [Abstract][Full Text] [Related]
18. Kernel-based least squares policy iteration for reinforcement learning.
Xu X; Hu D; Lu X
IEEE Trans Neural Netw; 2007 Jul; 18(4):973-92. PubMed ID: 17668655
[TBL] [Abstract][Full Text] [Related]
19. Neuromuscular control of the point to point and oscillatory movements of a sagittal arm with the actor-critic reinforcement learning method.
Golkhou V; Parnianpour M; Lucas C
Comput Methods Biomech Biomed Engin; 2005 Apr; 8(2):103-13. PubMed ID: 16154874
[TBL] [Abstract][Full Text] [Related]
20. An information-theoretic analysis of return maximization in reinforcement learning.
Iwata K
Neural Netw; 2011 Dec; 24(10):1074-81. PubMed ID: 21665429
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]