435 related articles for article (PubMed ID: 18632380)
1. Ensemble algorithms in reinforcement learning.
Wiering MA; van Hasselt H
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):930-6. PubMed ID: 18632380
[TBL] [Abstract][Full Text] [Related]
2. Reinforcement learning in continuous time and space: interference and not ill conditioning is the main problem when using distributed function approximators.
Baddeley B
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):950-6. PubMed ID: 18632383
[TBL] [Abstract][Full Text] [Related]
3. Improved Adaptive-Reinforcement Learning Control for morphing unmanned air vehicles.
Valasek J; Doebbler J; Tandale MD; Meade AJ
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):1014-20. PubMed ID: 18632393
[TBL] [Abstract][Full Text] [Related]
4. An evolutionary approach toward dynamic self-generated fuzzy inference systems.
Zhou Y; Er MJ
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):963-9. PubMed ID: 18632385
[TBL] [Abstract][Full Text] [Related]
5. Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks.
Yang Q; Vance JB; Jagannathan S
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):994-1001. PubMed ID: 18632390
[TBL] [Abstract][Full Text] [Related]
6. Incoherent control of quantum systems with wavefunction-controllable subspaces via quantum reinforcement learning.
Dong D; Chen C; Tarn TJ; Pechen A; Rabitz H
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):957-62. PubMed ID: 18632384
[TBL] [Abstract][Full Text] [Related]
7. A spiking neural network model of an actor-critic learning agent.
Potjans W; Morrison A; Diesmann M
Neural Comput; 2009 Feb; 21(2):301-39. PubMed ID: 19196231
[TBL] [Abstract][Full Text] [Related]
8. Adaptive feedback control by constrained approximate dynamic programming.
Ferrari S; Steck JE; Chandramohan R
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):982-7. PubMed ID: 18632388
[TBL] [Abstract][Full Text] [Related]
9. Issues on stability of ADP feedback controllers for dynamical systems.
Balakrishnan SN; Ding J; Lewis FL
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):913-7. PubMed ID: 18632377
[TBL] [Abstract][Full Text] [Related]
10. A parameter control method in reinforcement learning to rapidly follow unexpected environmental changes.
Murakoshi K; Mizuno J
Biosystems; 2004 Nov; 77(1-3):109-17. PubMed ID: 15527950
[TBL] [Abstract][Full Text] [Related]
11. Robust reinforcement learning.
Morimoto J; Doya K
Neural Comput; 2005 Feb; 17(2):335-59. PubMed ID: 15720771
[TBL] [Abstract][Full Text] [Related]
12. Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof.
Al-Tamimi A; Lewis FL; Abu-Khalaf M
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):943-9. PubMed ID: 18632382
[TBL] [Abstract][Full Text] [Related]
13. Direct heuristic dynamic programming for damping oscillations in a large power system.
Lu C; Si J; Xie X
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):1008-13. PubMed ID: 18632392
[TBL] [Abstract][Full Text] [Related]
14. Modeling of autonomous problem solving process by dynamic construction of task models in multiple tasks environment.
Ohigashi Y; Omori T
Neural Netw; 2006 Oct; 19(8):1169-80. PubMed ID: 16989982
[TBL] [Abstract][Full Text] [Related]
15. Reinforcement learning of motor skills with policy gradients.
Peters J; Schaal S
Neural Netw; 2008 May; 21(4):682-97. PubMed ID: 18482830
[TBL] [Abstract][Full Text] [Related]
16. Reliability of internal prediction/estimation and its application. I. Adaptive action selection reflecting reliability of value function.
Sakaguchi Y; Takano M
Neural Netw; 2004 Sep; 17(7):935-52. PubMed ID: 15312837
[TBL] [Abstract][Full Text] [Related]
17. Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data.
Lewis FL; Vamvoudakis KG
IEEE Trans Syst Man Cybern B Cybern; 2011 Feb; 41(1):14-25. PubMed ID: 20350860
[TBL] [Abstract][Full Text] [Related]
18. Adaptive critic learning techniques for engine torque and air-fuel ratio control.
Liu D; Javaherian H; Kovalenko O; Huang T
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):988-93. PubMed ID: 18632389
[TBL] [Abstract][Full Text] [Related]
19. Elman backpropagation as reinforcement for simple recurrent networks.
GrĂ¼ning A
Neural Comput; 2007 Nov; 19(11):3108-31. PubMed ID: 17883351
[TBL] [Abstract][Full Text] [Related]
20. A graph-based evolutionary algorithm: Genetic Network Programming (GNP) and its extension using reinforcement learning.
Mabu S; Hirasawa K; Hu J
Evol Comput; 2007; 15(3):369-98. PubMed ID: 17705783
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]