These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
6. Reinforcement learning of motor skills with policy gradients. Peters J; Schaal S Neural Netw; 2008 May; 21(4):682-97. PubMed ID: 18482830 [TBL] [Abstract][Full Text] [Related]
7. Node perturbation learning without noiseless baseline. Cho T; Katahira K; Okanoya K; Okada M Neural Netw; 2011 Apr; 24(3):267-72. PubMed ID: 21193286 [TBL] [Abstract][Full Text] [Related]
8. Reward-weighted regression with sample reuse for direct policy search in reinforcement learning. Hachiya H; Peters J; Sugiyama M Neural Comput; 2011 Nov; 23(11):2798-832. PubMed ID: 21851281 [TBL] [Abstract][Full Text] [Related]
9. Incremental state aggregation for value function estimation in reinforcement learning. Mori T; Ishii S IEEE Trans Syst Man Cybern B Cybern; 2011 Oct; 41(5):1407-16. PubMed ID: 21632307 [TBL] [Abstract][Full Text] [Related]
10. Elman backpropagation as reinforcement for simple recurrent networks. Grüning A Neural Comput; 2007 Nov; 19(11):3108-31. PubMed ID: 17883351 [TBL] [Abstract][Full Text] [Related]
11. Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation. Tangkaratt V; Mori S; Zhao T; Morimoto J; Sugiyama M Neural Netw; 2014 Sep; 57():128-40. PubMed ID: 24995917 [TBL] [Abstract][Full Text] [Related]
12. Intelligent quotient estimation of mental retarded people from different psychometric instruments using artificial neural networks. Di Nuovo AG; Di Nuovo S; Buono S Artif Intell Med; 2012 Feb; 54(2):135-45. PubMed ID: 22153923 [TBL] [Abstract][Full Text] [Related]
13. Clipping in neurocontrol by adaptive dynamic programming. Fairbank M; Prokhorov D; Alonso E IEEE Trans Neural Netw Learn Syst; 2014 Oct; 25(10):1909-20. PubMed ID: 25291742 [TBL] [Abstract][Full Text] [Related]
15. A parameter control method in reinforcement learning to rapidly follow unexpected environmental changes. Murakoshi K; Mizuno J Biosystems; 2004 Nov; 77(1-3):109-17. PubMed ID: 15527950 [TBL] [Abstract][Full Text] [Related]
16. Partially observable Markov decision processes and performance sensitivity analysis. Li Y; Yin B; Xi H IEEE Trans Syst Man Cybern B Cybern; 2008 Dec; 38(6):1645-51. PubMed ID: 19022734 [TBL] [Abstract][Full Text] [Related]
17. Convergence analysis of three classes of split-complex gradient algorithms for complex-valued recurrent neural networks. Xu D; Zhang H; Liu L Neural Comput; 2010 Oct; 22(10):2655-77. PubMed ID: 20608871 [TBL] [Abstract][Full Text] [Related]
18. Global exponential stability of generalized recurrent neural networks with discrete and distributed delays. Liu Y; Wang Z; Liu X Neural Netw; 2006 Jun; 19(5):667-75. PubMed ID: 16046098 [TBL] [Abstract][Full Text] [Related]
19. Principal whitened gradient for information geometry. Yang Z; Laaksonen J Neural Netw; 2008; 21(2-3):232-40. PubMed ID: 18255260 [TBL] [Abstract][Full Text] [Related]
20. Asymptotic stability for neural networks with mixed time-delays: the discrete-time case. Liu Y; Wang Z; Liu X Neural Netw; 2009 Jan; 22(1):67-74. PubMed ID: 19028076 [TBL] [Abstract][Full Text] [Related] [Next] [New Search]