These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

69 related articles for article (PubMed ID: 22019189)

  • 1. Analysis and improvement of policy gradient estimation.
    Zhao T; Hachiya H; Niu G; Sugiyama M
    Neural Netw; 2012 Feb; 26():118-29. PubMed ID: 22019189
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Parameter-exploring policy gradients.
    Sehnke F; Osendorfer C; Rückstiess T; Graves A; Peters J; Schmidhuber J
    Neural Netw; 2010 May; 23(4):551-9. PubMed ID: 20061118
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Efficient sample reuse in policy gradients with parameter-based exploration.
    Zhao T; Hachiya H; Tangkaratt V; Morimoto J; Sugiyama M
    Neural Comput; 2013 Jun; 25(6):1512-47. PubMed ID: 23517103
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
    Hachiya H; Akiyama T; Sugiayma M; Peters J
    Neural Netw; 2009 Dec; 22(10):1399-410. PubMed ID: 19216050
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning.
    Morimura T; Uchibe E; Yoshimoto J; Peters J; Doya K
    Neural Comput; 2010 Feb; 22(2):342-76. PubMed ID: 19842990
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Reinforcement learning of motor skills with policy gradients.
    Peters J; Schaal S
    Neural Netw; 2008 May; 21(4):682-97. PubMed ID: 18482830
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Node perturbation learning without noiseless baseline.
    Cho T; Katahira K; Okanoya K; Okada M
    Neural Netw; 2011 Apr; 24(3):267-72. PubMed ID: 21193286
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Reward-weighted regression with sample reuse for direct policy search in reinforcement learning.
    Hachiya H; Peters J; Sugiyama M
    Neural Comput; 2011 Nov; 23(11):2798-832. PubMed ID: 21851281
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Incremental state aggregation for value function estimation in reinforcement learning.
    Mori T; Ishii S
    IEEE Trans Syst Man Cybern B Cybern; 2011 Oct; 41(5):1407-16. PubMed ID: 21632307
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Elman backpropagation as reinforcement for simple recurrent networks.
    Grüning A
    Neural Comput; 2007 Nov; 19(11):3108-31. PubMed ID: 17883351
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation.
    Tangkaratt V; Mori S; Zhao T; Morimoto J; Sugiyama M
    Neural Netw; 2014 Sep; 57():128-40. PubMed ID: 24995917
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Intelligent quotient estimation of mental retarded people from different psychometric instruments using artificial neural networks.
    Di Nuovo AG; Di Nuovo S; Buono S
    Artif Intell Med; 2012 Feb; 54(2):135-45. PubMed ID: 22153923
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Clipping in neurocontrol by adaptive dynamic programming.
    Fairbank M; Prokhorov D; Alonso E
    IEEE Trans Neural Netw Learn Syst; 2014 Oct; 25(10):1909-20. PubMed ID: 25291742
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Reinforcement learning state estimator.
    Morimoto J; Doya K
    Neural Comput; 2007 Mar; 19(3):730-56. PubMed ID: 17298231
    [TBL] [Abstract][Full Text] [Related]  

  • 15. A parameter control method in reinforcement learning to rapidly follow unexpected environmental changes.
    Murakoshi K; Mizuno J
    Biosystems; 2004 Nov; 77(1-3):109-17. PubMed ID: 15527950
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Partially observable Markov decision processes and performance sensitivity analysis.
    Li Y; Yin B; Xi H
    IEEE Trans Syst Man Cybern B Cybern; 2008 Dec; 38(6):1645-51. PubMed ID: 19022734
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Convergence analysis of three classes of split-complex gradient algorithms for complex-valued recurrent neural networks.
    Xu D; Zhang H; Liu L
    Neural Comput; 2010 Oct; 22(10):2655-77. PubMed ID: 20608871
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Global exponential stability of generalized recurrent neural networks with discrete and distributed delays.
    Liu Y; Wang Z; Liu X
    Neural Netw; 2006 Jun; 19(5):667-75. PubMed ID: 16046098
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Principal whitened gradient for information geometry.
    Yang Z; Laaksonen J
    Neural Netw; 2008; 21(2-3):232-40. PubMed ID: 18255260
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Asymptotic stability for neural networks with mixed time-delays: the discrete-time case.
    Liu Y; Wang Z; Liu X
    Neural Netw; 2009 Jan; 22(1):67-74. PubMed ID: 19028076
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 4.