Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

69 related articles for article (PubMed ID: 22019189)

1. Analysis and improvement of policy gradient estimation.
Zhao T; Hachiya H; Niu G; Sugiyama M
Neural Netw; 2012 Feb; 26():118-29. PubMed ID: 22019189
[TBL] [Abstract][Full Text] [Related]

2. Parameter-exploring policy gradients.
Sehnke F; Osendorfer C; Rückstiess T; Graves A; Peters J; Schmidhuber J
Neural Netw; 2010 May; 23(4):551-9. PubMed ID: 20061118
[TBL] [Abstract][Full Text] [Related]

3. Efficient sample reuse in policy gradients with parameter-based exploration.
Zhao T; Hachiya H; Tangkaratt V; Morimoto J; Sugiyama M
Neural Comput; 2013 Jun; 25(6):1512-47. PubMed ID: 23517103
[TBL] [Abstract][Full Text] [Related]

4. Adaptive importance sampling for value function approximation in off-policy reinforcement learning.
Hachiya H; Akiyama T; Sugiayma M; Peters J
Neural Netw; 2009 Dec; 22(10):1399-410. PubMed ID: 19216050
[TBL] [Abstract][Full Text] [Related]

5. Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning.
Morimura T; Uchibe E; Yoshimoto J; Peters J; Doya K
Neural Comput; 2010 Feb; 22(2):342-76. PubMed ID: 19842990
[TBL] [Abstract][Full Text] [Related]

6. Reinforcement learning of motor skills with policy gradients.
Peters J; Schaal S
Neural Netw; 2008 May; 21(4):682-97. PubMed ID: 18482830
[TBL] [Abstract][Full Text] [Related]

7. Node perturbation learning without noiseless baseline.
Cho T; Katahira K; Okanoya K; Okada M
Neural Netw; 2011 Apr; 24(3):267-72. PubMed ID: 21193286
[TBL] [Abstract][Full Text] [Related]

8. Reward-weighted regression with sample reuse for direct policy search in reinforcement learning.
Hachiya H; Peters J; Sugiyama M
Neural Comput; 2011 Nov; 23(11):2798-832. PubMed ID: 21851281
[TBL] [Abstract][Full Text] [Related]

9. Incremental state aggregation for value function estimation in reinforcement learning.
Mori T; Ishii S
IEEE Trans Syst Man Cybern B Cybern; 2011 Oct; 41(5):1407-16. PubMed ID: 21632307
[TBL] [Abstract][Full Text] [Related]

10. Elman backpropagation as reinforcement for simple recurrent networks.
Grüning A
Neural Comput; 2007 Nov; 19(11):3108-31. PubMed ID: 17883351
[TBL] [Abstract][Full Text] [Related]

11. Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation.
Tangkaratt V; Mori S; Zhao T; Morimoto J; Sugiyama M
Neural Netw; 2014 Sep; 57():128-40. PubMed ID: 24995917
[TBL] [Abstract][Full Text] [Related]

12. Intelligent quotient estimation of mental retarded people from different psychometric instruments using artificial neural networks.
Di Nuovo AG; Di Nuovo S; Buono S
Artif Intell Med; 2012 Feb; 54(2):135-45. PubMed ID: 22153923
[TBL] [Abstract][Full Text] [Related]

13. Clipping in neurocontrol by adaptive dynamic programming.
Fairbank M; Prokhorov D; Alonso E
IEEE Trans Neural Netw Learn Syst; 2014 Oct; 25(10):1909-20. PubMed ID: 25291742
[TBL] [Abstract][Full Text] [Related]

14. Reinforcement learning state estimator.
Morimoto J; Doya K
Neural Comput; 2007 Mar; 19(3):730-56. PubMed ID: 17298231
[TBL] [Abstract][Full Text] [Related]

15. A parameter control method in reinforcement learning to rapidly follow unexpected environmental changes.
Murakoshi K; Mizuno J
Biosystems; 2004 Nov; 77(1-3):109-17. PubMed ID: 15527950
[TBL] [Abstract][Full Text] [Related]

16. Partially observable Markov decision processes and performance sensitivity analysis.
Li Y; Yin B; Xi H
IEEE Trans Syst Man Cybern B Cybern; 2008 Dec; 38(6):1645-51. PubMed ID: 19022734
[TBL] [Abstract][Full Text] [Related]

17. Convergence analysis of three classes of split-complex gradient algorithms for complex-valued recurrent neural networks.
Xu D; Zhang H; Liu L
Neural Comput; 2010 Oct; 22(10):2655-77. PubMed ID: 20608871
[TBL] [Abstract][Full Text] [Related]

18. Global exponential stability of generalized recurrent neural networks with discrete and distributed delays.
Liu Y; Wang Z; Liu X
Neural Netw; 2006 Jun; 19(5):667-75. PubMed ID: 16046098
[TBL] [Abstract][Full Text] [Related]

19. Principal whitened gradient for information geometry.
Yang Z; Laaksonen J
Neural Netw; 2008; 21(2-3):232-40. PubMed ID: 18255260
[TBL] [Abstract][Full Text] [Related]

20. Asymptotic stability for neural networks with mixed time-delays: the discrete-time case.
Liu Y; Wang Z; Liu X
Neural Netw; 2009 Jan; 22(1):67-74. PubMed ID: 19028076
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]