These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
2. Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation. Tangkaratt V; Mori S; Zhao T; Morimoto J; Sugiyama M Neural Netw; 2014 Sep; 57():128-40. PubMed ID: 24995917 [TBL] [Abstract][Full Text] [Related]
3. Reward-weighted regression with sample reuse for direct policy search in reinforcement learning. Hachiya H; Peters J; Sugiyama M Neural Comput; 2011 Nov; 23(11):2798-832. PubMed ID: 21851281 [TBL] [Abstract][Full Text] [Related]
4. Efficient exploration through active learning for value function approximation in reinforcement learning. Akiyama T; Hachiya H; Sugiyama M Neural Netw; 2010 Jun; 23(5):639-48. PubMed ID: 20080026 [TBL] [Abstract][Full Text] [Related]
5. A reinforcement learning algorithm acquires demonstration from the training agent by dividing the task space. Zu L; He X; Yang J; Liu L; Wang W Neural Netw; 2023 Jul; 164():419-427. PubMed ID: 37187108 [TBL] [Abstract][Full Text] [Related]
6. Conditional density estimation with dimensionality reduction via squared-loss conditional entropy minimization. Tangkaratt V; Xie N; Sugiyama M Neural Comput; 2015 Jan; 27(1):228-54. PubMed ID: 25380340 [TBL] [Abstract][Full Text] [Related]
7. Kernel-based least squares policy iteration for reinforcement learning. Xu X; Hu D; Lu X IEEE Trans Neural Netw; 2007 Jul; 18(4):973-92. PubMed ID: 17668655 [TBL] [Abstract][Full Text] [Related]
8. MOSAIC for multiple-reward environments. Sugimoto N; Haruno M; Doya K; Kawato M Neural Comput; 2012 Mar; 24(3):577-606. PubMed ID: 22168558 [TBL] [Abstract][Full Text] [Related]
9. State representation learning for control: An overview. Lesort T; Díaz-Rodríguez N; Goudou JI; Filliat D Neural Netw; 2018 Dec; 108():379-392. PubMed ID: 30268059 [TBL] [Abstract][Full Text] [Related]