These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

159 related articles for article (PubMed ID: 30408692)

  • 1. Implicit incremental natural actor critic algorithm.
    Iwaki R; Asada M
    Neural Netw; 2019 Jan; 109():103-112. PubMed ID: 30408692
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
    Li L; Li D; Song T; Xu X
    IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Efficient model learning methods for actor-critic control.
    Grondman I; Vaandrager M; Buşoniu L; Babuska R; Schuitema E
    IEEE Trans Syst Man Cybern B Cybern; 2012 Jun; 42(3):591-602. PubMed ID: 22156998
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Meta attention for Off-Policy Actor-Critic.
    Huang J; Huang W; Lan L; Wu D
    Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Reinforcement learning in continuous time and space.
    Doya K
    Neural Comput; 2000 Jan; 12(1):219-45. PubMed ID: 10636940
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.
    Shi D; Guo X; Liu Y; Fan W
    Entropy (Basel); 2022 May; 24(6):. PubMed ID: 35741495
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Continuous-time adaptive critics.
    Hanselmann T; Noakes L; Zaknich A
    IEEE Trans Neural Netw; 2007 May; 18(3):631-47. PubMed ID: 17526332
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction.
    Li L; Li D; Song T; Xu X
    IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks.
    Modares H; Lewis FL; Naghibi-Sistani MB
    IEEE Trans Neural Netw Learn Syst; 2013 Oct; 24(10):1513-25. PubMed ID: 24808590
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms.
    Zhang H; Xu J; Zhang J; Liu Q
    Comput Intell Neurosci; 2022; 2022():1117781. PubMed ID: 36438689
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
    Zheng J; Kurt MN; Wang X
    IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721
    [TBL] [Abstract][Full Text] [Related]  

  • 12. A3C-GS: Adaptive Moment Gradient Sharing With Locks for Asynchronous Actor-Critic Agents.
    Labao AB; Martija MAM; Naval PC
    IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1162-1176. PubMed ID: 32287019
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.
    Zhong S; Liu Q; Fu Q
    Comput Intell Neurosci; 2016; 2016():4824072. PubMed ID: 27795704
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Boosting On-Policy Actor-Critic With Shallow Updates in Critic.
    Li L; Zhu Y
    IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619961
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Robust Actor-Critic With Relative Entropy Regulating Actor.
    Cheng Y; Huang L; Chen CLP; Wang X
    IEEE Trans Neural Netw Learn Syst; 2023 Nov; 34(11):9054-9063. PubMed ID: 35286268
    [TBL] [Abstract][Full Text] [Related]  

  • 16. A policy iteration approach to online optimal control of continuous-time constrained-input systems.
    Modares H; Naghibi Sistani MB; Lewis FL
    ISA Trans; 2013 Sep; 52(5):611-21. PubMed ID: 23706414
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control.
    Wu HN; Luo B
    IEEE Trans Neural Netw Learn Syst; 2012 Dec; 23(12):1884-95. PubMed ID: 24808144
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms.
    Chen Y; Zhang F; Liu Z
    Neural Netw; 2024 Jan; 169():764-777. PubMed ID: 37981458
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Impedance learning for robotic contact tasks using natural actor-critic algorithm.
    Kim B; Park J; Park S; Kang S
    IEEE Trans Syst Man Cybern B Cybern; 2010 Apr; 40(2):433-43. PubMed ID: 19696001
    [TBL] [Abstract][Full Text] [Related]  

  • 20. A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives.
    Li C; Lowe R; Ziemke T
    Front Neurorobot; 2014; 8():23. PubMed ID: 25324773
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 8.