BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

343 related articles for article (PubMed ID: 32324571)

  • 1. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
    Li L; Li D; Song T; Xu X
    IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction.
    Li L; Li D; Song T; Xu X
    IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions.
    Shang Z; Li R; Zheng C; Li H; Cui Y
    IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37943648
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
    Zheng J; Kurt MN; Wang X
    IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Reinforcement learning solution for HJB equation arising in constrained optimal control problem.
    Luo B; Wu HN; Huang T; Liu D
    Neural Netw; 2015 Nov; 71():150-8. PubMed ID: 26356598
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
    Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
    IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Reinforcement learning in continuous time and space.
    Doya K
    Neural Comput; 2000 Jan; 12(1):219-45. PubMed ID: 10636940
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Meta attention for Off-Policy Actor-Critic.
    Huang J; Huang W; Lan L; Wu D
    Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.
    Zhong S; Liu Q; Fu Q
    Comput Intell Neurosci; 2016; 2016():4824072. PubMed ID: 27795704
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Kernel-based least squares policy iteration for reinforcement learning.
    Xu X; Hu D; Lu X
    IEEE Trans Neural Netw; 2007 Jul; 18(4):973-92. PubMed ID: 17668655
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Efficient model learning methods for actor-critic control.
    Grondman I; Vaandrager M; Buşoniu L; Babuska R; Schuitema E
    IEEE Trans Syst Man Cybern B Cybern; 2012 Jun; 42(3):591-602. PubMed ID: 22156998
    [TBL] [Abstract][Full Text] [Related]  

  • 12. An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems.
    Zhao C; Deng N
    Math Biosci Eng; 2024 Jan; 21(1):1445-1471. PubMed ID: 38303472
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Mild Policy Evaluation for Offline Actor-Critic.
    Huang L; Dong B; Lu J; Zhang W
    IEEE Trans Neural Netw Learn Syst; 2023 Sep; PP():. PubMed ID: 37676802
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences.
    Banerjee C; Chen Z; Noman N
    IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):3121-3129. PubMed ID: 35588412
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Implicit incremental natural actor critic algorithm.
    Iwaki R; Asada M
    Neural Netw; 2019 Jan; 109():103-112. PubMed ID: 30408692
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Deep Deterministic Policy Gradient With Compatible Critic Network.
    Wang D; Hu M
    IEEE Trans Neural Netw Learn Syst; 2023 Aug; 34(8):4332-4344. PubMed ID: 34653007
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Optimized Backstepping Tracking Control Using Reinforcement Learning for a Class of Stochastic Nonlinear Strict-Feedback Systems.
    Wen G; Xu L; Li B
    IEEE Trans Neural Netw Learn Syst; 2023 Mar; 34(3):1291-1303. PubMed ID: 34437076
    [TBL] [Abstract][Full Text] [Related]  

  • 18. A policy iteration approach to online optimal control of continuous-time constrained-input systems.
    Modares H; Naghibi Sistani MB; Lewis FL
    ISA Trans; 2013 Sep; 52(5):611-21. PubMed ID: 23706414
    [TBL] [Abstract][Full Text] [Related]  

  • 19. A3C-GS: Adaptive Moment Gradient Sharing With Locks for Asynchronous Actor-Critic Agents.
    Labao AB; Martija MAM; Naval PC
    IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1162-1176. PubMed ID: 32287019
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.
    Shi D; Guo X; Liu Y; Fan W
    Entropy (Basel); 2022 May; 24(6):. PubMed ID: 35741495
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 18.