These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

131 related articles for article (PubMed ID: 38619961)

  • 1. Boosting On-Policy Actor-Critic With Shallow Updates in Critic.
    Li L; Zhu Y
    IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619961
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction.
    Li L; Li D; Song T; Xu X
    IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network.
    Wu Y; Liao S; Liu X; Li Z; Lu R
    IEEE Trans Neural Netw Learn Syst; 2023 Jul; 34(7):3680-3690. PubMed ID: 34669579
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
    Zheng J; Kurt MN; Wang X
    IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
    Li L; Li D; Song T; Xu X
    IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Meta attention for Off-Policy Actor-Critic.
    Huang J; Huang W; Lan L; Wu D
    Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Reinforcement learning for robust stabilization of nonlinear systems with asymmetric saturating actuators.
    Yang X; Zhou Y; Gao Z
    Neural Netw; 2023 Jan; 158():132-141. PubMed ID: 36455428
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Deep Deterministic Policy Gradient With Compatible Critic Network.
    Wang D; Hu M
    IEEE Trans Neural Netw Learn Syst; 2023 Aug; 34(8):4332-4344. PubMed ID: 34653007
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions.
    Shang Z; Li R; Zheng C; Li H; Cui Y
    IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37943648
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Behavior fusion for deep reinforcement learning.
    Shi H; Xu M; Hwang KS; Cai BY
    ISA Trans; 2020 Mar; 98():434-444. PubMed ID: 31543262
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
    Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
    IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Reinforcement learning in continuous time and space.
    Doya K
    Neural Comput; 2000 Jan; 12(1):219-45. PubMed ID: 10636940
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Asynchronous learning for actor-critic neural networks and synchronous triggering for multiplayer system.
    Wang K; Mu C
    ISA Trans; 2022 Oct; 129(Pt B):295-308. PubMed ID: 35216805
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Deep Multi-Critic Network for accelerating Policy Learning in multi-agent environments.
    Hook J; Silva V; Kondoz A
    Neural Netw; 2020 Aug; 128():97-106. PubMed ID: 32446194
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Realistic Actor-Critic: A framework for balance between value overestimation and underestimation.
    Li S; Tang Q; Pang Y; Ma X; Wang G
    Front Neurorobot; 2022; 16():1081242. PubMed ID: 36699950
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.
    Shi D; Guo X; Liu Y; Fan W
    Entropy (Basel); 2022 May; 24(6):. PubMed ID: 35741495
    [TBL] [Abstract][Full Text] [Related]  

  • 17. The Actor-Dueling-Critic Method for Reinforcement Learning.
    Wu M; Gao Y; Jung A; Zhang Q; Du S
    Sensors (Basel); 2019 Mar; 19(7):. PubMed ID: 30935035
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms.
    Zhang H; Xu J; Zhang J; Liu Q
    Comput Intell Neurosci; 2022; 2022():1117781. PubMed ID: 36438689
    [TBL] [Abstract][Full Text] [Related]  

  • 19. An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems.
    Zhao C; Deng N
    Math Biosci Eng; 2024 Jan; 21(1):1445-1471. PubMed ID: 38303472
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Intelligent control of self-driving vehicles based on adaptive sampling supervised actor-critic and human driving experience.
    Zhang J; Ma N; Wu Z; Wang C; Yao Y
    Math Biosci Eng; 2024 May; 21(5):6077-6096. PubMed ID: 38872570
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 7.