These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

107 related articles for article (PubMed ID: 38700970)

  • 1. Actor-Critic With Synthesis Loss for Solving Approximation Biases.
    Guo BW; Chao F; Chang X; Shang C; Shen Q
    IEEE Trans Cybern; 2024 Sep; 54(9):5323-5336. PubMed ID: 38700970
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
    Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
    IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
    Li L; Li D; Song T; Xu X
    IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Reducing Estimation Bias via Triplet-Average Deep Deterministic Policy Gradient.
    Wu D; Dong X; Shen J; Hoi SCH
    IEEE Trans Neural Netw Learn Syst; 2020 Nov; 31(11):4933-4945. PubMed ID: 31940565
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Realistic Actor-Critic: A framework for balance between value overestimation and underestimation.
    Li S; Tang Q; Pang Y; Ma X; Wang G
    Front Neurorobot; 2022; 16():1081242. PubMed ID: 36699950
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Ensemble algorithms in reinforcement learning.
    Wiering MA; van Hasselt H
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):930-6. PubMed ID: 18632380
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions.
    Shang Z; Li R; Zheng C; Li H; Cui Y
    IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37943648
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Mild Policy Evaluation for Offline Actor-Critic.
    Huang L; Dong B; Lu J; Zhang W
    IEEE Trans Neural Netw Learn Syst; 2023 Sep; PP():. PubMed ID: 37676802
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Offline Reinforcement Learning With Behavior Value Regularization.
    Huang L; Dong B; Xie W; Zhang W
    IEEE Trans Cybern; 2024 Jun; 54(6):3692-3704. PubMed ID: 38669164
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Improving Exploration in Actor-Critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization.
    Li F; Fu M; Chen W; Zhang F; Zhang H; Qu H; Yi Z
    IEEE Trans Neural Netw Learn Syst; 2024 Jul; 35(7):8783-8796. PubMed ID: 36306289
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Action Candidate Driven Clipped Double Q-Learning for Discrete and Continuous Action Tasks.
    Jiang H; Li G; Xie J; Yang J
    IEEE Trans Neural Netw Learn Syst; 2024 Apr; 35(4):5269-5279. PubMed ID: 36166566
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
    Zheng J; Kurt MN; Wang X
    IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Meta attention for Off-Policy Actor-Critic.
    Huang J; Huang W; Lan L; Wu D
    Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Reinforcement learning in continuous time and space.
    Doya K
    Neural Comput; 2000 Jan; 12(1):219-45. PubMed ID: 10636940
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms.
    Chen Y; Zhang F; Liu Z
    Neural Netw; 2024 Jan; 169():764-777. PubMed ID: 37981458
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences.
    Banerjee C; Chen Z; Noman N
    IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):3121-3129. PubMed ID: 35588412
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Continuous action deep reinforcement learning for propofol dosing during general anesthesia.
    Schamberg G; Badgeley M; Meschede-Krasa B; Kwon O; Brown EN
    Artif Intell Med; 2022 Jan; 123():102227. PubMed ID: 34998516
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.
    Zhong S; Liu Q; Fu Q
    Comput Intell Neurosci; 2016; 2016():4824072. PubMed ID: 27795704
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.
    Yu C; Ren G; Dong Y
    BMC Med Inform Decis Mak; 2020 Jul; 20(Suppl 3):124. PubMed ID: 32646412
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Reinforcement Learning Tracking Control for Robotic Manipulator With Kernel-Based Dynamic Model.
    Hu Y; Wang W; Liu H; Liu L
    IEEE Trans Neural Netw Learn Syst; 2020 Sep; 31(9):3570-3578. PubMed ID: 31689218
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 6.