Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

107 related articles for article (PubMed ID: 38700970)

1. Actor-Critic With Synthesis Loss for Solving Approximation Biases.
Guo BW; Chao F; Chang X; Shang C; Shen Q
IEEE Trans Cybern; 2024 Sep; 54(9):5323-5336. PubMed ID: 38700970
[TBL] [Abstract][Full Text] [Related]

2. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
[TBL] [Abstract][Full Text] [Related]

3. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
Li L; Li D; Song T; Xu X
IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
[TBL] [Abstract][Full Text] [Related]

4. Reducing Estimation Bias via Triplet-Average Deep Deterministic Policy Gradient.
Wu D; Dong X; Shen J; Hoi SCH
IEEE Trans Neural Netw Learn Syst; 2020 Nov; 31(11):4933-4945. PubMed ID: 31940565
[TBL] [Abstract][Full Text] [Related]

5. Realistic Actor-Critic: A framework for balance between value overestimation and underestimation.
Li S; Tang Q; Pang Y; Ma X; Wang G
Front Neurorobot; 2022; 16():1081242. PubMed ID: 36699950
[TBL] [Abstract][Full Text] [Related]

6. Ensemble algorithms in reinforcement learning.
Wiering MA; van Hasselt H
IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):930-6. PubMed ID: 18632380
[TBL] [Abstract][Full Text] [Related]

7. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions.
Shang Z; Li R; Zheng C; Li H; Cui Y
IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37943648
[TBL] [Abstract][Full Text] [Related]

8. Mild Policy Evaluation for Offline Actor-Critic.
Huang L; Dong B; Lu J; Zhang W
IEEE Trans Neural Netw Learn Syst; 2023 Sep; PP():. PubMed ID: 37676802
[TBL] [Abstract][Full Text] [Related]

9. Offline Reinforcement Learning With Behavior Value Regularization.
Huang L; Dong B; Xie W; Zhang W
IEEE Trans Cybern; 2024 Jun; 54(6):3692-3704. PubMed ID: 38669164
[TBL] [Abstract][Full Text] [Related]

10. Improving Exploration in Actor-Critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization.
Li F; Fu M; Chen W; Zhang F; Zhang H; Qu H; Yi Z
IEEE Trans Neural Netw Learn Syst; 2024 Jul; 35(7):8783-8796. PubMed ID: 36306289
[TBL] [Abstract][Full Text] [Related]

11. Action Candidate Driven Clipped Double Q-Learning for Discrete and Continuous Action Tasks.
Jiang H; Li G; Xie J; Yang J
IEEE Trans Neural Netw Learn Syst; 2024 Apr; 35(4):5269-5279. PubMed ID: 36166566
[TBL] [Abstract][Full Text] [Related]

12. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
Zheng J; Kurt MN; Wang X
IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721
[TBL] [Abstract][Full Text] [Related]

13. Meta attention for Off-Policy Actor-Critic.
Huang J; Huang W; Lan L; Wu D
Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
[TBL] [Abstract][Full Text] [Related]

14. Reinforcement learning in continuous time and space.
Doya K
Neural Comput; 2000 Jan; 12(1):219-45. PubMed ID: 10636940
[TBL] [Abstract][Full Text] [Related]

15. Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms.
Chen Y; Zhang F; Liu Z
Neural Netw; 2024 Jan; 169():764-777. PubMed ID: 37981458
[TBL] [Abstract][Full Text] [Related]

16. Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences.
Banerjee C; Chen Z; Noman N
IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):3121-3129. PubMed ID: 35588412
[TBL] [Abstract][Full Text] [Related]

17. Continuous action deep reinforcement learning for propofol dosing during general anesthesia.
Schamberg G; Badgeley M; Meschede-Krasa B; Kwon O; Brown EN
Artif Intell Med; 2022 Jan; 123():102227. PubMed ID: 34998516
[TBL] [Abstract][Full Text] [Related]

18. Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.
Zhong S; Liu Q; Fu Q
Comput Intell Neurosci; 2016; 2016():4824072. PubMed ID: 27795704
[TBL] [Abstract][Full Text] [Related]

19. Supervised-actor-critic reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.
Yu C; Ren G; Dong Y
BMC Med Inform Decis Mak; 2020 Jul; 20(Suppl 3):124. PubMed ID: 32646412
[TBL] [Abstract][Full Text] [Related]

20. Reinforcement Learning Tracking Control for Robotic Manipulator With Kernel-Based Dynamic Model.
Hu Y; Wang W; Liu H; Liu L
IEEE Trans Neural Netw Learn Syst; 2020 Sep; 31(9):3570-3578. PubMed ID: 31689218
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]