Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

112 related articles for article (PubMed ID: 36166566)

1. Action Candidate Driven Clipped Double Q-Learning for Discrete and Continuous Action Tasks.
Jiang H; Li G; Xie J; Yang J
IEEE Trans Neural Netw Learn Syst; 2024 Apr; 35(4):5269-5279. PubMed ID: 36166566
[TBL] [Abstract][Full Text] [Related]

2. Integrated Double Estimator Architecture for Reinforcement Learning.
Lv P; Wang X; Cheng Y; Duan Z; Chen CLP
IEEE Trans Cybern; 2022 May; 52(5):3111-3122. PubMed ID: 33027028
[TBL] [Abstract][Full Text] [Related]

3. Actor-Critic With Synthesis Loss for Solving Approximation Biases.
Guo BW; Chao F; Chang X; Shang C; Shen Q
IEEE Trans Cybern; 2024 Sep; 54(9):5323-5336. PubMed ID: 38700970
[TBL] [Abstract][Full Text] [Related]

4. Collaborative double robust targeted maximum likelihood estimation.
van der Laan MJ; Gruber S
Int J Biostat; 2010 May; 6(1):Article 17. PubMed ID: 20628637
[TBL] [Abstract][Full Text] [Related]

5. On Practical Robust Reinforcement Learning: Adjacent Uncertainty Set and Double-Agent Algorithm.
Hwang U; Hong S
IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619960
[TBL] [Abstract][Full Text] [Related]

6. Reducing Estimation Bias via Triplet-Average Deep Deterministic Policy Gradient.
Wu D; Dong X; Shen J; Hoi SCH
IEEE Trans Neural Netw Learn Syst; 2020 Nov; 31(11):4933-4945. PubMed ID: 31940565
[TBL] [Abstract][Full Text] [Related]

7. Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.
Crider K; Williams J; Qi YP; Gutman J; Yeung L; Mai C; Finkelstain J; Mehta S; Pons-Duran C; Menéndez C; Moraleda C; Rogers L; Daniels K; Green P
Cochrane Database Syst Rev; 2022 Feb; 2(2022):. PubMed ID: 36321557
[TBL] [Abstract][Full Text] [Related]

8. Learning-Based DoS Attack Power Allocation in Multiprocess Systems.
Huang M; Ding K; Dey S; Li Y; Shi L
IEEE Trans Neural Netw Learn Syst; 2023 Oct; 34(10):8017-8030. PubMed ID: 35167483
[TBL] [Abstract][Full Text] [Related]

9. On the Q statistic with constant weights for standardized mean difference.
Bakbergenuly I; Hoaglin DC; Kulinskaya E
Br J Math Stat Psychol; 2022 Nov; 75(3):444-465. PubMed ID: 35094381
[TBL] [Abstract][Full Text] [Related]

10. Improving Exploration in Actor-Critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization.
Li F; Fu M; Chen W; Zhang F; Zhang H; Qu H; Yi Z
IEEE Trans Neural Netw Learn Syst; 2024 Jul; 35(7):8783-8796. PubMed ID: 36306289
[TBL] [Abstract][Full Text] [Related]

11. Reduction of bias in estimating the frequency of recessive genes.
Huether CA; Murphy EA
Am J Hum Genet; 1980 Mar; 32(2):212-22. PubMed ID: 7386457
[TBL] [Abstract][Full Text] [Related]

12. Scaling Up Q-Learning via Exploiting State-Action Equivalence.
Lyu Y; Côme A; Zhang Y; Talebi MS
Entropy (Basel); 2023 Mar; 25(4):. PubMed ID: 37190372
[TBL] [Abstract][Full Text] [Related]

13. Value Iteration Networks with Double Estimator for Planetary Rover Path Planning.
Jin X; Lan W; Wang T; Yu P
Sensors (Basel); 2021 Dec; 21(24):. PubMed ID: 34960508
[TBL] [Abstract][Full Text] [Related]

14. Human-in-the-Loop Reinforcement Learning in Continuous-Action Space.
Luo B; Wu Z; Zhou F; Wang BC
IEEE Trans Neural Netw Learn Syst; 2024 Nov; 35(11):15735-15744. PubMed ID: 37418406
[TBL] [Abstract][Full Text] [Related]

15. A Maximum Divergence Approach to Optimal Policy in Deep Reinforcement Learning.
Yang Z; Qu H; Fu M; Hu W; Zhao Y
IEEE Trans Cybern; 2023 Mar; 53(3):1499-1510. PubMed ID: 34478393
[TBL] [Abstract][Full Text] [Related]

16. Realistic Actor-Critic: A framework for balance between value overestimation and underestimation.
Li S; Tang Q; Pang Y; Ma X; Wang G
Front Neurorobot; 2022; 16():1081242. PubMed ID: 36699950
[TBL] [Abstract][Full Text] [Related]

17. Parameterized MDPs and Reinforcement Learning Problems-A Maximum Entropy Principle-Based Framework.
Srivastava A; Salapaka SM
IEEE Trans Cybern; 2022 Sep; 52(9):9339-9351. PubMed ID: 34406959
[TBL] [Abstract][Full Text] [Related]

18. Approximate Policy-Based Accelerated Deep Reinforcement Learning.
Wang X; Gu Y; Cheng Y; Liu A; Chen CLP
IEEE Trans Neural Netw Learn Syst; 2020 Jun; 31(6):1820-1830. PubMed ID: 31398131
[TBL] [Abstract][Full Text] [Related]

19. Q-ADER: An Effective Q-Learning for Recommendation With Diminishing Action Space.
Li F; Qu H; Zhang L; Fu M; Chen W; Yi Z
IEEE Trans Neural Netw Learn Syst; 2024 Jul; PP():. PubMed ID: 39012739
[TBL] [Abstract][Full Text] [Related]

20. Context transfer in reinforcement learning using action-value functions.
Mousavi A; Nadjar Araabi B; Nili Ahmadabadi M
Comput Intell Neurosci; 2014; 2014():428567. PubMed ID: 25610457
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]