These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

107 related articles for article (PubMed ID: 38507376)

  • 1. MEOL: A Maximum-Entropy Framework for Options Learning.
    Zhang P; Dong W; Cai M; Jia S; Wang ZP
    IEEE Trans Neural Netw Learn Syst; 2024 Mar; PP():. PubMed ID: 38507376
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Empowering the Diversity and Individuality of Option: Residual Soft Option Critic Framework.
    Zhu A; Chen F; Xu H; Ouyang D; Shao J
    IEEE Trans Neural Netw Learn Syst; 2023 Aug; 34(8):4816-4825. PubMed ID: 34851834
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions.
    Shang Z; Li R; Zheng C; Li H; Cui Y
    IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37943648
    [TBL] [Abstract][Full Text] [Related]  

  • 4. A Maximum Divergence Approach to Optimal Policy in Deep Reinforcement Learning.
    Yang Z; Qu H; Fu M; Hu W; Zhao Y
    IEEE Trans Cybern; 2023 Mar; 53(3):1499-1510. PubMed ID: 34478393
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
    Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
    IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences.
    Banerjee C; Chen Z; Noman N
    IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):3121-3129. PubMed ID: 35588412
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Exploration With Task Information for Meta Reinforcement Learning.
    Jiang P; Song S; Huang G
    IEEE Trans Neural Netw Learn Syst; 2023 Aug; 34(8):4033-4046. PubMed ID: 34739382
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment.
    Chen W; Wong KKL; Long S; Sun Z
    Entropy (Basel); 2022 Mar; 24(4):. PubMed ID: 35455103
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Improving Exploration in Actor-Critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization.
    Li F; Fu M; Chen W; Zhang F; Zhang H; Qu H; Yi Z
    IEEE Trans Neural Netw Learn Syst; 2024 Jul; 35(7):8783-8796. PubMed ID: 36306289
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Hierarchical Adversarial Inverse Reinforcement Learning.
    Chen J; Lan T; Aggarwal V
    IEEE Trans Neural Netw Learn Syst; 2023 Sep; PP():. PubMed ID: 37703157
    [TBL] [Abstract][Full Text] [Related]  

  • 11. An off-policy multi-agent stochastic policy gradient algorithm for cooperative continuous control.
    Guo D; Tang L; Zhang X; Liang YC
    Neural Netw; 2024 Feb; 170():610-621. PubMed ID: 38056408
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Temporal and state abstractions for efficient learning, transfer, and composition in humans.
    Xia L; Collins AGE
    Psychol Rev; 2021 Jul; 128(4):643-666. PubMed ID: 34014709
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction.
    Li L; Li D; Song T; Xu X
    IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network.
    Wu Y; Liao S; Liu X; Li Z; Lu R
    IEEE Trans Neural Netw Learn Syst; 2023 Jul; 34(7):3680-3690. PubMed ID: 34669579
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Meta attention for Off-Policy Actor-Critic.
    Huang J; Huang W; Lan L; Wu D
    Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Constructing Temporally Extended Actions through Incremental Community Detection.
    Xu X; Yang M; Li G
    Comput Intell Neurosci; 2018; 2018():2085721. PubMed ID: 29849543
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning.
    Sun H; Fu T; Ling Y; He C
    Sensors (Basel); 2021 Sep; 21(17):. PubMed ID: 34502796
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Erratum: Eyestalk Ablation to Increase Ovarian Maturation in Mud Crabs.
    J Vis Exp; 2023 May; (195):. PubMed ID: 37235796
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Space-Air-Ground Integrated Mobile Crowdsensing for Partially Observable Data Collection by Multi-Scale Convolutional Graph Reinforcement Learning.
    Ren Y; Ye Z; Song G; Jiang X
    Entropy (Basel); 2022 May; 24(5):. PubMed ID: 35626523
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Robust Actor-Critic With Relative Entropy Regulating Actor.
    Cheng Y; Huang L; Chen CLP; Wang X
    IEEE Trans Neural Netw Learn Syst; 2023 Nov; 34(11):9054-9063. PubMed ID: 35286268
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 6.