Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

107 related articles for article (PubMed ID: 38507376)

1. MEOL: A Maximum-Entropy Framework for Options Learning.
Zhang P; Dong W; Cai M; Jia S; Wang ZP
IEEE Trans Neural Netw Learn Syst; 2024 Mar; PP():. PubMed ID: 38507376
[TBL] [Abstract][Full Text] [Related]

2. Empowering the Diversity and Individuality of Option: Residual Soft Option Critic Framework.
Zhu A; Chen F; Xu H; Ouyang D; Shao J
IEEE Trans Neural Netw Learn Syst; 2023 Aug; 34(8):4816-4825. PubMed ID: 34851834
[TBL] [Abstract][Full Text] [Related]

3. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions.
Shang Z; Li R; Zheng C; Li H; Cui Y
IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37943648
[TBL] [Abstract][Full Text] [Related]

4. A Maximum Divergence Approach to Optimal Policy in Deep Reinforcement Learning.
Yang Z; Qu H; Fu M; Hu W; Zhao Y
IEEE Trans Cybern; 2023 Mar; 53(3):1499-1510. PubMed ID: 34478393
[TBL] [Abstract][Full Text] [Related]

5. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
[TBL] [Abstract][Full Text] [Related]

6. Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences.
Banerjee C; Chen Z; Noman N
IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):3121-3129. PubMed ID: 35588412
[TBL] [Abstract][Full Text] [Related]

7. Exploration With Task Information for Meta Reinforcement Learning.
Jiang P; Song S; Huang G
IEEE Trans Neural Netw Learn Syst; 2023 Aug; 34(8):4033-4046. PubMed ID: 34739382
[TBL] [Abstract][Full Text] [Related]

8. Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment.
Chen W; Wong KKL; Long S; Sun Z
Entropy (Basel); 2022 Mar; 24(4):. PubMed ID: 35455103
[TBL] [Abstract][Full Text] [Related]

9. Improving Exploration in Actor-Critic With Weakly Pessimistic Value Estimation and Optimistic Policy Optimization.
Li F; Fu M; Chen W; Zhang F; Zhang H; Qu H; Yi Z
IEEE Trans Neural Netw Learn Syst; 2024 Jul; 35(7):8783-8796. PubMed ID: 36306289
[TBL] [Abstract][Full Text] [Related]

10. Hierarchical Adversarial Inverse Reinforcement Learning.
Chen J; Lan T; Aggarwal V
IEEE Trans Neural Netw Learn Syst; 2023 Sep; PP():. PubMed ID: 37703157
[TBL] [Abstract][Full Text] [Related]

11. An off-policy multi-agent stochastic policy gradient algorithm for cooperative continuous control.
Guo D; Tang L; Zhang X; Liang YC
Neural Netw; 2024 Feb; 170():610-621. PubMed ID: 38056408
[TBL] [Abstract][Full Text] [Related]

12. Temporal and state abstractions for efficient learning, transfer, and composition in humans.
Xia L; Collins AGE
Psychol Rev; 2021 Jul; 128(4):643-666. PubMed ID: 34014709
[TBL] [Abstract][Full Text] [Related]

13. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction.
Li L; Li D; Song T; Xu X
IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664
[TBL] [Abstract][Full Text] [Related]

14. Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network.
Wu Y; Liao S; Liu X; Li Z; Lu R
IEEE Trans Neural Netw Learn Syst; 2023 Jul; 34(7):3680-3690. PubMed ID: 34669579
[TBL] [Abstract][Full Text] [Related]

15. Meta attention for Off-Policy Actor-Critic.
Huang J; Huang W; Lan L; Wu D
Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
[TBL] [Abstract][Full Text] [Related]

16. Constructing Temporally Extended Actions through Incremental Community Detection.
Xu X; Yang M; Li G
Comput Intell Neurosci; 2018; 2018():2085721. PubMed ID: 29849543
[TBL] [Abstract][Full Text] [Related]

17. Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning.
Sun H; Fu T; Ling Y; He C
Sensors (Basel); 2021 Sep; 21(17):. PubMed ID: 34502796
[TBL] [Abstract][Full Text] [Related]

18. Erratum: Eyestalk Ablation to Increase Ovarian Maturation in Mud Crabs.
J Vis Exp; 2023 May; (195):. PubMed ID: 37235796
[TBL] [Abstract][Full Text] [Related]

19. Space-Air-Ground Integrated Mobile Crowdsensing for Partially Observable Data Collection by Multi-Scale Convolutional Graph Reinforcement Learning.
Ren Y; Ye Z; Song G; Jiang X
Entropy (Basel); 2022 May; 24(5):. PubMed ID: 35626523
[TBL] [Abstract][Full Text] [Related]

20. Robust Actor-Critic With Relative Entropy Regulating Actor.
Cheng Y; Huang L; Chen CLP; Wang X
IEEE Trans Neural Netw Learn Syst; 2023 Nov; 34(11):9054-9063. PubMed ID: 35286268
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]