These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

115 related articles for article (PubMed ID: 38669170)

  • 1. Distributional Policy Gradient With Distributional Value Function.
    Liu Q; Li Y; Shi X; Lin K; Liu Y; Lou Y
    IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38669170
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
    Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
    IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Distributional generative adversarial imitation learning with reproducing kernel generalization.
    Zhou Y; Lu M; Liu X; Che Z; Xu Z; Tang J; Zhang Y; Peng Y; Peng Y
    Neural Netw; 2023 Aug; 165():43-59. PubMed ID: 37276810
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning.
    Bai C; Xiao T; Zhu Z; Wang L; Zhou F; Garg A; He B; Liu P; Wang Z
    IEEE Trans Neural Netw Learn Syst; 2024 Jul; 35(7):8954-8968. PubMed ID: 36331649
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Implicit Posteriori Parameter Distribution Optimization in Reinforcement Learning.
    Li T; Yang G; Chu J
    IEEE Trans Cybern; 2024 May; 54(5):3051-3064. PubMed ID: 37030741
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Inference-Based Posteriori Parameter Distribution Optimization.
    Wang X; Li T; Cheng Y; Chen CLP
    IEEE Trans Cybern; 2022 May; 52(5):3006-3017. PubMed ID: 33027029
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Dual Parallel Policy Iteration With Coupled Policy Improvement.
    Cheng Y; Huang L; Chen CLP; Wang X
    IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):4286-4298. PubMed ID: 36094996
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Exploration With Task Information for Meta Reinforcement Learning.
    Jiang P; Song S; Huang G
    IEEE Trans Neural Netw Learn Syst; 2023 Aug; 34(8):4033-4046. PubMed ID: 34739382
    [TBL] [Abstract][Full Text] [Related]  

  • 9. CVaR-Constrained Policy Optimization for Safe Reinforcement Learning.
    Zhang Q; Leng S; Ma X; Liu Q; Wang X; Liang B; Liu Y; Yang J
    IEEE Trans Neural Netw Learn Syst; 2024 Feb; PP():. PubMed ID: 38393836
    [TBL] [Abstract][Full Text] [Related]  

  • 10. A Distributional Perspective on Multiagent Cooperation With Deep Reinforcement Learning.
    Huang L; Fu M; Rao A; Irissappane AA; Zhang J; Xu C
    IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):4246-4259. PubMed ID: 36121959
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Asymmetric and adaptive reward coding via normalized reinforcement learning.
    Louie K
    PLoS Comput Biol; 2022 Jul; 18(7):e1010350. PubMed ID: 35862443
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Batch Reinforcement Learning With a Nonparametric Off-Policy Policy Gradient.
    Tosatto S; Carvalho J; Peters J
    IEEE Trans Pattern Anal Mach Intell; 2022 Oct; 44(10):5996-6010. PubMed ID: 34106848
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Diversity Evolutionary Policy Deep Reinforcement Learning.
    Liu J; Feng L
    Comput Intell Neurosci; 2021; 2021():5300189. PubMed ID: 34394336
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Relative Entropy Regularized Sample-Efficient Reinforcement Learning With Continuous Actions.
    Shang Z; Li R; Zheng C; Li H; Cui Y
    IEEE Trans Neural Netw Learn Syst; 2023 Nov; PP():. PubMed ID: 37943648
    [TBL] [Abstract][Full Text] [Related]  

  • 15. An opponent striatal circuit for distributional reinforcement learning.
    Lowet AS; Zheng Q; Meng M; Matias S; Drugowitsch J; Uchida N
    bioRxiv; 2024 Jan; ():. PubMed ID: 38260354
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Guided Cooperation in Hierarchical Reinforcement Learning via Model-Based Rollout.
    Wang H; Tang Z; Sun Y; Wang F; Zhang S; Chen Y
    IEEE Trans Neural Netw Learn Syst; 2024 Aug; PP():. PubMed ID: 39133586
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Improving Offline Reinforcement Learning With In-Sample Advantage Regularization for Robot Manipulation.
    Ma C; Yang D; Wu T; Liu Z; Yang H; Chen X; Lan X; Zheng N
    IEEE Trans Neural Netw Learn Syst; 2024 Sep; PP():. PubMed ID: 39302799
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning.
    Morimura T; Uchibe E; Yoshimoto J; Peters J; Doya K
    Neural Comput; 2010 Feb; 22(2):342-76. PubMed ID: 19842990
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Mild Policy Evaluation for Offline Actor-Critic.
    Huang L; Dong B; Lu J; Zhang W
    IEEE Trans Neural Netw Learn Syst; 2023 Sep; PP():. PubMed ID: 37676802
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Reinforcement Learning Tracking Control for Robotic Manipulator With Kernel-Based Dynamic Model.
    Hu Y; Wang W; Liu H; Liu L
    IEEE Trans Neural Netw Learn Syst; 2020 Sep; 31(9):3570-3578. PubMed ID: 31689218
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 6.