These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
Pubmed for Handhelds
PUBMED FOR HANDHELDS
Journal Abstract Search
113 related items for PubMed ID: 33481718
1. An Off-Policy Trust Region Policy Optimization Method With Monotonic Improvement Guarantee for Deep Reinforcement Learning. Meng W, Zheng Q, Shi Y, Pan G. IEEE Trans Neural Netw Learn Syst; 2022 May; 33(5):2223-2235. PubMed ID: 33481718 [Abstract] [Full Text] [Related]
3. Authentic Boundary Proximal Policy Optimization. Cheng Y, Huang L, Wang X. IEEE Trans Cybern; 2022 Sep; 52(9):9428-9438. PubMed ID: 33705327 [Abstract] [Full Text] [Related]
4. CVaR-Constrained Policy Optimization for Safe Reinforcement Learning. Zhang Q, Leng S, Ma X, Liu Q, Wang X, Liang B, Liu Y, Yang J. IEEE Trans Neural Netw Learn Syst; 2024 Feb 23; PP():. PubMed ID: 38393836 [Abstract] [Full Text] [Related]
10. Implicit Posteriori Parameter Distribution Optimization in Reinforcement Learning. Li T, Yang G, Chu J. IEEE Trans Cybern; 2024 May 29; 54(5):3051-3064. PubMed ID: 37030741 [Abstract] [Full Text] [Related]
11. Relative sparsity for medical decision problems. Weisenthal SJ, Thurston SW, Ertefaie A. Stat Med; 2023 Aug 15; 42(18):3067-3092. PubMed ID: 37315949 [Abstract] [Full Text] [Related]
13. Graph-Attention-Based Casual Discovery With Trust Region-Navigated Clipping Policy Optimization. Liu S, Feng Y, Wu K, Cheng G, Huang J, Liu Z. IEEE Trans Cybern; 2023 Apr 15; 53(4):2311-2324. PubMed ID: 34665751 [Abstract] [Full Text] [Related]
14. Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network. Meng W, Zheng Q, Yang L, Li P, Pan G. IEEE Trans Neural Netw Learn Syst; 2020 Oct 15; 31(10):4374-4380. PubMed ID: 31765320 [Abstract] [Full Text] [Related]
16. Intelligent Navigation of a Magnetic Microrobot with Model-Free Deep Reinforcement Learning in a Real-World Environment. Salehi A, Hosseinpour S, Tabatabaei N, Soltani Firouz M, Yu T. Micromachines (Basel); 2024 Jan 09; 15(1):. PubMed ID: 38258231 [Abstract] [Full Text] [Related]
17. Anti-Martingale Proximal Policy Optimization. Gu Y, Cheng Y, Yu K, Wang X. IEEE Trans Cybern; 2023 Oct 09; 53(10):6421-6432. PubMed ID: 35560090 [Abstract] [Full Text] [Related]
19. Reinforcement Learning for Improving Agent Design. Ha D. Artif Life; 2019 Oct 09; 25(4):352-365. PubMed ID: 31697584 [Abstract] [Full Text] [Related]
20. Extreme Trust Region Policy Optimization for Active Object Recognition. Liu H, Wu Y, Sun F, Huaping Liu, Yupei Wu, Fuchun Sun, Sun F, Liu H, Wu Y. IEEE Trans Neural Netw Learn Syst; 2018 Jun 09; 29(6):2253-2258. PubMed ID: 29771676 [Abstract] [Full Text] [Related] Page: [Next] [New Search]