MEDLINE/PubMed Journal Browser Search

Pubmed for Handhelds

PUBMED FOR HANDHELDS

Journal Abstract Search

113 related items for PubMed ID: 33481718

1. An Off-Policy Trust Region Policy Optimization Method With Monotonic Improvement Guarantee for Deep Reinforcement Learning.
Meng W, Zheng Q, Shi Y, Pan G.
IEEE Trans Neural Netw Learn Syst; 2022 May; 33(5):2223-2235. PubMed ID: 33481718
[Abstract] [Full Text] [Related]

2.
; . PubMed ID:
[No Abstract] [Full Text] [Related]

3. Authentic Boundary Proximal Policy Optimization.
Cheng Y, Huang L, Wang X.
IEEE Trans Cybern; 2022 Sep; 52(9):9428-9438. PubMed ID: 33705327
[Abstract] [Full Text] [Related]

4. CVaR-Constrained Policy Optimization for Safe Reinforcement Learning.
Zhang Q, Leng S, Ma X, Liu Q, Wang X, Liang B, Liu Y, Yang J.
IEEE Trans Neural Netw Learn Syst; 2024 Feb 23; PP():. PubMed ID: 38393836
[Abstract] [Full Text] [Related]

5.
; . PubMed ID:
[No Abstract] [Full Text] [Related]

6. Multiagent Trust Region Policy Optimization.
Li H, He H.
IEEE Trans Neural Netw Learn Syst; 2024 Sep 23; 35(9):12873-12887. PubMed ID: 37053062
[Abstract] [Full Text] [Related]

7.
; . PubMed ID:
[No Abstract] [Full Text] [Related]

8.
; . PubMed ID:
[No Abstract] [Full Text] [Related]

9. Quantum architecture search via truly proximal policy optimization.
Zhu X, Hou X.
Sci Rep; 2023 Mar 29; 13(1):5157. PubMed ID: 36991061
[Abstract] [Full Text] [Related]

10. Implicit Posteriori Parameter Distribution Optimization in Reinforcement Learning.
Li T, Yang G, Chu J.
IEEE Trans Cybern; 2024 May 29; 54(5):3051-3064. PubMed ID: 37030741
[Abstract] [Full Text] [Related]

11. Relative sparsity for medical decision problems.
Weisenthal SJ, Thurston SW, Ertefaie A.
Stat Med; 2023 Aug 15; 42(18):3067-3092. PubMed ID: 37315949
[Abstract] [Full Text] [Related]

12.
; . PubMed ID:
[No Abstract] [Full Text] [Related]

13. Graph-Attention-Based Casual Discovery With Trust Region-Navigated Clipping Policy Optimization.
Liu S, Feng Y, Wu K, Cheng G, Huang J, Liu Z.
IEEE Trans Cybern; 2023 Apr 15; 53(4):2311-2324. PubMed ID: 34665751
[Abstract] [Full Text] [Related]

14. Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network.
Meng W, Zheng Q, Yang L, Li P, Pan G.
IEEE Trans Neural Netw Learn Syst; 2020 Oct 15; 31(10):4374-4380. PubMed ID: 31765320
[Abstract] [Full Text] [Related]

15.
; . PubMed ID:
[No Abstract] [Full Text] [Related]

16. Intelligent Navigation of a Magnetic Microrobot with Model-Free Deep Reinforcement Learning in a Real-World Environment.
Salehi A, Hosseinpour S, Tabatabaei N, Soltani Firouz M, Yu T.
Micromachines (Basel); 2024 Jan 09; 15(1):. PubMed ID: 38258231
[Abstract] [Full Text] [Related]

17. Anti-Martingale Proximal Policy Optimization.
Gu Y, Cheng Y, Yu K, Wang X.
IEEE Trans Cybern; 2023 Oct 09; 53(10):6421-6432. PubMed ID: 35560090
[Abstract] [Full Text] [Related]

18.
; . PubMed ID:
[No Abstract] [Full Text] [Related]

19. Reinforcement Learning for Improving Agent Design.
Ha D.
Artif Life; 2019 Oct 09; 25(4):352-365. PubMed ID: 31697584
[Abstract] [Full Text] [Related]

20. Extreme Trust Region Policy Optimization for Active Object Recognition.
Liu H, Wu Y, Sun F, Huaping Liu, Yupei Wu, Fuchun Sun, Sun F, Liu H, Wu Y.
IEEE Trans Neural Netw Learn Syst; 2018 Jun 09; 29(6):2253-2258. PubMed ID: 29771676
[Abstract] [Full Text] [Related]

Page: [Next] [New Search]