These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
127 related articles for article (PubMed ID: 38393836)
1. CVaR-Constrained Policy Optimization for Safe Reinforcement Learning. Zhang Q; Leng S; Ma X; Liu Q; Wang X; Liang B; Liu Y; Yang J IEEE Trans Neural Netw Learn Syst; 2024 Feb; PP():. PubMed ID: 38393836 [TBL] [Abstract][Full Text] [Related]
2. Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning. Bai C; Xiao T; Zhu Z; Wang L; Zhou F; Garg A; He B; Liu P; Wang Z IEEE Trans Neural Netw Learn Syst; 2024 Jul; 35(7):8954-8968. PubMed ID: 36331649 [TBL] [Abstract][Full Text] [Related]
3. Learn Zero-Constraint-Violation Safe Policy in Model-Free Constrained Reinforcement Learning. Ma H; Liu C; Li SE; Zheng S; Sun W; Chen J IEEE Trans Neural Netw Learn Syst; 2024 Jan; PP():. PubMed ID: 38231811 [TBL] [Abstract][Full Text] [Related]
4. Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning. Zhao R; Chen Z; Fan Y; Li Y; Gao F Sensors (Basel); 2024 Jun; 24(13):. PubMed ID: 39000919 [TBL] [Abstract][Full Text] [Related]
5. Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning. Zhang L; Peng Y; Yang W; Zhang Z IEEE Trans Pattern Anal Mach Intell; 2024 May; 46(5):3722-3735. PubMed ID: 38163315 [TBL] [Abstract][Full Text] [Related]
6. Optimal Control for Constrained Discrete-Time Nonlinear Systems Based on Safe Reinforcement Learning. Zhang L; Xie L; Jiang Y; Li Z; Liu X; Su H IEEE Trans Neural Netw Learn Syst; 2023 Oct; PP():. PubMed ID: 37906491 [TBL] [Abstract][Full Text] [Related]
7. When CVaR Meets With Bluetooth PAN: A Physical Distancing System for COVID-19 Proactive Safety. Munir MS; Kim DH; Bairagi AK; Hong CS IEEE Sens J; 2021 Jun; 21(12):13858-13869. PubMed ID: 35790090 [TBL] [Abstract][Full Text] [Related]
8. Safe Reinforcement Learning With Dual Robustness. Li Z; Hu C; Wang Y; Yang Y; Li SE IEEE Trans Pattern Anal Mach Intell; 2024 Dec; 46(12):10876-10890. PubMed ID: 39146157 [TBL] [Abstract][Full Text] [Related]
9. Model-Based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian. Peng B; Duan J; Chen J; Li SE; Xie G; Zhang C; Guan Y; Mu Y; Sun E IEEE Trans Neural Netw Learn Syst; 2022 May; PP():. PubMed ID: 35635820 [TBL] [Abstract][Full Text] [Related]
10. An Off-Policy Trust Region Policy Optimization Method With Monotonic Improvement Guarantee for Deep Reinforcement Learning. Meng W; Zheng Q; Shi Y; Pan G IEEE Trans Neural Netw Learn Syst; 2022 May; 33(5):2223-2235. PubMed ID: 33481718 [TBL] [Abstract][Full Text] [Related]
11. Adaptive Cruise Control Based on Safe Deep Reinforcement Learning. Zhao R; Wang K; Che W; Li Y; Fan Y; Gao F Sensors (Basel); 2024 Apr; 24(8):. PubMed ID: 38676274 [TBL] [Abstract][Full Text] [Related]
12. Distributional Policy Gradient With Distributional Value Function. Liu Q; Li Y; Shi X; Lin K; Liu Y; Lou Y IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38669170 [TBL] [Abstract][Full Text] [Related]
13. Minimum capital requirement and portfolio allocation for non-life insurance: a semiparametric model with Conditional Value-at-Risk (CVaR) constraint. Staino A; Russo E; Costabile M; Leccadito A Comput Manag Sci; 2023; 20(1):12. PubMed ID: 37520270 [TBL] [Abstract][Full Text] [Related]
14. Adaptive Safe Reinforcement Learning With Full-State Constraints and Constrained Adaptation for Autonomous Vehicles. Zhang Y; Liang X; Li D; Ge SS; Gao B; Chen H; Lee TH IEEE Trans Cybern; 2024 Mar; 54(3):1907-1920. PubMed ID: 37363853 [TBL] [Abstract][Full Text] [Related]
15. Adaptive pessimism via target Q-value for offline reinforcement learning. Liu J; Zhang Y; Li C; Yang Y; Liu Y; Ouyang W Neural Netw; 2024 Dec; 180():106588. PubMed ID: 39180907 [TBL] [Abstract][Full Text] [Related]
16. On Robustness of Individualized Decision Rules. Qi Z; Pang JS; Liu Y J Am Stat Assoc; 2023; 118(543):2143-2157. PubMed ID: 38143785 [TBL] [Abstract][Full Text] [Related]
17. Multitrend Conditional Value at Risk for Portfolio Optimization. Lai ZR; Li C; Wu X; Guan Q; Fang L IEEE Trans Neural Netw Learn Syst; 2024 Feb; 35(2):1545-1558. PubMed ID: 35737603 [TBL] [Abstract][Full Text] [Related]
18. Safe Reinforcement Learning With Stability Guarantee for Motion Planning of Autonomous Vehicles. Zhang L; Zhang R; Wu T; Weng R; Han M; Zhao Y IEEE Trans Neural Netw Learn Syst; 2021 Dec; 32(12):5435-5444. PubMed ID: 34242172 [TBL] [Abstract][Full Text] [Related]
20. A UoI-Optimal Policy for Timely Status Updates with Resource Constraint. Wang L; Sun J; Sun Y; Zhou S; Niu Z Entropy (Basel); 2021 Aug; 23(8):. PubMed ID: 34441224 [TBL] [Abstract][Full Text] [Related] [Next] [New Search]