Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

127 related articles for article (PubMed ID: 38393836)

1. CVaR-Constrained Policy Optimization for Safe Reinforcement Learning.
Zhang Q; Leng S; Ma X; Liu Q; Wang X; Liang B; Liu Y; Yang J
IEEE Trans Neural Netw Learn Syst; 2024 Feb; PP():. PubMed ID: 38393836
[TBL] [Abstract][Full Text] [Related]

2. Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning.
Bai C; Xiao T; Zhu Z; Wang L; Zhou F; Garg A; He B; Liu P; Wang Z
IEEE Trans Neural Netw Learn Syst; 2024 Jul; 35(7):8954-8968. PubMed ID: 36331649
[TBL] [Abstract][Full Text] [Related]

3. Learn Zero-Constraint-Violation Safe Policy in Model-Free Constrained Reinforcement Learning.
Ma H; Liu C; Li SE; Zheng S; Sun W; Chen J
IEEE Trans Neural Netw Learn Syst; 2024 Jan; PP():. PubMed ID: 38231811
[TBL] [Abstract][Full Text] [Related]

4. Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning.
Zhao R; Chen Z; Fan Y; Li Y; Gao F
Sensors (Basel); 2024 Jun; 24(13):. PubMed ID: 39000919
[TBL] [Abstract][Full Text] [Related]

5. Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning.
Zhang L; Peng Y; Yang W; Zhang Z
IEEE Trans Pattern Anal Mach Intell; 2024 May; 46(5):3722-3735. PubMed ID: 38163315
[TBL] [Abstract][Full Text] [Related]

6. Optimal Control for Constrained Discrete-Time Nonlinear Systems Based on Safe Reinforcement Learning.
Zhang L; Xie L; Jiang Y; Li Z; Liu X; Su H
IEEE Trans Neural Netw Learn Syst; 2023 Oct; PP():. PubMed ID: 37906491
[TBL] [Abstract][Full Text] [Related]

7. When CVaR Meets With Bluetooth PAN: A Physical Distancing System for COVID-19 Proactive Safety.
Munir MS; Kim DH; Bairagi AK; Hong CS
IEEE Sens J; 2021 Jun; 21(12):13858-13869. PubMed ID: 35790090
[TBL] [Abstract][Full Text] [Related]

8. Safe Reinforcement Learning With Dual Robustness.
Li Z; Hu C; Wang Y; Yang Y; Li SE
IEEE Trans Pattern Anal Mach Intell; 2024 Dec; 46(12):10876-10890. PubMed ID: 39146157
[TBL] [Abstract][Full Text] [Related]

9. Model-Based Chance-Constrained Reinforcement Learning via Separated Proportional-Integral Lagrangian.
Peng B; Duan J; Chen J; Li SE; Xie G; Zhang C; Guan Y; Mu Y; Sun E
IEEE Trans Neural Netw Learn Syst; 2022 May; PP():. PubMed ID: 35635820
[TBL] [Abstract][Full Text] [Related]

10. An Off-Policy Trust Region Policy Optimization Method With Monotonic Improvement Guarantee for Deep Reinforcement Learning.
Meng W; Zheng Q; Shi Y; Pan G
IEEE Trans Neural Netw Learn Syst; 2022 May; 33(5):2223-2235. PubMed ID: 33481718
[TBL] [Abstract][Full Text] [Related]

11. Adaptive Cruise Control Based on Safe Deep Reinforcement Learning.
Zhao R; Wang K; Che W; Li Y; Fan Y; Gao F
Sensors (Basel); 2024 Apr; 24(8):. PubMed ID: 38676274
[TBL] [Abstract][Full Text] [Related]

12. Distributional Policy Gradient With Distributional Value Function.
Liu Q; Li Y; Shi X; Lin K; Liu Y; Lou Y
IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38669170
[TBL] [Abstract][Full Text] [Related]

13. Minimum capital requirement and portfolio allocation for non-life insurance: a semiparametric model with Conditional Value-at-Risk (CVaR) constraint.
Staino A; Russo E; Costabile M; Leccadito A
Comput Manag Sci; 2023; 20(1):12. PubMed ID: 37520270
[TBL] [Abstract][Full Text] [Related]

14. Adaptive Safe Reinforcement Learning With Full-State Constraints and Constrained Adaptation for Autonomous Vehicles.
Zhang Y; Liang X; Li D; Ge SS; Gao B; Chen H; Lee TH
IEEE Trans Cybern; 2024 Mar; 54(3):1907-1920. PubMed ID: 37363853
[TBL] [Abstract][Full Text] [Related]

15. Adaptive pessimism via target Q-value for offline reinforcement learning.
Liu J; Zhang Y; Li C; Yang Y; Liu Y; Ouyang W
Neural Netw; 2024 Dec; 180():106588. PubMed ID: 39180907
[TBL] [Abstract][Full Text] [Related]

16. On Robustness of Individualized Decision Rules.
Qi Z; Pang JS; Liu Y
J Am Stat Assoc; 2023; 118(543):2143-2157. PubMed ID: 38143785
[TBL] [Abstract][Full Text] [Related]

17. Multitrend Conditional Value at Risk for Portfolio Optimization.
Lai ZR; Li C; Wu X; Guan Q; Fang L
IEEE Trans Neural Netw Learn Syst; 2024 Feb; 35(2):1545-1558. PubMed ID: 35737603
[TBL] [Abstract][Full Text] [Related]

18. Safe Reinforcement Learning With Stability Guarantee for Motion Planning of Autonomous Vehicles.
Zhang L; Zhang R; Wu T; Weng R; Han M; Zhao Y
IEEE Trans Neural Netw Learn Syst; 2021 Dec; 32(12):5435-5444. PubMed ID: 34242172
[TBL] [Abstract][Full Text] [Related]

19. Quantile Markov Decision Processes.
Li X; Zhong H; Brandeau ML
Oper Res; 2022; 70(3):1428-1447. PubMed ID: 36034163
[TBL] [Abstract][Full Text] [Related]

20. A UoI-Optimal Policy for Timely Status Updates with Resource Constraint.
Wang L; Sun J; Sun Y; Zhou S; Niu Z
Entropy (Basel); 2021 Aug; 23(8):. PubMed ID: 34441224
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]