These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
2. CVaR-Constrained Policy Optimization for Safe Reinforcement Learning. Zhang Q; Leng S; Ma X; Liu Q; Wang X; Liang B; Liu Y; Yang J IEEE Trans Neural Netw Learn Syst; 2024 Feb; PP():. PubMed ID: 38393836 [TBL] [Abstract][Full Text] [Related]
3. Markov decision processes: a tool for sequential decision making under uncertainty. Alagoz O; Hsu H; Schaefer AJ; Roberts MS Med Decis Making; 2010; 30(4):474-83. PubMed ID: 20044582 [TBL] [Abstract][Full Text] [Related]
4. BATCH POLICY LEARNING IN AVERAGE REWARD MARKOV DECISION PROCESSES. Liao P; Qi Z; Wan R; Klasnja P; Murphy SA Ann Stat; 2022 Dec; 50(6):3364-3387. PubMed ID: 37022318 [TBL] [Abstract][Full Text] [Related]
5. An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions. Ma Y; Zhao T; Hatano K; Sugiyama M Neural Comput; 2016 Mar; 28(3):563-93. PubMed ID: 26735742 [TBL] [Abstract][Full Text] [Related]
6. Learning to maximize reward rate: a model based on semi-Markov decision processes. Khodadadi A; Fakhari P; Busemeyer JR Front Neurosci; 2014; 8():101. PubMed ID: 24904252 [TBL] [Abstract][Full Text] [Related]
7. Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning. Zhang L; Peng Y; Yang W; Zhang Z IEEE Trans Pattern Anal Mach Intell; 2024 May; 46(5):3722-3735. PubMed ID: 38163315 [TBL] [Abstract][Full Text] [Related]
8. An Algorithm of Nonparametric Quantile Regression. Huang ML; Han Y; Marshall W J Stat Theory Pract; 2023; 17(2):32. PubMed ID: 37013135 [TBL] [Abstract][Full Text] [Related]
9. Dynamic programming for solving a simulated clinical scenario of sepsis resuscitation. Zhang Z; Zhang X; Gu S; Xu X; Jiang W; Lv C; Zheng S Ann Palliat Med; 2021 Apr; 10(4):3715-3725. PubMed ID: 33691453 [TBL] [Abstract][Full Text] [Related]
10. A Gradient-Aware Search Algorithm for Constrained Markov Decision Processes. Khairy S; Balaprakash P; Cai LX IEEE Trans Neural Netw Learn Syst; 2023 Sep; PP():. PubMed ID: 37773894 [TBL] [Abstract][Full Text] [Related]
11. A simulation-based neighbourhood search algorithm to schedule multi-category patients at a multi-facility health care diagnostic centre. Jain V; Mohan U Health Syst (Basingstoke); 2018; 7(3):212-229. PubMed ID: 31214349 [TBL] [Abstract][Full Text] [Related]
12. Simulation-based approximate policy iteration for dynamic patient scheduling for radiation therapy. Gocgun Y Health Care Manag Sci; 2018 Sep; 21(3):317-325. PubMed ID: 27766509 [TBL] [Abstract][Full Text] [Related]
13. Experimental optimization of a real time fed-batch fermentation process using Markov decision process. Saucedo VM; Karim MN Biotechnol Bioeng; 1997 Jul; 55(2):317-27. PubMed ID: 18636490 [TBL] [Abstract][Full Text] [Related]
14. The Effect of Budgetary Restrictions on Breast Cancer Diagnostic Decisions. Ayvaci MU; Alagoz O; Burnside ES Manuf Serv Oper Manag; 2012 Apr; 14(4):600-617. PubMed ID: 24027436 [TBL] [Abstract][Full Text] [Related]
15. Application of Constrained Optimization Methods in Health Services Research: Report 2 of the ISPOR Optimization Methods Emerging Good Practices Task Force. Crown W; Buyukkaramikli N; Sir MY; Thokala P; Morton A; Marshall DA; Tosh JC; Ijzerman MJ; Padula WV; Pasupathy KS Value Health; 2018 Sep; 21(9):1019-1028. PubMed ID: 30224103 [TBL] [Abstract][Full Text] [Related]
16. Optimizing patient treatment decisions in an era of rapid technological advances: the case of hepatitis C treatment. Liu S; Brandeau ML; Goldhaber-Fiebert JD Health Care Manag Sci; 2017 Mar; 20(1):16-32. PubMed ID: 26188961 [TBL] [Abstract][Full Text] [Related]
17. Parameterized MDPs and Reinforcement Learning Problems-A Maximum Entropy Principle-Based Framework. Srivastava A; Salapaka SM IEEE Trans Cybern; 2022 Sep; 52(9):9339-9351. PubMed ID: 34406959 [TBL] [Abstract][Full Text] [Related]
18. A markov decision process model for the optimal dispatch of military medical evacuation assets. Keneally SK; Robbins MJ; Lunday BJ Health Care Manag Sci; 2016 Jun; 19(2):111-29. PubMed ID: 25223847 [TBL] [Abstract][Full Text] [Related]
19. Markov models for clinical decision-making in radiation oncology: A systematic review. McCullum LB; Karagoz A; Dede C; Garcia R; Nosrat F; Hemmati M; Hosseinian S; Schaefer AJ; Fuller CD; ; J Med Imaging Radiat Oncol; 2024 Aug; 68(5):610-623. PubMed ID: 38766899 [TBL] [Abstract][Full Text] [Related]
20. Approximate robust policy iteration using multilayer perceptron neural networks for discounted infinite-horizon Markov decision processes with uncertain correlated transition matrices. Li B; Si J IEEE Trans Neural Netw; 2010 Aug; 21(8):1270-80. PubMed ID: 20601311 [TBL] [Abstract][Full Text] [Related] [Next] [New Search]