115 related articles for article (PubMed ID: 34055465)
1. The Unreasonable Effectiveness of Inverse Reinforcement Learning in Advancing Cancer Research.
Kalantari J; Nelson H; Chia N
Proc AAAI Conf Artif Intell; 2020 Apr; 34(1):437-445. PubMed ID: 34055465
[TBL] [Abstract][Full Text] [Related]
2. Bridging the Gap Between Imitation Learning and Inverse Reinforcement Learning.
Piot B; Geist M; Pietquin O
IEEE Trans Neural Netw Learn Syst; 2017 Aug; 28(8):1814-1826. PubMed ID: 27164607
[TBL] [Abstract][Full Text] [Related]
3. Inverse reinforcement learning for intelligent mechanical ventilation and sedative dosing in intensive care units.
Yu C; Liu J; Zhao H
BMC Med Inform Decis Mak; 2019 Apr; 19(Suppl 2):57. PubMed ID: 30961594
[TBL] [Abstract][Full Text] [Related]
4. Hierarchical Bayesian inverse reinforcement learning.
Choi J; Kim KE
IEEE Trans Cybern; 2015 Apr; 45(4):793-805. PubMed ID: 25291805
[TBL] [Abstract][Full Text] [Related]
5. Energy-efficient and damage-recovery slithering gait design for a snake-like robot based on reinforcement learning and inverse reinforcement learning.
Bing Z; Lemke C; Cheng L; Huang K; Knoll A
Neural Netw; 2020 Sep; 129():323-333. PubMed ID: 32593929
[TBL] [Abstract][Full Text] [Related]
6. Generating Reward Functions Using IRL Towards Individualized Cancer Screening.
Petousis P; Han SX; Hsu W; Bui AAT
Artif Intell Health (2018); 2019; 11326():213-227. PubMed ID: 31363717
[TBL] [Abstract][Full Text] [Related]
7. Efficient Reinforcement Learning from Demonstration via Bayesian Network-Based Knowledge Extraction.
Zhang Y; Lan Y; Fang Q; Xu X; Li J; Zeng Y
Comput Intell Neurosci; 2021; 2021():7588221. PubMed ID: 34603434
[TBL] [Abstract][Full Text] [Related]
8. Modeling pedestrian behavior in pedestrian-vehicle near misses: A continuous Gaussian Process Inverse Reinforcement Learning (GP-IRL) approach.
Nasernejad P; Sayed T; Alsaleh R
Accid Anal Prev; 2021 Oct; 161():106355. PubMed ID: 34461394
[TBL] [Abstract][Full Text] [Related]
9. Reinforcement Learning-Based Multi-AUV Adaptive Trajectory Planning for Under-Ice Field Estimation.
Wang C; Wei L; Wang Z; Song M; Mahmoudian N
Sensors (Basel); 2018 Nov; 18(11):. PubMed ID: 30424017
[TBL] [Abstract][Full Text] [Related]
10. Scalable Inverse Reinforcement Learning Through Multifidelity Bayesian Optimization.
Imani M; Ghoreishi SF
IEEE Trans Neural Netw Learn Syst; 2022 Aug; 33(8):4125-4132. PubMed ID: 33481721
[TBL] [Abstract][Full Text] [Related]
11. Benchmarking for Bayesian Reinforcement Learning.
Castronovo M; Ernst D; Couëtoux A; Fonteneau R
PLoS One; 2016; 11(6):e0157088. PubMed ID: 27304891
[TBL] [Abstract][Full Text] [Related]
12. On Practical Robust Reinforcement Learning: Adjacent Uncertainty Set and Double-Agent Algorithm.
Hwang U; Hong S
IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619960
[TBL] [Abstract][Full Text] [Related]
13. Informative Trajectory Planning Using Reinforcement Learning for Minimum-Time Exploration of Spatiotemporal Fields.
Li Z; You K; Sun J; Wang G
IEEE Trans Neural Netw Learn Syst; 2023 Aug; PP():. PubMed ID: 37581975
[TBL] [Abstract][Full Text] [Related]
14. A unified analysis of value-function-based reinforcement- learning algorithms.
Szepesvári C; Littman ML
Neural Comput; 1999 Nov; 11(8):2017-59. PubMed ID: 10578043
[TBL] [Abstract][Full Text] [Related]
15. Kernel-based least squares policy iteration for reinforcement learning.
Xu X; Hu D; Lu X
IEEE Trans Neural Netw; 2007 Jul; 18(4):973-92. PubMed ID: 17668655
[TBL] [Abstract][Full Text] [Related]
16. The future of Cochrane Neonatal.
Soll RF; Ovelman C; McGuire W
Early Hum Dev; 2020 Nov; 150():105191. PubMed ID: 33036834
[TBL] [Abstract][Full Text] [Related]
17. First-Person Activity Forecasting from Video with Online Inverse Reinforcement Learning.
Rhinehart N; Kitani KM
IEEE Trans Pattern Anal Mach Intell; 2020 Feb; 42(2):304-317. PubMed ID: 30295615
[TBL] [Abstract][Full Text] [Related]
18. Model-Free Reinforcement Learning by Embedding an Auxiliary System for Optimal Control of Nonlinear Systems.
Xu Z; Shen T; Cheng D
IEEE Trans Neural Netw Learn Syst; 2022 Apr; 33(4):1520-1534. PubMed ID: 33347416
[TBL] [Abstract][Full Text] [Related]
19. Medical Image Segmentation Algorithm for Three-Dimensional Multimodal Using Deep Reinforcement Learning and Big Data Analytics.
Gao W; Li X; Wang Y; Cai Y
Front Public Health; 2022; 10():879639. PubMed ID: 35462800
[TBL] [Abstract][Full Text] [Related]
20. A real-world application of Markov chain Monte Carlo method for Bayesian trajectory control of a robotic manipulator.
Tavakol Aghaei V; Ağababaoğlu A; Yıldırım S; Onat A
ISA Trans; 2022 Jun; 125():580-590. PubMed ID: 34148651
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]