These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

147 related articles for article (PubMed ID: 30356836)

  • 1. Evolving Robust Policy Coverage Sets in Multi-Objective Markov Decision Processes Through Intrinsically Motivated Self-Play.
    Abdelfattah S; Kasmarik K; Hu J
    Front Neurorobot; 2018; 12():65. PubMed ID: 30356836
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Robust Multiobjective Reinforcement Learning Considering Environmental Uncertainties.
    He X; Hao J; Chen X; Wang J; Ji X; Lv C
    IEEE Trans Neural Netw Learn Syst; 2024 May; PP():. PubMed ID: 38781066
    [TBL] [Abstract][Full Text] [Related]  

  • 3. MO-MIX: Multi-Objective Multi-Agent Cooperative Decision-Making With Deep Reinforcement Learning.
    Hu T; Luo B; Yang C; Huang T
    IEEE Trans Pattern Anal Mach Intell; 2023 Oct; 45(10):12098-12112. PubMed ID: 37285257
    [TBL] [Abstract][Full Text] [Related]  

  • 4. An Improved Approach towards Multi-Agent Pursuit-Evasion Game Decision-Making Using Deep Reinforcement Learning.
    Wan K; Wu D; Zhai Y; Li B; Gao X; Hu Z
    Entropy (Basel); 2021 Oct; 23(11):. PubMed ID: 34828131
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Multi-Objective Markov Decision Processes for Data-Driven Decision Support.
    Lizotte DJ; Laber EB
    J Mach Learn Res; 2016; 17():. PubMed ID: 28018133
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Adversarial Decision-Making for Moving Target Defense: A Multi-Agent Markov Game and Reinforcement Learning Approach.
    Yao Q; Wang Y; Xiong X; Wang P; Li Y
    Entropy (Basel); 2023 Apr; 25(4):. PubMed ID: 37190393
    [TBL] [Abstract][Full Text] [Related]  

  • 7. A robust bi-objective multi-trip periodic capacitated arc routing problem for urban waste collection using a multi-objective invasive weed optimization.
    Babaee Tirkolaee E; Goli A; Pahlevan M; Malekalipour Kordestanizadeh R
    Waste Manag Res; 2019 Nov; 37(11):1089-1101. PubMed ID: 31416408
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems.
    Delaney J; Dowey S; Cheng CT
    Sensors (Basel); 2023 May; 23(10):. PubMed ID: 37430736
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Optimal Robust Output Containment of Unknown Heterogeneous Multiagent System Using Off-Policy Reinforcement Learning.
    Zuo S; Song Y; Lewis FL; Davoudi A
    IEEE Trans Cybern; 2018 Nov; 48(11):3197-3207. PubMed ID: 29989978
    [TBL] [Abstract][Full Text] [Related]  

  • 10. Reinforcement Learning-Aided Channel Estimator in Time-Varying MIMO Systems.
    Kim TK; Min M
    Sensors (Basel); 2023 Jun; 23(12):. PubMed ID: 37420854
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Semi-Infinitely Constrained Markov Decision Processes and Provably Efficient Reinforcement Learning.
    Zhang L; Peng Y; Yang W; Zhang Z
    IEEE Trans Pattern Anal Mach Intell; 2024 May; 46(5):3722-3735. PubMed ID: 38163315
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Solving multi-objective optimization problems in conservation with the reference point method.
    Dujardin Y; Chadès I
    PLoS One; 2018; 13(1):e0190748. PubMed ID: 29293650
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Composition of web services using Markov decision processes and dynamic programming.
    Uc-Cetina V; Moo-Mena F; Hernandez-Ucan R
    ScientificWorldJournal; 2015; 2015():545308. PubMed ID: 25874247
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Guided Policy Exploration for Markov Decision Processes Using an Uncertainty-Based Value-of-Information Criterion.
    Sledge IJ; Emigh MS; Principe JC
    IEEE Trans Neural Netw Learn Syst; 2018 Jun; 29(6):2080-2098. PubMed ID: 29771664
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots.
    Ngo H; Luciw M; Förster A; Schmidhuber J
    Front Psychol; 2013; 4():833. PubMed ID: 24324448
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Benchmarking for Bayesian Reinforcement Learning.
    Castronovo M; Ernst D; Couëtoux A; Fonteneau R
    PLoS One; 2016; 11(6):e0157088. PubMed ID: 27304891
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Reinforcement Learning-Based Multihop Relaying: A Decentralized Q-Learning Approach.
    Wang X; Wang X
    Entropy (Basel); 2021 Oct; 23(10):. PubMed ID: 34682034
    [TBL] [Abstract][Full Text] [Related]  

  • 18. The Convergence of a Cooperation Markov Decision Process System.
    Mo X; Xu D; Fu Z
    Entropy (Basel); 2020 Aug; 22(9):. PubMed ID: 33286724
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Learning Dynamics and Control of a Stochastic System under Limited Sensing Capabilities.
    Zadenoori MA; Vicario E
    Sensors (Basel); 2022 Jun; 22(12):. PubMed ID: 35746272
    [TBL] [Abstract][Full Text] [Related]  

  • 20. An Off-Policy Reinforcement Learning-Based Adaptive Optimization Method for Dynamic Resource Allocation Problem.
    He B; Meng Y; Tang L
    IEEE Trans Neural Netw Learn Syst; 2023 Dec; PP():. PubMed ID: 38090867
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 8.