These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

275 related articles for article (PubMed ID: 35741495)

  • 1. Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.
    Shi D; Guo X; Liu Y; Fan W
    Entropy (Basel); 2022 May; 24(6):. PubMed ID: 35741495
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Asynchronous learning for actor-critic neural networks and synchronous triggering for multiplayer system.
    Wang K; Mu C
    ISA Trans; 2022 Oct; 129(Pt B):295-308. PubMed ID: 35216805
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Superhuman AI for multiplayer poker.
    Brown N; Sandholm T
    Science; 2019 Aug; 365(6456):885-890. PubMed ID: 31296650
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Meta attention for Off-Policy Actor-Critic.
    Huang J; Huang W; Lan L; Wu D
    Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms.
    Zhang H; Jiang H; Luo C; Xiao G
    IEEE Trans Cybern; 2017 Oct; 47(10):3331-3340. PubMed ID: 28113535
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Decentralized multi-agent reinforcement learning based on best-response policies.
    Gabler V; Wollherr D
    Front Robot AI; 2024; 11():1229026. PubMed ID: 38690119
    [No Abstract]   [Full Text] [Related]  

  • 7. Deep Multi-Critic Network for accelerating Policy Learning in multi-agent environments.
    Hook J; Silva V; Kondoz A
    Neural Netw; 2020 Aug; 128():97-106. PubMed ID: 32446194
    [TBL] [Abstract][Full Text] [Related]  

  • 8. DeepStack: Expert-level artificial intelligence in heads-up no-limit poker.
    Moravčík M; Schmid M; Burch N; Lisý V; Morrill D; Bard N; Davis T; Waugh K; Johanson M; Bowling M
    Science; 2017 May; 356(6337):508-513. PubMed ID: 28254783
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Nearly Optimal Control for Mixed Zero-Sum Game Based on Off-Policy Integral Reinforcement Learning.
    Song R; Yang G; Lewis FL
    IEEE Trans Neural Netw Learn Syst; 2024 Feb; 35(2):2793-2804. PubMed ID: 35877793
    [TBL] [Abstract][Full Text] [Related]  

  • 10. A priority experience replay actor-critic algorithm using self-attention mechanism for strategy optimization of discrete problems.
    Sun Y; Yang B
    PeerJ Comput Sci; 2024; 10():e2161. PubMed ID: 38983226
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Behavior fusion for deep reinforcement learning.
    Shi H; Xu M; Hwang KS; Cai BY
    ISA Trans; 2020 Mar; 98():434-444. PubMed ID: 31543262
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Student of Games: A unified learning algorithm for both perfect and imperfect information games.
    Schmid M; Moravčík M; Burch N; Kadlec R; Davidson J; Waugh K; Bard N; Timbers F; Lanctot M; Holland GZ; Davoodi E; Christianson A; Bowling M
    Sci Adv; 2023 Nov; 9(46):eadg3256. PubMed ID: 37967182
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Adaptive Optimal Control for Stochastic Multiplayer Differential Games Using On-Policy and Off-Policy Reinforcement Learning.
    Liu M; Wan Y; Lewis FL; Lopez VG
    IEEE Trans Neural Netw Learn Syst; 2020 Dec; 31(12):5522-5533. PubMed ID: 32142455
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games.
    Song R; Lewis FL; Wei Q
    IEEE Trans Neural Netw Learn Syst; 2017 Mar; 28(3):704-713. PubMed ID: 27448374
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Learning Macromanagement in Starcraft by Deep Reinforcement Learning.
    Huang W; Yin Q; Zhang J; Huang K
    Sensors (Basel); 2021 May; 21(10):. PubMed ID: 34065012
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Federated Reinforcement Learning for Training Control Policies on Multiple IoT Devices.
    Lim HK; Kim JB; Heo JS; Han YH
    Sensors (Basel); 2020 Mar; 20(5):. PubMed ID: 32121671
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Attention-Shared Multi-Agent Actor-Critic-Based Deep Reinforcement Learning Approach for Mobile Charging Dynamic Scheduling in Wireless Rechargeable Sensor Networks.
    Jiang C; Wang Z; Chen S; Li J; Wang H; Xiang J; Xiao W
    Entropy (Basel); 2022 Jul; 24(7):. PubMed ID: 35885188
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning.
    Yang X; Zhang H; Wang Z
    IEEE Trans Neural Netw Learn Syst; 2022 Aug; 33(8):3872-3883. PubMed ID: 33587707
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
    Li L; Li D; Song T; Xu X
    IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
    [TBL] [Abstract][Full Text] [Related]  

  • 20. Boosting On-Policy Actor-Critic With Shallow Updates in Critic.
    Li L; Zhu Y
    IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619961
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 14.