Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

275 related articles for article (PubMed ID: 35741495)

1. Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.
Shi D; Guo X; Liu Y; Fan W
Entropy (Basel); 2022 May; 24(6):. PubMed ID: 35741495
[TBL] [Abstract][Full Text] [Related]

2. Asynchronous learning for actor-critic neural networks and synchronous triggering for multiplayer system.
Wang K; Mu C
ISA Trans; 2022 Oct; 129(Pt B):295-308. PubMed ID: 35216805
[TBL] [Abstract][Full Text] [Related]

3. Superhuman AI for multiplayer poker.
Brown N; Sandholm T
Science; 2019 Aug; 365(6456):885-890. PubMed ID: 31296650
[TBL] [Abstract][Full Text] [Related]

4. Meta attention for Off-Policy Actor-Critic.
Huang J; Huang W; Lan L; Wu D
Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
[TBL] [Abstract][Full Text] [Related]

5. Discrete-Time Nonzero-Sum Games for Multiplayer Using Policy-Iteration-Based Adaptive Dynamic Programming Algorithms.
Zhang H; Jiang H; Luo C; Xiao G
IEEE Trans Cybern; 2017 Oct; 47(10):3331-3340. PubMed ID: 28113535
[TBL] [Abstract][Full Text] [Related]

6. Decentralized multi-agent reinforcement learning based on best-response policies.
Gabler V; Wollherr D
Front Robot AI; 2024; 11():1229026. PubMed ID: 38690119
[No Abstract] [Full Text] [Related]

7. Deep Multi-Critic Network for accelerating Policy Learning in multi-agent environments.
Hook J; Silva V; Kondoz A
Neural Netw; 2020 Aug; 128():97-106. PubMed ID: 32446194
[TBL] [Abstract][Full Text] [Related]

8. DeepStack: Expert-level artificial intelligence in heads-up no-limit poker.
Moravčík M; Schmid M; Burch N; Lisý V; Morrill D; Bard N; Davis T; Waugh K; Johanson M; Bowling M
Science; 2017 May; 356(6337):508-513. PubMed ID: 28254783
[TBL] [Abstract][Full Text] [Related]

9. Nearly Optimal Control for Mixed Zero-Sum Game Based on Off-Policy Integral Reinforcement Learning.
Song R; Yang G; Lewis FL
IEEE Trans Neural Netw Learn Syst; 2024 Feb; 35(2):2793-2804. PubMed ID: 35877793
[TBL] [Abstract][Full Text] [Related]

10. A priority experience replay actor-critic algorithm using self-attention mechanism for strategy optimization of discrete problems.
Sun Y; Yang B
PeerJ Comput Sci; 2024; 10():e2161. PubMed ID: 38983226
[TBL] [Abstract][Full Text] [Related]

11. Behavior fusion for deep reinforcement learning.
Shi H; Xu M; Hwang KS; Cai BY
ISA Trans; 2020 Mar; 98():434-444. PubMed ID: 31543262
[TBL] [Abstract][Full Text] [Related]

12. Student of Games: A unified learning algorithm for both perfect and imperfect information games.
Schmid M; Moravčík M; Burch N; Kadlec R; Davidson J; Waugh K; Bard N; Timbers F; Lanctot M; Holland GZ; Davoodi E; Christianson A; Bowling M
Sci Adv; 2023 Nov; 9(46):eadg3256. PubMed ID: 37967182
[TBL] [Abstract][Full Text] [Related]

13. Adaptive Optimal Control for Stochastic Multiplayer Differential Games Using On-Policy and Off-Policy Reinforcement Learning.
Liu M; Wan Y; Lewis FL; Lopez VG
IEEE Trans Neural Netw Learn Syst; 2020 Dec; 31(12):5522-5533. PubMed ID: 32142455
[TBL] [Abstract][Full Text] [Related]

14. Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games.
Song R; Lewis FL; Wei Q
IEEE Trans Neural Netw Learn Syst; 2017 Mar; 28(3):704-713. PubMed ID: 27448374
[TBL] [Abstract][Full Text] [Related]

15. Learning Macromanagement in Starcraft by Deep Reinforcement Learning.
Huang W; Yin Q; Zhang J; Huang K
Sensors (Basel); 2021 May; 21(10):. PubMed ID: 34065012
[TBL] [Abstract][Full Text] [Related]

16. Federated Reinforcement Learning for Training Control Policies on Multiple IoT Devices.
Lim HK; Kim JB; Heo JS; Han YH
Sensors (Basel); 2020 Mar; 20(5):. PubMed ID: 32121671
[TBL] [Abstract][Full Text] [Related]

17. Attention-Shared Multi-Agent Actor-Critic-Based Deep Reinforcement Learning Approach for Mobile Charging Dynamic Scheduling in Wireless Rechargeable Sensor Networks.
Jiang C; Wang Z; Chen S; Li J; Wang H; Xiang J; Xiao W
Entropy (Basel); 2022 Jul; 24(7):. PubMed ID: 35885188
[TBL] [Abstract][Full Text] [Related]

18. Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning.
Yang X; Zhang H; Wang Z
IEEE Trans Neural Netw Learn Syst; 2022 Aug; 33(8):3872-3883. PubMed ID: 33587707
[TBL] [Abstract][Full Text] [Related]

19. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
Li L; Li D; Song T; Xu X
IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
[TBL] [Abstract][Full Text] [Related]

20. Boosting On-Policy Actor-Critic With Shallow Updates in Critic.
Li L; Zhu Y
IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619961
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]