Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

126 related articles for article (PubMed ID: 35853061)

21. Optimal Synchronization Control of Multiagent Systems With Input Saturation via Off-Policy Reinforcement Learning.
Qin J; Li M; Shi Y; Ma Q; Zheng WX
IEEE Trans Neural Netw Learn Syst; 2019 Jan; 30(1):85-96. PubMed ID: 29993726
[TBL] [Abstract][Full Text] [Related]

22. Meta attention for Off-Policy Actor-Critic.
Huang J; Huang W; Lan L; Wu D
Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
[TBL] [Abstract][Full Text] [Related]

23. Integral Reinforcement-Learning-Based Optimal Containment Control for Partially Unknown Nonlinear Multiagent Systems.
Wu Q; Wu Y; Wang Y
Entropy (Basel); 2023 Jan; 25(2):. PubMed ID: 36832588
[TBL] [Abstract][Full Text] [Related]

24. Distributed Actor-Critic Algorithms for Multiagent Reinforcement Learning Over Directed Graphs.
Dai P; Yu W; Wang H; Baldi S
IEEE Trans Neural Netw Learn Syst; 2023 Oct; 34(10):7210-7221. PubMed ID: 35015654
[TBL] [Abstract][Full Text] [Related]

25. Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.
Shi D; Guo X; Liu Y; Fan W
Entropy (Basel); 2022 May; 24(6):. PubMed ID: 35741495
[TBL] [Abstract][Full Text] [Related]

26. Actor-critic learning based coordinated control for a dual-arm robot with prescribed performance and unknown backlash-like hysteresis.
Ouyang Y; Sun C; Dong L
ISA Trans; 2022 Jul; 126():1-13. PubMed ID: 34446282
[TBL] [Abstract][Full Text] [Related]

27. Reinforcement learning for a biped robot based on a CPG-actor-critic method.
Nakamura Y; Mori T; Sato MA; Ishii S
Neural Netw; 2007 Aug; 20(6):723-35. PubMed ID: 17412559
[TBL] [Abstract][Full Text] [Related]

28. Near-Optimal Controller for Nonlinear Continuous-Time Systems With Unknown Dynamics Using Policy Iteration.
Dutta S; Patchaikani PK; Behera L
IEEE Trans Neural Netw Learn Syst; 2016 Jul; 27(7):1537-49. PubMed ID: 26259150
[TBL] [Abstract][Full Text] [Related]

29. Data-Based Optimal Synchronization of Heterogeneous Multiagent Systems in Graphical Games via Reinforcement Learning.
Xiong C; Ma Q; Guo J; Lewis FL
IEEE Trans Neural Netw Learn Syst; 2023 Jul; PP():. PubMed ID: 37463077
[TBL] [Abstract][Full Text] [Related]

30. A policy iteration approach to online optimal control of continuous-time constrained-input systems.
Modares H; Naghibi Sistani MB; Lewis FL
ISA Trans; 2013 Sep; 52(5):611-21. PubMed ID: 23706414
[TBL] [Abstract][Full Text] [Related]

31. A Tandem Robotic Arm Inverse Kinematic Solution Based on an Improved Particle Swarm Algorithm.
Zhao G; Jiang D; Liu X; Tong X; Sun Y; Tao B; Kong J; Yun J; Liu Y; Fang Z
Front Bioeng Biotechnol; 2022; 10():832829. PubMed ID: 35662837
[TBL] [Abstract][Full Text] [Related]

32. Nearly Optimal Control for Mixed Zero-Sum Game Based on Off-Policy Integral Reinforcement Learning.
Song R; Yang G; Lewis FL
IEEE Trans Neural Netw Learn Syst; 2024 Feb; 35(2):2793-2804. PubMed ID: 35877793
[TBL] [Abstract][Full Text] [Related]

33. Model-Free Reinforcement Learning by Embedding an Auxiliary System for Optimal Control of Nonlinear Systems.
Xu Z; Shen T; Cheng D
IEEE Trans Neural Netw Learn Syst; 2022 Apr; 33(4):1520-1534. PubMed ID: 33347416
[TBL] [Abstract][Full Text] [Related]

34. Observer-Based Adaptive Synchronization Control of Unknown Discrete-Time Nonlinear Heterogeneous Systems.
Fu H; Chen X; Wang W; Wu M
IEEE Trans Neural Netw Learn Syst; 2022 Feb; 33(2):681-693. PubMed ID: 33079683
[TBL] [Abstract][Full Text] [Related]

35. Off-Policy Interleaved Q -Learning: Optimal Control for Affine Nonlinear Discrete-Time Systems.
Li J; Chai T; Lewis FL; Ding Z; Jiang Y
IEEE Trans Neural Netw Learn Syst; 2019 May; 30(5):1308-1320. PubMed ID: 30273155
[TBL] [Abstract][Full Text] [Related]

36. Reinforcement Learning With Vision-Proprioception Model for Robot Planar Pushing.
Cong L; Liang H; Ruppel P; Shi Y; Görner M; Hendrich N; Zhang J
Front Neurorobot; 2022; 16():829437. PubMed ID: 35308311
[TBL] [Abstract][Full Text] [Related]

37. A Parallel Framework of Adaptive Dynamic Programming Algorithm With Off-Policy Learning.
Sun C; Li X; Sun Y
IEEE Trans Neural Netw Learn Syst; 2021 Aug; 32(8):3578-3587. PubMed ID: 32833647
[TBL] [Abstract][Full Text] [Related]

38. Finite-Horizon Optimal Consensus Control for Unknown Multiagent State-Delay Systems.
Zhang H; Park JH; Yue D; Xie X
IEEE Trans Cybern; 2020 Feb; 50(2):402-413. PubMed ID: 30207970
[TBL] [Abstract][Full Text] [Related]

39. A Distributed Anti-Jamming Algorithm Based on Actor-Critic Countering Intelligent Malicious Jamming for WSN.
Chen Y; Niu Y; Chen C; Zhou Q; Xiang P
Sensors (Basel); 2022 Oct; 22(21):. PubMed ID: 36365857
[TBL] [Abstract][Full Text] [Related]

40. A Local-and-Global Attention Reinforcement Learning Algorithm for Multiagent Cooperative Navigation.
Song C; He Z; Dong L
IEEE Trans Neural Netw Learn Syst; 2024 Jun; 35(6):7767-7777. PubMed ID: 36383584
[TBL] [Abstract][Full Text] [Related]

[Previous] [Next] [New Search]