Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

159 related articles for article (PubMed ID: 30408692)

1. Implicit incremental natural actor critic algorithm.
Iwaki R; Asada M
Neural Netw; 2019 Jan; 109():103-112. PubMed ID: 30408692
[TBL] [Abstract][Full Text] [Related]

2. Actor-Critic Learning Control With Regularization and Feature Selection in Policy Gradient Estimation.
Li L; Li D; Song T; Xu X
IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1217-1227. PubMed ID: 32324571
[TBL] [Abstract][Full Text] [Related]

3. Efficient model learning methods for actor-critic control.
Grondman I; Vaandrager M; Buşoniu L; Babuska R; Schuitema E
IEEE Trans Syst Man Cybern B Cybern; 2012 Jun; 42(3):591-602. PubMed ID: 22156998
[TBL] [Abstract][Full Text] [Related]

4. Meta attention for Off-Policy Actor-Critic.
Huang J; Huang W; Lan L; Wu D
Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
[TBL] [Abstract][Full Text] [Related]

5. Reinforcement learning in continuous time and space.
Doya K
Neural Comput; 2000 Jan; 12(1):219-45. PubMed ID: 10636940
[TBL] [Abstract][Full Text] [Related]

6. Optimal Policy of Multiplayer Poker via Actor-Critic Reinforcement Learning.
Shi D; Guo X; Liu Y; Fan W
Entropy (Basel); 2022 May; 24(6):. PubMed ID: 35741495
[TBL] [Abstract][Full Text] [Related]

7. Continuous-time adaptive critics.
Hanselmann T; Noakes L; Zaknich A
IEEE Trans Neural Netw; 2007 May; 18(3):631-47. PubMed ID: 17526332
[TBL] [Abstract][Full Text] [Related]

8. Actor-Critic Learning Control Based on -Regularized Temporal-Difference Prediction With Gradient Correction.
Li L; Li D; Song T; Xu X
IEEE Trans Neural Netw Learn Syst; 2018 Dec; 29(12):5899-5909. PubMed ID: 29993664
[TBL] [Abstract][Full Text] [Related]

9. Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks.
Modares H; Lewis FL; Naghibi-Sistani MB
IEEE Trans Neural Netw Learn Syst; 2013 Oct; 24(10):1513-25. PubMed ID: 24808590
[TBL] [Abstract][Full Text] [Related]

10. Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms.
Zhang H; Xu J; Zhang J; Liu Q
Comput Intell Neurosci; 2022; 2022():1117781. PubMed ID: 36438689
[TBL] [Abstract][Full Text] [Related]

11. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
Zheng J; Kurt MN; Wang X
IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721
[TBL] [Abstract][Full Text] [Related]

12. A3C-GS: Adaptive Moment Gradient Sharing With Locks for Asynchronous Actor-Critic Agents.
Labao AB; Martija MAM; Naval PC
IEEE Trans Neural Netw Learn Syst; 2021 Mar; 32(3):1162-1176. PubMed ID: 32287019
[TBL] [Abstract][Full Text] [Related]

13. Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning.
Zhong S; Liu Q; Fu Q
Comput Intell Neurosci; 2016; 2016():4824072. PubMed ID: 27795704
[TBL] [Abstract][Full Text] [Related]

14. Boosting On-Policy Actor-Critic With Shallow Updates in Critic.
Li L; Zhu Y
IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38619961
[TBL] [Abstract][Full Text] [Related]

15. Robust Actor-Critic With Relative Entropy Regulating Actor.
Cheng Y; Huang L; Chen CLP; Wang X
IEEE Trans Neural Netw Learn Syst; 2023 Nov; 34(11):9054-9063. PubMed ID: 35286268
[TBL] [Abstract][Full Text] [Related]

16. A policy iteration approach to online optimal control of continuous-time constrained-input systems.
Modares H; Naghibi Sistani MB; Lewis FL
ISA Trans; 2013 Sep; 52(5):611-21. PubMed ID: 23706414
[TBL] [Abstract][Full Text] [Related]

17. Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control.
Wu HN; Luo B
IEEE Trans Neural Netw Learn Syst; 2012 Dec; 23(12):1884-95. PubMed ID: 24808144
[TBL] [Abstract][Full Text] [Related]

18. Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms.
Chen Y; Zhang F; Liu Z
Neural Netw; 2024 Jan; 169():764-777. PubMed ID: 37981458
[TBL] [Abstract][Full Text] [Related]

19. Impedance learning for robotic contact tasks using natural actor-critic algorithm.
Kim B; Park J; Park S; Kang S
IEEE Trans Syst Man Cybern B Cybern; 2010 Apr; 40(2):433-43. PubMed ID: 19696001
[TBL] [Abstract][Full Text] [Related]

20. A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives.
Li C; Lowe R; Ziemke T
Front Neurorobot; 2014; 8():23. PubMed ID: 25324773
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]