Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

182 related articles for article (PubMed ID: 34394336)

1. Diversity Evolutionary Policy Deep Reinforcement Learning.
Liu J; Feng L
Comput Intell Neurosci; 2021; 2021():5300189. PubMed ID: 34394336
[TBL] [Abstract][Full Text] [Related]

2. An off-policy multi-agent stochastic policy gradient algorithm for cooperative continuous control.
Guo D; Tang L; Zhang X; Liang YC
Neural Netw; 2024 Feb; 170():610-621. PubMed ID: 38056408
[TBL] [Abstract][Full Text] [Related]

3. Dual Parallel Policy Iteration With Coupled Policy Improvement.
Cheng Y; Huang L; Chen CLP; Wang X
IEEE Trans Neural Netw Learn Syst; 2024 Mar; 35(3):4286-4298. PubMed ID: 36094996
[TBL] [Abstract][Full Text] [Related]

4. Stochastic Integrated Actor-Critic for Deep Reinforcement Learning.
Zheng J; Kurt MN; Wang X
IEEE Trans Neural Netw Learn Syst; 2024 May; 35(5):6654-6666. PubMed ID: 36256721
[TBL] [Abstract][Full Text] [Related]

5. Reinforcement learning based temperature control of a fermentation bioreactor for ethanol production.
Rajasekhar N; Radhakrishnan TK; Mohamed SN
Biotechnol Bioeng; 2024 Oct; 121(10):3114-3127. PubMed ID: 38938008
[TBL] [Abstract][Full Text] [Related]

6. Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms.
Zhang H; Xu J; Zhang J; Liu Q
Comput Intell Neurosci; 2022; 2022():1117781. PubMed ID: 36438689
[TBL] [Abstract][Full Text] [Related]

7. Approximate Policy-Based Accelerated Deep Reinforcement Learning.
Wang X; Gu Y; Cheng Y; Liu A; Chen CLP
IEEE Trans Neural Netw Learn Syst; 2020 Jun; 31(6):1820-1830. PubMed ID: 31398131
[TBL] [Abstract][Full Text] [Related]

8. Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment.
Chen W; Wong KKL; Long S; Sun Z
Entropy (Basel); 2022 Mar; 24(4):. PubMed ID: 35455103
[TBL] [Abstract][Full Text] [Related]

9. Adaptive control for circulating cooling water system using deep reinforcement learning.
Xu J; Li H; Zhang Q
PLoS One; 2024; 19(7):e0307767. PubMed ID: 39047030
[TBL] [Abstract][Full Text] [Related]

10. Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces.
Shi J; Li K; Piao C; Gao J; Chen L
Sensors (Basel); 2023 Aug; 23(16):. PubMed ID: 37631658
[TBL] [Abstract][Full Text] [Related]

11. Distributional generative adversarial imitation learning with reproducing kernel generalization.
Zhou Y; Lu M; Liu X; Che Z; Xu Z; Tang J; Zhang Y; Peng Y; Peng Y
Neural Netw; 2023 Aug; 165():43-59. PubMed ID: 37276810
[TBL] [Abstract][Full Text] [Related]

12. Learning Intention-Aware Policies in Deep Reinforcement Learning.
Zhao T; Wu S; Li G; Chen Y; Niu G; Sugiyama M
Neural Comput; 2023 Sep; 35(10):1657-1677. PubMed ID: 37523456
[TBL] [Abstract][Full Text] [Related]

13. Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors.
Duan J; Guan Y; Li SE; Ren Y; Sun Q; Cheng B
IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6584-6598. PubMed ID: 34101599
[TBL] [Abstract][Full Text] [Related]

14. Human-in-the-Loop Reinforcement Learning in Continuous-Action Space.
Luo B; Wu Z; Zhou F; Wang BC
IEEE Trans Neural Netw Learn Syst; 2024 Nov; 35(11):15735-15744. PubMed ID: 37418406
[TBL] [Abstract][Full Text] [Related]

15. Path planning of mobile robot based on improved TD3 algorithm in dynamic environment.
Li P; Chen D; Wang Y; Zhang L; Zhao S
Heliyon; 2024 Jun; 10(11):e32167. PubMed ID: 38912483
[TBL] [Abstract][Full Text] [Related]

16. Systematic Performance Evaluation of Reinforcement Learning Algorithms Applied to Wastewater Treatment Control Optimization.
Croll HC; Ikuma K; Ong SK; Sarkar S
Environ Sci Technol; 2023 Nov; 57(46):18382-18390. PubMed ID: 37405782
[TBL] [Abstract][Full Text] [Related]

17. Entropy-Aware Model Initialization for Effective Exploration in Deep Reinforcement Learning.
Jang S; Kim HI
Sensors (Basel); 2022 Aug; 22(15):. PubMed ID: 35957399
[TBL] [Abstract][Full Text] [Related]

18. An immediate-return reinforcement learning for the atypical Markov decision processes.
Pan Z; Wen G; Tan Z; Yin S; Hu X
Front Neurorobot; 2022; 16():1012427. PubMed ID: 36582302
[TBL] [Abstract][Full Text] [Related]

19. Asynchronous Episodic Deep Deterministic Policy Gradient: Toward Continuous Control in Computationally Complex Environments.
Zhang Z; Chen J; Chen Z; Li W
IEEE Trans Cybern; 2021 Feb; 51(2):604-613. PubMed ID: 31902788
[TBL] [Abstract][Full Text] [Related]

20. A Maximum Divergence Approach to Optimal Policy in Deep Reinforcement Learning.
Yang Z; Qu H; Fu M; Hu W; Zhao Y
IEEE Trans Cybern; 2023 Mar; 53(3):1499-1510. PubMed ID: 34478393
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]