These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

52 related articles for article (PubMed ID: 36409817)

  • 1. Reward-Adaptive Reinforcement Learning: Dynamic Policy Gradient Optimization for Bipedal Locomotion.
    Huang C; Wang G; Zhou Z; Zhang R; Lin L
    IEEE Trans Pattern Anal Mach Intell; 2023 Jun; 45(6):7686-7695. PubMed ID: 36409817
    [TBL] [Abstract][Full Text] [Related]  

  • 2. A Multi-Agent Reinforcement Learning Method for Omnidirectional Walking of Bipedal Robots.
    Mou H; Xue J; Liu J; Feng Z; Li Q; Zhang J
    Biomimetics (Basel); 2023 Dec; 8(8):. PubMed ID: 38132555
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Adaptive Gait Acquisition through Learning Dynamic Stimulus Instinct of Bipedal Robot.
    Zhang Y; Chen X; Meng F; Yu Z; Du Y; Zhou Z; Gao J
    Biomimetics (Basel); 2024 May; 9(6):. PubMed ID: 38921190
    [TBL] [Abstract][Full Text] [Related]  

  • 4. Biped Robots Control in Gusty Environments with Adaptive Exploration Based DDPG.
    Zhang Y; Sun H; Sun H; Huang Y; Hashimoto K
    Biomimetics (Basel); 2024 Jun; 9(6):. PubMed ID: 38921226
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Intelligent control of self-driving vehicles based on adaptive sampling supervised actor-critic and human driving experience.
    Zhang J; Ma N; Wu Z; Wang C; Yao Y
    Math Biosci Eng; 2024 May; 21(5):6077-6096. PubMed ID: 38872570
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Autonomous Driving of Mobile Robots in Dynamic Environments Based on Deep Deterministic Policy Gradient: Reward Shaping and Hindsight Experience Replay.
    Park M; Park C; Kwon NK
    Biomimetics (Basel); 2024 Jan; 9(1):. PubMed ID: 38248625
    [TBL] [Abstract][Full Text] [Related]  

  • 7. An off-policy multi-agent stochastic policy gradient algorithm for cooperative continuous control.
    Guo D; Tang L; Zhang X; Liang YC
    Neural Netw; 2024 Feb; 170():610-621. PubMed ID: 38056408
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Multi-agent reinforcement learning with approximate model learning for competitive games.
    Park YJ; Cho YS; Kim SB
    PLoS One; 2019; 14(9):e0222215. PubMed ID: 31509568
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Deep reinforcement learning for the direct optimization of gradient separations in liquid chromatography.
    Kensert A; Libin P; Desmet G; Cabooter D
    J Chromatogr A; 2024 Apr; 1720():464768. PubMed ID: 38442496
    [TBL] [Abstract][Full Text] [Related]  

  • 10. An Off-Policy Reinforcement Learning-Based Adaptive Optimization Method for Dynamic Resource Allocation Problem.
    He B; Meng Y; Tang L
    IEEE Trans Neural Netw Learn Syst; 2023 Dec; PP():. PubMed ID: 38090867
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Dynamic sparse coding-based value estimation network for deep reinforcement learning.
    Zhao H; Li Z; Su W; Xie S
    Neural Netw; 2023 Nov; 168():180-193. PubMed ID: 37757726
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Dynamic Fall Recovery Control for Legged Robots via Reinforcement Learning.
    Li S; Pang Y; Bai P; Hu S; Wang L; Wang G
    Biomimetics (Basel); 2024 Mar; 9(4):. PubMed ID: 38667204
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Feudal Latent Space Exploration for Coordinated Multi-Agent Reinforcement Learning.
    Liu X; Tan Y
    IEEE Trans Neural Netw Learn Syst; 2023 Oct; 34(10):7775-7783. PubMed ID: 35167482
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Deep Reinforcement Learning for Nash Equilibrium of Differential Games.
    Li Z; Luo Y
    IEEE Trans Neural Netw Learn Syst; 2024 Jan; PP():. PubMed ID: 38261501
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Cross-domain policy adaptation with dynamics alignment.
    Gui H; Pang S; Yu S; Qiao S; Qi Y; He X; Wang M; Zhai X
    Neural Netw; 2023 Oct; 167():104-117. PubMed ID: 37647740
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Boosting Reinforcement Learning via Hierarchical Game Playing With State Relay.
    Liu C; Cong J; Liu G; Jiang G; Xu X; Zhu E
    IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38648134
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Self-Punishment and Reward Backfill for Deep Q-Learning.
    Bonyadi MR; Wang R; Ziaei M
    IEEE Trans Neural Netw Learn Syst; 2023 Oct; 34(10):8086-8093. PubMed ID: 35041613
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Nonlinear dynamic modeling and model-based AI-driven control of a magnetoactive soft continuum robot in a fluidic environment.
    Moezi SA; Sedaghati R; Rakheja S
    ISA Trans; 2024 Jan; 144():245-259. PubMed ID: 37932207
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Distributional Policy Gradient With Distributional Value Function.
    Liu Q; Li Y; Shi X; Lin K; Liu Y; Lou Y
    IEEE Trans Neural Netw Learn Syst; 2024 Apr; PP():. PubMed ID: 38669170
    [TBL] [Abstract][Full Text] [Related]  

  • 20. DTC: Deep Tracking Control.
    Jenelten F; He J; Farshidian F; Hutter M
    Sci Robot; 2024 Jan; 9(86):eadh5401. PubMed ID: 38232148
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 3.