BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

435 related articles for article (PubMed ID: 18632380)

  • 1. Ensemble algorithms in reinforcement learning.
    Wiering MA; van Hasselt H
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):930-6. PubMed ID: 18632380
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Reinforcement learning in continuous time and space: interference and not ill conditioning is the main problem when using distributed function approximators.
    Baddeley B
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):950-6. PubMed ID: 18632383
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Improved Adaptive-Reinforcement Learning Control for morphing unmanned air vehicles.
    Valasek J; Doebbler J; Tandale MD; Meade AJ
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):1014-20. PubMed ID: 18632393
    [TBL] [Abstract][Full Text] [Related]  

  • 4. An evolutionary approach toward dynamic self-generated fuzzy inference systems.
    Zhou Y; Er MJ
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):963-9. PubMed ID: 18632385
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks.
    Yang Q; Vance JB; Jagannathan S
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):994-1001. PubMed ID: 18632390
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Incoherent control of quantum systems with wavefunction-controllable subspaces via quantum reinforcement learning.
    Dong D; Chen C; Tarn TJ; Pechen A; Rabitz H
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):957-62. PubMed ID: 18632384
    [TBL] [Abstract][Full Text] [Related]  

  • 7. A spiking neural network model of an actor-critic learning agent.
    Potjans W; Morrison A; Diesmann M
    Neural Comput; 2009 Feb; 21(2):301-39. PubMed ID: 19196231
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Adaptive feedback control by constrained approximate dynamic programming.
    Ferrari S; Steck JE; Chandramohan R
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):982-7. PubMed ID: 18632388
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Issues on stability of ADP feedback controllers for dynamical systems.
    Balakrishnan SN; Ding J; Lewis FL
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):913-7. PubMed ID: 18632377
    [TBL] [Abstract][Full Text] [Related]  

  • 10. A parameter control method in reinforcement learning to rapidly follow unexpected environmental changes.
    Murakoshi K; Mizuno J
    Biosystems; 2004 Nov; 77(1-3):109-17. PubMed ID: 15527950
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Robust reinforcement learning.
    Morimoto J; Doya K
    Neural Comput; 2005 Feb; 17(2):335-59. PubMed ID: 15720771
    [TBL] [Abstract][Full Text] [Related]  

  • 12. Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof.
    Al-Tamimi A; Lewis FL; Abu-Khalaf M
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):943-9. PubMed ID: 18632382
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Direct heuristic dynamic programming for damping oscillations in a large power system.
    Lu C; Si J; Xie X
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):1008-13. PubMed ID: 18632392
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Modeling of autonomous problem solving process by dynamic construction of task models in multiple tasks environment.
    Ohigashi Y; Omori T
    Neural Netw; 2006 Oct; 19(8):1169-80. PubMed ID: 16989982
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Reinforcement learning of motor skills with policy gradients.
    Peters J; Schaal S
    Neural Netw; 2008 May; 21(4):682-97. PubMed ID: 18482830
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Reliability of internal prediction/estimation and its application. I. Adaptive action selection reflecting reliability of value function.
    Sakaguchi Y; Takano M
    Neural Netw; 2004 Sep; 17(7):935-52. PubMed ID: 15312837
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data.
    Lewis FL; Vamvoudakis KG
    IEEE Trans Syst Man Cybern B Cybern; 2011 Feb; 41(1):14-25. PubMed ID: 20350860
    [TBL] [Abstract][Full Text] [Related]  

  • 18. Adaptive critic learning techniques for engine torque and air-fuel ratio control.
    Liu D; Javaherian H; Kovalenko O; Huang T
    IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):988-93. PubMed ID: 18632389
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Elman backpropagation as reinforcement for simple recurrent networks.
    GrĂ¼ning A
    Neural Comput; 2007 Nov; 19(11):3108-31. PubMed ID: 17883351
    [TBL] [Abstract][Full Text] [Related]  

  • 20. A graph-based evolutionary algorithm: Genetic Network Programming (GNP) and its extension using reinforcement learning.
    Mabu S; Hirasawa K; Hu J
    Evol Comput; 2007; 15(3):369-98. PubMed ID: 17705783
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 22.