These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
4. Varieties of learning automata: an overview. Thathachar ML; Sastry PS IEEE Trans Syst Man Cybern B Cybern; 2002; 32(6):711-22. PubMed ID: 18244878 [TBL] [Abstract][Full Text] [Related]
5. New learning automata based algorithms for adaptation of backpropagation algorithm parameters. Meybodi MR; Beigy H Int J Neural Syst; 2002 Feb; 12(1):45-67. PubMed ID: 11852444 [TBL] [Abstract][Full Text] [Related]
6. Finite time analysis of the pursuit algorithm for learning automata. Rajaraman K; Sastry PS IEEE Trans Syst Man Cybern B Cybern; 1996; 26(4):590-8. PubMed ID: 18263056 [TBL] [Abstract][Full Text] [Related]
7. Parallel algorithms for modules of learning automata. Thathachar ML; Arvind MT IEEE Trans Syst Man Cybern B Cybern; 1998; 28(1):24-33. PubMed ID: 18255919 [TBL] [Abstract][Full Text] [Related]
8. A team of continuous-action learning automata for noise-tolerant learning of half-spaces. Sastry PS; Nagendra GD; Manwani N IEEE Trans Syst Man Cybern B Cybern; 2010 Feb; 40(1):19-28. PubMed ID: 19884058 [TBL] [Abstract][Full Text] [Related]
9. A Collaborative Multiagent Reinforcement Learning Method Based on Policy Gradient Potential. Zhang Z; Ong YS; Wang D; Xue B IEEE Trans Cybern; 2021 Feb; 51(2):1015-1027. PubMed ID: 31443061 [TBL] [Abstract][Full Text] [Related]
10. Decentralized learning in Markov games. Vrancx P; Verbeeck K; Nowé A IEEE Trans Syst Man Cybern B Cybern; 2008 Aug; 38(4):976-81. PubMed ID: 18632387 [TBL] [Abstract][Full Text] [Related]
11. Discrete-Time Deterministic $Q$ -Learning: A Novel Convergence Analysis. Wei Q; Lewis FL; Sun Q; Yan P; Song R IEEE Trans Cybern; 2017 May; 47(5):1224-1237. PubMed ID: 27093714 [TBL] [Abstract][Full Text] [Related]
12. Learning Automata-Based Multiagent Reinforcement Learning for Optimization of Cooperative Tasks. Zhang Z; Wang D; Gao J IEEE Trans Neural Netw Learn Syst; 2021 Oct; 32(10):4639-4652. PubMed ID: 33027003 [TBL] [Abstract][Full Text] [Related]
13. Learning in multilevel games with incomplete information. II. Zhou J; Billard E; Lakshmivarahan S IEEE Trans Syst Man Cybern B Cybern; 1999; 29(3):340-9. PubMed ID: 18252309 [TBL] [Abstract][Full Text] [Related]
14. Magnified gradient function with deterministic weight modification in adaptive learning. Ng SC; Cheung CC; Leung SH IEEE Trans Neural Netw; 2004 Nov; 15(6):1411-23. PubMed ID: 15565769 [TBL] [Abstract][Full Text] [Related]
15. An efficient approximation algorithm for finding a maximum clique using Hopfield network learning. Wang RL; Tang Z; Cao QP Neural Comput; 2003 Jul; 15(7):1605-19. PubMed ID: 12816568 [TBL] [Abstract][Full Text] [Related]
16. Last-position elimination-based learning automata. Zhang J; Wang C; Zhou M IEEE Trans Cybern; 2014 Dec; 44(12):2484-92. PubMed ID: 24710837 [TBL] [Abstract][Full Text] [Related]
17. Cellular learning automata with multiple learning automata in each cell and its applications. Beigy H; Meybodi MR IEEE Trans Syst Man Cybern B Cybern; 2010 Feb; 40(1):54-65. PubMed ID: 19884061 [TBL] [Abstract][Full Text] [Related]
18. Reinforcement Learning for Constrained Energy Trading Games With Incomplete Information. Wang H; Huang T; Liao X; Abu-Rub H; Chen G IEEE Trans Cybern; 2017 Oct; 47(10):3404-3416. PubMed ID: 28885145 [TBL] [Abstract][Full Text] [Related]
19. A new class of epsilon-optimal learning automata. Papadimitriou GI; Sklira M; Pomportsis AS IEEE Trans Syst Man Cybern B Cybern; 2004 Feb; 34(1):246-54. PubMed ID: 15369067 [TBL] [Abstract][Full Text] [Related]
20. Reinforcement learning for a stochastic automaton modelling predation in stationary model-mimic environments. Tsoularis A; Wallace J Math Biosci; 2005 May; 195(1):76-91. PubMed ID: 15893338 [TBL] [Abstract][Full Text] [Related] [Next] [New Search]