227 related articles for article (PubMed ID: 21151963)
1. Structure learning in human sequential decision-making.
Acuña DE; Schrater P
PLoS Comput Biol; 2010 Dec; 6(12):e1001003. PubMed ID: 21151963
[TBL] [Abstract][Full Text] [Related]
2. Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making.
Schönberg T; Daw ND; Joel D; O'Doherty JP
J Neurosci; 2007 Nov; 27(47):12860-7. PubMed ID: 18032658
[TBL] [Abstract][Full Text] [Related]
3. Mouse tracking reveals structure knowledge in the absence of model-based choice.
Konovalov A; Krajbich I
Nat Commun; 2020 Apr; 11(1):1893. PubMed ID: 32312966
[TBL] [Abstract][Full Text] [Related]
4. Overtaking method based on sand-sifter mechanism: Why do optimistic value functions find optimal solutions in multi-armed bandit problems?
Ochi K; Kamiura M
Biosystems; 2015 Sep; 135():55-65. PubMed ID: 26166266
[TBL] [Abstract][Full Text] [Related]
5. One-shot learning and behavioral eligibility traces in sequential decision making.
Lehmann MP; Xu HA; Liakoni V; Herzog MH; Gerstner W; Preuschoff K
Elife; 2019 Nov; 8():. PubMed ID: 31709980
[TBL] [Abstract][Full Text] [Related]
6. Similarities and differences in spatial and non-spatial cognitive maps.
Wu CM; Schulz E; Garvert MM; Meder B; Schuck NW
PLoS Comput Biol; 2020 Sep; 16(9):e1008149. PubMed ID: 32903264
[TBL] [Abstract][Full Text] [Related]
7. [Mathematical models of decision making and learning].
Ito M; Doya K
Brain Nerve; 2008 Jul; 60(7):791-8. PubMed ID: 18646619
[TBL] [Abstract][Full Text] [Related]
8. Credit assignment in movement-dependent reinforcement learning.
McDougle SD; Boggess MJ; Crossley MJ; Parvin D; Ivry RB; Taylor JA
Proc Natl Acad Sci U S A; 2016 Jun; 113(24):6797-802. PubMed ID: 27247404
[TBL] [Abstract][Full Text] [Related]
9. The actor-critic learning is behind the matching law: matching versus optimal behaviors.
Sakai Y; Fukai T
Neural Comput; 2008 Jan; 20(1):227-51. PubMed ID: 18045007
[TBL] [Abstract][Full Text] [Related]
10. Finding structure in multi-armed bandits.
Schulz E; Franklin NT; Gershman SJ
Cogn Psychol; 2020 Jun; 119():101261. PubMed ID: 32059133
[TBL] [Abstract][Full Text] [Related]
11. Policy adjustment in a dynamic economic game.
Li J; McClure SM; King-Casas B; Montague PR
PLoS One; 2006 Dec; 1(1):e103. PubMed ID: 17183636
[TBL] [Abstract][Full Text] [Related]
12. Benchmarking for Bayesian Reinforcement Learning.
Castronovo M; Ernst D; Couëtoux A; Fonteneau R
PLoS One; 2016; 11(6):e0157088. PubMed ID: 27304891
[TBL] [Abstract][Full Text] [Related]
13. Brain signals of a Surprise-Actor-Critic model: Evidence for multiple learning modules in human decision making.
Liakoni V; Lehmann MP; Modirshanechi A; Brea J; Lutti A; Gerstner W; Preuschoff K
Neuroimage; 2022 Feb; 246():118780. PubMed ID: 34875383
[TBL] [Abstract][Full Text] [Related]
14. Adaptive learning via selectionism and Bayesianism, Part II: the sequential case.
Zhang J
Neural Netw; 2009 Apr; 22(3):229-36. PubMed ID: 19395235
[TBL] [Abstract][Full Text] [Related]
15. Modulation of value-based decision making behavior by subregions of the rat prefrontal cortex.
Verharen JPH; den Ouden HEM; Adan RAH; Vanderschuren LJMJ
Psychopharmacology (Berl); 2020 May; 237(5):1267-1280. PubMed ID: 32025777
[TBL] [Abstract][Full Text] [Related]
16. Model-based reinforcement learning under concurrent schedules of reinforcement in rodents.
Huh N; Jo S; Kim H; Sul JH; Jung MW
Learn Mem; 2009 May; 16(5):315-23. PubMed ID: 19403794
[TBL] [Abstract][Full Text] [Related]
17. Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates.
Solway A; Botvinick MM
Psychol Rev; 2012 Jan; 119(1):120-54. PubMed ID: 22229491
[TBL] [Abstract][Full Text] [Related]
18. A frontal dopamine system for reflective exploratory behavior.
Blanco NJ; Love BC; Cooper JA; McGeary JE; Knopik VS; Maddox WT
Neurobiol Learn Mem; 2015 Sep; 123():84-91. PubMed ID: 26004676
[TBL] [Abstract][Full Text] [Related]
19. Decision-making without a brain: how an amoeboid organism solves the two-armed bandit.
Reid CR; MacDonald H; Mann RP; Marshall JA; Latty T; Garnier S
J R Soc Interface; 2016 Jun; 13(119):. PubMed ID: 27278359
[TBL] [Abstract][Full Text] [Related]
20. How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.
Krigolson OE; Hassall CD; Handy TC
J Cogn Neurosci; 2014 Mar; 26(3):635-44. PubMed ID: 24168216
[TBL] [Abstract][Full Text] [Related]
[Next] [New Search]