These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


PUBMED FOR HANDHELDS

Journal Abstract Search


247 related items for PubMed ID: 33186101

  • 1. Self-Supervised Discovering of Interpretable Features for Reinforcement Learning.
    Shi W, Huang G, Song S, Wang Z, Lin T, Wu C.
    IEEE Trans Pattern Anal Mach Intell; 2022 May; 44(5):2712-2724. PubMed ID: 33186101
    [Abstract] [Full Text] [Related]

  • 2.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 3. STACoRe: Spatio-temporal and action-based contrastive representations for reinforcement learning in Atari.
    Lee YJ, Kim J, Kwak M, Park YJ, Kim SB.
    Neural Netw; 2023 Mar; 160():1-11. PubMed ID: 36587439
    [Abstract] [Full Text] [Related]

  • 4.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 5.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 6.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 7.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 8.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 9.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 10. Reinforcement Learning for Improving Agent Design.
    Ha D.
    Artif Life; 2019 Mar; 25(4):352-365. PubMed ID: 31697584
    [Abstract] [Full Text] [Related]

  • 11. Reinforcement learning and its connections with neuroscience and psychology.
    Subramanian A, Chitlangia S, Baths V.
    Neural Netw; 2022 Jan; 145():271-287. PubMed ID: 34781215
    [Abstract] [Full Text] [Related]

  • 12.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 13.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 14. Meta attention for Off-Policy Actor-Critic.
    Huang J, Huang W, Lan L, Wu D.
    Neural Netw; 2023 Jun; 163():86-96. PubMed ID: 37030278
    [Abstract] [Full Text] [Related]

  • 15.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 16. Differentiable self-supervised clustering with intrinsic interpretability.
    Yan X, Jin Z, Mao Y, Ye Y, Yu H.
    Neural Netw; 2024 Nov; 179():106542. PubMed ID: 39053302
    [Abstract] [Full Text] [Related]

  • 17. Weak Human Preference Supervision for Deep Reinforcement Learning.
    Cao Z, Wong K, Lin CT.
    IEEE Trans Neural Netw Learn Syst; 2021 Dec; 32(12):5369-5378. PubMed ID: 34101604
    [Abstract] [Full Text] [Related]

  • 18.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]

  • 19. GenPADS: Reinforcing politeness in an end-to-end dialogue system.
    Mishra K, Firdaus M, Ekbal A.
    PLoS One; 2023 Dec; 18(1):e0278323. PubMed ID: 36607963
    [Abstract] [Full Text] [Related]

  • 20.
    ; . PubMed ID:
    [No Abstract] [Full Text] [Related]


    Page: [Next] [New Search]
    of 13.