These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.


BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

136 related articles for article (PubMed ID: 33619091)

  • 1. The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima.
    Feng Y; Tu Y
    Proc Natl Acad Sci U S A; 2021 Mar; 118(9):. PubMed ID: 33619091
    [TBL] [Abstract][Full Text] [Related]  

  • 2. Stochastic Gradient Descent Introduces an Effective Landscape-Dependent Regularization Favoring Flat Solutions.
    Yang N; Tang C; Tu Y
    Phys Rev Lett; 2023 Jun; 130(23):237101. PubMed ID: 37354404
    [TBL] [Abstract][Full Text] [Related]  

  • 3. Anomalous diffusion dynamics of learning in deep neural networks.
    Chen G; Qu CK; Gong P
    Neural Netw; 2022 May; 149():18-28. PubMed ID: 35182851
    [TBL] [Abstract][Full Text] [Related]  

  • 4. A mean field view of the landscape of two-layer neural networks.
    Mei S; Montanari A; Nguyen PM
    Proc Natl Acad Sci U S A; 2018 Aug; 115(33):E7665-E7671. PubMed ID: 30054315
    [TBL] [Abstract][Full Text] [Related]  

  • 5. Shaping the learning landscape in neural networks around wide flat minima.
    Baldassi C; Pittorino F; Zecchina R
    Proc Natl Acad Sci U S A; 2020 Jan; 117(1):161-170. PubMed ID: 31871189
    [TBL] [Abstract][Full Text] [Related]  

  • 6. Accelerating Minibatch Stochastic Gradient Descent Using Typicality Sampling.
    Peng X; Li L; Wang FY
    IEEE Trans Neural Netw Learn Syst; 2020 Nov; 31(11):4649-4659. PubMed ID: 31899442
    [TBL] [Abstract][Full Text] [Related]  

  • 7. Understanding Short-Range Memory Effects in Deep Neural Networks.
    Tan C; Zhang J; Liu J
    IEEE Trans Neural Netw Learn Syst; 2023 Feb; PP():. PubMed ID: 37027555
    [TBL] [Abstract][Full Text] [Related]  

  • 8. Unveiling the Structure of Wide Flat Minima in Neural Networks.
    Baldassi C; Lauditi C; Malatesta EM; Perugini G; Zecchina R
    Phys Rev Lett; 2021 Dec; 127(27):278301. PubMed ID: 35061428
    [TBL] [Abstract][Full Text] [Related]  

  • 9. Towards Better Generalization of Deep Neural Networks via Non-Typicality Sampling Scheme.
    Peng X; Wang FY; Li L
    IEEE Trans Neural Netw Learn Syst; 2023 Oct; 34(10):7910-7920. PubMed ID: 35157598
    [TBL] [Abstract][Full Text] [Related]  

  • 10. The Limiting Dynamics of SGD: Modified Loss, Phase-Space Oscillations, and Anomalous Diffusion.
    Kunin D; Sagastuy-Brena J; Gillespie L; Margalit E; Tanaka H; Ganguli S; Yamins DLK
    Neural Comput; 2023 Dec; 36(1):151-174. PubMed ID: 38052080
    [TBL] [Abstract][Full Text] [Related]  

  • 11. Stochastic Gradient Descent-like relaxation is equivalent to Metropolis dynamics in discrete optimization and inference problems.
    Angelini MC; Cavaliere AG; Marino R; Ricci-Tersenghi F
    Sci Rep; 2024 May; 14(1):11638. PubMed ID: 38773255
    [TBL] [Abstract][Full Text] [Related]  

  • 12. A Geometric Interpretation of Stochastic Gradient Descent Using Diffusion Metrics.
    Fioresi R; Chaudhari P; Soatto S
    Entropy (Basel); 2020 Jan; 22(1):. PubMed ID: 33285876
    [TBL] [Abstract][Full Text] [Related]  

  • 13. Stochastic Mirror Descent on Overparameterized Nonlinear Models.
    Azizan N; Lale S; Hassibi B
    IEEE Trans Neural Netw Learn Syst; 2022 Dec; 33(12):7717-7727. PubMed ID: 34270431
    [TBL] [Abstract][Full Text] [Related]  

  • 14. Learning Rates for Stochastic Gradient Descent With Nonconvex Objectives.
    Lei Y; Tang K
    IEEE Trans Pattern Anal Mach Intell; 2021 Dec; 43(12):4505-4511. PubMed ID: 33755555
    [TBL] [Abstract][Full Text] [Related]  

  • 15. Preconditioned Stochastic Gradient Descent.
    Li XL
    IEEE Trans Neural Netw Learn Syst; 2018 May; 29(5):1454-1466. PubMed ID: 28362591
    [TBL] [Abstract][Full Text] [Related]  

  • 16. Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup.
    Goldt S; Advani MS; Saxe AM; Krzakala F; Zdeborová L
    J Stat Mech; 2020 Dec; 2020(12):124010. PubMed ID: 34262607
    [TBL] [Abstract][Full Text] [Related]  

  • 17. Weighted SGD for ℓ
    Yang J; Chow YL; Ré C; Mahoney MW
    Proc Annu ACM SIAM Symp Discret Algorithms; 2016 Jan; 2016():558-569. PubMed ID: 29782626
    [TBL] [Abstract][Full Text] [Related]  

  • 18. A(DP)
    Xu J; Zhang W; Wang F
    IEEE Trans Pattern Anal Mach Intell; 2022 Nov; 44(11):8036-8047. PubMed ID: 34449356
    [TBL] [Abstract][Full Text] [Related]  

  • 19. Drill the Cork of Information Bottleneck by Inputting the Most Important Data.
    Peng X; Zhang J; Wang FY; Li L
    IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6360-6372. PubMed ID: 34029196
    [TBL] [Abstract][Full Text] [Related]  

  • 20. On the different regimes of stochastic gradient descent.
    Sclocchi A; Wyart M
    Proc Natl Acad Sci U S A; 2024 Feb; 121(9):e2316301121. PubMed ID: 38377198
    [TBL] [Abstract][Full Text] [Related]  

    [Next]    [New Search]
    of 7.