These tools will no longer be maintained as of December 31, 2024. Archived website can be found here. PubMed4Hh GitHub repository can be found here. Contact NLM Customer Service if you have questions.
239 related articles for article (PubMed ID: 30054315)
1. A mean field view of the landscape of two-layer neural networks. Mei S; Montanari A; Nguyen PM Proc Natl Acad Sci U S A; 2018 Aug; 115(33):E7665-E7671. PubMed ID: 30054315 [TBL] [Abstract][Full Text] [Related]
2. Shaping the learning landscape in neural networks around wide flat minima. Baldassi C; Pittorino F; Zecchina R Proc Natl Acad Sci U S A; 2020 Jan; 117(1):161-170. PubMed ID: 31871189 [TBL] [Abstract][Full Text] [Related]
3. Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup. Goldt S; Advani MS; Saxe AM; Krzakala F; Zdeborová L J Stat Mech; 2020 Dec; 2020(12):124010. PubMed ID: 34262607 [TBL] [Abstract][Full Text] [Related]
4. Anomalous diffusion dynamics of learning in deep neural networks. Chen G; Qu CK; Gong P Neural Netw; 2022 May; 149():18-28. PubMed ID: 35182851 [TBL] [Abstract][Full Text] [Related]
5. Stochastic Gradient Descent Introduces an Effective Landscape-Dependent Regularization Favoring Flat Solutions. Yang N; Tang C; Tu Y Phys Rev Lett; 2023 Jun; 130(23):237101. PubMed ID: 37354404 [TBL] [Abstract][Full Text] [Related]
6. The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima. Feng Y; Tu Y Proc Natl Acad Sci U S A; 2021 Mar; 118(9):. PubMed ID: 33619091 [TBL] [Abstract][Full Text] [Related]
7. Stochastic Gradient Descent for Nonconvex Learning Without Bounded Gradient Assumptions. Lei Y; Hu T; Li G; Tang K IEEE Trans Neural Netw Learn Syst; 2020 Oct; 31(10):4394-4400. PubMed ID: 31831449 [TBL] [Abstract][Full Text] [Related]
8. A Geometric Interpretation of Stochastic Gradient Descent Using Diffusion Metrics. Fioresi R; Chaudhari P; Soatto S Entropy (Basel); 2020 Jan; 22(1):. PubMed ID: 33285876 [TBL] [Abstract][Full Text] [Related]
13. Learning Rates for Stochastic Gradient Descent With Nonconvex Objectives. Lei Y; Tang K IEEE Trans Pattern Anal Mach Intell; 2021 Dec; 43(12):4505-4511. PubMed ID: 33755555 [TBL] [Abstract][Full Text] [Related]
14. Accelerating deep neural network training with inconsistent stochastic gradient descent. Wang L; Yang Y; Min R; Chakradhar S Neural Netw; 2017 Sep; 93():219-229. PubMed ID: 28668660 [TBL] [Abstract][Full Text] [Related]
15. Towards Better Generalization of Deep Neural Networks via Non-Typicality Sampling Scheme. Peng X; Wang FY; Li L IEEE Trans Neural Netw Learn Syst; 2023 Oct; 34(10):7910-7920. PubMed ID: 35157598 [TBL] [Abstract][Full Text] [Related]
16. Geometry of Energy Landscapes and the Optimizability of Deep Neural Networks. Becker S; Zhang Y; Lee AA Phys Rev Lett; 2020 Mar; 124(10):108301. PubMed ID: 32216422 [TBL] [Abstract][Full Text] [Related]
17. Accelerating DNN Training Through Selective Localized Learning. Krithivasan S; Sen S; Venkataramani S; Raghunathan A Front Neurosci; 2021; 15():759807. PubMed ID: 35087370 [TBL] [Abstract][Full Text] [Related]
18. Primal Averaging: A New Gradient Evaluation Step to Attain the Optimal Individual Convergence. Tao W; Pan Z; Wu G; Tao Q IEEE Trans Cybern; 2020 Feb; 50(2):835-845. PubMed ID: 30346303 [TBL] [Abstract][Full Text] [Related]
19. Archetypal landscapes for deep neural networks. Verpoort PC; Lee AA; Wales DJ Proc Natl Acad Sci U S A; 2020 Sep; 117(36):21857-21864. PubMed ID: 32843349 [TBL] [Abstract][Full Text] [Related]