Biomarkers Search

BIOMARKERS

Molecular Biopsy of Human Tumors

- a resource for Precision Medicine *

136 related articles for article (PubMed ID: 33619091)

1. The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima.
Feng Y; Tu Y
Proc Natl Acad Sci U S A; 2021 Mar; 118(9):. PubMed ID: 33619091
[TBL] [Abstract][Full Text] [Related]

2. Stochastic Gradient Descent Introduces an Effective Landscape-Dependent Regularization Favoring Flat Solutions.
Yang N; Tang C; Tu Y
Phys Rev Lett; 2023 Jun; 130(23):237101. PubMed ID: 37354404
[TBL] [Abstract][Full Text] [Related]

3. Anomalous diffusion dynamics of learning in deep neural networks.
Chen G; Qu CK; Gong P
Neural Netw; 2022 May; 149():18-28. PubMed ID: 35182851
[TBL] [Abstract][Full Text] [Related]

4. A mean field view of the landscape of two-layer neural networks.
Mei S; Montanari A; Nguyen PM
Proc Natl Acad Sci U S A; 2018 Aug; 115(33):E7665-E7671. PubMed ID: 30054315
[TBL] [Abstract][Full Text] [Related]

5. Shaping the learning landscape in neural networks around wide flat minima.
Baldassi C; Pittorino F; Zecchina R
Proc Natl Acad Sci U S A; 2020 Jan; 117(1):161-170. PubMed ID: 31871189
[TBL] [Abstract][Full Text] [Related]

6. Accelerating Minibatch Stochastic Gradient Descent Using Typicality Sampling.
Peng X; Li L; Wang FY
IEEE Trans Neural Netw Learn Syst; 2020 Nov; 31(11):4649-4659. PubMed ID: 31899442
[TBL] [Abstract][Full Text] [Related]

7. Understanding Short-Range Memory Effects in Deep Neural Networks.
Tan C; Zhang J; Liu J
IEEE Trans Neural Netw Learn Syst; 2023 Feb; PP():. PubMed ID: 37027555
[TBL] [Abstract][Full Text] [Related]

8. Unveiling the Structure of Wide Flat Minima in Neural Networks.
Baldassi C; Lauditi C; Malatesta EM; Perugini G; Zecchina R
Phys Rev Lett; 2021 Dec; 127(27):278301. PubMed ID: 35061428
[TBL] [Abstract][Full Text] [Related]

9. Towards Better Generalization of Deep Neural Networks via Non-Typicality Sampling Scheme.
Peng X; Wang FY; Li L
IEEE Trans Neural Netw Learn Syst; 2023 Oct; 34(10):7910-7920. PubMed ID: 35157598
[TBL] [Abstract][Full Text] [Related]

10. The Limiting Dynamics of SGD: Modified Loss, Phase-Space Oscillations, and Anomalous Diffusion.
Kunin D; Sagastuy-Brena J; Gillespie L; Margalit E; Tanaka H; Ganguli S; Yamins DLK
Neural Comput; 2023 Dec; 36(1):151-174. PubMed ID: 38052080
[TBL] [Abstract][Full Text] [Related]

11. Stochastic Gradient Descent-like relaxation is equivalent to Metropolis dynamics in discrete optimization and inference problems.
Angelini MC; Cavaliere AG; Marino R; Ricci-Tersenghi F
Sci Rep; 2024 May; 14(1):11638. PubMed ID: 38773255
[TBL] [Abstract][Full Text] [Related]

12. A Geometric Interpretation of Stochastic Gradient Descent Using Diffusion Metrics.
Fioresi R; Chaudhari P; Soatto S
Entropy (Basel); 2020 Jan; 22(1):. PubMed ID: 33285876
[TBL] [Abstract][Full Text] [Related]

13. Stochastic Mirror Descent on Overparameterized Nonlinear Models.
Azizan N; Lale S; Hassibi B
IEEE Trans Neural Netw Learn Syst; 2022 Dec; 33(12):7717-7727. PubMed ID: 34270431
[TBL] [Abstract][Full Text] [Related]

14. Learning Rates for Stochastic Gradient Descent With Nonconvex Objectives.
Lei Y; Tang K
IEEE Trans Pattern Anal Mach Intell; 2021 Dec; 43(12):4505-4511. PubMed ID: 33755555
[TBL] [Abstract][Full Text] [Related]

15. Preconditioned Stochastic Gradient Descent.
Li XL
IEEE Trans Neural Netw Learn Syst; 2018 May; 29(5):1454-1466. PubMed ID: 28362591
[TBL] [Abstract][Full Text] [Related]

16. Dynamics of stochastic gradient descent for two-layer neural networks in the teacher-student setup.
Goldt S; Advani MS; Saxe AM; Krzakala F; Zdeborová L
J Stat Mech; 2020 Dec; 2020(12):124010. PubMed ID: 34262607
[TBL] [Abstract][Full Text] [Related]

17. Weighted SGD for ℓ
Yang J; Chow YL; Ré C; Mahoney MW
Proc Annu ACM SIAM Symp Discret Algorithms; 2016 Jan; 2016():558-569. PubMed ID: 29782626
[TBL] [Abstract][Full Text] [Related]

18. A(DP)
Xu J; Zhang W; Wang F
IEEE Trans Pattern Anal Mach Intell; 2022 Nov; 44(11):8036-8047. PubMed ID: 34449356
[TBL] [Abstract][Full Text] [Related]

19. Drill the Cork of Information Bottleneck by Inputting the Most Important Data.
Peng X; Zhang J; Wang FY; Li L
IEEE Trans Neural Netw Learn Syst; 2022 Nov; 33(11):6360-6372. PubMed ID: 34029196
[TBL] [Abstract][Full Text] [Related]

20. On the different regimes of stochastic gradient descent.
Sclocchi A; Wyart M
Proc Natl Acad Sci U S A; 2024 Feb; 121(9):e2316301121. PubMed ID: 38377198
[TBL] [Abstract][Full Text] [Related]

[Next] [New Search]