Welcome to Jikai Jin's website
Welcome to Jikai Jin's website
Home
News
Publications
Experience
Contact
Light
Dark
Automatic
Deep Learning Theory
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking
We investigate the “grokking” phenomenon in deep learning on some simple setups, and show that it is caused by a dichotomy of the implicit biases between the early phase and late phase during training.
Kaifeng Lyu
,
Jikai Jin
,
Zhiyuan Li
,
Simon S. Du
,
Jason D. Lee
,
Wei Hu
Cite
ArXiv
Understanding Incremental Learning of Gradient Descent -- A Fine-grained Analysis of Matrix Sensing
We prove that GD applied to the matrix sensing problem has intriguing properties – with small initialization and early stopping, it follows an incremental/greedy low-rank learning procedure. This form of simplicity bias allows GD to recover the ground-truth, despite over-parameterization and non-convexity.
Jikai Jin
,
Zhiyuan Li
,
Kaifeng Lyu
,
Simon S. Du
,
Jason D. Lee
PDF
Cite
ArXiv
Poster
Slides
Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power
We provide theoretical evidence that the hardness of robust generalization may stem from the expressive power of deep neural networks. Even when standard generalization is easy, robust generalization provably requires the size of DNNs to be exponentially large.
Binghui Li
,
Jikai Jin
,
Han Zhong
,
John E. Hopcroft
,
Liwei Wang
PDF
Cite
ArXiv
Cite
×