Seguir
Mor Shpigel Nacson
Mor Shpigel Nacson
PhD Student, Technion
Dirección de correo verificada de campus.technion.ac.il
Título
Citado por
Citado por
Año
The implicit bias of gradient descent on separable data
D Soudry, E Hoffer, MS Nacson, S Gunasekar, N Srebro
Journal of Machine Learning Research 19 (70), 1-57, 2018
8952018
Convergence of gradient descent on separable data
MS Nacson, J Lee, S Gunasekar, PHP Savarese, N Srebro, D Soudry
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
1522019
Stochastic gradient descent on separable data: Exact convergence with a fixed learning rate
MS Nacson, N Srebro, D Soudry
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
862019
On the implicit bias of initialization shape: Beyond infinitesimal mirror descent
S Azulay, E Moroshko, MS Nacson, BE Woodworth, N Srebro, ...
International Conference on Machine Learning, 468-477, 2021
642021
Lexicographic and depth-sensitive margins in homogeneous and non-homogeneous deep models
MS Nacson, S Gunasekar, J Lee, N Srebro, D Soudry
International Conference on Machine Learning, 4683-4692, 2019
622019
Implicit bias of the step size in linear diagonal neural networks
MS Nacson, K Ravichandran, N Srebro, D Soudry
International Conference on Machine Learning, 16270-16295, 2022
382022
TAEN: temporal aware embedding network for few-shot action recognition
R Ben-Ari, MS Nacson, O Azulai, U Barzelay, D Rotman
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
242021
At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks?
N Giladi, MS Nacson, E Hoffer, D Soudry
arXiv preprint arXiv:1909.12340, 2019
182019
Gradient descent monotonically decreases the sharpness of gradient flow solutions in scalar networks and beyond
I Kreisler, MS Nacson, D Soudry, Y Carmon
International Conference on Machine Learning, 17684-17744, 2023
62023
The implicit bias of minima stability in multivariate shallow relu networks
MS Nacson, R Mulayoff, G Ongie, T Michaeli, D Soudry
arXiv preprint arXiv:2306.17499, 2023
42023
Action recognition using limited data
R Ben-Ari, O Azulai, U Barzelay, MS Nacson
US Patent App. 17/219,322, 2022
12022
How Uniform Random Weights Induce Non-uniform Bias: Typical Interpolating Neural Networks Generalize with Narrow Teachers
G Buzaglo, I Harel, MS Nacson, A Brutzkus, N Srebro, D Soudry
arXiv preprint arXiv:2402.06323, 2024
2024
How Learning Rate and Delay Affect Minima Selection in Asynchronous Training of Neural Networks
N Giladi, MS Nacson, E Hoffer, D Soudry
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–13