Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Evolution and Recent trends for the SGD algorithm: key takeaways from an Educational Short Course

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

1 Cita (Scopus)

Resumen

One of the key challenges when designing a ten (10) hours educational short course, entitled “A Hands-on Approach for Implementing Stochastic Optimization Algorithms from Scratch”, which was accepted for inclusion at ICASSP’23, was related to addressing “how to introduce the Stochastic Gradient Descent (SGD) algorithm and variants in a consistent, accessible fashion”? From a simplistic perspective, the SGD algorithm is nothing else than the classical gradient descent (GD) algorithm along with a (very) noisy gradient. Nonetheless, arguably, SGD’s most influential variants, e.g. AdaGrad, RMSprop and Adam, nor more recent ones (LookAhead, ϵAdam, MadGrad, among several others) may not be explained in such superficial terms. Moreover, such variants are usually given as as black-boxes by most deep-learning (DL) libraries (e.g. TensorFlow, PyTorch, etc.). In this article, based on the experience of the aforementioned short-course, I propose to link the SGD algorithm and variants via an “evolutionary path”, in which each SGD variant may be understood as a set of add-on features over the vanilla SGD, resulting in a generalized algorithm along with a “family tree” graph which are both intuitive and useful when implementing a given SGD variant.

Idioma originalInglés
Título de la publicación alojada32nd European Signal Processing Conference, EUSIPCO 2024 - Proceedings
EditorialEuropean Signal Processing Conference, EUSIPCO
Páginas1761-1765
Número de páginas5
ISBN (versión digital)9789464593617
DOI
EstadoPublicada - 2024
Evento32nd European Signal Processing Conference, EUSIPCO 2024 - Lyon, Francia
Duración: 26 ago. 202430 ago. 2024

Serie de la publicación

NombreEuropean Signal Processing Conference
ISSN (versión impresa)2219-5491

Conferencia

Conferencia32nd European Signal Processing Conference, EUSIPCO 2024
País/TerritorioFrancia
CiudadLyon
Período26/08/2430/08/24

Huella

Profundice en los temas de investigación de 'Evolution and Recent trends for the SGD algorithm: key takeaways from an Educational Short Course'. En conjunto forman una huella única.

Citar esto