Diagonal Barzilai-Borwein Rules in Stochastic Gradient-Like Methods

Franchini, G.; Porta, F.; Ruggiero, V.; Trombini, I.; Zanni, L.

doi:10.1007/978-3-031-34020-8_2

Minimization problems involving a finite sum as objective function often arise in machine learning applications. The number of components of the finite-sum term is typically very large, by making unfeasible the computation of its gradient. For this reason stochastic gradient methods are commonly considered. The performance of these approaches strongly relies on the selection of both the learning rate and the mini-batch size employed to compute the stochastic direction. In this paper we combine a recent idea to select the learning rate as a diagonal matrix based on stochastic Barzilai-Borwein rules together with an adaptive subsampling technique to fix the mini-batch size. Convergence results of the resulting stochastic gradient algorithm are shown for both convex and non-convex objective functions. Several numerical experiments on binary classification problems are carried out to compare the proposed method with other state-of-the-art schemes.

Diagonal Barzilai-Borwein Rules in Stochastic Gradient-Like Methods

Franchini G.;Porta F.;Ruggiero V.;Trombini I.^Penultimo;Zanni L.

2023

Abstract

Minimization problems involving a finite sum as objective function often arise in machine learning applications. The number of components of the finite-sum term is typically very large, by making unfeasible the computation of its gradient. For this reason stochastic gradient methods are commonly considered. The performance of these approaches strongly relies on the selection of both the learning rate and the mini-batch size employed to compute the stochastic direction. In this paper we combine a recent idea to select the learning rate as a diagonal matrix based on stochastic Barzilai-Borwein rules together with an adaptive subsampling technique to fix the mini-batch size. Convergence results of the resulting stochastic gradient algorithm are shown for both convex and non-convex objective functions. Several numerical experiments on binary classification problems are carried out to compare the proposed method with other state-of-the-art schemes.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2023
			
	ISBN
	
				978-3-031-34019-2
978-3-031-34020-8
			
	Parole chiave
	
				Stochastic gradient methods; Diagonal Barzilai-Borwein rules; Variance reduced methods
			
	Appare nelle tipologie:
	
				04.2 Contributi in atti di convegno (in Volume)

File in questo prodotto:

File	Dimensione	Formato
544026_1_En_2_Chapter_Author.pdf solo gestori archivio Descrizione: Pre-print Tipologia: Pre-print Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 888.95 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	888.95 kB	Adobe PDF	Visualizza/Apri Richiedi una copia
Diagonal Barzilai-Borwein Rules in Stochastic Gradient-Like Methods.pdf solo gestori archivio Descrizione: Full text editoriale Tipologia: Full text (versione editoriale) Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 1.05 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.05 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in SFERA sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11392/2517990

Citazioni

ND

0

ND

SFERA Archivio dei prodotti della Ricerca dell'Università di Ferrara

Diagonal Barzilai-Borwein Rules in Stochastic Gradient-Like Methods

Franchini G.;Porta F.;Ruggiero V.;Trombini I.^Penultimo;Zanni L.

Penultimo

2023

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

SFERA Archivio dei prodotti della Ricerca dell'Università di Ferrara

Diagonal Barzilai-Borwein Rules in Stochastic Gradient-Like Methods

Franchini G.;Porta F.;Ruggiero V.;Trombini I.Penultimo;Zanni L.

Penultimo

2023

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Franchini G.;Porta F.;Ruggiero V.;Trombini I.^Penultimo;Zanni L.

Scheda breve

Scheda completa

Scheda completa (DC)