Projects per year
Abstract
Various tasks in data science are modeled utilizing the variational regularization approach, where manually selecting regularization parameters presents a challenge. The difficulty gets exacerbated when employing regularizers involving a large number of hyperparameters. To overcome this challenge, bilevel learning can be employed to learn such parameters from data. However, neither exact function values nor exact gradients with respect to the hyperparameters are attainable, necessitating methods that only rely on inexact evaluation of such quantities. State-of-the-art inexact gradient-based methods a priori select a sequence of the required accuracies and cannot identify an appropriate step size since the Lipschitz constant of the hypergradient is unknown. In this work, we propose an algorithm with backtracking line search that only relies on inexact function evaluations and hypergradients and show convergence to a stationary point. Furthermore, the proposed algorithm determines the required accuracy dynamically rather than manually selected before running it. Our numerical experiments demonstrate the efficiency and feasibility of our approach for hyperparameter estimation on a range of relevant problems in imaging and data science such as total variation and field of experts denoising and multinomial logistic regression. Particularly, the results show that the algorithm is robust to its own hyperparameters such as the initial accuracies and step size.
| Original language | English |
|---|---|
| Pages (from-to) | 906-936 |
| Number of pages | 31 |
| Journal | SIAM Journal on Mathematics of Data Science |
| Volume | 7 |
| Issue number | 3 |
| Early online date | 3 Jul 2025 |
| DOIs | |
| Publication status | Published - 30 Sept 2025 |
Acknowledgements
This research made use of Hex, the GPU Cloud in the Department of Computer Science at the University of Bath.Keywords
- backtracking line search
- bilevel learning
- bilevel optimization
- hyperparameter optimization
- machine learning
ASJC Scopus subject areas
- Applied Mathematics
- Computational Mathematics
- Statistics and Probability
Fingerprint
Dive into the research topics of 'An Adaptively Inexact First-Order Method for Bilevel Optimization with Application to Hyperparameter Learning'. Together they form a unique fingerprint.Projects
- 1 Active
-
Programme Grant: Mathematics of Deep Learning
Budd, C. (PI) & Ehrhardt, M. (CoI)
Engineering and Physical Sciences Research Council
31/01/22 → 30/07/27
Project: Research council