Tesi etd-04122023-113901

Tipo di tesi

Tesi di laurea magistrale

Autore

ZHANG, CHENXIANG

URN

etd-04122023-113901

Titolo

Training, Architecture, and Prior for Deterministic Uncertainty Methods

Dipartimento

INFORMATICA

Corso di studi

INFORMATICA

Relatori

relatore Prof. Micheli, Alessio

Parole chiave

deep learning
machine learning
uncertainty estimation

Data inizio appello

26/05/2023

Consultabilità

Completa

Riassunto

Accurate and efficient uncertainty estimation is crucial to build reliable Machine Learning (ML) models capable to provide calibrated uncertainty estimates, generalize and detect Out-Of Distribution (OOD) datasets. To this end, Deterministic Uncertainty Methods (DUMs) is a promising model family capable to perform uncertainty estimation in a single forward pass. This work investigates important design choices in DUMs: (1) we show that training schemes decoupling the core architecture and the uncertainty head schemes can significantly improve uncertainty performances. (2) we demonstrate that the core architecture expressiveness is crucial for uncertainty performance and that additional architecture constraints to avoid feature collapse can deteriorate the trade-off between OOD generalization and detection. (3) Contrary to other Bayesian models, we show that the prior defined by DUMs do not have a strong effect on the final performances.

File

Nome file	Dimensione
thesis.pdf	10.84 Mb
Contatta l’autore

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-04122023-113901