Tesi etd-06212024-114703

Tipo di tesi

Tesi di dottorato di ricerca

Autore

CONTI, FRANCESCO

URN

etd-06212024-114703

Titolo

A bridge between persistent homology and group equivariant non-expansive operators: theory and applications

Settore scientifico disciplinare

MAT/03 - GEOMETRIA

Corso di studi

MATEMATICA

Relatori

tutor Dott. Moroni, Davide
correlatore Frosini, Patrizio
supervisore Dott. Pascali, Maria Antonietta

Parole chiave

geneo
persistent homology
topological data analysis

Data inizio appello

04/07/2024

Consultabilità

Completa

Riassunto

Topological Data Analysis (TDA) is proving to be an excellent tool for shape analysis of digital data. The recently found synergy with artificial intelligence gave rise to Topological Machine Learning (TML), which aims to combine the expressive power of computational topology with the accuracy of machine learning to provide a comprehensive and automatic framework for data classification. The aim of this thesis is twofold: to develop current applications of TML in practical scenarios, with emphasis on the most overlooked aspects of its pipeline, and to connect the theory of TDA with a broader class of maps, the Group Equivariant Non-Expansive Operators (GENEOs). In the first part of this dissertation, we develop a pipeline to study digital data by means of TML in order to validate the practical aspects of our theory. We apply this pipeline to benchmark and experimental datasets, achieving state-of-the-art accuracies in biomedical scenarios. Moreover, we perform an empirical but extensive study of the stability of features arising from the various homological dimensions with respect to noise and points distribution in the persistence diagram. Such a comparison is novel in the TML literature and our findings show that results coming from the concatenation of each homological dimension available are the best approach in the vectorisation step. We later expand on the main concept of TDA, proving that the functor that computes persistence diagrams can be seen as a particular instance of GENEOs. The GENEO framework allows us to inject arbitrary equivariances in a machine learning setting and represents a new possible approach to neural network architecture. Next, we fully present the theory of GENEOs and their properties, such as convexity and concavity, under suitable assumptions. This thesis expand the GENEO theory with two new tools to define such operators, namely using symmetric functions and a characterization theorem of linear GENEOs between arbitrary functional spaces. Finally, we develop a new neural network architecture with GENEOs instead of neurons and show its potential in a couple of applications.

File

Nome file	Dimensione
Sintesi_tesi.pdf	23.74 Kb
Tesi_dottorato.pdf	13.52 Mb
Contatta l’autore

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-06212024-114703