logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-06212024-114703


Tipo di tesi
Tesi di dottorato di ricerca
Autore
CONTI, FRANCESCO
URN
etd-06212024-114703
Titolo
A bridge between persistent homology and group equivariant non-expansive operators: theory and applications
Settore scientifico disciplinare
MAT/03 - GEOMETRIA
Corso di studi
MATEMATICA
Relatori
tutor Dott. Moroni, Davide
correlatore Frosini, Patrizio
supervisore Dott. Pascali, Maria Antonietta
Parole chiave
  • geneo
  • persistent homology
  • topological data analysis
Data inizio appello
04/07/2024
Consultabilità
Completa
Riassunto
Topological Data Analysis (TDA) is proving to be an excellent tool for shape analysis of digital data. The recently found synergy with artificial intelligence gave rise to Topological Machine Learning (TML), which aims to combine the expressive power of computational topology with the accuracy of machine learning to provide a comprehensive and automatic framework for data classification. The aim of this thesis is twofold: to develop current applications of TML in practical scenarios, with emphasis on the most overlooked aspects of its pipeline, and to connect the theory of TDA with a broader class of maps, the Group Equivariant Non-Expansive Operators (GENEOs). In the first part of this dissertation, we develop a pipeline to study digital data by means of TML in order to validate the practical aspects of our theory. We apply this pipeline to benchmark and experimental datasets, achieving state-of-the-art accuracies in biomedical scenarios. Moreover, we perform an empirical but extensive study of the stability of features arising from the various homological dimensions with respect to noise and points distribution in the persistence diagram. Such a comparison is novel in the TML literature and our findings show that results coming from the concatenation of each homological dimension available are the best approach in the vectorisation step. We later expand on the main concept of TDA, proving that the functor that computes persistence diagrams can be seen as a particular instance of GENEOs. The GENEO framework allows us to inject arbitrary equivariances in a machine learning setting and represents a new possible approach to neural network architecture. Next, we fully present the theory of GENEOs and their properties, such as convexity and concavity, under suitable assumptions. This thesis expand the GENEO theory with two new tools to define such operators, namely using symmetric functions and a characterization theorem of linear GENEOs between arbitrary functional spaces. Finally, we develop a new neural network architecture with GENEOs instead of neurons and show its potential in a couple of applications.
File