logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-09232024-223942


Tipo di tesi
Tesi di laurea magistrale
Autore
VOLPI, FEDERICO
URN
etd-09232024-223942
Titolo
Balancing Fairness and Interpretability in Clustering: Introducing FairParTree
Dipartimento
INFORMATICA
Corso di studi
DATA SCIENCE AND BUSINESS INFORMATICS
Relatori
relatore Prof. Guidotti, Riccardo
correlatore Landi, Cristiano
correlatore Marchiori Manerba, Marta
Parole chiave
  • clustering
  • decision tree
  • explainability
  • fairness
  • fairpartree
  • partree
Data inizio appello
11/10/2024
Consultabilità
Non consultabile
Data di rilascio
11/10/2027
Riassunto
The revolution involving Machine Learning and Artificial Intelligence has transformed data analytics, making algorithms play important roles in decision-making processes across various domains, even in sensitive scenarios. Indeed, traditional clustering algorithms often lack interpretability and exhibit biases, leading to discriminatory practices and opaque decision-making. We introduce FairParTree, a novel fair and interpretable clustering algorithm built upon the ParTree clustering algorithm. FairParTree integrates fairness constraints directly into the clustering process, making sure that the resulting clusters do not disproportionately disadvantage any particular group. The algorithm employs three fairness definitions: demographic fairness, individual fairness, and group fairness. By leveraging the structure of decision trees, FairParTree also enhances the interpretability of clustering results, providing clear and understandable explanations for cluster assignments. We evaluate FairParTree's performance against state-of-the-art competitors. Through extensive experiments, we show that it maintains strong performances in w.r.t. fairness, explainability, and clustering quality across different dataset sizes, asserting its value as a fair, explainable, and efficient clustering algorithm.
File