logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-09212022-084503


Tipo di tesi
Tesi di laurea magistrale
Autore
VOLPI, GIULIA
URN
etd-09212022-084503
Titolo
Generative Model for Decision Trees
Dipartimento
INFORMATICA
Corso di studi
INFORMATICA
Relatori
relatore Guidotti, Riccardo
relatore Monreale, Anna
Parole chiave
  • classification
  • variational autoencoder
  • generative models
  • evolutionary algorithms
  • decision tree induction
Data inizio appello
07/10/2022
Consultabilità
Non consultabile
Data di rilascio
07/10/2025
Riassunto
Decision trees are among the most popular classification algorithms due to their knowledge representation that resembles human reasoning and is easy to understand. Commonly used decision tree induction methods, such as the CART algorithm, are based on a greedy top-down strategy. Although this approach is known to be an efficient heuristic, the resulting trees are only locally optimal and tend to have overly complex structures. An alternative way to search the parameter space of trees is to use global optimization methods such as evolutionary algorithms. In this thesis, I propose the GEN-Tree hybrid algorithm for the induction of decision trees. The first stage of the method consists of training a VAE model using various decision tree classifiers decoded in the form of linear chromosomes. Next, a genetic algorithm is used to explore the latent space of the trained VAE, with the aim of finding a decision tree with good predictive performance and a small size. I compared GEN-Tree with greedy and ensemble techniques using nine publicly available datasets. The results show that GEN-Tree is able to induce accurate decision trees with very low complexity, making them very interpretable.
File