Tesi etd-07062022-160622

Tipo di tesi

Tesi di laurea magistrale

Autore

LANDI, CRISTIANO

Indirizzo email

c.landi7@studenti.unipi.it, cri98li@gmail.com

URN

etd-07062022-160622

Titolo

Interpretable Machine Learning Models for Trajectory Classification

Dipartimento

INFORMATICA

Corso di studi

DATA SCIENCE AND BUSINESS INFORMATICS

Relatori

relatore Prof.ssa Monreale, Anna
relatore Prof. Guidotti, Riccardo
relatore Dott. Spinnato, Francesco

Parole chiave

classification
interpretable
ml
trajectory
xai

Data inizio appello

22/07/2022

Consultabilità

Non consultabile

Data di rilascio

22/07/2092

Riassunto

With the increase in storage device capacities and data transmission, it becomes easier to collect data. Among the different data types collected, a large space is reserved for mobility data as it is used in various applications ranging from location-based to route planning systems.
The most accurate and effective algorithms these systems use rely on complex statistical models humans cannot understand.
Researchers have recently proposed various approaches to mitigate the interpretability problem in recent years.
Indeed, these approaches formed a new branch of Artificial Intelligence called eXplainable AI (XAI).

Unfortunately, although researchers, business companies, and governments exploit mobility data to make decisions that impact lives in many ways, mobility data analysis is an under-experienced topic. There are few interpretable models and even fewer XAI approaches for black-box models at the current stage.
In this thesis, we propose three machine learning algorithms for GPS trajectory classification.
Our idea is to make some effective and intuitive transformations to the input data to use a simple interpretable model without losing performance.
Interpretability in our methods comes from the ability to highlight the parts of a trajectory that made the greatest contribution.
All proposed models attempt to extract discriminating subtrajectories using techniques from pattern mining, clustering and time series fields and apply transformations to the mobility data similar to those used for time series classification.
We also evaluate our proposals using real-world datasets to assess how well the different ideas perform with this particular data type.
The results show that one of our algorithm is an order of magnitude faster than the other proposals while maintaining comparable if not better accuracy than non-interpretable state-of-the-art classifiers.

File

Nome file	Dimensione
Tesi non consultabile. Contatta l’autore

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-07062022-160622