logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-04132022-234029


Tipo di tesi
Tesi di laurea magistrale
Autore
VERNA, FEDERICA
URN
etd-04132022-234029
Titolo
Data lineage service development and analysis via process mining
Dipartimento
INGEGNERIA DELL'INFORMAZIONE
Corso di studi
ARTIFICIAL INTELLIGENCE AND DATA ENGINEERING
Relatori
relatore Cimino, Mario Giovanni Cosimo Antonio
relatore Vaglini, Gigliola
Parole chiave
  • processmining
  • datalineage
  • spark
  • spline
  • conformancechecking
Data inizio appello
29/04/2022
Consultabilità
Non consultabile
Data di rilascio
29/04/2092
Riassunto
Nowadays it is not enough to have data to generate business value, it
is also necessary to organize it in order to make the most of it and having
good quality data allows to trust the data being used ensuring better
business decisions. Therefore this thesis work will focus firstly on the
development of a data lineage service in the context of a data catalog, i.e
an organized inventory of data assets, identifying data movement across
the enterprise with the purpose of providing support to data governance in
terms of data validation, data usage control, regulatory compliance, and
data privacy. Finally, process mining techniques will be applied to verify
that the processes to which the data are exposed are compliant and with internal regulations and observe the expected sequence of operations. This will be done by using conformance checking techiniques, which allow a process model to be compared with event logs of the same process.
File