logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-02012023-112945


Tipo di tesi
Tesi di laurea magistrale
Autore
MACRI', ARMANDO
URN
etd-02012023-112945
Titolo
Using deep learning for structure recognition in vehicle registration documents
Dipartimento
INGEGNERIA DELL'INFORMAZIONE
Corso di studi
ARTIFICIAL INTELLIGENCE AND DATA ENGINEERING
Relatori
relatore Cimino, Mario Giovanni Cosimo Antonio
correlatore Alfeo, Antonio Luca
tutor Xhani, Orges
Parole chiave
  • deep learning
  • detr
  • document layout analysis
  • faster r-cnn
  • object detection
  • transfer learning
  • yolo
Data inizio appello
17/02/2023
Consultabilità
Non consultabile
Data di rilascio
17/02/2093
Riassunto
The research on object detection has come a long way and has in recent years become so efficient and reliable that we see many practical applications in simple detection of objects in pictures to advance segmentation in video recordings. State-of-the-art deep learning techniques that are designed to work on images, are adapted to the domain of digital documents. In this thesis work, an automatic method to extract meaningful information from documents is proposed. The study is based on the most recent object detection algorithms trained to detect various fields present in an Italian vehicle certificates, a document type that has not been considered in the literature. A custom dataset of 453 certificates images has been annotated. Three different models (YOLO, DETR, Faster RCNN) have been compared and evaluated on the mean Average Precision (mAP), common objects in context (COCO) evaluation metrics. Several experiments have been carried out, exploiting different combinations of data augmentation techniques. The obtained results are encouraging, leading to the conclusion that object detection is a viable method for information extraction.
File