logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-11032021-222833


Tipo di tesi
Tesi di laurea magistrale
Autore
NANNINI, ALICE
URN
etd-11032021-222833
Titolo
Using Deep Learning-based Object Detection to extract context-specific information from digitized documents
Dipartimento
INGEGNERIA DELL'INFORMAZIONE
Corso di studi
ARTIFICIAL INTELLIGENCE AND DATA ENGINEERING
Relatori
relatore Cimino, Mario Giovanni Cosimo Antonio
relatore Vaglini, Gigliola
relatore Galatolo, Federico Andrea
relatore Bracaloni, Simone
Parole chiave
  • google cloud platform
  • yolo
  • faster r-cnn
  • theater scripts
  • transfer learning
  • information extraction
  • document layout analysis
  • region proposal
  • object detection
  • computer vision
  • neural network
  • artificial intelligence
  • deep learning
Data inizio appello
19/11/2021
Consultabilità
Non consultabile
Data di rilascio
19/11/2091
Riassunto
The computer vision and object detection techniques developed in recent years are dominating the state of the art, and are increasingly applied to document layout analysis resolutions. This paper wants to offer a method to process digitized documents for the purpose of extracting meaningful information. By fine-tuning object detectors such as Faster R-CNN and YOLO, we attempt to identify text sections of interest with bounding boxes and classify them into a specific category depending on the context in which the document is placed. The deep learning model is implemented using the Python programming language, and is eventually integrated into the back-end of a web application hosted on the Google Cloud Platform infrastructure.
File