logo SBA

ETD

Digital archive of theses discussed at the University of Pisa

 

Thesis etd-04182020-105146


Thesis type
Tesi di laurea magistrale
Author
TONO, ILARIA
URN
etd-04182020-105146
Thesis title
A Comparative Study of Semantic Segmentation Methods over Context-specific Datasets for Virtual Reality Applications
Department
INGEGNERIA DELL'INFORMAZIONE
Course of study
EMBEDDED COMPUTING SYSTEMS
Supervisors
relatore Prof. Tecchia, Franco
relatore Prof. Slater, Mel
tutor Dott. Gallego, Jaume
Keywords
  • artificial neural networks
  • computer vision
  • deep learning
  • semantic segmentation
  • virtual reality
Graduation session start date
05/05/2020
Availability
Withheld
Release date
05/05/2090
Summary
Boosting the quality of semantic understanding of images and videos has recently become a key process in computer vision. Artificial neural networks have played an important role in defining new ways to extract valuable information from large and diverse sets of data. This capability allows them to be applied to a wide range of applications, including Virtual Reality: Object Detection, Semantic Segmentation and Human Pose Estimation can be vital in Virtual Environments reconstruction based on monocular images and videos. In this work we focus on the task of Semantic Segmentation, which deals with a pixel-level classification, and we will compare some of the state-of-the-art methods addressing this task. The evolution of such networks is strongly related with both the quantity and quality of the data used for learning. We study this problem by testing the models over two brand new datasets, which address music-related environments. The goal is to obtain an efficient network able to segment live concert videos from which we want to build the Virtual Environment.
The models are evaluated in terms of mean Intersection over Union index (mIoU) achieved on evaluation sets, and in terms of efficiency analyzing resource utilization and processing time during training and inference. We also discuss the fact that using Generative Adversarial Networks to produce new data cannot be used as aid to enrich a small dataset.
File