logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-03282025-172243


Tipo di tesi
Tesi di laurea magistrale
Autore
D'ORSI, DOMENICO
URN
etd-03282025-172243
Titolo
Design and implementation of a 3D-native pipeline for open-vocabulary 3D scene understanding
Dipartimento
INGEGNERIA DELL'INFORMAZIONE
Corso di studi
ARTIFICIAL INTELLIGENCE AND DATA ENGINEERING
Relatori
relatore Prof. Tonellotto, Nicola
correlatore Ing. Falchi, Fabrizio
correlatore Ing. Carrara, Fabio
Parole chiave
  • 3d scene understanding
  • computer vision
Data inizio appello
14/04/2025
Consultabilità
Non consultabile
Data di rilascio
14/04/2028
Riassunto
The ultimate goal of this thesis work was to structure a framework for performing 3D open vocabulary scene understanding in an innovative manner compared to the currently existing state-of the art solutions in this context. The pre-existing solutions, in fact, employ not only three-dimensional data related to the input environments, but also a wide range of RGB-D images of the same scenes from different angles, which enable more comprehensive and accurate semantic identification and understanding of segmented objects.
Differently, the framework we hypothesized aims to exclusively utilize environments in the form of three-dimensional point clouds, without additional usage of other data typologies to support task completion: to achieve this, a synergistic employment of two distinct components is necessary, which will constitute a unified data flow that starts from the real environment and concludes with its semantic comprehension.
File