Tesi etd-10292024-145541 |
Link copiato negli appunti
Tipo di tesi
Tesi di laurea magistrale
Autore
FASANO, PAOLO
URN
etd-10292024-145541
Titolo
Semantic Querying of Reconstructed 3D Environments Using Pre-trained 2D Foundation Models
Dipartimento
INFORMATICA
Corso di studi
INFORMATICA
Relatori
relatore Prof.ssa Giorgi, Daniela
relatore Prof. Carrara, Fabio
relatore Prof. Palma, Gianpaolo
relatore Prof. Carrara, Fabio
relatore Prof. Palma, Gianpaolo
Parole chiave
- 2D Foundation Models
- 3D reconstruction
- Hololens 2
- openClip
- Segment Anything
- semantic querying
Data inizio appello
29/11/2024
Consultabilità
Completa
Riassunto
Understanding and interacting with complex 3D environments is increasingly important in robotics, virtual reality, and autonomous systems. To address this need, this thesis project aims to develop a system that reconstructs a 3D model of the environment enhanced with semantic features. We project in the 3D model the semantic features extracted using pre-trained 2D foundational models, such as Segment Anything and OpenCLIP. This allows open-vocabulary queries about the environment in natural language that are not restricted by predefined categories or information and without requiring additional training.
The system is tested on real RGB-D data captured from a HoloLens 2 device.
The system is tested on real RGB-D data captured from a HoloLens 2 device.
File
Nome file | Dimensione |
---|---|
Tesi_Mag...asano.pdf | 39.43 Mb |
Contatta l’autore |