logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-02042025-213315


Tipo di tesi
Tesi di laurea magistrale
URN
etd-02042025-213315
Titolo
A Scalable Multi-Modal Perception System for Cognitive Architectures in Humanoid Robotics
Dipartimento
INGEGNERIA DELL'INFORMAZIONE
Corso di studi
ARTIFICIAL INTELLIGENCE AND DATA ENGINEERING
Parole chiave
  • artificial intelligence
  • cognitive architectures
  • deep learning
  • human robot interaction
  • micro services
  • robotics
Data inizio appello
21/02/2025
Consultabilità
Non consultabile
Data di rilascio
21/02/2028
Riassunto (Inglese)
Riassunto (Italiano)
This thesis presents a perceptual system for Abel, a social robot, to construct a "meta-scene"—a coherent representation of its environment. We built a Go-based middleware that coordinates real-time data flow between neural network-powered modules.

Key modules include a Voice Activity Detector, Speech-to-Text converter, Object and Subject Detectors, Depth Estimator, and Saliency Estimator. These operate asynchronously, contributing to a shared representation maintained by the server. A Python-based client library enhances flexibility and extensibility.

Compared to existing middleware, our system offers superior modularity, low-latency processing, and adaptability. Benchmarks confirm its efficiency in handling sensory data and enabling multi-modal perception.
File