Tesi etd-03102025-122617

Tipo di tesi

Tesi di laurea magistrale

Autore

PETIX, MARCO

URN

etd-03102025-122617

Titolo

Behaviour Characterization via Policy Supervectors in Multi-Task Reinforcement Learning

Dipartimento

INFORMATICA

Corso di studi

INFORMATICA

Relatori

relatore Prof. Bacciu, Davide
relatore Dott. Piccoli Elia

Parole chiave

behaviour characterization
gaussian mixture models
multi-task reinforcement learning
reinforcement learning
transfer learning

Data inizio appello

11/04/2025

Consultabilità

Completa

Riassunto

This thesis proposes a novel Multi-Task Reinforcement Learning (MTRL) framework that facilitates inter-environment policy transfer by leveraging state-based behavioural characterizations. Our method encodes policies through supervector representations derived from Gaussian Mixture Models (GMMs) trained on visited states. By embedding policies into a shared latent space, we enable similarity-based policy selection, guiding learning in new tasks based on previously acquired knowledge. This approach is evaluated across environments from the Highway-Env suite, aiming to demonstrate its effectiveness in accelerating adaptation, improving generalization, and identifying transferable skills between source and target tasks.

File

Nome file	Dimensione
petix_thesis.pdf	11.46 Mb
Contatta l’autore

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-03102025-122617