Tesi etd-04162021-135903

Tipo di tesi

Tesi di laurea magistrale

Autore

BERTI, STEFANO

URN

etd-04162021-135903

Titolo

A new MusAE: Adversarial Transformer Autoencoder for Music

Dipartimento

INFORMATICA

Corso di studi

INFORMATICA

Relatori

relatore Prof. Bacciu, Davide
correlatore Dott. Valenti, Andrea

Parole chiave

adversarial autoencoder
autoencoder
gan
midi
music
transformer
wgan-gp

Data inizio appello

07/05/2021

Consultabilità

Completa

Riassunto

Automatic polyphonic music generation is a challenging temporal task. Asidefrom the temporal dimension, a model dealing with polyphonic music shouldunderstand which combination of notes are consonant and can be groupedtogether into chords. The ability of deep generative models to learn from bigcollection of data makes it a suitable approach to model complex data distri-butions such as the one underlying music. Among them, the original MusAEmodel leveraged an adversarial autoencoder to generate smooth interpola-tions between two polyphonic musical sequences made by monophonic se-quences. A problem presented by the original MusAE is that the generationof musical sequences by sampling from the latent space lead to inconsistentsamples. This thesis extends the original MusAE integrating a state-of-the-artmodel for sequence to sequence tasks: the Transformer. Moreover, the eventdata representation is replaced with a new type of representation that, to-gether with the relative positional encoding, allows to deal with polyphonicinstruments. The experiments show that the new MusAE is able to deal withpolyphonic instruments and to reconstruct songs with a similar style to thosein input. Furthermore it allows for smooth interpolations between two mu-sical sequences and it is able to generate new musical sequences with appre-ciable and coherent dynamics and structure.

File

Nome file	Dimensione
BertiSte...hesis.pdf	1.79 Mb
Contatta l’autore

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-04162021-135903