logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-04292024-125827


Tipo di tesi
Tesi di laurea magistrale
Autore
LEOCATA, MARTINA
URN
etd-04292024-125827
Titolo
Authorship Verification for Cultural Heritage: The Case of the Aprocryphal Quixote
Dipartimento
FILOLOGIA, LETTERATURA E LINGUISTICA
Corso di studi
INFORMATICA UMANISTICA
Relatori
relatore Prof. Sebastiani, Fabrizio
relatore Prof. Moreo Fernández, Alejandro
Parole chiave
  • authorship attribution
  • authorship verification
  • machine learning
  • Miguel de Cervantes
  • text classification
Data inizio appello
28/05/2024
Consultabilità
Non consultabile
Data di rilascio
28/05/2094
Riassunto
El Segundo tomo del ingenioso hidalgo Don Quijote de la Mancha is an apocryphal work published under the pseudonym of Alonso Fernández de Avellaneda, and intended as a continuation of the first part of Miguel de Cervantes' immortal novel. Over the centuries, various hypotheses have been put forward as to its authorship. However, there is still no agreement among scholars on who the author of this ``aprocryphal Quixote'' might be.

In this dissertation, we propose the application of computational authorship identification methods based on machine learning, to the problem of identifying the author of the aprocryphal Quixote.
These methods are quantitative, and based on the automatic extraction and analysis of the stylistic (in the sense of stylometry) elements of a text. In particular, our research focuses on the development of authorship \textit{verification} models, i.e., models (implemented as automatic binary classifiers) whose task is to confirm or reject the hypothesis that a text has been written by a given a candidate, and authorship \textit{attribution} models, i.e., models (implemented as automatic single-label multiclass classifiers) whose task is to pick the most likely author of a text from a closed set of candidates.

Contrary to the hypothesis recently formulated by several scholars, the models we have developed seem to exclude that Jerónimo de Pasamonte (a former comrade-in-arms of Cervantes) might be the author of the apocryphal Quixote. Indeed, our models seem to exclude that any of the 13 Spanish authors we have considered, all of them indicated as possible candidates by Cervantes scholars, is Alonso Fernández de Avellaneda.

An interesting aspect that emerges from our analysis is the outcome of the authorship attribution model, which indicates Miguel de Cervantes himself as the most likely author among the 13 candidates considered. This result suggests that the apocryphal Quixote imitates Cervantes' style very effectively, since it misleads an automatic attribution model that, as we show in the dissertation, proved 100\% accurate in previous demanding experiments. However, as argued above, Avellaneda's style is not similar enough to Cervantes to deceive our equally accurate (as we also show) verification model.
File