| Tesi etd-04292024-125827 | 
    Link copiato negli appunti
  
    Tipo di tesi
  
  
    Tesi di laurea magistrale
  
    Autore
  
  
    LEOCATA, MARTINA  
  
    URN
  
  
    etd-04292024-125827
  
    Titolo
  
  
    Authorship Verification for Cultural Heritage: The Case of the Aprocryphal Quixote
  
    Dipartimento
  
  
    FILOLOGIA, LETTERATURA E LINGUISTICA
  
    Corso di studi
  
  
    INFORMATICA UMANISTICA
  
    Relatori
  
  
    relatore Prof. Sebastiani, Fabrizio
relatore Prof. Moreo Fernández, Alejandro
  
relatore Prof. Moreo Fernández, Alejandro
    Parole chiave
  
  - authorship attribution
- authorship verification
- machine learning
- Miguel de Cervantes
- text classification
    Data inizio appello
  
  
    28/05/2024
  
    Consultabilità
  
  
    Non consultabile
  
    Data di rilascio
  
  
    28/05/2094
  
    Riassunto
  
  El Segundo tomo del ingenioso hidalgo Don Quijote de la Mancha is an apocryphal work published under the pseudonym of Alonso Fernández de Avellaneda, and intended as a continuation of the first part of Miguel de Cervantes' immortal novel. Over the centuries, various hypotheses have been put forward as to its authorship. However, there is still no agreement among scholars on who the author of this ``aprocryphal Quixote'' might be.
In this dissertation, we propose the application of computational authorship identification methods based on machine learning, to the problem of identifying the author of the aprocryphal Quixote.
These methods are quantitative, and based on the automatic extraction and analysis of the stylistic (in the sense of stylometry) elements of a text. In particular, our research focuses on the development of authorship \textit{verification} models, i.e., models (implemented as automatic binary classifiers) whose task is to confirm or reject the hypothesis that a text has been written by a given a candidate, and authorship \textit{attribution} models, i.e., models (implemented as automatic single-label multiclass classifiers) whose task is to pick the most likely author of a text from a closed set of candidates.
Contrary to the hypothesis recently formulated by several scholars, the models we have developed seem to exclude that Jerónimo de Pasamonte (a former comrade-in-arms of Cervantes) might be the author of the apocryphal Quixote. Indeed, our models seem to exclude that any of the 13 Spanish authors we have considered, all of them indicated as possible candidates by Cervantes scholars, is Alonso Fernández de Avellaneda.
An interesting aspect that emerges from our analysis is the outcome of the authorship attribution model, which indicates Miguel de Cervantes himself as the most likely author among the 13 candidates considered. This result suggests that the apocryphal Quixote imitates Cervantes' style very effectively, since it misleads an automatic attribution model that, as we show in the dissertation, proved 100\% accurate in previous demanding experiments. However, as argued above, Avellaneda's style is not similar enough to Cervantes to deceive our equally accurate (as we also show) verification model.
In this dissertation, we propose the application of computational authorship identification methods based on machine learning, to the problem of identifying the author of the aprocryphal Quixote.
These methods are quantitative, and based on the automatic extraction and analysis of the stylistic (in the sense of stylometry) elements of a text. In particular, our research focuses on the development of authorship \textit{verification} models, i.e., models (implemented as automatic binary classifiers) whose task is to confirm or reject the hypothesis that a text has been written by a given a candidate, and authorship \textit{attribution} models, i.e., models (implemented as automatic single-label multiclass classifiers) whose task is to pick the most likely author of a text from a closed set of candidates.
Contrary to the hypothesis recently formulated by several scholars, the models we have developed seem to exclude that Jerónimo de Pasamonte (a former comrade-in-arms of Cervantes) might be the author of the apocryphal Quixote. Indeed, our models seem to exclude that any of the 13 Spanish authors we have considered, all of them indicated as possible candidates by Cervantes scholars, is Alonso Fernández de Avellaneda.
An interesting aspect that emerges from our analysis is the outcome of the authorship attribution model, which indicates Miguel de Cervantes himself as the most likely author among the 13 candidates considered. This result suggests that the apocryphal Quixote imitates Cervantes' style very effectively, since it misleads an automatic attribution model that, as we show in the dissertation, proved 100\% accurate in previous demanding experiments. However, as argued above, Avellaneda's style is not similar enough to Cervantes to deceive our equally accurate (as we also show) verification model.
    File
  
  | Nome file | Dimensione | 
|---|---|
| Tesi non consultabile. | |
 
		