ETD

Archivio digitale delle tesi discusse presso l'Università di Pisa

Tesi etd-03122021-000321


Tipo di tesi
Tesi di laurea magistrale
Autore
ROCCHIETTI, GUIDO
URN
etd-03122021-000321
Titolo
Common-Sense and Common-Knowledge. How much do Neural Language Models know about the world?
Dipartimento
FILOLOGIA, LETTERATURA E LINGUISTICA
Corso di studi
INFORMATICA UMANISTICA
Relatori
relatore Prof. Lenci, Alessandro
Parole chiave
  • common knowledge
  • common sense
  • diagnostic dataset
  • probing task
  • natural language inference
  • nlp
  • nli
  • natural language processing
Data inizio appello
26/04/2021
Consultabilità
Tesi non consultabile
Riassunto
Nowadays it has become more and more important to understand how much the neural models applied on Natural Language Processing can understand about language features. The standard method to address this kind of problem is to create some probing task in order to investigate the knowledge learned by pre-trained models for the Natural Language Inference task. The purpose of this thesis is to determine to which extent the models are able to address linguistic features such as the common sense and the common knowledge.​
We created a new data-set with 1000 couples of sentences regarding these kind of phenomena, defining a set of fine-grained categories to better describe the linguistic features. We tagged every pair of sentences with the expected label and we used the data-set to test different neural networks in order to analyze which kind of sub-phenomena they are able to understand.
File