logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-03262024-145031


Tipo di tesi
Tesi di laurea magistrale
Autore
LLESHI, BLERTA
URN
etd-03262024-145031
Titolo
Combining Natural Language Processing and Deep Learning for Automated Code Documentation Generation and Retrieval
Dipartimento
INFORMATICA
Corso di studi
DATA SCIENCE AND BUSINESS INFORMATICS
Relatori
relatore Prof. Bondielli, Alessandro
relatore Merangolo, Francesco
Parole chiave
  • embeddings
  • large language models
  • natural language processing
  • retrieval
  • text generation
Data inizio appello
12/04/2024
Consultabilità
Non consultabile
Data di rilascio
12/04/2064
Riassunto
In contemporary society, Natural Language Processing (NLP) and Large Language Models (LLMs) have become integral components of numerous applications and industries, profoundly impacting various aspects of daily life. The fusion of advanced algorithms, vast datasets, and scalable computing infrastructure has propelled NLP and LLMs to the forefront of technological innovation. These transformative technologies are becoming indispensable in different sectors ranging from communication and commerce to healthcare and entertainment. The thesis explores the evolution of LLMs, tracing their development from early language models to contemporary transformer-based architectures, for text generation. It presents a case study on the use of generative Large Language Models for the automatic creation of documentation starting from code, and for the automatic recovery of documents using word/document embeddings techniques. The case study was addressed and developed within a corporate reality.
File