logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-04042019-003101


Tipo di tesi
Tesi di laurea magistrale
Autore
AGOSTI, DANIELE
URN
etd-04042019-003101
Titolo
Exploitation of web and social network data for event detection and opinion mining: an enterprise application use case
Dipartimento
INGEGNERIA DELL'INFORMAZIONE
Corso di studi
COMPUTER ENGINEERING
Relatori
relatore Dott. Bechini, Alessio
correlatore Prof. Marcelloni, Francesco
correlatore Prof.ssa Lazzerini, Beatrice
correlatore Renda, Alessandro
Parole chiave
  • text mining
  • sentiment analysis
  • opinion mining
  • machine learning
  • information retrieval
Data inizio appello
03/05/2019
Consultabilità
Non consultabile
Data di rilascio
03/05/2089
Riassunto
Nowadays, a huge amount of information is available from many sources and in particular from Social Media. Integrating this information into business decision making is certainly a great strategic advantage and is becoming definitely necessary. Nonetheless, the information is described in natural language, so data extrapolation is the main challenge. This work studies the main techniques of text mining and opinion mining and applies them to an enterprise application use case. The project proposes an easy-to-use system to detect events with information about place, date and kind of event gathered by websites and to extract opinions on monitored topics of the Twitter domain.
The system offers an interface to define the resource and the keyword to analyze and show the results of the extracted information.
The proposed solution makes use of text analysis, named entity recognition, topic clustering, sentiment analysis and text classification to identify the significant events that are available on the web and most influential topics that are retrievable on the social network.
The project is implemented in Python and exploits popular library such as Flask and spaCy. Under ad-hoc testing cases, the developed system has shown to be effective in the identification of events, as well as in extracting proper information from Twitter posts.
File