logo SBA

ETD

Archivio digitale delle tesi discusse presso l’Università di Pisa

Tesi etd-04092019-141124


Tipo di tesi
Tesi di laurea magistrale
Autore
CERVELLI, ELENA
URN
etd-04092019-141124
Titolo
Design and development of a machine learning process
Dipartimento
INGEGNERIA DELL'ENERGIA, DEI SISTEMI, DEL TERRITORIO E DELLE COSTRUZIONI
Corso di studi
INGEGNERIA GESTIONALE
Relatori
relatore Prof. Fantoni, Gualtiero
relatore Dott. Chiarello, Filippo
Parole chiave
  • classification
  • decision tree
  • machine learning
  • NER
  • neural network
  • NLP
  • patent
  • text mining
  • user
Data inizio appello
02/05/2019
Consultabilità
Non consultabile
Data di rilascio
02/05/2089
Riassunto
The present work proposes a method for the automatic extraction of textual elements within documents, in the specific instance of users within patents. The extraction is made on a massive quantity of patents (almost 40.000) and it is automatic thanks to the integration of text mining and machine learning techniques. The techniques are combined to pursue two conflicting objectives: getting an effective classification tool and, at the same time, obtaining a fast and low-memory algorithm and its classification logics understanding. The management approach to the problem has allowed to keep the machine learning process under control, thanks to the use of the design tool “HP tree”, a hypothesis tree.
File