ETD

Archivio digitale delle tesi discusse presso l'Università di Pisa

Tesi etd-05212020-091549


Tipo di tesi
Tesi di laurea magistrale
Autore
MATTOLINI, FRANCESCA
URN
etd-05212020-091549
Titolo
The evolution of jobs: automatic job titles extraction based on a text mining approach
Dipartimento
INGEGNERIA DELL'ENERGIA, DEI SISTEMI, DEL TERRITORIO E DELLE COSTRUZIONI
Corso di studi
INGEGNERIA GESTIONALE
Relatori
relatore Prof. Fantoni, Gualtiero
correlatore Dott. Manfredi, Pietro
Parole chiave
  • job title
  • occupations
  • work
  • automatic extraction
  • text mining
  • database
  • update
  • green economy
  • job visualizer
Data inizio appello
18/06/2020
Consultabilità
Non consultabile
Data di rilascio
18/06/2090
Riassunto
Nowadays, the job market is continuously evolving, and elementary professional figures are progressively diversifying into more specialised profiles in numerous application fields. To categorize every occupation and its related skills and technologies, worldwide job databases have been developed. These frameworks share common difficulties concerning data completeness and upgrades, including manual and sporadic updates. By exploiting text mining techniques, this thesis focused on the development of an automatic method for the identification and extraction of job titles from text sources. Firstly, a set of clues was defined and collected to uniquely allow the identification of job titles within any text files. The collection of these markers was performed by analysing scientific papers, occupational databases, online dictionaries, and multimedia vocabularies. In a second step, the identified set of clues was utilized to automatically extract compound job titles. This extraction was achieved by generating a list of rules able to recognize any kind of complement that specifies the application area of each base job title.
To demonstrate its broad effectiveness, the implemented algorithm was tested in the meaningful topic of the green economy. Lastly, a demo-web application was created and published online to allow the user to assess the algorithm functionalities.
Overall, we believe this work could constitute a useful approach for worldwide database authorities, international recruitment companies, and institutions involved in professional formation and training courses.
File