ETD

Archivio digitale delle tesi discusse presso l'Università di Pisa

Tesi etd-11242009-124600


Tipo di tesi
Tesi di dottorato di ricerca
Autore
ROMEI, ANDREA
URN
etd-11242009-124600
Titolo
XQuake: an XML-based Knowledge Discovery Environment
Settore scientifico disciplinare
INF/01
Corso di studi
INFORMATICA
Relatori
tutor Prof. Turini, Franco
Parole chiave
  • query language
  • knowledge discovery
  • inductive database
  • XML
  • data mining
  • XQuery
Data inizio appello
10/12/2009
Consultabilità
Non consultabile
Data di rilascio
10/12/2049
Riassunto
Data mining is the analysis of large volumes of data to find unsuspected relationships and to summarize the data in novel ways, that are both understandable and useful to the data owner. Nowadays, the rapid growth of semi-structured sources raises the need of designing and implementing environments for data mining out of XML data.
On the basis of the principles of the inductive database theory, this dissertation presents a flexible data mining system with capabilities of obtaining, maintaining, representing and querying induced, deduced and prior knowledge, stored inside native XML databases. In particular, it summarizes our three-years experience in the design and development of XQuake, a query language that extends XQuery to support mining primitives. Features of the language are an intuitive syntax, a good expressiveness, and the capability of dealing uniformly with data mining entities. A detail of its implementation and the evaluation of its performance are also given.
File