About CS&CE
Prospective Students
Research
Staff
Dept Comp Sci & Comp Eng
Contact Details
La Trobe University
Victoria 3086
AUSTRALIA
Tel: +61 3 9479 1107
Fax: +61 3 9479 3060
Email: info
@cs.latrobe.edu.au
|
 |
Research - Current Postgraduates - Details
Department of Computer Science & Computer Engineering
| Nguyen, Quang Hong |
|
Course:
|
PhD |
|
Research Title/Topic:
|
Mining XML Documents based on Object-Oriented Semantics |
|
Supervisor:
|
Assoc. Prof. Wenny Rahayu, Dr. Kinh Nguyen, and Dr. David Taniar |
Description:
Due to the ubiquitous dissemination of semi structured data in XML format, much research effort has been devoted to integrate large collections of such documents from different data sources. Commonalities and differences between XML documents can be uncovered so that data integration systems could allow users to effectively query and retrieve information from a multitude of data sources. In this context, an effective and efficient extraction of objects and their relationships out of documents is a decisive success factor of a data integration system. This research proposes a novel approach for discovering interesting real-world objects and relationships from XML documents based on their semantics from object-oriented perspective. It also introduces a set of trimming rules to prune schema trees for better performance and high-quality result. From the discovered objects and relationships, our approach produces a set of schema representatives which can be effectively used for a wide range of applications such as data integration, data management and data mining tasks. Experiments on both synthetic and real datasets will be extensively performed to evaluate our work. |
|
|