Global Utilities

Research - Current Postgraduates - Details

Department of Computer Science & Computer Engineering

Rusu, Laura
Course: PhD
Research Title/Topic: XML Warehousing and Mining
Supervisor: Assoc. Prof. Wenny Rahayu & Dr. David Taniar (Monash)
Description:
In this project, an integrated non-redundant DNA sequences database is created by defining a new data warehouse repository that filters and transfers data from different existing DNA sequences data sources. This new repository will be built on a XML database model. The need to create this repository has emerged because of the requirement to have a non-redundant database that is stored in a suitable structure which can capture all necessary data for the different applications that will be built on top of it. Moreover, the implementation of this repository in XML database will facilitate easy integration and transformation to other applications. One advantage could be the fact that structured and semi-structured data can be stored in a XML database. DNA sequences have a regular structure but that structure varies enough that mapping it to a relational database results in either a large number of columns with null values (which wastes space) or a large number of tables (which is inefficient). Another reason for using a XML database model for our repository is retrieval speed. Also, the query and data interchange are all done in XML based data interchange language, for example using XQuery, in which queries are concise and easily understood, flexible enough to query a broad spectrum of XML information sources, including both databases and documents.
Content Approved by: Head of School
Page maintained by: Applications Programmer
Last Updated: 14 October, 2009