BIG DATA MANAGEMENT ON THE CLOUD

CSE5BDC

2020

Credit points: 15

Subject outline

In this subject we will study the current state-of-the-art technologies for analysing huge amounts of data and responding to millions of user requests within one second. Currently the most cost efficient way of achieving the above aim is to use large-scale cloud-based services offered by vendors such as Amazon, Google, IBM and Microsoft. We will study how to use the cloud services provided by these vendors to meet the big data needs of businesses. This subject includes the following topics: cloud architectures, parallel database systems, map and reduce, key value stores, transaction support in the cloud, virtualization, and multi-tenant database systems.

School: Engineering and Mathematical Sciences (Pre 2022)

Credit points: 15

Subject Co-ordinator: Zhen He

Available to Study Abroad/Exchange Students: Yes

Subject year level: Year Level 5 - Masters

Available as Elective: No

Learning Activities: N/A

Capstone subject: No

Subject particulars

Subject rules

Prerequisites: CSE4OOF AND CSE4DBF

Co-requisites: N/A

Incompatible subjects: CSE4BDC OR CSE3BDC

Equivalent subjects: N/A

Quota Management Strategy: N/A

Quota-conditions or rules: N/A

Special conditions: N/A

Minimum credit point requirement: N/A

Assumed knowledge: N/A

Learning resources

Hadoop in Action

Resource Type: Other resource

Resource Requirement: Recommended

Author: Chuck Lam

Year: 2010

Edition/Volume: N/A

Publisher: Manning

ISBN: N/A

Chapter/article title: N/A

Chapter/issue: N/A

URL: N/A

Other description: N/A

Source location: N/A

Hadoop The Definitive Guide

Resource Type: Other resource

Resource Requirement: Recommended

Author: Tom White

Year: 2015

Edition/Volume: N/A

Publisher: O'Reilly Media

ISBN: N/A

Chapter/article title: N/A

Chapter/issue: N/A

URL: N/A

Other description: N/A

Source location: N/A

Career Ready

Career-focused: No

Work-based learning: No

Self sourced or Uni sourced: N/A

Entire subject or partial subject: N/A

Total hours/days required: N/A

Location of WBL activity (region): N/A

WBL addtional requirements: N/A

Graduate capabilities & intended learning outcomes

Graduate Capabilities

Intended Learning Outcomes

01. Explain the benefits of cloud computing over traditional methods for managing big data.
02. Identify the best type of cloud-based service to use for range of application scenarios.
03. Write efficient map and reduce programs to analyse large data sets.
04. Write efficient programs that query cloud-hosted database systems.
05. Setup cloud-hosted database systems.
06. Identify advantages and disadvantages of state-of-the-art technologies developed by research projects in the area of big data analytics using cloud-based services.
07. Analyse complex cloud computing architectures and propose remedies to any identified flaws.

Bendigo, 2020, Semester 1, Day

Overview

Online enrolment: Yes

Maximum enrolment size: N/A

Subject Instance Co-ordinator: Zhen He

Class requirements

Computer LaboratoryWeek: 10 - 22
One 2.00 hours computer laboratory per week on weekdays during the day from week 10 to week 22 and delivered via face-to-face.

LectureWeek: 10 - 22
One 2.00 hours lecture per week on weekdays during the day from week 10 to week 22 and delivered via face-to-face.

Assessments

Assessment elementCommentsCategoryContributionHurdle%ILO*

10 Laboratory Reports (equivalent to 1000 words)Each lab report is equivalent to a 100 word essay amount of work. The lab reports will be marked and returned to the students before the start of the following lab.

N/AN/AN/ANo10SILO3, SILO4, SILO5

One 3-hour examination (equivalent to 3000 words)

N/AN/AN/ANo60SILO1, SILO2, SILO3, SILO4, SILO6, SILO7

Programming Assignment (equivalent to 2500 words)The students are required to achieve greater than 50% for the non-exam components (including lab reports) as a hurdle

N/AN/AN/AYes30SILO3

Melbourne (Bundoora), 2020, Semester 1, Day

Overview

Online enrolment: Yes

Maximum enrolment size: N/A

Subject Instance Co-ordinator: Zhen He

Class requirements

Computer LaboratoryWeek: 11 - 22
One 2.00 hours computer laboratory per week on weekdays during the day from week 11 to week 22 and delivered via face-to-face.

LectureWeek: 10 - 22
One 2.00 hours lecture per week on weekdays during the day from week 10 to week 22 and delivered via face-to-face.

Assessments

Assessment elementCommentsCategoryContributionHurdle%ILO*

10 Laboratory Reports (equivalent to 1000 words)Each lab report is equivalent to a 100 word essay amount of work. The lab reports will be marked and returned to the students before the start of the following lab.

N/AN/AN/ANo10SILO3, SILO4, SILO5

One 3-hour examination (equivalent to 3000 words)

N/AN/AN/ANo60SILO1, SILO2, SILO3, SILO4, SILO6, SILO7

Programming Assignment (equivalent to 2500 words)The students are required to achieve greater than 50% for the non-exam components (including lab reports) as a hurdle

N/AN/AN/AYes30SILO3