Advanced Data Management and Analysis using SAS
Subject code: MA5831:03
This subject will provide students with cutting-edge tools and techniques for high-performance and large-scale computing, with focus on computer models and software designed to handle Big Data sets in a distributed and/or parallel fashion. Particular focus will be given to distributed and parallel computing using Map-Reduce/Hadoop and similar models for processing Big Data sets.
Software platform: SAS exclusively and Hadoop
Learning outcomes
-
List the different systems and approaches for high-performance and large-scale computing, as well as explain their differences
-
Conceptually describe and apply models for distributed and parallel computing of Big Data sets, such as MapReduce and Spark
-
Choose and apply different techniques and software for distributed and cloud computing of Big Data, such as Hadoop
Assessment
Assessment for this course will occur at various times across the seven-week study period. Tasks may include online quizzes, discussion board activity, portfolio development, case studies, reflection, literature reviews presentations and reports.Feedback will be provided to you throughout the study period as well as a final grade at the conclusion of the study period. Click here to find out more about this subject's assessments.
This is one of the interdisciplinary subjects studied in the online Master of Data Science.
Please note, course structure and content are subject to change. For information on all course subjects download the course guide.