MSc Data Analytics

Start date: September 2019
Duration: 12 months full-time
Programme code: COMT130

Apply now

The content of our courses is reviewed annually to make sure it's up-to-date and relevant. Individual modules are occasionally updated or withdrawn. This is in response to discoveries through our world-leading research; funding changes; professional accreditation requirements; student or employer feedback; outcomes of reviews; and variations in staff or student numbers.In the event of any change we'll consult and inform students in good time and take reasonable steps to minimise disruption


Overview
  • Be in demand - our course has been developed to meet skills gaps identified by industry
  • Gain the specific skills increasingly valued by employers
  • Teaching informed by researchers working in relevant areas such as Machine Learning and Natural Language Processing
  • The Department of Computer Science is 5th in the UK for Research Excellence (REF 2014)

The MSc in Data Analytics is designed for students with a numerate background (for example a first degree in Mathematics, Economics, Accounting, Psychology, Physics or Chemistry) as well as graduates already working in industry. The programme will enable you to utilise and apply your previous academic experience to gain the skills required to work with the large quantities of data that need to be analysed in the modern world.

What is Data Analytics?

Around 200 million tweets are sent per day. Google receives over 2.4 million search requests per minute. The UK’s Department of Health plan to sequence 100,000 genomes, each of which generates 200 GB for data. Walmart’s database contains over 2.5 petabytes of data from the retailer’s 1 million customer transactions per hour. Who will analyse all this data?

Being able to quickly and efficiently analyse large amounts of electronic data is becoming increasingly important for a wide range of org anisations. Huge amounts of data are currently available and the volume being produced is growing rapidly.

Data is generated from a wide range of sources including medicine, use of social media, scientific experiments and sensor networks. This data exists in a variety of formats ranging from structured (e.g. spreadsheets and sensor data) to unstructured (e.g. text, images, video and speech). Deriving information from this data has become one of the key challenges within Computer Science.

Data Analytics focuses on managing vast amounts of information and transforming it into actionable knowledge. The programme teaches the key skills that are required to carry out practical analysis of the types of data sets that need to be interpreted in the modern world. The types of data sets encountered include large data sets as well as structured and unstructured data. The programme makes use of techniques developed within a range of disciplines, including computer science, artificial intelligence, mathematics and statistics.

Content

Course outline

The course covers key techniques for analysing and interpreting data. It is taught collaboratively by two departments - Computer Science, and Mathematics and Statistics.

The Department of Computer Science manages the course and teaches a range of topics, including the Python programming language. Modules in Machine Learning show how information can be derived from data using statistical learning and how these approaches can be applied on large scale using open source technologies. Modules in Natural Language Processing introduce techniques for analysing unstructured data. A module in mathematics introduces key statistical concepts and the R programming language, while another covers topics about the handling and governance of digital information. A team project provides the opportunity to apply techniques learned in other modules to an industrially relevant problem.

There are also options to study modules on parallel computing (including GPUs and CUDA) and computer security. You will have the opportunity to put these techniques into practise with the data analytics project in which you can explore a problem of your own choosing in depth. Projects are carried out in collaboration with providers of data (either internal or external) and completed over the summer.

The MSc Data Analytics consists of taught and research components. The taught component consists of two 15 week semesters from late September until the following June. A research project is then carried out over the summer until mid-September.

Course content

Core modules

Text Processing

This module introduces fundamental concepts and ideas in natural language text processing, covers techniques for handling text corpora, and examines representative systems that require the automated processing of large volumes of text. The course focusses on modern quantitative techniques for text analysis and explores important models for representing and acquiring information from texts.

Machine Learning and Adaptive Intelligence

The module is about core technologies underpinning modern artificial intelligence. The module will introduce statistical machine learning and probabilistic modelling and their application to describing real world phenomena. The module will give students a grounding in modern state of the art algorithms that allow modern computer systems to learn from data.

Statistical Data Science in R

This module starts in Intro Week with a speedy review of basic background mathematics and statistics, and an introduction to R and LaTeX. The module will then introduce students to a range of statistical and programming techniques and give practice in their implementation and interpretation using the software R. It aims to help students develop the knowledge and experience to select and use appropriate techniques for a variety of problems. The emphasis will be on practical application of techniques and knowledge of their scope rather than development of theoretical underpinnings. Areas to be covered include: exploratory data analysis, simple checks on data, density estimation, simulation, programming and optimization.

Industrial Team Project

The Industrial Team project provides the opportunity for students to engage in industry inspired research work. It is undertaken in groups. Projects are suggested by industrial problems and supervised by Department of Computer Science staff or industry experts. Students form groups and choose a project which interests them, then refine the scope of the research by conducting a thorough analysis of the topic area and formulating a solution also with the help of their supervisor. The project is developed under strong supervision. The students present their project overview within the first few weeks of the semester and submit a report on the work at the end of semester.

Natural Language Processing

This module provides an introduction to the field of computer processing of written natural language, known as Natural Language Processing (NLP). We will cover standard theories, models and algorithms, discuss competing solutions to problems, describe example systems and applications, and highlight areas of open research.

Scalable Machine Learning

This module will focus on technologies and algorithms that can be applied to data at a very large scale (e.g. population level). From a theoretical perspective it will focus on parallelization of algorithms and algorithmic approaches such as stochastic gradient descent. There will also be a significant practical element to the module that will focus on approaches to deploying scalable ML in practice such as SPARK, FLINK, programming languages such as Scala and deployment on elastic computing structures, cloud computing and/or GPUs.

Professional Issues

This module aims to promote an awareness of the wider social, legal and ethical issues of computing. It describes the relationship between technological change, society and the law, emphasising the powerful role that computers and computer professionals play in a technological society. It also introduces the legal areas which are specific and relevant to the discipline of computing (eg intellectual property, liability for defective software, computer misuse, etc) and aims to provide an understanding of ethical concepts that are important to computer professionals, and experience of considering ethical dilemmas.

Optional modules

Computer Security and Forensics

This module provides, in general, an introduction into computer security and forensics. In particular, this module focuses on approaches and techniques for building secure systems and for the secure operation of systems. The module requires a solid understanding of mathematical concepts (e.g., modulo-arithmetic, complex numbers, group theory) and logic (set theory, predicate logic, natural deduction). Moreover, the module requires a solid understanding of a programming language (e.g., Java, Ruby or C), basic software engineering knowledge and an understanding of database and Web systems. The lab sessions require a basic command of Linux in general and the command line (shell) in particular.

Students should be aware that there are limited places available on this course.

Parallel Computing with Graphic Processing Units (GPUs)

Accelerator architectures are discrete processing units which supplement a base processor with the objective of providing advanced performance at lower energy cost. Performance is gained by a design which favours a high number of parallel compute cores at the expense of imposing significant software challenges. This module looks at accelerated computing from multi-core CPUs to GPU accelerators with many TFlops of theoretical performance. The module will give insight into how to write high performance code with specific emphasis on GPU programming with NVIDIA CUDA GPUs. A key aspect of the module will be understanding what the implications of program code are on the underlying hardware so that it can be optimised.

Students should be aware that there are limited places available on this course.

Dissertation project

Individual dissertation project

For the individual project, students can choose from a wide range of possibilities in many different environments both within and outside the University. The project is completed during the summer, and each student will have a personal academic supervisor to guide them during this period. The individual project is examined by a dissertation based on the project work and an oral examination.

Careers

Be in demand

McKinsey & Company have projected a global demand for 1.5 million new data scientists. Data from IT Jobs Watch shows a strong demand within the UK, with average salary of £37,500.

The MSc in Data Analytics will equip you with the key skills valued by employers, enabling you to progress rapidly within your chosen profession.
Graduates are in demand across the public and private sector, with potential employment routes including:

  • Data ScientistsData Analytics careers
  • Scientific research
  • Data Science/Analytics consultancy
  • Further study, including PhD

Recent graduates have gone on to work in a range of organisations including large corporations, government departments and SMEs. Examples include Barclays, Waicai, and Defence Science and Technology Laboratory (Dstl).

What employers are saying

"There is an increasing demand for graduates with the skill sets that the MSc in Data Analytics will deploy not just for Amazon but across the new digital landscape and industry." - Ralf Herbrich, Director of Machine Learning (Amazon)

"The rapid explosion of E-commerce means everyone now has data, so everyone has to manage it and wants to get value from it. The demand for data scientists outstrips the current supply. There is a need for a new breed of Data Scientists- people who have a broader range of Data Analytics experience and skills, with maths one part, and computer science much more important." - Adrian Lingard, CEO (Jaywing PLC)

"There is a severe shortage of graduates with the requisite computational and statistical skills. The MSc in Data Analytics provides the necessary statistical and computational background to ensure that graduates are 'data ready'." - Alfredo Kalaitzis, Data Scientist (Microsoft)