Academic Handbook Course Descriptors and Programme Specifications

LDSCI7229 Advanced Data Engineering Course Descriptor

Course code LDSCI7229 Discipline Data Science
UK Credit 30 US Credit NA
FHEQ level 7 Date approved June 2023
Core attributes N/A
Pre-requisites None
Co-requisites None

Course Overview

In this course, you will learn more about the principles and concepts of data engineering and big data architecture. Then you will learn how data engineers work with big data. You will also develop a better understanding of how to build, schedule and monitor data pipelines. This understanding will advance your knowledge of data storage architectures, including non-relational databases, and will comprehensively allow you to use statistical and mathematical foundations and advanced practical knowledge of AI and machine learning methodologies applied to complex datasets to meet business objectives. There is a particular focus on sustainable development.

Course Aims

The aims of the course are to: 

  • Demonstrate an understanding of data engineering principles and concepts, and big data architecture.
  • Develop a critical awareness of current issues and developments in data engineering.
  • Demonstrate the ability to apply tools and techniques used to develop data solutions in business.

Learning Outcomes

On successful completion of the course, students will be able to:

Knowledge and Understanding

K1d Comprehensively understand and have advanced knowledge of high-performance data storage architectures, programming languages and data engineering principles to deliver robust and scalable solutions to address business needs.
K2d Identify and use appropriate programming languages and software to develop robust, scalable models for data extraction, analysis, transformation and visualisation.
K3d Identify and creatively apply appropriate data engineering tools and techniques on-premise and using cloud platform technology to achieve organisational goals.
K4d Have an accurate, impartial, scientific, rigorous, hypothesis-driven approach to work including critical awareness of the capabilities and limitations of proposed techniques and solutions.

Subject Specific Skills

S1d Critically evaluate and apply advanced artificial intelligence methods to complex datasets to address business needs.
S3d Select, evaluate, and apply an appropriate range of advanced computational and scientific methods to solve artificial intelligence and data engineering problems of varying levels of complexity for business.

Transferable and Professional Skills

T3d(i) Keep up to date with current thinking and ideas at the forefront of discipline in artificial intelligence and data engineering.
T3d(ii) Identify, critique and synthesise complex information from a range of sources.

Teaching and Learning

This course has a dedicated Virtual Learning Environment (VLE) page with a syllabus and range of additional resources (e.g. readings, question prompts, tasks, assignment briefs, discussion boards) to orientate and engage you in your studies.

The scheduled teaching and learning activities for this course are:

Lectures/labs: Contact hours are typically a mix of weekly lectures and lab sessions, totalling up to 50 scheduled hours:

  • Version 1: All sessions in the same sized group, or
  • Version 2: most of the sessions in larger groups; some of the sessions in smaller groups

Faculty hold regular ‘office hours’, which are opportunities for students to drop in or sign up to explore ideas, raise questions, or seek targeted guidance or feedback, individually or in small groups.

Students are to attend and participate in all the scheduled teaching and learning activities for this course and to manage their directed learning and independent study.

Indicative total learning hours for this course: 300

Assessment

Both formative and summative assessment are used as part of this course, with formative opportunities typically embedded within interactive teaching activities delivered via the VLE.

Summative

AE: Assessment Activity Weighting (%) Duration Length
1 Set Exercises 40% N/A Code and 3,000-word explanation
2 Practical Skills Assessment 60% 2 Hours N/A

Further information about the assessments can be found in the Course Syllabus.

Feedback

Students will receive formative and summative feedback in a variety of ways, written (e.g. marked up on assignments or via the VLE) or oral (e.g. as part of interactive teaching sessions or in office hours).

Indicative Reading

Note: Comprehensive and current reading lists for courses are produced annually in the Course Syllabus or other documentation provided to students; the indicative reading list provided below is used as part of the approval/modification process only.

Books

  • Data Engineering with Python Work with Massive Datasets to Design Data Models and Automate Data Pipelines Using Python.
  • Cassandra: The Definitive Guide, (Revised) Third Edition, 3rd Edition. (2022). Cassandra. O’Reilly Media, Inc.
  • White, T. (2015). Hadoop : the definitive guide. 4th ed. ed. Sebastopol, California: O’Reilly.
  • Practical Apache Spark : using the Scala API
  • Apache Spark for data science cookbook : over 90 insightful recipes to get lightning-fast analytics with Apache Spark
  • Apache Hive essentials : essential techniques to help you process, and get unique insights from, big data
  • Data pipelines with Apache Airflow.

Indicative Topics

Note: Comprehensive and current topics for courses are produced annually in the Course Syllabus or other documentation provided to students. The indicative topics provided below are used as a general guide and part of the approval/modification process only.

  • Data engineering and data pipelines
  • Machine learning models development and operations (MLOps)
  • Big data software products
  • Data engineering software products
  • Non-relational databases

Version History

Title: LDSCI7229 Advanced Data Engineering

Approved by: Academic Board

Location: Academic Handbook/Programme specifications and Handbooks/ Postgraduate Programme Specifications/

Version number Date approved Date published Owner Proposed next review date Modification (As per AQF4) & category number
1.0 June 2023 June 2023 Alexandros Koliousis April 2028  
Print/Save PDF