Data Engineering on AWS

Data Engineering on AWS is an intermediate course designed for professionals who want a deep dive into data engineering practices and solutions on AWS. The course covers the design, implementation, optimisation, and security of modern data solutions using AWS services.

This instructor-led training spans the full data engineering lifecycle, from foundational concepts to hands-on implementation of data lakes, data warehouses, and batch and streaming data pipelines. Through a mix of theory, practical labs, and group activities, participants gain the skills needed to architect and operate scalable, secure, and production-ready data engineering solutions on AWS.

Course objectives

In this course, you will learn to:

  • Understand core data engineering roles, personas, and key concepts
  • Identify AWS services used for orchestration, security, monitoring, CI/CD, infrastructure as code, networking, and cost optimisation
  • Design and implement a data lake solution on AWS, including ingestion, transformation, and data serving
  • Optimise and secure data lakes using open table formats, governance, and troubleshooting techniques
  • Design and implement data warehouse solutions using Amazon Redshift Serverless
  • Apply performance optimisation techniques for data warehouses, including monitoring and query optimisation
  • Manage security, access control, auditing, and compliance for data warehouses
  • Design and implement batch data pipelines using appropriate AWS services
  • Optimise, orchestrate, and secure batch data pipelines
  • Architect streaming data pipelines and implement ingestion, processing, and analysis using AWS services
  • Optimise and secure streaming data solutions with appropriate access control and compliance considerations

Prerequisites

We recommend that attendees of this course have:

  • Familiarity with basic machine learning concepts
  • Working knowledge of Python and common data science libraries such as NumPy, Pandas, and Scikit-learn
  • Basic understanding of cloud computing concepts and familiarity with AWS
  • Familiarity with SQL and relational databases, which is recommended but not mandatory
  • Experience with version control systems such as Git, which is beneficial but not required

Target audience

This course is designed for professionals who are interested in designing, building, optimising, and securing data engineering solutions using AWS services.

Day 1 – Data engineering foundations and data lakes

The first day introduces data engineering roles and key concepts, AWS data engineering services, and core tooling for orchestration, security, monitoring, CI/CD, and infrastructure as code. You design and implement data lakes on AWS, covering ingestion, cataloguing, transformation, and data serving, followed by optimisation and security using AWS Lake Formation and automation with blueprints.

Day 2 – Data warehouses and batch pipelines

Day two focuses on data warehouse architecture and implementation using Amazon Redshift Serverless. You work with performance optimisation, monitoring, query tuning, and access control. The day continues with designing and implementing batch data pipelines, including processing, integration, cataloguing, and serving data for consumption through hands-on labs.

Day 3 – Advanced batch and streaming pipelines

The final day covers optimisation, orchestration, and security of batch data pipelines using AWS services such as Step Functions and Spark. You also design, optimise, and secure streaming data pipelines, working with ingestion, processing, analysis, and access control using managed streaming services on AWS.

Practical information

Duration: 3 days
Price: 27 900 NOK
Course level: Intermediate

FAQ

Er dette et sertifiseringskurs?
Nei, dette er et opplæringskurs og gir ingen formell sertifisering.

Er kurset praktisk rettet?
Ja, kurset inkluderer presentasjoner, demonstrasjoner, hands-on labs og gruppeøvelser.

Hvilke AWS-tjenester jobber man mest med i kurset?
Kurset dekker blant annet AWS Lake Formation, Amazon Redshift Serverless, batch- og streamingtjenester, samt orkestrering og sikkerhetstjenester.

Passer kurset for deltakere uten tidligere data engineering-erfaring?
Kurset forutsetter teknisk bakgrunn og noe erfaring med data og programmering, men gir en strukturert progresjon fra grunnleggende til avanserte konsepter.

Dekker kurset både batch og streaming data pipelines?
Ja, kurset dekker både batch- og streamingarkitekturer, inkludert design, optimalisering, sikkerhet og drift.

Other relevant courses

17. March
1 days
Classroom Virtual
18. March
3 days
Classroom Virtual
25. March
3 days
Classroom Virtual
8. April
3 days
Classroom Virtual