Building Data Lakes on AWS

In this course, you will learn how to build an operational data lake on AWS that supports analysis of both structured and unstructured data. The course introduces the services and components used to create a data lake and shows how they fit into modern data architectures.

The training focuses on designing, building, securing, and operating data lakes using AWS services. You work with AWS Lake Formation to build and govern the data lake, AWS Glue to create and manage the data catalog, and Amazon Athena to analyse data. Through lectures and hands-on labs, the course explores common data lake architectures and how data lakes support analytics and data-driven decision making.

Course objectives

In this course, you will learn to:

  • Apply data lake methodologies when planning and designing a data lake
  • Plan and design a data lake using established data lake methodologies
  • Describe the components and services required to build a data lake on AWS
  • Explain how to secure a data lake using appropriate permissions
  • Compare ingestion, storage, and transformation approaches in an AWS data lake
  • Analyse and visualise data stored in a data lake on AWS
  • Build and automate deployment of a data lake on AWS
  • Describe the role of a data lake within a modern data architecture

Prerequisites

We recommend that attendees of this course have:

  • Completed the AWS Technical Essentials classroom course
  • One year of experience building data analytics pipelines or completion of the Data Analytics Fundamentals digital course

Target audience

This course is intended for:

  • Data platform engineers
  • Solutions architects
  • IT professionals

Introduction to data lakes

The course begins with the value of data lakes, how they differ from data warehouses, key components, and common data lake architectures.

Data ingestion, cataloging, and preparation

You learn how data is ingested and stored in a data lake, how AWS Glue crawlers are used to build a data catalog, and how formatting, partitioning, and compression impact performance.

Building a data lake with AWS Lake Formation

This section covers data processing in a data lake, using AWS Glue for processing, and Amazon Athena for querying and analysis. You complete a hands-on lab building a data lake with AWS Lake Formation.

Data processing and analysis

You explore the features and benefits of AWS Lake Formation, its security model, and work through a lab that builds and secures a data lake.

Additional Lake Formation configurations

The course covers built-in blueprints, advanced permissions, fine-grained access control, and tag-based access control in Lake Formation.

Modern data architecture

You learn how data lakes fit into modern data architectures, including scalable analytics, unified governance, data movement patterns, and data mesh concepts, followed by a lab on building and publishing a data product.

Course wrap-up

The course concludes with a knowledge check, architecture review, and course review.

Practical information

Duration: 1 day
Price: 9 900 NOK
Course level: Intermediate

FAQ

Er dette et sertifiseringskurs?
Nei, dette er et opplæringskurs og gir ingen formell sertifisering.

Er kurset praktisk rettet?
Ja, kurset inkluderer presentasjoner, forelesning, hands-on labs og gruppeøvelser.

Hvilke AWS-tjenester brukes i kurset?
Kurset dekker blant annet AWS Lake Formation, AWS Glue og Amazon Athena.

Passer kurset for deltakere uten erfaring med data lakes?
Ja, kurset gir en strukturert introduksjon til data lakes, men forutsetter grunnleggende AWS- og dataanalyseerfaring.

Handler kurset om moderne dataarkitektur?
Ja, kurset viser hvordan data lakes inngår i moderne dataarkitekturer, inkludert data mesh-tilnærminger.

Other relevant courses

17. March
1 days
Classroom Virtual
18. March
3 days
Classroom Virtual
25. March
3 days
Classroom Virtual
8. April
3 days
Classroom Virtual