DP-200: Implementing an Azure Data Solution

Important notice!

This course (and DP-201) has been replaced by the new course DP-203 (to be relased from Microsoft in the end of April 2021):

DP-203: Data Engineering on Microsoft Azure

 

 

 

 

 

 

 

 

In this course, the students will implement various data platform technologies into solutions that are in line with business and technical requirements including on-premises, cloud, and hybrid data scenarios incorporating both relational and No-SQL data. They will also learn how to process data using a range of technologies and languages for both streaming and batch data.

The students will also explore how to implement data security including authentication, authorization, data policies and standards. They will also define and implement data solution monitoring for both the data storage and data processing activities. Finally, they will manage and troubleshoot Azure data solutions which includes the optimization and disaster recovery of big data, batch processing and streaming data solutions.

Audience

The primary audience for this course is data professionals, data architects, and business intelligence professionals who want to learn about the data platform technologies that exist on Microsoft Azure.

The secondary audience for this course is individuals who develop applications that deliver content from the data platform technologies that exist on Microsoft Azure.

Job role: Data Engineer

Prerequisites

Successful students start this course with knowledge of cloud computing and core data concepts and professional experience with data solutions.
Specifically completing:

Course content

This course contains these themes and modules:

Module 1: Azure for the Data Engineer

  • Explain the evolving world of data
  • Survey the services in the Azure Data Platform
  • Identify the tasks that are performed by a Data Engineer
  • Describe the use cases for the cloud in a Case Study

Module 2: Working with Data Storage

  • Choose a data storage approach in Azure
  • Create an Azure Storage Account
  • Explain Azure Data Lake storage
  • Upload data into Azure Data Lake

Module 3: Enabling Team Based Data Science with Azure Databricks

  • Explain Azure Databricks and Machine Learning Platforms
  • Describe the Team Data Science Process
  • Provision Azure Databricks and workspaces
  • Perform data preparation tasks

Module 4: Building Globally Distributed Databases with Cosmos DB

  • Create an Azure Cosmos DB database built to scale
  • Insert and query data in your Azure Cosmos DB database
  • Provision a .NET Core app for Cosmos DB in Visual Studio Code
  • Distribute your data globally with Azure Cosmos DB

Module 5: Working with Relational Data Stores in the Cloud

  • SQL Database and SQL Data Warehouse
  • Provision an Azure SQL database to store data
  • Provision and load data into Azure SQL Data Warehouse

Module 6: Performing Real-Time Analytics with Stream Analytics

  • Explain data streams and event processing
  • Querying streaming data using Stream Analytics
  • How to process data with Azure Blob and Stream Analytics
  • How to process data with Event Hubs and Stream Analytics

Module 7: Orchestrating Data Movement with Azure Data Factory

  • Explain how Azure Data Factory works
  • Create Linked Services and datasets
  • Create pipelines and activities
  • Azure Data Factory pipeline execution and triggers

Module 8: Securing Azure Data Platforms

  • Configuring Network Security
  • Configuring Authentication
  • Configuring Authorization
  • Auditing Security

Module 9: Monitoring and Troubleshooting Data Storage and Processing

  • Data Engineering troubleshooting approach
  • Azure Monitoring Capabilities
  • Troubleshoot common data issues
  • Troubleshoot common data processing issues

Certification

This course is recommended as preparation for exam DP-200.

Important notice!

Exam DP-200 (and exam DP-201) will retire 30 June 2021.
The new exam DP-203  (available from 23 February 2021), will replace the DP-200 and DP-201 exams.

To achieve the Azure Data Engineer Associate certification, you need to fulfil one of these two options:

  • Pass exam DP-201 and exam DP-200 before 30 June 2021 or
  • Pass exam DP-203