0% found this document useful (0 votes)
27 views69 pages

Diagrams

Uploaded by

ramdotind
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views69 pages

Diagrams

Uploaded by

ramdotind
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 69

Design and implement data storage – Basics

We need different types of storage services


Design and implement data storage - Azure Synapse
Analytics
What have we seen so far
A data warehouse
Welcome to Azure Synapse Analytics
Lab - Let's create a Azure Synapse workspace
About the serverless SQL pool

Quick note on Microsoft Entra ID and permissions


Lab - Using External tables – CSV
Using the dedicated SQL pool
Lab - SQL Pool - External Tables
Lab - Loading data into a SQL pool using Polybase
Designing a data warehouse
Fact and Dimension Tables
Understanding Azure Synapse Architecture
Understanding table types
Lab - Surrogate keys for dimension tables
Slowly Changing dimensions
Which Load Method to use

Design and Develop Data Processing - Azure Data


Factory
Extract, Transform and Load
What is Azure Data Factory
Self-Hosted Integration Runtime
Azure Data Factory and Git
Design and Develop Data Processing - Azure Event
Hubs ,Stream Analytics
Batch and Real-Time Processing
What are Azure Event Hubs
What is Azure Stream Analytics

About Windowing functions


Design and Develop Data Processing - Scala,
Notebooks and Spark

Why Spark
Spark Architecture
Installing Spark
Design and Develop Data Processing - Azure Synapse
and Azure Databricks
Azure Synapse - Spark Pool – Concepts
What is Azure Databricks
Concepts with Azure Databricks
Lab - Creating a cluster
Design and Implement Data Security
Authorization for Azure Data Lake Gen2

Microsoft Entra ID
Lab - Using Microsoft Entra ID - Using RBAC - Storage Blob Data
Reader
Lab - Using Access Control Lists
Azure Synapse - Dedicated Pool Encryption
Azure Synapse Workspace Encryption
Azure Synapse - Microsoft Entra ID
Azure Storage Accounts - Network and Firewall
About Managed Identities
Azure Synapse - Managed Identity connectivity
Azure Data Factory – Encryption

Monitor and optimize data storage and data processing


Microsoft Purview
Best practices for data storage - Azure Data Lake
Azure Data Lake Gen2 - Access tiers
Azure Data Factory Logging
Azure Synapse - Result set caching
Azure Stream Analytics – Optimization
Azure Stream Analytics - The importance of time

You might also like