ETL-Developer-Training
ETL-Developer-Training
Thank you for contacting our SQL School. I am Mr. Sai Phanindra, trainer for this ETL
Developer Course.
With 19+ Years of technical expertise exclusively on Database and Azure, BI Technologies, I
assure you 100% Practical, Step by Step Classes for this in-depth ETL Developer course. My
Profile @ linkedin.com/in/saiphanindra/
Detailed Curriculum
Module 1: SQL Server TSQL Queries (MSSQL)
www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
INSERT & INSERT INTO UNION ALL For Sorting Data Primary Key & Foreign Key
SELECT & SELECT INTO Creating, Using Schemas RDBMS and ER Models
Basic Operators in SQL Real-world Banking Database Identity Property, Default
Special Operators in MSSQL Table Migrations @ Schemas Clustered Index, Primary Key
ALTER, ADD, TRUNCATE, DROP 2 Part, 3 Part & 4 Part Naming Non Clustered Index, Unique
Ch 7: Joins & Views Basics Ch 8: Functions(UDF), Data Types Ch 9: Stored Procedures,Models
JOINS: Purpose. Inner Joins Using Functions in MSSQL Stored Procedures & Usage
Left / Right / Full Outer Joins Scalar Value Functions Creating, Testing Procedures
Cross Joins, Query Tuning Inline & Multiline Functions Encryption, Deferred Names
Creating & Using Views Date & Time Functions SPs for Validations, Analysis
DML, SELECT with Views String, Aggregate Functions System SPs, Recompilation
RLS : WITH CHECK OPTION Data Types : Integer, Char, Bit Normal Forms & Types
System Views & Metadata SQL Variant, Timestamp, Date Data Models, Self-References
Ch 10: Triggers, Temp Tables Ch 11: DB Architecture, Locks Ch 12: Cursors & CTEs, Links
Need for Triggers Planning VLDBs : Files, Sizing Cursors : Realtime Use
DDL & DML Triggers Filegroups, Extents & Types Fetch & Access Cursor Rows
Using Memory Tables Log Files : VLF, Mini LSN CTEs for SELECT, DML
Data Replication, Automation Table Location, Performance CTEs: Scenarios & Tuning
Local & Global Temp Tables Schemas, Transfer, Synonyms Linked Servers, Remote Joins
Testing & Using Temp Tables Transactions Types, Lock Hint Linked Servers: MSDTC, RPC
SELECT .. INTO & Bulk Loads Query Blocking Scenarios Tuning Remote Queries
Ch 13: Merge, Upsert & Rank Ch 14: Grouping & Cube Ch 15: Self Joins, Excel Analysis
Need for Merge in ETL Group By & HAVING Self Joins & Self References
Incremental Loads with SQL Cube, Rollup & Grouping UNION, UNION ALL
MERGE and RANK Functions Joins with Group By Sub Queries with Joins
Window Functions, Partition 3 Table, 4 Table Joins IIF, CASE, EXISTS Statements
Identify, Remove Duplicates Query Execution Order Excel Analytics, Pivot Reports
Realtime Case Study : Health-Care Domain
www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
Ch 4: Azure Data Factory (ADF) Ch 5: ADF Incremental Loads - 1 Ch 6: ADF Incremental Loads - 2
Need for ADF & Pipelines File Incremental Loads Implement SCD with ADF
Linked Services & IRs Storage Account, Data Lake Self Hosted IR: Realtime Use
Datasets, Pipelines, Triggers Binary Copy, Schema Drift On-premise Data: Incr Loads
Copy Data Activity & CDT Staging Concept in ADF Copy Method : Upsert, Keys
Data Loads Pipelines, DTUs DOCP, Logging & Consistency Staging & ADF Optimizations
Pipeline Monitoring, Edits Polybase Concept & Tuning Pipeline Runs, Activity IDs
Ch 7: ADF Data Flow - 1 Ch 8: ADF Data Flow - 2 Ch 9: ADF Data Flow - 3
Data Flow Transformations Working with Multiple Tables Pivot Transformation
Spark Clusters for Debugging Join Transform, Broadcast Group By & Pivot Keys
Optimized Clusters, Preview Row Filters, Column Filters Column Pattern, Deduplicate
Conditional Split, SELECT Surrogate Keys, Derived Cols Lookup, Cached Lookup
Sort, Union Transformations ETL Loads Dates, Sink Options Tuning Transformations
Pipelines with Data Flow Aggregated Data Loads Tuning Data Flow, Spark
Ch 10: Synapse Analytics - 1 Ch 11: Synapse Analytics - 2 Ch 12: Synapse Analytics - 3
Azure Synapse Analytics Serverless Pools in Synapse Apache Spark Pool & Usage
Dedicated SQL Pools TSQL Scripts with Serverless Synapse Analytics with Pools
TSQL: Stored Procedures ADLS Data Imports & ELT PySpark Staging,Aggregations
Synapse Pipelines, Tuning Synapse Aggregation, Analytics Spark Queries & Python ETL
SP Activity in Pipelines, Jobs Synapse Optimizations Python Notebooks, Pipelines
Comparing ADF & Synapse Synapse Security & Logins Integrating Python with DWH
Ch 13: Parameters, SCD & ETL Ch 14: CDC @ ETL, ELT & Tuning
ADF Templates in Realtime Using CDC in ADF A Realtime Case Study
Table Incremental Loads Control Tables (CT): Upserts
Control Tables, Watermarks Handling Inserts, Updates
Pipeline Parameters, SPs SCD Type 1 & Type 2
Dynamic Data Sets, SCD ADF, Synapse : Limitations
Ch 15: Azure Intro & Storage Ch 16: Azure Storage Operations Ch 17: Azure Storage Security
Storage, ETL, IoT Resources BLOB Storage: Containers Realtime use of Keys
Azure Storage Components Storage Browser, Explorer Access Keys & Admin Access
Azure Storage Account, HNS File & Folder Uploads, Edits SAS Keys Generation, Ips
Azure Data Lake Storage Azure Tables: Row Key Creating, Using Entra Users
Azure Storage Explorer Tool Partition Key, Timestamp Azure AD Users, Groups
Storage Explorer Config Use Cases of BLOB Storage IAM & RBAC with Entra Users
Storage Account Properties Use Cases of Azure Tables ACLs and ADLS Security
www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
Ch 18: Azure SQL DB Migrations Ch 19: Azure Stream Analytics Ch 20: Azure Stream Analytics
On-Premise SQL DB bacpac Azure IoT Hubs & Devices Azure IoT Hubs & Devices
Azure SQL Deployment APIs with Connection Strings APIs with Connection Strings
Azure Storage from SSMS Azure Steam Analytic Jobs Azure Steam Analytic Jobs
Azure SQL DB Migration Inputs, Outputs, SAQL Query Inputs, Outputs, SAQL Query
Migration Verifications LIVE Feed: JSON, AVRO Files LIVE Feed: JSON, AVRO Files
Testing Migrations in SQL Watermark & LIVE Stats Watermark & LIVE Stats
Ch 21: Azure Key Vaults, Alerts Ch 22: Azure Storage Optimization Ch23: Azure Pricing, Functions
Azure Encryptions @ REST BLOB Types & Content Types Azure Logic Apps : Usage
Azure Key Vaults & Keys Hot, Cool, Cold, Archive Types Log Apps Usage in ETL
SMK & CMK Encryptions Creating, Using Access Policies Snapshots, Azure Functions
Azure Metrics: Ingress Immutable Storage, Rotation Azure Functions Realtime Use
Egress, E2E Latency Issues Containerization, Indexing ETL & DWH with Functions
Performance Tuning Options Replication: LRS, ZRS, RA-GRS Azure Resource Pricing
Ch 24: Azure Big Data & Spark Ch 25: Spark Cluster Operations Ch 26: Python & PySpark, ETL
Azure Big Data & Spark DBFS : Flat File Imports Python Fundamentals
Azure ETL & DWH Databases Table Conversions using GUI Python Dataframes : ETL
Azure Spark, HIVE Metastore Spark Clusters: Table Creations Python For Big Data, Pandas
Azure Databricks Service Basic Transformations in Spark Python Notebooks, Views
Spark Cluster (Personal) SQL Notebooks: Creation Aggregated Loads to Spark
Unity Catalog & Azure VM Default DB Queries, Cloning Spark DB Creations, Tables
Ch 27: PySpark & ADLS, Widgets Ch 28: ADB Jobs, Delta Tables Ch 29: Scala Notebooks & ETL
Creating Spark Databases Azure Databrick Jobs Scala Notebooks : Purpose
Spark Tables, Catalog Info Azure Workflows & Tasks Aggregated Data Loads
PySpark with ADLS Storage Notebook Schedule Options Incremental Data Loads
Using Widgets for ADLS Keys Continuous Jobs, Notifications Widgets & Jobs with Scala
PySpark Variables & Widgets Delta Tables & Data Cleansing Python Versus Scala
Using Variables in Functions SCD (Merge Into), Contact, etc Converting Python to Scala
Spark SQL with Control Text Creating, Using Dataframes JVM Benefits, SQL DB Conn”
Using Variables in Spark SQL Multi Dataframe Joins SQL DB Loads with Scala
www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
Ch 30: Databricks Architecture Ch 31: Medallion Architecture Ch 32: Delta LIVE Tables (DLT)
Azure Databricks Services Medallion Architecture in ETL Creating Delta LIVE Tables
Cluster Components & DBFS DWH Data Loads & Incr Loads DLT Pipelines in ETL, DWH
RDD, DAG, Photon, SpotLight Bronze, Silver & Gold Data Automated Incr Loads
Spark Partitioned Tables Processing Raw Data Files Control Tables, Timestamp
Cluster Manager: Spark Jobs Data Cleansing, Formatting SCD Type 1 with DLT
Databricks Runtime (DBR) Aggregation Advantages SCD Type 2 with DLT
Databricks Security DBES & Node Architecture Automated Merge Into Stmt
Workspace Security Unity Catalog Concept Delta Tables Vs DLT
Notebook & Job Security LUNs and Unity Catalog Merge Into Vs DLT Pipeline
www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
Ch 7: Python Functions Ch 8: Python Classes & Arrays Ch 9: Python Modules
Python Functions & Usage Python Classes & Objects import Python Modules
Function Parameters __init__() Function Variables in Modules
Arguments, **kwargs __str__() Function Built In Modules & dir
Default & List Parameters Self Parameters & Objects datetime module in Python
Python Lambda Functions Python Inheritance & Classes Date Objections Creation
Anonymous Functions Parent & Child Classes strftime Method & Usage
Recursive Functions, Usage __init__() & super() Function imports & datetime.now()
Return & Print @ Lamdba Polymorphism in Python Using Python Constructors
Ch 10: Python JSON & RegEx Ch 11: Python User Inputs & TRY Ch 12: Python File Handling
JSON Concepts, Usage Try Except, Exception Handling File Handling, Activities
Dictionary & import json NameError Resolution r, a, w, x modes
Python Objects into JSON Python Finally Block, Usage t, b Operations
Formatting & Ordering Raise an exception method Read Only Parts
json.dumps, print options TypeError, Scripting in Python Loop, Write, Close Files
Python Regular Expressions Python User Inputs Appending, Overwriting
RegEx Module & Functions Python Index Numbers import os, path.exists
search() & span() , Strings Named Indexes, Usage f.open, f.write
Using RegEx with JSON input() & raw_input() f.read, f.close
Ch 13: Data Analytics - Pandas Ch 14: Data Analytics - DataFrames Ch 15: Data Analytics - Pandas
Python Modules & Pandas Indexes & Named Options Pandas - Cleaning Data
Pandas Codebase & Usage Locate Row and Load Rows Replace, Transform Columns
Installation of Pandas Row Index & Index Lists Data Discovery & Column Fill
import pandas.DataFrame Load Files Into a DataFrame Identify & Remove Duplicates
Checking Pandas Version pd.read_csv() Function dropna(), fillna() Functions
Pandas Series, arrays pd.options.display.max_rows Pandas - Data Correlations
Labels : Creation, Use df.to_string() Function Good & Bad Correlation
series(), print() tail() & null() Function Data Plotting & matlib Lib
Ch 16: SQL Server & Python - 1 Ch 17: SQL Server & Python - 2 Ch 18: Power BI with Python
SQL Server DB Engine pandas.Series with SQL Server Using Python Script Visual
Azure Data Studio Tool Indexing Methods in Realtime PyScript Options & Tuning
sp_execute_external_script Convert series to data frame Settings, Labelling Options
Input Data & Result Sets Output values into data.frame Running and Testing Scripts
DDL & DML with Python pymssql package in SQL Server Data Validations in Power BI
SQL_out, SQL_in pip list & Package Manager Power BI: ipynb Scripts
Variables & Parameters Python runtime, Py Package Interactive Reports
Versions, Package List pymssql.connect & Usage Data Formatting with Python
WITH RESULT SETS Options Cursor Variables & Usage End to End Realtime Projects
www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
ETL Developer : Resume, Mock Interview + Project FAQs and Solutions
www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801