0% found this document useful (0 votes)
3 views

ETL-Developer-Training

The document outlines an ETL Developer course led by Mr. Sai Phanindra, featuring a comprehensive training plan over 15 weeks divided into three modules: SQL Server TSQL Queries, Azure Data Engineering, and Python ETL. Each module covers various topics and skills essential for ETL development, including database management, Azure services, and Python programming. The course promises practical, step-by-step classes leveraging the trainer's extensive experience in database and BI technologies.

Uploaded by

Malkhan
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

ETL-Developer-Training

The document outlines an ETL Developer course led by Mr. Sai Phanindra, featuring a comprehensive training plan over 15 weeks divided into three modules: SQL Server TSQL Queries, Azure Data Engineering, and Python ETL. Each module covers various topics and skills essential for ETL development, including database management, Azure services, and Python programming. The course promises practical, step-by-step classes leveraging the trainer's extensive experience in database and BI technologies.

Uploaded by

Malkhan
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 7

ETL Developer

Thank you for contacting our SQL School. I am Mr. Sai Phanindra, trainer for this ETL
Developer Course.

With 19+ Years of technical expertise exclusively on Database and Azure, BI Technologies, I
assure you 100% Practical, Step by Step Classes for this in-depth ETL Developer course. My
Profile @ linkedin.com/in/saiphanindra/

ETL Developer Training Plan


Module 1 SQL Server TSQL Queries (MSSQL) 4 weeks
Module 2 Azure Data Engineering 7 weeks
Module 3 Python ETL 4 weeks
Total Duration 15 Weeks

Detailed Curriculum
Module 1: SQL Server TSQL Queries (MSSQL)

Ch 1: Introduction Ch 2: Installations Ch 3: SQL Basics - 1


 Database Introduction  SQL Server 2019, 2017  Need for Databases, Tables
 Types of Databases  SSMS Tools Installation  Need for SQL Commands
 Need for & ETL, DWH  Database Engine (OLTP)  DDL, DML & DQL Statements
 BI Implementations  SCM, Configuration Tools  Database Creation @ GUI
 SQL Server Advantages  Instance Types, Uses  Data Operations @ GUI
 Version, Editions of MSSQL  Authentication Modes  Session ID, SQL Context
 Engineering, Analytics Roles  Collation, File Stream  DB, Tables, Data @ SQL
Ch 4: SQL Basics - 2 Ch 5: Data Imports, Schemas Ch 6: Constraints, Index Basics
 DDL Variants in MSSQL  Data Imports with Excel  Need for Constraints, Keys
 DML Variants in MSSQL  ORDER BY & UNION  NULL, NOT NULL, UNIQUE

www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
 INSERT & INSERT INTO  UNION ALL For Sorting Data  Primary Key & Foreign Key
 SELECT & SELECT INTO  Creating, Using Schemas  RDBMS and ER Models
 Basic Operators in SQL  Real-world Banking Database  Identity Property, Default
 Special Operators in MSSQL  Table Migrations @ Schemas  Clustered Index, Primary Key
 ALTER, ADD, TRUNCATE, DROP  2 Part, 3 Part & 4 Part Naming  Non Clustered Index, Unique
Ch 7: Joins & Views Basics Ch 8: Functions(UDF), Data Types Ch 9: Stored Procedures,Models
 JOINS: Purpose. Inner Joins  Using Functions in MSSQL  Stored Procedures & Usage
 Left / Right / Full Outer Joins  Scalar Value Functions  Creating, Testing Procedures
 Cross Joins, Query Tuning  Inline & Multiline Functions  Encryption, Deferred Names
 Creating & Using Views  Date & Time Functions  SPs for Validations, Analysis
 DML, SELECT with Views  String, Aggregate Functions  System SPs, Recompilation
 RLS : WITH CHECK OPTION  Data Types : Integer, Char, Bit  Normal Forms & Types
 System Views & Metadata  SQL Variant, Timestamp, Date  Data Models, Self-References

Ch 10: Triggers, Temp Tables Ch 11: DB Architecture, Locks Ch 12: Cursors & CTEs, Links
 Need for Triggers  Planning VLDBs : Files, Sizing  Cursors : Realtime Use
 DDL & DML Triggers  Filegroups, Extents & Types  Fetch & Access Cursor Rows
 Using Memory Tables  Log Files : VLF, Mini LSN  CTEs for SELECT, DML
 Data Replication, Automation  Table Location, Performance  CTEs: Scenarios & Tuning
 Local & Global Temp Tables  Schemas, Transfer, Synonyms  Linked Servers, Remote Joins
 Testing & Using Temp Tables  Transactions Types, Lock Hint  Linked Servers: MSDTC, RPC
 SELECT .. INTO & Bulk Loads  Query Blocking Scenarios  Tuning Remote Queries
Ch 13: Merge, Upsert & Rank Ch 14: Grouping & Cube Ch 15: Self Joins, Excel Analysis
 Need for Merge in ETL  Group By & HAVING  Self Joins & Self References
 Incremental Loads with SQL  Cube, Rollup & Grouping  UNION, UNION ALL
 MERGE and RANK Functions  Joins with Group By  Sub Queries with Joins
 Window Functions, Partition  3 Table, 4 Table Joins  IIF, CASE, EXISTS Statements
 Identify, Remove Duplicates  Query Execution Order  Excel Analytics, Pivot Reports
Realtime Case Study : Health-Care Domain

Module 2: Azure Data Engineer


Ch 1: ETL, DWH Introduction Ch 2: Azure Intro, Azure SQL Ch 3: Azure Synapse (DWH)
 Database Introduction  Azure SQL Server, SQL DB  Synapse Pool Architecture
 Data Warehouse (DWH)  Azure SQL Database (OLTP)  Control Node, Compute Node
 Data Engineering Work Flow  Azure SQL Pool (DWH)  DMS & Partitioned Tables
 Cloud Concepts: IaaS, PaaS  Connections from SSMS Tool  Creating Tables with TSQL
 SaaS & Azure Cloud Concepts  Connections from ADS Tool  Distributions: RR, Hash, Repl
 Azure Resources & Groups  Pause / Resume SQL Pool  Big Data Loads with TQL
 Storage, ETL, IoT Resources  Source Data Configurations  Important DMFs & DMVs

www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
Ch 4: Azure Data Factory (ADF) Ch 5: ADF Incremental Loads - 1 Ch 6: ADF Incremental Loads - 2
 Need for ADF & Pipelines  File Incremental Loads  Implement SCD with ADF
 Linked Services & IRs  Storage Account, Data Lake  Self Hosted IR: Realtime Use
 Datasets, Pipelines, Triggers  Binary Copy, Schema Drift  On-premise Data: Incr Loads
 Copy Data Activity & CDT  Staging Concept in ADF  Copy Method : Upsert, Keys
 Data Loads Pipelines, DTUs  DOCP, Logging & Consistency  Staging & ADF Optimizations
 Pipeline Monitoring, Edits  Polybase Concept & Tuning  Pipeline Runs, Activity IDs
Ch 7: ADF Data Flow - 1 Ch 8: ADF Data Flow - 2 Ch 9: ADF Data Flow - 3
 Data Flow Transformations  Working with Multiple Tables  Pivot Transformation
 Spark Clusters for Debugging  Join Transform, Broadcast  Group By & Pivot Keys
 Optimized Clusters, Preview  Row Filters, Column Filters  Column Pattern, Deduplicate
 Conditional Split, SELECT  Surrogate Keys, Derived Cols  Lookup, Cached Lookup
 Sort, Union Transformations  ETL Loads Dates, Sink Options  Tuning Transformations
 Pipelines with Data Flow  Aggregated Data Loads  Tuning Data Flow, Spark
Ch 10: Synapse Analytics - 1 Ch 11: Synapse Analytics - 2 Ch 12: Synapse Analytics - 3
 Azure Synapse Analytics  Serverless Pools in Synapse  Apache Spark Pool & Usage
 Dedicated SQL Pools  TSQL Scripts with Serverless  Synapse Analytics with Pools
 TSQL: Stored Procedures  ADLS Data Imports & ELT  PySpark Staging,Aggregations
 Synapse Pipelines, Tuning  Synapse Aggregation, Analytics  Spark Queries & Python ETL
 SP Activity in Pipelines, Jobs  Synapse Optimizations  Python Notebooks, Pipelines
 Comparing ADF & Synapse  Synapse Security & Logins  Integrating Python with DWH
Ch 13: Parameters, SCD & ETL Ch 14: CDC @ ETL, ELT & Tuning
 ADF Templates in Realtime  Using CDC in ADF A Realtime Case Study
 Table Incremental Loads  Control Tables (CT): Upserts
 Control Tables, Watermarks  Handling Inserts, Updates
 Pipeline Parameters, SPs  SCD Type 1 & Type 2
 Dynamic Data Sets, SCD  ADF, Synapse : Limitations
Ch 15: Azure Intro & Storage Ch 16: Azure Storage Operations Ch 17: Azure Storage Security
 Storage, ETL, IoT Resources  BLOB Storage: Containers  Realtime use of Keys
 Azure Storage Components  Storage Browser, Explorer  Access Keys & Admin Access
 Azure Storage Account, HNS  File & Folder Uploads, Edits  SAS Keys Generation, Ips
 Azure Data Lake Storage  Azure Tables: Row Key  Creating, Using Entra Users
 Azure Storage Explorer Tool  Partition Key, Timestamp  Azure AD Users, Groups
 Storage Explorer Config  Use Cases of BLOB Storage  IAM & RBAC with Entra Users
 Storage Account Properties  Use Cases of Azure Tables  ACLs and ADLS Security

www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
Ch 18: Azure SQL DB Migrations Ch 19: Azure Stream Analytics Ch 20: Azure Stream Analytics
 On-Premise SQL DB bacpac  Azure IoT Hubs & Devices  Azure IoT Hubs & Devices
 Azure SQL Deployment  APIs with Connection Strings  APIs with Connection Strings
 Azure Storage from SSMS  Azure Steam Analytic Jobs  Azure Steam Analytic Jobs
 Azure SQL DB Migration  Inputs, Outputs, SAQL Query  Inputs, Outputs, SAQL Query
 Migration Verifications  LIVE Feed: JSON, AVRO Files  LIVE Feed: JSON, AVRO Files
 Testing Migrations in SQL  Watermark & LIVE Stats  Watermark & LIVE Stats

Ch 21: Azure Key Vaults, Alerts Ch 22: Azure Storage Optimization Ch23: Azure Pricing, Functions
 Azure Encryptions @ REST  BLOB Types & Content Types  Azure Logic Apps : Usage
 Azure Key Vaults & Keys  Hot, Cool, Cold, Archive Types  Log Apps Usage in ETL
 SMK & CMK Encryptions  Creating, Using Access Policies  Snapshots, Azure Functions
 Azure Metrics: Ingress  Immutable Storage, Rotation  Azure Functions Realtime Use
 Egress, E2E Latency Issues  Containerization, Indexing  ETL & DWH with Functions
 Performance Tuning Options  Replication: LRS, ZRS, RA-GRS  Azure Resource Pricing

Ch 24: Azure Big Data & Spark Ch 25: Spark Cluster Operations Ch 26: Python & PySpark, ETL
 Azure Big Data & Spark  DBFS : Flat File Imports  Python Fundamentals
 Azure ETL & DWH Databases  Table Conversions using GUI  Python Dataframes : ETL
 Azure Spark, HIVE Metastore  Spark Clusters: Table Creations  Python For Big Data, Pandas
 Azure Databricks Service  Basic Transformations in Spark  Python Notebooks, Views
 Spark Cluster (Personal)  SQL Notebooks: Creation  Aggregated Loads to Spark
 Unity Catalog & Azure VM  Default DB Queries, Cloning  Spark DB Creations, Tables

Ch 27: PySpark & ADLS, Widgets Ch 28: ADB Jobs, Delta Tables Ch 29: Scala Notebooks & ETL
 Creating Spark Databases  Azure Databrick Jobs  Scala Notebooks : Purpose
 Spark Tables, Catalog Info  Azure Workflows & Tasks  Aggregated Data Loads
 PySpark with ADLS Storage  Notebook Schedule Options  Incremental Data Loads
 Using Widgets for ADLS Keys  Continuous Jobs, Notifications  Widgets & Jobs with Scala
 PySpark Variables & Widgets  Delta Tables & Data Cleansing  Python Versus Scala
 Using Variables in Functions  SCD (Merge Into), Contact, etc  Converting Python to Scala
 Spark SQL with Control Text  Creating, Using Dataframes  JVM Benefits, SQL DB Conn”
 Using Variables in Spark SQL  Multi Dataframe Joins  SQL DB Loads with Scala

www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
Ch 30: Databricks Architecture Ch 31: Medallion Architecture Ch 32: Delta LIVE Tables (DLT)
 Azure Databricks Services  Medallion Architecture in ETL  Creating Delta LIVE Tables
 Cluster Components & DBFS  DWH Data Loads & Incr Loads  DLT Pipelines in ETL, DWH
 RDD, DAG, Photon, SpotLight  Bronze, Silver & Gold Data  Automated Incr Loads
 Spark Partitioned Tables  Processing Raw Data Files  Control Tables, Timestamp
 Cluster Manager: Spark Jobs  Data Cleansing, Formatting  SCD Type 1 with DLT
 Databricks Runtime (DBR)  Aggregation Advantages  SCD Type 2 with DLT
 Databricks Security  DBES & Node Architecture  Automated Merge Into Stmt
 Workspace Security  Unity Catalog Concept  Delta Tables Vs DLT
 Notebook & Job Security  LUNs and Unity Catalog  Merge Into Vs DLT Pipeline

Real-time Project (Azure Data Engineer) + Resume Guidance

Module 3: Python ETL

Ch 1: Python Introduction Ch 2: Python Basics, Architecture Ch 3: Data Types & Variables


 Need for Data Analytics  Python Scripting Options  Integer / Int Data Types
 Python in Data Analysis  Basic Operations in Python  Float & String Data Types
 History of Python  Python Scripts, Print()  Boolean, Binary Types
 Python Versions  Single, Multiline Statements  Sequence Types: List, Tuple
 Python Implementations  Adding Cells, Saving Notebook  Range, Complex & memview
 Python Installations  Single, Multi Line Comments  Retrieving Data Type: type()
 Python IDE & Usage  Python : Internal Architecture  Multi Assignments & Casting
 Jupyter Notebooks  Compiler Versus Interpreter  Unpack Collection, Outputs

Ch 4: Python Operators Ch 5: Python Loops, Iterations Ch 6: Python Collections


 Arithmetic, Assignment Ops  Python Loop & Realtime Use  Python Collections (Arrays)
 Comparison Operators  Python While Loop Statement  list() Constructor, print()
 Logical, Identity Operators  Break and Continue Statement  Python Tuples, Tuple Items
 Member, Bitwise Operators  Using Print with While()  tuple() Constructor, Usage
 Operator Precedence  Iterations & Conditions  Python Sets : Syntax Rules
 If ... Else Statement, Pass  Exit Conditions & For Loops  Duplicates, Types, Ordered
 Short Hand If, OR, AND  Break, Continue & Range  Python Dictionaries: Usage
 ELIF and ELSE IF Statements  __iter__() and __next__()  Changeable, Ordered Data
 Expressions, Ternary OPs  iter() and Looping Options  Dictionary Construct, type()

www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
Ch 7: Python Functions Ch 8: Python Classes & Arrays Ch 9: Python Modules
 Python Functions & Usage  Python Classes & Objects  import Python Modules
 Function Parameters  __init__() Function  Variables in Modules
 Arguments, **kwargs  __str__() Function  Built In Modules & dir
 Default & List Parameters  Self Parameters & Objects  datetime module in Python
 Python Lambda Functions  Python Inheritance & Classes  Date Objections Creation
 Anonymous Functions  Parent & Child Classes  strftime Method & Usage
 Recursive Functions, Usage  __init__() & super() Function  imports & datetime.now()
 Return & Print @ Lamdba  Polymorphism in Python  Using Python Constructors

Ch 10: Python JSON & RegEx Ch 11: Python User Inputs & TRY Ch 12: Python File Handling
 JSON Concepts, Usage  Try Except, Exception Handling  File Handling, Activities
 Dictionary & import json  NameError Resolution  r, a, w, x modes
 Python Objects into JSON  Python Finally Block, Usage  t, b Operations
 Formatting & Ordering  Raise an exception method  Read Only Parts
 json.dumps, print options  TypeError, Scripting in Python  Loop, Write, Close Files
 Python Regular Expressions  Python User Inputs  Appending, Overwriting
 RegEx Module & Functions  Python Index Numbers  import os, path.exists
 search() & span() , Strings  Named Indexes, Usage  f.open, f.write
 Using RegEx with JSON  input() & raw_input()  f.read, f.close

Ch 13: Data Analytics - Pandas Ch 14: Data Analytics - DataFrames Ch 15: Data Analytics - Pandas
 Python Modules & Pandas  Indexes & Named Options  Pandas - Cleaning Data
 Pandas Codebase & Usage  Locate Row and Load Rows  Replace, Transform Columns
 Installation of Pandas  Row Index & Index Lists  Data Discovery & Column Fill
 import pandas.DataFrame  Load Files Into a DataFrame  Identify & Remove Duplicates
 Checking Pandas Version  pd.read_csv() Function  dropna(), fillna() Functions
 Pandas Series, arrays  pd.options.display.max_rows  Pandas - Data Correlations
 Labels : Creation, Use  df.to_string() Function  Good & Bad Correlation
 series(), print()  tail() & null() Function  Data Plotting & matlib Lib

Ch 16: SQL Server & Python - 1 Ch 17: SQL Server & Python - 2 Ch 18: Power BI with Python
 SQL Server DB Engine  pandas.Series with SQL Server  Using Python Script Visual
 Azure Data Studio Tool  Indexing Methods in Realtime  PyScript Options & Tuning
 sp_execute_external_script  Convert series to data frame  Settings, Labelling Options
 Input Data & Result Sets  Output values into data.frame  Running and Testing Scripts
 DDL & DML with Python  pymssql package in SQL Server  Data Validations in Power BI
 SQL_out, SQL_in  pip list & Package Manager  Power BI: ipynb Scripts
 Variables & Parameters  Python runtime, Py Package  Interactive Reports
 Versions, Package List  pymssql.connect & Usage  Data Formatting with Python
 WITH RESULT SETS Options  Cursor Variables & Usage  End to End Realtime Projects

www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801
ETL Developer : Resume, Mock Interview + Project FAQs and Solutions

👉🏻 Please visit on www.sqlschool.com

👉🏻 Reach us on +919666440801, +91 9666 640801 (SQL School


Team)

www.sqlschool.com For Free Demo: +91 99514 40801, +91 966644 0801

You might also like