0% found this document useful (0 votes)

32 views14 pages

New Microsoft Word Document

Uploaded by

lan300995

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views14 pages

New Microsoft Word Document

Uploaded by

lan300995

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 14

1.

Modernizing Data Lakes and Data Warehouses with

Google cloud

1.1. Introduction to Data Engineering

1.1.1. The role of a data engineer

One example of Data lake = Cloud Storage bucket
Example of Data Warehouse is Big Query

Batch data
Data proc manages Hadoop and spark services.
Hadoop = processed to several servers instead of
1 single machine.

Streaming data

1.1.2. Data Engineering Challenges

1.1.3. Introduction to Big Query

1.1.4. Data Lakes and Data Warehouses

1.1.5. Transactional Databases vs Data warehouses
A database stores the current data required to power an
application. A data warehouse stores current and
historical data from one or more systems in a predefined
and fixed schema, which allows business analysts and
data scientists to easily analyze the data.
1.1.6. Partner effectively with other data teams
1.1.7. Manage data access and governance

1.1.8. Build production ready pipeline

1.1.9. Customer case study

1.1.10. Recap
1.2. Building Data lakes

1.2.1. Introduction to Data lakes

Data lakes –-- data pipelines - data warehouses

Orchestration workflows = kick off data pipeline when
new raw data is available.
1.2.2. Data Storage and ETL Options on Google cloud

Federated queries let you send a query statement to

AlloyDB, Spanner, or Cloud SQL databases and get the
result back as a temporary table.

1.2.3. Build a data lake using cloud storage

1.2.4. Secure cloud storage
1.2.5. Store all sorts of data types
1.2.6. Cloud SQL as your OLTP system

1.3. Building Data Warehouse

M1.1 Introduction To Data Engineering
No ratings yet
M1.1 Introduction To Data Engineering
75 pages
Google Cloud Data Lakes & Warehouses
No ratings yet
Google Cloud Data Lakes & Warehouses
4 pages
OD M1 Introduction To Data Engineering
No ratings yet
OD M1 Introduction To Data Engineering
69 pages
OD M1 Introduction To Data Engineering
No ratings yet
OD M1 Introduction To Data Engineering
69 pages
M1 - Introduction To Data Engineering Slides
No ratings yet
M1 - Introduction To Data Engineering Slides
62 pages
04 BigQuery
100% (1)
04 BigQuery
243 pages
Curso Google Data Engineer
100% (1)
Curso Google Data Engineer
36 pages
T DLAKES I 5 l1 en File 44
No ratings yet
T DLAKES I 5 l1 en File 44
1 page
Building Data Lakes on Google Cloud
No ratings yet
Building Data Lakes on Google Cloud
60 pages
OD M2 Building A Data Lake
No ratings yet
OD M2 Building A Data Lake
59 pages
GCP - DataPlex - Building A Data Lakehouse
No ratings yet
GCP - DataPlex - Building A Data Lakehouse
19 pages
Data Engineering Essentials
No ratings yet
Data Engineering Essentials
36 pages
Modernizing Data Lakes and Data Warehouses With Google Cloud
No ratings yet
Modernizing Data Lakes and Data Warehouses With Google Cloud
1 page
Data Lakes Powering The Future of Big Data
No ratings yet
Data Lakes Powering The Future of Big Data
8 pages
Big Book of Data Engineering 2nd Edition Final
No ratings yet
Big Book of Data Engineering 2nd Edition Final
97 pages
The Big Book of Data Engineering: A Collection of Technical Blogs, Including Code Samples and Notebooks
100% (2)
The Big Book of Data Engineering: A Collection of Technical Blogs, Including Code Samples and Notebooks
57 pages
Data Engineering Guide for Experts
No ratings yet
Data Engineering Guide for Experts
97 pages
Introduction To Data Engineering
100% (1)
Introduction To Data Engineering
6 pages
Data Engineering Nanodegree Program Syllabus
No ratings yet
Data Engineering Nanodegree Program Syllabus
16 pages
Google Cloud Analytics Lakehouse
No ratings yet
Google Cloud Analytics Lakehouse
47 pages
Data Engineering - Session 03
No ratings yet
Data Engineering - Session 03
26 pages
PaperID 52140-JATIT
No ratings yet
PaperID 52140-JATIT
13 pages
GCP Data Engineer
No ratings yet
GCP Data Engineer
8 pages
Data and Analytics - TechM PDF
No ratings yet
Data and Analytics - TechM PDF
8 pages
Data Engineering Roadmap Guide
No ratings yet
Data Engineering Roadmap Guide
3 pages
Building Batch Data Pipelines On Google Cloud
No ratings yet
Building Batch Data Pipelines On Google Cloud
18 pages
Data Engineering Nanodegree Program Syllabus
33% (3)
Data Engineering Nanodegree Program Syllabus
15 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
91 pages
Google Cloud Data Engineer Data Dossier 1 - 1548369728 PDF
100% (2)
Google Cloud Data Engineer Data Dossier 1 - 1548369728 PDF
156 pages
Data Engineering With Databricks (Verma, Sumit) (Z-Library)
No ratings yet
Data Engineering With Databricks (Verma, Sumit) (Z-Library)
193 pages
Data Engineering Course Outline
No ratings yet
Data Engineering Course Outline
3 pages
Essentials of Data engineeringByMukeshSaini
No ratings yet
Essentials of Data engineeringByMukeshSaini
30 pages
Data Engineering
No ratings yet
Data Engineering
14 pages
Road To Data Engineer
No ratings yet
Road To Data Engineer
9 pages
Internship
No ratings yet
Internship
17 pages
Data Engineering Nanodegree Program Syllabus PDF
No ratings yet
Data Engineering Nanodegree Program Syllabus PDF
5 pages
Warehouse Assignment MIM 106
No ratings yet
Warehouse Assignment MIM 106
8 pages
Cloud Data Engineering
No ratings yet
Cloud Data Engineering
2 pages
Data Engineering Notes Expanded
No ratings yet
Data Engineering Notes Expanded
2 pages
Data Engineering Interview Q&A Guide
No ratings yet
Data Engineering Interview Q&A Guide
3 pages
Google GCP BigLake
No ratings yet
Google GCP BigLake
13 pages
Data Engineering UNIT-1
No ratings yet
Data Engineering UNIT-1
5 pages
Introduction To Data Engineering
No ratings yet
Introduction To Data Engineering
13 pages
Lecture 1.1 - Introduction To DE
No ratings yet
Lecture 1.1 - Introduction To DE
27 pages
Lakehouse With Delta Lake Deep Dive
100% (2)
Lakehouse With Delta Lake Deep Dive
64 pages
Ebook: The Data Store For AI
No ratings yet
Ebook: The Data Store For AI
17 pages
Databricks Certified Data Engineer Associate Course V2 Release
No ratings yet
Databricks Certified Data Engineer Associate Course V2 Release
300 pages
Data Engineering and Data Engineer - Students
No ratings yet
Data Engineering and Data Engineer - Students
56 pages
Cloud Data Lakes For Dummies Snowflake Special Edition V1 2
No ratings yet
Cloud Data Lakes For Dummies Snowflake Special Edition V1 2
10 pages
Data Engineering 101
No ratings yet
Data Engineering 101
1 page
01 Overview of GCP Platform
No ratings yet
01 Overview of GCP Platform
66 pages
Intro To Data Engineering!
No ratings yet
Intro To Data Engineering!
34 pages
GCP Technologies
No ratings yet
GCP Technologies
12 pages
Google Cloud Fund M8 Big Data and Machine Learning in The Cloud
No ratings yet
Google Cloud Fund M8 Big Data and Machine Learning in The Cloud
44 pages
1 - Architecting For The Lakehouse
No ratings yet
1 - Architecting For The Lakehouse
115 pages
LakeHouse Architecture
No ratings yet
LakeHouse Architecture
23 pages
US11133209
No ratings yet
US11133209
42 pages
Course Tracker
No ratings yet
Course Tracker
3 pages
Ansems Sup A Sup Et Al-2021-Cochrane Database of Systematic Reviews
No ratings yet
Ansems Sup A Sup Et Al-2021-Cochrane Database of Systematic Reviews
93 pages
Google Data Engineer Certification Workbook
100% (1)
Google Data Engineer Certification Workbook
80 pages
Course1 Module2
No ratings yet
Course1 Module2
2 pages
Garden Waiver
No ratings yet
Garden Waiver
1 page

New Microsoft Word Document

Uploaded by

New Microsoft Word Document

Uploaded by

1.

Modernizing Data Lakes and Data Warehouses with

1.1. Introduction to Data Engineering

1.1.1. The role of a data engineer

1.1.2. Data Engineering Challenges

1.1.4. Data Lakes and Data Warehouses

1.1.8. Build production ready pipeline

1.2.1. Introduction to Data lakes

Data lakes –-- data pipelines - data warehouses

Federated queries let you send a query statement to

1.2.3. Build a data lake using cloud storage

1.3. Building Data Warehouse

You might also like