0% found this document useful (0 votes)

793 views

Data Warehouse Architecture

Data warehouse architecture involves organizing data from multiple sources into a central repository for analysis. There are three main architectures: single-tier, two-tier, and three-tier. The two-tier architecture stages data between source systems and the data warehouse for cleansing. The three-tier architecture adds a reconciled layer between sources and the warehouse. Data warehouse construction also uses top-down or bottom-up approaches - top-down builds data marts from a central warehouse while bottom-up constructs individual marts integrated later into a warehouse.

Uploaded by

Binay Yadav

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

793 views

Data Warehouse Architecture

Uploaded by

Binay Yadav

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Data Warehouse Architecture

Data Warehouse Architecture is complex as it’s an information system that contains historical and
commutative data from multiple sources.
Data warehouse architecture is a method of defining the overall architecture of data
communication processing and presentation that exist for end-clients computing within the
enterprise. Each data warehouse is different, but all are characterized by standard vital
components.

Single-tier Data Warehouse Architecture

Single-tier architectures are not implemented in real-time systems. They are used for batch and
real-time processing. The data is first transferred to a single-tier architecture where it is converted
into a format that is suitable for real-time processing.
Single-Tier architecture is not periodically used in practice. Its purpose is to minimize the amount
of data stored to reach this goal; it removes data redundancies.
The single-tier architecture has three layers:
 A source layer
 A data warehouse layer
 An analysis layer(Presentation)
In the single-tier architecture, only the source layer is physical. The data warehouse layer is
virtual and provides data in a multidimensional view, created by an intermediate processing
layer.

Er. Binay Yadav Page 1

Data Warehouse Architecture

Two-tier Data Warehouse Architecture

Two-tier architecture includes a staging area for all data sources, before the data warehouse
layer. By adding a staging area between the sources and the storage repository, you ensure all
data loaded into the warehouse is cleansed and in the appropriate format.

Most businesses that use data marts as a server make use of the two-tier data warehouse
architecture, which is also made up of two tiers:
1. The Data Tier
This is the layer where actual data is stored after various ETL processes have been used to load
data into the data warehouse.
It’s also made up of three layers:
 A source layer
 A data staging layer
 A data warehouse layer
Source layer: A data warehouse system uses a heterogeneous source of data. That data
is stored initially to corporate relational databases or legacy databases, or it may come
from an information system outside the corporate walls.
Data Staging: The data stored to the source should be extracted, cleansed to remove
inconsistencies and fill gaps, and integrated to merge heterogeneous sources into one
standard schema. The so-named Extraction, Transformation, and Loading Tools (ETL) can
combine heterogeneous schemata, extract, transform, cleanse, validate, filter, and load
source data into a data warehouse.
Data Warehouse layer: Information is saved to one logically centralized individual
repository: a data warehouse. The data warehouses can be directly accessed, but it can
also be used as a source for creating data marts, which partially replicate data warehouse
contents and are designed for specific enterprise departments. Meta-data repositories
store information on sources, access procedures, data staging, users, data mart schema,
and so on.

2. The Client Tier

This layer is where clients can use data stored in the data warehouse to generate insights for
making informed, data-driven decisions. You can modify or transform this layer based on the data
trends that you discover from your analysis reports.
And it’s made up of a single layer:
 An analysis layer

Er. Binay Yadav Page 2

Data Warehouse Architecture

Analysis Layer (Presentation Layer): In this layer, integrated data is efficiently, and flexible
accessed to issue reports, dynamically analyze information, and simulate hypothetical business
scenarios. It should feature aggregate information navigators, complex query optimizers, and
customer-friendly GUIs.
Three-tier Data Warehouse Architecture
The three-tier approach is the most widely used architecture for data warehouse systems.
The three-tier architecture is what most organizations go for when building a data warehouse
system. It solves the connectivity problems that the two-tier architecture commonly faces.
The three-tier architecture is made up of:
 A source layer
 A reconciled layer
 A data warehouse layer
The three-tier architecture is useful for extensive, enterprise-wide systems.
The three-tier architecture consists of the source layer (containing multiple source system), the
reconciled layer and the data warehouse layer (containing both data warehouses and data
marts). The reconciled layer sits between the source data and data warehouse.

The main advantage of the reconciled layer is that it creates a standard reference data model
for a whole enterprise. At the same time, it separates the problems of source data extraction and
integration from those of data warehouse population.
Essentially, the three-tier architecture also has three tiers:
1. The bottom tier is the database of the warehouse, where the cleansed and transformed
data is loaded.
2. The middle tier is the application layer giving an abstracted view of the database. It
arranges the data to make it more suitable for analysis.

3. The top-tier is where the user accesses and interacts with the data. It represents the
front-end client layer. You can use reporting tools, query, analysis or data mining
tools.

Data Warehouse Architecture

A data-warehouse is a heterogeneous collection of different data sources organized under a
unified schema. There are 2 approaches for constructing data-warehouse: Top-down approach
and Bottom-up approach are explained as below.

Er. Binay Yadav Page 3

Data Warehouse Architecture

1. Top-down approach:

The essential components are discussed below:

1. External Sources –
External source is a source from where data is collected irrespective of the type of data. Data
can be structured, semi structured and unstructured as well.
2. Stage Area –
Since the data, extracted from the external sources does not follow a particular format, so
there is a need to validate this data to load into data warehouse. For this purpose, it is
recommended to use ETL tool.
 E(Extracted): Data is extracted from External data source.

 T(Transform): Data is transformed into the standard format.

 L(Load): Data is loaded into data warehouse after transforming it into the standard format.

3. Data-warehouse –
After cleansing of data, it is stored in the data warehouse as central repository. It actually
stores the meta data and the actual data gets stored in the data marts. Note that
datawarehouse stores the data in its purest form in this top-down approach.

4. Data Marts –
Data mart is also a part of storage component. It stores the information of a particular function
of an organization which is handled by single authority. There can be as many number of data
marts in an organization depending upon the functions. We can also say that data mart
contains subset of the data stored in data warehouse.

5. Data Mining –
The practice of analyzing the big data present in data warehouse is data mining. It is used to
find the hidden patterns that are present in the database or in data warehouse with the help of
algorithm of data mining.
This approach is defined by Inmon as – data warehouse as a central repository for the
complete organization and data marts are created from it after the complete data warehouse
has been created.

Er. Binay Yadav Page 4

Data Warehouse Architecture

Advantages of Top-Down Approach –

1. Since the data marts are created from the data warehouse, provides consistent dimensional
view of data marts.

2. Also, this model is considered as the strongest model for business changes. That’s why; big
organizations prefer to follow this approach.

3. Creating data mart from data warehouse is easy.

Disadvantages of Top-Down Approach –

1. The cost, time taken in designing and its maintenance is very high.

2. Bottom-up approach:

1. First, the data is extracted from external sources (same as happens in top-down approach).

2. Then, the data go through the staging area (as explained above) and loaded into data marts
instead of data warehouse. The data marts are created first and provide reporting capability. It
addresses a single business area.

3. These data marts are then integrated into data warehouse.

4. This approach is given by Kinball as – data marts are created first and provides a thin view for
analyses and data warehouse is created after complete data marts have been created.

Advantages of Bottom-Up Approach –

1. As the data marts are created first, so the reports are quickly generated.

2. We can accommodate more number of data marts here and in this way data warehouse can
be extended.

3. Also, the cost and time taken in designing this model is low comparatively.

Disadvantage of Bottom-Up Approach –

1. This model is not strong as top-down approach as dimensional view of data marts is not
consistent as it is in above approach.

Er. Binay Yadav Page 5

System Testing and Implementation
100% (7)
System Testing and Implementation
7 pages
SPAMMING TUT Cading & Hacking Guide
100% (1)
SPAMMING TUT Cading & Hacking Guide
22 pages
Research Paper
No ratings yet
Research Paper
9 pages
SHC Cheatsheet
No ratings yet
SHC Cheatsheet
2 pages
16 Mark Questions OOAD
100% (2)
16 Mark Questions OOAD
9 pages
Compiler Design
No ratings yet
Compiler Design
130 pages
Decision Support Systems and Business Intelligence PDF
100% (1)
Decision Support Systems and Business Intelligence PDF
25 pages
DBMS GTU Study Material Presentations Unit-1 27072019070458AM
0% (1)
DBMS GTU Study Material Presentations Unit-1 27072019070458AM
45 pages
Unit 4 Transaction Processing
No ratings yet
Unit 4 Transaction Processing
45 pages
Java Lab Manual
No ratings yet
Java Lab Manual
120 pages
1) Subject-Oriented Data:: Data Warehouse: The Building Blocks EX 1
0% (1)
1) Subject-Oriented Data:: Data Warehouse: The Building Blocks EX 1
5 pages
Stepwise Project Planning 12052016
No ratings yet
Stepwise Project Planning 12052016
17 pages
7BCE6C3 SoftwareEngineering
100% (1)
7BCE6C3 SoftwareEngineering
77 pages
Software Engineering Question Bank
No ratings yet
Software Engineering Question Bank
7 pages
CS402 Data Mining and Warehousing Question Bank
No ratings yet
CS402 Data Mining and Warehousing Question Bank
6 pages
Tools of Structured Analysis
100% (1)
Tools of Structured Analysis
23 pages
Compiler Design Unit 2
No ratings yet
Compiler Design Unit 2
117 pages
Data Warehousing and Data Mining Lab Manual
100% (1)
Data Warehousing and Data Mining Lab Manual
30 pages
Taxonomy of Architectural Styles
No ratings yet
Taxonomy of Architectural Styles
4 pages
LEC 8 - SQE - Defect Prev, Reduction and Containment
No ratings yet
LEC 8 - SQE - Defect Prev, Reduction and Containment
13 pages
Chapter 4 Software Project Planning
No ratings yet
Chapter 4 Software Project Planning
206 pages
Unit - 3 Software Design
No ratings yet
Unit - 3 Software Design
60 pages
Software Project Management - Quiz 1 Solutions
No ratings yet
Software Project Management - Quiz 1 Solutions
8 pages
Unit - II
100% (1)
Unit - II
44 pages
Software Engineering Lab File
No ratings yet
Software Engineering Lab File
85 pages
Introduction - Examples of Distributed Systems-Trends in Distributed Systems - Focus On Resource Sharing - Challenges. Case Study: World Wide Web
100% (2)
Introduction - Examples of Distributed Systems-Trends in Distributed Systems - Focus On Resource Sharing - Challenges. Case Study: World Wide Web
46 pages
Computer Network UNIT 3
No ratings yet
Computer Network UNIT 3
28 pages
Daa-r22-Unit 1&2-Digital Notes Cse Dept (A.y 2024-25) @DR.K
No ratings yet
Daa-r22-Unit 1&2-Digital Notes Cse Dept (A.y 2024-25) @DR.K
50 pages
Requirements Modeling: Flow, Behavior, Patterns, and Webapps
100% (2)
Requirements Modeling: Flow, Behavior, Patterns, and Webapps
46 pages
Lab Manual SRE
100% (1)
Lab Manual SRE
33 pages
Syllabus Cloud Computing
No ratings yet
Syllabus Cloud Computing
9 pages
Computer Networks JNTUH Unit1 Notes
No ratings yet
Computer Networks JNTUH Unit1 Notes
6 pages
MCA - First - Year - Detailed - Syllabus - 2024-25
No ratings yet
MCA - First - Year - Detailed - Syllabus - 2024-25
33 pages
SPM Unit 5 Notes
100% (1)
SPM Unit 5 Notes
26 pages
Flajolet-Martin Algorithm
No ratings yet
Flajolet-Martin Algorithm
28 pages
Data Analytics Unit-3 Notes
No ratings yet
Data Analytics Unit-3 Notes
21 pages
PTU Lab Practicals
No ratings yet
PTU Lab Practicals
67 pages
Daa Handwritten Notes
No ratings yet
Daa Handwritten Notes
43 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
2 pages
Koe081: Cloud Computing: Detailed Syllabus 3-1-0 Unit Topic Proposed I
No ratings yet
Koe081: Cloud Computing: Detailed Syllabus 3-1-0 Unit Topic Proposed I
3 pages
Software Testing Strategies
No ratings yet
Software Testing Strategies
27 pages
Best PPT On Security Attacks Services Mechanism
100% (1)
Best PPT On Security Attacks Services Mechanism
23 pages
Unit 2 Part 1
No ratings yet
Unit 2 Part 1
34 pages
Indroduction To Data Warehousing (Alex Berson)
0% (2)
Indroduction To Data Warehousing (Alex Berson)
20 pages
User Interface Design
No ratings yet
User Interface Design
12 pages
Csl33 Database Management Lab-Exercises (SQL)
100% (1)
Csl33 Database Management Lab-Exercises (SQL)
20 pages
Prctice Question On DAG
No ratings yet
Prctice Question On DAG
21 pages
Devops Unit - 1 Material Final
No ratings yet
Devops Unit - 1 Material Final
36 pages
Big Data and Business Analytics: Lab Manual
100% (1)
Big Data and Business Analytics: Lab Manual
45 pages
Miniorange
No ratings yet
Miniorange
1 page
Measuring Data Similarity and Dissimilarity
No ratings yet
Measuring Data Similarity and Dissimilarity
20 pages
Unit #3 - Data Warehouse and Data Mining
No ratings yet
Unit #3 - Data Warehouse and Data Mining
70 pages
Data Analytics III I
No ratings yet
Data Analytics III I
86 pages
Project Report On Vehicle Management System
No ratings yet
Project Report On Vehicle Management System
47 pages
1.1 Introduction To Data Mining: 1.1.1 Moving Toward The Information Age
No ratings yet
1.1 Introduction To Data Mining: 1.1.1 Moving Toward The Information Age
14 pages
Module-3 Syntax Analyzer
No ratings yet
Module-3 Syntax Analyzer
80 pages
Software Project Management Unit-5 - 4 PDF
No ratings yet
Software Project Management Unit-5 - 4 PDF
2 pages
Unit-2: Multi-Dimensional Data Model?
No ratings yet
Unit-2: Multi-Dimensional Data Model?
21 pages
Data Warehouse Architechture-Layers
No ratings yet
Data Warehouse Architechture-Layers
21 pages
Data Warehouse Architecture
No ratings yet
Data Warehouse Architecture
5 pages
BI_Unit 4
No ratings yet
BI_Unit 4
10 pages
Data Warehousing
No ratings yet
Data Warehousing
16 pages
Unit 4 Association Rule Mining
No ratings yet
Unit 4 Association Rule Mining
18 pages
Unit 2
No ratings yet
Unit 2
8 pages
Pentaho Lab
No ratings yet
Pentaho Lab
22 pages
Unit - 3 Data Cube Technology
No ratings yet
Unit - 3 Data Cube Technology
6 pages
Power BI
100% (1)
Power BI
49 pages
Data Warehousing & Data Mining
No ratings yet
Data Warehousing & Data Mining
10 pages
Unit-4 (Service Oriented Architecture)
No ratings yet
Unit-4 (Service Oriented Architecture)
9 pages
Unit-2 (Cloud Computing Arch2)
No ratings yet
Unit-2 (Cloud Computing Arch2)
8 pages
Unit-1 (Introduction To Cloud)
No ratings yet
Unit-1 (Introduction To Cloud)
15 pages
Unit 2 (Cloud Computing Architecture)
No ratings yet
Unit 2 (Cloud Computing Architecture)
22 pages
Data Sheet Acronis Cyber Disaster Recovery EN US 220919
No ratings yet
Data Sheet Acronis Cyber Disaster Recovery EN US 220919
2 pages
1 2 3 Tier Architecture
No ratings yet
1 2 3 Tier Architecture
2 pages
Document Management Techniques and Techn
No ratings yet
Document Management Techniques and Techn
9 pages
Mba 4 Sem Business Data Warehousing and Data Analytics Kmbnit05 2022
No ratings yet
Mba 4 Sem Business Data Warehousing and Data Analytics Kmbnit05 2022
1 page
Training Manual SignServer-v10-20221012 - 223306
No ratings yet
Training Manual SignServer-v10-20221012 - 223306
47 pages
BeyondTrust BCIE - Remote Support 2024-v1-1
No ratings yet
BeyondTrust BCIE - Remote Support 2024-v1-1
9 pages
IBM DataStage V11.5.x Database Transaction Processing
No ratings yet
IBM DataStage V11.5.x Database Transaction Processing
27 pages
GRC12 Secrity Guide
100% (1)
GRC12 Secrity Guide
52 pages
Aditya Arun Funde 9th c Roll No 14 Basic Ict Skills Ppt
No ratings yet
Aditya Arun Funde 9th c Roll No 14 Basic Ict Skills Ppt
9 pages
Power Bi Developer Visuals
No ratings yet
Power Bi Developer Visuals
670 pages
Concur Cte Mobile and 9.0
No ratings yet
Concur Cte Mobile and 9.0
151 pages
CC 6
No ratings yet
CC 6
11 pages
BRF Plus For Beginners
100% (1)
BRF Plus For Beginners
11 pages
Resume Anton Mahesh Anand Test Manager Archhitect V8.0
No ratings yet
Resume Anton Mahesh Anand Test Manager Archhitect V8.0
8 pages
Unit 4 Dbms
No ratings yet
Unit 4 Dbms
85 pages
DBMS END TERM Paper and Solution 2020
No ratings yet
DBMS END TERM Paper and Solution 2020
19 pages
ABAP Tips and Tricks Database
100% (2)
ABAP Tips and Tricks Database
11 pages
Software_PLC-Developer_en
No ratings yet
Software_PLC-Developer_en
5 pages
Question Text: Clear My Choice
No ratings yet
Question Text: Clear My Choice
16 pages
Gov Dept Reg Flowchart
No ratings yet
Gov Dept Reg Flowchart
2 pages
Notes Second Year BAJMC
No ratings yet
Notes Second Year BAJMC
15 pages
Ghana Institute of Management and Public Administration (Gimpa)
No ratings yet
Ghana Institute of Management and Public Administration (Gimpa)
9 pages
AJP QuestionBank
No ratings yet
AJP QuestionBank
2 pages
Row Level Security (RLS) in Power BI
No ratings yet
Row Level Security (RLS) in Power BI
13 pages
France's SREN Law Threatens Open Internet and Privacy
No ratings yet
France's SREN Law Threatens Open Internet and Privacy
5 pages
Data Sheet - How To Use PsPing To Test Ping, Latency & Bandwidth Between Blue Prism Components
No ratings yet
Data Sheet - How To Use PsPing To Test Ping, Latency & Bandwidth Between Blue Prism Components
8 pages
Relational Database Design - : Mapping ERD To Relational
No ratings yet
Relational Database Design - : Mapping ERD To Relational
61 pages
Titan ENUM-DNS 7.23.2 Release Notes Release Notes v1
No ratings yet
Titan ENUM-DNS 7.23.2 Release Notes Release Notes v1
12 pages

Data Warehouse Architecture

Uploaded by

Data Warehouse Architecture

Uploaded by

Data Warehouse Architecture

Data Warehouse Architecture

Single-tier Data Warehouse Architecture

Er. Binay Yadav Page 1

Two-tier Data Warehouse Architecture

2. The Client Tier

Er. Binay Yadav Page 2

Data Warehouse Architecture

Er. Binay Yadav Page 3

The essential components are discussed below:

 T(Transform): Data is transformed into the standard format.

Er. Binay Yadav Page 4

Advantages of Top-Down Approach –

3. Creating data mart from data warehouse is easy.

Disadvantages of Top-Down Approach –

3. These data marts are then integrated into data warehouse.

Advantages of Bottom-Up Approach –

Disadvantage of Bottom-Up Approach –

Er. Binay Yadav Page 5

You might also like