Open navigation menu
Close suggestions
Search
Search
en
Change Language
Upload
Sign in
Sign in
Download free for days
0 ratings
0% found this document useful (0 votes)
33 views
SCD Types
Uploaded by
Kiran Kulkarni
Copyright
© © All Rights Reserved
Available Formats
Download as PDF or read online on Scribd
Download now
Download
Save SCD Types For Later
Download
Save
Save SCD Types For Later
0%
0% found this document useful, undefined
0%
, undefined
Embed
Share
Print
Report
0 ratings
0% found this document useful (0 votes)
33 views
SCD Types
Uploaded by
Kiran Kulkarni
Copyright
© © All Rights Reserved
Available Formats
Download as PDF or read online on Scribd
Download now
Download
Save SCD Types For Later
Carousel Previous
Carousel Next
Save
Save SCD Types For Later
0%
0% found this document useful, undefined
0%
, undefined
Embed
Share
Print
Report
Download now
Download
You are on page 1
/ 23
Search
Fullscreen
Slowly Changing Dimension: Categories ® e - Prof. Sunita Sahu Assistant Prof, VESIT,;MumbaiSlowly Changing Dimension: Categories o Dimensions that change slowly over time, rather than changing on regular schedule, time-base. o In Data Warehouse there is a need to track changes in dimension attributes in order to report historical data. o The usual changes to dimension tables are classified into three types ® Type 1 © Type 2 © Type 3Customer Product Customer Key Product Key Customer Name, Product Name Order face Customer Code Order tact Martial Status Product Code “Ts Product Key "Kadises Product Line Time Key State Brand Customer Key Zip Salesperson Key Order Dollars Time Cost Dollars Salesperson Time Key Margin Dollars Salesperson Key Date Sale Units Salesperson Name Month 5 care Territory “@ Region Nai Yearah ey) 1C QeS: Error Correction o Usually relate to corrections of errors in the source system. o For example, the customer dimension: change in name because of spelling mistakeType 1 Changes, cont. General Principles for Type 1 changes: e Usually, the changes relate to correction of errors in the source system e Sometimes the change in the source system has no significance e The old value in the source system needs to be discarded e The change in the source system need not be preserved in the DWHApplying Type 1c e Overwrite the attribute value in the dimension table row with the new value e The old value of the attribute is not preserved e No other changes are made in the dimension table row. « The key of this dimension table or any other key values are not affected. e Easiest to implement.o Before the change: Customer_ID Customer_Name Customer_Type 1 Cust_1 Corporate o After the change: Customer_ID Customer_Name Customer_Type 4 Cust_1 RetailType 2 Changes: © Let's look at the martial status of customer. o One the DWH’s requirements is to track orders by o martial status o All changes before 11/10/2004 will be under Martial Status = Single, and all changes after that date will be under Martial Status = Married o We need to aggregate the orders before and after the marriage separatelyType 2 Changes, cont. o General Principles for Type 2 changes: e They usually relate to true changes in source . systems. e There is a need to preserve history in the DWH. This type of change partitions the history in the DWH. e Every change for the same attributes must be preserved.Type 2 Implementation o The steps: « Add a new dimension table row with the new value of ~ the changed attribute e An effective date will be included in the dimension table e There are no changes to the original row in the dimension table e The key of the original row is not affected e The new row is inserted with a new surrogate keyType 2 Example an ea aca fee Bi of Cust! Corporate 22-07-2010 44 45 ggag Gusto |Customer_N Customer_T | Start_Date End_Date Cotta | ead Deh 1 Cust_1 Corporate 22-07-2010 31-12-9999 2 Cust_1 Retail 22-07-2010 31-12-9999Type 3 Changes o Type 3 Slowly Changing Dimension, there will be two columns to indicate the particular attribute of interest, one indicating the original value, and one indicating the current value. o There will also be a column that indicates when the current value becomes active. o Not common at all e Time-consuming o We want to track history without lifting heavy burden. o There are many soft changes and we don't care for the “far” history @Type 3 Changes o General Principles: e They usually relate to “soft” or tentative changes in the source systems e There is a need to keep track of history with old and new values of the changes attribute e They are used to compare performances across the transition e They provide the ability to track forward and backwardType 3 e No new dimension row is needed e The existing queries will seamlessly switch to the current value. e Any queries that need to use the old value must be revised accordingly. e The technique works best for one soft change at a time. e If there is a succession of changes, more sophisticated techniques must be advisedType 3 Customer Key Name State 1001 Williams New York © After Williams moved from New York to Los Angeles, the original information gets updated, and we have the following table (assuming the effective date of change is February 20, 2010): Customer Key Name Original State Current State Effective Date 1001 Williams New York Los Angeles 20-FEB-2010 °Type 3 o Advantages © This does not increase the size of the table, since new information is updated. © This allows us to keep some part of history. o Disadvantages © Type 3 will not be able to keep all history where an attribute is changed more than once. For example, if Williams later moves to Texas on December 15, 2003, the Los Angeles information will be lost. @Large Dimension Table o Dimension table is large based on two factors. o very deep: that is, the dimension has a very large number of rows. o Very wide: that is, the dimension may have a large number of attributes or columns. o Ina data warehouse, typically the customer and product dimensions are likely to be large. o Such customer dimension tables may have as many as 100 million rows. o The product dimension of large retailers is also quite huge.Junk Dimension © The junk dimension is simply a structure that provides a convenient place to store the junk attributes. It is just a collection of random transactional codes, flags and/or text attributes that are unrelated to any particular dimension. © In OLTP tables that are full of flag fields and yes/no attributes, many of which are used for operational support and have no documentation except for the column names and the memory banks of the person who created them. Not only do those types of attributes not integrate easily into conventional dimensions such as Customer, Vendor, Time, Location, and Product, but you also don't want to carry bad design into the data warehouse.However, some of the miscellaneous attributes will contain data that has significant business value, so you have to do something with them.Junk Dimension o Advantage of junk dimension: o It provides a recognizable location for related codes, indicators and their descriptors in a dimensional framework. 0 This avoids the creation of multiple dimension tables. o Provide a smaller, quicker point of entry for queries compared to performance when these attributes are directly in the fact table. o An interesting use for a junk dimension is to capture the context of a specific transaction. While our common, conformed dimensions contain the key dimensional attributes of interest, there are likely attributes about the transaction that are not known until the transaction is processed.Ajok dmeniooiled [Oot Y¥ [ XN TN | Ww] NW TN | NW | ON | wihYiNeos L$ TON TN TON TN nT]Rapidly Changing Dimensions o If one or more of its attributes changes frequently. o when you deal with a type 2 change, you create an additional dimension table row with the new value of the changed attribute. By doing so, you are able to preserve the history. o consider customer dimension. Here the number of rows tends to be large, sometimes in the range of even a million or more rows. But significant attributes in a customer dimension may change many timesin a year. Rapidly changing large dimensions can be too problematic for the type 2 approach.Rapidly Changing Dimensions o One effective approach is to break the large dimension table into one or more simpler dimension tables. How can you accomplish this? o Obviously, you need to break off the rapidly changing attributes into another dimension table, leaving the slowly changing attributes behind in the original table.Solution to rapidly changing dimension o Large dimensions call for special considerations. o Because of the sheer size, many data warehouse functions involving large dimensions may be slow and inefficient. o You need to address the following issues by using effective design methods, by choosing proper indexes, and by applying other optimizing techniques:
You might also like
Implementing Rapidly Changing Dimension: What Are Fast Changing Dimensions?
PDF
No ratings yet
Implementing Rapidly Changing Dimension: What Are Fast Changing Dimensions?
5 pages
What Is The Difference Between OLTP and OLAP?
PDF
No ratings yet
What Is The Difference Between OLTP and OLAP?
33 pages
Dimensional Data Modeling Introduction
PDF
100% (3)
Dimensional Data Modeling Introduction
56 pages
Slowly Changing Dimensions
PDF
No ratings yet
Slowly Changing Dimensions
26 pages
Slowly Changing Dimensions
PDF
No ratings yet
Slowly Changing Dimensions
26 pages
Facts & Dims
PDF
No ratings yet
Facts & Dims
14 pages
Data Warehouse Concepts
PDF
No ratings yet
Data Warehouse Concepts
11 pages
Dimensional Modeling
PDF
No ratings yet
Dimensional Modeling
84 pages
Class 3
PDF
No ratings yet
Class 3
28 pages
BI - Lecture 3 - Kimball Concepts
PDF
No ratings yet
BI - Lecture 3 - Kimball Concepts
44 pages
Slowly Changing Dimension DW
PDF
No ratings yet
Slowly Changing Dimension DW
3 pages
SCD types
PDF
No ratings yet
SCD types
5 pages
What Is Data Warehouse?: Explanatory Note
PDF
No ratings yet
What Is Data Warehouse?: Explanatory Note
11 pages
What Are Slowly Changing Dimensions
PDF
No ratings yet
What Are Slowly Changing Dimensions
2 pages
DWH Int Questions
PDF
100% (1)
DWH Int Questions
9 pages
Cost Based Optimization
PDF
No ratings yet
Cost Based Optimization
14 pages
What Are Slowly Changing Dimensions
PDF
No ratings yet
What Are Slowly Changing Dimensions
2 pages
Data Warehouses: FPT University Hanoi 2010
PDF
No ratings yet
Data Warehouses: FPT University Hanoi 2010
19 pages
DWT Chapter 2 Part 2
PDF
No ratings yet
DWT Chapter 2 Part 2
14 pages
Tutorial # 1
PDF
No ratings yet
Tutorial # 1
58 pages
CS 6 DM SCDs 1582444165319
PDF
No ratings yet
CS 6 DM SCDs 1582444165319
30 pages
11 Chapter11+ +Building+the+Data+Warehouse+ +part2
PDF
No ratings yet
11 Chapter11+ +Building+the+Data+Warehouse+ +part2
22 pages
20463D 03
PDF
No ratings yet
20463D 03
32 pages
Data Warehousing SS G515 Me Software Systems BITS Pilani, Dubai Campus
PDF
No ratings yet
Data Warehousing SS G515 Me Software Systems BITS Pilani, Dubai Campus
46 pages
Data Warehouse Ques
PDF
No ratings yet
Data Warehouse Ques
10 pages
Ravi Data Warehousing Concepts Document 1665375367
PDF
No ratings yet
Ravi Data Warehousing Concepts Document 1665375367
49 pages
What Are The Dimensions in Data Warehouse
PDF
100% (1)
What Are The Dimensions in Data Warehouse
6 pages
Week 3
PDF
No ratings yet
Week 3
39 pages
Designing A Data Warehouse
PDF
No ratings yet
Designing A Data Warehouse
38 pages
Wancerz
PDF
No ratings yet
Wancerz
2 pages
Designing A Data Warehouse
PDF
No ratings yet
Designing A Data Warehouse
36 pages
Dimensional Types in Data Warehousing: Nimra Ayaz
PDF
No ratings yet
Dimensional Types in Data Warehousing: Nimra Ayaz
11 pages
In The Star Schema Design
PDF
No ratings yet
In The Star Schema Design
11 pages
Slowly Changing Dimension (SCD'S) : Submitted By: BALAJI K
PDF
No ratings yet
Slowly Changing Dimension (SCD'S) : Submitted By: BALAJI K
14 pages
History Management of Data - Slowly Changing Dimensions: Marek Wancerz, Paweł Wancerz
PDF
No ratings yet
History Management of Data - Slowly Changing Dimensions: Marek Wancerz, Paweł Wancerz
3 pages
Web Content: With The SQL Server 2008R2 Platform
PDF
No ratings yet
Web Content: With The SQL Server 2008R2 Platform
4 pages
Dimensional Modeling
PDF
No ratings yet
Dimensional Modeling
59 pages
Types of Dimensions - Javatpoint
PDF
No ratings yet
Types of Dimensions - Javatpoint
1 page
Dimensional Modeling
PDF
No ratings yet
Dimensional Modeling
59 pages
Populating A DW With SS2K
PDF
No ratings yet
Populating A DW With SS2K
5 pages
Lecture 5
PDF
No ratings yet
Lecture 5
37 pages
3 AdvMDImpl
PDF
No ratings yet
3 AdvMDImpl
10 pages
Types of Dimensions - Data Warehouse
PDF
No ratings yet
Types of Dimensions - Data Warehouse
8 pages
3.2 Informatica - SCD
PDF
No ratings yet
3.2 Informatica - SCD
3 pages
dw4 - Dimension1
PDF
No ratings yet
dw4 - Dimension1
75 pages
Different Types of Dimensions and Facts in Data
PDF
No ratings yet
Different Types of Dimensions and Facts in Data
5 pages
Slowly Changing Dimensions: Product ID (PK) Year Product Name Product Price
PDF
No ratings yet
Slowly Changing Dimensions: Product ID (PK) Year Product Name Product Price
19 pages
Introduction To Data Warehousing
PDF
No ratings yet
Introduction To Data Warehousing
46 pages
Data Ware Hose Fundamentals
PDF
No ratings yet
Data Ware Hose Fundamentals
13 pages
Types of SCD With Example
PDF
No ratings yet
Types of SCD With Example
30 pages
Web Content: With The SQL Server 2008R2 Platform
PDF
No ratings yet
Web Content: With The SQL Server 2008R2 Platform
4 pages
Data Warehousing INTERVIEW QUESTION
PDF
No ratings yet
Data Warehousing INTERVIEW QUESTION
17 pages
SCD Types Olap-Oltp
PDF
No ratings yet
SCD Types Olap-Oltp
3 pages
CSIS 3300 W3 Denormalization StarSchema
PDF
No ratings yet
CSIS 3300 W3 Denormalization StarSchema
27 pages
Dimensional Modeling
PDF
No ratings yet
Dimensional Modeling
40 pages
Datawarehouse Concepts
PDF
No ratings yet
Datawarehouse Concepts
7 pages
Data Warehousin G Concepts
PDF
No ratings yet
Data Warehousin G Concepts
41 pages