OLAP, and OLTP
Introduction
Data, data, dataeverywhere! Informationthats another story! Especially, the right information @ the right time! Data ware housing's goal is to make the right information available @ the right time Data warehousing is a data store (eg., a database of some sort) and a process for bringing together disparate data from throughout an organization for decision-support purposes
2
Different Goal
Aggregation, summarization and exploration Of historical data To help management make informed decisions
Product Coke (0.5 gallon) Pepsi (0.5 gallon) Coke (1 gallon) Altoids Branch Convoy Street UTC UTC Costa Verde Time 2006-03-01 [Link] 2006-03-01 [Link] 2006-03-01 [Link] 2006-03-01 [Link] Price $1.00 $1.03 $1.50 $0.30
...
Find the total sales for each product and month Find the percentage change in the total monthly sales for each product
3
OLAP and OLTP
OLTP-Online Transaction processing system (relies on solely on relational databases) record at time OLAP-Online analytical processing system (class of technologies that are designed for adhoc data access and analysis) deals with summarized data
Different Requirements
OLTP On-Line Transaction Processing OLAP On-Line Analytical Processing
OLTP Tasks Size of database Time span Size of working set Day to day operation Gigabytes Recent, up-to-date OLAP High level decision support Terabytes Spanning over months / years
Tens of records, accessed Consolidated data from through primary keys multiple databases Structured / repetitive Transaction throughput Ad-hoc, exploratory queries Query latency
Workload Performance
Data Warehouse
Enterprise Database
Customers
Orders
Transactions
Vendors Etc Data Miners: Farmers they know Explorers - unpredictable
Etc
Copied, organized summarized
Data Warehouse
Data Mining
General Architecture for Data Warehousing
Source systems Extraction, (Clean), Transformation, & Load (ETL) Central repository Metadata repository Data marts Operational feedback End users (business)
8
Where does OLAP fit in?
OLAP Overview
Interactive, exploratory analysis of multidimensional data to discover patterns
gender
age
ac
s nt e id c
10
OLAP Architecture
11
Server Options
Single processor Symmetric multiprocessor (SMP) Massively parallel processor (MPP)
12
OLAP Server Options
Multi-dimensional OLAP (MOLAP)
A k-dimensional matrix based on a non relational storage structure. [Agrawal et al]
Relational OLAP (ROLAP)
A relational back-end wherein operations of the data are translated to relational queries. [Agrawal et al]
Hybrid OLAP (HOLAP)
Integration of MOLAP with ROLAP.
Desktop OLAP (DOLAP)
Simplified versions of MOLAP or ROLAP.
ZOLAP
Speak with your chemist (normally only prescribed for death march victims)
13
OLAP Online Analytical Processing
A definition:
Data representation is in the form of a CUBE OLAP goes beyond SQL with its analysis capabilities Key feature of OLAP: Relevant multi-dimensional views such as products, time, geography
14
OLAP Cube - 1
15
OLAP Cube - 2
16
OLAP Cube - 3
Star Structure (quite common)
Product Model Type Color Facts Product Region Time Channel Revenue Expenses Units Region Nation District Dealer
Channel
Time Week Year
17
A Sample Data Cube
Pr od uc t
TV PC VCR sum 1Qtr 2Qtr
Date
3Qtr
4Qtr
sum
Total annual sales of TV in U.S.A. U.S.A Canada Mexico sum
Country
18
OLAP Cube - 5
ThreeDimensional Cube Display
Pag e R e g io n : N orth Red b lo b Row s Y ear 1996 1997 T otal B lu e b lo b C o lu m n s S a le s
T ota l
19
OLAP Cube - 6
Dimension Brand Store Customer segment Product group Period Variable Example Mt. Airy Atlanta Business Desks January Units sold
SixDimensional Cube
20
Rotation (Pivot Table)
21
Drill Down
22
OLAP Examples
[Link]
Excel Pivot Table example (similar to OLAP cube)
23
Sample of OLAP products
Just a snippet from [Link] ; not an endorsement
24
Data Mining versus OLAP
25
Data Mining versus OLAP
OLAP - Online Analytical Processing
Provides you with a very good view of what is happening, but can not predict what will happen in the future or why it is happening
26
Results of Data Mining Include:
Forecasting what may happen in the future Classifying people or things into groups by recognizing patterns Clustering people or things into groups based on their attributes Associating what events are likely to occur together Sequencing what events are likely to lead to later events
27
Thanks for listening.
28