0% found this document useful (0 votes)

24 views5 pages

Data Mining and Warehousing

This document outlines the examination paper for the Sixth Semester B. Tech. in Computer Science and Engineering/Artificial Intelligence and Machine Learning, focusing on Data Mining and Warehousing. It includes various questions related to OLAP queries, ETL processes, schema design, SQL commands, data normalization, decision trees, and clustering algorithms. The exam is structured to assess students' understanding of key concepts in data mining and warehousing within a 3-hour timeframe.

Uploaded by

aayush.dharpure02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views5 pages

Data Mining and Warehousing

Uploaded by

aayush.dharpure02

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Course Code : CAT 307 MPNO/MS – 24 / 1747

Sixth Semester B. Tech. ( Computer Science and Engineering /

Artificial Intelligence and Machine Learning ) Examination

DATA MINING AND WAREHOUSING

Time : 3 Hours ] [ Max. Marks : 60

Instructions to Candidates :—
(1) Assume suitable data wherever necessary.
(2) All questions carry marks as indicated.

1. (a) What is CUBE ? If we create CUBE for sales application with three
dimension for time, location and item, illustrate with example how sub cubes
in lattice can be created. 4(CO1)
(b) Consider the star schema of an automobile data warehouse :
Autos (ModelId, modelname, serialNo, color)
Dealers (DealerId, name, city state, phone)
Time (TimeId, day, week, month, year)
Sales (ModelId, DealerId, TimeId, QtySold, CountSold)
Where the attribute QtySold is intended to be the total price of all automobiles
for the given model, color, date and dealer, while CountSold is the total number
of automobiles in that category. Answer the following OLAP queries :—
(i) Find total sales generated for model name (Maruti, Honda) and dealer
state (Maharashtra, Gujarat) in September 2017 and October 2017
using ROLL – UP across three dimensions – ModelId, DealerId and
TimeId.
(ii) Find total sales generated for model name (Maruti, Honda) and dealer
state (Maharashtra, Gujarat) in September 2017 and October 2017
using CUBE across the dimensions – ModelId, DealerId and TimeId.
(iii) Comment on difference in output using ROLL – UP and CUBE
aggregation clause. 3(CO1)
(c) What do you mean by ETL Process ? What is the purpose of 'refresh'
in ETL process ? 3(CO1)

MPNO/MS-24 / 1747 Contd.

2. (a) Suppose two stocks infosys and TCS have the following values in one
week : (3, 6), (4, 9), (6, 11), (5, 12), (7, 15). If the stocks are affected
by the same industry trends, will their prices rise or fall together ?
4(CO3)

(b) The Restaurants 'SR' wholesale restaurant company supplies equipment to

55 different restaurants in Mumbai, such as tables, chairs, table cloths, napkin
holders, cutlery and so on, as well as kitchen equipment such as saucepans,
knives and chef clothing. They wish to analyze their daily sales in terms of
revenue, unit sales, costs and profit for each product and customer. They
also would like to know this information by product line and product group.

= Design a STAR schema according to the given scenario.

= Convert STAR schema into Snowflake Schema.

Bring out the difference between STAR and Snowflake Schema. 6(CO3)

3. (a) Consider the following snapshot of SALES table :—

Explain how they query : Select the rows from the Sales table where product
is "Washer" and color is "Almond" and division is "East" or "South" will be
executed if bitmap indexes are created on Product, Color and Region columns.
Show the intermediate steps.

5(CO1)

MPNO/MS-24 / 1747 2 Contd.

(b) Write SQL command to create Index Organized Table Employee with the
attributes empno, empname and salary in tablespace tsa as directed :

(1) Empno is primary key for the table.

(2) PCTTHRESHOLD is 20.

(3) Specify Overflow and Including clause. Assume empname to be

included in Including clause.

(4) Give meaning of PCTTHRESHOLD, including and overflow clause.

Mention advantages of IOT over B – tree indexes. 5(CO2)

4. (a) Given is the data for age in particular region after survey :
15, 17, 18, 18, 21, 22, 22, 23, 24, 24, 27, 27, 27, 27, 32, 35,
35, 37, 37, 37, 37, 38, 42, 47, 48, 54, 72. Apply the following methods
and show the results :

(i) Use smoothing by bin means with a depth of 3.

(ii) Use Min - Max normalization to transform the value 36 into the
range 0 . 0 to 1 . 0.

(iii) Use z - score normalization to transform the value.

(iv) Use normalization by decimal scaling to transform the value 36.

(v) Plot an equi - width histogram of width 10.

Sketch examples of different sampling techniques using sample of size 5 and

the strata low, medium and high. 5(CO2)

(b) State what is bitmap join index ? List the advantage of creating bitmap
join index over normal index. Write query which will explain bitmap join
index. 5(CO3)

MPNO/MS-24 / 1747 3 Contd.

5. (a) Construct a decision tree for the following data set using Gini Index.

5(CO3)
(b) Generate the frequent itemsets using the Apriori algorithm for the transaction
database shown below and a minimum support s_min = 3 and minimum
confidence = 60%.

5(CO1)

6. (a) Use DBSCAN algorithm to cluster the following examples with Euclidean
distance as a distance measure.
How many cluster(s) the algorithm will form with Epsilon = 3 and minpoint = 3 ?
Draw the 11 by 11 space on Graph paper and illustrate the discovered clusters.
A1 = (3, 11), A2 = (3, 6), A3 = (9, 5), A4 = (6, 9), A5 = (8, 6), A6 = (7, 5),
A7 = (2, 3), A8 = (5, 11). 5(CO4)

MPNO/MS-24 / 1747 4 Contd.

(b) The distance between five pair of cases given below :
Cluster the five cases using below procedure and draw the Dendograms structure.
(a) Single linkage hierarchical procedure.
(b) Complete linkage hierarchical procedure.

5(CO4)

MPNO/MS-24 / 1747 5 55

Data Mining and Warehousing
No ratings yet
Data Mining and Warehousing
5 pages
Dec 2016
No ratings yet
Dec 2016
2 pages
Write Your Roll Number: Time: Hours Max. Marks
No ratings yet
Write Your Roll Number: Time: Hours Max. Marks
2 pages
Data Mining Exam for IT Students
No ratings yet
Data Mining Exam for IT Students
2 pages
Q1R Ext
No ratings yet
Q1R Ext
4 pages
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
No ratings yet
Script of E - Previous Question Papers - URR18 03.08.2023 - VI Semester - U18CS605 PDF
10 pages
Cosf 221 Inte 421 Bmis 313 Data Mining - Kabarak University
No ratings yet
Cosf 221 Inte 421 Bmis 313 Data Mining - Kabarak University
11 pages
CEUC502 - DMBI - Question - Bank
No ratings yet
CEUC502 - DMBI - Question - Bank
12 pages
DMDW Co1 Session 7
No ratings yet
DMDW Co1 Session 7
46 pages
DM
No ratings yet
DM
7 pages
CST466
No ratings yet
CST466
5 pages
Data Warehousing and Mining
No ratings yet
Data Warehousing and Mining
4 pages
Data Warehousing & Mining Exam
No ratings yet
Data Warehousing & Mining Exam
4 pages
Big Data Analytics Exam April 2016
No ratings yet
Big Data Analytics Exam April 2016
2 pages
DMA Question Bank
No ratings yet
DMA Question Bank
4 pages
Answer Midterm Exam Data Mining1 2021 - 2022
100% (2)
Answer Midterm Exam Data Mining1 2021 - 2022
4 pages
CS-30013 (DMDW) - CS Mid Sept 2024
No ratings yet
CS-30013 (DMDW) - CS Mid Sept 2024
12 pages
Data Mining and Data Warehousing 2023
No ratings yet
Data Mining and Data Warehousing 2023
2 pages
MCS-221 Repeated Questions
No ratings yet
MCS-221 Repeated Questions
3 pages
DWH KOE - 093 Tutorial and Assignment
No ratings yet
DWH KOE - 093 Tutorial and Assignment
16 pages
Document 2
No ratings yet
Document 2
10 pages
DWM Extra
No ratings yet
DWM Extra
7 pages
CS614 Data Warehousing Exam Questions
No ratings yet
CS614 Data Warehousing Exam Questions
13 pages
Adbms
No ratings yet
Adbms
19 pages
DWDM Question Bank (R23)
100% (1)
DWDM Question Bank (R23)
6 pages
Data Warehousing and Data Mining Dec 2023
No ratings yet
Data Warehousing and Data Mining Dec 2023
7 pages
BCS058
No ratings yet
BCS058
2 pages
Data Warehousing&Data Mining AMTCSE0114
No ratings yet
Data Warehousing&Data Mining AMTCSE0114
3 pages
CCS341 Set3
100% (1)
CCS341 Set3
3 pages
Data Mining with SQL and Linear Regression
No ratings yet
Data Mining with SQL and Linear Regression
7 pages
SS G515
No ratings yet
SS G515
4 pages
Jntuqp DWDM
No ratings yet
Jntuqp DWDM
8 pages
Data Mining Exam Questions Nov 2022
No ratings yet
Data Mining Exam Questions Nov 2022
4 pages
M.Tech Exam: Data Warehousing & Mining
No ratings yet
M.Tech Exam: Data Warehousing & Mining
5 pages
Faculty of Engineering and Computing Sciences: Assignment As Consider For CT-III
No ratings yet
Faculty of Engineering and Computing Sciences: Assignment As Consider For CT-III
28 pages
Data Warehouse & Mining Assignment
No ratings yet
Data Warehouse & Mining Assignment
2 pages
(It-704c) Data Warehousing and Data Mining (2013-14)
No ratings yet
(It-704c) Data Warehousing and Data Mining (2013-14)
6 pages
DM PYQ Merged
No ratings yet
DM PYQ Merged
26 pages
It-3031 (DMDW) - CS End Nov 2023
No ratings yet
It-3031 (DMDW) - CS End Nov 2023
23 pages
DWDM Previous
No ratings yet
DWDM Previous
10 pages
Data Warehousing Exam 2004
No ratings yet
Data Warehousing Exam 2004
5 pages
ZG515 Ec-3r
No ratings yet
ZG515 Ec-3r
2 pages
Sca-Dec 2024
No ratings yet
Sca-Dec 2024
2 pages
Data Warehousing & Mining Exam 2018
No ratings yet
Data Warehousing & Mining Exam 2018
17 pages
Jntuworld: R07 Set No. 2
No ratings yet
Jntuworld: R07 Set No. 2
7 pages
Data Mining & BI Question Bank 2020-21
No ratings yet
Data Mining & BI Question Bank 2020-21
12 pages
Data Warehouse & Mining Question Bank
No ratings yet
Data Warehouse & Mining Question Bank
26 pages
Document 3
No ratings yet
Document 3
9 pages
DWDM 1-5 QB Sols
No ratings yet
DWDM 1-5 QB Sols
193 pages
Data Warehousing Class Test 2024-25
No ratings yet
Data Warehousing Class Test 2024-25
3 pages
Question With Answer
No ratings yet
Question With Answer
22 pages
Data Warehousing & Mining Syllabus
No ratings yet
Data Warehousing & Mining Syllabus
2 pages
Cis 417.Ccs 415. CCT 416 Cat
No ratings yet
Cis 417.Ccs 415. CCT 416 Cat
4 pages
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
No ratings yet
SEM 5 - Comps, IOT, CYBER, CS - Data Warehousing & Mining - 2024 MAY To 2022 DEC PYQ - Aeraxia - in
10 pages
Be6 r4
No ratings yet
Be6 r4
2 pages
Tech Mahindra SQL Interview Questions For Data Engineer
No ratings yet
Tech Mahindra SQL Interview Questions For Data Engineer
6 pages
Expt 2 - 2-1
No ratings yet
Expt 2 - 2-1
31 pages
Synopsis
No ratings yet
Synopsis
3 pages
ZS JD - Software Engineer - QA
No ratings yet
ZS JD - Software Engineer - QA
2 pages
Nutanix JD - Intern Sre Role
No ratings yet
Nutanix JD - Intern Sre Role
1 page
Nutanix JD - Sre Role
No ratings yet
Nutanix JD - Sre Role
1 page
Kickdrum JD
No ratings yet
Kickdrum JD
1 page
OST Questions
No ratings yet
OST Questions
6 pages
Huawei H13-629 Exam Questions & Answers
No ratings yet
Huawei H13-629 Exam Questions & Answers
70 pages
SQL Performance Tuning Techniques
No ratings yet
SQL Performance Tuning Techniques
11 pages
Dbms QP Endsem
No ratings yet
Dbms QP Endsem
12 pages
SAP BI 7.0 vs BW3.5: Key Differences
No ratings yet
SAP BI 7.0 vs BW3.5: Key Differences
2 pages
EXPandTRN File Layouts
No ratings yet
EXPandTRN File Layouts
7 pages
Satheesh Kumar Kothandaraman
No ratings yet
Satheesh Kumar Kothandaraman
5 pages
X-Ways Forensics White Paper
No ratings yet
X-Ways Forensics White Paper
7 pages
DBMS Project
No ratings yet
DBMS Project
14 pages
Python Developer Resume Example
No ratings yet
Python Developer Resume Example
1 page
Scalar I6 Tape Library Datasheet (DS00510A)
No ratings yet
Scalar I6 Tape Library Datasheet (DS00510A)
2 pages
Redbookdb 2
No ratings yet
Redbookdb 2
190 pages
SSRS 2008 Beginner Tutorial Guide
No ratings yet
SSRS 2008 Beginner Tutorial Guide
130 pages
Cs3352 - Foundation of Data Science
No ratings yet
Cs3352 - Foundation of Data Science
56 pages
Rec 1975
No ratings yet
Rec 1975
6 pages
Oracle FNDCPPUR Program Guide
No ratings yet
Oracle FNDCPPUR Program Guide
3 pages
Zabbix Proxy HA & Load Balancing
No ratings yet
Zabbix Proxy HA & Load Balancing
20 pages
Functional Dependency
No ratings yet
Functional Dependency
95 pages
Business Objects Versions & Features
No ratings yet
Business Objects Versions & Features
43 pages
Oracle RMAN
No ratings yet
Oracle RMAN
7 pages
Bcom (CA) Oa Record
No ratings yet
Bcom (CA) Oa Record
34 pages
Module 1introduction To Microsoft SQL Server 2014
100% (1)
Module 1introduction To Microsoft SQL Server 2014
26 pages
RODBC
No ratings yet
RODBC
34 pages
2000 M3u Urls Shamna
No ratings yet
2000 M3u Urls Shamna
167 pages
Akshay Gavandi: Software Engineer Profile
No ratings yet
Akshay Gavandi: Software Engineer Profile
1 page
Query Based Reports in Maximo: Overview of Maximo Ad-Hoc Reporting Functionality
No ratings yet
Query Based Reports in Maximo: Overview of Maximo Ad-Hoc Reporting Functionality
40 pages
Database Design for Doctors
No ratings yet
Database Design for Doctors
30 pages
Introduction To Tableau: Data Visualization With Tableau
No ratings yet
Introduction To Tableau: Data Visualization With Tableau
17 pages
DBMS HW 4
No ratings yet
DBMS HW 4
10 pages
DBMS Exp 5 Cursor Writeup
No ratings yet
DBMS Exp 5 Cursor Writeup
7 pages

Data Mining and Warehousing

Uploaded by

Data Mining and Warehousing

Uploaded by

Course Code : CAT 307 MPNO/MS – 24 / 1747

Sixth Semester B. Tech. ( Computer Science and Engineering /

DATA MINING AND WAREHOUSING

Time : 3 Hours ] [ Max. Marks : 60

MPNO/MS-24 / 1747 Contd.

(b) The Restaurants 'SR' wholesale restaurant company supplies equipment to

= Design a STAR schema according to the given scenario.

= Convert STAR schema into Snowflake Schema.

3. (a) Consider the following snapshot of SALES table :—

MPNO/MS-24 / 1747 2 Contd.

(1) Empno is primary key for the table.

(2) PCTTHRESHOLD is 20.

(3) Specify Overflow and Including clause. Assume empname to be

(4) Give meaning of PCTTHRESHOLD, including and overflow clause.

(i) Use smoothing by bin means with a depth of 3.

(iii) Use z - score normalization to transform the value.

(iv) Use normalization by decimal scaling to transform the value 36.

(v) Plot an equi - width histogram of width 10.

Sketch examples of different sampling techniques using sample of size 5 and

MPNO/MS-24 / 1747 3 Contd.

MPNO/MS-24 / 1747 4 Contd.

You might also like