0% found this document useful (0 votes)

319 views6 pages

(It-704c) Data Warehousing and Data Mining (2013-14)

This document provides information about a course on data warehousing and data mining including sample exam questions. The exam covers topics such as: 1. Characteristics of a data warehouse like containing historical and subject-oriented data. 2. Differences between ROLAP and MOLAP. 3. Concepts in data mining including association rules, frequent itemsets, and decision trees. 4. Data mining algorithms like FP-Growth and k-means clustering. The exam consists of multiple choice, short answer, and long answer questions testing understanding of key data warehousing and mining concepts and the ability to apply techniques like drawing schemas and illustrating algorithm workings.

Uploaded by

Suraj Dasgupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

319 views6 pages

(It-704c) Data Warehousing and Data Mining (2013-14)

Uploaded by

Suraj Dasgupta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

CS/B.

TECH/IT (New)/SEM-7/IT-704C/2013-14

2013

DATA WAREHOUSING AND DATA MINING

Time Allotted : 3 Hours Full Marks : 70

The figures in the margin indicate full marks.

Candidates are required to give their answers in their own words

as far as practicable.

GROUP – A

(Multiple Choice Type Question)

1. Choose the correct alternatives for the following: 10 x 1 = 10

i) A data warehouse is said to contain a ‘time-varying’

collection of data because

a) its contents vary automatically with time

b) its life-span is very limited

c) it contains historical data

d) its content has explicit time-stamp.

ii) A data warehouse is said to contain a ‘subject-oriented’

collection of data because

a) its contents have a common theme

b) it is built for a specific application

c) it cannot support multiple subjects

d) it is generalization of ‘object-oriented’.
iii) A data warehouse is built as a separate repository of data,
different from the operational data of an enterprise because

a) it is necessary to keep the operational data free of any

warehouse operation

b) a data warehouse cannot afford to allow corrupted

data within it

c) a data warehouse contains summarized data whereas

the operational database contains transactional data

d) it is just needed.

iv) ROLAP is preferred over MOLAP when

a) a data warehouse and relational database are

inseparable

b) the data warehouse is in relational tables, but no

slice and dice operations are required

c) the multidimensional model does not support query

optimization

d) A data warehouse contains many fact tables and

many dimension tables.

v) The ‘Slice operation’ deals with

a) selecting all but one dimension of the data cube

b) merging the cells along one dimension

c) merging cells of all but one dimension

d) selecting the cells of any one dimension of the data

cube.

vi) Which of the following indexing techniques is appropriate

for data warehousing?

a) Hashing on primary key

b) Indexing on foreign keys of fact table

c) Bit-map indexing

d) Join indexing.
vii) What is ‘MOLAP’?

a) MOLAP is an OLAP engine for (i) relational models

and (ii) multidimensional OLAP operations

b) MOLAP is an OLAP engine for (i) multidimensional

models and (ii) SQL based OLAP operations

c) MOLAP is an OLAP engine for (i) multidimensional

models and (ii) supports multidimensional OLAP
operations.

d) MOLAP is a ROLAP with a supporting

multidimensional model.

viii) The advantage of FP-tree Growth Algorithm is

a) it counts the support values of the item sets in the

dashed structure as it moves along from one step
point to another.

b) it avoids the generation of large numbers of candidate

sets.

c) to update the association rules when the database

discover the set of frequent item sets

d) to prune the item sets which are not frequent.

ix) The ID3 generates a

a) binary decision tree

b) a decision tree with as many branches as there are

distinct values of the attribute

c) a tree with a variable number of branches, not related

to the domain of the attributes

d) a tree with an exponential number of branches.

x) An oblique tree is relevant when

a) the attributes are correlated

b) the attributes are independent

c) there are only two attributes

d) all attributes are categorical.

GROUP – B

(Short Answer Type Questions)

Answer any three of the following. 3 x 5 = 15

2. Differentiate among Enterprise Warehouse, Data mart and Virtual

warehouse.

3. Distinguish between OLTP and OLAP systems.

4. Explain support, confidence, frequent itemset and give a formal

definition of association rule.

5. Compare between HOLAP, ROLAP and MOLAP.

6. Describe the basic algorithm for decision tree induction.

GROUP – C

(Long Answer Type Questions)

Answer any three of the following. 3 x 15 = 45

7. a) How is data warehouse different from a database?

b) What is the significance of a multi-dimensional data model

in data-warehousing? Briefly compare the snowflake
schema and fact constellation concepts with a suitable
example.

c) Suppose that a data warehouse consists of the three

dimensions time, doctor and patient and two measures
count and charge, where charge is the fee that a doctor
charges a patient for a visit.

i) Draw a star schema for the above warehouse.

ii) Starting with the base cuboid (month, doctor,

patient), what specific OLAP operations should be
performed in order to list the total fee collected by
each doctor in 2012? 3+6+6

8. a) What is FP-tree?

b) Discuss the different phases of FP-tree growth algorithm.

c) Consider the following transaction database T, which
contains 15 records:

A1 A2 A3 A4 A5 A6 A7 A8 A9
1 0 0 0 1 1 0 1 0
0 1 0 1 0 0 0 1 0
0 0 0 1 1 0 1 0 0
0 1 1 0 0 0 0 0 0
0 0 0 0 1 1 1 0 0
0 1 1 1 0 0 0 0 0
0 1 0 0 0 1 1 0 1
0 0 0 0 1 0 0 0 0
0 0 0 0 0 0 0 1 0
0 0 1 0 1 0 1 0 0
0 0 1 0 1 0 1 0 0
0 0 0 0 1 1 0 1 0
0 1 0 1 0 1 1 0 0
1 0 1 0 1 0 1 0 0
0 1 1 0 0 0 0 0 1

The set of items, A = {A1, A2, A3, A4, A5, A6, A7, A8, A9}.

Assume ߪ = 20%.

Illustrate the working of a FP-tree growth algorithm for the above

database. 2+4+9

9. a) Define with suitable examples of each of the following data

mining functionalities: data characterization, data
association and data discrimination.

b) What is the conceptual hierarchy? How many cuboids are

there in n-dimensional data cube considering the
hierarchies in each dimension?

c) In real world data, tuples with missing values for some

attributes are a common occurrence. Suggest two different
approaches for handling such event. 5+5+5

10. a) What is clustering? What are the features of good cluster?

b) What do you mean by hierarchical clustering technique?

c) Suppose that the data mining task is to divide the following
eight points representing locations into 3 clusters: A1(2,10),
A2(2, 5), A3(8, 4), B1(5, 8), B2(7, 5), B3(6, 4), C1(1, 2),
C2(4,9). The distance function is Euclidian distance.
Initially, we assign A1, B1 and C1 as the center of each
cluster. Use k-means algorithm to determine the 3
clusters. 3+4+8

11. a) What is tree pruning? What are the different tree pruning
techniques?

b) Describe PAM algorithm in brief.

c) Evaluate Information Gain and Gain Ratio with suitable

example. 5+5+5

==========

CS 8031 Data Mining and Data Warehousing Tutorial
No ratings yet
CS 8031 Data Mining and Data Warehousing Tutorial
9 pages
DWDM QB
No ratings yet
DWDM QB
12 pages
Assign em NT
No ratings yet
Assign em NT
2 pages
Question Bank For DMDW
100% (1)
Question Bank For DMDW
10 pages
MSC CS Mqp0708
No ratings yet
MSC CS Mqp0708
12 pages
Data Warehouse and Data Mining Question Bank R13 PDF
No ratings yet
Data Warehouse and Data Mining Question Bank R13 PDF
12 pages
Data Mining-1
No ratings yet
Data Mining-1
15 pages
BTech Data Mining Exam Prep
No ratings yet
BTech Data Mining Exam Prep
8 pages
Questions and Answers
No ratings yet
Questions and Answers
19 pages
Question Bank: Q1) What Is Data Warehouse?
No ratings yet
Question Bank: Q1) What Is Data Warehouse?
17 pages
Data Warehousing MCQ
No ratings yet
Data Warehousing MCQ
71 pages
MCQ-Part1-2025-Question Bank (PEC-CSBS601D)
No ratings yet
MCQ-Part1-2025-Question Bank (PEC-CSBS601D)
7 pages
Data Mining & Warehousing Exam 2016
No ratings yet
Data Mining & Warehousing Exam 2016
3 pages
Data Warehouse and Data Mining Unit 1
No ratings yet
Data Warehouse and Data Mining Unit 1
2 pages
BE Information Technology 0
No ratings yet
BE Information Technology 0
655 pages
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
No ratings yet
Subject Code: 80359 Subject Name: Data Warehousing and Data Mining Common Subject Code (If Any)
9 pages
126VW122019
No ratings yet
126VW122019
2 pages
CXCXX C C
No ratings yet
CXCXX C C
14 pages
Vivaquestions
No ratings yet
Vivaquestions
14 pages
1 - Page
No ratings yet
1 - Page
11 pages
Question With Answer
No ratings yet
Question With Answer
22 pages
Consolidated Cse Question Bank1
No ratings yet
Consolidated Cse Question Bank1
170 pages
CS614 Data Warehousing MCQs Solved
100% (2)
CS614 Data Warehousing MCQs Solved
30 pages
Cs614-Mid Term Solved MCQs With References by Moaaz PDF
No ratings yet
Cs614-Mid Term Solved MCQs With References by Moaaz PDF
30 pages
M.Tech Exam: Data Warehousing & Mining
No ratings yet
M.Tech Exam: Data Warehousing & Mining
5 pages
Vi Sem Bca Qbank - Wcms - Fds
50% (2)
Vi Sem Bca Qbank - Wcms - Fds
11 pages
DM QB
No ratings yet
DM QB
25 pages
Data Warehousing and Data Mining MCQ'S: Unit - I
No ratings yet
Data Warehousing and Data Mining MCQ'S: Unit - I
29 pages
Cis 417.Ccs 415. CCT 416 Cat
No ratings yet
Cis 417.Ccs 415. CCT 416 Cat
4 pages
DMDW Question Bank
No ratings yet
DMDW Question Bank
17 pages
Data Warehousing & Mining Guide
No ratings yet
Data Warehousing & Mining Guide
4 pages
Data Warehouse & Mining Exam
No ratings yet
Data Warehouse & Mining Exam
3 pages
Question Bank: Data Warehousing and Data Mining Semester: VII
No ratings yet
Question Bank: Data Warehousing and Data Mining Semester: VII
4 pages
Ps Assignment - Solution
No ratings yet
Ps Assignment - Solution
7 pages
Data Mining Assignment Guide
No ratings yet
Data Mining Assignment Guide
2 pages
MCQ & Answers: DWM EXAM 2020-21
No ratings yet
MCQ & Answers: DWM EXAM 2020-21
17 pages
Ban Quiz Answer
No ratings yet
Ban Quiz Answer
12 pages
Data Mining MCQ Multiple Choice Questions With Answers: Eguardian
No ratings yet
Data Mining MCQ Multiple Choice Questions With Answers: Eguardian
15 pages
CEUC502 - DMBI - Question - Bank
No ratings yet
CEUC502 - DMBI - Question - Bank
12 pages
Anna University Data Warehousing and Data Mining November December 2011 Question Paper
No ratings yet
Anna University Data Warehousing and Data Mining November December 2011 Question Paper
3 pages
DWM 700
No ratings yet
DWM 700
16 pages
DM MCQS Unit-1
No ratings yet
DM MCQS Unit-1
4 pages
Data Mining Assignment
0% (1)
Data Mining Assignment
11 pages
Sri Vidya College of Engineering & Technology - Dept of CSE
No ratings yet
Sri Vidya College of Engineering & Technology - Dept of CSE
4 pages
Data Warehousing and Data Mining-1
No ratings yet
Data Warehousing and Data Mining-1
17 pages
Pyqp - Cs402-Qp-Jun21
No ratings yet
Pyqp - Cs402-Qp-Jun21
3 pages
Jntuqp DWDM
No ratings yet
Jntuqp DWDM
8 pages
Data Mining Model 2024
No ratings yet
Data Mining Model 2024
1 page
Data Warehousing Exam Prep
No ratings yet
Data Warehousing Exam Prep
59 pages
DM
No ratings yet
DM
7 pages
Unit 1
No ratings yet
Unit 1
6 pages
Data Warehousing & Mining Guide
No ratings yet
Data Warehousing & Mining Guide
3 pages
CS402 Data Mining and Warehousing Question Bank
No ratings yet
CS402 Data Mining and Warehousing Question Bank
6 pages
Data Mining Question Bank
No ratings yet
Data Mining Question Bank
4 pages
Lecture 1428550844
No ratings yet
Lecture 1428550844
11 pages
CAT 2023 Simulated Slot 1 Full
No ratings yet
CAT 2023 Simulated Slot 1 Full
4 pages
Dc-102-Sci Mumbai-5610034767
No ratings yet
Dc-102-Sci Mumbai-5610034767
1 page
Portfolio Career
No ratings yet
Portfolio Career
32 pages
Soft Skill & Interpersonal Communication Organizer PDF
0% (1)
Soft Skill & Interpersonal Communication Organizer PDF
80 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
11 pages
Holiday List 2024 PDF
No ratings yet
Holiday List 2024 PDF
1 page
Software Testing and Quality Factors
No ratings yet
Software Testing and Quality Factors
12 pages
Disk Management MCQs & Scheduling
No ratings yet
Disk Management MCQs & Scheduling
5 pages
SWE2
No ratings yet
SWE2
25 pages
Adobe Scan 19 Oct 2021
No ratings yet
Adobe Scan 19 Oct 2021
25 pages
Cloud Computing
No ratings yet
Cloud Computing
10 pages
O o o o O: Services
No ratings yet
O o o o O: Services
10 pages
SAP ERP Upgrade Learnings
No ratings yet
SAP ERP Upgrade Learnings
7 pages
Microsoft Access - Exercise: Houston Public Library
No ratings yet
Microsoft Access - Exercise: Houston Public Library
3 pages
Instructions NP AC19 1a
No ratings yet
Instructions NP AC19 1a
3 pages
Database PPQ
No ratings yet
Database PPQ
5 pages
Oracle Cloud Maintenance and Update Schedules
No ratings yet
Oracle Cloud Maintenance and Update Schedules
2 pages
QUESTION
No ratings yet
QUESTION
13 pages
Combine - Asynchronous Programming With Swift
No ratings yet
Combine - Asynchronous Programming With Swift
7 pages
Ceph Reference Architecture
100% (1)
Ceph Reference Architecture
12 pages
NetSuite Customization Services
No ratings yet
NetSuite Customization Services
12 pages
TDD With Django
No ratings yet
TDD With Django
27 pages
DocDxfAspEng ASPAN
No ratings yet
DocDxfAspEng ASPAN
1 page
Default - Parallel: You Can Set The Number of Reducers For A Map Job by Passing Any Whole Number As A
No ratings yet
Default - Parallel: You Can Set The Number of Reducers For A Map Job by Passing Any Whole Number As A
6 pages
AWS Database Migration Service Best Practices
100% (1)
AWS Database Migration Service Best Practices
17 pages
Lab Planning - OOPJ - 2022-23
No ratings yet
Lab Planning - OOPJ - 2022-23
3 pages
UTA003
No ratings yet
UTA003
2 pages
SQL Beginners' Lab Guide
100% (1)
SQL Beginners' Lab Guide
11 pages
Django Campus Event Management System
No ratings yet
Django Campus Event Management System
2 pages
50 Practice Queries
No ratings yet
50 Practice Queries
2 pages
Programming Language Evolution
100% (1)
Programming Language Evolution
29 pages
AP AR Netting Oracle
No ratings yet
AP AR Netting Oracle
3 pages
Java Platform Standard Edition 8 Documentation: What's New
No ratings yet
Java Platform Standard Edition 8 Documentation: What's New
2 pages
Java Basic Syntax PDF
100% (1)
Java Basic Syntax PDF
42 pages
1 - SPS Mapping Unidrive M DE PDF
No ratings yet
1 - SPS Mapping Unidrive M DE PDF
12 pages
MAS TCRD 2025-03 Circular On Two Factor Authentication
100% (1)
MAS TCRD 2025-03 Circular On Two Factor Authentication
2 pages
IT Security Guide for NOVA Staff
No ratings yet
IT Security Guide for NOVA Staff
39 pages
SunFishERP-Brochure (Web)
No ratings yet
SunFishERP-Brochure (Web)
24 pages
Lab 05 Analyzing Types of Attacks and Mitigation Techniques
No ratings yet
Lab 05 Analyzing Types of Attacks and Mitigation Techniques
8 pages
TCL 3 2015.00 SG
No ratings yet
TCL 3 2015.00 SG
158 pages

(It-704c) Data Warehousing and Data Mining (2013-14)

Uploaded by

(It-704c) Data Warehousing and Data Mining (2013-14)

Uploaded by

CS/B.

DATA WAREHOUSING AND DATA MINING

Time Allotted : 3 Hours Full Marks : 70

Candidates are required to give their answers in their own words

(Multiple Choice Type Question)

i) A data warehouse is said to contain a ‘time-varying’

a) its contents vary automatically with time

b) its life-span is very limited

c) it contains historical data

d) its content has explicit time-stamp.

ii) A data warehouse is said to contain a ‘subject-oriented’

a) its contents have a common theme

b) it is built for a specific application

c) it cannot support multiple subjects

a) it is necessary to keep the operational data free of any

b) a data warehouse cannot afford to allow corrupted

c) a data warehouse contains summarized data whereas

iv) ROLAP is preferred over MOLAP when

a) a data warehouse and relational database are

b) the data warehouse is in relational tables, but no

c) the multidimensional model does not support query

d) A data warehouse contains many fact tables and

v) The ‘Slice operation’ deals with

a) selecting all but one dimension of the data cube

b) merging the cells along one dimension

c) merging cells of all but one dimension

d) selecting the cells of any one dimension of the data

vi) Which of the following indexing techniques is appropriate

a) Hashing on primary key

b) Indexing on foreign keys of fact table

a) MOLAP is an OLAP engine for (i) relational models

b) MOLAP is an OLAP engine for (i) multidimensional

c) MOLAP is an OLAP engine for (i) multidimensional

d) MOLAP is a ROLAP with a supporting

viii) The advantage of FP-tree Growth Algorithm is

a) it counts the support values of the item sets in the

b) it avoids the generation of large numbers of candidate

c) to update the association rules when the database

d) to prune the item sets which are not frequent.

ix) The ID3 generates a

a) binary decision tree

b) a decision tree with as many branches as there are

c) a tree with a variable number of branches, not related

d) a tree with an exponential number of branches.

x) An oblique tree is relevant when

a) the attributes are correlated

b) the attributes are independent

c) there are only two attributes

d) all attributes are categorical.

(Short Answer Type Questions)

2. Differentiate among Enterprise Warehouse, Data mart and Virtual

3. Distinguish between OLTP and OLAP systems.

4. Explain support, confidence, frequent itemset and give a formal

5. Compare between HOLAP, ROLAP and MOLAP.

6. Describe the basic algorithm for decision tree induction.

(Long Answer Type Questions)

7. a) How is data warehouse different from a database?

b) What is the significance of a multi-dimensional data model

c) Suppose that a data warehouse consists of the three

i) Draw a star schema for the above warehouse.

ii) Starting with the base cuboid (month, doctor,

b) Discuss the different phases of FP-tree growth algorithm.

Illustrate the working of a FP-tree growth algorithm for the above

9. a) Define with suitable examples of each of the following data

b) What is the conceptual hierarchy? How many cuboids are

c) In real world data, tuples with missing values for some

10. a) What is clustering? What are the features of good cluster?

b) What do you mean by hierarchical clustering technique?

b) Describe PAM algorithm in brief.

c) Evaluate Information Gain and Gain Ratio with suitable

You might also like