0% found this document useful (0 votes)
375 views2 pages

BCS058

This document outlines the structure and content of a BTECH theory examination for Data Warehousing & Data Mining, scheduled for 2024-25. It includes various sections with questions covering key concepts such as data warehouse components, schema design, data pre-processing, and clustering algorithms. The exam consists of multiple-choice and descriptive questions, requiring students to demonstrate their understanding of the subject matter.

Uploaded by

vctmexamination
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
375 views2 pages

BCS058

This document outlines the structure and content of a BTECH theory examination for Data Warehousing & Data Mining, scheduled for 2024-25. It includes various sections with questions covering key concepts such as data warehouse components, schema design, data pre-processing, and clustering algorithms. The exam consists of multiple-choice and descriptive questions, requiring students to demonstrate their understanding of the subject matter.

Uploaded by

vctmexamination
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Printed Page: 1 of 2

Subject Code: BCS058/ BCAI053


0Roll No: 0 0 0 0 0 0 0 0 0 0 0 0 0

BTECH
(SEM V) THEORY EXAMINATION 2024-25
DATA WAREHOUSING & DATA MINING
TIME: 3 HRS M.MARKS: 70

Note: Attempt all Sections. In case of any missing data; choose suitably.
SECTION A

1. Attempt all questions in brief. 2 x 07 = 14


Q no. Question CO Level
a. List the main components of a data warehouse. 1 K1
b. What is the difference between a star schema and a snowflake schema? 1 K1
c. What is the difference between a centralized data warehouse and a 2 K1
distributed data warehouse?
d. List the primary functionalities of data pre-processing. 2 K2
e. Name two statistical measures commonly used in large databases. 3 K2
f. How do statistical-based algorithms differ from distance-based 4 K1
algorithms in data mining?
g. What are the main types of OLAP servers? 5 K1
SECTION B

46
2. Attempt any three of the following: 40 07 x 3 = 07

5.
_3

16
Q no. Question CO Level
a. Given a dataset, determine whether a star schema or a snowflake schema 1 K2
P1

5.
would be more appropriate and justify your choice.

25
5D

b. Develop a schema design for a warehouse that stores e-commerce 2 K3

3.
transaction data.
P2

|4
c. Describe the process of binning for reducing noisy data. 3 K2
d. Compare and contrast the strengths and weaknesses of DBSCAN and 4 K3
Q

AM
OPTICS in density-based clustering.
e. Design a backup and recovery strategy for a data warehouse containing 5 K3
sensitive customer data.
57
9:

SECTION C
:5
10

3. Attempt any one part of the following: 07 x 1 = 07


Q no. Question CO Level
2 5

a. Examine the relationship between data warehouse components and the 1 K2


20

overall ETL (Extract, Transform, Load) process.


b. Break down the components of a warehouse database and explain their 1 K2
b-

interconnections.
Fe
5-

4. Attempt any one part of the following: 07 x 1 = 07


|0

Q no. Question CO Level


a. Explain the role of parallel processors and cluster systems in a data 2 K3
warehouse environment.
b. Apply the client/server computing model to optimize query processing in 2 K3
a large data warehouse.

1|Page
QP25DP1_340 | 05-Feb-2025 10:59:57 AM | 43.255.165.46
Printed Page: 2 of 2
Subject Code: BCS058/ BCAI053
0Roll No: 0 0 0 0 0 0 0 0 0 0 0 0 0

BTECH
(SEM V) THEORY EXAMINATION 2024-25
DATA WAREHOUSING & DATA MINING
TIME: 3 HRS M.MARKS: 70

5. Attempt any one part of the following: 07 x 1 = 07


Q no. Question CO Level
a. Use dimensionality reduction techniques to simplify a dataset with 3 K3
hundreds of features.
b. Given a dataset with missing values, demonstrate how you would clean 3 K3
it using imputation.

6. Attempt any one part of the following: 07 x 1 = 07


Q no. Question CO Level
a. Use DBSCAN to identify clusters in a spatial dataset and explain how it 4 K4
classifies noise points.
b. Apply the Apriori algorithm to generate association rules for a market 4 K4
basket dataset.

46
7. Attempt any one part of the following: 40 07 x 1 = 07

5.
Q no. Question CO Level
_3

16
a. Compare the functionalities of ROLAP, MOLAP, and HOLAP servers. 5 K2
P1

5.
b. Examine the differences between web mining, spatial mining, and 5 K2

25
5D

temporal mining in terms of their applications.

3.
P2

|4
Q

AM
57
9:
:5
10
2 5
20
b-
Fe
5-
|0

2|Page
QP25DP1_340 | 05-Feb-2025 10:59:57 AM | 43.255.165.46

You might also like