Data-mining online test
1. _______________ is a process of taking operational data from one or more sources and
mapping it, field by field, onto a new data structure in the data warehouse
Transformation
Cleansing.
Integration.
Scrubbing
2. A decision tree is a tree in which every node is either a ________________ or a
decision node
leaf node
Root node
Both a and b
Sub node
3. Height, width comes under which type of data?
Finite
Discrete
Continuous
None of the above
4. What is data mining?
Used to find patterns by comparing large amounts of data.
Used to find patterns by comparing large amounts of data mainly for
statistically inclined users
used for extracting and storing data which allows easier reporting
Both a and b
5. ________________stores data in a summarized version
Cube
Roll up
Both a and b
A mine
7. Summarization is a simple addition of values along one or more data dimensions
True
False
8. What is the nature of Quality of data?
The data should be accurate
The data could be stored according to data type.
The data should be timely.
Both a and c
9. Which of the following fields below typically make use of Data Mining techniques?
Marketing
Government intelligence
Advertising
All of above
11. Which stage of data mining involves preparation and collection of data?
Validation
Exploration
Both a and b
Collection
tice test, exam, quiz
12. Which of the following can be closely associated with Time Series algorithm?
Time
Speed
Both a and b
Forecasting
13. What is meant by discrete data?
One that allows Only finite set of values
One that allows real numbers only
Both a and b
One that allows float values only.
14. What is the Naive Bayes Algorithm used for?
Generate mining models
Estimating the probability of a class value during classification and prediction
Both a and b
To make decisions for reporting.
15. Which algorithm is used to find correlations among different attributes in a data
set?
Associative algorithm
Association algorithm
Time Series algorithm
Series algorithm
--------------------------------------------------
1. Why aggregate is used in dimensional model of data warehouse?
To take the dimension and change its granularity
To retrieve data from an operational system
To store data in one operational system
All of the above
2. The active data warehouse architecture includes which of the following?
At least one data mart
Data that can extracted from numerous internal and external sources
Near real-time updates
All of the above
3. What is data scrubbing?
A process to reject data from the data warehouse and to create the necessary indexes
A process to load the data in the data warehouse and to create the necessary indexes
A process to upgrade the quality of data after it is moved into a data warehouse
A process to upgrade the quality of data before it is moved into a data warehouse
Data-Warehousing online test, online practice test, exam, quiz
4. Which of the following statements is/are true about Data Warehouse?
Can be update by end user
Contains numerous naming conventions and formats
Organized around important subject areas
Contain only current data
5. What is a data mart?
It is the collection of data in data warehouse
It is the access layer of the data warehouse environment that is used to get
data out to the users
It is the process of extracting patterns from large data sets
It is the process of extracting data
Online Practice Test > Data-Warehousing (Total 10 questions) (Time spent 5:43)
6. A star schema has what type of relationship between a dimension and fact table?
Many-to-many
One-to-one
One-to-many
All of the above
Data-Warehousing online test, online practice test, exam, quiz
Online Practice Test > Data-Warehousing (Total 10 questions) (Time spent 6:21)
7. Attempt to find a function which models the data with the least error is known as
Clustering
Regression
Association rule
Clustering
Data-Warehousing online test, online practice test, exam, quiz
Online Practice Test > Data-Warehousing (Total 10 questions) (Time spent 7:2)
8. What is reconciled data?
Data stored in the various operational systems throughout the organization.
Current data intended to be the single source for all decision support systems.
Data stored in one operational system in the organization.
Data that has been selected and formatted for end-user support applications.
9. Fact tables are
Completely demoralized
Partially demoralized
Completely normalized
Partially normalized
10. In which approach of data warehousing, the transaction data is partitioned into facts
Dimensional approach
Normalized approach
Operational approach
None of the above