Online_MBA_AI(OMBAIML 301)
Extract insights from data
Data Analysis
Learning from labeled data
Decision Trees
When the model performs well on training data but poorly on test data
c(1, 2, 3)
[Link]()
[Link](column1, column2)
[Link]("package_name")
11
To clean the data
Inferential statistics
Values that are not recorded
Mean
selection
conversion
duplicates
True
False
True
2. What challenges might the company face when handling text data?
✅ Ambiguity in language
✅ High dimensionality of text data
3. Which preprocessing methods help improve the accuracy of sentiment analysis?
✅ Lemmatization
✅ Stop word removal
4. What types of insights can be gained from sentiment analysis of customer feedback?
✅ Identification of customer preferences
✅ Understanding product issues
5. Which of the following visualizations can be effective for displaying sentiment analysis results?
✅ Bar charts
✅ Line graphs
6. What is the primary goal of topic modeling in text analysis?
✅ To identify common themes in feedback
✅ To classify text into information
Online_MBA_ML(OMBAIML 302)_
1. Which of the following machine learning algorithm can be used for imputing missing values of
both categorical and continuous variables?
✅ KNN
2. Which of the following distance measure do we use in case of categorical variables in KNN?
✅ Hamming distance
3. Machine Learning is used to find what?
✅ Hidden patterns in dataset
4. What is Standard Deviation?
✅ A quantity expressing by how much the members of a group differ from the mean value for the
group
5. Which algorithm is commonly used for classification tasks in machine learning?
✅ Decision trees
6. What is Regression?
✅ A technique to determine the statistical relationship between two or more variables
7. What is the full form of SVR?
✅ Support vector regression
8. Clustering is a technique of which Machine Learning?
✅ Unsupervised
9. If there are datapoints that belong to more than one cluster, then which clustering technique
is/are used?
✅ Fuzzy
10. In which type of clustering, each data point either belongs to a cluster completely or not?
✅ Hard clustering
11. What do you mean by support(A)?
✅ Number of transactions containing A / Total number of transactions
12. What is the full form of MDP?
✅ Markov Decision Process
13. CNN carries out which operation, which is an element-wise product and sum between two
matrices?
✅ Convolution
14. Which layer simply removes all the negative values from the convolution?
✅ ReLU
15. Mobile phone face ID is an example of Deep Learning
✅ True
16. PCA is dimensionality reduction technique
✅ True
17. Random Forest is an example of bagging
✅ True
18. The fundamental unit of Neural Network is ___________
✅ Neuron
19. ______ are best recognized for their ability to find patterns present in images
✅ CNN
20. _______ is applied by the agent to decide the next action based on the current state.
✅ Policy/Strategy
2. Choose applications that utilize machine learning:
✅ Weather forecasting
✅ Language translation
(These are explicitly mentioned in the paragraph as ML applications. Simple calculator apps and
alarm clocks are not.)
3. Identify the fields that are used in machine learning:
✅ Autonomous vehicles
✅ Music composition
(The paragraph discusses autonomous vehicles. Music composition can involve ML in generative
models, although not explicitly mentioned in the paragraph, it's still a valid application compared to
the others.)
4. Choose the correct applications of machine learning that are used in healthcare:
✅ Medical image analysis
✅ Disease prediction
(Both are explicitly listed in the healthcare section of the paragraph.)
5. Identify the tasks that can be performed using natural language processing (NLP):
✅ Sentiment analysis
✅ Speech recognition
(Sentiment analysis is a direct NLP task. Speech recognition is mentioned in ML use, and often
overlaps with NLP in speech-to-text systems.)
6. Select all the applications of machine learning in recommendation systems:
✅ Music recommendation
✅ Movie recommendation
Online_MBA_PAP(OMBAIML 303)
1. Identify the correct way to define a set in Python:
✅ set = {1, 2, 3}
2. Identify the correct way to access a value in a dictionary by its key in Python:
✅ dictionary[key]
3. Determine what statistics provided by [Link]() function in Pandas:
✅ Descriptive
4. Visual data exploration and exploratory statistical analysis are part of which step?
✅ Data Analysis
5. Which regression model is used when the dependent variable is categorical?
✅ Logistic Regression
6. Which of the following is not a method to prevent overfitting in decision trees?
✅ Regularization
7. What task is SVM algorithm used for?
✅ Classification
8. What does Ensemble methods do?
✅ combine multiple classifier to improve the performance of a model.
9. Major voting process is considered to be from the following:
✅ Bagging
10. For multiple trees to be effective, how should the trees be?
✅ Independent
11. What does k-NN stand for?
✅ k-Nearest Neighbor
12. Association Rules are used to determine:
✅ Discovering interesting relationships between variables
13. The support of an itemset in Association Rules relates to:
✅ The proportion of transactions containing the itemset out of all transactions
14. What major challenges do interconnected databases and regular data updates pose?
✅ Negative impact on data quality
15. Decision Trees can handle both numerical and categorical data.
✅ True
16. A social network learner usually consists of a local model using only node-specific
characteristics.
✅ True
17. Conventional approach to analytics in social network learning doesn't consider network
connections.
✅ True
18. Growing a decision tree involves recursively splitting data based on:
✅ Information Gain
19. A decision tree model with too many levels or branches suffers from:
✅ Overfitting
20. In SVM, the points closest to the hyperplane are called:
✅ Support Vectors
2. Determine the libraries or frameworks in Python that facilitate event-driven programming
✅ Tkinter
✅ asyncio
(❌ Requests and NumPy are not event-driven libraries)
3. Dissect the advantages of event-driven programming in Python?
✅ Responsive and interactive applications
✅ Modular and reusable code
(❌ Sequential execution and database management are not core advantages mentioned)
4. Determine which type of applications can benefit from event-driven programming in Python?
✅ Networking servers
✅ GUI interfaces (implied from the paragraph; choose Networking servers + GUI interfaces if asked)
(Note: Command-line utilities and simulations are not event-driven in nature)
5. Determine which features are associated with event-driven programming in Python?
✅ Asynchronous execution
✅ Callback functions
(❌ Object-oriented programming and exception handling are general concepts, not specific to
event-driven)
6. Evaluate how are events typically triggered in event-driven programming?
✅ Through user actions
✅ Automatically by the operating system
Online_MBA_SQT(OMBAIML 304)
1. Unit of measurement for variance of heights in cm
✅ cm²
2. Type of data for heights of the trees
✅ Continuous
3. Purpose of multiple regression
✅ To predict scores on a dependent variable from scores on multiple independent variables
4. Attribute with two classes is called
✅ Dichotomy
5. Data required for calculation of measures of association
✅ Attributes
6. Type of data for gender of students
✅ Attribute
7. Probability of selecting a king from a deck of 52 cards
✅ 0.076 (4 kings / 52 cards = 0.0769)
8. Why saying 10°C is twice as hot as 5°C is incorrect
✅ For interval data, differences make sense, but ratios do not
9. Maximax criterion decision-making environment
✅ Conditions of uncertainty, the decision maker being perfectly optimistic
10. Probability of getting a heart from 52 cards
✅ 0.25 (13 hearts / 52 cards = 0.25)
11. Mean and median for: 1, 7, 3, 3, 6, 4
✅ 4 and 3.5
(Sorted: 1, 3, 3, 4, 6, 7 → mean = 24/6 = 4; median = (3+4)/2 = 3.5)
12. Measure that is most frequent number in dataset
✅ Mode
13. Mean of 6 numbers is 17 → Sum = ?
✅ 102 (Mean × count = 17 × 6 = 102)
14. Arrival follows
✅ Poisson distribution
15. Programming that allows more than one objective
✅ Goal
16. Transportation problem requires equal
✅ Units supplied; units demanded
17. Standard normal distribution has
✅ Mean equal 0 and variance equal 1
18. Median divides into four equal parts
✅ False
(Median divides into two equal parts; quartiles divide into four)
19. Hungarian Method is faster and efficient
✅ True
20. Pure strategy in two-person, zero-sum game
✅ True
1- A&B
2- B&D
3- B&C
4- A&B
5- C&D
Online_MBA_BI(OMBAIML 305)_
1. Where is data stored, retrieved & updated in a data warehouse?
✅ OLAP
2. Which metadata maps core warehouse metadata to business concepts?
✅ Application-level metadata
3. Expansion of OLAP:
✅ Online Analytical Processing
4. What does data mining use to convert large collections of data into useful output?
✅ Algorithms and various techniques
5. Data mining tool based on GUI and uses R stats:
✅ Rattle
6. Primary means of creating a dimension:
✅ Dimension Wizard
7. Processing of data about customers and their relationship with the enterprise:
✅ CRM analytics
8. Performance management tool:
✅ Balanced Scorecard
9. How to remove noise and inconsistency in data:
✅ Data cleaning
10. Style of network communication where client initiates request:
✅ Pull coding
11. Device used to transfer an image in visualization:
✅ Scanner
12. Visualization technique showing customer perception of brands:
✅ Perceptual mapping
13. How to design effective tables and charts for data visualization:
✅ Data-ink ratio
14. Use of BI service:
✅ Flexible self-service analysis
15. What processes data through use of nodes:
✅ Neural networks
16. A calculation in ________ is a bunch of heuristics and estimations:
✅ Data mining
17. The class under study in Data Characterization is known as:
✅ Target
18. To harness a successful BI project, companies must plan:
✅ True
19. A data source may not be the initial location where data is born:
✅ True
20. Successful information mining supports business strategy and operations:
✅ True
2. What is the requirement to use BI softwares?
✅ for transparent analytics
✅ reporting process
3. Which are data management solutions?
✅ Business intelligence
✅ Business Analytics
These are explicitly mentioned as data management solutions used to collect and analyze data.
4. Which kind of data could be used for analysis?
✅ organized
✅ unstructured
The paragraph says: "The data used for analysis can be organised or unstructured."
5. What is the use of statistics in BI?
✅ to analyze raw information
✅ to deliver insights for future
It says BI uses statistics "to analyze raw information, and deliver insights for making better future
decisions."
6. In which format business intelligence present the data?
✅ charts
✅ dashboard
Online_MBA_R programming(OMBAIML 306)
1. What does R stand for in the context of the R programming language?
✅ NA (R does not officially stand for anything; it was partly named after the first letters of its
authors Robert and Ross)
2. Which of the following is a primary use of R?
✅ Statistical analysis
3. What is the command to install a package in R?
✅ [Link]("package_name")
4. Which of the following data structures is NOT available in R?
✅ tree
5. How do you create a vector in R?
✅ c(1,2,3)
6. What function is used to view the structure of a dataset in R?
✅ str()
7. Which of the following is a way to read a CSV file into R?
✅ [Link]("[Link]")
8. In R, what does the function summary() do?
✅ It provides a summary of statistical measures for an object.
9. What is the primary function of the ggplot2 package in R?
✅ data visualization
10. Which of the following is a logical operator in R?
✅ &&
11. What is a matrix in R?
✅ A collection of elements of the same type arranged in a rectangular format
12. How do you create a matrix in R?
✅ matrix(c(1, 2, 3, 4), nrow=2, ncol=2)
13. What function is used to access a specific element in a matrix in R?
✅ [row, column] notation
14. Which function is used to transpose a matrix in R?
✅ t(matrix)
15. To create a sequence of numbers from 1 to 10 in R, you would use the function __________.
✅ seq(1,10)
16. The function __________ is used to read a CSV file into R.
✅ [Link]()
17. In R, the __________ function is used to calculate the mean of a numeric vector.
✅ mean()
18. R packages are collections of R functions, data, and documentation bundled together
✅ True
19. The function [Link]("package_name") is used to load an already installed package
into the R environment.
✅ False
*(Correct function: library(package_name))
20. R scripts are plain text files that contain R code and can be executed in the R environment.
✅ True
2. What is the purpose of the JOIN clause in SQL?
✅ To combine rows from two or more tables based on a related column
✅ (None of the other listed options are correct; only one correct option applies here)
3. Which of the following SQL statements correctly groups data to calculate the total quantity sold
for each product?
✅ SELECT Product, SUM(Quantity) AS TotalQuantity FROM Orders GROUP BY Product;
✅ SELECT Product, SUM(Quantity) AS TotalQuantity FROM Orders ORDER BY TotalQuantity;
(Though missing GROUP BY, this one could still be a possible second choice if GROUP BY is implied.
However, the first is the most accurate.)
4. Which R function is used to read CSV data into a data frame?
✅ [Link]()
✅ read.csv2() (Another variant that uses semicolon separators common in European data formats)
5. What is the output of the following SQL query?
✅ Total revenue generated per city
✅ (Only one correct answer; others do not match the query's intent.)
6. Which dplyr functions did Mr. Sinha use in R to summarize data?
✅ group_by()
✅ summarise()