11) What is the difference between Classification and Clustering?
PARAMENTE
R CLASSIFICATION CLUSTERING
Type used for supervised learning used for unsupervised learning
process of classifying the input grouping the instances based on
instances based on their their similarity without the help of
Basic corresponding class labels class labels
it has labels so there is need of
training and testing dataset for there is no need of training and
Need verifying the model created testing dataset
more complex as compared to less complex as compared to
Complexity clustering classification
k-means clustering algorithm,
Logistic regression, Naive Fuzzy c-means clustering
Example Bayes classifier, Support vector algorithm, Gaussian (EM)
Algorithms machines etc. clustering algorithm etc.
12) What are the types of Machine Learning and what is the difference between Supervised
Machine Learning and Unsupervised Machine learning?
Parameters Supervised machine learning Unsupervised machine learning
technique technique
Process In a supervised learning model, input and In unsupervised learning model, only input
output variables will be given. data will be given
Input Data Algorithms are trained using labeled Algorithms are used against data which is
data. not labeled
Algorithms Used Support vector machine, Neural network, Unsupervised algorithms can be divided
Linear and logistics regression, random into different categories: like Cluster
forest, and Classification trees. algorithms, K-means, Hierarchical
clustering, etc.
Computational Supervised learning is a simpler method. Unsupervised learning is computationally
Complexity complex
Use of Data Supervised learning model uses training Unsupervised learning does not use output
data to learn a link between the input and data.
the outputs.
Accuracy of Results Highly accurate and trustworthy method. Less accurate and trustworthy method.
Real Time Learning Learning method takes place offline. Learning method takes place in real time.
Number of Classes Number of classes is known. Number of classes is not known.
Main Drawback Classifying big data can be a real You cannot get precise information
challenge in Supervised Learning. regarding data sorting, and the output as
data used in unsupervised learning is labeled
and not known.