questions to study
see matlab web docu
https://www.mathworks.com/help/stats/parallelcoords.html
parallel coordinates plot to visualize high dimensional data, where each observation is
represented by the sequence of its coordinate values plotted against their coordinate indices.
https://www.mathworks.com/help/phased/ug/receiver-operating-characteristic-roc-curves.html
Receiver Operating Characteristic (ROC) curves present graphical summaries of a detector's
performance. You can generate ROC curves using the rocpfa and rocsnr functions.
Binary or binomial classification is the task of classifying the elements of a given set into two
groups (predicting which group each one belongs to) on the basis of a classification rule.
Contexts requiring a decision as to whether or not an item has some qualitative property, some
specified characteristic, or some typical binary classification include:
https://www.mathworks.com/matlabcentral/answers/333135-what-is-a-confusion-matrix
A Confusion matrix (link), specifically one that is (2x2), tells you the accuracy of your classifier. It
will list correct classifications as ‘true positives’ or ‘true negatives’, and incorrect classifications as
‘false positives’ or ‘false negatives’. It is also possible to derive the sensitivity, specificity, positive
predictive value, and others. This is particularly important in diagnostic tests for the presence or
absence of a particular condition or disease based on a single test.
Compare and Improve Classification Models
Click models in the history list to explore the results in the plots. Compare model performance
by inspecting results in the scatter plot and confusion matrix. Examine the percentage accuracy
reported in the history list for each model. See Assess Classifier Performance in Classification
Learner.
Select the best model in the history list and then try including and excluding different features
in the model. Click Feature Selection.
Try the parallel coordinates plot to help you identify features to remove. See if you can
improve the model by removing features with low predictive power. Specify predictors to include
in the model, and train new models using the new options. Compare results among the models
in the history list.
You can also try transforming features with PCA to reduce dimensionality.
See Feature Selection and Feature Transformation Using Classification Learner App.
To improve the model further, you can try changing classifier parameter settings in the
Advanced dialog box, and then train using the new options. To learn how to control model
flexibility, see Choose Classifier Options.
If feature selection, PCA, or new parameter settings improve your model, try training All model
types with the new settings. See if another model type does better with the new settings.
Training a model in Classification Learner consists of two parts:
Validated Model: Train a model with a validation scheme. By default, the app protects against
overfitting by applying cross-validation. Alternatively, you can choose holdout validation. The
validated model is visible in the app.
Full Model: Train a model on full data without validation. The app trains this model
simultaneously with the validated model. However, the model trained on full data is not visible
in the app. When you choose a classifier to export to the workspace, Classification Learner
exports the full model.