Linear Classifiers Adapted to Missing Values

This repository compares adapted linear classifiers for handling missing values, specifically Perceptron, Logistic Regression, and Linear Discriminant Analysis (LDA). Two main approaches are considered for adapting to missing data: imputation methods and pattern-by-pattern methods.

Approaches to Handling Missing Values

1. Imputation Methods

Constant imputation: Replacing missing values with a fixed value (e.g., 0).
Iterative imputation: Using IterativeImputer from Scikit-learn to estimate missing values.

2. Pattern-by-Pattern Methods

These methods decompose the Bayes classifier for each missingness pattern, training a separate classifier for each observed pattern.
In the LDA setting, this decomposition allows leveraging distributional assumptions:
- When missingness is not informative, it is possible to average over all observed data.
- When missingness is informative, it is possible to only estimate the most frequently observed patterns.

Experimental Analysis

Multiple experiments are conducted to analyze:

Convergence rates and robustness to misspecifications in the joint distribution and missingness model.
The impact of correlation between input covariates.
The effects of signal-to-noise ratio, missingness probability, and dimensionality.

Reference

For more details, refer to the paper:
https://arxiv.org/abs/2405.09196

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
linear_classifiers_NA.ipynb		linear_classifiers_NA.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Linear Classifiers Adapted to Missing Values

Approaches to Handling Missing Values

1. Imputation Methods

2. Pattern-by-Pattern Methods

Experimental Analysis

Reference

About

Uh oh!

Releases

Packages

Languages

AngelReyero/classification_NA

Folders and files

Latest commit

History

Repository files navigation

Linear Classifiers Adapted to Missing Values

Approaches to Handling Missing Values

1. Imputation Methods

2. Pattern-by-Pattern Methods

Experimental Analysis

Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages