Python Application Development Using Imbalanced-Learn
Python Application Development Using Imbalanced-Learn
Bily809
3248 views
bily809
Bioinformatics
Medical imaging: diseases versus healthy
Social sciences: prediction of academic dropout
Web services: Service Level Agreement violation prediction
Security services: fraud detection
Most classification algorithms will only perform optimally when the number of
samples of each class is roughly the same. Highly skewed datasets, where the
minority is heavily outnumbered by one or more classes, have proven to be a
challenge while at the same time becoming more and more common. One way of
addressing this issue is by re-sampling the dataset as to offset this imbalance
with the hope of arriving at a more robust and fair decision boundary than you
would otherwise.
(i) under-sampling,
Boostlog
(ii) isover-sampling,
an online community for developers
Sign in with GitHub.
who want to share ideas and grow each other.
(iii) combination of over- and under-sampling, and
Boostlog Sign in
(iv) ensemble learning methods.
Under-sampling
Over-sampling
Ensemble sampling
xix. EasyEnsemble
xx. BalanceCascade
Installation
imbalanced-learn is tested to work under Python 2.7, Python 3.5 and 3.6. The
dependency requirements are based on the last scikit-learn release:
scipy (>=0.13.3)
numpy (>=1.8.2)
scikit-learn (>=0.19.0)
Example
AUTHOR
Bily809
bily809
READ NEXT
Jan 25 2018
Jan 25 2018
Plink in Python
python
Margot Swift
margot_swift19 0