Abstract
Class imbalance is a common issue in medical diagnosis. Although standard radial basis function neural network (RBF-NN) has achieved remarkably high performance on balanced data, its ability to classify imbalanced data is still limited. So far as we know, cost-sensitive learning is an advanced imbalanced data processing method. However, few studies have focused on the combination of RBF-NN and cost sensitivity. From our knowledge, only one paper has proposed a cost-sensitive RBF-NN for software defect prediction. However, the authors implemented a fixed RBF-NN structure. In this paper, a novel cost-sensitive RBF-NN that optimizes structure and parameters simultaneously is proposed to handle medical imbalanced data. Genetic algorithm (GA) and improved particle swarm optimization (IPSO) are used to optimize the structure and parameters of cost-sensitive RBF-NN respectively, and the optimization of cost-sensitive RBF-NN based on dynamic structure is realized. A cost-sensitive function determined adaptively by the sample distribution as the objective function of RBF-NN, so that it can adapt to datasets with different sample distributions. Experimental results show that the proposed cost-sensitive RBF-NN outperforms other state-of-the-art representative algorithms for five imbalanced medical diagnostic datasets in term of accuracy and area under curve (AUC). It can improve the accuracy of medical diagnosis and reduce the error rate of medical decisions.
Article PDF
Avoid common mistakes on your manuscript.
References
J.S.M. Alneamy, et al., Utilizing hybrid functional fuzzy wavelet neural networks with a teaching learning-based optimization algorithm for medical disease diagnosis, Comput. Biol. Med. 112 (2019), 103348.
S. Shilaskar, A. Ghatol, P. Chatur, Medical decision support system for extremely imbalanced datasets, Inf. Sci. 384 (2017), 205–219.
X. Tao, et al., Affinity and class probability-based fuzzy support vector machine for imbalanced data sets, Neural Netw. 122 (2020), 289–307.
D. Gan, et al., Integrating TANBN with cost sensitive classification algorithm for imbalanced data in medical diagnosis, Comput. Ind. Eng. 140 (2020), 106266.
S. García, et al., Dynamic ensemble selection for multi-class imbalanced datasets, Inf. Sci. 445 (2018), 22–37.
S. Pouyanfar, S.-C. Chen, Automatic video event detection for imbalance data using enhanced ensemble deep learning, Int. J. Semant. Comput. 11 (2017), 85–109.
R.J. Kuo, et al., Integrating cluster analysis with granular computing for imbalanced data classification problem - a case study on prostate cancer prognosis, Comput. Ind. Eng. 125 (2018), 319–332.
Z.-L. Zhang, et al., Cost-Sensitive back-propagation neural networks with binarization techniques in addressing multi-class problems and non-competent classifiers, Appl. Soft Comput. 56 (2017), 357–367.
P. Domingos, MetaCost: a general method for making classifiers cost-sensitive, in Conference on Knowledge Discovery and Data Mining, San Diego, CA, USA, 1999.
C.-h. Tsai, L.-c. Chang, H.-c. Chiang, Forecasting of ozone episode days by cost-sensitive neural network methods, Sci. Total Environ. 407 (2009), 2124–2135.
S. Zhang, Cost-sensitive KNN classification, Neurocomput. 391 (2020), 234–242.
A. Ghodsi, D. Schuurmans, Automatic basis selection techniques for RBF networks, Neural Netw. 16 (2003), 809–816.
S. Yu, K. Wang, Y.-M. Wei, A hybrid self-adaptive particle swarm optimization–genetic algorithm–radial basis function model for annual electricity demand prediction, Energy Convers. Manag. 91 (2015), 176–185.
V. Fathi, G.A. Montazer, An improvement in RBF learning algorithm based on PSO for real time applications, Neurocomput. 111 (2013), 169–176.
S. Amini, M. Taki, A. Rohani, Applied improved RBF neural network model for predicting the broiler output energies, Appl. Soft Comput. 87 (2020), 106006.
P. Kumudha, R. Venkatesan, Cost-sensitive radial basis function neural network classifier for software defect prediction, Sci. World J. 2016 (2016), 1–20.
R. Mohammadi, S.M.T. Fatemi Ghomi, F. Zeinali, A new hybrid evolutionary based RBF networks method for forecasting time series: a case study of forecasting emergency supply demand time series, Eng. Appl. Artif. Intell. 36 (2014), 204–214.
Z.-Q. Li, et al., A proposed self-organizing radial basis function network for aero-engine thrust estimation, Aerospace Sci. Technol. 87 (2019), 167–177.
W. Jia, D. Zhao, L. Ding, An optimized RBF neural network algorithm based on partial least squares and genetic algorithm for classification of small sample, Appl. Soft Comput. 48 (2016), 373–384.
G.A. Montazer, D. Giveki, An improved radial basis function neural network for object image retrieval, Neurocomputing. 168 (2015), 221–233.
S. Yu, Y.-M. Wei, K. Wang, Provincial allocation of carbon emission reduction targets in China: an approach based on improved fuzzy cluster and Shapley value decomposition, Energy Policy. 66 (2014), 630–644.
C.-F. Tsai, et al., Under-sampling class imbalanced datasets by combining clustering analysis and instance selection, Inf. Sci. 477 (2019), 47–54.
N. Ofek, et al., Fast-CBUS: a fast clustering-based undersampling method for addressing the class imbalance problem, Neurocomputing. 243 (2017), 88–102.
W.-C. Lin, et al., Clustering-based undersampling in classimbalanced data, Inf. Sci. 409 (2017), 17–26.
B. Krawczyk, et al., Evolutionary undersampling boosting for imbalanced classification of breast cancer malignancy, Appl. Soft Comput. 38 (2016), 714–726.
M. Galar, et al., EUSBoost: enhancing ensembles for highly imbalanced data-sets by evolutionary undersampling, Pattern Recognit. 46 (2013), 3460–3471.
S. Cateni, V. Colla, M. Vannucci, A method for resampling imbalanced datasets in binary classification tasks for real-world problems, Neurocomputing. 135 (2014), 32–41.
N.V. Chawla, et al., SMOTE: synthetic minority over-sampling technique, J. Artif. Intell. Res. 16 (2002), 321–357.
J.A. Sáez, et al., SMOTE–IPF: addressing the noisy and borderline examples problem in imbalanced classification by a re-sampling method with filtering, Inf. Sci. 291 (2015), 184–203.
G. Douzas, F. Bacao, F. Last, Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE, Inf. Sci. 465 (2018), 1–20.
G. Douzas, F. Bacao, Geometric SMOTE a geometrically enhanced drop-in replacement for SMOTE, Inf. Sci. 501 (2019), 118–135.
X.W. Liang, et al., LR-SMOTE — an improved unbalanced data set oversampling based on K-means and SVM, Knowl. Based Syst. 196 (2020), 105845.
G.E. Batista, R.C. Prati, M.C. Monard, A study of the behavior of several methods for balancing machine learning training data, ACM SIGKDD Explor. Newsl. 6 (2004), 20–29.
V. López, et al., An insight into classification with imbalanced data: empirical results and current trends on using data intrinsic characteristics, Inf. Sci. 250 (2013), 113–141.
G. Haixiang, et al., Learning from class-imbalanced data: review of methods and applications, Expert Syst. Appl. 73 (2017), 220–239.
Y. Freund, R.E. Schapire, Experiments with a new boosting algorithm, in International Conference on Machine Learning, Citeseer, Bari, Italy. 1996, p. 148–156.
J.H. Friedman, Greedy function approximation: a gradient boosting machine, Ann. Stat. 29 (2001), 1189–1232.
J. Lin, et al., Accurate prediction of potential druggable proteins based on genetic algorithm and Bagging-SVM ensemble classifier, Artif. Intell. Med. 98 (2019), 35–47.
M. Hao, Y. Wang, S.H. Bryant, An efficient algorithm coupled with synthetic minority over-sampling technique to classify imbalanced PubChem BioAssay data, Anal. Chim. Acta. 806 (2014), 117–127.
V. Bolón-Canedo, A. Alonso-Betanzos, Ensembles for feature selection: a review and future trends, Inf. Fusion. 52 (2019), 1–12.
D. Fuqua, T. Razzaghi, A cost-sensitive convolution neural network learning for control chart pattern recognition, Expert Syst. Appl. 150 (2020), 113275.
H. Li, et al., Cost-sensitive sequential three-way decision modeling using a deep neural network, Int. J. Approx. Reason. 85 (2017), 68–78.
Ö.F. Arar, K. Ayan, Software defect prediction using cost-sensitive neural network, Appl. Soft Comput. 33 (2015), 263–277.
J. Park, I.W. Sandberg, Universal approximation using radialbasis-function networks, Neural Comput. 3 (1991), 246–257.
J.H. Holland, Genetic algorithms, Sci. Am. 267 (1992), 66–73.
V. López, et al., Cost-sensitive linguistic fuzzy rule based classification systems under the MapReduce framework for imbalanced big data, Fuzzy Sets Syst. 258 (2015), 5–38.
C.L. Castro, A.P. Braga, Novel cost-sensitive approach to improve the multilayer perceptron performance on imbalanced data, IEEE Trans. Neural Netw. Learn. Syst. 24 (2013), 888–899.
M. Hall, et al., The WEKA data mining software: an update, SIGKDD Explor. Newsl. 11 (2009), 10–18.
D. Veganzones, E. Severin, An investigation of bankruptcy prediction in imbalanced datasets, Decis. Support Syst. 112 (2018), 111–124.
Y. Feng, et al., Multi-strategy monarch butterfly optimization algorithm for discounted {0–1} knapsack problem, Neural Comput. Appl. 30 (2017), 3019–3036.
G.G. Wang, S. Deb, L.D.S. Coelho, Earthworm optimisation algorithm: a bio-inspired metaheuristic algorithm for global optimisation problems, Int. J. Bio-Inspired Comput. 12 (2015), 1–22.
G.G. Wang, et al., A new metaheuristic optimisation algorithm motivated by elephant herding behaviour, Int. J. Bio Inspired Comput. 8 (2017), 394–409.
G.G. Wang, Moth search algorithm: a bio-inspired metaheuristic algorithm for global optimization problems, Memet. Comput. 10 (2016), 151–164.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
This is an open access article distributed under the CC BY-NC 4.0 license (https://doi.org/creativecommons.org/licenses/by-nc/4.0/).
About this article
Cite this article
Wu, JC., Shen, J., Xu, M. et al. An Evolutionary Self-organizing Cost-Sensitive Radial Basis Function Neural Network to Deal with Imbalanced Data in Medical Diagnosis. Int J Comput Intell Syst 13, 1608–1618 (2020). https://doi.org/10.2991/ijcis.d.201012.005
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.2991/ijcis.d.201012.005