An Ensemble of Statistical Metadata and CNN Classification of Class Imbalanced Skin Lesion Data

Sachin Nayak, Shweta Vincent, Sumathi K, Om Prakash Kumar, Sameena Pathan


Skin Cancer is one of the most widely present forms of cancer. The correct classification of skin lesions as malignant or benign is a complex process that has to be undertaken by experienced specialists. Another major issue of the class imbalance of data causes a bias in the results of classification. This article presents a novel approach to the usage of metadata of skin lesions' images to classify them. The usage of techniques addresses the problem of class imbalance to nullify the imbalances. Further, the use of a convolutional neural network (CNN) is proposed to fine-tune the skin lesion data classification. Ultimately, it is proven that an ensemble of statistical metadata analysis and CNN usage would result in the highest accuracy of skin color classification instead of using the two techniques separately.

Full Text:



Siegel, R. L.; Miller, K. D.; Fedewa, S. A.; Ahnen, D. J; Meester, R. G. S.; Barzi, A.; Jemal, A., "Cancer statistics" CA: A cancer journal for Clinicians, 67(1), 7–30, 2017.

Apalla, Z.; Nashan, D.; Weller, R. B.; Castellsague´, X. , “Skin Cancer: Epidemiology, Disease Burden, Pathophysiology, Diagnosis, and Therapeutic Approaches", Dermatology and Therapy, 7(1), 5–19, 2017.

Pai, K.; Giridharan, A., "Convolutional Neural Networks for classifying skin lesions", Proceedings of TENCON 2019 IEEE conference, 1794-1796, 2019.

Mikołajczyk, A.; Grochowski, M., "Data augmentation for improving deep learning in image classification problem", Proceedings of 2018 International Interdisciplinary PhD Workshop (IIPhDW), 117-122, 2018.

Harangi, B., "Skin lesion classification with ensembles of deep convolutional neural networks", Journal of Biomedical Informatics, 86, 25-32, 2018.

Ayan, E.; Ünver, H. U., "Data Augmentation Importance for Classification of Skin Lesions via Deep Learning", Proceedings of 2018 Electric Electronics, Computer Science, Biomedical Engineering Meeting(EBBT), 10-15, 2018.

Gessert, N.; Sentker, T.; Madesta, F.; Schmitz, R.; Kniep, H.; Baltruschat, I.; Werner, R.; Schlaefer, A., "Skin Lesion Classification Using CNNs with Patch-Based Attention and Diagnosis-Guided Loss Weighting", IEEE Transactions on Biomedical Engineering , 1-1, 99, 2019.

Hosny, K. M.; Kassen, M. A.; Foaud, M. M., "Classification of skin lesions using transfer learning and augmentation with Alex-net", PLOS One, 17-20, 2019.

Hameed, N.; Shabut, A. M.; Hossain, M. A., "Multi-Class Skin Diseases Classification Using Deep Convolutional Neural Network and Support Vector Machine", Proceedings of 12th International Conference on Software, Knowledge, Information Management and Applications (SKIMA), 23-30, 2019

Albahar, M. A. , "Skin Lesion Classification Using Convolutional Neural Network With Novel Regularizer", IEEE Access, 7, 2019.

Hekler, A.; Utikal, J. S.; Enk, A. H., "Superior skin cancer classification by the combination of human and artificial intelligence", European Journal of Cancer, 120, 114-121, 2019.

Kittler, H.; Pehamberger, H.; Wolff, K.; Binder, M., "Diagnostic accuracy of Dermatoscopy", Lancet Oncology, 3(3), 159-165, 2002.

Maragoudakis, M.; Maglogiannis, I., "Skin lesion diagnosis from images using novel ensemble classification techniques," in Information Technology and Applications in Biomedicine (ITAB), 2010 10th IEEE International Conference , 1–5, 2010.

Kawahara, J.; Hamarneh, G., "Multi-resolution-tract CNN with hybrid pretrained and skin-lesion trained layers," in International Workshop on Machine Learning in Medical Imaging. Springer, 164–171, 2016.

Tschandl, P.; Rosendahl, C.; Kittler, H., "The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions," Scientific Data, 5, 180161, 2018.

Shapiro, S. S.; Wilk, M. B., "An analysis of variance test for normality (complete samples). Biometrika, 52(3–4), 591–611, 1965.

Ralph, B.; Cureton, D. E. E., "Test of Normality Against Skewed Alternatives", Psychological Bulletin, 78, 262-265, 1972.

Stephens, M. A., "EDF Statistics for Goodness of Fit and Some Comparisons". Journal of the American Statistical Association. , 69: 730–737, 1974.

Kruskal, W. H.; Wallis, W. A., "Use of ranks in one-criterion variance analysis". Journal of American Statistics Association, 583–621, 1952.

Nachar, N., "The Mann-Whitney U: A test for assessing whether two independent samples come from the same distribution", Tutorials in Quantitative Methods for Psychology, 4(1), 13-20, 2008.

Scheff, S. W., "Chapter 8- Nano-parametric Statistics", Fundamental Statistical Principles for the Neurobiologist-A Survival Guide, 157-182, 2016.

Lan, K. K. G.; Soo, Y.; Siu, C.; Wang, M., "The use of Weighted Z-tests in Medical Research", Journal of Biopharmaceutical Statistics, 15(4), 625-639, 2005.

Yusop, H.; Yeng, F. F.; Jumadi, A.; Mahadi, A.; Ali, M. N.; Johari, N., "The Effectiveness of Excellence Camp: A study on Paired Sample", Procedia Economics and Finance, 31, 453-461, 2015.

Abdi, H.; Williams, L. J., "Tukey’s Honestly Significant Difference (HSD) Test”, Encyclopedia of Research Design., 2010.

Fraiman, D.; Fraiman, R., “An ANOVA approach for statistical comparisons of brain networks”, Scientific Repository, 8, 4746, 2018.


  • There are currently no refbacks.

International Journal of Electronics and Telecommunications
is a periodical of Electronics and Telecommunications Committee
of Polish Academy of Sciences

eISSN: 2300-1933