Deep learning with SMOTE techniques for improved skin lesion classification on unbalanced data

Mustafa Al-Asadi, Adem Alpaslan Altun


Skin cancer has become a major public health concern around the world, with an increasing incidence in recent decades. The morphological characteristics of skin lesions are thought to be an important component of skin cancer diagnosis and early detection. Thus, with rapid advances in image classification, more emphasis has been placed on computer-aided diagnosis (CAD) of skin lesions according to their morphological features. However, small datasets or an imbalance of skin cancer datasets are the two most important issues that can hinder the success of skin cancer detection. This paper introduces a method for dealing with class imbalance and data scarcity that is based on the Synthetic Minority Oversampling Technique (SMOTE). The improved images were then used to train the Deep Learning Convolutional Neural Network (DLCNN) model. The proposed data augmentation technique is used to generate a new skin dataset for the HAM10000 dataset using dermoscopic images of seven skin lesion classes. According to the empirical results, the improved strategy proposed in this study has statistically significant effects on improving performance with respect to accuracy (85.99%), precision (90%), recall (88%), and F1-score (88%). Moreover, the proposed classification strategy It outperforms some of the techniques used to balance melanoma detection data.


Skin lesions, Dermatoscopy, Deep convolutional network, Skin Cancer, MNIST: HAM10000, Data Augmentation, Python

Full Text:



Kittler, H., et al., Diagnostic accuracy of dermoscopy. The lancet oncology, 2002. 3(3): p. 159-165.

Saladi, R.N. and A.N. Persaud, The causes of skin cancer: a comprehensive review. Drugs of Today, 2005. 41(1): p. 37-54.

Kuschal, C., et al., Skin cancer in organ transplant recipients: effects of immunosuppressive medications on DNA repair. Experimental dermatology, 2012. 21(1): p. 2-6.

Saladi, R., T. Nektalova, and J. Fox, Induction of skin carcinogenicity by alcohol and ultraviolet light. Clinical and Experimental Dermatology: Clinical dermatology, 2010. 35(1): p. 7-11.

Rosendahl, C., et al., Diagnostic accuracy of dermatoscopy for melanocytic and nonmelanocytic pigmented lesions. Journal of the American academy of dermatology, 2011. 64(6): p. 1068-1073.

Al-Asadi, M.A. and S. Tasdemİr, Medical Image Analysis Using Machine Learning Techniques: A Systematic Review. Machine Learning and Deep Learning in Efficacy Improvement of Healthcare Systems, 2022: p. 137-153.

Maiti, A. and B. Chatterjee, Improving detection of Melanoma and Naevus with deep neural networks. Multimedia Tools and Applications, 2020. 79(21): p. 15635-15654.

Li, K.M. and E.C. Li, Skin lesion analysis towards melanoma detection via end-to-end deep learning of convolutional neural networks. arXiv preprint arXiv:1807.08332, 2018.

Tschandl, P., C. Sinz, and H. Kittler, Domain-specific classification-pretrained fully convolutional network encoders for skin lesion segmentation. Computers in biology and medicine, 2019. 104: p. 111-116.

Rezvantalab, A., H. Safigholi, and S. Karimijeshni, Dermatologist level dermoscopy skin cancer classification using different deep learning convolutional neural networks algorithms. arXiv preprint arXiv:1810.10348, 2018.

Brinker, T.J., et al., Deep neural networks are superior to dermatologists in melanoma image classification. European Journal of Cancer, 2019. 119: p. 11-17.

Zhang, J., et al., Attention residual learning for skin lesion classification. IEEE transactions on medical imaging, 2019. 38(9): p. 2092-2103.

Pacheco, A.G. and R.A. Krohling, The impact of patient clinical information on automated skin cancer detection. Computers in biology and medicine, 2020. 116: p. 103545.

Hosny, K.M., M.A. Kassem, and M.M. Foaud, Classification of skin lesions using transfer learning and augmentation with Alex-net. PloS one, 2019. 14(5): p. e0217293.

Ratul, M.A.R., et al., Skin lesions classification using deep learning based on dilated convolution. BioRxiv, 2020: p. 860700.

Al-Asadi, M.A. and S. Tasdemír, Empirical comparisons for combining balancing and feature selection strategies for characterizing football players using FIFA video game system. IEEE Access, 2021. 9: p. 149266-149286.

Kassani, S.H. and P.H. Kassani, A comparative study of deep learning architectures on melanoma detection. Tissue and Cell, 2019. 58: p. 76-83.

Jackson, P.T., et al. Style augmentation: data augmentation via style randomization. in CVPR workshops. 2019.

Shorten, C. and T.M. Khoshgoftaar, A survey on image data augmentation for deep learning. Journal of big data, 2019. 6(1): p. 1-48.

Abayomi-Alli, O.O., et al., Malignant skin melanoma detection using image augmentation by oversamplingin nonlinear lower-dimensional embedding manifold. Turkish Journal of Electrical Engineering and Computer Sciences, 2021. 29(8): p. 2600-2614.

Ali, A.-R., et al., A machine learning approach to automatic detection of irregularity in skin lesion border using dermoscopic images. PeerJ Computer Science, 2020. 6: p. e268.

Chen, J., et al., Feature selection for text classification with Naïve Bayes. Expert Systems with Applications, 2009. 36(3): p. 5432-5435.

Shen, S., et al., A Low-Cost High-Performance Data Augmentation for Deep Learning-Based Skin Lesion Classification. BME Frontiers, 2022. 2022.

Le, D.N., et al., Transfer learning with class-weighted and focal loss function for automatic skin cancer classification. arXiv preprint arXiv:2009.05977, 2020.

Pham, T.-C., et al., Improving skin-disease classification based on customized loss function combined with balanced mini-batch logic and real-time image augmentation. IEEE Access, 2020. 8: p. 150725-150737.

Collaboration, T.I.S.I. Skin Cancer MNIST: HAM10000 dataset. 2016 11/02/2022]; Available from:

Masood, A., A. Al-Jumaily, and K. Anam. Self-supervised learning model for skin cancer diagnosis. in 2015 7th international IEEE/EMBS conference on neural engineering (NER). 2015. IEEE.

Mishra, S., H. Imaizumi, and T. Yamasaki. Interpreting fine-grained dermatological classification by deep learning. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 2019.

Esteva, A., et al., Dermatologist-level classification of skin cancer with deep neural networks. nature, 2017. 542(7639): p. 115-118.

Nasr-Esfahani, E., et al. Melanoma detection by analysis of clinical images using convolutional neural network. in 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). 2016. IEEE.

Lopez, A.R., et al. Skin lesion classification from dermoscopic images using deep learning techniques. in 2017 13th IASTED international conference on biomedical engineering (BioMed). 2017. IEEE.

Perez, F., S. Avila, and E. Valle. Solo or ensemble? choosing a cnn architecture for melanoma classification. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 2019.

Kawahara, J., et al., Seven-point checklist and skin lesion classification using multitask multimodal neural nets. IEEE journal of biomedical and health informatics, 2018. 23(2): p. 538-546.

Polat, K. and K.O. Koc, Detection of skin diseases from dermoscopy image using the combination of convolutional neural network and one-versus-all. Journal of Artificial Intelligence and Systems, 2020. 2(1): p. 80-97.

Rahman, Z., et al., An approach for multiclass skin lesion classification based on ensemble learning. Informatics in Medicine Unlocked, 2021. 25: p. 100659.

Wang, Q. and R.K. Ward, Fast image/video contrast enhancement based on weighted thresholded histogram equalization. IEEE transactions on Consumer Electronics, 2007. 53(2): p. 757-764.

Ahmad, B., et al., Improving Skin Cancer Classification Using Heavy-Tailed Student T-Distribution in Generative Adversarial Networks (TED-GAN). Diagnostics, 2021. 11(11): p. 2147.

Abdelhalim, I.S.A., M.F. Mohamed, and Y.B. Mahdy, Data augmentation for skin lesion using self-attention based progressive generative adversarial network. Expert Systems with Applications, 2021. 165: p. 113922.

Tschandl, P., C. Rosendahl, and H. Kittler, The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions. Scientific data, 2018. 5: p. 180161.

Almaraz-Damian, J.-A., et al., Melanoma and nevus skin lesion classification using handcraft and deep learning feature fusion via mutual information measures. Entropy, 2020. 22(4): p. 484.

Huang, L., et al. Decorrelated batch normalization. in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2018.

Al-Asadi, M.A. and S. Tasdemır, Predict the value of football players using FIFA video game data and machine learning techniques. IEEE Access, 2022. 10: p. 22631-22645.

Article Metrics

Metrics Loading ...

Metrics powered by PLOS ALM


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Selcuk University Journal of Engineering Sciences (SUJES) ISSN:2757-8828

Abstracting and indexing

Index Copernicus International


Selcuk university journal of engineering sciences (Online)

ICI World of Journals


Eurasian Scientific Journal Index