Classification of Meningioma, Glioma, and Pituitary Tumors Using a Hybrid Algorithm Based On the Decomposition Method in MRI Brain Images
Shirzad HR, Kazemi A, Saberi F, Sepahdoost M and Zeinalnezhad M
Published on: 2022-12-29
Abstract
Background: Brain tumor image classification is an important part of clinical diagnosis and effective treatment. Radiologists may also misclassify brain tumor types when handling large amounts of data with multiple classes. Since image classification techniques can effectively improve the process of tumor diagnosis and treatment.
Objective: This study introduces a new hybrid method based on machine learning and deep learning to classify brain tumors in MRI brain images.
Methods: In this article, the T1-weighted contrast-enhanced images of 233 patients with three kinds of brain tumor: meningioma (300 slices), glioma (300 slices), and pituitary tumor (300 slices) have been used. First, the histogram equalization method was used for contrast adjustment in the image pre-processing stage. In the processing step, many features were extracted using the Gray Level Co-occurrence Matrix (GLCM) method and Convolutional Neural Network (CNN) models such as AlexNet and GoogleNet. Also, Principal Component Analysis (PCA) technique was used to reduce the dimensions of the features. Finally, 210 main features (including 100 features from AlexNet, 100 features from GoogleNet, and 10 features from the co-occurrence matrix) were obtained using the PCA technique for the classification stage. In this step, new OVO and supervised machine learning algorithms were introduced to classify the images. The use of decomposition methods, especially One-Versus-One (OVO) methods, can increase classification accuracy by separating the data into binary subsets during classification. This factor inspired the development of a new algorithm in this article in order to characterize the tumor in MRI images. Finally, we use a k-fold cross-validation (with k=10) method to separate the training and test data to evaluate the capabilities of the algorithm.
Results: The obtained results show that the accuracy of the designed algorithm using the fit ensemble classifier with the best performance in OVO and multiclass classification methods over other classifiers in brain tumor images was 99.11% and 97.44%, with a sensitivity of 99.89% and 97.52%, a specificity of 98.33% and 98.74%, a precision of 98.41% and 97.44%, a F1_score of
99.13% and 97.44%, and an AUC of 99.67% and 99.0%, respectively.
Conclusion: The achieved results of brain tumor image classification using the hybrid techniques indicates a significant potential for the medical image of the brain and enhance the classification performance, especially on multiclass datasets.
Keywords
Brain tumor classification; MRI; Convolutional neural network; Gray level co-occurrence matrix; Principal component analysis method; OVO algorithmIntroduction
The brain with 100 billion nerve cells is one of the complex and vital organs [1]. Any irregularity within this organ can lead to threatening health problems. A brain tumor is an abnormal form of brain cells, which affects brain tissue and vary in size and type [2]. This is an imbalanced proliferation of brain cells, which can be divided into major or minor. Major cases arise from within cells in the brain, whereas minor cases arise from cells other than brain cells [3]. The three types of brain tumors are meningioma, glioma, and pituitary, with glioma being the most common. There are different ways to treat brain tumors depending on the tumor type, size, and location. Presently, the most common treatment for brain tumors is surgery as it has no side effects on the brain [4].
Different types of medical imaging modalities such as magnetic resonance imaging (MRI), positron emission tomography (PET), and computed tomography (CT) are used to observe the internal parts of the human body conditions. Among all these modalities, MRI is the most popular imaging modality that captures and preserves the highest quality brain images with rich information providing anatomical structure and internal brain contents about brain tumor position, shape, size, and type. Recently MRI images and computers use as a solution for diagnosis tasks and classification [5].
However, it is difficult for radiologists to identify, segment, and detection of infected areas in MRI images of brain tumors using clinical expertise and manual imaging methods. Also, due to the high complexity of brain tissues, it can be hectic, time-consuming, frustrating, and even prone to error due to the influx of patients [6,7]. On the other hand, it is very hard to gain insight into the abnormal structure of the human brain using simple imaging techniques. This makes the conventional and old methods ineffective in the absence of experts, and consequently, after their death, their abilities also disappear. For this reason, the traditional diagnosis of brain tissues and tumors is very time-consuming and depends on the conditions of the operator. In addition, the need for specialists to examine the images is essential to diagnose this problem
It is therefore necessary to create an automatic system that can estimate abnormal brain images with high accuracy. Based on this criterion, several automatic systems were developed to classify brain images. Automatic classification is a diagnostic technique that can act as a tool for helping doctors and radiologists with a computer-aided diagnosis (CAD) system development at brain MRI images. One of the important applications of classification is to identify the type of tumor in abnormal brain MRI images. Also, it can extract patterns for grouping different images into different categories using similar features. Common tumor detection techniques are based on a physician's observation with a high error rate. Therefore, the use of automatic methods will be very useful for the accurate examination of tumors.
This study presents a novel package for classifying brain tumors on MRI images. This package includes four main steps data preprocessing, feature extraction, dimension reduction, and classification. The histogram equalization method is used for data preprocessing. The Gray Level Co-occurrence Matrix (GLCM) method and networks of GoogleNet and AlexNet are considered for feature extraction, while Principal Component Analysis (PCA) method is used to reduce the dimension in extracted features. Finally, supervised machine learning classifiers have been used to classify brain tumors of meningioma, glioma, and pituitary tumors. The main sections of the article are arranged: in section 2, the authors reviewed the previous studies. Section 3 introduces the proposed methods. Evaluation and experimental results, along with a discussion, are shown in section 4, and eventually, the conclusion is expressed in section 5.
Literature Review
Image classification algorithms based on machine learning (ML) include supervised and unsupervised methods. Some of the supervised methods include classifiers of Decision Tree )DT), Linear discriminant analysis (LDA), Logistic Regression (LR), Naïve Bayes (NB), Support vector machine (SVM), k-nearest neighbor (K-NN), Fit Ensemble, Support vector machine -Radial Basis Function (SVM-RBF) etc. [8]. These methods have been used for the classification and identification of medical fields, as well as new software analysis tools. We also see the widespread use of artificial neural networks (ANNs) as supervised methods for various classification cases in numerous articles. Some of these methods include Radial Basis Functions (RBF), Probabilistic Neural Network (PNN), Back Propagation Neural Network (BPNN), and General regression neural network (GRNN) [8]. Various techniques have been proposed for the automatic classification of brain MRI images based on traditional ML and ANN methods as shown in Table 1. Table 1 shows a summary of these algorithms in brain tumor classification in MRI images. In this table, a summary of different feature extraction methods and classification algorithms is displayed.
Table 1: Work Associated with Classification of Brain Tumors.
|
Ref. |
Year |
Author |
Feature Extraction Method |
Classification Method |
Objective |
|
[9] |
2017 |
M. Arunachalam and S. Royappan Savarimuthu |
Gabor, GLCM, and discrete wavelet transform (DWT) |
Feed-forward back propagation neural network |
Classification of brain MRI into normal and abnormal |
|
[10] |
2017 |
Paul et al. |
CNN |
Fully connected and CNN |
Brain tumor classification |
|
[11] |
2018 |
N. Varuna Shree and T. Kumar |
Gray level co- occurrence matrix |
PNN |
Classification of brain MRI into normal and abnormal |
|
[12] |
2018 |
B. Ural |
k-mean with fuzzy c- mean (KMFCM) |
PNN |
Brain tumor detection |
|
[13] |
2019 |
Rajan, PG Sundar, C |
Adaptive Gray-Level Co-Occurrence Matrix (AGLCM) |
SVM |
Detection and Segmentation of Brain Tumor |
|
[14] |
2019 |
S. Preethi and P. Aishwarya |
Wavelet + GLCM |
Deep neural network (DNN) |
Classification of tumor and non- tumor image |
|
[15] |
2019 |
Deepak and Ameer |
GoogleNet |
Deep transfer learning |
Classification of glioma, meningioma, and pituitary tumors |
|
[16] |
2019 |
Saxena et al. |
CNN |
CNN networks with transfer learning |
Binary classification of brain tumors into normal and abnormal |
|
[17] |
2019 |
Hemanth et al. |
CNN |
CNN |
MR brain image classification into normal and abnormal (Binary) |
|
[18] |
2019 |
Das et al. |
CNN |
CNN |
Classification (Glioma, Meningioma, and Pituitary) |
|
[19] |
2020 |
Ullah et al. |
DWT |
Feed-forward neural network |
Classification of brain MRI into normal and abnormal |
|
[20] |
2021 |
Francisco et al. |
Convolutional Neural Network (CNN) |
Multi-pathway CNN |
Brain tumor classification |
|
[21] |
2021 |
J. Kang et al. |
Pre-trained CNN networks |
ML classifiers |
Multi-classification |
In order to differentiate between normal and abnormal brain tumors in brain MR images, Arunachalam and Savarimuthu [9] suggested a model. They included feature extraction, augmentation, transformation, and classification in their suggested model. First, they used shift- invariant shearlet transform to improve the brain MR image (SIST). The features were then retrieved using the Gabor filter, the discrete wavelet transform (DWT), and the gray level co- occurrence matrix (GLCM). Finally, feed-forward backpropagation neural network was used to acquire a high accuracy rate (99.8%) using these extracted features. Deep learning techniques have been extensively used for brain MRI categorization over the past ten years. The feature extraction and classification stages are integrated into self-learning in the deep learning approach, therefore they do not need to be carefully extracted features manually [10].
Normal and abnormal brain MRI images were categorized into two classes by Shree and Kumar. They applied a probabilistic neural network (PNN) classifier and used GLCM for feature extraction that achieved 95% accuracy [11]. Berkan Ural [12] first applied various image processing methods to improve brain MRI. Additionally, various segmentation techniques have been combined to improve the performance of the system. The PNN approach is additionally used to identify and localize the tumor region in the brain. Their suggested method has a respectable accuracy rate (90%) and a relatively short processing time. A hybrid energy-efficient technique for automatic tumor segmentation and detection was proposed by Rajan and Sundar. Their suggested approach involves seven lengthy phases and a purported 98% accuracy. Their suggested model workflow is its lengthy computation time caused by the employment of various approaches [13].
Preethi and Aishwarya [14] created a strategy to categorize different stages of brain tumors. To create the feature matrix, they combined the GLCM and the wavelet-based methods. With the help of the oppositional flower pollination algorithm (OFPA), the extracted features were further decreased. In order to classify the MR brain images using the chosen features, a deep neural network is used in the end and achieved 92% accuracy. Deepak and Ameer [15] classified three different types of brain tumors with 98% accuracy by using a pre-trained GoogleNet to extract features from brain MR images. To classify data on brain tumor images, Saxena [16] combined transfer learning techniques with Inception V3, ResNet-50, and VGG-16 models. The ResNet-50 model had a 95% accuracy rate, which was the highest.
Several academics have recently confirmed their proposed methods using brain tumor classification datasets and CNNs for brain MRI classification [17,18]. In 2019, Thejaswiniet [22] proposed a model for classifying and detecting brain tumors that combines the Regularized Kernel-based Fuzzy C-Means Clustering (ARKFCM) segmentation method with SVM for feature extraction. Ullah [19] used DWT to extract the level-3 decomposition's approximation and detail coefficients, lowered the coefficient using color moments (CM), and then used a feed- forward ANN to identify between abnormal and normal brain MR images. A multi-pathway CNN design was presented by Francisco [20] for the automatic segmentation of brain tumors like pituitary, meningiomas, and gliomas tumors. Using a T1-weighted, contrast-enhanced MRI dataset that was made accessible to the public, they tested their suggested model and achieved 97.3% accuracy. Their training process is, nevertheless, rather expensive.
Using multiclass datasets, Kang [21] created a CNN-based model for classifying brain cancers. The accuracy is poor (93.72%), though, and the computation time is excessive. For accurate brain tumor MRI classification, Mirza Mumtaz Zahoor and Saddam Hussain Khan [23] suggested a deep residual and regional-based Res-BRNet CNN. Within the redesigned spatial and residual blocks, the developed Res-BRNet applied regional and boundary-based operations in a systematic order. With the help of patterns and edge-related properties, spatial blocks extract the boundary, heterogeneity, and homogeneity of the brain tumor. Furthermore, the remaining blocks effectively represent regional and global changes in brain tumor texture.
Hafiza Akter Munira and Saiful Islam [24] introduced hybrid deep learning methods for CNN and ML classifiers to categorize brain cancers. For the purpose of extracting deep brain features from MRI data, a new 23-layer CNN architecture is created. Then, the CNN model's flattened layer's extracted in-depth features are assessed using SVM and RF classifiers. On multi-class brain MRI datasets, this study applies tuned Inception V3, CNN-SVM, CNN-RF, and CNN deep learning models. For the classification of brain tumors, Majdi Alnowami [25] proposed utilizing densely connected convolutional networks (DenseNets) in 2022. The model is made up of four dense blocks and contains 58 layers. The study has demonstrated that: 1) implementing an intensity normalization preprocessing step significantly improves the performance of the classification model, and 2) the normalization method's effect on classification accuracy.
One of the problems in using the mentioned algorithms is that some of these algorithms are not used directly to classify several classes. Because many real-world applications rely on a large number of parameters and classes, multiclass and multivariate classification issues are crucial to recent advances in machine learning. There are a lot of unresolved difficulties as well, including problems with optimization, overfitting, and high dimensionality [26]. One of the solutions proposed in recent years to solve this problem is the use of group methods with binary techniques such as the decomposition method. One of the analysis methods is the One-Versus-One (OVO) method, in which the data with several classes are divided into the maximum binary subset of classes, and the classification of the subsets is done by binary algorithms, and finally, the output of the classifications of the final class is combined. The use of binary subsets of classes can improve classification in two ways. First, it reduces complexity in classification. Because in multi- class classification, there is more complexity due to the boundaries between classes, and in the second case, it increases the accuracy of the classification in many cases. A classification algorithm provides the most accuracy in the classification, but this does not mean that it can provide the highest accuracy in every subset of classes related to the problem. The combination of pre- processing methods, feature extraction, dimensionality reduction, and classifications has led to the emergence of various algorithms in the field of classification
The Proposed Method
The real data in the field of classification can be objectively used as a suitable tool for checking the efficiency of different algorithms. In this study, in order to create a reliable database, a number of MRI images are collected from the website of Guangzhou Southern Medical University, China. This dataset contains 3064, T1-weighted images collected from 233 patients. Three types of tumors including meningioma, glioma, and pituitary tumor have been identified in these images and thereby three classes have been determined for the images. Due to the limitation of available computer hardware including CPU, GPU, and Ram in using the classification algorithms, 300 images have been selected from each tumor class in MRI images. Also, the size of all images is 227×227 [27]. Therefore, this used dataset includes 900 brain MRI images. The method proposed in this study leads to the diagnosis of brain tumors in MRI images. Based on the steps shown in Figure 1, the suggested algorithm includes four stages pre-processing, feature extraction, dimension reduction, and classification.

Figure 1: The proposed model.
In this research, the histogram equalization method for pre-processing stage, convolution neural network (AlexNet and GoogleNet models) and GLCM for the feature extraction stage, and PCA method for dimensional reduction are used. Finally, a combination of the OVO method and supervised machine learning algorithms was used to classify brain MRI images. At this stage, the classes are separated in binary and therefore classified. Also, in order to divide the images into training and finally test data, the k-fold cross-validation method has been used, in which k is considered equal to 10. The reason for choosing this value is that k=10 in cross-validation has less bias and variance, which is usually recommended [28]. Based on this method, the data are divided into ten separate subsets, in each iteration, one subset of data is considered to test data and nine subsets are considered training data. Finally, the average error of all 10 folds is considered the final classification error. Each step of the proposed method is explained here:
Histogram Equalization
Many imaging applications, including medical image analysis, remote sensing, professional photography, and display technologies, depend on contrast enhancement. Contrast enhancement has been achieved using a variety of image processing methods. Due to its efficiency and simplicity, histogram equalization (HE) is a well-known technique for image improvement [29]. In order to flatten and extend the dynamic range of the histogram, HE first determines the frequency of each intensity value in an image using the probability density function. Then, this method re-distributes the intensity values of pixels in the image. The output image that is produced has a more even distribution of intensities, greatly enhancing the image contrast [30].
Therefore, by equalizing the histogram, the brightness of the background and textures are adjusted and the processing is done correctly. This algorithm makes it possible that if the devices are different and the images are produced with different brightness levels, this will not affect the efficiency of the algorithm and the images will be delivered to the processor algorithm in the same way. Histogram equalization is calculated for each pixel by equation (1).
![]()
where h(v) is the value of the histogram, cdf(v) is the value of the cumulative distribution function associated with pixel v, cdfmin is the minimum value of the cumulative distribution function, w is the image width, h is the image height, and L is the value of the gray level used and, in most cases, equals 256.
Feature Extraction
The process of feature extraction starts with feature selection. The complexity and effectiveness of the analysis and pattern categorization procedure will largely depend on the features that are chosen. The features are initially chosen based on the specifications of the application and the developer's expertise [31]. In order to extract features, a combination of a traditional technique GLCM and two pre-trained ANN models named GoogleNet and AlexNet have been used.
Gray Level Co-occurrence Matrix (GLCM)
GLCM is a matrix that contains information pertinent to the relationship between the values of adjacent pixels in an image and the number of rows and columns which is equal to the number of gray levels in the image. This means that if the number of gray levels of an image is G, then the dimensions of the desired GLCM matrix are equal to a G×G matrix [32]. Features such as energy, correlation, contrast, etc. are extracted from this matrix, which is found to have a significant impact on image classification and recognition. For instance, cancerous glands have a contrasting color compared to their surroundings. This feature is a good choice in describing the variation of the gray level around the desired pixel. At this stage, applying the co-occurrence matrix technique, 22 features are extracted from brain MRI images, and those features along with the relevant formula are shown in Table 2.
Table 2: Descriptions related to extracted features from the GLCM.

p(i, j): (i,j)th entry in a normalized gray-tones spatial-dependence matrix, px(i): ith entry in a marginal-probability matrix obtained by summing the rows of
means of px, uy: means of
standard deviations of
standard deviations of py ng: number of distinct gray levels in the quantized image, c(i, j): the co-occurrence probability between grey levels i and j j, G: GLCM matrix ![]()
Finally, the output of features extracted using the GLCM method according to the 900 studied images, includes a matrix with 900 rows and 22 columns.
Convolutional Neural Networks (CNNs)
Deep learning is a set of techniques based on neural networks that allow us to learn features automatically from input data [33]. In deep learning, CNNs are considered a class of ANNs that are mostly used for image analysis and to extract features and better accuracy in the classification. The purpose of designing CNNs is to accurately model the function of the human visual system and its connection with the visual part of the brain, which is responsible for automatically extracting key features, learning images, and removing possible additional features. A CNN consists of numerous convolutional and pooling layers. The input layer of a CNN is usually an image with arbitrary dimensions and its output layer is a feature vector with high resolution and corresponds to different classes. The hidden layers in this network are composed of the convolution layer, pooling layer, and full connection layer [34]. The structure of CNN is shown in Figure 2.

Figure 2: Simple structure of CNN.
It should be noted that convolutional layers perform very well in finding features in images. If some of these layers are placed one after the other, they learn a hierarchy of non-linear features. That is, in the initial layers of the network, parts of the image such as corners, lines, and edges are learned, and in the later stages, the features of higher levels (Figure 3) are learned. For instance, if the input image is an image of a person's face; features such as eyes, nose, cheeks, and face are learned in the higher layers. Finally, the final layers of the generated features are also used for classification.

Figure 3: Learning stages and feature extraction in CNN [35].
Two models of CNN models, which are used in the proposed algorithm in this study, are the AlexNet and GoogleNet models. The AlexNet model was introduced in 2012 to train and classify ImageNet data in an article entitled "ImageNet classification using deep CNNs". ImageNet contains 1.2 million 256x256 images divided into 1000 subsets. According to Figure 4, There are eight learnable layers in the AlexNet. The Rectified Linear Unit (ReLU) activation is used in each of the five levels of the model, with the exception of the output layer, which uses max pooling followed by three fully connected layers. They discovered that the training process was nearly six times faster when the ReLU was used as an activation function. Additionally, they made use of dropout layers, which stopped their model from overfitting. The database used by the AlexNet network includes 15 million photos in 22 thousand groups [36].

Figure 4: AlexNet CNN [36].
In 2014, Szegedy et al. introduced GoogleNet in an article entitled "Going deeper with convolutions", which is deeper and more complex than the AlexNet model, and in fact, it is the deepest network among common models. The network consists of 22 layers and the error rate of this network is 6.7%. For the first time, the structure of this network deviated from the common structures of CNNs that predicted convolution and pooling layers on top of each other as a sequential structure. In this model, the authors introduced a new concept called inception. Each inception consists of six convolution layers and one pooling layer. According to Figure 5, it can be seen that the GoogleNet model includes two convolution layers, three pooling layers, and nine inception layers. In comparison to shallower and less wide networks, this method's key benefit is a large quality gain with a relatively small increase in processing needs. A further indication of the power of the Inception architecture is the fact that detection work was competitive despite not using context nor performing bounding box regression [37].

Figure 5: GoogleNet Convolution Neural Network [37].
Ensemble learning combines several deep learning methods. This is carried out to improve predictions, classifications, or other deep learning model functions. The combined functionality of various deep learning models could be used in ensemble learning to construct new models. Compared to training a new model from scratch, it has advantages including saving time, using less computing power, having a model with higher accuracy, and more capabilities [37]. Because of the several benefits of ensemble learning, in this research, GoogleNet and AlexNet networks have been used to extract image features using pre-trained CNN methods. Through the AlexNet network, we will have 4096 features, the output of which will be a matrix with 900 rows and 4096 columns. Furthermore, 1000 features are extracted through the GoogleNet network, the output of which will be a matrix with 900 rows and 100 columns. Ultimately, from the feature extraction techniques introduced in advance, we have three feature matrices including the GLCM feature matrix with 900 rows and 22 columns, the AlexNet feature matrix with 900 rows and 4096 columns, and last the GoogleNet feature matrix with 900 rows and 1000 columns.
Dimension Reduction
It is quite obvious that the vectors extracted from the AlexNet and GoogleNet methods as features are quite large and thereby increase the amount of complexity in the calculations and accordingly increase the calculation time in the classification, and to sum up, generally, all these features are not used in the classification. In essence, there is no need to use all these features. As an example, to better understand this issue, imagine if we are looking to find the face of a certain person among several images with a blue background, the features of the face are considered, not the background of the image. The use of dimension reduction methods makes it possible to select the optimal and effective features that have achieved the highest percentage of correct detection in the samples (correct classification) from the obtained features [38]. Among these methods, the PCA method is used in this study because of the small amount of data displayed. In the mathematical definition, PCA is an orthogonal linear transformation that places the data into a new coordinate system. So that the largest data variance is placed on the first coordinate axis, the second largest data variance is placed on the second coordinate axis, and the same is done for the rest of the data. PCA retains the components of the dataset that contribute the most variance. One of the advantages of the principal components analysis method compared with the components analysis method is to reduce the computational complexity and find special vectors with more influence [39]. By using the PCA method to reduce the number of features, in order to reduce the complexity in calculations, decrease the calculation time and optimal use of the effective features, from the number of 22 features obtained by the GLCM technique, 4096 features obtained using AlexNet and 1000 features obtained from GoogleNet, the number of 10, 100, and 100 features are obtained, respectively. That is, reducing the dimensions using the PCA method which leads to a reduction in the number of image features from 5118 to 210 new features. Therefore, by reducing the features of the previous matrices and aggregating the remaining features, we come by a feature matrix with 900 rows and 210 columns.
Classification
OVO Algorithm
Recently, many classification methods have been developed for MRI images and consequently, several major shortcomings can be identified in these methods, based on the revision of these algorithms. In the first case, it can be pointed out that these algorithms are not used in other fields. This is because classification algorithms may perform well in one data series and not in other data. In the next case, we can mention the lack of classification algorithms to classify several classes. Many classification algorithms can only be used for binary classification, which is the major weakness. The use of binary subsets of classes from two aspects can lead to improving classifications. First, it reduces the complexity of the classification. Because, the multi-class classification has more complexity due to the boundaries between classes, and in the second case, usually increases the accuracy of classification. In order to reduce these drawbacks, a new combination of the OVO method is introduced in this article. This algorithm is summarized in four main steps, in the first step, the classes are binary-separated and classified using a classification algorithm. Then the prediction operation on the classes is performed for each pair of classes by a classifier in this step. For example, for 3 classes “a”, “b”, and “c”, “a” decision is made once between “a” and “b”, once between “b” and “c”, and finally once between a and c, and the group that gets the most votes is selected [40].
Machine Learning Classifiers
Finally, for the training of the prediction model, the classes of brain tumors are fed into a variety of supervised classifiers, such as decision tree, discriminant analysis, logistic regression, naïve Bayesian, support vector machine (SVM), k-nearest neighbor (KNN), fit ensemble, and neural network [8]. These supervised methods are frequently used to classify classes in pattern recognition. Other evaluation criteria for classification algorithms include holdout and random sampling techniques, cross-validation, bootstrapping techniques, receiver operating characteristic (ROC), or statistical significance tests for the accurate evaluation of classifiers using random sample data. In order to evaluate the effectiveness of the used classifiers, we used the k-fold cross- validation method and the confusion matrix to calculate accuracy, true positive rate (sensitivity), true negative rate (specificity), precision, F1_score, and area under the curve (AUC) in Receiver operating characteristic (ROC). The testing and training phases of the k-fold cross-validation procedure are repeated k times. Finally, the mean of all accuracy estimates is taken into account as final accuracy after the K times of testing and training. Due to the very low bias and variance, we also employed 10-fold cross-validation (k=10) to measure accuracy. A helpful visual tool for contrasting classifiers is the ROC curve. The accuracy of the classifier is sometimes given as the area under the ROC curve [31]. Considering that 3 categories of 300 brain MRI images (meningioma, glioma, and pituitary tumor) were used in this research, therefore, decision-making for each image is divided into three subsets as follows:
- Decide between groups 1 and 2
- Decide between groups 1 and 3
- Decide between groups 2 and 3
The group that gets at least two votes is given as the output of the algorithm and the label and class of the image. It is worth mentioning that the separation of training and test data is done using the k-fold cross-validation method. The classes are separated in binary form and for each binary class, the training process is performed using machine learning classification algorithms. The number of binary classes is obtained using the formula
where n is the number of classes. Accordingly, the number of times the training process is performed is the same. For example, if a data has 3 classes, the training process is performed 3 times in total, which includes binary classes (1,2), (1,3), and (2,3).
In this method, the images are divided into three groups of 300 images. Each group is used twice for training and once for testing. In this way, every time 600 images are used as training and 300 images as testing. In step 1, images 1 to 600 are used for training and 601 to 900 for testing. In the next step, images 1 to 300 and 601 to 900 are used for training and 301 to 600 for testing, and in the last step, images 301 to 900 are used for training and 1 to 300 for testing. In order to divide the frequency of the images of different tumors, it is equally divided into groups. That is, each group of 300 includes 100 of each tumor.
Results And Discussion
In this section, we evaluated the accuracy of tumor classification using the existing algorithms of the Matlab software library. The classifiers that we employ to deal with the three defined classes also enter data using the vector of extracted features. To evaluate the effectiveness of classifiers, the extracted feature vectors were trained using the OVO method. To achieve the best results, The parameters of machine learning classification algorithms are set optimally for hyperparameters. This algorithm used three phases to classify MRI images: in the first phase, 22 features were extracted using the GLCM technique, and then were extracted 4096 and 1000 features using CNN models including AlexNet and GoogleNet, respectively. In the second phase, due to the creation of many features by feature extraction methods, the PCA technique was used to reduce the dimensions of the features, which ultimately resulted in a total of 210 main features being obtained (100 features from AlexNet, 100 features from Google Net and 10 features from the GLCM), which created a matrix with 210 features and 900 rows. In the third phase, new OVO method and machine learning algorithms were introduced for the classification task. The OVO algorithm consists of two phases. In the first phase, the data class was divided into binary subsets, and in the second phase, machine learning classification algorithms were used to classify each binary subset.
In this study, as a means to divide the data into training and test data, the K-fold cross-validation method was used. The results were calculated for classifying brain tumor images using pattern recognition algorithms and confusion matrices pertinent to the 10-fold cross-validation approach is shown in Table 3. The obtained results show that the accuracy of the designed algorithm using the fit ensemble classifier with the best performance in OVO and multi-class methods over other classifiers in brain tumor images was 99.11% and 97.44%, a sensitivity of 99.89% and 97.52%, a specificity of 98.33% and 98.74%, a precision of 98.41% and 97.44%, a F1_score of 99.13% and 97.44%, and an AUC of 99.67% and 99.0%, respectively. According to the obtained results, it can be stated that the performance of the OVO method has performed better than multi-class classification. Also, the results show that the features extracted from the images of the pituitary tumor class can well provide the characteristics of this tumor in the brain tumor images, which in many classifiers such as decision tree, naïve Bayesian, SVM, K-NN, and fit ensemble with 100% accuracy can differentiate from the other two classes. Table 4 also evaluates the outcomes of the related works by the evaluation metrics in detail. Some values are placed blank due to the authors did not calculate them. Also, in Table 4 given that the used database by the authors is similar to our work, the results are comparable and considerable.
Thejaswiniet [22] proposed a model for brain tumor classification using the ARKFCM segmentation method with SVM and the accuracy obtained was equal to 91.4%. Mirza Mumtaz Zahoor and Saddam Hussain Khan [23] suggested a Res-BRNet CNN based on the redesigned spatial and residual blocks that applied regional and boundary-based operations in a systematic order. With the help of patterns and edge-related properties, spatial blocks extract the boundary, heterogeneity, and homogeneity of the brain tumor. The accuracy obtained in this method was equal to 98.22%. Hafiza Akter Munira and Saiful Islam [24] introduced hybrid deep learning methods for CNN and a random forest classifier to classify brain cancers in MRI images. The accuracy obtained in this work was equal to 96.52%. Majdi Alnowami [25] proposed utilizing DenseNets for the classification of brain tumors. The model is made up of four dense blocks and contains 58 layers. The study has demonstrated that: 1) implementing an intensity normalization preprocessing step, and 2) the normalization method's effect on classification accuracy. The evaluation of this proposed model was achieved using the brain MRI image dataset with an accuracy of 96.52%. Our proposed algorithm was evaluated with OVO and multi-class methods, and the best results were obtained using OVO method with 99.11% accuracy. This algorithm increased the accuracy and reduced the complexity of the classification. Also, the new classification algorithm can be used in other data that have several classes and obtain high classification accuracy.
Table 3: Accuracy of Designed Classification Algorithms with 10-Fold Cross-Validation to the Combined Algorithm Data + Principal Component Analysis in MATLAB software.
| Classification Algorithm | Evaluation Criteria (%) a | |||||||||
| Class | Accuracy | Sensitivity | Specificity | Precision | F1_score | AUC | ||||
| meningioma | 96.5 | 99.33 | 93.67 | 94.01 | 96.6 | 95 | ||||
| vs. glioma | ||||||||||
| Decision Tree | meningioma | 100 | 100 | 100 | 100 | 100 | 100 | |||
| vs. pituitary | ||||||||||
| glioma vs. | 100 | 100 | 100 | 100 | 100 | 100 | ||||
| pituitary | ||||||||||
| Multiclass | 97.22 | 97.32 | 98.64 | 97.22 | 97.22 | 97 | ||||
| meningioma | 89.67 | 92.33 | 87 | 87.66 | 89.94 | 94 | ||||
| vs. glioma | ||||||||||
| Discriminant Analysis | meningioma | 96.67 | 98.67 | 94.67 | 94.87 | 96.73 | 99 | |||
| vs. pituitary | ||||||||||
| glioma vs. | 90.67 | 91.67 | 89.67 | 89.87 | 90.76 | 96 | ||||
| pituitary | ||||||||||
| Multiclass | 85.11 | 85.42 | 92.56 | 85.11 | 85.17 | 96 | ||||
| meningioma | 86 | 85.33 | 86.67 | 86.49 | 85.91 | 90 | ||||
| Logistic Regression | vs. glioma | |||||||||
| meningioma | 94.67 | 96 | 93.33 | 93.51 | 94.74 | 98 | ||||
| vs. pituitary | ||||||||||
| glioma vs. | 84.83 | 84.33 | 85.33 | 85.19 | 84.76 | 90 | ||||
| pituitary | ||||||||||
| Multiclass | --- | --- | --- | --- | --- | --- | ||||
| meningioma | 92.2 | 93.67 | 90.67 | 90.94 | 92.28 | 97 | ||||
| vs. glioma | ||||||||||
| Naïve Bayesian | meningioma vs. pituitary | 100 | 100 | 100 | 100 | 100 | 100 | |||
| glioma vs. | 98.67 | 98 | 99.33 | 99.32 | 98.66 | 100 | ||||
| pituitary | ||||||||||
| Multiclass | 94.89 | 94.88 | 97.45 | 94.89 | 94.88 | 98 | ||||
| meningioma | 97 | 97.67 | 96.33 | 96.38 | 97.02 | 98 | ||||
| vs. glioma | ||||||||||
| SVM | meningioma | 96.33 | 96.33 | 96.33 | 96.33 | 96.33 | 99 | |||
| vs. pituitary | ||||||||||
| glioma vs. pituitary | 100 | 100 | 100 | 100 | 100 | 100 | ||||
| Multiclass | 97.89 | 97.89 | 98.95 | 97.89 | 97.89 | 99 | ||||
| K-NN (K=6, | meningioma | 96.33 | 97.33 | 95.33 | 95.42 | 96.37 | 98 | |||
| Distance = Minkowski (cubic)) | vs. glioma | |||||||||
| meningioma | 100 | 100 | 100 | 100 | 100 | 100 | ||||
| vs. pituitary | ||||||||||
| glioma vs. | 100 | 100 | 100 | 100 | 100 | 100 | ||||
| pituitary | ||||||||||
| Multiclass | 95.11 | 95.13 | 97.56 | 95.11 | 95.12 | 94 | ||||
| meningioma vs. glioma | 97.33 | 99.67 | 95 | 95.22 | 97.39 | 99 | ||||
| meningioma vs. pituitary | 100 | 100 | 100 | 100 | 100 | 100 | ||||
| Fit Ensemble | glioma vs. pituitary | 100 | 100 | 100 | 100 | 100 | 100 | |||
| Multiclass | 97.44 | 97.52 | 98.74 | 97.44 | 97.44 | 99 | ||||
| meningioma | 95.67 | 97.67 | 93.67 | 93.91 | 95.75 | 99 | ||||
| vs. glioma | ||||||||||
| Neural Network | meningioma | 96.5 | 97.67 | 95.33 | 95.44 | 96.54 | 99 | |||
| vs. pituitary | ||||||||||
| glioma vs. | 97.5 | 95.33 | 99.67 | 99.65 | 97.44 | 100 | ||||
| pituitary | ||||||||||
| Multiclass | 90.67 | 90.62 | 95.4 | 90.67 | 90.59 | 99 | ||||
| a. Accuracy = (TP+TN)/(P+N), Sensitivity = TP/P, Specificity = TN/N, Precision = TP/(TP+FP), F1_score = 2TP/(2TP+FP+FN), and AUC = Area Under the ROC Curve that indicates a trade-off between True Positive (TPR = TP / P) and False Positive (FPR = FP / N) that TP=True Positive Samples, TN=True Negative Samples, FP=False Positive Samples, FN= False Negative Samples, P= Positive Samples, and N= Negative Samples. | ||||||||||
Table 4: Comparison of our proposed work and related studies in terms of performance metrics.
|
Author and year |
The Feature Extraction method and Classification Algorithm |
Evaluation metrics (%) |
|||||||
|
Accuracy |
Sensitivity |
Specificity |
Precision |
F1_score |
AUC |
||||
|
P. Thejaswiniet et al., 2019 [22] |
The Regularized Kernel-based Fuzzy C- Means + SVM |
91.4 |
98 |
78 |
--- |
--- |
--- |
||
|
Mirza Mumtaz Zahoor and Saddam Hussain Khan, 2022 [23] |
Res-BRNet |
98.22 |
98.11 |
--- |
98.22 |
96.41 |
--- |
||
|
Hafiza Akter Munira and Saiful Islam, 2022 [24] |
Random Forest + CNN |
96.52 |
--- |
--- |
--- |
--- |
--- |
||
|
Majdi Alnowami et al., 2022 [25] |
DenseNets |
96.52 |
98.5 |
82.1 |
--- |
--- |
--- |
||
|
Our Proposed Method |
GLCM + GoogleNet + AlexNet |
Fit Ensemble |
OVO |
99.11 |
99.89 |
98.33 |
98.41 |
99.13 |
99.67 |
|
Multi-class |
97.44 |
97.52 |
98.74 |
97.44 |
97.44 |
99.0 |
|||
Conclusion
Early brain tumor diagnosis is essential to cure the patient. Therefore, in this study, we use a hybrid algorithm based on the decomposition method in brain MRI scans for meningioma, glioma, and pituitary tumor classification. In image processing methods to detect a specific subject in the image, whether it is a disease, a specific object, or a specific face, the most important issue is to find effective features and appropriate classification. Our model uses the combination of the features extracted using AlexNet and GoogleNet networks and GLCM in the feature extraction stage and finally the OVO method and machine learning classifiers for brain tumor classification. Also, the PCA dimension reduction method is used to select the more effective features in order to reduce the processing time and use the rich information in decision-making. Using the OVO method (with an accuracy of 99.11%) strengthens the classification method and its resistance to noise in the information because each classification is conducted in three stages and is based on the majority vote. Finally, the effectiveness of the suggested algorithm indicates a strong potential for disease evaluations based on medical images
References
- Louis D, Perry A, Reifenberger G, Von DA, Figarella-Branger D, Cavenee WK, et al. The 2016 World Health Organization classification of tumors of the central nervous system: a summary. Acta Neuropathol. 2016; 131: 803-820.
- Rouse C, Gittleman H, Ostrom QT, Kruchko C and Barnholtz-Sloan JS. Years of potential life lost for brain and CNS tumors relative to other cancers in adults in the United States, 2010. Neuro-oncology. 2015; 18: 70-77.
- Gopal ST, Biswas M, Omprakash GK, Tiwari A, Harman SS, Monica, et al. A review on a deep learning perspective in brain cancer classification. Cancers. 2019; 11: 111.
- Mehrotra R, Ansari M, Agrawal R and Anand R. A transfer learning approach for AI- based classification of brain tumors. Machine Learning with Applications. 2020; 2: 100003.
- Goyal M, Knackstedt T, Yan S and Hassanpour S. Artificial intelligence-based image classification methods for diagnosis of skin cancer: Challenges and opportunities. Computers in Biology and Medicine.2020; 127: 104065.
- Pereira S, Pinto A, Alves V and Silva CA. Brain tumor segmentation using convolutional neural networks in MRI images. IEEE transactions on medical imaging. 2016; 35: 1240-1251.
- Popuri K, Cobzas D, Murtha A and Jägersand M. 3D variational brain tumor segmentation using Dirichlet priors on a clustered feature set. International journal of computer assisted radiology and surgery. 2012; 7: 493-506.
- Haykin S.Neural networks and learning machines, 3/E. Pearson Education 2009.
- Arunachalam M and Royappan SS. An efficient and automatic glioblastoma brain tumor detection using shift?invariant shearlet transform and neural networks. International Journal of Imaging Systems and Technology. 2017; 27: 216-226.
- Paul JS, Plassard AJ, Landman BA and Fabbri D. Deep learning for brain tumor classification, in Medical Imaging, Biomedical Applications in Molecular. Structural and Functional Imaging. 2017; 10137: 253-268.
- Varuna SN and Kumar T. Identification and classification of brain tumor MRI images with feature extraction using DWT and probabilistic neural network, Brain informatics. 2018; 5: 23-30.
- Ural B. A computer-based brain tumor detection approach with advanced image processing and probabilistic neural network methods. Journal of Medical and Biological Engineering. 2018; 38: 867-879.
- Rajan P and Sundar C. Brain tumor detection and segmentation by intensity adjustment, Journal of medical systems. 2019; 43: 1-13.
- Preethi S and Aishwarya P. Combining wavelet texture features and deep neural network for tumor detection and segmentation over MRI. Journal of Intelligent Systems. 2019; 28: 571-588.
- Deepak S and Ameer P. Brain tumor classification using deep CNN features via transfer learning. Computers in biology and medicine. 2019; 111: 103345.
- Saxena P, Maheshwari A and Maheshwari S. Predictive modeling of brain tumor: a deep learning approach, in Innovations in computational intelligence and computer vision. Springer, 2021: 275-285.
- Hemanth DJ, Anitha J, Naaji A, Geman O, Popescu DE and Son LH. A modified deep convolutional neural network for abnormal brain image classification. IEEE Access. 2018; 7: 4275-4283.
- Das S, Aranya ORR and Labiba NN. Brain tumor classification using convolutional neural network, in 2019 1st International Conference on Advances in Science. Engineering and Robotics Technology (ICASERT), IEEE. 2019.
- Ullah Z, Farooq MU, Lee SH and An D. A hybrid image enhancement based brain MRI images classification technique. Medical hypotheses. 2020; 143: 109922.
- Díaz-Pernas FJ, Martínez-Zarzuela M, Antón-Rodríguez M and González-Ortega D. A deep learning approach for brain tumor classification and segmentation using a multiscale convolutional neural network, in Healthcare. 2021; 9: 153.
- Kang J, Ullah Z and Gwak J. MRI-Based Brain Tumor Classification Using Ensemble of Deep Features and Machine Learning Sensors. 2021; 21: 2222.
- Thejaswini P, Bhat MB and Prakash MK. Detection and classification of tumour in brain Int J Eng Manufact (IJEM). 2019; 9: 11-20.
- Mumtaz ZM and Hussain Brain Tumor MRI Classification using a Novel Deep Residual and Regional CNN. 2211.16571, 2022.
- Munira HA, Islam MS, Boulila W, Ammar A, Samma H, Yafooz WMS, et al. Hybrid Deep Learning Models for Multi-classification of Tumour from Brain 2022; 24: 799.
- Alnowami M, Taha E, Alsebaeai S, Anwar SM and Alhawsawi A. MR image normalization dilemma and the accuracy of brain tumor classification model. J Radiation Research and Applied Sci. 2022; 15: 33-39.
- Cruz EAS and Franco CHE. Challenges of multivariable and multiclass classification problems.
- Cheng. Brain tumor dataset, figshare, Dataset.2017.
- Han J, Pei J and Tong H. Data mining: concepts and techniques. Morgan kaufmann.
- Lu L, Zhou Y, Panetta K and Agaian S. Comparative study of histogram equalization algorithms for image enhancement, Mobile Multimedia/Image Processing, Security, and Applications. 2010; 7708: 337-347.
- Chen SD and Ramli AR. Minimum mean brightness error bi-histogram equalization in contrast enhancement. IEEE transactions on Consumer Electronics. 2003; 4: 1310-1319.
- Umbaugh SE. Digital image processing and analysis: human and computer vision applications with CVIPtools. CRC press. 2010.
- Sebastian VB, Unnikrishnan A and K. Balakrishnan. Gray level co-occurrence matrices: generalisation and some new features. arXiv preprint arXiv: 1205.4831. 2012.
- Mallick PK, Ryu SH, Satapathy SK, Mishra S, Nguyen GN and Tiwari P. Brain MRI image classification for cancer detection using deep wavelet autoencoder-based deep neural network, IEEE Access. 2019; 7: 46278-46287.
- O'Shea K and Nash R. An introduction to convolutional neural networks, arXiv preprint arXiv:1511.08458v2. 2015.
- Baradaran M and Bergevin R. A Critical Study on the Recent Deep Learning Based Semi-Supervised Video Anomaly Detection Methods, arXiv preprint arXiv. 2111.01604. 2021.
- Krizhevsky A, Sutskever I and Hinton GE. ImageNet classification with deep convolutional neural networks, Communications of the ACM. 2017; 60: 84-90.
- Szegedy C, Liu W, Jia Y, Sermanet Y, Reed S, Anguelov D, et al. Going deeper with convolutions, in Proceedings of the IEEE conference on computer vision and pattern recognition. 2015: 1-9.
- Dash M and Liu H. Feature selection for classification, Intelligent Data Analysis.1997; 1: 131-156.
- Ma?kiewicz A and Ratajczak W. Principal components analysis (PCA). Computers & Geosciences. 1993 303-342.
- Kharis S, Hadi I and Hasanah K. Multiclass Classification of Brain Cancer with Multiple Multiclass Artificial Bee Colony Feature Selection and Support Vector Machine, in Journal of Physics: Conference Series.1417 012015. 2019.