International Journal of Computational Intelligence Systems

Volume 11, Issue 1, 2018, Pages 1322 - 1337

Automatic clustering based on Crow Search Algorithm-Kmeans (CSA-Kmeans) and Data Envelopment Analysis (DEA)

Authors
Alireza Balavand1, a.balavand@srbiau.ac.ir, Ali Husseinzadeh Kashan*, 2, a.kashan@modares.ac.ir, Abbas Saghaei3, a.saghaei@srbiau.ac.ir
1Department of Industrial Engineering, Science and Research branch, Islamic Azad University, Tehran, Iran.
2Faculty of Industrial and Systems Engineering, Tarbiat Modares University, Tehran, Iran
3Department of Industrial Engineering, Science and Research branch, Islamic Azad University, Tehran, Iran.
*Corresponding author
Corresponding Author
Ali Husseinzadeh Kashana.kashan@modares.ac.ir
Available Online 1 January 2018.
DOI
10.2991/ijcis.11.1.98How to use a DOI?
Keywords
Automatic Clustering; K-Means; Crow Search Optimization Algorithm; Cluster Validity Indices; Data Envelopment Analysis
Abstract

Cluster Validity Indices (CVI) evaluate the efficiency of a clustering algorithm and Data Envelopment Analysis (DEA) evaluate the efficiency of Decision-Making Units (DMUs) using a number of inputs data and outputs data. Combination of the CVI and DEA inspired the development of a new automatic clustering algorithm called Automatic Clustering Based on Data Envelopment Analysis (ACDEA). ACDEA is able to determine the optimal number of clusters in four main steps. In the first step, a new clustering algorithm called CSA-Kmeans is introduced. In this algorithm, clustering is performed by the Crow Search Algorithm (CSA), in which the K-means algorithm generates the initial centers of the clusters. In the second step, the clustering of data is performed from kmin cluster to kmax cluster, using CSA-Kmeans. At each iteration of clustering, using correct data labels, Within-Group Scatter (WGS) index, Between-Group Scatter (BGS) index, Dunn Index (DI), the Calinski-Harabasz (CH) index, and the Silhouette index (SI) are extracted and stored, which ultimately these indices make a matrix that the columns of this matrix indicate the values of validity indices and the rows or DMUs represent the number of clustering times from kmin cluster to kmax cluster. In the third step, the efficiency of the DMUs is calculated using the DEA method based on the second stage matrix, and given that the DI, CH, and SI estimate the relationship within group scatter and between group scatter, WGS and BGS are used as input variables and the indices of DI, CH and SI are used as output variables to DEA. Finally, in step four, AP method is used to calculate the efficiency of DMUs, so that an efficiency value is obtained for each DMU that maximum efficiency represents the optimal number of clusters. In this study, three categories of data are used to measure the efficiency of the ACDEA algorithm. Also, the efficiency of ACDEA is compared with the DCPSO, GCUK and ACDE algorithms. According to the results, there is a positive significant relationship between input CVI and output CVI in data envelopment analysis, and the optimal number of clusters is achieved for many cases.

Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article under the CC BY-NC license (http://creativecommons.org/licences/by-nc/4.0/).

Download article (PDF)
View full text (HTML)

Journal
International Journal of Computational Intelligence Systems
Volume-Issue
11 - 1
Pages
1322 - 1337
Publication Date
2018/01/01
ISSN (Online)
1875-6883
ISSN (Print)
1875-6891
DOI
10.2991/ijcis.11.1.98How to use a DOI?
Copyright
© 2018, the Authors. Published by Atlantis Press.
Open Access
This is an open access article under the CC BY-NC license (http://creativecommons.org/licences/by-nc/4.0/).

Cite this article

TY  - JOUR
AU  - Alireza Balavand
AU  - Ali Husseinzadeh Kashan
AU  - Abbas Saghaei
PY  - 2018
DA  - 2018/01/01
TI  - Automatic clustering based on Crow Search Algorithm-Kmeans (CSA-Kmeans) and Data Envelopment Analysis (DEA)
JO  - International Journal of Computational Intelligence Systems
SP  - 1322
EP  - 1337
VL  - 11
IS  - 1
SN  - 1875-6883
UR  - https://doi.org/10.2991/ijcis.11.1.98
DO  - 10.2991/ijcis.11.1.98
ID  - Balavand2018
ER  -