An Efficient Binary Sand Cat Swarm Optimization for Feature Selection in High-Dimensional Biomedical Data

dc.contributor.authorPashaei, Elnaz
dc.contributor.departmentMedical and Molecular Genetics, School of Medicine
dc.date.accessioned2024-03-22T12:12:41Z
dc.date.available2024-03-22T12:12:41Z
dc.date.issued2023-09-25
dc.description.abstractRecent breakthroughs are making a significant contribution to big data in biomedicine which are anticipated to assist in disease diagnosis and patient care management. To obtain relevant information from this data, effective administration and analysis are required. One of the major challenges associated with biomedical data analysis is the so-called “curse of dimensionality”. For this issue, a new version of Binary Sand Cat Swarm Optimization (called PILC-BSCSO), incorporating a pinhole-imaging-based learning strategy and crossover operator, is presented for selecting the most informative features. First, the crossover operator is used to strengthen the search capability of BSCSO. Second, the pinhole-imaging learning strategy is utilized to effectively increase exploration capacity while avoiding premature convergence. The Support Vector Machine (SVM) classifier with a linear kernel is used to assess classification accuracy. The experimental results show that the PILC-BSCSO algorithm beats 11 cutting-edge techniques in terms of classification accuracy and the number of selected features using three public medical datasets. Moreover, PILC-BSCSO achieves a classification accuracy of 100% for colon cancer, which is difficult to classify accurately, based on just 10 genes. A real Liver Hepatocellular Carcinoma (TCGA-HCC) data set was also used to further evaluate the effectiveness of the PILC-BSCSO approach. PILC-BSCSO identifies a subset of five marker genes, including prognostic biomarkers HMMR, CHST4, and COL15A1, that have excellent predictive potential for liver cancer using TCGA data.
dc.eprint.versionFinal published version
dc.identifier.citationPashaei E. An Efficient Binary Sand Cat Swarm Optimization for Feature Selection in High-Dimensional Biomedical Data. Bioengineering (Basel). 2023;10(10):1123. Published 2023 Sep 25. doi:10.3390/bioengineering10101123
dc.identifier.urihttps://hdl.handle.net/1805/39423
dc.language.isoen_US
dc.publisherMDPI
dc.relation.isversionof10.3390/bioengineering10101123
dc.relation.journalBioengineering
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.sourcePMC
dc.subjectSand cat swarm optimization
dc.subjectPinhole-imaging-based learning
dc.subjectFeature selection
dc.subjectBiomedical data
dc.subjectCancer prediction
dc.titleAn Efficient Binary Sand Cat Swarm Optimization for Feature Selection in High-Dimensional Biomedical Data
dc.typeArticle
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
bioengineering-10-01123.pdf
Size:
3.25 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: