Abstract
Background: Using knowledge-based interpretation to analyze omics data can not only obtain essential information regarding various biological processes, but also reflect the current physiological status of cells and tissue. The major challenge to analyze gene expression data, with a large number of genes and small samples, is to extract disease-related information from a massive amount of redundant data and noise. Gene selection, eliminating redundant and irrelevant genes, has been a key step to address this problem. Results: The modified method was tested on four benchmark datasets with either two-class phenotypes or multiclass phenotypes, outperforming previous methods, with relatively higher accuracy, true positive rate, false positive rate and reduced runtime. Conclusions: This paper proposes an effective feature selection method, combining double RBF-kernels with weighted analysis, to extract feature genes from gene expression data, by exploring its nonlinear mapping ability.
Original language | English |
---|---|
Article number | 396 |
Number of pages | 14 |
Journal | BMC Bioinformatics |
Volume | 19 |
Issue number | 1 |
DOIs | |
Publication status | Published - 29 Oct 2018 |
Keywords
- Cancer classification
- Clustering
- Data mining
- Feature selection
- Gene expression
- Gene Expression Profiling/methods
- Humans
- Gene Expression Regulation, Neoplastic
- Computational Biology/methods
- Phenotype
- Algorithms
- Neoplasms/classification
- Neoplasm Proteins/genetics