TY - JOUR
T1 - Tumor classification by combining PNN classifier ensemble with neighborhood rough set based gene reduction
AU - Wang, Shu Lin
AU - Li, Xueling
AU - Zhang, Shanwen
AU - Gui, Jie
AU - Huang, De Shuang
N1 - Funding Information:
This work was supported by the National Science Foundation of China, (Grant nos. 60973153 and 30700161), the Guide Project of Innovative Base of Chinese Academy of Sciences (Grant no. KSCX1-YW-R-30), the Knowledge Innovation Program of the Chinese Academy of Sciences (0823A16121), and the China Postdoctoral Science Foundation (Grant no. 20090450707).
PY - 2010/2
Y1 - 2010/2
N2 - Since Golub applied gene expression profiles (GEP) to the molecular classification of tumor subtypes for more accurately and reliably clinical diagnosis, a number of studies on GEP-based tumor classification have been done. However, the challenges from high dimension and small sample size of tumor dataset still exist. This paper presents a new tumor classification approach based on an ensemble of probabilistic neural network (PNN) and neighborhood rough set model based gene reduction. Informative genes were initially selected by gene ranking based on an iterative search margin algorithm and then were further refined by gene reduction to select many minimum gene subsets. Finally, the candidate base PNN classifiers trained by each of the selected gene subsets were integrated by majority voting strategy to construct an ensemble classifier. Experiments on tumor datasets showed that this approach can obtain both high and stable classification performance, which is not too sensitive to the number of initially selected genes and competitive to most existing methods. Additionally, the classification results can be cross-verified in a single biomedical experiment by the selected gene subsets, and biologically experimental results also proved that the genes included in the selected gene subsets are functionally related to carcinogenesis, indicating that the performance obtained by the proposed method is convincing.
AB - Since Golub applied gene expression profiles (GEP) to the molecular classification of tumor subtypes for more accurately and reliably clinical diagnosis, a number of studies on GEP-based tumor classification have been done. However, the challenges from high dimension and small sample size of tumor dataset still exist. This paper presents a new tumor classification approach based on an ensemble of probabilistic neural network (PNN) and neighborhood rough set model based gene reduction. Informative genes were initially selected by gene ranking based on an iterative search margin algorithm and then were further refined by gene reduction to select many minimum gene subsets. Finally, the candidate base PNN classifiers trained by each of the selected gene subsets were integrated by majority voting strategy to construct an ensemble classifier. Experiments on tumor datasets showed that this approach can obtain both high and stable classification performance, which is not too sensitive to the number of initially selected genes and competitive to most existing methods. Additionally, the classification results can be cross-verified in a single biomedical experiment by the selected gene subsets, and biologically experimental results also proved that the genes included in the selected gene subsets are functionally related to carcinogenesis, indicating that the performance obtained by the proposed method is convincing.
KW - Biological data mining
KW - Gene expression profiles
KW - Gene selection
KW - Neighborhood rough set model
KW - Probabilistic neural network ensemble
KW - Tumor classification
UR - http://www.scopus.com/inward/record.url?scp=77649237177&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77649237177&partnerID=8YFLogxK
U2 - 10.1016/j.compbiomed.2009.11.014
DO - 10.1016/j.compbiomed.2009.11.014
M3 - Article
C2 - 20044083
AN - SCOPUS:77649237177
SN - 0010-4825
VL - 40
SP - 179
EP - 189
JO - Computers in Biology and Medicine
JF - Computers in Biology and Medicine
IS - 2
ER -