Neighborhood rough set model based gene selection for multi-subtype tumor classification

Shulin Wang, Xueling Li, Shanwen Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Scopus citations

Abstract

Multi-subtype tumor diagnosis based on gene expression profiles is promising in clinical medicine application. Therefore, a great deal of research on tumor classification based on gene expression profiles has been developed, where various machine learning approaches were applied to constructing the best tumor classification model to improve the classification performance as much as possible. To achieve this goal, extracting features or finding informative genes that have good classification ability is crucial. We propose a novel gene selection approach, which adopts Kruskal-Wallis rank sum test to rank all genes and then apply an algorithm based on neighborhood rough set model to gene reduction to obtain gene subsets with fewer genes and more classification ability. Experiments on a small round blue cell tumor (SRBCT) dataset show that our approach can achieve very high classification accuracy with only three or four genes as evaluated by three classifiers: support vector machines, K-nearest neighbor and neighborhood classifier, respectively.

Original languageEnglish (US)
Title of host publicationAdvanced Intelligent Computing Theories and Applications
Subtitle of host publicationWith Aspects of Theoretical and Methodological Issues - 4th International Conference on Intelligent Computing, ICIC 2008, Proceedings
Pages146-158
Number of pages13
DOIs
StatePublished - 2008
Externally publishedYes
Event4th International Conference on Intelligent Computing, ICIC 2008 - Shanghai, China
Duration: Sep 15 2008Sep 18 2008

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume5226 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other4th International Conference on Intelligent Computing, ICIC 2008
Country/TerritoryChina
CityShanghai
Period9/15/089/18/08

Keywords

  • Gene expression profiles
  • K-nearest neighbor
  • Neighborhood classifier
  • Support vector machines
  • Tumor classification

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Neighborhood rough set model based gene selection for multi-subtype tumor classification'. Together they form a unique fingerprint.

Cite this