Feature Selection with Non-Linear Dependence Based on Multi-objective Strategy

Chun Liang Lu, Wei Chun Tang, Yu Shuen Tsai, Nikhil R. Pal, I. Fang Chung*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

It is an interesting and important issue to identify a small set of useful features from a high dimensional data that can be used to design a classification mechanism. Usually, researchers prefer to find the features that have high relevance, in the sense that the correlation of each of those features with class labels is high or the mutual information between each of the features and class labels is high. Such approaches usually end up finding features that may be linearly dependent with each other. For some biological studies, it may be interesting to find a set of genes (features), which have high relevance with the class labels and also the genes are nonlinearly dependent-we explicitly want to exclude relevant genes that are linearly correlated among them. Although, our primary focus in this study is to find such genes from microarray data sets, such features may also be important in other studies. In this study, the Combinations of Relevantly Non-linear Dependency Subsets (CoRNDS) is proposed to tackle such the multi-objective problem. It opens up a good to simultaneously control selection of number of useful features, optimize the relevance between the selected features with class labels, and the non-linear dependency between the selected features. Using innovative ways we design three new objectives and optimize them by using the well-known multi-objective evolutionary algorithm based on decomposition (MOEA/D) method. To the best of our knowledge, this is the first attempt to feature (gene) selection along with identification of non-linear dependency between features via a multi-objective strategy. Experimental results show that the feasibility and effective performance on microarray cancer dataset. As to these selected gene subsets, investigate their auxiliary role of co-regulation in the biological pathways, and the occurrence in the pathogenesis of cancer are interesting future works.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE 16th International Conference on Bioinformatics and Bioengineering, BIBE 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages346-349
Number of pages4
ISBN (Electronic)9781509038336
DOIs
StatePublished - 16 Dec 2016
Event16th IEEE International Conference on Bioinformatics and Bioengineering, BIBE 2016 - Taichung, Taiwan
Duration: 31 Oct 20162 Nov 2016

Publication series

NameProceedings - 2016 IEEE 16th International Conference on Bioinformatics and Bioengineering, BIBE 2016

Conference

Conference16th IEEE International Conference on Bioinformatics and Bioengineering, BIBE 2016
Country/TerritoryTaiwan
CityTaichung
Period31/10/162/11/16

Keywords

  • Feature selection
  • Multi-Objectives
  • Non-linear dependency

Fingerprint

Dive into the research topics of 'Feature Selection with Non-Linear Dependence Based on Multi-objective Strategy'. Together they form a unique fingerprint.

Cite this