SciELO - Scientific Electronic Library Online

 
vol.18 issue2Two-Degrees-of-Freedom Robust PID Controllers Tuning Via a Multiobjective Genetic AlgorithmAttribute and Case Selection for NN Classifier through Rough Sets and Naturally Inspired Algorithms author indexsubject indexsearch form
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • Have no similar articlesSimilars in SciELO

Share


Computación y Sistemas

On-line version ISSN 2007-9737Print version ISSN 1405-5546

Abstract

GONZALEZ-NAVARRO, Félix Fernando  and  BELANCHE-MUNOZ, Lluís A.. Feature Selection for Microarray Gene Expression Data Using Simulated Annealing Guided by the Multivariate Joint Entropy. Comp. y Sist. [online]. 2014, vol.18, n.2, pp.275-293. ISSN 2007-9737.  https://doi.org/10.13053/CyS-18-2-2014-032.

Microarray classification poses many challenges for data analysis, given that a gene expression data set may consist of dozens of observations with thousands or even tens of thousands of genes. In this context, feature subset selection techniques can be very useful to reduce the representation space to one that is manageable by classification techniques. In this work we use the discretized multivariate joint entropy as the basis for a fast evaluation of gene relevance in a Microarray Gene Expression context. The proposed algorithm combines a simulated annealing schedule specially designed for feature subset selection with the incrementally computed joint entropy, reusing previous values to compute current feature subset relevance. This combination turns out to be a powerful tool when applied to the maximization of gene subset relevance. Our method delivers highly interpretable solutions that are more accurate than competing methods. The algorithm is fast, effective and has no critical parameters. The experimental results in several public-domain microarray data sets show a notoriously high classification performance and low size subsets, formed mostly by biologically meaningful genes. The technique is general and could be used in other similar scenarios.

Keywords : Feature selection; microarray gene expression data; multivariate joint entropy; simulated annealing.

        · abstract in Spanish     · text in English

 

Creative Commons License All the contents of this journal, except where otherwise noted, is licensed under a Creative Commons Attribution License