Data reduction by genetic algorithms and non-algebraic feature construction: a case study

Leila S. Shafti, Eduardo Pérez

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

5 Citations (Scopus)

Abstract

Real-world data are often prepared for purposes other than data mining and machine learning and, therefore, are represented by primitive attributes. When data representation is primitive, preprocessing data before looking for patterns becomes necessary. If lack of domain experts prevents the use of highly informative attributes, patterns are hard to uncover due to complex attribute interactions. This article suggests a new use of MFE3/GA to restructure the primitive data representation by means of capturing and compacting hidden information into new features in order to highlight them to the learner. Empirical results on Poker Hand data set show that the new use successfully improves learning this concept by means of data reduction, generation of a smaller decision tree classifier, and accuracy improvement.
Original languageEnglish
Title of host publication2008 Eighth International Conference on Hybrid Intelligent Systems
EditorsFatos Xhafa, Francisco Herrera, Ajith Abraham, Mario Köppen, Jose Manuel Bénitez
Place of PublicationPiscataway, NJ
PublisherIEEE
Pages573-578
Number of pages6
ISBN (Print)9780769533261
DOIs
Publication statusPublished - 19 Sept 2008
EventEighth International Conference on Hybrid Intelligent Systems - Barcelona, Spain
Duration: 10 Sept 200812 Sept 2008

Publication series

NameInternational Conference on Hybrid Intelligent Systems (HIS)

Conference

ConferenceEighth International Conference on Hybrid Intelligent Systems
Abbreviated titleHIS 2008
Country/TerritorySpain
CityBarcelona
Period10/09/0812/09/08

Funding

This work has been partially supported by the Spanish Ministry of Science and Technology, under Grant number TSI2005-08225-C07-06.

Keywords

  • genetic algorithms
  • data reduction
  • MFE3/GA

Fingerprint

Dive into the research topics of 'Data reduction by genetic algorithms and non-algebraic feature construction: a case study'. Together they form a unique fingerprint.

Cite this