Abstract
Primitive data representation of real-world data facilitates attribute interactions, which make information opaque to most learners. Feature Construction (FC) aims to abstract and encapsulate interactions into new features and outline them to the learner. When a GA is applied to perform FC, the goal is to generate features that facilitate more accurate learning. Then the GA's fitness function should estimate the quality of the constructed features. We propose a new fitness function based on Minimum Description Length (MDL). This fitness is incorporated in MFE2/GA to improve its accuracy. The new system is compared with other systems based on Entropy or error-rate fitness.
Original language | English |
---|---|
Title of host publication | Proceedings of GECCO 2007 |
Subtitle of host publication | Genetic and Evolutionary Computation Conference |
Publisher | Association for Computing Machinery (ACM) |
Pages | 1875 |
Number of pages | 1 |
ISBN (Print) | 1595936971, 9781595936974 |
DOIs | |
Publication status | Published - 7 Jul 2007 |
Event | 9th Annual Genetic and Evolutionary Computation Conference, GECCO 2007 - London, United Kingdom Duration: 7 Jul 2007 → 11 Jul 2007 |
Conference
Conference | 9th Annual Genetic and Evolutionary Computation Conference, GECCO 2007 |
---|---|
Country/Territory | United Kingdom |
City | London |
Period | 7/07/07 → 11/07/07 |
Funding
This work is supported by the Spanish Ministry of Science and Technology, Grant number TSI2005-08225-C07-06.
Keywords
- Attribute interaction
- Entropy
- Feature construction
- Fitness function
- Machine learning
- MDL