Top Banner
Noname manuscript No. (will be inserted by the editor) Symbiotic Coevolutionary Genetic Programming: A Benchmarking Study under Large Attribute Spaces John A. Doucette · Andrew R. McIntyre · Peter Lichodzijewski · Malcolm I. Heywood Received: date / Accepted: date Abstract Classification under large attribute spaces represents a dual learning problem in which attribute subspaces need to be identified at the same time as the classifier design is established. Embedded as opposed to filter or wrapper method- ologies address both tasks simultaneously. The motivation for this work stems from the observation that team based approaches to Genetic Programming (GP) have the potential to design multiple classifiers per class – each with a potentially unique attribute subspace – without recourse to filter or wrapper style preprocess- ing steps. Specifically, competitive coevolution provides the basis for scaling the algorithm to data sets with large instance counts; whereas cooperative coevolu- tion provides a framework for problem decomposition under a bid-based model for establishing program context. Symbiosis is used to separate the tasks of team / en- semble composition from the design of specific team members. Team composition is specified in terms of a combinatorial search performed by a Genetic Algorithm (GA); whereas the properties of individual team members and therefore subspace identification is established under an independent GP population. Teaming implies that the members of the resulting ensemble of classifiers should have explicitly non- overlapping behaviour. Performance evaluation is conducted over data sets taken from the UCI repository with 649 to 102,660 attributes and 2 to 10 classes. The resulting teams identify attribute spaces 1 to 4 orders of magnitude smaller than under the original data set. Moreover, team members generally consist of less than 10 instructions; thus, small attribute subspaces are not being traded for opaque models. Keywords Feature Subspace Selection, Problem Decomposition, Symbiosis, Coevolution, Model Complexity, Classification, Genetic Programming J. A. Doucette David R. Cheriton School of Computer Science, University of Waterloo, ON. Canada E-mail: [email protected] A. R. McIntyre, P. Lichodzijewski and M. I. Heywood Faculty of Computer Science, Dalhousie University, NS. Canada E-mail: {armcnty, piotr, mheywood}@cs.dal.ca
30

Symbiotic Coevolutionary Genetic Programming: A Benchmarking Study under Large Attribute Spaces

Apr 25, 2023

Download

Documents

Sehrish Rafiq
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.