UCT CS Research Document Archive

Selecting relevant features for classifier optimization

Mgala, Mvurya and Audrey Mbogho (2014) Selecting relevant features for classifier optimization. In Proceedings AMLTA 2014, pages 211-222, Cairo, Egypt.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.


Feature selection is an important data pre-processing step that comes before applying a machine learning algorithm. It removes irrelevant and redundant attributes from the dataset with an aim of improving the algorithm performance. There exist feature selection methods which focus on discovering features that are most suitable. These methods include wrappers, a subroutine of the learning algorithm itself, and filters, which discover features according to heuristics, based on the data characteristics and not tied to a specific algorithm. This paper improves the filter approach by enabling it to select strongly relevant and weakly relevant features and gives room to the researcher to decide which of the weakly relevant features to include. This new approach brings clarity and understandability to the feature selection preprocessing step.

EPrint Type:Conference Paper
Keywords:feature selection, information gain, wrappers, filters, descriptive statistics
Subjects:I Computing Methodologies: I.2 ARTIFICIAL INTELLIGENCE
ID Code:1054
Deposited By:Mgala, Mvurya
Deposited On:27 January 2016