Selecting relevant features for classifier optimization
Mgala, Mvurya and Audrey Mbogho (2014) Selecting relevant features for classifier optimization. In Proceedings AMLTA 2014, pages 211-222, Cairo, Egypt.
Feature selection is an important data pre-processing step that comes before applying a machine learning algorithm. It removes irrelevant and redundant attributes from the dataset with an aim of improving the algorithm performance. There exist feature selection methods which focus on discovering features that are most suitable. These methods include wrappers, a subroutine of the learning algorithm itself, and filters, which discover features according to heuristics, based on the data characteristics and not tied to a specific algorithm. This paper improves the filter approach by enabling it to select strongly relevant and weakly relevant features and gives room to the researcher to decide which of the weakly relevant features to include. This new approach brings clarity and understandability to the feature selection preprocessing step.
|EPrint Type:||Conference Paper|
|Keywords:||feature selection, information gain, wrappers, filters, descriptive statistics|
|Subjects:||I Computing Methodologies: I.2 ARTIFICIAL INTELLIGENCE|
|Deposited By:||Mgala, Mvurya|
|Deposited On:||27 January 2016|