UCT CS Research Document Archive

The Data Mining OPtimization Ontology

Keet, C. Maria, Agnieszka Lawrynowicz, Claudia d'Amato, Alexandros Kalousis, Phong Nguyen, Raul Palma and Robert Stevens (2015) The Data Mining OPtimization Ontology. Web Semantics: Science, Services and Agents on the World Wide Web 32:43-53.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

The Data Mining OPtimization Ontology (DMOP) has been developed to support informed decision-making at various choice points of the data mining process. The ontology can be used by data miners and deployed in ontology-driven information systems. The primary purpose for which DMOP has been developed is the automation of algorithm and model selection through semantic meta-mining that makes use of an ontology-based meta-analysis of complete data mining processes in view of extracting patterns associated with mining performance.
To this end, DMOP contains detailed descriptions of data mining tasks (e.g., learning, feature selection), data, algorithms, hypotheses such as mined models or patterns, and workflows. A development methodology was used for DMOP, including items such as competency questions and foundational ontology reuse. Several non-trivial modeling problems were encountered and due to the complexity of the data mining details, the ontology requires the use of the OWL 2 DL profile.
DMOP was successfully evaluated for semantic meta-mining and used in constructing the Intelligent Discovery Assistant, deployed at the popular data mining environment RapidMiner.

EPrint Type:Journal (Paginated)
Keywords:Data mining, ontology development, semantic web
Subjects:I Computing Methodologies: I.2 ARTIFICIAL INTELLIGENCE
I Computing Methodologies: I.6 SIMULATION AND MODELING
ID Code:1020
Deposited By:Keet, C. Maria
Deposited On:08 July 2015