UCT CS Research Document Archive

A hybrid POMDP-BDI agent architecture with online stochastic planning and plan caching

Rens, G and D Moodley (2017) A hybrid POMDP-BDI agent architecture with online stochastic planning and plan caching. Cognitive Systems Research 43:1-20.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

This article presents an agent architecture for controlling an autonomous agent in stochastic, noisy environments. The architecture combines the partially observable Markov decision process (POMDP) model with the belief-desire-intention (BDI) framework. The Hybrid POMDP-BDI agent architecture takes the best features from the two approaches, that is, the online generation of reward-maximizing courses of action from POMDP theory, and sophisticated multiple goal management from BDI theory. We introduce the advances made since the introduction of the basic architecture, including (i) the ability to pursue and manage multiple goals simultaneously and (ii) a plan library for storing pre-written plans and for storing recently generated plans for future reuse. A version of the architecture is implemented and is evaluated in a simulated environment. The results of the experiments show that the improved hybrid architecture outperforms the standard POMDP architecture and the previous basic hybrid architecture for both processing speed and effectiveness of the agent in reaching its goals.

EPrint Type:Journal (Paginated)
Subjects:I Computing Methodologies: I.2 ARTIFICIAL INTELLIGENCE
ID Code:1218
Deposited By:Moodley, Deshen
Deposited On:23 November 2017
Alternative Locations:http://www.sciencedirect.com/science/article/pii/S1389041716300870?via%3Dihub