UCT CS Research Document Archive

Very Large Scale Digital Library Building in Greenstone using Parallel Processing

Thompson, John, David Bainbridge and Hussein Suleman (2011) Very Large Scale Digital Library Building in Greenstone using Parallel Processing. In Xing, Chunxiao, Fabio Crestani and Andreas Rauber, Eds. Proceedings 13th International Conference on Asia-Pacific Digital Libraries, pages 331-340, Beijing, China.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

As very large digital library collections become more commonplace, software tools must adapt appropriately. This paper reports on an evolution of the Greenstone Digital Library software to support parallel processing during the collection building phase. A series of experiments were conducted to first establish a basic speed-up factor, and then deconstruct the parallelisation process to understand the execution profile of the application. Several bottlenecks were identified and resolved to further improve the performance. The adaptation of Greenstone confirms that the build phase is indeed a suitable candidate for parallelisation; and suggests that parallelisation of processing is a new avenue for exploration in emerging digital library architectures.

EPrint Type:Conference Paper
Keywords:Greenstone, VLDL, Parallel Processing, Open MPI
Subjects:H Information Systems: H.4 INFORMATION SYSTEMS APPLICATIONS
C Computer Systems Organization: C.4 PERFORMANCE OF SYSTEMS
ID Code:742
Deposited By:Suleman, Hussein
Deposited On:09 December 2011
Alternative Locations:http://www.springerlink.com/content/06872m3x53328vr5/