UCT CS Research Document Archive

Large Scale Metadata Harvesting Over Low Bandwidth Connections

Mulder, Rickert (2010) Large Scale Metadata Harvesting Over Low Bandwidth Connections. Technical Report CS10-05-00, Department of Computer Science, University of Cape Town.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

There seems to be a widespread perception that large scale metadata harvesting requires a large amount of bandwidth. In this study a simple Python-based metadata harvester was created and run over a residential broadband connection. Results show that it is possible to build a metadata collection in the order of millions of records in just a few days over such connections.

EPrint Type:Departmental Technical Report
Keywords:OAI-PMH, Low Bandwidth, Harvesting
Subjects:H Information Systems: H.3 INFORMATION STORAGE AND RETRIEVAL
ID Code:602
Deposited By:Mulder, Rickert
Deposited On:11 March 2010