Using A Hidden Markov Model to Transcribe Handwritten Bushman Texts

Williams, Kyle and Suleman, Hussein (2011) Using A Hidden Markov Model to Transcribe Handwritten Bushman Texts, Proceedings of 11th Annual ACM/IEEE Joint Conference on Digital Libraries, 13-17 June 2011, Ottawa, Canada, 445-446, ACM/IEEE.

[img] PDF
jcdl226p-williams.pdf

Download (140kB)

Abstract

The Bushman texts in the Bleek and Lloyd Collection contain complex diacritics that make automatic transcription difficult. Transcriptions of these texts would allow for enhanced digital library services to be created for interacting with the collection. In this study, an investigation into automatic transcription of the Bushman texts was performed using the popular method of using a Hidden Markov Model for text line recognition. The results show that while this technique may be well suited to well-constrained and understood scripts, its application to more complex scripts introduces a number of difficulties that need to be overcome.

Item Type: Conference poster
Uncontrolled Keywords: OCR, handwriting recognition, Hidden Markov Model, digital libraries
Subjects: Applied computing > Document management and text processing
Information systems > Information retrieval
Date Deposited: 22 Jun 2011
Last Modified: 10 Oct 2019 15:33
URI: http://pubs.cs.uct.ac.za/id/eprint/696

Actions (login required)

View Item View Item