Williams, Kyle and Suleman, Hussein (2011) Using A Hidden Markov Model to Transcribe Handwritten Bushman Texts, Proceedings of 11th Annual ACM/IEEE Joint Conference on Digital Libraries, 13-17 June 2011, Ottawa, Canada, 445-446, ACM/IEEE.
PDF
jcdl226p-williams.pdf Download (140kB) |
Abstract
The Bushman texts in the Bleek and Lloyd Collection contain complex diacritics that make automatic transcription difficult. Transcriptions of these texts would allow for enhanced digital library services to be created for interacting with the collection. In this study, an investigation into automatic transcription of the Bushman texts was performed using the popular method of using a Hidden Markov Model for text line recognition. The results show that while this technique may be well suited to well-constrained and understood scripts, its application to more complex scripts introduces a number of difficulties that need to be overcome.
Item Type: | Conference poster |
---|---|
Uncontrolled Keywords: | OCR, handwriting recognition, Hidden Markov Model, digital libraries |
Subjects: | Applied computing > Document management and text processing Information systems > Information retrieval |
Date Deposited: | 22 Jun 2011 |
Last Modified: | 10 Oct 2019 15:33 |
URI: | http://pubs.cs.uct.ac.za/id/eprint/696 |
Actions (login required)
View Item |