Improving the Transcription of Academic Lectures for Information Retrieval

Mbogho, Audrey and Marquard, Stephen (2013) Improving the Transcription of Academic Lectures for Information Retrieval, The 12th International Conference on Machine Learning and Applications (ICMLA'13), 4-7 December 2013, Miami, FL, USA, IEEE.

[img] PDF
searchability.pdf

Download (205kB)

Abstract

Recording university lectures through lecture capture systems is increasingly common, generating large amounts of audio and video data. Transcribing recordings greatly enhances their usefulness by making them easy to search. However, the number of recordings accumulates rapidly, rendering manual transcription impractical. Automatic transcription, on the other hand, suffers from low levels of accuracy, partly due to the special language of academic disciplines, which standard language models do not cover. This paper looks into the use of Wikipedia to dynamically adapt language models for scholarly speech. We propose Ranked Word Correct Rate as a new metric better aligned with the goals of improving transcript searchability and specialist word recognition. The study shows that, while overall transcription accuracy may remain low, targeted language modelling can substantially improve searchability, an important goal in its own right.

Item Type: Conference proceedings
Uncontrolled Keywords: language model, speech recognition, lecture recording
Subjects: Social and professional topics > Professional topics > Computing education
Applied computing > Document management and text processing
Date Deposited: 28 Oct 2013
Last Modified: 10 Oct 2019 15:32
URI: http://pubs.cs.uct.ac.za/id/eprint/905

Actions (login required)

View Item View Item