Semi-automatic linguistic annotation for lexicography with machine learning

Gubb, Benjamin and Marquard, Cael (2025) Semi-automatic linguistic annotation for lexicography with machine learning, Proceedings of African Association for Lexicography - 29th International Conference, 2-5 July 2025, Cape Town, South Africa, 33-34, African Association for Lexicography.

[thumbnail of Semi-automatic linguistic annotation for lexicography with machine learning (Gubb & Marquard).docx] Text
Semi-automatic linguistic annotation for lexicography with machine learning (Gubb & Marquard).docx - Accepted Version
Available under License Creative Commons Attribution.

Download (21kB)

Abstract

Many terminology lists for South African languages provide only the translations of each term without grammatical information such as part-of-speech and noun class. This makes it difficult to know how to use these words correctly in context. Annotating these terminology lists manually requires linguistic expertise and can be costly, time-intensive, and error prone. Instead, we propose annotating these terminology lists using a machine-learning classifier. An expert will then review the generated output to ensure accuracy. We will apply this approach to isiXhosa and integrate the results into the IsiXhosa.click online dictionary (Marquard 2024). This progresses the annotation of lexicographic works by making it easier to input linguistic information in dictionaries.

Item Type: Conference paper
Uncontrolled Keywords: IsiXhosa, Lexicography, e-Lexicography, Dictionary, Online Dictionary, Machine Learning, Natural Language Processing, NLP
Subjects: Information systems > Information systems applications > Collaborative and social computing systems and tools > Open source software
Computing methodologies > Artificial intelligence > Natural language processing
Information systems > Information retrieval > Document representation > Dictionaries
Alternate Locations: https://www.afrilex.co.za/_files/ugd/bd0c6e_26ce7c948ecf4c7283d0edc600ec69fa.pdf
Date Deposited: 13 Oct 2025 12:26
Last Modified: 13 Oct 2025 12:26
URI: https://pubs.cs.uct.ac.za/id/eprint/1751

Actions (login required)

View Item View Item