Automatic Detection and Correction of Spelling Errors in Agglutinative Languages (A Case Study of the Uzbek Language)

Authors

  • Nazira Sobirova G‘anijon qizi Basic Doctoral Student, Tashkent State University of Uzbek Language and Literature

Keywords:

Spell checking, NLP, Uzbek language, Levenshtein distance, N-gram, language model

Abstract

One of the key directions in the field of Natural Language Processing (NLP) is the automatic detection and correction of spelling errors in texts. This task becomes particularly challenging in agglutinative languages such as Uzbek, where words are formed through the addition of numerous affixes. This paper analyzes algorithms for detecting and correcting spelling errors in the Uzbek language, their operational principles, and modern machine learning approaches. The study examines dictionary-based methods, the Levenshtein distance algorithm, N-gram models, and context-aware approaches based on neural networks. The findings demonstrate that a hybrid algorithm, combining multiple techniques, provides the most effective solution for Uzbek spell-checking.

References

L. Bobojonova, A. Akhundjanova, P. Ostheimer, and S. Fellenz, “BERT-based part-of-speech tagging for Uzbek language,” arXiv, 2025.

F. J. Damerau, “A technique for computer detection and correction of spelling errors,” Communications of the ACM, vol. 7, no. 3, pp. 171–176, 1964.

D. Jurafsky and J. H. Martin, Speech and Language Processing, 3rd ed., 2023.

C. D. Manning and H. Schütze, Foundations of Statistical Natural Language Processing. Cambridge, MA, USA: MIT Press, 1999.

K. Kukich, “Techniques for automatically correcting words in text,” ACM Computing Surveys, vol. 24, no. 4, pp. 377–439, 1992.

V. I. Levenshtein, “Binary codes capable of correcting deletions, insertions, and reversals,” Soviet Physics Doklady, vol. 10, no. 8, pp. 707–710, 1966.

M. M. Ochilov, O. O. Narzullayev, and O. A. Xolmatov, “Mashinali o‘qitish algoritmlari asosida o‘zbek tili matnlaridagi imlo xatolarini aniqlash va tuzatish,” 2025.

B. Elov and M. Ahmedova, “Development of a spell correction system based on N-grams,” 2025.

U. Salaev, “UzMorphAnalyser: Morphological analysis model for Uzbek language,” arXiv, 2024.

M. Minin, “Norvig and SymSpell spelling correction algorithms,” 2024.

R. S. Madatovich and S. D. Maxamadiyevna, “The Role Of The System Of Education And Family Education In Forming Youth's World View,” European Journal of Humanities and Educational Advancements, vol. 4, no. 4, pp. 128–130.

R. Madatovich, “The role of civic responsibility in educating youth in a healthy spiritual environment in an information society,” Pubmedia Social Sciences and Humanities, vol. 3, no. 1, pp. 6, 2025.

R. S. Madatovich, “The role of preschool education and family education in the raising of a healthy balanced generation,” For Teachers, vol. 57, no. 4, pp. 520–523, 2024.

R. O‘. SirojmuRODOV, “Yoshlarda sog‘lom turmush tarzi rivojlanishida milliy va dunyoviy qadriyatlarni uyg‘unlashtirishning ijtimoiy-falsafiy tahlil,” ACTA NUUz, vol. 1, no. 1.10.1, pp. 184–186, 2024.

S. Ruzimurodov and Sh. Artikov, “Anakharsis–velikiy filosof iz Centralnoy Azii,” Innovatsii v tekhnologiyakh i obrazovanii, pp. 340–342, 2016.

Downloads

Published

2026-05-16

How to Cite

G‘anijon qizi, N. S. (2026). Automatic Detection and Correction of Spelling Errors in Agglutinative Languages (A Case Study of the Uzbek Language). Central Asian Journal of Literature, Philosophy and Culture, 7(3), 91–97. Retrieved from https://cajlpc.casjournal.org/index.php/CAJLPC/article/view/1545

Issue

Section

Articles