Statistical Analysis of the Vocabulary of “Shajarayi tarokima” by Parts of Speech

  • Begmatova Marjona Jomgirovna Tashkent State named after Alisher Navoi, University of Uzbek language and literature 2 Foundation doctoral Student
Keywords: Lexeme, parts of speech, words denoting a person and an object, words denoting an action, words denoting a quality, words denoting number and quantity, lexemes denoting reference

Abstract

This study investigates the vocabulary of the work Shajara-i Tarākima through a statistical lens, focusing on parts of speech. The analysis is anchored in the importance of language as a communication tool that reflects the social, economic, and cultural status of its time. With the advancement of computational linguistics, this research employs statistical methods to analyze and compare the frequency of lexical categories, aiding in the understanding of language structure and style. Despite the relevance of historical works in shaping modern language, there is a lack of comprehensive statistical analyses of classic texts like Shajara-i Tarākima, particularly in terms of parts of speech and their semantic classifications. The lexicon of Shajara-i Tarākima was digitized and analyzed using Excel, categorizing 14,093 lexemes into various parts of speech, including nouns, verbs, adjectives, and other lexical categories. The frequency of each category was calculated, and a semantic classification was conducted to provide deeper insight into the vocabulary. The analysis revealed that nouns, particularly proper nouns such as anthroponyms, toponyms, and ethnonyms, formed the largest portion of the lexicon. Verbs and adjectives followed, with a smaller proportion of auxiliary words and conjunctions. Notably, the work exhibits historical and archaic lexical forms, which distinguish it from modern Uzbek. The study concludes that the vocabulary of Shajara-i Tarākima is rich in Turkic onomastic units, showing minimal grammatical deviation from the modern Uzbek language but containing archaic lexemes. This research contributes to understanding the evolution of the Uzbek language, offering a statistical framework for further linguistic studies and comparisons across historical texts. It also highlights the relevance of linguistic tools in studying classical works for cultural, historical, and literary analysis.

References

H. Dadaboyev, O‘zbek terminologiyasi, Toshkent: Nodirabegim, 2020.

Explanatory Dictionary of the Uzbek Language, vols. I–V, Tashkent: Uzbekistan National Encyclopedia, 2006–2008.

Old Turkic Dictionary (DTS), Leningrad, 1969, p. 516.

M. Xolmurodova, Qutadghu Bilig leksikasi, Ph.D. diss., Tashkent, 2018.

B. Abdushukurov, Turkiy manbalar leksikasi, Tashkent: BOOKMANY PRINT, 2022, pp. 141-142.

B. Abdushukurov, Qissasi Rabg‘uziy leksikasi, Ph.D. diss., Tashkent, 2017.

Alisher Navoi, Mahbub ul-qulub, Complete Works, vol. 14, Tashkent: Fan, 1998.

A. Khojiyev, O‘zbek tili sinonimlarining izohli lug‘ati, Tashkent: O‘qituvchi, 1974, p. 307.

A. Khojiyev, Tilshunoslik terminlarining izohli lug‘ati, Tashkent: Uzbekistan National Encyclopedia, 2002.

Sh. Rahmatullaev, O‘zbek tilining etimologik lug‘ati, vol. II (Arabic words and their derivatives), Tashkent: Universitet, 2003.

I. Ismoilov, Turkiy tillarda qavm-qarindoshlik terminlari, Tashkent: FAN, 1966, p. 150.

A. Sodiqov, A. Abduazizov, and M. Irsqulov, Tilshunoslikka kirish, Tashkent: O‘qituvchi, 1981, p. 95.

Sh. Shoabdurahmonov, O‘zbek adabiy tili va o‘zbek xalq shevalari, Tashkent, 1962, p. 10.

B. Abulgazi, Shajarayi Tarokima, Tashkent, 1995.

S. T. Durbin, "The Application of Linguostatistics in Turkic Language Studies," Journal of Turkic Linguistics, vol. 22, no. 1, pp. 45-58, 2021.

J. M. Stone, Statistical Approaches to Linguistic Analysis of Classical Texts, Cambridge: Cambridge University Press, 2019.

Z. M. Akhmedov, "Lexical Analysis of Old Turkic Texts Using Modern Computational Tools," Asian Journal of Linguistics, vol. 15, no. 3, pp. 120-133, 2020.

M. R. Jafarov, "The Role of Lexical Categories in Historical Texts: A Case Study of Shajara-i Tarākima," Linguistic Research Review, vol. 5, no. 2, pp. 88-102, 2022.

Published
2025-10-01
How to Cite
Jomgirovna, B. M. (2025). Statistical Analysis of the Vocabulary of “Shajarayi tarokima” by Parts of Speech. Central Asian Journal of Literature, Philosophy and Culture, 6(4), 880-886. https://doi.org/10.51699/cajlpc.v6i4.1351
Section
Articles