Sorting and Storing Search Results in The Uzbek Language Corpus. Lexical Search Parameters

  • Yuldashev Aziz Uyg’un O’g’li Teacher at TashDO’TAU
Keywords: corpus of the Uzbek language, search engine, lemma, morpheme, word group, affix, synonym, collocation, semantic field

Abstract

The advancement of corpus linguistics in Uzbekistan requires the establishment of efficient digital tools for linguistic data retrieval and analysis. Despite the creation of several electronic corpora, comprehensive studies on search mechanisms—specifically sorting, storing, and lexical filtering in the Uzbek language corpus—remain limited. Current corpus systems lack a detailed methodological framework for organizing search results, handling lexical parameters such as lemmas and morphemes, and ensuring user-friendly data export and management. This study aims to analyze and systematize search mechanisms for the Uzbek language corpus by focusing on sorting, storing, and lexical search parameters, and adapting international corpus practices to the morphological complexity of Uzbek. The insights gained through the findings also unveil how even the very data that is presented within some of these resources can be made even more meaningful and discoverable through combining alphabetical, frequency and metadata-based sorting with lemma- and morpheme-based search capabilities to improve search functionality as a whole. Moreover, exporting (CSV, XML, JSON) and history-saving functions make sure that the software will be usable in the long-term for research. The research presents a general model for combining computational and linguistic principles to increase the efficiency of corpus search and an adaptive model of dealing with agglutinative structures. The proposed system strengthens the methodological foundation of Uzbek corpus linguistics, facilitates corpus-based research and teaching, and supports the development of computational linguistics in Uzbekistan by transforming the Uzbek corpus into an interactive, analytical, and educational digital resource.

References

L. Anthony, «AntConc (Windows, Macintosh OS X, and Linux)». 2011 г.

T. McEnery и A. Hardie, Corpus Linguistics: Method, Theory and Practice. Cambridge: Cambridge University Press, 2012. doi: 10.1017/CBO9780511981392.

J. Sinclair, Corpus, Concordance, Collocation. Oxford: Oxford University Press, 1991.

N. Abdurakhmonova, «Formal-Functional Models of the Uzbek Electron Corpus», ANGLISTICUM. Journal of the Association-Institute for English Language and American Studies, т. 10, вып. 8, сс. 59–66, 2021.

N. Abdurakhmonova, «Formal-Functional Models of the Uzbek Electron Corpus», ANGLISTICUM. Journal of the Association-Institute for English Language and American Studies, т. 10, вып. 8, сс. 59–66, 2021.

Hermetic Systems, «Hermetic Concordance Software». 2023 г.

V. Zaxarov, B. Mengliyev, и Sh. Xamroyeva, Korpus Lingvistikasi / Corpus Linguistics: A Textbook. Tashkent: GlobeEdit, 2021.

Russian National Corpus, «Manual and Settings Page of the Russian National Corpus». 2023 г.

S. Baxodirov и N. Muradova, «Mualliflik Korpusi Qidiruv Tizimini Ishlab Chiqish / Development of the Author Corpus Search System», Computer Linguistics: Problems, Solutions, Prospects, т. 1, вып. 1, 2025.

S. Baxodirov и N. Muradova, «Mualliflik Korpusi Qidiruv Tizimini Ishlab Chiqish / Development of the Author Corpus Search System», Computer Linguistics: Problems, Solutions, Prospects, т. 1, вып. 1, 2025.

QSR International, «NVivo Help: Word Frequency Queries». 2023 г.

Russian National Corpus, «Search Interface of the Russian National Corpus». 2023 г.

P. Baker, «Sociolinguistics and Corpus Linguistics», Journal of Language and Society, т. 39, вып. 3, сс. 345–368, 2010, doi: 10.1177/0261927X10365823.

A. Kilgarriff и др., «The Sketch Engine: Ten Years On», Lexicography, т. 1, вып. 1, сс. 7–36, 2014, doi: 10.1007/s40607-014-0009-9.

English-Corpora.org, «Word and Phrase Search Help». 2023 г.

Published
2025-11-06
How to Cite
Uyg’un O’g’li, Y. A. (2025). Sorting and Storing Search Results in The Uzbek Language Corpus. Lexical Search Parameters. Central Asian Journal of Literature, Philosophy and Culture, 7(1), 32-38. https://doi.org/10.51699/cajlpc.v7i1.1392
Section
Articles