TO THE CORPUS OF JADID HERITAGE: ELBEK–FITRAT LEXICOGRAPHY, CRITERIA OF HOMONYMY, AND A MODEL FOR SEMANTIC TAGGING OF DRAMATIC LEXIS
Keywords:
Jadids, lexicography, Elbek, Abdurauf Fitrat, EOL, ROL, homonym, 1929 Latin script, vowel harmony, etymology, corpus linguistics, semantic tagging, lemmatization, Jadid drama, archaism/historicism, explanatory-collocational dictionary.Abstract
The article reinterprets the heritage of the Jadids through lexicographic and corpus-linguistic approaches. Based on the works of Elbek, Abdurauf Fitrat, and other enlighteners (EOL, Fitrat’s etymological explanations, etc.), the phonetic-graphic, lexical, and grammatical features of the language of the period are described. Drawing on Sh. Bobojonova’s analysis, the need to reconsider the criteria for identifying homonyms (identity of pronunciation and spelling, difference in meaning) is substantiated due to discrepancies between EOL (416 units) and ROL (497 units), as well as the influence of the 1929 Latinization and orthographic norms based on vowel harmony. Fitrat’s etymological interpretations of Turkic lexemes (e.g., yitika, mung‘ilamak, qub) demonstrate a cross-source integrative approach in Jadid lexicography.
Using the dramas of Behbudiy, Avloniy, Qodiriy, and Cho‘lpon, the study proposes thematic classification of lexical units (socio-political, religious-ethical, everyday, educational, journalistic, measurement-related, etc.), the use of archaic and historical labels, and a model for semantic tagging of “Jadid drama terminology.” For the authorial corpus, tools such as lemmatization, token–lemma–collocation–concordance search, operator and constant tags, synonym–antonym layers, and a tagging model are developed. The findings outline directions for creating a комплекс of explanatory-collocational and thematic-etymological dictionaries based on Jadid drama texts.
References
1. Nuritdinov A. S. Jadid davri adabiy muhitiga doir asarlardan korpusda foydalanish // Kompyuter lingvistikasi: muammolar, yechim, istiqbollar. — 2024. — Vol. 1, № 1.
2. Xamroyeva Sh. M. Oʻzbek tili mualliflik korpusini tuzishning lingvistik asoslari: filol. fan. boʻyicha falsafa doktori (PhD) dis. — Buxoro, 2018. — 250 b.
3. Axmedova D. B. Atov birliklarini oʻzbek tili korpuslari uchun leksik-semantik teglashning lingvistik asos va modellari: filol. fan. boʻyicha falsafa doktori (PhD) dis. avtoref. — Buxoro, 2020. — 54 b.
4. Islomov I. Oʻzbek tilining geografik terminologiyasi: tizimi, genezisi, semantik strukturasi va leksikografik talqini: filol. fan. dok. dis. avtoref. — Qarshi, 2021.