Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
Advancing Karakalpak Linguistics with Dictionary-Based Morphological Analysis: Implications for Text Correction Systems. / Mengliev, Davlatyor B.; Barakhnin, Vladimir B.; Boltayev, Nodirbek R. et al.
International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM. IEEE Computer Society, 2024. p. 2380-2383 (International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
}
TY - GEN
T1 - Advancing Karakalpak Linguistics with Dictionary-Based Morphological Analysis: Implications for Text Correction Systems
AU - Mengliev, Davlatyor B.
AU - Barakhnin, Vladimir B.
AU - Boltayev, Nodirbek R.
AU - Polatova, Sevinch A.
AU - Eshkulov, Mukhriddin O.
AU - Ibragimov, Bahodir B.
N1 - Conference code: 25
PY - 2024
Y1 - 2024
N2 - The article presents an original method of morphological analysis of the Karakalpak language, based on the dictionary approach, focusing on its application in text correction systems. The algorithm analyzes words, identifying their roots and affixes using an extensive dictionary of roots of more than ten-thousand-word forms and affixes, as well as a dictionary of exceptions for words that do not applicable for general grammatical rules. Proposed approach allows for high accuracy in determining the morphological structure of words and offers the user correction options for potentially misspelled words. The work contributes to the development of linguistic tools for the Karakalpak language and highlights the importance of developing technologies to support linguistic diversity and digital inclusion. In addition, as part of this work, the authors analyzed a number of existing scientific studies closely related to the topic under study in order to develop the most relevant and effective solution for automatic text correction of texts in Karakalpak language.
AB - The article presents an original method of morphological analysis of the Karakalpak language, based on the dictionary approach, focusing on its application in text correction systems. The algorithm analyzes words, identifying their roots and affixes using an extensive dictionary of roots of more than ten-thousand-word forms and affixes, as well as a dictionary of exceptions for words that do not applicable for general grammatical rules. Proposed approach allows for high accuracy in determining the morphological structure of words and offers the user correction options for potentially misspelled words. The work contributes to the development of linguistic tools for the Karakalpak language and highlights the importance of developing technologies to support linguistic diversity and digital inclusion. In addition, as part of this work, the authors analyzed a number of existing scientific studies closely related to the topic under study in order to develop the most relevant and effective solution for automatic text correction of texts in Karakalpak language.
KW - NLP
KW - agglutinative languages
KW - analysis algorithms
KW - dictionary method
KW - digital inclusion
KW - karakalpak language
KW - linguistic diversity
KW - morphological analysis
KW - quality of information processes
KW - text correction
UR - https://www.scopus.com/record/display.uri?eid=2-s2.0-85201965579&origin=inward&txGid=ad16dae9881da1f208f84c52e3961d08
UR - https://www.mendeley.com/catalogue/8ca3170d-9ce2-3ffc-b40e-0baf6481b626/
U2 - 10.1109/EDM61683.2024.10615182
DO - 10.1109/EDM61683.2024.10615182
M3 - Conference contribution
SN - 9798350389234
T3 - International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM
SP - 2380
EP - 2383
BT - International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM
PB - IEEE Computer Society
T2 - 25th IEEE International Conference of Young Professionals in Electron Devices and Materials
Y2 - 28 June 2024 through 2 July 2024
ER -
ID: 60548870