Combined approach to problem of part-of-speech homonymy resolution in Russian texts

Standard

Combined approach to problem of part-of-speech homonymy resolution in Russian texts. / Batura, Tatiana; Bruches, Elena.

2018 International Russian Automation Conference, RusAutoCon 2018. Institute of Electrical and Electronics Engineers Inc., 2018. 8501718 (2018 International Russian Automation Conference, RusAutoCon 2018).

Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование

Harvard

Batura, T & Bruches, E 2018, Combined approach to problem of part-of-speech homonymy resolution in Russian texts. в 2018 International Russian Automation Conference, RusAutoCon 2018., 8501718, 2018 International Russian Automation Conference, RusAutoCon 2018, Institute of Electrical and Electronics Engineers Inc., 2018 International Russian Automation Conference, Sochi, Российская Федерация, 09.09.2018. https://doi.org/10.1109/RUSAUTOCON.2018.8501718

APA

Batura, T., & Bruches, E. (2018). Combined approach to problem of part-of-speech homonymy resolution in Russian texts. в 2018 International Russian Automation Conference, RusAutoCon 2018 [8501718] (2018 International Russian Automation Conference, RusAutoCon 2018). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/RUSAUTOCON.2018.8501718

Vancouver

Batura T, Bruches E. Combined approach to problem of part-of-speech homonymy resolution in Russian texts. в 2018 International Russian Automation Conference, RusAutoCon 2018. Institute of Electrical and Electronics Engineers Inc. 2018. 8501718. (2018 International Russian Automation Conference, RusAutoCon 2018). doi: 10.1109/RUSAUTOCON.2018.8501718

Author

Batura, Tatiana ; Bruches, Elena. / Combined approach to problem of part-of-speech homonymy resolution in Russian texts. 2018 International Russian Automation Conference, RusAutoCon 2018. Institute of Electrical and Electronics Engineers Inc., 2018. (2018 International Russian Automation Conference, RusAutoCon 2018).

BibTeX

@inproceedings{aefe34f594474b46a94d027e677b6c29,

title = "Combined approach to problem of part-of-speech homonymy resolution in Russian texts",

abstract = "The Russian language has an inflective structure and does not have a strict word order. This causes processing difficulties, such as part-of-speech homonymy. This article is devoted to the mentioned issue. The existing approaches to resolving the morphological homonymy problem can be divided into the following groups: rule-based approaches, statistical approaches, machine learning approaches, and combined methods. In the paper, we showed that each approach has its advantages and disadvantages; however, combining several approaches can significantly increase the precision of the algorithm. Moreover, the article provides the analysis of the influence of certain features on the morphological homonymy resolution. The precision of the proposed algorithm is sufficient for its use in the tasks of intellectual text processing texts, for example, in machine translation and summarization systems. The proposed method is successfully used in the geographic location system. The main problem is the distinction between function words (conjunctions, particles, prepositions, interjections). Solving this problem is one of the priorities for the further work. We also plan to implement a system without a dictionary, in order to determine better morphological features for unknown words.",

keywords = "Combined approach, Homonymy resolution, Machine learning, Part-of-speech homonymy, Text processing",

author = "Tatiana Batura and Elena Bruches",

year = "2018",

month = oct,

day = "19",

doi = "10.1109/RUSAUTOCON.2018.8501718",

language = "English",

isbn = "9781538649381",

series = "2018 International Russian Automation Conference, RusAutoCon 2018",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2018 International Russian Automation Conference, RusAutoCon 2018",

address = "United States",

note = "2018 International Russian Automation Conference, RusAutoCon 2018 ; Conference date: 09-09-2018 Through 16-09-2018",

}

RIS

TY - GEN

T1 - Combined approach to problem of part-of-speech homonymy resolution in Russian texts

AU - Batura, Tatiana

AU - Bruches, Elena

PY - 2018/10/19

Y1 - 2018/10/19

N2 - The Russian language has an inflective structure and does not have a strict word order. This causes processing difficulties, such as part-of-speech homonymy. This article is devoted to the mentioned issue. The existing approaches to resolving the morphological homonymy problem can be divided into the following groups: rule-based approaches, statistical approaches, machine learning approaches, and combined methods. In the paper, we showed that each approach has its advantages and disadvantages; however, combining several approaches can significantly increase the precision of the algorithm. Moreover, the article provides the analysis of the influence of certain features on the morphological homonymy resolution. The precision of the proposed algorithm is sufficient for its use in the tasks of intellectual text processing texts, for example, in machine translation and summarization systems. The proposed method is successfully used in the geographic location system. The main problem is the distinction between function words (conjunctions, particles, prepositions, interjections). Solving this problem is one of the priorities for the further work. We also plan to implement a system without a dictionary, in order to determine better morphological features for unknown words.

AB - The Russian language has an inflective structure and does not have a strict word order. This causes processing difficulties, such as part-of-speech homonymy. This article is devoted to the mentioned issue. The existing approaches to resolving the morphological homonymy problem can be divided into the following groups: rule-based approaches, statistical approaches, machine learning approaches, and combined methods. In the paper, we showed that each approach has its advantages and disadvantages; however, combining several approaches can significantly increase the precision of the algorithm. Moreover, the article provides the analysis of the influence of certain features on the morphological homonymy resolution. The precision of the proposed algorithm is sufficient for its use in the tasks of intellectual text processing texts, for example, in machine translation and summarization systems. The proposed method is successfully used in the geographic location system. The main problem is the distinction between function words (conjunctions, particles, prepositions, interjections). Solving this problem is one of the priorities for the further work. We also plan to implement a system without a dictionary, in order to determine better morphological features for unknown words.

KW - Combined approach

KW - Homonymy resolution

KW - Machine learning

KW - Part-of-speech homonymy

KW - Text processing

UR - http://www.scopus.com/inward/record.url?scp=85057062280&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/155b44c2-6de6-32a1-a707-ae7a87eec81d/

U2 - 10.1109/RUSAUTOCON.2018.8501718

DO - 10.1109/RUSAUTOCON.2018.8501718

M3 - Conference contribution

AN - SCOPUS:85057062280

SN - 9781538649381

T3 - 2018 International Russian Automation Conference, RusAutoCon 2018

BT - 2018 International Russian Automation Conference, RusAutoCon 2018

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2018 International Russian Automation Conference

Y2 - 9 September 2018 through 16 September 2018

ER -

ID: 17554148