Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems

Standard

Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems. / Yakovenko, Olga; Bondarenko, Ivan; Borovikova, Mariya et al.

Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings. ed. / A Karpov; O Jokisch; R Potapova. Springer, 2018. p. 768-777 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11096 LNAI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review

Harvard

Yakovenko, O, Bondarenko, I, Borovikova, M & Vodolazsky, D 2018, Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems. in A Karpov, O Jokisch & R Potapova (eds), Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11096 LNAI, Springer, pp. 768-777, 20th International Conference on Speech and Computer, Leipzig, Germany, 18.09.2018. https://doi.org/10.1007/978-3-319-99579-3_78

APA

Yakovenko, O., Bondarenko, I., Borovikova, M., & Vodolazsky, D. (2018). Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems. In A. Karpov, O. Jokisch, & R. Potapova (Eds.), Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings (pp. 768-777). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11096 LNAI). Springer. https://doi.org/10.1007/978-3-319-99579-3_78

Vancouver

Yakovenko O, Bondarenko I, Borovikova M, Vodolazsky D. Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems. In Karpov A, Jokisch O, Potapova R, editors, Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings. Springer. 2018. p. 768-777. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-319-99579-3_78

Author

Yakovenko, Olga ; Bondarenko, Ivan ; Borovikova, Mariya et al. / Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems. Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings. editor / A Karpov ; O Jokisch ; R Potapova. Springer, 2018. pp. 768-777 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

BibTeX

@inproceedings{7cc12a81e7f04b81ab41f125c16e6e78,

title = "Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems",

abstract = "This paper presents an overview of rule-based system for automatic accentuation and phonemic transcription of Russian texts for speech connected tasks, such as Automatic Speech Recognition (ASR). Two parts of the developed system, accentuation and transcription, use different approaches to achieve correct phonemic representations of input phrases. Accentuation is based on “Grammatical dictionary of the Russian language” of A.A. Zaliznyak and wiktionary corpus. To distinguish homographs, the accentuation system also utilises morphological information of the sentences based on Recurrent Neural Networks (RNN). Transcription algorithms apply the rules presented in the monograph of B.M. Lobanov and L.I. Tsirulnik “Computer Synthesis and Voice Cloning”. The rules described in the present paper are implemented in an open-source module, which can be of use to any scientific study connected to ASR or Speech To Text (STT) tasks. Automatically marked up text annotations of the Russian Voxforge database were used as training data for an acoustic model in CMU Sphinx. The resulting acoustic model was evaluated on cross-validation, mean Word Accuracy being 71.2%. The developed toolkit is written in the Python language and is accessible on GitHub for any researcher interested.",

keywords = "Accentuation, Automatic speech recognition, Corpora, Rule-based phonemic transcription",

author = "Olga Yakovenko and Ivan Bondarenko and Mariya Borovikova and Daniil Vodolazsky",

year = "2018",

month = jan,

day = "1",

doi = "10.1007/978-3-319-99579-3_78",

language = "English",

isbn = "9783319995786",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "768--777",

editor = "A Karpov and O Jokisch and R Potapova",

booktitle = "Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings",

address = "United States",

note = "20th International Conference on Speech and Computer, SPECOM 2018 ; Conference date: 18-09-2018 Through 22-09-2018",

}

RIS

TY - GEN

T1 - Algorithms for Automatic Accentuation and Transcription of Russian Texts in Speech Recognition Systems

AU - Yakovenko, Olga

AU - Bondarenko, Ivan

AU - Borovikova, Mariya

AU - Vodolazsky, Daniil

PY - 2018/1/1

Y1 - 2018/1/1

N2 - This paper presents an overview of rule-based system for automatic accentuation and phonemic transcription of Russian texts for speech connected tasks, such as Automatic Speech Recognition (ASR). Two parts of the developed system, accentuation and transcription, use different approaches to achieve correct phonemic representations of input phrases. Accentuation is based on “Grammatical dictionary of the Russian language” of A.A. Zaliznyak and wiktionary corpus. To distinguish homographs, the accentuation system also utilises morphological information of the sentences based on Recurrent Neural Networks (RNN). Transcription algorithms apply the rules presented in the monograph of B.M. Lobanov and L.I. Tsirulnik “Computer Synthesis and Voice Cloning”. The rules described in the present paper are implemented in an open-source module, which can be of use to any scientific study connected to ASR or Speech To Text (STT) tasks. Automatically marked up text annotations of the Russian Voxforge database were used as training data for an acoustic model in CMU Sphinx. The resulting acoustic model was evaluated on cross-validation, mean Word Accuracy being 71.2%. The developed toolkit is written in the Python language and is accessible on GitHub for any researcher interested.

AB - This paper presents an overview of rule-based system for automatic accentuation and phonemic transcription of Russian texts for speech connected tasks, such as Automatic Speech Recognition (ASR). Two parts of the developed system, accentuation and transcription, use different approaches to achieve correct phonemic representations of input phrases. Accentuation is based on “Grammatical dictionary of the Russian language” of A.A. Zaliznyak and wiktionary corpus. To distinguish homographs, the accentuation system also utilises morphological information of the sentences based on Recurrent Neural Networks (RNN). Transcription algorithms apply the rules presented in the monograph of B.M. Lobanov and L.I. Tsirulnik “Computer Synthesis and Voice Cloning”. The rules described in the present paper are implemented in an open-source module, which can be of use to any scientific study connected to ASR or Speech To Text (STT) tasks. Automatically marked up text annotations of the Russian Voxforge database were used as training data for an acoustic model in CMU Sphinx. The resulting acoustic model was evaluated on cross-validation, mean Word Accuracy being 71.2%. The developed toolkit is written in the Python language and is accessible on GitHub for any researcher interested.

KW - Accentuation

KW - Automatic speech recognition

KW - Corpora

KW - Rule-based phonemic transcription

UR - http://www.scopus.com/inward/record.url?scp=85053780912&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/57b5407d-85a1-3fc5-a87a-424e1c999609/

U2 - 10.1007/978-3-319-99579-3_78

DO - 10.1007/978-3-319-99579-3_78

M3 - Conference contribution

AN - SCOPUS:85053780912

SN - 9783319995786

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 768

EP - 777

BT - Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings

A2 - Karpov, A

A2 - Jokisch, O

A2 - Potapova, R

PB - Springer

T2 - 20th International Conference on Speech and Computer

Y2 - 18 September 2018 through 22 September 2018

ER -

ID: 16703931