Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases

Standard

Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases. / Glazkova, Anna; Morozov, Dmitry; Garipov, Timur.

Lecture Notes in Computer Science. Springer, 2025. p. 107-119 (Lecture Notes in Computer Science; Vol. 15419 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review

Harvard

Glazkova, A, Morozov, D & Garipov, T 2025, Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases. in Lecture Notes in Computer Science. Lecture Notes in Computer Science, vol. 15419 LNCS, Springer, pp. 107-119, 12th International Conference Analysis of Images, Social Networks and Texts 2024, Bishkek, Kyrgyzstan, 17.10.2024. https://doi.org/10.1007/978-3-031-88036-0_5

APA

Glazkova, A., Morozov, D., & Garipov, T. (2025). Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases. In Lecture Notes in Computer Science (pp. 107-119). (Lecture Notes in Computer Science; Vol. 15419 LNCS). Springer. https://doi.org/10.1007/978-3-031-88036-0_5

Vancouver

Glazkova A, Morozov D, Garipov T. Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases. In Lecture Notes in Computer Science. Springer. 2025. p. 107-119. (Lecture Notes in Computer Science). doi: 10.1007/978-3-031-88036-0_5

Author

Glazkova, Anna ; Morozov, Dmitry ; Garipov, Timur. / Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases. Lecture Notes in Computer Science. Springer, 2025. pp. 107-119 (Lecture Notes in Computer Science).

BibTeX

@inproceedings{e86574ab3a0d4fdab17b72ec371e9ba3,

title = "Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases",

abstract = "Keyphrase selection is a challenging task in natural language processing that has a wide range of applications. Adapting existing supervised and unsupervised solutions for the Russian language faces several limitations due to the rich morphology of Russian and the limited number of training datasets available. Recent studies conducted on English texts show that large language models (LLMs) successfully address the task of generating keyphrases. LLMs allow achieving impressive results without task-specific fine-tuning, using text prompts instead. In this work, we access the performance of prompt-based methods for generating keyphrases for Russian scientific abstracts. First, we compare the performance of zero-shot and few-shot prompt-based methods, fine-tuned models, and unsupervised methods. Then we assess strategies for selecting keyphrase examples in a few-shot setting. We present the outcomes of human evaluation of the generated keyphrases and analyze the strengths and weaknesses of the models through expert assessment. Our results suggest that prompt-based methods can outperform common baselines even using simple text prompts.",

keywords = "Keyphrase selection, Large language model, Prompt-based learning, Scientific texts",

author = "Anna Glazkova and Dmitry Morozov and Timur Garipov",

note = "Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases / A. Glazkova, D. Morozov, T. Garipov // Analysis of Images, Social Networks and Texts. AIST 2024. Lecture Notes in Computer Science. / A. Panchenko [et al] // Springer, Cham.: 2025. - vol 15419. DOI: https://doi.org/10.1007/978-3-031-88036-0_5; 12th International Conference Analysis of Images, Social Networks and Texts 2024, AIST - 2024 ; Conference date: 17-10-2024 Through 19-10-2024",

year = "2025",

month = apr,

day = "15",

doi = "10.1007/978-3-031-88036-0_5",

language = "English",

isbn = "9783031880353",

series = "Lecture Notes in Computer Science",

publisher = "Springer",

pages = "107--119",

booktitle = "Lecture Notes in Computer Science",

address = "United States",

}

RIS

TY - GEN

T1 - Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases

AU - Glazkova, Anna

AU - Morozov, Dmitry

AU - Garipov, Timur

N1 - Conference code: 12

PY - 2025/4/15

Y1 - 2025/4/15

N2 - Keyphrase selection is a challenging task in natural language processing that has a wide range of applications. Adapting existing supervised and unsupervised solutions for the Russian language faces several limitations due to the rich morphology of Russian and the limited number of training datasets available. Recent studies conducted on English texts show that large language models (LLMs) successfully address the task of generating keyphrases. LLMs allow achieving impressive results without task-specific fine-tuning, using text prompts instead. In this work, we access the performance of prompt-based methods for generating keyphrases for Russian scientific abstracts. First, we compare the performance of zero-shot and few-shot prompt-based methods, fine-tuned models, and unsupervised methods. Then we assess strategies for selecting keyphrase examples in a few-shot setting. We present the outcomes of human evaluation of the generated keyphrases and analyze the strengths and weaknesses of the models through expert assessment. Our results suggest that prompt-based methods can outperform common baselines even using simple text prompts.

AB - Keyphrase selection is a challenging task in natural language processing that has a wide range of applications. Adapting existing supervised and unsupervised solutions for the Russian language faces several limitations due to the rich morphology of Russian and the limited number of training datasets available. Recent studies conducted on English texts show that large language models (LLMs) successfully address the task of generating keyphrases. LLMs allow achieving impressive results without task-specific fine-tuning, using text prompts instead. In this work, we access the performance of prompt-based methods for generating keyphrases for Russian scientific abstracts. First, we compare the performance of zero-shot and few-shot prompt-based methods, fine-tuned models, and unsupervised methods. Then we assess strategies for selecting keyphrase examples in a few-shot setting. We present the outcomes of human evaluation of the generated keyphrases and analyze the strengths and weaknesses of the models through expert assessment. Our results suggest that prompt-based methods can outperform common baselines even using simple text prompts.

KW - Keyphrase selection

KW - Large language model

KW - Prompt-based learning

KW - Scientific texts

UR - https://www.mendeley.com/catalogue/150db0f7-cc37-30f1-9ae7-6b0af5cf02db/

UR - https://www.scopus.com/record/display.uri?eid=2-s2.0-105003859167&origin=inward&txGid=e6a897f76c6f2da113c1d1a0081684cf

U2 - 10.1007/978-3-031-88036-0_5

DO - 10.1007/978-3-031-88036-0_5

M3 - Conference contribution

SN - 9783031880353

T3 - Lecture Notes in Computer Science

SP - 107

EP - 119

BT - Lecture Notes in Computer Science

PB - Springer

T2 - 12th International Conference Analysis of Images, Social Networks and Texts 2024

Y2 - 17 October 2024 through 19 October 2024

ER -

ID: 67075920