RuNNE-2022 Shared Task: Recognizing Nested Named Entities

Standard

RuNNE-2022 Shared Task: Recognizing Nested Named Entities. / Артемова, Е. Л.; Змеев, М. В.; Лукашевич, Н. В. et al.

2022. Paper presented at International conference on Computational Linguistics and Intellectual Technologies "Dialogue 2022", Москва, Russian Federation.

Research output: Contribution to conference › Paper › peer-review

Harvard

Артемова, ЕЛ, Змеев, МВ, Лукашевич, НВ, Рожков, ИC, Батура, ТВ, Иванов, ВВ & Тутубалина, ЕВ 2022, 'RuNNE-2022 Shared Task: Recognizing Nested Named Entities', Paper presented at International conference on Computational Linguistics and Intellectual Technologies "Dialogue 2022", Москва, Russian Federation, 15.07.2022 - 18.07.2022. https://doi.org/10.28995/2075-7182-2022-21-33-41

APA

Артемова, Е. Л., Змеев, М. В., Лукашевич, Н. В., Рожков, И. C., Батура, Т. В., Иванов, В. В., & Тутубалина, Е. В. (2022). RuNNE-2022 Shared Task: Recognizing Nested Named Entities. Paper presented at International conference on Computational Linguistics and Intellectual Technologies "Dialogue 2022", Москва, Russian Federation. https://doi.org/10.28995/2075-7182-2022-21-33-41

Vancouver

Артемова ЕЛ, Змеев МВ, Лукашевич НВ, Рожков ИC, Батура ТВ, Иванов ВВ et al.. RuNNE-2022 Shared Task: Recognizing Nested Named Entities. 2022. Paper presented at International conference on Computational Linguistics and Intellectual Technologies "Dialogue 2022", Москва, Russian Federation. doi: 10.28995/2075-7182-2022-21-33-41

Author

Артемова, Е. Л. ; Змеев, М. В. ; Лукашевич, Н. В. et al. / RuNNE-2022 Shared Task: Recognizing Nested Named Entities. Paper presented at International conference on Computational Linguistics and Intellectual Technologies "Dialogue 2022", Москва, Russian Federation.9 p.

BibTeX

@conference{df1825c3ce5047c094b8dce8f40a6861,

title = "RuNNE-2022 Shared Task: Recognizing Nested Named Entities",

abstract = "The RuNNE Shared Task approaches the problem of nested named entity recognition. The annotation schema is designed in such a way, that an entity may partially overlap or even be nested into another entity. This way, the named entity “The Yermolova Theatre” of type ORGANIZATION houses another entity “Yermolova” of type PERSON. We adopt the Russian NEREL dataset (Loukachevitch et al., 2021) for the RuNNE Shared Task. NEREL comprises news texts written in the Russian language and collected from the Wikinews portal. The annotation schema includes 29 entity types. The nestedness of named entities in NEREL reaches up to six levels. The RuNNE Shared Task explores two setups. (i) In the general setup all entities occur more or less with the same frequency.(ii) In the few-shot setup the majority of entity types occur often in the training set. However, some of the entity types are have lower frequency, being thus challenging to recognize. In the test set the frequency of all entity types is even.This paper reports on the results of the RuNNE Shared Task. Overall the shared task has received 156 submissions from nine teams. Half of the submissions outperform a straightforward BERT-based baseline in both setups.This paper overviews the shared task setup and discusses the submitted systems, discovering meaning insights for the problem of nested NER. The links to the evaluation platform and the data from the shared task are available in our github repository.",

author = "Артемова, {Е. Л.} and Змеев, {М. В.} and Лукашевич, {Н. В.} and И.C. Рожков and Т.В. Батура and В.В. Иванов and Е.В. Тутубалина",

note = "Acknowledgments: The project is supported by the Russian Science Foundation, grant # 20-11-20166. The experiments were partially carried out on computational resources of HPC facilities at HSE University (Kostenetskiy et al., 2021) and the shared research facilities of HPC computing resources at Lomonosov Moscow State University. Ekaterina Artemova was supported by the framework of the HSE University Basic Research Program.; International conference on Computational Linguistics and Intellectual Technologies {"}Dialogue 2022{"}, Dialogue 2022 ; Conference date: 15-07-2022 Through 18-07-2022",

year = "2022",

month = jun,

day = "18",

doi = "10.28995/2075-7182-2022-21-33-41",

language = "English",

url = "https://www.dialog-21.ru/",

}

RIS

TY - CONF

T1 - RuNNE-2022 Shared Task: Recognizing Nested Named Entities

AU - Артемова, Е. Л.

AU - Змеев, М. В.

AU - Лукашевич, Н. В.

AU - Рожков, И.C.

AU - Батура, Т.В.

AU - Иванов, В.В.

AU - Тутубалина, Е.В.

N1 - Acknowledgments: The project is supported by the Russian Science Foundation, grant # 20-11-20166. The experiments were partially carried out on computational resources of HPC facilities at HSE University (Kostenetskiy et al., 2021) and the shared research facilities of HPC computing resources at Lomonosov Moscow State University. Ekaterina Artemova was supported by the framework of the HSE University Basic Research Program.

PY - 2022/6/18

Y1 - 2022/6/18

N2 - The RuNNE Shared Task approaches the problem of nested named entity recognition. The annotation schema is designed in such a way, that an entity may partially overlap or even be nested into another entity. This way, the named entity “The Yermolova Theatre” of type ORGANIZATION houses another entity “Yermolova” of type PERSON. We adopt the Russian NEREL dataset (Loukachevitch et al., 2021) for the RuNNE Shared Task. NEREL comprises news texts written in the Russian language and collected from the Wikinews portal. The annotation schema includes 29 entity types. The nestedness of named entities in NEREL reaches up to six levels. The RuNNE Shared Task explores two setups. (i) In the general setup all entities occur more or less with the same frequency.(ii) In the few-shot setup the majority of entity types occur often in the training set. However, some of the entity types are have lower frequency, being thus challenging to recognize. In the test set the frequency of all entity types is even.This paper reports on the results of the RuNNE Shared Task. Overall the shared task has received 156 submissions from nine teams. Half of the submissions outperform a straightforward BERT-based baseline in both setups.This paper overviews the shared task setup and discusses the submitted systems, discovering meaning insights for the problem of nested NER. The links to the evaluation platform and the data from the shared task are available in our github repository.

AB - The RuNNE Shared Task approaches the problem of nested named entity recognition. The annotation schema is designed in such a way, that an entity may partially overlap or even be nested into another entity. This way, the named entity “The Yermolova Theatre” of type ORGANIZATION houses another entity “Yermolova” of type PERSON. We adopt the Russian NEREL dataset (Loukachevitch et al., 2021) for the RuNNE Shared Task. NEREL comprises news texts written in the Russian language and collected from the Wikinews portal. The annotation schema includes 29 entity types. The nestedness of named entities in NEREL reaches up to six levels. The RuNNE Shared Task explores two setups. (i) In the general setup all entities occur more or less with the same frequency.(ii) In the few-shot setup the majority of entity types occur often in the training set. However, some of the entity types are have lower frequency, being thus challenging to recognize. In the test set the frequency of all entity types is even.This paper reports on the results of the RuNNE Shared Task. Overall the shared task has received 156 submissions from nine teams. Half of the submissions outperform a straightforward BERT-based baseline in both setups.This paper overviews the shared task setup and discusses the submitted systems, discovering meaning insights for the problem of nested NER. The links to the evaluation platform and the data from the shared task are available in our github repository.

UR - https://www.scopus.com/inward/record.url?eid=2-s2.0-85140870759&partnerID=40&md5=007f3fb45603091dcc6dc2772aefacf5

UR - https://www.mendeley.com/catalogue/a44f7b5d-6ba5-360d-9062-547a2d895685/

U2 - 10.28995/2075-7182-2022-21-33-41

DO - 10.28995/2075-7182-2022-21-33-41

M3 - Paper

T2 - International conference on Computational Linguistics and Intellectual Technologies "Dialogue 2022"

Y2 - 15 July 2022 through 18 July 2022

ER -

ID: 45017157