Standard

Matching of authors and publications in multilingual bibliographic knowledge bases. / Apanovich, Zinaida.

In: CEUR Workshop Proceedings, Vol. 2543, 01.01.2020, p. 26-37.

Research output: Contribution to journalConference articlepeer-review

Harvard

APA

Vancouver

Author

BibTeX

@article{84d8ba8725e34ef59cea7beddf7d20bd,
title = "Matching of authors and publications in multilingual bibliographic knowledge bases",
abstract = "The cross-lingual matching of authors and publications is a special case of the task of assigning a unique identifier to the same real-world entity in multilingual data sources. This paper presents the results of experiments with the several versions of a cross-lingual system designed to match, basing on a Russian-language data source, the authors and English-language publications. Since different heuristics have been tested in these versions of the system, we consider here only those that have given the best results. An important element of the system is its interactive visualization tool, which gives information on the distribution of publications by authors, as well as providing the ability to edit the results of the analysis. The visualization system is supplemented with methods for similarity matrices ordering. Experiments have shown that the main source of improving the quality of the matching and clustering algorithm is extending the set of confirmed publications. The approaches used in this system are applicable to solving the problem of linking named entities in various multilingual data sources.",
keywords = "Clustering, Cross-Lingual Matching of Authors and Publications, Entity Resolution, Interactive Visualization, Multilingual Knowledge Bases",
author = "Zinaida Apanovich",
year = "2020",
month = jan,
day = "1",
language = "English",
volume = "2543",
pages = "26--37",
journal = "CEUR Workshop Proceedings",
issn = "1613-0073",
publisher = "CEUR-WS",
note = "21st Conference on Scientific Services and Internet, SSI 2019 ; Conference date: 23-09-2019 Through 28-09-2019",

}

RIS

TY - JOUR

T1 - Matching of authors and publications in multilingual bibliographic knowledge bases

AU - Apanovich, Zinaida

PY - 2020/1/1

Y1 - 2020/1/1

N2 - The cross-lingual matching of authors and publications is a special case of the task of assigning a unique identifier to the same real-world entity in multilingual data sources. This paper presents the results of experiments with the several versions of a cross-lingual system designed to match, basing on a Russian-language data source, the authors and English-language publications. Since different heuristics have been tested in these versions of the system, we consider here only those that have given the best results. An important element of the system is its interactive visualization tool, which gives information on the distribution of publications by authors, as well as providing the ability to edit the results of the analysis. The visualization system is supplemented with methods for similarity matrices ordering. Experiments have shown that the main source of improving the quality of the matching and clustering algorithm is extending the set of confirmed publications. The approaches used in this system are applicable to solving the problem of linking named entities in various multilingual data sources.

AB - The cross-lingual matching of authors and publications is a special case of the task of assigning a unique identifier to the same real-world entity in multilingual data sources. This paper presents the results of experiments with the several versions of a cross-lingual system designed to match, basing on a Russian-language data source, the authors and English-language publications. Since different heuristics have been tested in these versions of the system, we consider here only those that have given the best results. An important element of the system is its interactive visualization tool, which gives information on the distribution of publications by authors, as well as providing the ability to edit the results of the analysis. The visualization system is supplemented with methods for similarity matrices ordering. Experiments have shown that the main source of improving the quality of the matching and clustering algorithm is extending the set of confirmed publications. The approaches used in this system are applicable to solving the problem of linking named entities in various multilingual data sources.

KW - Clustering

KW - Cross-Lingual Matching of Authors and Publications

KW - Entity Resolution

KW - Interactive Visualization

KW - Multilingual Knowledge Bases

UR - http://www.scopus.com/inward/record.url?scp=85078483379&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85078483379

VL - 2543

SP - 26

EP - 37

JO - CEUR Workshop Proceedings

JF - CEUR Workshop Proceedings

SN - 1613-0073

T2 - 21st Conference on Scientific Services and Internet, SSI 2019

Y2 - 23 September 2019 through 28 September 2019

ER -

ID: 23261648