Research output: Contribution to journal › Conference article › peer-review
Development of lexico-syntactic ontology design patterns for information extraction of scientific data. / Ovchinnikova, Kristina; Kononenko, Irina; Sidorova, Elena.
In: CEUR Workshop Proceedings, Vol. 3036, 2021, p. 349-361.Research output: Contribution to journal › Conference article › peer-review
}
TY - JOUR
T1 - Development of lexico-syntactic ontology design patterns for information extraction of scientific data
AU - Ovchinnikova, Kristina
AU - Kononenko, Irina
AU - Sidorova, Elena
N1 - Publisher Copyright: © 2021 CEUR-WS. All rights reserved.
PY - 2021
Y1 - 2021
N2 - The work considers an approach to information extraction based on lexico-syntactic patterns (LSPs). LSPs are built on the basis of knowledge about the scientific subject domain presented in the ontology and the corpus of scientific publications in different areas of knowledge. Two key tasks must be solved with the help of the LSPs: extracting object names and constructing objects in accordance with the structure of the ontology classes. In line with these tasks, terminological and informational LSPs are differentiated. Terminological patterns ensure the extraction of object names and properties based on indicators - marker words and phrases. Information patterns provide identification of ontology objects based on key attributes, description of actant structure for predicates expressing attributive relations and relations between ontology objects, as well as matching language constructions to values of attributes of ontology objects and their relations. Research is conducted on the basis of a corpus of scientific publications, which includes 100 articles from various fields of knowledge. The ways of expressing information about research method as the central concept of the ontology of scientific activity are investigated.
AB - The work considers an approach to information extraction based on lexico-syntactic patterns (LSPs). LSPs are built on the basis of knowledge about the scientific subject domain presented in the ontology and the corpus of scientific publications in different areas of knowledge. Two key tasks must be solved with the help of the LSPs: extracting object names and constructing objects in accordance with the structure of the ontology classes. In line with these tasks, terminological and informational LSPs are differentiated. Terminological patterns ensure the extraction of object names and properties based on indicators - marker words and phrases. Information patterns provide identification of ontology objects based on key attributes, description of actant structure for predicates expressing attributive relations and relations between ontology objects, as well as matching language constructions to values of attributes of ontology objects and their relations. Research is conducted on the basis of a corpus of scientific publications, which includes 100 articles from various fields of knowledge. The ways of expressing information about research method as the central concept of the ontology of scientific activity are investigated.
KW - Lexico-syntactic patterns
KW - Ontology design patterns
KW - Ontology of the scientific activity
KW - Ontology population
KW - Subject dictionary
UR - http://www.scopus.com/inward/record.url?scp=85121220874&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85121220874
VL - 3036
SP - 349
EP - 361
JO - CEUR Workshop Proceedings
JF - CEUR Workshop Proceedings
SN - 1613-0073
T2 - Supplementary 23rd International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2021
Y2 - 26 October 2021 through 29 October 2021
ER -
ID: 35028873