Standard

Named Entity Extraction from Semi-structured Data Using Machine Learning Algorithms. / Mansurova, Madina; Barakhnin, Vladimir; Khibatkhanuly, Yerzhan et al.

Computational Collective Intelligence - 11th International Conference, ICCCI 2019, Proceedings. ed. / Ngoc Thanh Nguyen; Richard Chbeir; Ernesto Exposito; Philippe Aniorté; Bogdan Trawinski; Ngoc Thanh Nguyen. Springer-Verlag GmbH and Co. KG, 2019. p. 58-69 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11684 LNAI).

Research output: Chapter in Book/Report/Conference proceedingConference contributionResearchpeer-review

Harvard

Mansurova, M, Barakhnin, V, Khibatkhanuly, Y & Pastushkov, I 2019, Named Entity Extraction from Semi-structured Data Using Machine Learning Algorithms. in NT Nguyen, R Chbeir, E Exposito, P Aniorté, B Trawinski & NT Nguyen (eds), Computational Collective Intelligence - 11th International Conference, ICCCI 2019, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11684 LNAI, Springer-Verlag GmbH and Co. KG, pp. 58-69, 11th International Conference on Computational Collective Intelligence, ICCCI 2019, Hendaye, France, 04.09.2019. https://doi.org/10.1007/978-3-030-28374-2_6

APA

Mansurova, M., Barakhnin, V., Khibatkhanuly, Y., & Pastushkov, I. (2019). Named Entity Extraction from Semi-structured Data Using Machine Learning Algorithms. In N. T. Nguyen, R. Chbeir, E. Exposito, P. Aniorté, B. Trawinski, & N. T. Nguyen (Eds.), Computational Collective Intelligence - 11th International Conference, ICCCI 2019, Proceedings (pp. 58-69). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11684 LNAI). Springer-Verlag GmbH and Co. KG. https://doi.org/10.1007/978-3-030-28374-2_6

Vancouver

Mansurova M, Barakhnin V, Khibatkhanuly Y, Pastushkov I. Named Entity Extraction from Semi-structured Data Using Machine Learning Algorithms. In Nguyen NT, Chbeir R, Exposito E, Aniorté P, Trawinski B, Nguyen NT, editors, Computational Collective Intelligence - 11th International Conference, ICCCI 2019, Proceedings. Springer-Verlag GmbH and Co. KG. 2019. p. 58-69. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-28374-2_6

Author

Mansurova, Madina ; Barakhnin, Vladimir ; Khibatkhanuly, Yerzhan et al. / Named Entity Extraction from Semi-structured Data Using Machine Learning Algorithms. Computational Collective Intelligence - 11th International Conference, ICCCI 2019, Proceedings. editor / Ngoc Thanh Nguyen ; Richard Chbeir ; Ernesto Exposito ; Philippe Aniorté ; Bogdan Trawinski ; Ngoc Thanh Nguyen. Springer-Verlag GmbH and Co. KG, 2019. pp. 58-69 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

BibTeX

@inproceedings{1214d1492729402a9a2f30dd6cc86ca9,
title = "Named Entity Extraction from Semi-structured Data Using Machine Learning Algorithms",
abstract = "The modern society have been witnessed that intensive development of Internet technologies had followed to information explosion during last decades. This explosion had been expressing by an exponential growth of data volume among the low-quality information. This paper is designed to provide detailed information about some intellectual tools which are support decision taking by automatic knowledge extraction. In the first part of paper, we considered a preprocessing contains morphological analysis of texts. Then we had considered the model of text documents in the form of a hypergraph and implementation of the random walk method to extract semantically close word{\textquoteright}s pairs, in other words, pairs that often appears together. Result of calculations is matrix with word affinity coefficients corresponding to each other component of vocabulary vector. In the second part we describe training of neural network for linguistic constructions extraction. These ones include possible values of text named entities descriptors. The neural network enables to retrieve information on one preselected descriptor, for example, location, in the form of the final result of the name of geographical objects. In a general case, the neural network can retrieve information on several descriptors simultaneously.",
keywords = "Entity extraction, Machine learning algorithms, Neural networks, Random walk method, Semi-structured data",
author = "Madina Mansurova and Vladimir Barakhnin and Yerzhan Khibatkhanuly and Ilya Pastushkov",
year = "2019",
month = jan,
day = "1",
doi = "10.1007/978-3-030-28374-2_6",
language = "English",
isbn = "9783030283735",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer-Verlag GmbH and Co. KG",
pages = "58--69",
editor = "Nguyen, {Ngoc Thanh} and Richard Chbeir and Ernesto Exposito and Philippe Aniort{\'e} and Bogdan Trawinski and Nguyen, {Ngoc Thanh}",
booktitle = "Computational Collective Intelligence - 11th International Conference, ICCCI 2019, Proceedings",
address = "Germany",
note = "11th International Conference on Computational Collective Intelligence, ICCCI 2019 ; Conference date: 04-09-2019 Through 06-09-2019",

}

RIS

TY - GEN

T1 - Named Entity Extraction from Semi-structured Data Using Machine Learning Algorithms

AU - Mansurova, Madina

AU - Barakhnin, Vladimir

AU - Khibatkhanuly, Yerzhan

AU - Pastushkov, Ilya

PY - 2019/1/1

Y1 - 2019/1/1

N2 - The modern society have been witnessed that intensive development of Internet technologies had followed to information explosion during last decades. This explosion had been expressing by an exponential growth of data volume among the low-quality information. This paper is designed to provide detailed information about some intellectual tools which are support decision taking by automatic knowledge extraction. In the first part of paper, we considered a preprocessing contains morphological analysis of texts. Then we had considered the model of text documents in the form of a hypergraph and implementation of the random walk method to extract semantically close word’s pairs, in other words, pairs that often appears together. Result of calculations is matrix with word affinity coefficients corresponding to each other component of vocabulary vector. In the second part we describe training of neural network for linguistic constructions extraction. These ones include possible values of text named entities descriptors. The neural network enables to retrieve information on one preselected descriptor, for example, location, in the form of the final result of the name of geographical objects. In a general case, the neural network can retrieve information on several descriptors simultaneously.

AB - The modern society have been witnessed that intensive development of Internet technologies had followed to information explosion during last decades. This explosion had been expressing by an exponential growth of data volume among the low-quality information. This paper is designed to provide detailed information about some intellectual tools which are support decision taking by automatic knowledge extraction. In the first part of paper, we considered a preprocessing contains morphological analysis of texts. Then we had considered the model of text documents in the form of a hypergraph and implementation of the random walk method to extract semantically close word’s pairs, in other words, pairs that often appears together. Result of calculations is matrix with word affinity coefficients corresponding to each other component of vocabulary vector. In the second part we describe training of neural network for linguistic constructions extraction. These ones include possible values of text named entities descriptors. The neural network enables to retrieve information on one preselected descriptor, for example, location, in the form of the final result of the name of geographical objects. In a general case, the neural network can retrieve information on several descriptors simultaneously.

KW - Entity extraction

KW - Machine learning algorithms

KW - Neural networks

KW - Random walk method

KW - Semi-structured data

UR - http://www.scopus.com/inward/record.url?scp=85072857642&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-28374-2_6

DO - 10.1007/978-3-030-28374-2_6

M3 - Conference contribution

AN - SCOPUS:85072857642

SN - 9783030283735

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 58

EP - 69

BT - Computational Collective Intelligence - 11th International Conference, ICCCI 2019, Proceedings

A2 - Nguyen, Ngoc Thanh

A2 - Chbeir, Richard

A2 - Exposito, Ernesto

A2 - Aniorté, Philippe

A2 - Trawinski, Bogdan

A2 - Nguyen, Ngoc Thanh

PB - Springer-Verlag GmbH and Co. KG

T2 - 11th International Conference on Computational Collective Intelligence, ICCCI 2019

Y2 - 4 September 2019 through 6 September 2019

ER -

ID: 21792743