Standard

Application of elementary probability models for text homogeneity and segmentation: A case study of Bible. / Abebe, Berhane.

In: PLoS ONE, Vol. 19, No. 6 June, e0303432, 06.2024.

Research output: Contribution to journalArticlepeer-review

Harvard

APA

Vancouver

Abebe B. Application of elementary probability models for text homogeneity and segmentation: A case study of Bible. PLoS ONE. 2024 Jun;19(6 June):e0303432. doi: 10.1371/journal.pone.0303432

Author

BibTeX

@article{9ceda83db4db4891b68fb280aad585fe,
title = "Application of elementary probability models for text homogeneity and segmentation: A case study of Bible",
abstract = "For the purpose of this study, A statistical test of Biblical books was conducted using the recently discovered probability models for text homogeneity and text change point detection. Accordingly, translations of Biblical books of Tigrigna and Amharic (major languages spoken in Eritrea and Ethiopia) and English were studied. A Zipf-Mandelbrot distribution with a parameter range of 0.55 to 0.88 was obtained in these three Bibles. According to the statistical analysis of the texts' homogeneity, the translation of Bible in each of these three languages was a heterogeneous concatenation of different books or genres. Furthermore, an in-depth examination of the text segmentation of prat of a single genre-the English Bible letters revealed that the Pauline letters are heterogeneous concatenations of two homogeneous segments.",
author = "Berhane Abebe",
year = "2024",
month = jun,
doi = "10.1371/journal.pone.0303432",
language = "English",
volume = "19",
journal = "PLoS ONE",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "6 June",

}

RIS

TY - JOUR

T1 - Application of elementary probability models for text homogeneity and segmentation: A case study of Bible

AU - Abebe, Berhane

PY - 2024/6

Y1 - 2024/6

N2 - For the purpose of this study, A statistical test of Biblical books was conducted using the recently discovered probability models for text homogeneity and text change point detection. Accordingly, translations of Biblical books of Tigrigna and Amharic (major languages spoken in Eritrea and Ethiopia) and English were studied. A Zipf-Mandelbrot distribution with a parameter range of 0.55 to 0.88 was obtained in these three Bibles. According to the statistical analysis of the texts' homogeneity, the translation of Bible in each of these three languages was a heterogeneous concatenation of different books or genres. Furthermore, an in-depth examination of the text segmentation of prat of a single genre-the English Bible letters revealed that the Pauline letters are heterogeneous concatenations of two homogeneous segments.

AB - For the purpose of this study, A statistical test of Biblical books was conducted using the recently discovered probability models for text homogeneity and text change point detection. Accordingly, translations of Biblical books of Tigrigna and Amharic (major languages spoken in Eritrea and Ethiopia) and English were studied. A Zipf-Mandelbrot distribution with a parameter range of 0.55 to 0.88 was obtained in these three Bibles. According to the statistical analysis of the texts' homogeneity, the translation of Bible in each of these three languages was a heterogeneous concatenation of different books or genres. Furthermore, an in-depth examination of the text segmentation of prat of a single genre-the English Bible letters revealed that the Pauline letters are heterogeneous concatenations of two homogeneous segments.

UR - https://www.scopus.com/record/display.uri?eid=2-s2.0-85195533491&origin=inward&txGid=f63c21d528ea3fd0ffbc1ce66b978959

UR - https://www.mendeley.com/catalogue/bc254618-d12d-3a1a-a40c-28dbf0ebd886/

U2 - 10.1371/journal.pone.0303432

DO - 10.1371/journal.pone.0303432

M3 - Article

C2 - 38848327

VL - 19

JO - PLoS ONE

JF - PLoS ONE

SN - 1932-6203

IS - 6 June

M1 - e0303432

ER -

ID: 61117374