Standard

Geometric Approach to Phylogeographic Analysis Molecular Genetic Sequences: Principal Components and Dendrograms. / Efimov, V. M.; Efimov, K. V.; Kovaleva, V. Yu.

In: Molecular Biology, Vol. 57, No. 2, 04.2023, p. 176-181.

Research output: Contribution to journalArticlepeer-review

Harvard

APA

Vancouver

Efimov VM, Efimov KV, Kovaleva VY. Geometric Approach to Phylogeographic Analysis Molecular Genetic Sequences: Principal Components and Dendrograms. Molecular Biology. 2023 Apr;57(2):176-181. doi: 10.1134/S002689332302005X

Author

Efimov, V. M. ; Efimov, K. V. ; Kovaleva, V. Yu. / Geometric Approach to Phylogeographic Analysis Molecular Genetic Sequences: Principal Components and Dendrograms. In: Molecular Biology. 2023 ; Vol. 57, No. 2. pp. 176-181.

BibTeX

@article{09d5d555349d4c4b9f06ac5ef675a3d1,
title = "Geometric Approach to Phylogeographic Analysis Molecular Genetic Sequences: Principal Components and Dendrograms",
abstract = "Currently, the search for manifestations of selection under the influence of the environment in molecular sequences is usually carried out within closely related species or at the intraspecific level. It is believed that at high taxonomic levels this is unpromising due to phylogenetic relationship. Cytochrome b amino acid sequences of 67 rodent and lagomorph species with known geographic coordinates were digitized using the AAindex database. Based on more than 200 thousand characters, the principal components were obtained. A well-known statistical method, which has not been previously used for such problems, was used, which makes it possible to orthogonally decompose multidimensional variability into intra- and intertaxon variability and analyze them separately. The subfamily level was selected. For the second principal component (17.05% of intertaxon variability), a correlation with latitude was found (r = 0.561; n = 67; p < E–5). The clear division into two groups, revealed by the first principal component (39.48% of intertaxon variability), which does not coincide with the taxonomic one, indicates a possible physicochemical underlying cause for the differences between them. This requires further research.",
keywords = "Euclidean distances, Rodentia, cluster analysis, cytochrome b, geographic coordinates, orthogonal decomposition",
author = "Efimov, {V. M.} and Efimov, {K. V.} and Kovaleva, {V. Yu}",
note = "This study was performed within the framework of the budgetary project of the Institute of Cytology and Genetics, Siberian Branch, Russian Academy of Sciences no. FWNR2022-0020 “Systems Biology and Bioinformatics: Reconstruction, Analysis, and Modeling of the Structural-Functional Organization and Evolution of Human, Animal, Plant, and Microorganism Gene Networks.” Публикация для корректировки.",
year = "2023",
month = apr,
doi = "10.1134/S002689332302005X",
language = "English",
volume = "57",
pages = "176--181",
journal = "Molecular Biology",
issn = "0026-8933",
publisher = "Maik Nauka-Interperiodica Publishing",
number = "2",

}

RIS

TY - JOUR

T1 - Geometric Approach to Phylogeographic Analysis Molecular Genetic Sequences: Principal Components and Dendrograms

AU - Efimov, V. M.

AU - Efimov, K. V.

AU - Kovaleva, V. Yu

N1 - This study was performed within the framework of the budgetary project of the Institute of Cytology and Genetics, Siberian Branch, Russian Academy of Sciences no. FWNR2022-0020 “Systems Biology and Bioinformatics: Reconstruction, Analysis, and Modeling of the Structural-Functional Organization and Evolution of Human, Animal, Plant, and Microorganism Gene Networks.” Публикация для корректировки.

PY - 2023/4

Y1 - 2023/4

N2 - Currently, the search for manifestations of selection under the influence of the environment in molecular sequences is usually carried out within closely related species or at the intraspecific level. It is believed that at high taxonomic levels this is unpromising due to phylogenetic relationship. Cytochrome b amino acid sequences of 67 rodent and lagomorph species with known geographic coordinates were digitized using the AAindex database. Based on more than 200 thousand characters, the principal components were obtained. A well-known statistical method, which has not been previously used for such problems, was used, which makes it possible to orthogonally decompose multidimensional variability into intra- and intertaxon variability and analyze them separately. The subfamily level was selected. For the second principal component (17.05% of intertaxon variability), a correlation with latitude was found (r = 0.561; n = 67; p < E–5). The clear division into two groups, revealed by the first principal component (39.48% of intertaxon variability), which does not coincide with the taxonomic one, indicates a possible physicochemical underlying cause for the differences between them. This requires further research.

AB - Currently, the search for manifestations of selection under the influence of the environment in molecular sequences is usually carried out within closely related species or at the intraspecific level. It is believed that at high taxonomic levels this is unpromising due to phylogenetic relationship. Cytochrome b amino acid sequences of 67 rodent and lagomorph species with known geographic coordinates were digitized using the AAindex database. Based on more than 200 thousand characters, the principal components were obtained. A well-known statistical method, which has not been previously used for such problems, was used, which makes it possible to orthogonally decompose multidimensional variability into intra- and intertaxon variability and analyze them separately. The subfamily level was selected. For the second principal component (17.05% of intertaxon variability), a correlation with latitude was found (r = 0.561; n = 67; p < E–5). The clear division into two groups, revealed by the first principal component (39.48% of intertaxon variability), which does not coincide with the taxonomic one, indicates a possible physicochemical underlying cause for the differences between them. This requires further research.

KW - Euclidean distances

KW - Rodentia

KW - cluster analysis

KW - cytochrome b

KW - geographic coordinates

KW - orthogonal decomposition

UR - https://www.scopus.com/record/display.uri?eid=2-s2.0-85153717584&origin=inward&txGid=814a75d63ac4b2939ecfcdb38ea1316a

UR - https://www.mendeley.com/catalogue/37473e57-5970-3b7d-b2db-10aeb55ca3f1/

U2 - 10.1134/S002689332302005X

DO - 10.1134/S002689332302005X

M3 - Article

VL - 57

SP - 176

EP - 181

JO - Molecular Biology

JF - Molecular Biology

SN - 0026-8933

IS - 2

ER -

ID: 59653475