Результаты исследований: Научные публикации в периодических изданиях › статья › Рецензирование
SpineScan: a deep learning model for lumbar spine MRI annotation and Pfirrmann grading assessment. / Minin, Aleksandr; Leonova, Olga; Krutko, Aleksandr и др.
в: European Spine Journal, 03.11.2025.Результаты исследований: Научные публикации в периодических изданиях › статья › Рецензирование
}
TY - JOUR
T1 - SpineScan: a deep learning model for lumbar spine MRI annotation and Pfirrmann grading assessment
AU - Minin, Aleksandr
AU - Leonova, Olga
AU - Krutko, Aleksandr
AU - Elgaeva, Elizaveta
AU - Antonets, Denis
AU - Shtokalo, Dmitriy
AU - Tsepilov, Yakov
N1 - The work of A.M., D.A., and D.S. was supported by the Research Program at the Moscow State University (MSU) Institute for Artificial Intelligence. The work of E.E. and Y.T. was supported by Budget Project #FWNR-2022-0020 from the Institute of Cytology and Genetics, SB RAS. We also thank the Federal Research Center for Information and Computational Technologies SB RAS (FRC ICT SB RAS) for providing computational resources. The authors thank Vladimir Filonenko for his help with the translation and correction of this manuscript.
PY - 2025/11/3
Y1 - 2025/11/3
N2 - PURPOSE: While recent advances in deep learning have enabled automated Pfirrmann grading systems for intervertebral disc degeneration (IDD), many models remain inaccessible due to proprietary restrictions. This study aimed to develop and validate a convolutional neural network (CNN) for automated Pfirrmann grading using a diverse clinical dataset, to compare our model's performance with previously published results, and to create an open-source web application with a graphical user interface capable of grading both DICOM studies and individual MRI slices provided as image files.METHODS: We trained a CNN-based model using the YOLOv8x architecture on two datasets: a well-curated Russian disc degeneration study (RuDDS) cohort and an open-access dataset, totaling 484 lumbar MRI scans. Ground truth grading was provided by expert radiologists. The model was designed to simultaneously detect intervertebral discs and classify degeneration grades from single MRI slices. Performance was evaluated using standard metrics, including precision, recall, and mean average precision (mAP) across Pfirrmann grades I to V.RESULTS: Our model achieved predictive accuracy between 0.78 and 0.82 depending on lumbar level. The highest performance was observed for Grade IV discs (mAP50 = 0.872), while performance for Grade V was lower (mAP50-95 = 0.525), likely due to poor contrast and indistinct boundaries in highly degenerated discs. Overall, the model demonstrated a precision of 0.75 and a recall of 0.808. Comparison with previous studies revealed that our results are consistent with expert-level performance. The developed model formed the basis of a specialized web application, SpineScan, implemented using the Streamlit framework.CONCLUSIONS: The developed model shows strong potential for automated grading of lumbar disc degeneration and performs comparably to expert radiologists in most cases. Our findings support the potential applicability of SpineScan for AI-assisted Pfirrmann grading.
AB - PURPOSE: While recent advances in deep learning have enabled automated Pfirrmann grading systems for intervertebral disc degeneration (IDD), many models remain inaccessible due to proprietary restrictions. This study aimed to develop and validate a convolutional neural network (CNN) for automated Pfirrmann grading using a diverse clinical dataset, to compare our model's performance with previously published results, and to create an open-source web application with a graphical user interface capable of grading both DICOM studies and individual MRI slices provided as image files.METHODS: We trained a CNN-based model using the YOLOv8x architecture on two datasets: a well-curated Russian disc degeneration study (RuDDS) cohort and an open-access dataset, totaling 484 lumbar MRI scans. Ground truth grading was provided by expert radiologists. The model was designed to simultaneously detect intervertebral discs and classify degeneration grades from single MRI slices. Performance was evaluated using standard metrics, including precision, recall, and mean average precision (mAP) across Pfirrmann grades I to V.RESULTS: Our model achieved predictive accuracy between 0.78 and 0.82 depending on lumbar level. The highest performance was observed for Grade IV discs (mAP50 = 0.872), while performance for Grade V was lower (mAP50-95 = 0.525), likely due to poor contrast and indistinct boundaries in highly degenerated discs. Overall, the model demonstrated a precision of 0.75 and a recall of 0.808. Comparison with previous studies revealed that our results are consistent with expert-level performance. The developed model formed the basis of a specialized web application, SpineScan, implemented using the Streamlit framework.CONCLUSIONS: The developed model shows strong potential for automated grading of lumbar disc degeneration and performs comparably to expert radiologists in most cases. Our findings support the potential applicability of SpineScan for AI-assisted Pfirrmann grading.
U2 - 10.1007/s00586-025-09537-x
DO - 10.1007/s00586-025-09537-x
M3 - Article
C2 - 41182393
JO - European Spine Journal
JF - European Spine Journal
SN - 0940-6719
ER -
ID: 72326768