Standard

Towards fair decentralized benchmarking of healthcare AI algorithms with the Federated Tumor Segmentation (FeTS) challenge. / Pati, Sarthak; Linardos, Akis; Edwards, Brandon и др.

в: Nature Communications, Том 16, № 1, 6274, 08.07.2025.

Результаты исследований: Научные публикации в периодических изданияхстатьяРецензирование

Harvard

Pati, S, Linardos, A, Edwards, B, Sheller, M, Foley, P, Aristizabal, A, Zimmerer, D, Gruzdev, A, Martin, J, Shinohara, RT, Reinke, A, Isensee, F, Parampottupadam, S, Parekh, K, Floca, R, Kassem, H, Baheti, B, Thakur, S, Kushibar, K, Lekadir, K, Jiang, M, Yin, Y, Yang, H, Liu, Q, Chen, C, Dou, Q, Heng, PA, Zhang, X, Zhang, S, Khan, MI, Azeem, MA, Jafaritadi, M, Alhoniemi, E, Kontio, E, Khan, SA, Mächler, L, Ezhov, I, Kofler, F, Shit, S, Paetzold, JC, Loehr, T, Wiestler, B, Peiris, H, Pawar, K, Zhong, S, Chen, Z, Hayat, M, Egan, G, Harandi, M, Isik Polat, E, Polat, G, Kocyigit, A, Temizel, A, Tuladhar, A, Tyagi, L, Souza, R, Forkert, ND, Mouches, P, Wilms, M, Shambhat, V, Maurya, A, Danannavar, SS, Kalla, R, Anand, VK, Krishnamurthi, G, Nalawade, S, Ganesh, C, Wagner, B, Reddy, D, Das, Y, Yu, FF, Fei, B, Madhuranthakam, AJ, Maldjian, J, Singh, G, Ren, J, Zhang, W, An, N, Hu, Q, Zhang, Y, Zhou, Y, Siomos, V, Tarroni, G, Passerrat-Palmbach, J, Rawat, A, Zizzo, G, Kadhe, SR, Epperlein, JP, Braghin, S, Tuchinov, B, Maier-Hein, K (ред.) & Bakas, S (ред.) 2025, 'Towards fair decentralized benchmarking of healthcare AI algorithms with the Federated Tumor Segmentation (FeTS) challenge', Nature Communications, Том. 16, № 1, 6274. https://doi.org/10.1038/s41467-025-60466-1

APA

Pati, S., Linardos, A., Edwards, B., Sheller, M., Foley, P., Aristizabal, A., Zimmerer, D., Gruzdev, A., Martin, J., Shinohara, R. T., Reinke, A., Isensee, F., Parampottupadam, S., Parekh, K., Floca, R., Kassem, H., Baheti, B., Thakur, S., Kushibar, K., ... Bakas, S. (Ред.) (2025). Towards fair decentralized benchmarking of healthcare AI algorithms with the Federated Tumor Segmentation (FeTS) challenge. Nature Communications, 16(1), [6274]. https://doi.org/10.1038/s41467-025-60466-1

Vancouver

Pati S, Linardos A, Edwards B, Sheller M, Foley P, Aristizabal A и др. Towards fair decentralized benchmarking of healthcare AI algorithms with the Federated Tumor Segmentation (FeTS) challenge. Nature Communications. 2025 июль 8;16(1):6274. doi: 10.1038/s41467-025-60466-1

Author

Pati, Sarthak ; Linardos, Akis ; Edwards, Brandon и др. / Towards fair decentralized benchmarking of healthcare AI algorithms with the Federated Tumor Segmentation (FeTS) challenge. в: Nature Communications. 2025 ; Том 16, № 1.

BibTeX

@article{ecc685bacee544829b69d2fa77604a0d,
title = "Towards fair decentralized benchmarking of healthcare AI algorithms with the Federated Tumor Segmentation (FeTS) challenge",
abstract = "Computational competitions are the standard for benchmarking medical image analysis algorithms, but they typically use small curated test datasets acquired at a few centers, leaving a gap to the reality of diverse multicentric patient data. To this end, the Federated Tumor Segmentation (FeTS) Challenge represents the paradigm for real-world algorithmic performance evaluation. The FeTS challenge is a competition to benchmark (i) federated learning aggregation algorithms and (ii) state-of-the-art segmentation algorithms, across multiple international sites. Weight aggregation and client selection techniques were compared using a multicentric brain tumor dataset in realistic federated learning simulations, yielding benefits for adaptive weight aggregation, and efficiency gains through client sampling. Quantitative performance evaluation of state-of-the-art segmentation algorithms on data distributed internationally across 32 institutions yielded good generalization on average, albeit the worst-case performance revealed data-specific modes of failure. Similar multi-site setups can help validate the real-world utility of healthcare AI algorithms in the future.",
author = "Sarthak Pati and Akis Linardos and Brandon Edwards and Micah Sheller and Patrick Foley and Alejandro Aristizabal and David Zimmerer and Alexey Gruzdev and Jason Martin and Shinohara, {Russell T.} and Annika Reinke and Fabian Isensee and Santhosh Parampottupadam and Kaushal Parekh and Ralf Floca and Hasan Kassem and Bhakti Baheti and Siddhesh Thakur and Kaisar Kushibar and Karim Lekadir and Meirui Jiang and Youtan Yin and Hongzheng Yang and Quande Liu and Cheng Chen and Qi Dou and Heng, {Pheng Ann} and Xiaofan Zhang and Shaoting Zhang and Khan, {Muhammad Irfan} and Azeem, {Mohammad Ayyaz} and Mojtaba Jafaritadi and Esa Alhoniemi and Elina Kontio and Khan, {Suleiman A.} and Leon M{\"a}chler and Ivan Ezhov and Florian Kofler and Suprosanna Shit and Paetzold, {Johannes C.} and Timo Loehr and Benedikt Wiestler and Himashi Peiris and Kamlesh Pawar and Shenjun Zhong and Zhaolin Chen and Munawar Hayat and Gary Egan and Mehrtash Harandi and {Isik Polat}, Ece and Gorkem Polat and Altan Kocyigit and Alptekin Temizel and Anup Tuladhar and Lakshay Tyagi and Raissa Souza and Forkert, {Nils D.} and Pauline Mouches and Matthias Wilms and Vishruth Shambhat and Akansh Maurya and Danannavar, {Shubham Subhas} and Rohit Kalla and Anand, {Vikas Kumar} and Ganapathy Krishnamurthi and Sahil Nalawade and Chandan Ganesh and Ben Wagner and Divya Reddy and Yudhajit Das and Yu, {Fang F.} and Baowei Fei and Madhuranthakam, {Ananth J.} and Joseph Maldjian and Gaurav Singh and Jianxun Ren and Wei Zhang and Ning An and Qingyu Hu and Youjia Zhang and Ying Zhou and Vasilis Siomos and Giacomo Tarroni and Jonathan Passerrat-Palmbach and Ambrish Rawat and Giulio Zizzo and Kadhe, {Swanand Ravindra} and Epperlein, {Jonathan P.} and Stefano Braghin and Bair Tuchinov and Klaus Maier-Hein and Spyridon Bakas",
note = "We would like to thank Manuel Wiesenfarth and Paul F. J{\"a}ger (DKFZ) for helpful discussions. Research reported in this publication was partly funded by the Helmholtz Association (HA) within the project “Trustworthy Federated Data Analytics” (TFDA) (funding number ZT-I-OO1 4), and partly by the National Institutes of Health (NIH), under award numbers NCI:U01CA242871 (PI: S.Bakas) and NCI:U24CA279629 (PI: S.Bakas). K. Kushibar holds the Juan de la Cierva fellowship with a reference number FJC2021-047659-I. This work was supported in part by Hong Kong Research Grants Council Project No. T45- 401/22-N. Team HT-TUAS was partly funded by Business Finland under Grant 33961/31/2020. They also acknowledges the CSC-Puhti super-computer for their support and computational resources during FeTS 2021 and 2022. N. D. Forkert was supported by the Canadian Institutes of Health Research (CIHR Project Grant 462169). Jakub Nalepa was supported by the Silesian University of Technology funds through the Excellence Initiative–Research University program (Grant 02/080/SDU/10-21-01), and by the Silesian University of Technology funds through the grant for maintaining and developing research potential. Research reported in this publication was partly funded by R21EB030209, NIH/NIBIB (PI: Y. Yuan), UL1TR001433, NIH/NCATS, a research grant from Varian Medical Systems (Palo Alto, CA, USA) (PI: Y. Yuan). Y. Yuan also acknowledges the generous support of Herbert and Florence Irving/the Irving Trust. Z. Jiang was supported by National Cancer Institute (UG3 CA236536). H. Mohy-ud-Din was supported by a grant from the Higher Education Commission of Pakistan as part of the National Center for Big Data and Cloud Computing and the Clinical and Translational Imaging Lab at LUMS. M. Kozubek was supported by the Ministry of Health of the Czech Republic (grant NU21-08-00359 and conceptual development of research organization FNBr-65269705) and Ministry of Education, Youth and Sports of the Czech Republic (Project LM2023050). V{\'a}clav Vyb{\'i}hal was supported by MH CZ - DRO (FNBr, 65269705). Y. Gusev was supported by CCSG Grant number: NCI P30 CA51008. P. Vollmuth was supported by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) - Project-ID 404521405, SFB 1389 - UNITE Glioblastoma, Work Package C02, and Priority Programme 2177 “Radiomics: Next Generation of Biomedical Imaging” (KI 2410/1-1 ∣ MA 6340/18-1). B. Landman was supported by NSF 2040462. A. Rao was supported by the NIH (R37CA214955-01A1). A. Falc{\~a}o was supported by CNPq 304711/2023-3. P. Guevara was supported by the ANID-Basal proyects AFB240002 (AC3E) and FB210017 (CENIA). Research reported in this publication was partly funded by the NSF Convergence Accelerator - Track D: ImagiQ: Asynchronous and Decentralized Federated Learning for Medical Imaging, Grant Number: 2040532, and R21CA270742 (Period of Funding: 09/15/20 - 05/31/21). Martin Valli{\`e}res acknowledges funding from the Canada CIFAR AI Chairs Program. Stuart Currie receives salary support from a Leeds Hospitals Charity (9R01/1403) and Cancer Research UK (C19942/A28832) grants. Kavi Fatania is a 4ward North Clinical PhD fellow funded by Wellcome award (203914/Z/16/Z). Russell Frood is a Clinical Trials Fellow funded by CRUK (RCCCTF-Oct22/100002). This work was funded in part by National Institutes of Health R01CA233888 and the grant NCI:U24CA248265. The content of this publication is solely the responsibility of the authors and does not represent the official views of the HA, or the NIH. U.Baid, S.Pati, and S.Bakas conducted part of the work reported in this manuscript at their current affiliations, as well as while they were affiliated with the Center for Artificial Intelligence and Data Science for Integrated Diagnostics (AI2D) and the Center for Biomedical Image Computing and Analytics (CBICA) at the University of Pennsylvania, Philadelphia, PA, USA.",
year = "2025",
month = jul,
day = "8",
doi = "10.1038/s41467-025-60466-1",
language = "English",
volume = "16",
journal = "Nature Communications",
issn = "2041-1723",
publisher = "Nature Publishing Group",
number = "1",

}

RIS

TY - JOUR

T1 - Towards fair decentralized benchmarking of healthcare AI algorithms with the Federated Tumor Segmentation (FeTS) challenge

AU - Pati, Sarthak

AU - Linardos, Akis

AU - Edwards, Brandon

AU - Sheller, Micah

AU - Foley, Patrick

AU - Aristizabal, Alejandro

AU - Zimmerer, David

AU - Gruzdev, Alexey

AU - Martin, Jason

AU - Shinohara, Russell T.

AU - Reinke, Annika

AU - Isensee, Fabian

AU - Parampottupadam, Santhosh

AU - Parekh, Kaushal

AU - Floca, Ralf

AU - Kassem, Hasan

AU - Baheti, Bhakti

AU - Thakur, Siddhesh

AU - Kushibar, Kaisar

AU - Lekadir, Karim

AU - Jiang, Meirui

AU - Yin, Youtan

AU - Yang, Hongzheng

AU - Liu, Quande

AU - Chen, Cheng

AU - Dou, Qi

AU - Heng, Pheng Ann

AU - Zhang, Xiaofan

AU - Zhang, Shaoting

AU - Khan, Muhammad Irfan

AU - Azeem, Mohammad Ayyaz

AU - Jafaritadi, Mojtaba

AU - Alhoniemi, Esa

AU - Kontio, Elina

AU - Khan, Suleiman A.

AU - Mächler, Leon

AU - Ezhov, Ivan

AU - Kofler, Florian

AU - Shit, Suprosanna

AU - Paetzold, Johannes C.

AU - Loehr, Timo

AU - Wiestler, Benedikt

AU - Peiris, Himashi

AU - Pawar, Kamlesh

AU - Zhong, Shenjun

AU - Chen, Zhaolin

AU - Hayat, Munawar

AU - Egan, Gary

AU - Harandi, Mehrtash

AU - Isik Polat, Ece

AU - Polat, Gorkem

AU - Kocyigit, Altan

AU - Temizel, Alptekin

AU - Tuladhar, Anup

AU - Tyagi, Lakshay

AU - Souza, Raissa

AU - Forkert, Nils D.

AU - Mouches, Pauline

AU - Wilms, Matthias

AU - Shambhat, Vishruth

AU - Maurya, Akansh

AU - Danannavar, Shubham Subhas

AU - Kalla, Rohit

AU - Anand, Vikas Kumar

AU - Krishnamurthi, Ganapathy

AU - Nalawade, Sahil

AU - Ganesh, Chandan

AU - Wagner, Ben

AU - Reddy, Divya

AU - Das, Yudhajit

AU - Yu, Fang F.

AU - Fei, Baowei

AU - Madhuranthakam, Ananth J.

AU - Maldjian, Joseph

AU - Singh, Gaurav

AU - Ren, Jianxun

AU - Zhang, Wei

AU - An, Ning

AU - Hu, Qingyu

AU - Zhang, Youjia

AU - Zhou, Ying

AU - Siomos, Vasilis

AU - Tarroni, Giacomo

AU - Passerrat-Palmbach, Jonathan

AU - Rawat, Ambrish

AU - Zizzo, Giulio

AU - Kadhe, Swanand Ravindra

AU - Epperlein, Jonathan P.

AU - Braghin, Stefano

AU - Tuchinov, Bair

A2 - Maier-Hein, Klaus

A2 - Bakas, Spyridon

N1 - We would like to thank Manuel Wiesenfarth and Paul F. Jäger (DKFZ) for helpful discussions. Research reported in this publication was partly funded by the Helmholtz Association (HA) within the project “Trustworthy Federated Data Analytics” (TFDA) (funding number ZT-I-OO1 4), and partly by the National Institutes of Health (NIH), under award numbers NCI:U01CA242871 (PI: S.Bakas) and NCI:U24CA279629 (PI: S.Bakas). K. Kushibar holds the Juan de la Cierva fellowship with a reference number FJC2021-047659-I. This work was supported in part by Hong Kong Research Grants Council Project No. T45- 401/22-N. Team HT-TUAS was partly funded by Business Finland under Grant 33961/31/2020. They also acknowledges the CSC-Puhti super-computer for their support and computational resources during FeTS 2021 and 2022. N. D. Forkert was supported by the Canadian Institutes of Health Research (CIHR Project Grant 462169). Jakub Nalepa was supported by the Silesian University of Technology funds through the Excellence Initiative–Research University program (Grant 02/080/SDU/10-21-01), and by the Silesian University of Technology funds through the grant for maintaining and developing research potential. Research reported in this publication was partly funded by R21EB030209, NIH/NIBIB (PI: Y. Yuan), UL1TR001433, NIH/NCATS, a research grant from Varian Medical Systems (Palo Alto, CA, USA) (PI: Y. Yuan). Y. Yuan also acknowledges the generous support of Herbert and Florence Irving/the Irving Trust. Z. Jiang was supported by National Cancer Institute (UG3 CA236536). H. Mohy-ud-Din was supported by a grant from the Higher Education Commission of Pakistan as part of the National Center for Big Data and Cloud Computing and the Clinical and Translational Imaging Lab at LUMS. M. Kozubek was supported by the Ministry of Health of the Czech Republic (grant NU21-08-00359 and conceptual development of research organization FNBr-65269705) and Ministry of Education, Youth and Sports of the Czech Republic (Project LM2023050). Václav Vybíhal was supported by MH CZ - DRO (FNBr, 65269705). Y. Gusev was supported by CCSG Grant number: NCI P30 CA51008. P. Vollmuth was supported by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) - Project-ID 404521405, SFB 1389 - UNITE Glioblastoma, Work Package C02, and Priority Programme 2177 “Radiomics: Next Generation of Biomedical Imaging” (KI 2410/1-1 ∣ MA 6340/18-1). B. Landman was supported by NSF 2040462. A. Rao was supported by the NIH (R37CA214955-01A1). A. Falcão was supported by CNPq 304711/2023-3. P. Guevara was supported by the ANID-Basal proyects AFB240002 (AC3E) and FB210017 (CENIA). Research reported in this publication was partly funded by the NSF Convergence Accelerator - Track D: ImagiQ: Asynchronous and Decentralized Federated Learning for Medical Imaging, Grant Number: 2040532, and R21CA270742 (Period of Funding: 09/15/20 - 05/31/21). Martin Vallières acknowledges funding from the Canada CIFAR AI Chairs Program. Stuart Currie receives salary support from a Leeds Hospitals Charity (9R01/1403) and Cancer Research UK (C19942/A28832) grants. Kavi Fatania is a 4ward North Clinical PhD fellow funded by Wellcome award (203914/Z/16/Z). Russell Frood is a Clinical Trials Fellow funded by CRUK (RCCCTF-Oct22/100002). This work was funded in part by National Institutes of Health R01CA233888 and the grant NCI:U24CA248265. The content of this publication is solely the responsibility of the authors and does not represent the official views of the HA, or the NIH. U.Baid, S.Pati, and S.Bakas conducted part of the work reported in this manuscript at their current affiliations, as well as while they were affiliated with the Center for Artificial Intelligence and Data Science for Integrated Diagnostics (AI2D) and the Center for Biomedical Image Computing and Analytics (CBICA) at the University of Pennsylvania, Philadelphia, PA, USA.

PY - 2025/7/8

Y1 - 2025/7/8

N2 - Computational competitions are the standard for benchmarking medical image analysis algorithms, but they typically use small curated test datasets acquired at a few centers, leaving a gap to the reality of diverse multicentric patient data. To this end, the Federated Tumor Segmentation (FeTS) Challenge represents the paradigm for real-world algorithmic performance evaluation. The FeTS challenge is a competition to benchmark (i) federated learning aggregation algorithms and (ii) state-of-the-art segmentation algorithms, across multiple international sites. Weight aggregation and client selection techniques were compared using a multicentric brain tumor dataset in realistic federated learning simulations, yielding benefits for adaptive weight aggregation, and efficiency gains through client sampling. Quantitative performance evaluation of state-of-the-art segmentation algorithms on data distributed internationally across 32 institutions yielded good generalization on average, albeit the worst-case performance revealed data-specific modes of failure. Similar multi-site setups can help validate the real-world utility of healthcare AI algorithms in the future.

AB - Computational competitions are the standard for benchmarking medical image analysis algorithms, but they typically use small curated test datasets acquired at a few centers, leaving a gap to the reality of diverse multicentric patient data. To this end, the Federated Tumor Segmentation (FeTS) Challenge represents the paradigm for real-world algorithmic performance evaluation. The FeTS challenge is a competition to benchmark (i) federated learning aggregation algorithms and (ii) state-of-the-art segmentation algorithms, across multiple international sites. Weight aggregation and client selection techniques were compared using a multicentric brain tumor dataset in realistic federated learning simulations, yielding benefits for adaptive weight aggregation, and efficiency gains through client sampling. Quantitative performance evaluation of state-of-the-art segmentation algorithms on data distributed internationally across 32 institutions yielded good generalization on average, albeit the worst-case performance revealed data-specific modes of failure. Similar multi-site setups can help validate the real-world utility of healthcare AI algorithms in the future.

UR - https://www.mendeley.com/catalogue/46270516-fb74-3b8b-a755-186bd89f038e/

UR - https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105010224653&origin=inward

U2 - 10.1038/s41467-025-60466-1

DO - 10.1038/s41467-025-60466-1

M3 - Article

VL - 16

JO - Nature Communications

JF - Nature Communications

SN - 2041-1723

IS - 1

M1 - 6274

ER -

ID: 68460939