Standard

Harnessing Ensemble Machine Learning Models for Timely Diagnosis of Breast Cancer Metastasis: A Case Study on CatBoost, XGBoost, and LGBM. / Luu, Minh Sao Khue; Banerjee, Santanu; Pavlovskiy, Evgeniy N. и др.

International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM. IEEE Computer Society, 2024. стр. 2320-2325 (International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM).

Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференцийстатья в сборнике материалов конференциинаучнаяРецензирование

Harvard

Luu, MSK, Banerjee, S, Pavlovskiy, EN & Tuchinov, BN 2024, Harnessing Ensemble Machine Learning Models for Timely Diagnosis of Breast Cancer Metastasis: A Case Study on CatBoost, XGBoost, and LGBM. в International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM. International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM, IEEE Computer Society, стр. 2320-2325, 25th IEEE International Conference of Young Professionals in Electron Devices and Materials, Российская Федерация, 28.06.2024. https://doi.org/10.1109/EDM61683.2024.10615210

APA

Luu, M. S. K., Banerjee, S., Pavlovskiy, E. N., & Tuchinov, B. N. (2024). Harnessing Ensemble Machine Learning Models for Timely Diagnosis of Breast Cancer Metastasis: A Case Study on CatBoost, XGBoost, and LGBM. в International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM (стр. 2320-2325). (International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM). IEEE Computer Society. https://doi.org/10.1109/EDM61683.2024.10615210

Vancouver

Luu MSK, Banerjee S, Pavlovskiy EN, Tuchinov BN. Harnessing Ensemble Machine Learning Models for Timely Diagnosis of Breast Cancer Metastasis: A Case Study on CatBoost, XGBoost, and LGBM. в International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM. IEEE Computer Society. 2024. стр. 2320-2325. (International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM). doi: 10.1109/EDM61683.2024.10615210

Author

Luu, Minh Sao Khue ; Banerjee, Santanu ; Pavlovskiy, Evgeniy N. и др. / Harnessing Ensemble Machine Learning Models for Timely Diagnosis of Breast Cancer Metastasis: A Case Study on CatBoost, XGBoost, and LGBM. International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM. IEEE Computer Society, 2024. стр. 2320-2325 (International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM).

BibTeX

@inproceedings{8d5a2d403dde4e549c4fabac302f12dd,
title = "Harnessing Ensemble Machine Learning Models for Timely Diagnosis of Breast Cancer Metastasis: A Case Study on CatBoost, XGBoost, and LGBM",
abstract = "This study employs three advanced gradient boosting machine learning algorithms to assess potential disparities in healthcare delivery. We specifically investigate which factors contribute to a patient's timely diagnosis of metastatic breast cancer using a public healthcare dataset. Our approach involves training and testing three separate models, as well as an ensemble model with automatically optimized weights. The models try to predict whether patients received a diagnosis of metastatic breast cancer within 90 days. Each model has different preprocessing and feature selection steps. The hyperparameter optimization is performed using the Optuna library in Python. Models are evaluated on the Kaggle platform, with our metrics indicating strong predictive performance; Categorical Boosting achieved an Area Under the Receiver Operating Characteristic Curve score of 0.813, Extreme Gradient Boosting reached 0.808, Light Gradient Boosting Machine scored 0.805, and the ensemble model culminated at 0.808. Additionally, we analyze the set of features being used by all the best models and examine the impact of hyperparameters on the models' overall performance.",
keywords = "gradient boosting, health informatics, healthcare equity, machine learning, metastatic breast cancer",
author = "Luu, {Minh Sao Khue} and Santanu Banerjee and Pavlovskiy, {Evgeniy N.} and Tuchinov, {Bair N.}",
note = "This work was supported by a grant for research centers, provided by the Analytical Center for the Government of the Russian Federation in accordance with the subsidy agreement (agreement identifier 000000D730324P540002) and the agreement with the Novosibirsk State University dated December 27, 2023 No. 70-2023-001318.; 25th IEEE International Conference of Young Professionals in Electron Devices and Materials, EDM 2024 ; Conference date: 28-06-2024 Through 02-07-2024",
year = "2024",
doi = "10.1109/EDM61683.2024.10615210",
language = "English",
isbn = "9798350389234",
series = "International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM",
publisher = "IEEE Computer Society",
pages = "2320--2325",
booktitle = "International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM",
address = "United States",
url = "https://edm.ieeesiberia.org/",

}

RIS

TY - GEN

T1 - Harnessing Ensemble Machine Learning Models for Timely Diagnosis of Breast Cancer Metastasis: A Case Study on CatBoost, XGBoost, and LGBM

AU - Luu, Minh Sao Khue

AU - Banerjee, Santanu

AU - Pavlovskiy, Evgeniy N.

AU - Tuchinov, Bair N.

N1 - Conference code: 25

PY - 2024

Y1 - 2024

N2 - This study employs three advanced gradient boosting machine learning algorithms to assess potential disparities in healthcare delivery. We specifically investigate which factors contribute to a patient's timely diagnosis of metastatic breast cancer using a public healthcare dataset. Our approach involves training and testing three separate models, as well as an ensemble model with automatically optimized weights. The models try to predict whether patients received a diagnosis of metastatic breast cancer within 90 days. Each model has different preprocessing and feature selection steps. The hyperparameter optimization is performed using the Optuna library in Python. Models are evaluated on the Kaggle platform, with our metrics indicating strong predictive performance; Categorical Boosting achieved an Area Under the Receiver Operating Characteristic Curve score of 0.813, Extreme Gradient Boosting reached 0.808, Light Gradient Boosting Machine scored 0.805, and the ensemble model culminated at 0.808. Additionally, we analyze the set of features being used by all the best models and examine the impact of hyperparameters on the models' overall performance.

AB - This study employs three advanced gradient boosting machine learning algorithms to assess potential disparities in healthcare delivery. We specifically investigate which factors contribute to a patient's timely diagnosis of metastatic breast cancer using a public healthcare dataset. Our approach involves training and testing three separate models, as well as an ensemble model with automatically optimized weights. The models try to predict whether patients received a diagnosis of metastatic breast cancer within 90 days. Each model has different preprocessing and feature selection steps. The hyperparameter optimization is performed using the Optuna library in Python. Models are evaluated on the Kaggle platform, with our metrics indicating strong predictive performance; Categorical Boosting achieved an Area Under the Receiver Operating Characteristic Curve score of 0.813, Extreme Gradient Boosting reached 0.808, Light Gradient Boosting Machine scored 0.805, and the ensemble model culminated at 0.808. Additionally, we analyze the set of features being used by all the best models and examine the impact of hyperparameters on the models' overall performance.

KW - gradient boosting

KW - health informatics

KW - healthcare equity

KW - machine learning

KW - metastatic breast cancer

UR - https://www.scopus.com/record/display.uri?eid=2-s2.0-85201931798&origin=inward&txGid=8455f75d351cd53e743da6862370ab01

UR - https://www.mendeley.com/catalogue/0a3e884f-793a-34e4-ae20-216048a0db66/

U2 - 10.1109/EDM61683.2024.10615210

DO - 10.1109/EDM61683.2024.10615210

M3 - Conference contribution

SN - 9798350389234

T3 - International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM

SP - 2320

EP - 2325

BT - International Conference of Young Specialists on Micro/Nanotechnologies and Electron Devices, EDM

PB - IEEE Computer Society

T2 - 25th IEEE International Conference of Young Professionals in Electron Devices and Materials

Y2 - 28 June 2024 through 2 July 2024

ER -

ID: 60550286