DSpace Repository

Improving prediction of COVID-19 mortality using machine learning in the Spanish SEMI-COVID-19 registry

Show simple item record

APA

Casas Rojo, JM & Ventura, PS & Antón Santos, JM & de Latierro, AO & Arévalo-Lorido, JC & Mauri, M & Rubio-Rivas, M & González Vega, R & Giner Galvañ, V & López Escobar, Alejandro (2023 ) .Improving prediction of COVID-19 mortality using machine learning in the Spanish SEMI-COVID-19 registry.

ISO 690

Casas Rojo, JM & Ventura, PS & Antón Santos, JM & de Latierro, AO & Arévalo-Lorido, JC & Mauri, M & Rubio-Rivas, M & González Vega, R & Giner Galvañ, V & López Escobar, Alejandro. 2023 .Improving prediction of COVID-19 mortality using machine learning in the Spanish SEMI-COVID-19 registry.

https://hdl.handle.net/20.500.12080/44572
dc.contributor.author Casas Rojo, JM
dc.contributor.author Ventura, PS
dc.contributor.author Antón Santos, JM
dc.contributor.author de Latierro, AO
dc.contributor.author Arévalo-Lorido, JC
dc.contributor.author Mauri, M
dc.contributor.author Rubio-Rivas, M
dc.contributor.author González Vega, R
dc.contributor.author Giner Galvañ, V
dc.contributor.author López Escobar, Alejandro
dc.date.accessioned 2024-10-17T12:25:14Z
dc.date.available 2024-10-17T12:25:14Z
dc.date.created 2023
dc.date.issued 2023
dc.identifier.uri https://hdl.handle.net/20.500.12080/44572
dc.description.abstract COVID-19 is responsible for high mortality, but robust machine learning-based predictors of mortality are lacking. To generate a model for predicting mortality in patients hospitalized with COVID-19 using Gradient Boosting Decision Trees (GBDT). The Spanish SEMI-COVID-19 registry includes 24,514 pseudo-anonymized cases of patients hospitalized with COVID-19 from 1 February 2020 to 5 December 2021. This registry was used as a GBDT machine learning model, employing the CatBoost and BorutaShap classifier to select the most relevant indicators and generate a mortality prediction model by risk level, ranging from 0 to 1. The model was validated by separating patients according to admission date, using the period 1 February to 31 December 2020 (first and second waves, pre-vaccination period) for training, and 1 January to 30 November 2021 (vaccination period) for the test group. An ensemble of ten models with different random seeds was constructed, separating 80% of the patients for training and 20% from the end of the training period for cross-validation. The area under the receiver operating characteristics curve (AUC) was used as a performance metric. Clinical and laboratory data from 23,983 patients were analyzed. CatBoost mortality prediction models achieved an AUC performance of 84.76 (standard deviation 0.45) for patients in the test group (potentially vaccinated patients not included in model training) using 16 features. The performance of the 16-parameter GBDT model for predicting COVID-19 hospital mortality, although requiring a relatively large number of predictors, shows a high predictive capacity. es_ES
dc.format application/pdf es_ES
dc.language eng es_ES
dc.rights CC-BY es_ES
dc.rights.uri http://creativecommons.org/licenses/by/4.0/deed.es es_ES
dc.source Internal and emergency medicine es_ES
dc.title Improving prediction of COVID-19 mortality using machine learning in the Spanish SEMI-COVID-19 registry es_ES
dc.type info:eu-repo/semantics/article es_ES
dc.rights.accessrights info:eu-repo/semantics/closedAccess es_ES
dc.identifier.location N/A es_ES


Files in this item

This item appears in the following Collection(s)

Show simple item record

CC-BY Except where otherwise noted, this item's license is described as CC-BY

Search DSpace


Browse

My Account

Social Media