Developing a Machine Learning-Based Cardiometabolic Disease Model for Predicting Liver Disease

Miranda Moirangthem; Hillul Chutia; Nagamani Selvaraman; Romi Wahengbam

Developing a Machine Learning-Based Cardiometabolic Disease Model for Predicting Liver Disease

Authors

Miranda Moirangthem Biological Sciences and Technology Division, CSIR-North-East Institute of Science and Technology, Jorhat, Assam-785006, India Author
Hillul Chutia Advanced Computation and Data Sciences Division, CSIR-North-East Institute of Science and Technology, Jorhat, Assam-785006, India Author
Nagamani Selvaraman Faculty of Biological Sciences, Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh-201002, India Author
Romi Wahengbam Faculty of Biological Sciences, Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh-201002, India Author

Abstract

Cardiometabolic diseases, which are a leading cause of global mortality, are interconnected metabolic and cardiovascular disorders that include diabetes, MASLD and ischemic heart diseases. Predicting disease may help in its early diagnosis and treatment. Cohort studies are crucial in cardiometabolic disease research as it can give significant insight into disease demographics, prevalence and its prediction. Here, we utilise the data of a national longitudinal cohort study to investigate and predict liver disease. Clinical and anthropometric data of Phenome India Cohort $(n=207)$ were analysed and divided into subgroups based on the status of hepatic steatosis and fibrosis. Sixteen key metadata, including liver enzyme, renal, FibroScan and anthropometric parameters were used for initial model development, and eight parameters were identified using forward and recursive feature selection. Seven machine learning (ML) algorithms, namely Random Forest, XGBoost, CatBoost, SVM, Logistic Regression, Na"ive Bayes, and Neural Network, were trained on the new parameters, and data was split into training (75%) and testing (25%) sets. Models using all 16 features tended to overfit, achieving perfect performance on the training set but lower generalisation on the testing set. Feature reduction to eight resulted in a simpler model with similar performance. SVM provided the most desirable test performance among the seven algorithms achieving balance between sensitivity and specificity (accuracy 0.738, sensitivity 0.857, specificity 0.500, F1-score 0.814, ROC-AUC 0.724; 5-fold cross-validated accuracy 0.710 and ROC-AUC 0.741). Adjusting the decision threshold between 0.55 and 0.80 led to lower sensitivity at lower thresholds and high sensitivity at higher thresholds. The application of ML algorithms to clinical metadata can help in the prediction of liver disease.

Downloads

Published

2026-01-20

Issue

Vol. 8 No. 1 (2026): Abstracts of International Conference on Advances in Multidisciplinary Sciences and Engineering 2026

Section

Abstracts

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Author(s) retains full copyright of their abstract and grants non-exclusive publishing right to AIJR Abstracts and its publisher "AIJR (India)". Author(s) can archive pre-print, post-print, and published version/PDF to any open access, institutional repository, social media, or personal website provided that the published source must be acknowledged with citation and link to the publisher version.
Click here for more information on Copyright policy
Click here for more information on Licensing policy

How to Cite

[1]

Miranda Moirangthem, Hillul Chutia, Nagamani Selvaraman, and Romi Wahengbam, “Developing a Machine Learning-Based Cardiometabolic Disease Model for Predicting Liver Disease”, AIJR Abs., vol. 8, no. 1, p. 71, Jan. 2026, Accessed: Jul. 23, 2026. [Online]. Available: https://abstracts.aijr.org/index.php/abs/article/view/210

Download Citation