Loading...
NutriMatch: harmonizing food composition databases with large language models for enhanced nutritional prediction
Jankelow, Adam ; Godneva, Anastasia ; Rein, Michal ; Samocha-Bonet, Dorit ; Weissglas-Volkov, Daphna ; Zohar, Shahar ; Shor, Tal ; Segal, Eran
Jankelow, Adam
Godneva, Anastasia
Rein, Michal
Samocha-Bonet, Dorit
Weissglas-Volkov, Daphna
Zohar, Shahar
Shor, Tal
Segal, Eran
Files
Loading...
s44482-025-00001-7.pdf
Adobe PDF, 2.08 MB
Supervisor
Department
Computational Biology
Embargo End Date
Type
Journal article
Date
License
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Missing and inconsistent nutrient values in food-composition databases hinder comparative nutrition research. We present NutriMatch, a scalable harmonization method that embeds food descriptions with a large-language model, aligns nutritionally equivalent items, and imputes missing nutrients to enrich FCDB coverage. Applied to the Israeli Human Phenotype Project, NutriMatch expanded logged nutrient profiles from 21 to 151 nutrients; in the Australian PREDICT cohort, coverage rose from 43 to the same 151 nutrients using a validated external FCDB. Enriched nutrient sets improved 2-year obesity prediction (AUC 0.63 → 0.67) and increased median R² for body-fat percentage, waist circumference, continuous glucose monitoring, and several blood biomarkers. Models trained in HPP transferred to PREDICT without retraining, achieving Pearson r = 0.32 for visceral fat mass versus 0.24 with baseline nutrients. NutriMatch thus delivers rapid, reproducible harmonization of FCDBs, phenotype-informative nutrient enrichment, and enables robust cross-cohort studies.
Citation
Jankelow, A., Godneva, A., Rein, M., Samocha-Bonet, D., Weissglas-Volkov, D., Zohar, S., Shor, T., Segal, E. (2026). NutriMatch: harmonizing food composition databases with large language models for enhanced nutritional prediction. npj Digital Public Health, 1(1), 1-1. https://doi.org/10.1038/s44482-025-00001-7
Source
npj Digital Public Health
Conference
Keywords
Biomedical and Clinical Sciences, Nutrition and Dietetics, Zero Hunger
Subjects
Source
Publisher
Springer Nature
