Item

Multilevel functional distributional models with applications to continuous glucose monitoring in diabetes clinical trials

Matabuena, Marcos
Crainiceanu, Ciprian M
Supervisor
Department
Epidemiology
Embargo End Date
Type
Journal article
Date
License
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Continuous glucose monitoring (CGM) is a minimally invasive technology that measures blood glucose every few minutes for weeks or months at a time. CGM data are often collected in the free-living environment and is strongly related to sleep, physical activity, and meal intake. As the timing of these activities varies substantially within- and between-individuals, it is difficult to model CGM trajectories as a function of time of day. Therefore, in practice, CGM trajectories are often reduced to one or two scalar summaries of the thousands of measurements collected for a study participant. To alleviate the potential loss of information, the cumulative distribution function (cdf) of the CGM time series was proposed as an alternative. Here we address the problem of conducting inference on cdfs in clinical trials with long follow-up and frequent measurements. Our approach provides three major innovations: (1) modeling the entire cdf and preserving its monotonicity, (2) accounting for the cdfs correlation (because they are measured on the same individual), continuity (results are robust to the choice of the probability grid), and differential error (e.g., medians have lower variability than 0.99 quantiles), and (3) preserving the familywise error when the observed data are longitudinal samples of cdfs. We focus on modeling data collected by The Juvenile Diabetes Research Foundation Continuous Glucose Monitoring Group in a large clinical trial that collected CGM data every few minutes for 26 weeks. Our basic observation unit is the distribution of CGM observations in a four–week interval. The resulting data structure is multilevel (because each individual has multiple months of data) and distributional (because the data for each four-week interval is represented as a cdf). The scientific goals are to: (1) identify and quantify the effects of factors that affect glycaemic control in type 1 diabetes patients (T1D) and (2) identify and characterize the patients who respond to treatment.
Citation
M. Matabuena, C.M. Crainiceanu, "Multilevel functional distributional models with applications to continuous glucose monitoring in diabetes clinical trials," The Annals of Applied Statistics, vol. 20, no. 1, pp. 476-495, 2026, https://doi.org/10.1214/26-aoas2139.
Source
The Annals of Applied Statistics
Conference
Keywords
49 Mathematical Sciences, 4905 Statistics
Subjects
Source
Publisher
Institute of Mathematical Statistics
Full-text link