Loading...
Multidimensional scaling improves distance-based clustering for microbiome data
Chen, Guanhua ; Wang, Xinyue ; Sun, Qiang ; Tang, Zheng-Zheng
Chen, Guanhua
Wang, Xinyue
Sun, Qiang
Tang, Zheng-Zheng
Supervisor
Department
Statistics and Data Science
Embargo End Date
Type
Journal article
Date
2025
License
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Motivation: Clustering patients into subgroups based on their microbial compositions can greatly enhance our understanding of the role of microbes in human health and disease etiology. Distance-based clustering methods, such as partitioning around medoids (PAM), are popular due to their computational efficiency and absence of distributional assumptions. However, the performance of these methods can be suboptimal when true cluster memberships are driven by differences in the abundance of only a few microbes, a situation known as the sparse signal scenario. Results: We demonstrate that classical multidimensional scaling (MDS), a widely used dimensionality reduction technique, effectively denoises microbiome data and enhances the clustering performance of distance-based methods. We propose a two-step procedure that first applies MDS to project high-dimensional microbiome data into a low-dimensional space, followed by distance-based clustering using the low-dimensional data. Our extensive simulations demonstrate that our procedure offers superior performance compared to directly conducting distance-based clustering under the sparse signal scenario. The advantage of our procedure is further showcased in several real data applications. Availability and implementation: The R package MDSMClust is available at https://github.com/wxy929/MDS-project.
Citation
G. Chen, X. Wang, Q. Sun, and Z. Z. Tang, “Multidimensional scaling improves distance-based clustering for microbiome data,” Bioinformatics, vol. 41, no. 2, Feb. 2025, doi: 10.1093/BIOINFORMATICS/BTAF042
Source
Bioinformatics
Conference
Keywords
Subjects
Source
Publisher
Oxford University Press
