Loading...
Thumbnail Image
Item

SGAC: a graph neural network framework for imbalanced and structure-aware AMP classification

Wang, Yingxu
Liang, Victor
Yin, Nan
Liu, Siwei
Segal, Eran
Supervisor
Department
Computational Biology
Embargo End Date
Type
Journal article
Date
License
http://creativecommons.org/licenses/by/4.0/
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Classifying antimicrobial peptides (AMPs) from the vast collection of peptides derived from metagenomic sequencing offers a promising avenue for combating antibiotic resistance. However, most existing AMP classification methods rely primarily on sequence-based representations and fail to capture the spatial structural information critical for accurate identification. Although recent graph-based approaches attempt to incorporate structural information, they typically construct residue- or atom-level graphs that introduce redundant atomic details and increase structural complexity. Furthermore, the class imbalance between the small number of known AMPs and the abundant non-AMPs significantly hinders predictive performance. To address these challenges, we employ lightweight OmegaFold to predict the 3D structures of peptides and construct peptide graphs using C$_\alpha $ atoms to capture their backbone geometry and spatial topology. Building on this representation, we propose the spatial graph neural network (GNN)-based AMP classifier (SGAC), a novel framework that leverages GNNs to extract structural features and generate discriminative graph representations. To handle class imbalance, SGAC incorporates weight-enhanced contrastive learning to cluster structurally similar peptides and separate dissimilar ones through adaptive weighting, and applies weight-enhanced pseudo-label distillation to generate high-confidence pseudo labels for unlabeled samples, achieving balanced and consistent representation learning. Experiments on publicly available AMP and non-AMP datasets demonstrate that SGAC significantly achieves state-of-the-art performance compared to baselines. The complete code and dataset are available at: https://github.com/wyxwyx46941930/SGAC.
Citation
Y. Wang, V. Liang, N. Yin, S. Liu, E. Segal, "SGAC: a graph neural network framework for imbalanced and structure-aware AMP classification," Briefings in Bioinformatics, vol. 27, no. 1, pp. bbag038-, 2026, https://doi.org/10.1093/bib/bbag038.
Source
Briefings in Bioinformatics
Conference
Keywords
31 Biological Sciences, 3101 Biochemistry and Cell Biology, 3102 Bioinformatics and Computational Biology, 3105 Genetics, Algorithms, Antimicrobial Peptides, Computational Biology, Graph Neural Networks, Neural Networks, Computer
Subjects
Source
Publisher
Oxford University Press
Full-text link