Uncovering evolutionarily remote and highly potent antimicrobial peptides with protein language models
Yu, Qinze ; Liu, Hongbin ; Shi, Haimei ; Abdrakhmanov, Yerzhan ; Shen, Junbo ; Zhang, Chunhe ; Dong, Zhihang ; Zong, Licheng ; Si, Longlong ; Dai, Lei ... show 1 more
Yu, Qinze
Liu, Hongbin
Shi, Haimei
Abdrakhmanov, Yerzhan
Shen, Junbo
Zhang, Chunhe
Dong, Zhihang
Zong, Licheng
Si, Longlong
Dai, Lei
Supervisor
Department
Computational Biology
Embargo End Date
Type
Journal article
Date
License
Language
English
Collections
Research Projects
Organizational Units
Journal Issue
Abstract
Identifying evolutionarily remote antimicrobial peptides (AMPs) is crucial for discovering underexplored clinical candidates to combat antibiotic resistance. Existing experimental and computational methods are limited by their reliance on sequence identity to known AMPs, missing distant homologues. Here we introduce HMD-AMP, a protein language model-based approach for AMP discovery. HMD-AMP outperforms previous methods in identifying evolutionarily distant AMPs and enables the discovery of unknown and highly potent AMPs from metagenomic data. Applied to host and gut microorganism genomes of nine mammals, HMD-AMP revealed over 37 million predicted AMPs. Of 91 high-confidence sequences experimentally validated, 74 showed strong antibacterial activity and 48 were evolutionarily remote from known AMPs. Four of these AMPs exhibited broad-spectrum antibacterial activity at low effective concentrations and showed low toxicity, with the most potent peptide demonstrating therapeutic efficacy in a mouse model of peritoneal Escherichia coli infection. This study introduces an effective strategy to uncover AMPs.
Citation
Q. Yu, H. Liu, H. Shi, Y. Abdrakhmanov, J. Shen, C. Zhang , et al., "Uncovering evolutionarily remote and highly potent antimicrobial peptides with protein language models," Nature Biomedical Engineering, pp. 1-14, 2026, https://doi.org/10.1038/s41551-026-01630-w.
Source
Nature Biomedical Engineering
Conference
Keywords
40 Engineering, 4003 Biomedical Engineering
Subjects
Source
Publisher
Springer Nature
