Highly accurate carbohydrate-binding site prediction with DeepGlycanSite

Author(s)

X. He, L. Zhao, Y. Tian, R. Li, Q. Chu, Z. Gu, M. Zheng, Y. Wang, S. Li, H. Jiang, Y. Jiang, L. Wen, D. Wang & X. Cheng

Sources

Nature Communications | ( 2024) 15:5163 https://doi.org/10.1038/s41467-024-49516-2

Understanding how carbohydrates regulate proteins in physiological and pathological processes provides opportunities to address key biological problems and develop new therapeutics. The diversity and complexity of carbohydrates pose a challenge in experimentally identifying the sites where carbohydrates bind to and act on proteins. The authors present a deep learning model, DeepGlycanSite, that can accurately predict carbohydrate binding sites on a given protein structure. By incorporating geometric and evolutionary features of proteins into a deep equivariant graph neural network with the transformer architecture, DeepGlycanSite remarkably outperforms previous state-of-the-art methods and effectively predicts binding sites for diverse carbohydrates. When integrated with a mutagenesis study, DeepGlycanSite reveals an important G protein-coupled receptor’s guanosine 5′-diphosphate sugar recognition site. These results demonstrate that DeepGlycanSite is an invaluable tool for predicting carbohydrate binding sites and could provide insights into the molecular mechanisms underlying the carbohydrate regulation of therapeutically of therapeutically important proteins.

Representative carbohydrate-binding protein structures showing monosaccharide-, disaccharide-, oligosaccharide-, sugar nucleotide- and glycolipid-binding sites (PDB codes: 1E8U, 4FQZ, 6MGL, 6H21 and 2BV7). Carbohydrates are displayed as sticks. Proteins are shown in cartoons and surface depict. Carbohydrate-binding sites are colored green

(*) The official implementation of DeepGlycanSite, a state-of-the-art method for predicting carbohydrate binding sites, is available at https://github.com/xichengeva/DeepGlycanSite.This repository contains all the code, instructions and model weights needed to run the method or to retrain a model.

Latest news

 The carbohydrate 3D structure-prediction tools (builders) at GLYCAM-Web (glycam.org) are widely used for generating experimentally-consistent...

Glycosylation, the key reaction in synthetic glycochemistry, remains entirely unpredictable due to its complex mechanism...

The ‘taste of sweetness’ is one of the most iconic and universally recognised human sensory...

recognize sialic acid residues on cell surfaces. Pathogens and tumor cells exploit Siglecs to evade...