Search
Close this search box.

Highly accurate carbohydrate-binding site prediction with DeepGlycanSite

Author(s)

X. He, L. Zhao, Y. Tian, R. Li, Q. Chu, Z. Gu, M. Zheng, Y. Wang, S. Li, H. Jiang, Y. Jiang, L. Wen, D. Wang & X. Cheng

Sources

Nature Communications | ( 2024) 15:5163 https://doi.org/10.1038/s41467-024-49516-2

Understanding how carbohydrates regulate proteins in physiological and pathological processes provides opportunities to address key biological problems and develop new therapeutics. The diversity and complexity of carbohydrates pose a challenge in experimentally identifying the sites where carbohydrates bind to and act on proteins. The authors present a deep learning model, DeepGlycanSite, that can accurately predict carbohydrate binding sites on a given protein structure. By incorporating geometric and evolutionary features of proteins into a deep equivariant graph neural network with the transformer architecture, DeepGlycanSite remarkably outperforms previous state-of-the-art methods and effectively predicts binding sites for diverse carbohydrates. When integrated with a mutagenesis study, DeepGlycanSite reveals an important G protein-coupled receptor’s guanosine 5′-diphosphate sugar recognition site. These results demonstrate that DeepGlycanSite is an invaluable tool for predicting carbohydrate binding sites and could provide insights into the molecular mechanisms underlying the carbohydrate regulation of therapeutically of therapeutically important proteins.

Representative carbohydrate-binding protein structures showing monosaccharide-, disaccharide-, oligosaccharide-, sugar nucleotide- and glycolipid-binding sites (PDB codes: 1E8U, 4FQZ, 6MGL, 6H21 and 2BV7). Carbohydrates are displayed as sticks. Proteins are shown in cartoons and surface depict. Carbohydrate-binding sites are colored green

(*) The official implementation of DeepGlycanSite, a state-of-the-art method for predicting carbohydrate binding sites, is available at https://github.com/xichengeva/DeepGlycanSite.This repository contains all the code, instructions and model weights needed to run the method or to retrain a model.

Latest news

DIONYSUS is a database of protein-carbohydrate interfaces annotated according to proteins and carbohydrates’ structural, chemical...

Instruct-ERIC, ”the European Research Infrastructure Consortium for Structural biology research”, is a pan-European distributed research...

Computer-based tools for visualizing and manipulating molecular structures in real-time hold immense potential for accelerating...

Glycan-mediated interactions are crucial in biology and medicine, influencing signalling, immune responses, and disease pathogenesis....