Search
Close this search box.

Highly accurate carbohydrate-binding site prediction with DeepGlycanSite

Author(s)

X. He, L. Zhao, Y. Tian, R. Li, Q. Chu, Z. Gu, M. Zheng, Y. Wang, S. Li, H. Jiang, Y. Jiang, L. Wen, D. Wang & X. Cheng

Sources

Nature Communications | ( 2024) 15:5163 https://doi.org/10.1038/s41467-024-49516-2

Understanding how carbohydrates regulate proteins in physiological and pathological processes provides opportunities to address key biological problems and develop new therapeutics. The diversity and complexity of carbohydrates pose a challenge in experimentally identifying the sites where carbohydrates bind to and act on proteins. The authors present a deep learning model, DeepGlycanSite, that can accurately predict carbohydrate binding sites on a given protein structure. By incorporating geometric and evolutionary features of proteins into a deep equivariant graph neural network with the transformer architecture, DeepGlycanSite remarkably outperforms previous state-of-the-art methods and effectively predicts binding sites for diverse carbohydrates. When integrated with a mutagenesis study, DeepGlycanSite reveals an important G protein-coupled receptor’s guanosine 5′-diphosphate sugar recognition site. These results demonstrate that DeepGlycanSite is an invaluable tool for predicting carbohydrate binding sites and could provide insights into the molecular mechanisms underlying the carbohydrate regulation of therapeutically of therapeutically important proteins.

Representative carbohydrate-binding protein structures showing monosaccharide-, disaccharide-, oligosaccharide-, sugar nucleotide- and glycolipid-binding sites (PDB codes: 1E8U, 4FQZ, 6MGL, 6H21 and 2BV7). Carbohydrates are displayed as sticks. Proteins are shown in cartoons and surface depict. Carbohydrate-binding sites are colored green

(*) The official implementation of DeepGlycanSite, a state-of-the-art method for predicting carbohydrate binding sites, is available at https://github.com/xichengeva/DeepGlycanSite.This repository contains all the code, instructions and model weights needed to run the method or to retrain a model.

Latest news

Despite ground-breaking innovations in experimental structural biology and protein structure prediction techniques, capturing the structure...

Accurate computational simulations of protein−glycan dynamics are crucial, critically relying on the appropriate parameters, including...

The intricate nature of carbohydrate structures has led the scientific community to seek efficient protocols...

Instruct-ERIC, ”the European Research Infrastructure Consortium for Structural biology research”, is a pan-European distributed research...