A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities

Author(s)

D. Bojar, L. Meche, G. Meng, W. Eng, D.F. Smith, R.D. Cummings & L.K. Mahal

Sources

ACS Chemical Biology https://doi.org/10.1021/acschembio.1c00689

Tools to study glycans are rapidly evolving; however, most of the present knowledge is deeply dependent on binding by glycan-binding proteins (e.g., lectins). The specificities of lectins have not always been well-defined, making it difficult to leverage their full potential for glycan analysis. The authors use a combination of machine learning algorithms and expert annotation to define lectin specificity for this important probe set. The investigation uses comprehensive glycan microarray analysis of commercially available lectins, obtained using version 5.0 of the Consortium for Functional Glycomics glycan microarray (CFGv5, made public in 2011).
sans_titre-14.png
The authors report the creation of this data set and its use in the large-scale evaluation of lectin−glycan binding behaviors. The motif analysis was performed by integrating 68 manually defined glycan features with systematic probing of computational rules for significant binding motifs using mono- and disaccharides and linkages. From a combination of machine learning with manual annotation, the authors create a detailed interpretation of glycan-binding specificity for 57 unique lectins, categorized by their major binding motifs: mannose, complex-type N-glycan, O-glycan, fucose, sialic acid and sulfate, GlcNAc and chitin, Gal and LacNAc, and GalNAc.

Latest news

DIONYSUS is a database of protein-carbohydrate interfaces annotated according to proteins and carbohydrates’ structural, chemical...

Understanding the molecular mechanisms that drive and modulate host-pathogen interactions is essential for developing effective...

Crystalline polysaccharides are abundant and can be transformed into highly functional materials. However, the molecular...

The authors have developed a research strategy, called CryoSeek, to identify uncharacterized bio-entities from natural...