New AI Model Predicts Protein-DNA Binding Specificity with High Accuracy

A new artificial intelligence model developed by USC researchers and published in Nature Methods can predict how different proteins may bind to DNA with accuracy across different types of protein, a technological advance that promises to reduce the time required to develop new drugs and other medical treatments.

The tool, called Deep Predictor of Binding Specificity (DeepPBS), is a geometric deep learning model designed to predict protein–DNA binding specificity from protein–DNA complex structures. DeepPBS allows scientists and researchers to input the data structure of a protein–DNA complex into an online computational tool.

Structures of protein–DNA complexes contain proteins that are usually bound to a single DNA sequence. For understanding gene regulation, it is important to have access to the binding specificity of a protein to any DNA sequence or region of the genome. DeepPBS is an AI tool that replaces the need for high-throughput sequencing or structural biology experiments to reveal protein–DNA binding specificity."

Remo Rohs, professor and founding chair in the Department of Quantitative and Computational Biology, USC Dornsife College of Letters, Arts and Sciences

AI Analyzes, Predicts Protein–DNA Structures

DeepPBS employs a geometric deep learning model, a type of machine-learning approach that analyzes data using geometric structures. The AI tool was designed to capture the chemical properties and geometric contexts of protein–DNA to predict binding specificity.

Using this data, DeepPBS produces spatial graphs that illustrate protein structure and the relationship between protein and DNA representations. DeepPBS can also predict binding specificity across various protein families, unlike many existing methods that are limited to one family of proteins.

"It is important for researchers to have a method available that works universally for all proteins and is not restricted to a well-studied protein family. This approach allows us also to design new proteins," Rohs said.

Major Advance in Protein-Structure Prediction

The field of protein-structure prediction has advanced rapidly since the advent of DeepMind's AlphaFold, which can predict protein structure from sequence. These tools have led to an increase in structural data available to scientists and researchers for analysis. DeepPBS works in conjunction with structure prediction methods for predicting specificity for proteins without available experimental structures.

Rohs said the applications of DeepPBS are numerous. This new research method may lead to accelerating the design of new drugs and treatments for specific mutations in cancer cells, as well as lead to new discoveries in synthetic biology and applications in RNA research.

About the study: In addition to Rohs, other study authors include Raktim Mitra of USC; Jinsen Li of USC; Jared Sagendorf of University of California, San Francisco; Yibei Jiang of USC; Ari Cohen of USC; and Tsu-Pei Chiu of USC; as well as Cameron Glasscock of the University of Washington.

This research was primarily supported by NIH grant R35GM130376.

Source:
Journal reference:

Mitra, R., et al. (2024). Geometric deep learning of protein–DNA binding specificity. Nature Methods. doi.org/10.1038/s41592-024-02372-w.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoLifeSciences.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
Advancing Sequential DNA Computing for Programmable Devices