Deep learning can generate real insights into how proteins interact

Scientist are now combining recent advances in evolutionary analysis and deep learning to build three-dimensional models of how most proteins in eukaryotes interact. (Eukaryotes are organisms whose cells have a membrane-bound nucleus to hold genetic materials.)

The research effort has implications for understanding the biochemical processes that are common to all animals, plants, and fungi. The open-access work appears Nov. 11 in Science.

As part of a multi-institutional collaboration, the lab of David Baker at the UW Medicine Institute for Protein Design helped guide this new development.

To really understand the cellular conditions that give rise to health and disease, it's essential to know how different proteins in a cell work together. In this paper, we provide detailed information on protein interactions for nearly every core process in eukaryotic cells. This includes over a hundred interactions that have never been seen before."

David Baker, UW Medicine Institute

Proteins, the workhorses of all cells, rarely act alone. Different proteins often must fit together to form precise complexes that carry out specific tasks. These can include reading genes, digesting nutrients, and responding to signals from neighboring cells and the outside world. When protein complexes malfunction, disease can result.

"This work shows that deep learning can now generate real insights into decades-old questions in biology -; not just what a particular protein looks like, but also which proteins come together to interact," said senior author Qian Cong, assistant professor of biophysics at the University of Texas Southwestern Medical Center.

To exhaustively map the interactions that give rise to protein complexes, a team of structural biologists from UW Medicine, University of Texas Southwestern Medical Center, Harvard University, and several other institutions examined all known gene sequences in yeast. Using advanced statistical analyses, they identified pairs of genes that naturally acquire mutations in a linked fashion. They reasoned that such shared mutations are a sign that the proteins the genes encode must physically interact.

The researchers also used new deep-learning software to model the three-dimensional shapes of these interacting proteins. RoseTTAFold, invented at UW Medicine, and AlphaFold, invented by the Alphabet subsidiary DeepMind, were both tapped to generate hundreds of detailed pictures of protein complexes.

"As computer methods become more powerful, it is easier than ever to generate large amounts of scientific data, but to make sense of it still requires scientific experts," said Baker, who is a University of Washington School of Medicine professor of biochemistry. "So we recruited a village of expert biologists to interpret our 3D protein models. This is community science at its best."

The hundreds of newly identified protein complexes provide rich insights into how cells function. For example, one complex contains the protein RAD51, which is known to play a key role in DNA repair and cancer progression in humans. Another includes the poorly understood enzyme glycosylphosphatidylinositol transamidase, which has been implicated in neurodevelopmental disorders and cancer in humans. Understanding how these and other proteins interact may open the door to the development of new medications for a wide range of health disorders.

The protein structures generated in this work are available to download from the ModelArchive. The researchers thank and remember the late John Westbrook at the Protein Data Bank for his support in establishing formats and software code to allow efficient deposition of the models into the archive. The Science paper reporting the results was in preparation at the time of Westbrook's death.

The project on computed structures of core eukaryotic proteins complexes was led by Ian Humphrey, Aditya Krishnakumar, and Minkyung Baek, all at UW Medicine, as well as Jimin Pei at the University of Texas Southwestern Medical Center. Collaborating institutions include UW Medicine, UT Southwestern, Harvard University, Wayne State University, Cornell University, MRC Laboratory of Molecular Biology, Memorial Sloan Kettering Cancer Center, Gerstner Sloan Kettering Graduate School of Biomedical Sciences, Fred Hutchinson Cancer Research Center, Columbia University, University of Würzburg, St Jude Children's Research Hospital, FIRC Institute of Molecular Oncology, and Istituto di Genetica Molecolare, Consiglio Nazionale delle Ricerche.

Source:
Journal reference:

Humphreys, I.R., et al. (2021) Computed structures of core eukaryotic protein complexes. Science. doi.org/10.1126/science.abm4805.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of AZoLifeSciences.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
New Synthetic Biology Tool Improves Genetic Circuit Integration and Functionality