Nanotechnology Now

Our NanoNews Digest Sponsors
Heifer International



Home > Press > Machine learning reveals recipe for building artificial proteins

A ribbon model of a protein.
istockphoto.com
A ribbon model of a protein. istockphoto.com

Abstract:
Proteins are essential to the life of cells, carrying out complex tasks and catalyzing chemical reactions. Scientists and engineers have long sought to harness this power by designing artificial proteins that can perform new tasks, like treat disease, capture carbon, or harvest energy, but many of the processes designed to create such proteins are slow and complex, with a high failure rate.

Machine learning reveals recipe for building artificial proteins

Chicago, IL | Posted on July 24th, 2020

In a breakthrough that could have implications across the healthcare, agriculture, and energy sectors, a team lead by researchers in the Pritzker School of Molecular Engineering (PME) at the University of Chicago has developed an artificial intelligence-led process that uses big data to design new proteins.

By developing machine-learning models that can review protein information culled from genome databases, the researchers found relatively simple design rules for building artificial proteins. When the team constructed these artificial proteins in the lab, they found that they performed chemistries so well that they rivaled those found in nature.

"We have all wondered how a simple process like evolution can lead to such a high-performance material as a protein," said Rama Ranganathan, Joseph Regenstein Professor in the Department of Biochemistry and Molecular Biology, Pritzker Molecular Engineering, and the College. "We found that genome data contains enormous amounts of information about the basic rules of protein structure and function, and now we've been able to bottle nature's rules to create proteins ourselves."

The results were published July 24 in the journal Science.

Using artificial intelligence to learn design rules

Proteins are made up of hundreds or thousands of amino acids, and these amino acid sequences specify the protein's structure and function. But understanding just how to build these sequences to create novel proteins has been challenging. Past work has resulted in methods that can specify structure, but function has been more elusive.

What Ranganathan and his collaborators realized over the past 15 years is that genome databases--which are growing exponentially--contain enormous amounts of information about the basic rules of protein structure and function. His group developed mathematical models based on this data and then began using machine-learning methods to reveal new information about proteins' basic design rules.

For this research, they studied the chorismate mutase family of metabolic enzymes, a type of protein that is important for life in many bacteria, fungi, and plants. Using machine-learning models, the researchers were able to reveal the simple design rules behind these proteins.

The model shows that just conservation at amino acid positions and correlations in the evolution of pairs of amino acids are sufficient to predict new artificial sequences that would have the properties of the protein family.

"We generally assume that to build something, you have to first deeply understand how it works," Ranganathan said. "But if you have enough data examples, you can use deep learning methods to learn the rules of design, even as you are understanding how it works or why it's built that way."

He and his collaborators then created synthetic genes to encode for the proteins, cloned them into bacteria, and watched as the bacteria then made the synthetic proteins using their normal cellular machinery. They found that the artificial proteins had the same catalytic function as the natural chorismate mutase proteins.

A platform to understand other complex systems

Because the design rules are so relatively simple, the number of artificial proteins that researchers could potentially create with them is extremely large.

"The constraints are much smaller than we ever imagined they would be," Ranganathan said. "There is a simplicity in nature's design rules, and we believe similar approaches could help us search for models for design in other complex systems in biology, like ecosystems or the brain."

Though artificial intelligence revealed the design rules, Ranganathan and his collaborators still don't fully understand why the models work. Next they will work to understand just how the models came to this conclusion. "There is much more work to be done," he said.

In the meantime, they also hope to use this platform to develop proteins that can address pressing societal problems, like climate change. Ranganathan and Assoc. Prof. Andrew Ferguson have spun out a company called Evozyne that will commercialize this technology with applications in energy, environment, catalysis, and agriculture. Ranganathan has worked with UChicago's Polsky Center for Entrepreneurship and Innovation to file patents and license the IP to the company.

"This system gives us a platform for rationally engineering protein molecules in a way that we always dreamed we could," he said. "Not only can it teach us the physics of how proteins work and how they evolve, it can help us find solutions for issues like carbon capture and energy harvesting. Even more generally, the studies in proteins might even help teach us how the deep neural networks behind modern machine learning actually work."

###

Other authors on the paper include William P. Russ from University of Texas Southwestern Medical Center; Martin Weigt, Matteo Figliuzzi and Pierre Barrat-Charlaix from Sorbonne Universite?; Christian Stocker, Peter Kast, Donald Hilvert from ETH Zurich; Simona Cocco and Remi Monasson from Laboratoire de Physique de l'Ecole Normale Supe?rieure; and Michael Socolich from the University of Chicago.

Funding: National Institutes of Health, Robert A. Welch Foundation, University of Chicago Center for Data and Computing, Green Center for Systems Biology at University of Texas Southwestern Medical Center, EU H2020 Research and Innovation Programme, Agence Nationale de la Recherche, and the Swiss National Science Foundation.

####

For more information, please click here

Contacts:
Cynthia Medina


@UChicago

Copyright © University of Chicago

If you have a comment, please Contact us.

Issuers of news releases, not 7th Wave, Inc. or Nanotechnology Now, are solely responsible for the accuracy of the content.

Bookmark:
Delicious Digg Newsvine Google Yahoo Reddit Magnoliacom Furl Facebook

Related Links

Citation: "An evolution-based model for designing chorismate mutase enzymes," Russ et al. Science. DOI: 10.1126/science.aba3304:

Related News Press

News and information

Intelligent optical chip to improve telecommunications: An INRS team uses autonomous learning approaches for optical waveform generators to boost optical signal processing functionalities for current and future telecom applications October 15th, 2021

Using quantum Parrondo’s random walks for encryption: Asst Prof Kang Hao Cheong and his research team from SUTD have set out to apply concepts from quantum Parrondo’s paradox in search of a working protocol for semiclassical encryption October 15th, 2021

Cellular environments shape molecular architecture: Researchers glean a more complete picture of a structure called the nuclear pore complex by studying it directly inside cells October 15th, 2021

How to program DNA robots to poke and prod cell membranes: A discovery of how to build little blocks out of DNA and get them to stick to lipids has implications for biosensing and mRNA vaccines October 15th, 2021

Synthetic Biology

Bioinformatics tool accurately tracks synthetic: DNA Computer scientists show benefits of bioinformatics with PlasmidHawk February 26th, 2021

Synthetic biology reinvents development:The research team have used synthetic biology to develop a new type of genetic design that can reproduce some of the key processes that enable creating structures in natural systems, from termite nests to the development of embryos February 8th, 2021

Machine learning takes on synthetic biology: algorithms can bioengineer cells for you: Berkeley Lab scientists develop a tool that could drastically speed up the ability to design new biological systems September 25th, 2020

Advance in programmable synthetic materials: Reading sequence of metal atoms in MOFs allows encoding of multiple chemical functions August 11th, 2020

Discoveries

Intelligent optical chip to improve telecommunications: An INRS team uses autonomous learning approaches for optical waveform generators to boost optical signal processing functionalities for current and future telecom applications October 15th, 2021

Using quantum Parrondo’s random walks for encryption: Asst Prof Kang Hao Cheong and his research team from SUTD have set out to apply concepts from quantum Parrondo’s paradox in search of a working protocol for semiclassical encryption October 15th, 2021

Cellular environments shape molecular architecture: Researchers glean a more complete picture of a structure called the nuclear pore complex by studying it directly inside cells October 15th, 2021

How to program DNA robots to poke and prod cell membranes: A discovery of how to build little blocks out of DNA and get them to stick to lipids has implications for biosensing and mRNA vaccines October 15th, 2021

Announcements

Using quantum Parrondo’s random walks for encryption: Asst Prof Kang Hao Cheong and his research team from SUTD have set out to apply concepts from quantum Parrondo’s paradox in search of a working protocol for semiclassical encryption October 15th, 2021

Cellular environments shape molecular architecture: Researchers glean a more complete picture of a structure called the nuclear pore complex by studying it directly inside cells October 15th, 2021

How to program DNA robots to poke and prod cell membranes: A discovery of how to build little blocks out of DNA and get them to stick to lipids has implications for biosensing and mRNA vaccines October 15th, 2021

Molecular Sciences Software Institute receives $15 million grant from National Science Foundation October 15th, 2021

Interviews/Book Reviews/Essays/Reports/Podcasts/Journals/White papers/Posters

Intelligent optical chip to improve telecommunications: An INRS team uses autonomous learning approaches for optical waveform generators to boost optical signal processing functionalities for current and future telecom applications October 15th, 2021

Using quantum Parrondo’s random walks for encryption: Asst Prof Kang Hao Cheong and his research team from SUTD have set out to apply concepts from quantum Parrondo’s paradox in search of a working protocol for semiclassical encryption October 15th, 2021

Cellular environments shape molecular architecture: Researchers glean a more complete picture of a structure called the nuclear pore complex by studying it directly inside cells October 15th, 2021

How to program DNA robots to poke and prod cell membranes: A discovery of how to build little blocks out of DNA and get them to stick to lipids has implications for biosensing and mRNA vaccines October 15th, 2021

Artificial Intelligence

Development of dendritic-network-implementable artificial neurofiber transistors: Transistors with a fibrous architecture similar to those of neurons are capable of forming artificial neural networks. Fibrous networks can be used in smart wearable devices and robots September 24th, 2021

Argonne researchers use AI to optimize a popular material coating technique in real time June 25th, 2021

Graphene key for novel hardware security May 10th, 2021

With new optical device, engineers can fine tune the color of light April 23rd, 2021

NanoNews-Digest
The latest news from around the world, FREE




  Premium Products
NanoNews-Custom
Only the news you want to read!
 Learn More
NanoStrategies
Full-service, expert consulting
 Learn More











ASP
Nanotechnology Now Featured Books




NNN

The Hunger Project