Researchers have built up an AI-based tool to analyze how proteins move and interact which is faster and more precise than humans.
According to a study is published on the 3rd November 2020 in eLife.
The software, which is freely accessible, dramatically accelerates the study of protein dynamics and makes it available to research groups across the world, instead of restricted to a few labs with specialist expertise.
Proteins are the workhorses of our cells, and their movement controls a huge array of biological processes. Studying the movement of proteins – how they move around and interact with each other – is an essential part of understanding fundamental biology.
One of the fundamental tools for studying protein, motion is called — single-molecule Förster Resonance Energy Transfer (smFRET).
This works by marking at least 2 parts of the particle with a different fluorescent tag, and when the 2 tags are in close proximity, the change in fluorescence can be identified by a microscope. Thusly, the movement of proteins can be visualized & measured down to the nanometre level.
“Some of the challenges with smFRET include the very large data that are produced, and the steps that researchers need to take to process the images before analysis,” explains lead author Johannes Thomsen, who carried out this study as a Research Assistant at the University of Copenhagen, Denmark, and has since graduated with a Ph.D. “Machine learning technologies, especially deep neural networks, have significantly improved our ability to understand large datasets without the need for human intervention. We wanted to see whether employing these technologies to smFRET data would allow automated, fast characterization of protein motions, independently of human experts.”
The team chose to use a type of deep learning called deep neural networks (DNN). Deep learning is a unique branch of machine learning that takes the raw form of the data and looks for patterns with no prior ‘knowledge’. It has the advantage of learning useful features from raw data without time-intensive pre-processing, and offers a ‘less opinionated’ evaluation of the data, compared with the more subjective analysis by humans. DNN has a further advantage in that it can learn to recognize important aspects of the data and then classify it into groups. Although developing a DNN is a computationally-intensive process that can take time, once trained the model can be used easily, and by non-experts, in any computer.
The tool, DeepFRET, imports raw microscope images, locates the 2 different fluorescence signals, corrects for background noise and, with limited human help, produces a chart showing the motion of the molecules within the sample. When tested with simulated and real data, its accuracy at detecting meaningful patterns from the data was more than 95 percent, outperforming human operators, and yet only needing 1 percent of the time. The evaluation time for DeepFRET on a single piece of data (a trace) was around 50 milliseconds, whereas human reviewers spent an average of five seconds per trace.
“We have developed a machine learning method that can automatically, rapidly and reproducibly analyze recordings of the choreography of protein motions, with a simple user interface that works on different operating systems,” concludes senior author Nikos Hatzakis, Associate Professor at the University of Copenhagen, and Affiliate Associate Professor at the Novo Nordisk Foundation Center for Protein Research, University of Copenhagen. “The method works equally to or better than existing methods and requires an only minimal contribution by humans. It, therefore, offers a tool for people with limited expertise, which we hope will contribute to the standardization and rapid expansion of this field of study.”