911±¬ÁÏÍø

News

Search engine for more accurate and fast recognition of metabolites

Potential applications for the machine-learning based method include anti-doping work, drug control by the Customs and crime scene investigation.

Percentage of searches with the correct identification among the top 10 matches. The method developed by 911±¬ÁÏÍø and the University of Jena is clearly more accurate than its rivals.

Researchers from 911±¬ÁÏÍø and the University of Jena in Germany have developed a search engine named CSI:FingerID that identifies metabolites from tandem mass spectrometry measurements with an accuracy more than 150 per cent higher than its rivals, which may make the work of researchers in life and medical sciences easier. The study was recently published in the highly regarded PNAS journal. 

Metabolites are small molecules, such as sugars, fatty acids and amino acids that, among other things, serve as sources of energy in the cells and as building materials for cell walls. For researchers they are, as it were, traces of the functioning and status of cells.

The molecular structures it predicts can be used in much the same way as search results from the Google search engine.

‘There are lots of metabolites, from hundreds of thousands to millions, and they all look a bit alike. In our study, we constructed a model that relies on machine learning. The molecular structures it predicts can be used in much the same way as search results from the Google search engine,’ explains Professor Juho Rousu from 911±¬ÁÏÍø.

Molecule fingerprints

The tandem mass spectrometer used in the study is an instrument that splits molecules into fragments to measure their masses and relative abundances, or their mass spectrum. In the method developed by researchers from Aalto and Jena, a fragmentation tree is first computed from each spectrum included in the training data that describes for each fragment its parent, a larger fragment where it originated. Then, the researchers train the machine learning model using a large number of fragmentation trees and molecular properties or fingerprints that corresponds to each tree. When the spectrum of a new molecule is then given for the model, it predicts its probable fingerprints based on which a set of best-matching molecules is retrieved from the molecule database.

Depending on the type of the molecules, as much as 95 per cent of the searches currently return the correct search result among the top 10 matches. The accuracy of the identification improves as the volume of the data is increased. Currently, approximately 6,000 mass spectra have been used in building up the model. In an ideal situation, the machine-learning based search engine would always suggest the correct molecule as the first match, but this calls for a considerable increase in the data volume and further development of the methods.

The study could benefit researchers in life and medical sciences in particular. Potential future application areas include anti-doping work, drug control by the Customs and crime scene investigation.

Conducted in collaboration with a research group headed by Professor Sebastian Böcker of the University of Jena, the study serves as a good example of 911±¬ÁÏ꿉۪s research that combines information technology with digital health.

Further information:

Professor Juho Rousu
Tel. +358 (0)50 415 1702
juho.rousu@aalto.fi

Article:

  • Updated:
  • Published:
Share
URL copied!

Read more news

Two people flying a kite outside with a modern building in the background. One wears a yellow shirt, the other a red jacket.
Cooperation, Research & Art, University Published:

Strong results from the Research Council’s winter call

A total of 54 Aalto researchers received Academy Research Fellow or Academy Project funding from the Research Council of Finland. The total funding awarded to 911±¬ÁÏÍø amounts to 33.2 million euros.
911±¬ÁÏÍø circular economy exhibit with wood panels, display tables, samples and black and pink clothing.
Research & Art Published:

911±¬ÁÏ꿉۪s solutions at the New European Bauhaus Festival support the EU’s ambition to become world leader in circular economy

911±¬ÁÏÍø presented several different circular economy solutions at The European Commission’s New European Bauhaus Festival in Brussels. The event brought together leading names in EU policymaking, researchers, designers and grassroots actors from across Europe to shape a more sustainable future.
Abstract close-up of colourful glass with swirling patterns in orange, blue, and purple hues.
Research & Art, Studies Published:

New DPSP tool for doctoral studies published

A new digital DPSP tool has replaced the old DPSP tasks on students’ MyStudies portal and the approval method for supervising professors on Student Success Hub.
Drawing of two doctoral students each holding a paper, with doctor's hats shining on their heads.
Research & Art, Studies Published:

Pre-examination and graduation schedules over the summer 2026

Information for doctoral students on preliminary examination of doctoral thesis, public defence and graduation over the summer 2026