911

News

Decoding the chemistry of space with machine learning

Astronomers can detect complex chemical fingerprints in stardust – but many of them remain unidentified. The SpaceML project combines machine learning and computational chemistry to simulate how molecules form and evolve in space, helping researchers decode these signals.
Left: person wearing a black jacket and pearl necklace. Right: molecular structure illustration against a cosmic background.
Left: Academy Research Fellow Rina Ibragimova. Right: Graphene in space (artist's concept). Photo: NASA.

Organic molecules made of carbon, hydrogen, and oxygen are widespread in the universe. Yet important questions remain about where these compounds come from and how they form. 

 models how organic molecules form and break down around stars, with the aim of explaining spectroscopic signals – signatures produced by light-matter interactions that astronomers can detect but often struggle to interpret. 

“Machine learning allows us to simulate processes that were completely out of scope five years ago – harsh astrophysical conditions, constant radiation, rare reaction events. In practice, we now have tools to simulate matter in space. This way we can study systems that were previously impossible to approach computationally,” says Academy Research Fellow Rina Ibragimova

Ibragimova is an expert in computational physics and the Principal Investigator of the SpaceML project. She is also a member of the Data-driven Atomistic Simulation (DAS) group led by professor Miguel Caro at Aalto. 

The idea to use machine learning in astrochemistry emerged from an unexpected interdisciplinary meeting. A few years ago, during a visit to the University of La Laguna in Spain, one of Ibragimova’s lectures was attended by an astronomer. 

“Before that meeting, I had not thought about astrochemistry at all. But we talked and found there was a lot of common ground and that we could work together. That’s how the collaboration emerged. We started planning potential projects, and later I wrote my own proposal to the Academy of Finland, and SpaceML was formed” Ibragimova says. 

Carbon-rich AGB star with molecule formation stages at 10^14 cm, 10^15 cm, and 10^16 cm radii, denoted in white on a starry background.
A conceptual depiction of the molecules in the circumstellar medium (CRM) – the gas, dust, and plasma surrounding a star. Credit: Martínez, L., Santoro, G., Merino, P. et al. / Nature Astronomy.*

The challenge of identifying molecules in space 

When organic molecules drift around stars, they interact with light in characteristic ways. Using infrared spectroscopy, researchers break that light into a spectrum – a pattern of peaks and features. Each molecular structure leaves its own ‘fingerprint’. 

For simple molecules, the signal can be straightforward to link to a specific bond or structure. Often, however, astronomers observe complex mixtures. Their spectral signatures overlap, producing features that may correspond to several possible structures – or to none that can be identified with confidence. As a result, many signals in astronomical data remain unexplained. 

In materials science, researchers solve this problem in a more controlled way. 

“We synthesize a specific compound in the lab, measure its infrared spectrum, and create a direct link between structure and signal. In astronomy, the challenge is much greater. Researchers often do not know what kinds of molecules are present to begin with. The number of possible molecular structures and combinations is vast, making it difficult even to decide what to test experimentally,” Ibragimova explains. 

Laboratories can recreate certain space-like conditions by placing a starting material – a precursor – in a chamber, exposing it to hydrogen atoms and ultraviolet radiation, and measuring how its spectrum evolves. But selecting the right precursor is a major challenge. 

“This is where computational simulations become essential. By screening large numbers of precursor structures with advanced modeling and machine learning, we can identify the ones most likely to explain specific infrared features observed in space”, Ibragimova says. 

Building reusable methods 

Beyond its astrochemical focus, the SpaceML project aims to develop computational methods that can be applied widely in chemistry and materials science. Ultimately, the goal is to make the simulations accessible – not something that takes an entire PhD to run, but something more efficient and widely usable. 

The second goal is to fill theoretical gaps and stitch everything together into a coherent methodology. Although many machine-learning approaches already exist, Ibragimova sees a need for a more integrated workflow. 

“Ideally, you would have an idea, run the calculation quickly, identify the structure and its spectrum, and then bring that information to collaborators and say, ‘We think these structures are relevant under these conditions – you could try to test them’”, she says. 

Such a framework could guide experimentalists more efficiently and help make telescope observations more targeted. 

“For example, with the James Webb Space Telescope, we could better predict what to look for and estimate the confidence of identifying certain structures.” 

The project builds on tools developed within the Data-driven Atomistic Simulation (DAS) group while creating new ones. One model predicts infrared spectra using machine learning. Another, developed in collaboration with professor Andrea Sand at Aalto, simulates the effects of ionizing UV radiation on materials. 

“It’s amazing that the same tools can simulate catalytic materials and materials millions of light-years away. What’s fascinating is the difference in time scales. Stellar evolution happens over thousands or millions of years, while atomistic simulations might cover only hundreds of picoseconds. And yet with careful considerations and thorough verification, we can capture meaningful patterns”, Ibragimova says.

* The source for the image depicting the circumstellar medium can be found . Rina Ibragimova is not a co-author in the article from which the image is derived from but José A. Martín-Gago and Gonzalo Santoro are the collaborators of her ARF.

Further information:

DAS research group photo

Data-driven Atomistic Simulation (DAS)

Research group led by Miguel Caro

Department of Chemistry and Materials Science
A 3D structure with green spheres interconnected by a grey mesh, set against a multicoloured background.

A paradigm shift: machine learning is transforming research at the atomic scale

Assistant professor Miguel Caro and his research group use and develop machine learning tools to accelerate discoveries from simulation to experiment

News
  • Updated:
  • Published:
Share
URL copied!

Read more news

Abstract close-up of colourful glass with swirling patterns in orange, blue, and purple hues.
Research & Art, Studies Published:

New DPSP tool for doctoral studies published

A new digital DPSP tool has replaced the old DPSP tasks on students’ MyStudies portal and the approval method for supervising professors on Student Success Hub.
Drawing of two doctoral students each holding a paper, with doctor's hats shining on their heads.
Research & Art, Studies Published:

Pre-examination and graduation schedules over the summer 2026

Information for doctoral students on preliminary examination of doctoral thesis, public defence and graduation over the summer 2026
Aerial view of a modern campus with red brick buildings, tram tracks and green trees on a sunny day
Research & Art Published:

A unique joint effort – 911 receives EUR 9 million in donations to accelerate the energy transition

Donations from ABB, Fortum, St1 and the Walter Ahlström Foundation will be used to establish new professorships. At the same time, the donations support the establishment of 911 House of Energy Transition.
Person in a white lab coat stands in a bright laboratory with equipment and benches around
Research & Art Published:

Keys to growth: How the energy transition is making oil obsolete

The transition to clean energy is a societal transformation on the same scale as digitalisation, and Finland has many strengths to support it, says professor Annukka Santasalo-Aarnio, who leads 911’s new competence centre, House of Energy Transition.