From Prediction to Production: MIT’s AI System Helps Synthesize New Materials
AI systems are highly effective in predicting new material structures. We have previously covered this at AIwire. However, what about actually creating the material in the lab? The synthesis process to create the materials is far more delicate – something a human is typically better at.
AI can make reliable predictions based on data. It can generate outputs on which crystal structures would be most stable and useful under certain conditions. However, when it comes to actually combining the right chemicals, at the right temperature, under a specific pressure, and letting the experiment run for a certain duration, that is far more complicated. That could change now.
Researchers from MIT have developed DiffSyn – a GenAI system that draws from historical materials synthesis data to predict plausible synthesis pathways. It goes beyond structural material properties. The researchers claim that it models the sequence of experimental steps required to produce a target material, essentially translating structure into process.
The model narrows the experimental search space by identifying combinations of variables that have historically led to successful material formation. Rather than replacing human scientists, it functions as an intelligent experimental planning assistant.
The researchers believe that DiffSyn could break the biggest bottleneck in the materials discovery process. “To use an analogy, we know what kind of cake we want to make, but right now we don’t know how to bake the cake,” says lead author Elton Pan, a PhD candidate in MIT’s Department of Materials Science and Engineering (DMSE). “Materials synthesis is currently done through domain expertise and trial and error.”
(HL Creations/Shutterstock)
Using the new model, the researchers were able to synthesize a new zeolite material that showed improved thermal stability. They purposely tested the model to create a zeolite because these materials are structurally complex, extremely sensitive to synthesis conditions, known to have many theoretical frameworks that have never been made in reality.
What makes zeolites particularly challenging is not just their structure but the number of interacting variables involved in their formation. Traditional lab methods tend to explore those variables one at a time, which limits how much of the full parameter space can realistically be tested.
“People rely on their chemical intuition to guide the process,” Pan says. “Humans are linear. If there are five parameters, we might keep four of them constant and vary one of them linearly. But machines are much better at reasoning in a high-dimensional space.”
“Diffusion models are basically a generative AI model like ChatGPT, but more like the DALL-E image generation model,” Pan says. “During inference, it converts noise into meaningful structure by subtracting a little bit of noise at each step. In this case, the ‘structure’ is the synthesis route for a desired material.”
The researchers trained their model on over 23,000 published synthesis recipes spanning about 50 years. They deliberately added random variations to those recipes and taught the system to reconstruct them step by step. In doing so, the model learned how to move from randomness to a workable synthesis plan. That approach is known as diffusion, and it forms the foundation of DiffSyn.
(Sergio Klambiatti/Shutterstock)
“It basically tells you how to bake your cake,” Pan says. “You have a cake in mind, you feed it into the model, the model spits out the synthesis recipes. The scientist can pick whichever synthesis path they want, and there are simple ways to quantify the most promising synthesis path from what we provide, which we show in our paper.”
While the model has only been tested in a limited scope, the researchers are confident that the approach could work to trail other models that guide the synthesis of other material out of zeolites.
“This approach could be extended to other materials,” Pan says. “Now, the bottleneck is finding high-quality data for different material classes. But zeolites are complicated, so I can imagine they are close to the upper-bound of difficulty. Eventually, the goal would be interfacing these intelligent systems with autonomous real-world experiments, and agentic reasoning on experimental feedback to dramatically accelerate the process of materials design.”
If systems like DiffSyn prove reliable, they could significantly shorten the gap between computational discovery and experimental validation. The researchers note that the system’s performance depends heavily on the quality and diversity of historical synthesis data. Materials with limited documentation may remain difficult to model. However, the successful zeolite demonstration suggests that AI can move beyond structural prediction and begin encoding procedural scientific knowledge.
This article first appeared on BigDATAwire.
Related

