Chapter 09

The Library No One Knew Existed

DeepMind's GNoME discovered 2.2 million new crystal structures — 800 years of materials science — and gave the database away for free.

✓ Verified Published in Nature (Merchant et al., Nov 2023) · 736 materials independently synthesized by external labs · 380,000 stable structures added to Materials Project
Listen to this story Audio Overview
0:00 / 0:00
Share X LinkedIn Reddit HN

01 — The Shelf48,000 Crystals in a Century

Materials science moves at the speed of synthesis. A researcher hypothesizes a crystal structure, mixes precursors, fires a furnace, waits, analyzes the result, and records whether the material is stable. One structure at a time. One experiment at a time. For over a century, this is how humanity has built its catalog of known inorganic crystals.

By 2023, that catalog — the Inorganic Crystal Structure Database, or ICSD — contained approximately 48,000 experimentally confirmed stable materials. These are the crystals underpinning modern technology: the silicon in processors, the lithium compounds in batteries, the gallium arsenide in LEDs, the perovskites in next-generation solar cells. Every material in every device you own traces back to someone, somewhere, synthesizing a crystal and writing down what happened.

~48K
known stable inorganic crystals before GNoME (ICSD)
~100 yr
of cumulative human effort to discover them
~480
new stable crystals discovered per year, on average

The constraint was never imagination. Materials scientists have long suspected that the space of possible stable crystals is vastly larger than what has been explored. The constraint was throughput. Testing a single candidate material — from computational prediction through synthesis and characterization — takes weeks to months. Searching exhaustively through the space of possible compositions and structures at that rate would take millennia.

The question was not whether undiscovered materials existed. It was whether anyone could search fast enough to find them.

The Lattice — Crystal nodes assemble in a warm grid, pulsing faintly. A search beam sweeps through them. Most remain dim. A few catch the light and glow with stability — rising from the grid into a growing constellation above.

03 — The Yield2.2 Million Structures

The final count was 2.2 million new crystal structures. Of these, 380,000 were predicted to be thermodynamically stable — meaning they sit at energy minima that should allow them to be synthesized and to persist under real-world conditions.

To put that number in context: 380,000 stable materials is roughly eight times the total number of stable crystals discovered by all of human materials science over the past century. DeepMind described the result as "the equivalent of about 800 years of knowledge."

Layered Compounds

52,000
new structures similar to graphene — potential superconductors and electronics

Lithium Ion Conductors

528
25x more candidates than all prior human research combined

Within the 380,000, specific categories stood out. GNoME identified 52,000 new layered compounds with structures analogous to graphene — the single-atom-thick carbon sheets that won the 2010 Nobel Prize in Physics and have been the subject of intense research for applications in electronics, energy storage, and quantum computing. It also identified 528 potential lithium ion conductors, a class of material critical to the development of solid-state batteries. That figure — 528 — represents 25 times more candidates than all prior studies had identified in total.

The numbers are large enough to be difficult to absorb. But they are not abstract. Each of those 380,000 stable structures is a specific arrangement of specific atoms with specific predicted properties. Each one is a candidate for a specific application. The question for each is no longer "does this material exist?" It is "is it worth synthesizing?"

04 — The ProofFrom Prediction to Powder

A prediction is not a material. A crystal that exists only in a neural network's output has no mass, no conductivity, no practical value. The question for GNoME was always going to be whether its predictions survived contact with reality.

By the time the paper was published, 736 of GNoME's predicted materials had been independently created in laboratories around the world. These were not DeepMind experiments. External researchers, working from the predictions, synthesized the materials and confirmed their stability. The predictions held.

"We are at the precipice of a new era of materials science." — Amil Merchant, Google DeepMind

The most striking validation came from a companion paper published in Nature on the same day. A team at Lawrence Berkeley National Laboratory, led by Gerbrand Ceder, had built an autonomous robotic laboratory called A-Lab. The facility could plan synthesis experiments, mix precursors, operate furnaces, and analyze results — all without human intervention. The team pointed A-Lab at a set of GNoME-predicted materials and let it run.

A-Lab — Autonomous Synthesis Log
TARGET ›58 GNoME-predicted compounds
→ Recipes generated from thermodynamic data
→ Precursors selected and dispensed by robotic arm
→ Furnace cycles: autonomous temperature programming
→ X-ray diffraction analysis: automated
RESULT ›41 of 58 successfully synthesized
→ Success rate: 71%
→ Total time: 17 days
→ Human intervention: none

A-Lab attempted 58 compounds and successfully synthesized 41 — a 71 percent success rate, achieved in 17 days with no human intervention. The full loop — AI predicts a material, a robot makes it, an instrument confirms it — ran without a person touching a beaker.

This is what it looks like when the bottleneck moves. The prediction was fast. The synthesis was autonomous. The validation was automated. The rate-limiting step was no longer any of those. It was deciding which of the 380,000 candidates to try first.

05 — The Gift380,000 Structures, Free to the World

DeepMind could have kept the database proprietary. A catalog of 380,000 novel stable materials with predicted properties is commercially valuable — to battery manufacturers, semiconductor companies, pharmaceutical firms, defense contractors. The research cost real money and years of compute. There were strong incentives to monetize it.

They gave it away.

Materials Project — Database+380,000 structures added
CompoundStabilitySourceStatus
Li₃ScCl₆StableGNoMENew
Na₂ZrCl₆StableGNoMENew
BaTiGe₃O₉StableGNoMENew
Ca₂MnReO₆StableGNoMENew
... 379,996 more entries

The full dataset of 380,000 predicted stable structures was released to the Materials Project — the same open-access database that GNoME was originally trained on. The code was made available on GitHub. Any researcher, at any institution, in any country, could now search the database for materials matching their application requirements.

The decision to open-source was not charity. It was strategy. Materials science is a field where progress depends on synthesis — and DeepMind does not operate synthesis labs. The predictions are valuable only if someone tests them. The more labs that have access, the more materials get synthesized, the more feedback flows back into the system, the better the next generation of models becomes. Open-sourcing was the fastest path to validation.

But the structural consequence is real regardless of the motive. Before November 29, 2023, the catalog of computationally predicted stable materials was a resource available primarily to labs with the expertise and compute to run their own DFT calculations. After November 29, 2023, a graduate student in Nairobi and a research team at MIT had access to the same 380,000 predicted structures. The information asymmetry in materials discovery did not narrow. It collapsed.

What If?

GNoME found 528 potential lithium ion conductors — 25 times more candidates than all prior human research combined. Lithium ion conductors are the rate-limiting material in solid-state batteries, which would double the energy density of every electric vehicle and grid storage installation on Earth. The bottleneck has never been engineering the battery. It has been finding the material. GNoME also found 52,000 new layered compounds similar to graphene — the class that could enable room-temperature superconductors. Hundreds of thousands of novel crystal structures in spaces no one has explored, because doing it by hand would have taken centuries. Every one is now in a public database, available to any lab with a synthesis rig. What happens when the constraint shifts from 'we haven't found the right material' to 'we found it three years ago and no one has synthesized it yet'? The 380,000 stable structures in the Materials Project are an inventory of futures — batteries, semiconductors, catalysts, superconductors — waiting for someone to pick one and build it. The materials exist. Whether the institutions that turn them into technologies can move at the speed of the database that found them is another matter.

How did this land?

Sources

← Previous Chapter 08 The Second Reader 8 min read Next → Chapter 10 The First Conversation 7 min read
New chapters · No spam
Get the next story in your inbox