Chapter 04

The Antibiotic That AI Found Hiding in Plain Sight

MIT trained a neural network on 2,335 molecules, then screened 100 million compounds in three days. It flagged a failed diabetes drug that killed drug-resistant bacteria in mice within 24 hours. The molecule had been sitting in a database since 2009. No one had thought to test it.

✓ Verified Published in Cell (Stokes et al., Feb 2020) · Follow-up compound abaucin published in Nature Chemical Biology (May 2023)
Listen to this story Audio Overview
0:00 / 0:00
Share X LinkedIn Reddit HN

01 — The VoidThe Empty Pipeline

Since 1987, no genuinely new class of antibiotic has been discovered. Every antibiotic brought to market in the past three decades is a variation of a drug class that already existed. The pipeline did not slow down. It stopped.

The bacteria did not stop. In 2019, drug-resistant infections directly killed 1.27 million people worldwide — more than HIV/AIDS, more than malaria. The WHO classifies antimicrobial resistance as one of the top ten global public health threats. A 2024 Lancet study projected that without intervention, resistant infections will kill more than 39 million people by 2050.

37 yr
since the last new antibiotic class was discovered
1.27M
annual deaths from drug-resistant infections (2019)
18 → 6
pharma companies with active antibiotic programs

The pharmaceutical industry, meanwhile, has been walking away. Over the past two decades, the number of multinational companies with active antibiotic research programs fell from eighteen to six. Antibiotics are expensive to develop — over a billion dollars and ten to fifteen years per drug — and cheap to sell. The economics do not reward the investment.

The traditional approach to finding new antibiotics — synthesize a molecule, test it against bacteria, iterate — works, but it is slow, expensive, and constrained by what chemists already know how to make. The problem was not that the molecules did not exist. It was that no one could search fast enough to find them.

The Sieve — Eighty molecules drift downward through the dark. A teal scan beam sweeps across them. Most pass through unchanged. One catches the light, glows, and rises — hiding in plain sight, now visible.

03 — The FindA Failed Diabetes Drug

The model flagged a compound called SU3327.

SU3327 was a c-Jun N-terminal kinase inhibitor, originally synthesized in 2009 at the Burnham Institute for Medical Research as a candidate treatment for type 2 diabetes. It had been tested, found lacking, and shelved. It had been sitting in a database for over a decade. The model predicted it would kill bacteria. No one had thought to ask.

The researchers named it halicin, after HAL 9000, the artificial intelligence in Stanley Kubrick's 2001: A Space Odyssey.

In lab tests, halicin killed many of the most dangerous drug-resistant bacterial strains on the planet: Clostridioides difficile, Mycobacterium tuberculosis, and Acinetobacter baumannii — a WHO critical-priority pathogen that haunts hospital wards and military field hospitals. The strain of A. baumannii used in testing was resistant to all known antibiotics.

In a mouse model, the results were blunt. Mice infected with pan-resistant A. baumannii were treated with a halicin-containing ointment. Within 24 hours, the infections were completely cleared.

Halicin
0
resistance mutations after 30 days of E. coli exposure
Ciprofloxacin
200×
resistance increase by day 30 — appeared within 1–3 days
Mechanism of Action

Halicin kills bacteria by disrupting the electrochemical gradient across the cell membrane — the proton motive force that drives ATP production. As Stokes explained, "a cell can't necessarily acquire a single mutation or a couple of mutations to change" the fundamental chemistry of its outer membrane. The target is not a single protein. It is the membrane itself.

Halicin was structurally unlike any existing antibiotic. That was the point. The model had found it precisely because it did not look like what humans expected an antibiotic to look like.

04 — The Scale107 Million Molecules in Three Days

Halicin was found in a library of 6,000 compounds. The researchers wanted to know what a larger search would yield.

They pointed the trained model at the ZINC15 database — a collection of more than 107 million chemical compounds. The screen took three days. It identified 23 candidates that were structurally distant from all known antibiotics. Eight showed antibacterial activity in laboratory tests. Two were described as particularly powerful.

Halicin (2020)

6,000 → 1
Drug Repurposing Hub screened. Halicin flagged and validated.

Abaucin (2023)

6,680 → 9
Compounds screened in under 2 hours. Nine antibiotics found.

Three years later, the methodology proved it was not a one-off. In May 2023, Stokes — now leading his own lab at McMaster University — published a follow-up discovery with Collins and Barzilay in Nature Chemical Biology. The team trained a new model on approximately 7,500 compounds tested against A. baumannii specifically. They screened 6,680 unseen compounds in under two hours and identified 240 candidates for lab testing, of which nine showed antibiotic properties.

The most promising compound was named abaucin. It kills A. baumannii by disrupting lipoprotein trafficking — a different mechanism from halicin. Abaucin is narrow-spectrum: it targets A. baumannii specifically, leaving beneficial gut bacteria unharmed. That selectivity is not just a feature. Narrow-spectrum antibiotics exert less evolutionary pressure across the bacterial population, which means resistance develops more slowly.

Halicin showed the approach could work. Abaucin showed it could work again, on demand, against a specific target.

05 — The GapFrom Molecule to Patient

Halicin has not entered human clinical trials. As of 2024, preclinical studies indicate that the molecule is poorly absorbed into the bloodstream and rapidly eliminated from the body — pharmacokinetic challenges that are solvable in principle but expensive in practice.

Finding the molecule was the fast part. A neural network screened a database in three days. The slow part is everything that comes after: reformulation to improve bioavailability, toxicology studies, Phase I/II/III trials, regulatory approval. That pipeline takes a decade and costs over a billion dollars. It is the same pipeline that emptied the antibiotic market in the first place.

The structural shift, though, is real. The methodology that found halicin and abaucin is reusable. The models can be retrained on new data, pointed at new chemical libraries, optimized for new pathogens. The discovery phase — the part that stalled for three decades — has been compressed from years to days. What remains is the translation phase. It is slower, more expensive, and institutionally harder. But the constraint has moved.

For thirty-seven years, the world had a discovery problem. Now it has a delivery problem. That is progress.

What If?

What if the same model that found halicin hiding in a diabetes drug database is pointed at every abandoned compound in pharmaceutical history — every molecule tested for one condition, found lacking, and shelved? The Drug Repurposing Hub holds 6,000. ZINC15 holds 107 million. The full universe of synthesized-but-untested compounds stretches into the billions. Halicin was one molecule, found in one screen, for one class of disease. The methodology is pathogen-agnostic. Retrain on antifungal data and screen the same library — now you are searching for the next antifungal while Candida auris closes hospital wings. Retrain on antiviral data — broad-spectrum antivirals before the next pandemic, not after. The constraint was never chemistry. It was attention. The AI does not invent molecules. It finds the ones we already made and never thought to ask the right question about. How many people died of drug-resistant infections while a compound that could have treated them sat in a freezer, catalogued under the wrong heading, because no one could search at the right scale? That number grows every year the delivery pipeline falls behind the discovery one.

How did this land?

Sources

← Previous Chapter 03 The Ruling That Made AI Liable 5 min read Next → Chapter 05 The Fifty-Year Problem 8 min read
New chapters · No spam
Get the next story in your inbox