The Second Reader — The AI Files

Key Facts

Google Health/DeepMind study showed 5.7% fewer false positives and 9.4% fewer false negatives, published in Nature January 2020.
Swedish MASAI trial (105,934 women) found AI screening detected 29% more cancers without increasing false positives.
Germany's PRAIM study (461,818 women) confirmed a 17.6% higher cancer detection rate in real-world deployment.
AI systems detected 20-40% of interval cancers that radiologists had missed.
The NHS launched the EDITH trial across 30 sites in 2025.

01 — The GapThe Missing Reader

Every mammogram in the United Kingdom is read twice. Two radiologists, working independently, examine the same image. If either flags an abnormality, the case goes to assessment. This double-reading system is one of the reasons the UK breast screening program, established in 1988, catches cancers early. It is also the reason the program is running out of time.

There are not enough radiologists. The Royal College of Radiologists reported in 2023 that the UK had a shortfall of approximately 29% in its clinical radiology workforce. Breast screening radiologists are retiring faster than they are being replaced. Training a new one takes a decade. The problem is not unique to Britain. In the United States, where single-reading is the norm, radiologist burnout and workload are well-documented constraints. In low- and middle-income countries, the math is worse: sub-Saharan Africa has fewer than ten radiologists per million people. Most of the world has never had a double-reading system because it has never had enough readers for a single one.

In 2022, breast cancer was diagnosed in 2.3 million women worldwide and killed 666,000. It remains the most frequently diagnosed cancer in women and the leading cause of cancer death in women globally. The Lancet Breast Cancer Commission projected in 2024 that annual incidence will exceed three million by 2040, with the steepest increases in low- and middle-income countries. Early detection through screening is the single largest factor in survival. Five-year survival for breast cancer caught at stage one exceeds 98%. At stage four, it drops below 30%.

The constraint was always human attention. A radiologist reads a mammogram in about sixty to ninety seconds, searching for abnormalities in dense tissue that can be as small as a few millimeters. Across a full shift, that focus degrades. Across a career, it burns out. And across a population, there are simply not enough trained eyes.

2.3M

new breast cancer diagnoses worldwide in 2022

666K

breast cancer deaths globally in 2022

29%

UK clinical radiology workforce shortfall (2023)

The Scan — A warm grid of faint tissue nodes sits quietly. A pink scan line sweeps across. Most nodes dim and pass. One catches the light and glows — a detection the eye alone would have missed. The line continues. Another sweep begins.

02 — The StudyTwenty-Nine Thousand Mammograms

In January 2020, a team from Google Health, DeepMind, Cancer Research UK Imperial Centre, Northwestern University, and the Royal Surrey County Hospital published a study in Nature. The lead author was Scott Mayer McKinney. The study evaluated an AI system trained to detect breast cancer from mammograms.

The evaluation dataset comprised 25,856 women from two screening sites in London, with 414 cancer diagnoses confirmed by biopsy within three years, and 3,097 women from an academic medical center in the United States, with 686 confirmed cancers. The AI was not given patient history, prior images, or clinical notes. It saw only the mammogram.

AI Screening Output — Study Results

False Positive Reduction (US)-5.7%

False Positive Reduction (UK)-1.2%

False Negative Reduction (US)-9.4%

False Negative Reduction (UK)-2.7%

AI vs. Six RadiologistsAI outperformed all six

RECALL WORKLOAD REDUCTION: 88% (simulated second-reader replacement)

On US data, the AI achieved an absolute reduction of 5.7% in false positives — cases where a mammogram is incorrectly flagged as abnormal, leading to unnecessary follow-up — and 9.4% in false negatives, cases where a cancer is missed entirely. On UK data, where the baseline double-reading system already performs well, the reductions were 1.2% and 2.7% respectively.

In a head-to-head comparison with six radiologists, the AI outperformed all of them. In a simulated experiment modeling the UK double-reading workflow, the AI serving as a second reader achieved non-inferior performance while eliminating 88% of the second-reader workload.

The system also demonstrated cross-population generalization. Trained exclusively on UK mammograms, it still exceeded human expert performance when tested on US data. The screening protocols, equipment, and patient demographics differed between the two countries. The AI adapted. It had learned what cancer looks like, not what a particular country's mammograms look like.

03 — The TrialsFrom Lab to Clinic

The Nature paper was a retrospective study — it tested AI on already-collected data. The question that followed was whether the results would hold in a prospective, real-world clinical setting. Three large-scale trials answered that question.

The MASAI Trial (Sweden, 2021–2022). The Mammography Screening with Artificial Intelligence trial was the first randomized controlled trial of AI in breast cancer screening. Conducted across four screening sites in Sweden, it enrolled 105,934 women. Half were assigned to AI-supported screening, where a single radiologist worked with Lunit INSIGHT MMG. The other half received standard double reading by two radiologists.

The AI group detected 29% more cancers — a cancer detection rate of 6.4 per 1,000 women versus 5.0 per 1,000 in the control group. It did so without increasing false positives. In the AI group, 81% of cancer cases were detected at screening, compared to 74% in the control group. Radiologist workload was reduced by nearly half.

The full results, published in The Lancet in 2025, showed that AI-supported screening also reduced interval cancers — cancers diagnosed between routine screenings — by 12%. AI-screened women had fewer aggressive and advanced-stage diagnoses at the next screening round.

AI + 1 Radiologist

6.4

cancers detected per 1,000 women

2 Radiologists (Standard)

5.0

cancers detected per 1,000 women

The ScreenTrustCAD Trial (Sweden, 2021–2022). Run at Capio Sankt Goran Hospital in Stockholm, this prospective study used the same Lunit AI system. Double reading by one radiologist plus AI detected 4% more cancers than double reading by two radiologists. At the one-year follow-up, AI-assisted screening reduced false positives from 89.6% to 78.0% and improved positive predictive value from 16.9% to 22.1%. Published in The Lancet Digital Health in October 2023.

The PRAIM Study (Germany, 2021–2023). The largest prospective study to date. Across 12 screening sites in Germany, 461,818 women were screened. Approximately half received AI-supported double reading. The AI group achieved a 17.6% higher cancer detection rate with a higher positive predictive value (17.9% versus 14.9%), without increasing recalls or false positives. Published in Nature Medicine in January 2025.

MASAI (Sweden)

105,934

women screened. 29% more cancers detected. Interval cancers reduced 12%.

PRAIM (Germany)

461,818

women screened. 17.6% more cancers detected. No increase in false positives.

Across all three trials, the pattern was consistent: AI found more cancer, generated fewer false alarms, and reduced the number of radiologists needed to operate a screening program. One radiologist plus AI matched or exceeded two radiologists working alone.

04 — The Invisible CancersWhat the Eye Missed

The most consequential finding may be what AI does with interval cancers — breast cancers diagnosed between scheduled screenings, typically because they were invisible or ambiguous on the prior mammogram. Interval cancers account for approximately 25–30% of all screen-detected breast cancers and tend to be more aggressive, because they grow fast enough to become symptomatic before the next scheduled screen.

A growing body of evidence shows that AI systems can retrospectively identify 20–40% of interval cancers on the prior mammograms where they were originally missed. In the Google Health study of NHS data, the AI detected 25% of interval cancers. An RSNA study found that an AI algorithm correctly localized 32.6% of previously undetected interval cancers. Another study found AI detected an additional 44.4% of interval cancers and 66.7% of cancers that had been explicitly missed by radiologists.

The Interval Cancer Problem

A woman receives a clear mammogram. The radiologist saw nothing. Two years later, she is diagnosed with advanced breast cancer that was already present — visible in retrospect — on the image that was read and cleared. AI catches one in three of these cases. The question is whether "one in three" is the ceiling, or the starting point.

These are not hypothetical improvements. Each interval cancer that AI identifies on a prior mammogram represents a real patient who could have been recalled two years earlier, when the tumor was smaller, the treatment less aggressive, and the survival odds substantially better. The MASAI trial's full results showed that AI-supported screening led to fewer aggressive and advanced-stage diagnoses at the subsequent screening round — direct evidence that earlier detection translates to better clinical outcomes.

Lunit's AI system has also demonstrated the ability to estimate future breast cancer risk up to four to six years before a cancer becomes detectable, based on subtle patterns in mammographic tissue that human readers cannot reliably perceive.

05 — The DeploymentFrom Trial to Standard

As of 2025, AI mammography systems are moving from clinical trials into routine deployment. The trajectory is clear. The remaining questions are regulatory and institutional, not scientific.

In the United Kingdom, NHS England launched EDITH — the largest AI breast cancer screening trial in the world — in April 2025. Five competing AI systems will be tested across 30 screening sites, covering approximately 700,000 women. The Kheiron Medical Technologies system, Mia, has already shown in a prospective NHS study that AI-augmented reading found 12% more cancers while reducing unnecessary recalls and modeling a 30% workload reduction.

In the United States, several AI mammography systems have received FDA clearance. DeepHealth's SmartMammo showed a 22.7% increase in cancer detection for women with dense breast tissue. Clairity Breast received the first De Novo FDA authorization for AI that predicts five-year breast cancer risk from a standard mammogram. Google's mammography AI system, evaluated across NHS sites, achieved a cancer detection rate of 9.33 per 1,000 women compared to 7.54 for human readers alone.

In Sweden, following the MASAI results, AI-supported screening is being integrated into regional screening programs. In Germany, the PRAIM study's results have informed discussions about national-level implementation.

AI-supported mammography screening showed consistently favourable outcomes compared with standard double reading, with a non-inferior interval cancer rate, higher sensitivity, and the same specificity, while also reducing screen reading workload. — MASAI Trial, The Lancet, 2025

The evidence base now includes a Nature publication, multiple Lancet publications, a Nature Medicine publication, and prospective data from hundreds of thousands of women across multiple countries. No other AI medical application has this depth of clinical validation. The technology is not experimental. It is proven. The remaining gap is deployment.

06 — SignalThe Second Reader That Scales

The structural insight of AI breast cancer screening is not that machines are better than doctors. In many individual comparisons, the best radiologists still match or exceed AI. The insight is that AI solves a problem that training more radiologists cannot: it scales.

One radiologist plus AI matches two radiologists. That means every screening program in the world that cannot recruit a second reader now has one. Every rural clinic with a single radiologist reading mammograms alone — the standard in the United States — now has a double-reading system. Every country that was never going to train enough radiologists to launch population-based screening now has a path to one.

The constraint on breast cancer survival has always been stage at detection. The constraint on stage at detection has always been screening coverage. The constraint on screening coverage has always been the number of trained readers. AI removes the last constraint. Not perfectly. Not yet universally. But the evidence is no longer in question.

In wealthy countries, AI makes good screening programs better — catching interval cancers that human readers miss, reducing false positives that cause unnecessary anxiety and procedures, and freeing radiologists to spend more time on complex cases. In countries without screening programs, AI makes screening programs possible.

The question is no longer whether AI can read a mammogram as well as a radiologist. The answer, across five years of evidence from three countries and nearly 600,000 women, is yes. The question is how long it takes the institutions that control screening policy to act on what the data already shows.

∞

What If?

In 2022, breast cancer killed 666,000 women worldwide. Most of those deaths occurred where mammography screening does not exist — not because the technology is unavailable, but because there are not enough radiologists. In sub-Saharan Africa, fewer than ten per million people. A screening program requires two readers per mammogram. The math has never worked. But AI requires a GPU, a trained model, and an internet connection — not a decade of medical training or a salary. MASAI proved one radiologist plus AI matches two radiologists. PRAIM proved it at national scale. ScreenTrustCAD proved it halves workload. A government deploys mobile mammography vans — the hardware exists — and uploads images to a cloud AI that returns reads in seconds. Ten radiologists review only flagged cases. Screening capacity that took wealthy nations sixty years to build is replicated in months. AI can already read a mammogram. How many women will die between now and the moment their government decides the evidence is sufficient — while tumors grow from stage one to stage four and a proven system sits on a shelf, waiting for someone to say yes?

How did this land?

Sources

← Previous Chapter 07 The Library That Came Back 8 min read Next → Chapter 09 The Library No One Knew Existed 7 min read