Step 1 of 4
The Problem
1.1 Industry: Healthcare — Medical Imaging and Radiology
Medical imaging is a $45 billion global market, with chest X-rays being the most commonly ordered diagnostic imaging procedure worldwide. Hospitals and radiology departments process millions of scans annually, and AI-assisted diagnosis has become a critical area of investment. Machine learning models trained to detect pathologies from X-rays can reduce diagnostic errors, accelerate reporting, and extend diagnostic capability to underserved regions.
1.2 Company Profile: RadianceAI
RadianceAI is a Series B medical AI startup headquartered in Boston, Massachusetts. The company develops AI-powered diagnostic tools for chest X-ray interpretation, deployed across 120 hospitals in the United States and Southeast Asia. RadianceAI's flagship product, RadianceScreen, detects 14 common chest pathologies (pneumonia, cardiomegaly, pleural effusion, etc.) with radiologist-level accuracy. The company employs 85 people, including a team of 22 machine learning engineers and 8 board-certified radiologists who serve as clinical advisors.
Key Metrics:
- 2.4 million X-rays processed in 2025
- 94.2% mean AUC across the 14 common pathologies
- FDA 510(k) clearance for the core product
- $38 million in annual recurring revenue
1.3 Business Challenge: The Rare Disease Gap
RadianceAI's model performs well on common pathologies but struggles with rare conditions. The company has identified 6 critical rare pathologies — including pulmonary alveolar proteinosis (PAP), lymphangioleiomyomatosis (LAM), and pulmonary Langerhans cell histiocytosis (PLCH) — that collectively affect fewer than 200,000 patients in the US annually.
The problem is acute:
-
Data scarcity: Across 2.4 million scans, fewer than 800 confirmed cases of these 6 rare conditions exist. Standard deep learning models require thousands to tens of thousands of examples per class.
-
Clinical impact: Misdiagnosis of rare pulmonary diseases leads to delayed treatment, with average diagnostic delays exceeding 3 years for some conditions. RadianceAI's clinical advisory board reports that 40% of PAP patients receive at least one incorrect diagnosis before the condition is identified.
-
Regulatory requirement: To pursue FDA clearance for rare disease detection, RadianceAI needs to demonstrate model performance on a statistically significant number of cases — far more than currently available.
-
Market opportunity: Hospitals are willing to pay a 35% premium for AI tools that can flag rare conditions. This represents a $15 million incremental ARR opportunity.
1.4 Stakes and Constraints
Stakes:
- Patient safety: missed rare diagnoses lead to disease progression and potentially irreversible lung damage
- Regulatory: FDA requires robust validation sets; fewer than 100 cases per condition is insufficient
- Commercial: capturing the rare disease module would differentiate RadianceAI from 4 direct competitors
Constraints:
- Patient privacy (HIPAA): real patient data cannot be shared or artificially duplicated
- Clinical validity: synthetic images must be radiologically plausible — reviewed by board-certified radiologists
- Compute budget: $50,000 allocated for training infrastructure (GPU compute on AWS)
- Timeline: 6 months to deliver a validated synthetic data pipeline
1.5 Proposed Solution
RadianceAI will use Denoising Diffusion Probabilistic Models (DDPMs) to generate synthetic chest X-rays for the 6 rare pathologies. The diffusion model will be trained on the existing (small) set of confirmed rare disease X-rays, augmented with class-conditional generation techniques. The synthetic images will then be used to augment the training set for the downstream diagnostic classifier.
The choice of DDPMs over alternatives:
- GANs: Mode collapse is unacceptable in medical imaging — the model must produce diverse pathological presentations, not variations of a single pattern. DDPMs are known for superior mode coverage.
- VAEs: VAE-generated images tend to be blurry, which would obscure fine-grained pathological features critical for diagnosis.
- DDPMs: Produce high-fidelity, diverse images without mode collapse. The iterative denoising process naturally captures multi-scale features — from large anatomical structures to subtle opacities.