Benchmarks — RSNA Benchmarks

Active

In Development

Active Development

CT Abdomen Benchmark

The inaugural RSNA CT Benchmark — a comprehensive evaluation framework for AI models interpreting emergency radiology CT abdomen cases. Focuses on acute diagnoses encountered in clinical practice such as appendicitis, diverticulitis, and cholecystitis, spanning pathologies across liver, kidney, pancreas, bowel, and vascular structures with multi-reader consensus ground truth.

CTAbdomenMulti-diagnosisVLM EvaluationMultisite

v0.1 Version

2026 Date

TBD Sites

Overview 🔒 Collaborators — Private

Planned

On the Roadmap

These benchmarks are in planning stages. Interested in contributing or leading one? Get in touch.

Planned

Chest X-Ray Benchmark

Multi-center evaluation of AI performance on frontal and lateral chest radiographs across a spectrum of thoracic pathology.

RadiographChestPlanned

Upcoming

Brain MRI Benchmark

Structured assessment of AI interpretation across neuro MRI sequences for common and critical neurological diagnoses.

MRINeuroUpcoming

Propose

Have an idea for a benchmark?

We welcome proposals for new benchmarks from the community. If you have a clinical domain, modality, or evaluation task in mind, we'd like to hear from you.

Benchmark proposal process coming soon.