Each benchmark is a structured evaluation covering specific clinical domains, modalities, and diagnostic tasks.
The inaugural RSNA CT Benchmark — a comprehensive evaluation framework for AI models interpreting emergency radiology CT abdomen cases. Focuses on acute diagnoses encountered in clinical practice such as appendicitis, diverticulitis, and cholecystitis, spanning pathologies across liver, kidney, pancreas, bowel, and vascular structures with multi-reader consensus ground truth.
These benchmarks are in planning stages. Interested in contributing or leading one? Get in touch.
Multi-center evaluation of AI performance on frontal and lateral chest radiographs across a spectrum of thoracic pathology.
Structured assessment of AI interpretation across neuro MRI sequences for common and critical neurological diagnoses.
We welcome proposals for new benchmarks from the community. If you have a clinical domain, modality, or evaluation task in mind, we'd like to hear from you.
Benchmark proposal process coming soon.
Contact us to discuss →