AI Clinical Decision Support Reaching Human Parity
#1AI systems have crossed documented performance parity with specialist clinicians in multiple diagnostic domains. GPT-4 achieved 86.7% on the USMLE in the Kung et al. (2023) study, above the passing threshold and comparable to third-year residents. Google's Med-PaLM 2 scored 85%+ and produced answers rated comparable to physician responses on clinical questions. In imaging, AI outperforms radiologists for mammography (McKinney, Nature 2020), dermatologists for melanoma detection (Esteva, Nature 2017), and ophthalmologists for diabetic retinopathy (Gulshan, JAMA 2016) — with the gap widening as models scale. Clinical decision support systems are now embedded in major EHRs and generate real-time differential diagnoses, risk scores, and protocol recommendations that parallel what practitioners generate manually.