CTG / FHR Interpretation AI at Clinical Parity
#1A 2024 study published in BJOG demonstrated that GPT-4o achieved a mean CTG interpretation score of 77.86/100 on a standardized clinical rubric, compared to 80.43 for senior doctors — a difference that did not reach statistical significance (p > 0.05). This occurred without any domain fine-tuning, using only the general-purpose model. Concurrently, CE-marked and FDA-cleared CTG AI platforms (K2 Medical Systems, Monica Healthcare, PeriGen) are already deployed in labor wards across the UK, US, and Australia, providing continuous automated tracing analysis with NICE category classification and deterioration alerts.