LLM-Based and Algorithmic Test Scoring Already Deployed
#1Automated scoring of neuropsychological test batteries has been functionally complete for standardized measures since Q-global and PARiConnect achieved widespread adoption, but the 2025 demonstration of ChatGPT-4.5 scoring BICAMS batteries from raw inputs represents a qualitative shift: scoring can now occur outside proprietary platforms, using general-purpose AI, without licensing agreements. Rule-based scoring engines have existed for years; what is new is that LLMs can now score tests from unstructured inputs (handwritten response sheets photographed, verbal descriptions of performance) with accuracy rivaling trained technicians. This threatens even the manual scoring revenue retained in legacy workflows.