The fracture that got lost in translation: LLMs and the limits of classification from text

LLMs can apply OTA/AO classification to fracture radiology reports reliably at the broad level — but subgroup accuracy fails where it matters, and hallucinations occur in documented, specific ways. Here is what that means in practice.