The fracture that got lost in translation: LLMs and the limits of classification from text

LLMs can apply OTA/AO classification to fracture radiology reports reliably at the broad level — but subgroup accuracy fails where it matters, and hallucinations occur in documented, specific ways. Here is what that means in practice.

Confident and wrong: what a hand fracture study reveals about AI’s most dangerous failure mode

Evidence-reviewed. Citations throughout. The most dangerous thing about a wrong answer is not that it is wrong. It is that it sounds confident. In orthopaedic practice, this problem has a name everyone recognises: the missed scaphoid. A normal-looking X-ray. An unremarkable report. A patient who returns six weeks later with avascular necrosis and a question … Read more