Large language models, when used alongside human clinical judgement, “substantially” improved diagnostic accuracy among a ...