"The models particularly struggle with open-ended diagnostic reasoning.”
The models struggle with reasoning ... period.
Any apparent reasoning ability is really a statistical simulation thereof drawing on similarity with training data.
"The models particularly struggle with open-ended diagnostic reasoning.”
The models struggle with reasoning ... period.
Any apparent reasoning ability is really a statistical simulation thereof drawing on similarity with training data.