I don’t think they’re useless and I was pleasantly surprised to see how effective training on expressions of uncertainty was (got rid of 2/3 of confabulations and didn’t cause a big drop in answering when the model is correct) but empirically it doesn’t seem to be enough to fix things
September 21, 2025 - 17:25 UTC
1
0
4