I don't think it says that? The paper argues very specifically that the RL post-training is structured to produce hallucinations and suggests ways to change it to minimize hallucinations
1
0
1
That's cope. It's intrinsic to the way llms work.
1
0
1