I don't think it says that? The paper argues very specifically that the RL post-training is structured to produce hallucinations and suggests ways to change it to minimize hallucinations
September 21, 2025 - 17:38 UTC
1
0
1