Yes we knew that as soon as the Attention Is All You Need paper was published, it follows automatically from the fact that the output string of tokens is generated statistically from the input string & the meaning of the strings is not part of the model at all.
1
0
11
There is a joke about the content-free vapidity of text generated mindlessly by rules, in Tristram Shandy, written more than two hundred years ago.
1
0
9
another sign I should try to read that again
1
0
3