Why AI language models choke on too much text Compute costs scale with the square of the input size. That's not great. image