The Magic of Attention Mechanisms in Language Models | Rabbitt Learning