分享

Beyond position: how rotary embeddings shape representations and memory in autoregressive transfomers

热度