分享

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

热度