分享

INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning

热度