分享

Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning

热度