分享

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance

热度