分享

GDPval: Evaluating AI Model Performance on Real-World Economically Valuable Tasks

热度