分享

Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning

热度