分享

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models

热度