分享

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

热度