Subtitle: SuperCLUE-Safety, the first Chinese large-model multi-round adversarial safety benchmark, is released..
Greetings from a world where…
for the rest of the college football season, this status update will be devoted to tracking the Iowa Hawkeye offense's march to mediocrity
…As always, the searchable archive of all past issues is here. Please please subscribe here to support ChinAI under a Guardian/Wikipedia-style tipping model (everyone gets the same content but those who can pay support access for all AND compensation for awesome ChinAI contributors).
Feature Translation: SuperCLUE-Safety
Context: Every two months or so, we’ve been checking in with the SuperCLUE rankings, which aim to benchmark large language models from Chinese and international labs along different dimensions. In the previous update to the SuperCLUE benchmark, we saw Baidu's ErnieBot soar up the rankings, on the strength of its performance with Chinese-language particularities (e.g. idioms). This past week, the SuperCLUE team released a safety benchmark (link to [...]
---
First published:
September 18th, 2023
Source:
https://chinai.substack.com/p/chinai-237-safety-benchmarks-for
---
Narrated by TYPE III AUDIO.