We built an open benchmark to test GPT-5 "safe completion"

2 points | by agairola 9 hours ago

1 comments