AI benchmarks are a bad joke – and LLM makers are the ones laughing

147 points | by pseudolus 4 hours ago

60 comments