JanitorBench: A new LLM benchmark for multi-turn chats

25 points | by shep101 9 hours ago

27 comments