Kimi K1.5: Scaling Reinforcement Learning with LLMs

200 points | by noch 6 days ago

34 comments