Does RL Incentivize Reasoning in LLMs Beyond the Base Model?

69 points | by leodriesch 11 hours ago

25 comments