Search-R1: Training LLMs to Reason and Leverage Search Engines with RL

96 points | by jonbaer a day ago

12 comments