Supervised fine tuning on curated data is reinforcement learning

71 points | by GabrielBianconi 3 days ago

19 comments