Hacker News
new
show
ask
jobs
Reinforcement Learning from Human Feedback
97 points
by
onurkanbkrc
10 hours ago
5
comments
story
https://arxiv.org/abs/2504.12501
loading...