Reinforcement Learning from Human Feedback
100 points - today at 12:53 PM
SourceComments
dang today at 6:16 PM
Related. Others?
RLHF Book - https://news.ycombinator.com/item?id=42902936 - Feb 2025 (37 comments)
verdverm today at 2:47 PM
Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials
klelatti today at 1:46 PM
Web version with links, etc:
iisweetheartii today at 2:01 PM
[dead]