surgehq.ai
↩
Explaining Reinforcement Learning with Human Feedback (RLHF)
2023-01-04 18:27:44 (Hacker News)
Source:
Hacker News
ChatGPT Crushes Google on Coding Queries, and Matches It at General Information
2022-12-20 16:59:30 (Hacker News)
Source:
Hacker News
HellaSwag: 36% of this popular large language model benchmark contains errors
2022-12-05 17:07:35 (Hacker News)
Source:
Hacker News
Twitter’s Egregious Content Moderation Failures
2022-11-09 17:00:46 (Hacker News)
Source:
Hacker News
↩