John Dang
John Dang
Home
Papers
Light
Dark
Automatic
Source Themes
Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models
Aligning large language models (LLMs) with human values and intents critically involves the use of human or AI feedback. While dense …
Hritik Bansal
,
John Dang
,
Aditya Grover
PDF
Code
Dataset
Cite
×