John Dang
John Dang
Home
Papers
Light
Dark
Automatic
1
Group Preference Optimization: Few-Shot Alignment of Large Language Models
Many applications of large language models (LLMs), ranging from chatbots to creative writing, require nuanced subjective judgments that …
Siyan Zhao
,
John Dang
,
Aditya Grover
PDF
Code
Peering Through Preferences: Unraveling Feedback Acquisition for Aligning Large Language Models
Aligning large language models (LLMs) with human values and intents critically involves the use of human or AI feedback. While dense …
Hritik Bansal
,
John Dang
,
Aditya Grover
PDF
Code
Dataset
Cite
×