Publications

(2024). RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs . Preprint 2024.

PDF

(2024). Aya 23: Open Weight Releases to Further Multilingual Progress. Preprint 2024.

PDF

(2024). Group Preference Optimization: Few-Shot Alignment of Large Language Models. ICLR 2024.

PDF Code