Weiqiang
Weiqiang
Home
Publications
Contact
Light
Dark
Automatic
Wokring Papers
Yixin Liu
,
Argyris Oikonomou
,
Weiqiang Zheng
,
Yang Cai
,
Arman Cohan
(2024).
COMAL: A Convergent Meta Algorithm for Aligning LLMs with General Preferences
. NeurIPS workshop on Fine-Tuning in Modern Machine Learning: Principles and Scalability (FITML).
Selected for Oral Presentation
.
Cite
arXiv
FITML workshop
Yang Cai
,
Gabriele Farian
,
Julien Grand-Clément
,
Christian Kroer
,
Chung-Wei Lee
,
Haipeng Luo
,
Weiqiang Zheng
(2023).
Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games
. Working Paper.
Cite
arXiv
Cite
×