manuscript

From Average-Iterate to Last-Iterate Convergence in Games: A Reduction and Its Applications
COMAL: A Convergent Meta Algorithm for Aligning LLMs with General Preferences