Weiqiang
Weiqiang
Home
Publications
Contact
Light
Dark
Automatic
paper-conference
Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games
We study policy optimization algorithms for computing correlated equilibria in multi-player general-sum Markov Games. Previous results …
Yang Cai
,
Haipeng Luo
,
Chen-Yu Wei
,
Weiqiang Zheng
PDF
Cite
arXiv
Learning Thresholds with Latent Values and Censored Feedback
In this paper, we investigate a problem of
actively
learning threshold in latent space, where the
unknown
reward $g(\gamma, v)$ …
(*) Jiahao Zhang
,
Tao Lin
,
Weiqiang Zheng
,
Zhe Feng
,
Yifeng Teng
,
Xiaotie Deng
PDF
Cite
arXiv
Openreview
Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback
We revisit the problem of learning in two-player zero-sum Markov games, focusing on developing an algorithm that is uncoupled, …
Yang Cai
,
Haipeng Luo
,
Chen-Yu Wei
,
Weiqiang Zheng
PDF
Cite
arXiv
Doubly Optimal No-Regret Learning in Monotone Games
We consider online learning in multi-player smooth monotone games. Existing algorithms have limitations such as (1) being only …
Yang Cai
,
Weiqiang Zheng
PDF
Cite
arXiv
PMLR
Accelerated Single-Call Methods for Constrained Min-Max Optimization
We study first-order methods for constrained min-max optimization. Existing methods either require two gradient calls or two …
Yang Cai
,
Weiqiang Zheng
PDF
Cite
arXiv
Openreview
Beyond the Worst Case: Semi-random Complexity Analysis of Winner Determination
The computational complexity of winner determination is a classical and important problem in computational social choice. Previous work …
Lirong Xia
,
Weiqiang Zheng
PDF
Cite
arXiv
DOI
Finite-Time Last-Iterate Convergence for Learning in Multi-Player Games
We study the question of last-iterate convergence rate of the extragradient algorithm by Korpelevich [1976] and the optimistic gradient …
Yang Cai
,
Argyris Oikonomou
,
Weiqiang Zheng
PDF
Cite
video
openreivew
Nash Convergence of Mean-Based Learning Algorithms in First Price Auctions
Understanding the convergence properties of learning dynamics in repeated auctions is a timely and important question in the area of …
Xiaotie Deng
,
Xinyan Hu
,
Tao Lin
,
Weiqiang Zheng
PDF
Cite
DOI
arXiv
video
doi
Revenue and User Traffic Maximization in Mobile Short-Video Advertising
A new mobile attention economy has emerged with the explosive growth of short-video apps such as TikTok. In this internet market, three …
Dezhi Ran
,
Weiqiang Zheng
,
Yunqi Li
,
Kaigui Bian
,
Jie Zhang
,
Xiaotie Deng
PDF
Cite
doi
Cite
×