Sep 2025: Paper "Parameter-free Algorithms for the Stochastically Extended Adversarial Model" accepted to NeurIPS 2025. Joint work with current PhD student Shuche Wang, former postdoc Adarsh Barik, and collaborator Peng Zhao. The paper presents new parameter-free algorithms for the Stochastically Extended Adversarial (SEA) model, eliminating the need for pre-determined parameters (such as the Lipschitz constant of the loss function or domain diameter) by leveraging the Optimistic Online Newton Step (OONS) algorithm.
Online Decision Making, Multi-Armed Bandits, Reinforcement Learning Information Theory with Applications to Machine Learning Statistical Signal Processing
There are also multiple positions for talented
postdoctoral scholars. Postdoctoral scholars with strong publication records
and showing interest in the above research topics are also encouraged to
contact me to check with me if there are available positions. Please see this advertisement as well as this.
Best Arm Identification with Possibly Biased Offline Data Le Yang, Vincent Y. F. Tan, and Wang Chi Cheung
Proc. of the 41st Conference on Uncertainty in Artificial Intelligence (UAI), Rio de Janeiro, Brazil, Jul 2025 (AR ≈ 30.7%)
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms [Poster] [Slides]
Yunlong Hou, Fengzhuo Zhang, Cunxiao Du, Xuan Zhang, Jiachun Pan, Tianyu Pang, Chao Du, Vincent Y. F. Tan, and Zhuoran Yang
Proc. of the 42nd International Conference on Machine Learning (ICML), Vancouver, Canada, Jul 2025 (AR ≈ 26.9%)