Talk

One-Pass Bandit Learning for RLHF and Function Approximation
2026.06.18, School of Information Science and Technology, ShanghaiTech University, Shanghai.
One-Pass Bandit Learning for RLHF and Function Approximation
2026.04.18, 第二届人工智能基础大会（FAIC 2026）, 上海.
One-Pass Bandit Learning for RLHF and Function Approximation
2026.03.24, The 2nd RIKEN AIP - IIT Hyderabad Joint Workshop, online.
One-Pass Bandit Learning for RLHF and Function Approximation
2025.11.23, 第二十届中国人工智能基础年会 (CFAI 2025)·强化学习论坛, 湖南长沙.
One-Pass Bandit Learning for RLHF and Function Approximation
2025.09.22, School of Data Science, CUHK (Shenzhen), Shenzhen.
One-Pass Bandit Learning: Nonlinear Reward and Heavy-tailed Noise
2025.07.22, 2025 INFORMS International Meeting, Singapore.
Online Ensemble: A Theoretical Framework for Non-stationary Online Learning
2025.05.31, 桂林电子科技大学·计算机与信息安全学院, 广西桂林.
Provable Efficiency in Online RL: Function Approximation and RLHF
2025.04.29, NUS ISEM Department Seminar, Singapore.
Gradient-Variation Online Learning: Theory and Applications
2024.06.07, LAMDA-RIKENAIP joint workshop on ML, Nanjing.
Universal Online Learning with Gradient-Variation Regret
2023.12.18, UC Santa Barbara, US.
Online Ensemble: A Theoretical Framework for Non-stationary Online Learning
2023.11.23, HKUST (Guangzhou) AI Seminar, Guangzhou.
在线集成：非稳态在线学习的理论框架
2023.11.04, 第二十一届机器学习及其应用研讨会 (MLA 2023), 江苏南京.
基于"在线集成"框架的非稳态在线学习.
2023.06.10, 第十二届江苏省计算机大会 (JSCC 2023)·人工智能前沿论坛, 江苏溧阳.
Non-stationary Online Learning: An Online Ensemble Framework
2022.11.03, 中国人民大学·高瓴人工智能学院, 北京.
Bandit Convex Optimization in Non-stationary Environments
2021.07.10, CSIAM-BDAI 2021·在线学习与优化论坛, 四川成都.

Talks