Follow
Yan DAI (戴言)
Title
Cited by
Cited by
Year
Refined Regret for Adversarial MDPs with Linear Function Approximation
Y Dai, H Luo, CY Wei, J Zimmert
Proceedings of the 40th International Conference on Machine Learning 202 …, 2023
172023
Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits
J Huang, Y Dai, L Huang
Proceedings of the 39th International Conference on Machine Learning 162 …, 2022
172022
Follow-the-perturbed-leader for adversarial markov decision processes with bandit feedback
Y Dai, H Luo, L Chen
Advances in Neural Information Processing Systems 35, 11437-11449, 2022
152022
The Crucial Role of Normalization in Sharpness-Aware Minimization
Y Dai, K Ahn, S Sra
Advances in Neural Information Processing Systems 36, 67741-67770, 2023
92023
Variance-Aware Sparse Linear Bandits
Y Dai, R Wang, SS Du
International Conference on Learning Representations, 2023
82023
Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning
J Huang, Y Dai, L Huang
Proceedings of the 40th International Conference on Machine Learning 202 …, 2023
6*2023
Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise
K Ahn, Z Zhang, Y Kook, Y Dai
arXiv preprint arXiv:2402.01567, 2024
42024
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation
Y Dai, Q Cui, SS Du
Proceedings of Thirty Seventh Conference on Learning Theory 247, 1260-1261, 2024
12024
Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks
Y Dai, L Huang
arXiv preprint arXiv:2408.16215, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–9