Research

Research Interests

I am broadly interested in building intelligent systems that learn, decide, and optimize in complex, real-world environments. My current research spans deep learning for ranking and recommendation, multi-objective and constrained optimization, large language models for information retrieval and reasoning, uncertainty quantification, and online decision making.

More broadly, I work on stochastic optimization, optimal transport, robustness and reliability of deep learning, reinforcement learning and bandits, and neural architecture search.

I'm always open to collaboration and happy to chat — feel free to reach out via email.

Full list also on Google Scholar. ("*" indicates equal contribution)

2026

  • Beyond Test-Time Memory: State-Space Optimal Control for LLM Reasoning.
    Peihao Wang, Shan Yang, Xijun Wang, Tesi Xiao, Xin Liu, Changlong Yu, Yu Lou, Pan Li, Zhangyang Wang, Ming Lin. ICML 2026.
    [arXiv] [code]

2025

  • COS-DPO: Conditioned One-Shot Multi-Objective Fine-Tuning Framework.
    Yinuo Ren, Tesi Xiao, Michael Shavlovsky, Lexing Ying, Holakou Rahmanian. UAI 2025.
    [paper] [arXiv] [code]
    Early version: HyperDPO: Hypernetwork-based Multi-Objective Fine-Tuning Framework, NeurIPS 2024 Workshop on Fine-Tuning in Modern Machine Learning. [workshop]
  • Orbit: A Framework for Designing and Evaluating Multi-objective Rankers.
    Chenyang Yang, Tesi Xiao*, Michael Shavlovsky*, Christian Kästner, Tongshuang Wu. IUI 2025.
    [paper] [arXiv] [slides]
  • RewardRank: Optimizing True Learning-to-Rank Utility.
    Gaurav Bhatt, Kiran Koshy Thekumparampil, Tanmay Gangwani, Tesi Xiao, Leonid Sigal. Preprint, 2025.
    [arXiv]
  • An Efficient Algorithm for Entropic Optimal Transport under Martingale-type Constraints.
    Xun Tang, Michael Shavlovsky, Holakou Rahmanian, Tesi Xiao, Lexing Ying. Preprint, 2025.
    [arXiv]

2024

  • Optimal Algorithms for Stochastic Bilevel Optimization under Relaxed Smoothness Conditions.
    Xuxing Chen*, Tesi Xiao*, Krishnakumar Balasubramanian. JMLR 2024.
    [paper]
  • Accelerating Sinkhorn Algorithm with Sparse Newton Iterations.
    Xun Tang, Michael Shavlovsky, Holakou Rahmanian, Elisa Tardini, Kiran Koshy Thekumparampil, Tesi Xiao, Lexing Ying. ICLR 2024.
    [paper]
  • Multi-Objective Optimization via Wasserstein-Fisher-Rao Gradient Flow.
    Yinuo Ren, Tesi Xiao, Tanmay Gangwani, Anshuka Rangi, Holakou Rahmanian, Lexing Ying, Subhajit Sanyal. AISTATS 2024.
    [paper] [code] [poster]
  • A Sinkhorn-type Algorithm for Constrained Optimal Transport.
    Xun Tang, Holakou Rahmanian, Michael Shavlovsky, Kiran Koshy Thekumparampil, Tesi Xiao, Lexing Ying. ArXiv preprint.
    [paper]

2023

  • A One-Sample Decentralized Proximal Algorithm for Non-Convex Stochastic Composite Optimization.
    Tesi Xiao*, Xuxing Chen*, Krishnakumar Balasubramanian, Saeed Ghadimi. UAI 2023.
    [paper] [poster]
  • Towards Sequential Counterfactual Learning to Rank.
    Tesi Xiao, Branislav Kveton, Sumeet Katariya, Tanmay Gangwani, Anshuka Rangi. SIGIR-AP 2023.
    [paper]

2022

  • A Projection-free Algorithm for Constrained Stochastic Multi-level Composition Optimization.
    Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi. NeurIPS 2022.
    [paper] [poster]
  • Field-wise Embedding Size Search via Structural Hard Auxiliary Mask Pruning for Click-Through Rate Prediction.
    Tesi Xiao, Xia Xiao, Ming Chen, Youlong Cheng. CIKM DL4SR Workshop 2022.
    [paper]

2020 & Earlier

  • How Does Noise Help Robustness? Explanation and Exploration under the Neural SDE Framework.
    Xuanqing Liu, Tesi Xiao, Si Si, Qin Cao, Sanjiv Kumar, Cho-Jui Hsieh. CVPR 2020.
    [paper]
    Early version: Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise, arXiv 2019. [arXiv]
  • Improved Complexities for Stochastic Conditional Gradient Methods under Interpolation-like Conditions.
    Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi. Operations Research Letters.
    [paper]
  • Statistical Inference for Polyak-Ruppert Averaged Stochastic Zeroth-order Gradient Algorithm.
    Yanhao Jin*, Tesi Xiao*, Krishnakumar Balasubramanian. ArXiv preprint, 2021.
    [paper]

Optimization

  • COS-DPO: Conditioned One-Shot Multi-Objective Fine-Tuning Framework.
    Yinuo Ren, Tesi Xiao, Michael Shavlovsky, Lexing Ying, Holakou Rahmanian. UAI 2025.
    [paper] [arXiv] [code]
    Early version: HyperDPO: Hypernetwork-based Multi-Objective Fine-Tuning Framework, NeurIPS 2024 Workshop on Fine-Tuning in Modern Machine Learning. [workshop]
  • Optimal Algorithms for Stochastic Bilevel Optimization under Relaxed Smoothness Conditions.
    Xuxing Chen*, Tesi Xiao*, Krishnakumar Balasubramanian. JMLR 2024.
    [paper]
  • Accelerating Sinkhorn Algorithm with Sparse Newton Iterations.
    Xun Tang, Michael Shavlovsky, Holakou Rahmanian, Elisa Tardini, Kiran Koshy Thekumparampil, Tesi Xiao, Lexing Ying. ICLR 2024.
    [paper]
  • Multi-Objective Optimization via Wasserstein-Fisher-Rao Gradient Flow.
    Yinuo Ren, Tesi Xiao, Tanmay Gangwani, Anshuka Rangi, Holakou Rahmanian, Lexing Ying, Subhajit Sanyal. AISTATS 2024.
    [paper] [code] [poster]
  • An Efficient Algorithm for Entropic Optimal Transport under Martingale-type Constraints.
    Xun Tang, Michael Shavlovsky, Holakou Rahmanian, Tesi Xiao, Lexing Ying. Preprint, 2025.
    [arXiv]
  • A Sinkhorn-type Algorithm for Constrained Optimal Transport.
    Xun Tang, Holakou Rahmanian, Michael Shavlovsky, Kiran Koshy Thekumparampil, Tesi Xiao, Lexing Ying. ArXiv preprint.
    [paper]
  • A One-Sample Decentralized Proximal Algorithm for Non-Convex Stochastic Composite Optimization.
    Tesi Xiao*, Xuxing Chen*, Krishnakumar Balasubramanian, Saeed Ghadimi. UAI 2023.
    [paper] [poster]
  • A Projection-free Algorithm for Constrained Stochastic Multi-level Composition Optimization.
    Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi. NeurIPS 2022.
    [paper] [poster]
  • Improved Complexities for Stochastic Conditional Gradient Methods under Interpolation-like Conditions.
    Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi. Operations Research Letters.
    [paper]
  • Statistical Inference for Polyak-Ruppert Averaged Stochastic Zeroth-order Gradient Algorithm.
    Yanhao Jin*, Tesi Xiao*, Krishnakumar Balasubramanian. ArXiv preprint.
    [paper]

Recommender Systems & Ranking

  • RewardRank: Optimizing True Learning-to-Rank Utility.
    Gaurav Bhatt, Kiran Koshy Thekumparampil, Tanmay Gangwani, Tesi Xiao, Leonid Sigal. Preprint, 2025.
    [arXiv]
  • Orbit: A Framework for Designing and Evaluating Multi-objective Rankers.
    Chenyang Yang, Tesi Xiao*, Michael Shavlovsky*, Christian Kästner, Tongshuang Wu. IUI 2025.
    [paper] [arXiv] [slides]
  • Towards Sequential Counterfactual Learning to Rank.
    Tesi Xiao, Branislav Kveton, Sumeet Katariya, Tanmay Gangwani, Anshuka Rangi. SIGIR-AP 2023.
    [paper]
  • Field-wise Embedding Size Search via Structural Hard Auxiliary Mask Pruning for Click-Through Rate Prediction.
    Tesi Xiao, Xia Xiao, Ming Chen, Youlong Cheng. CIKM DL4SR Workshop 2022.
    [paper]

Deep Learning

  • Beyond Test-Time Memory: State-Space Optimal Control for LLM Reasoning.
    Peihao Wang, Shan Yang, Xijun Wang, Tesi Xiao, Xin Liu, Changlong Yu, Yu Lou, Pan Li, Zhangyang Wang, Ming Lin. ICML 2026.
    [arXiv] [code]
  • How Does Noise Help Robustness? Explanation and Exploration under the Neural SDE Framework.
    Xuanqing Liu, Tesi Xiao, Si Si, Qin Cao, Sanjiv Kumar, Cho-Jui Hsieh. CVPR 2020.
    [paper]
    Early version: Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise, arXiv 2019. [arXiv]