|
Alex Ayoub
I am a PhD candidate in Computing Science at the
University of Alberta,
advised by Csaba Szepesvari and Dale Schuurmans.
My research focuses on reinforcement learning theory, efficient reasoning in language models,
and decision-focused machine learning systems.
I have worked with Google DeepMind, Amazon, Netflix, Spotify, Morgan Stanley, and Huawei Noah's Ark Lab.
Email /
CV /
Scholar /
LinkedIn /
GitHub
|
|
Research
I work on sample-efficient reinforcement learning with function approximation,
theory-driven algorithm design, and practical methods for large-scale reasoning and recommendation.
Recent work includes discounted reinforcement learning for efficient reasoning,
regression correction in RL, and objective design for recommender systems.
|
|
ICLR 2026
|
Learning to Reason Efficiently with Discounted Reinforcement Learning
A. Ayoub, K. Asadi, D. Schuurmans, C. Szepesvari, K. Bouyarmane
ICLR, 2026
|
|
NeurIPS 2025
|
Eluder Dimension: Localise It!
A. Bakhtiari*, A. Ayoub*, S. Robertson, D. Janz, C. Szepesvari
NeurIPS, 2025 (Spotlight, top 3%)
|
|
RLC 2025
|
Rectifying Regression in Reinforcement Learning
A. Ayoub*, D. Szepesvari*, A. Bakhtiari, C. Szepesvari, D. Schuurmans
Reinforcement Learning Conference, 2025
|
|
WWW 2025
|
Does Weighting Improve Matrix Factorization for Recommender Systems?
A. Ayoub, S. Robertson, D. Liang, H. Steck, N. Kallus
The Web Conference, 2025
|
|
NeurIPS 2024
|
Almost Free: Self-Concordance in Natural Exponential Families and an Application to Bandits
S. Liu*, A. Ayoub*, F. Sentenac, X. Tan, C. Szepesvari
NeurIPS, 2024
|
|
RLC 2024
|
Mitigating the Curse of Horizon in Monte-Carlo Returns
A. Ayoub, D. Szepesvari, F. Zanini, B. Chan, D. Gupta, B. Castro da Silva, D. Schuurmans
Reinforcement Learning Conference, 2024
|
|
ICML 2024
|
Switching the Loss Reduces the Cost in Batch Reinforcement Learning
A. Ayoub, K. Wang, V. Liu, S. Robertson, J. McInerney, D. Liang, N. Kallus, C. Szepesvari
ICML, 2024
|
|
AISTATS 2024
|
Exploration via Linearly Perturbed Loss Minimisation
D. Janz*, S. Liu*, A. Ayoub*, C. Szepesvari
AISTATS, 2024 (Oral, top 1%)
|
|
NeurIPS 2023
|
Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Z. Zhang, J. Kirschner, J. Zhang, F. Zanini, A. Ayoub, D. Schuurmans
NeurIPS, 2023
|
|
TMLR 2023
|
Resmax: An Alternative Soft-Greedy Operator for Reinforcement Learning
E. Miahi, R. MacQueen, A. Ayoub, A. Masoumzadeh, M. White
Transactions on Machine Learning Research, 2023
|
|
ICML 2021
|
Randomized Exploration for Reinforcement Learning with General Value Function Approximation
H. Ishfaq*, Q. Cui*, V. Nguyen*, A. Ayoub*, Z. Yang, Z. Wang, D. Precup, L. F. Yang
ICML, 2021
|
|
ICML 2020
|
Model-Based Reinforcement Learning with Value-Targeted Regression
A. Ayoub, Z. Jia, C. Szepesvari, M. Wang, L. Yang
ICML, 2020
|
Professional Experience
Google DeepMind, Student Researcher - Frontier AI Unit (London, UK), Aug 2025-Present
Amazon, Applied Scientist Intern - Generative AI and LLM Reasoning (Seattle, WA), Jun 2025-Aug 2025
Netflix, Machine Learning Research Intern (Los Gatos, CA), Jun 2024-Nov 2024 and Jun 2023-Aug 2023
Morgan Stanley, Machine Learning Research Intern (New York, NY, remote), Feb 2024-May 2024
Spotify, Research Scientist Intern (New York, NY), Jun 2022-Aug 2022
Huawei Noah's Ark Lab, Research Scientist Intern (Edmonton, AB), Oct 2021-May 2023
|
Education
University of Alberta, PhD in Computing Science, Sep 2021-May 2026 (Expected)
Supervisors: Csaba Szepesvari and Dale Schuurmans
University of Alberta, MSc in Computing Science, Sep 2019-Sep 2021
Thesis: Towards Sample Efficient Reinforcement Learning with Function Approximation (nominated for Outstanding Thesis Award)
Florida State University, BSc in Computational Science and Applied Mathematics, Jun 2015-May 2019
|
Service, Awards, and Skills
Awards and Service: Inaugural Netflix Graduate Research Fellowship; Co-organizer, RL Theory Workshop (2023, 2024); Workflow Chair, ICML 2022; University of Alberta Graduate Research Assistant Fellowship; Florida State University Research Assistant Grant; FSU President's List (2018).
Technical Skills: Python, C/C++, MATLAB, PyTorch, JAX, CuPy, Mathematica.
Teaching: Teaching Assistant, University of Alberta (Sep 2019-Present).
|
|