Publications

Conferences:


  1. NeurIPS
    Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers
    Vasan, G., Elsayed, M., Azimi, S. A., He, J., Shahriar, F., Bellinger, C., White, M., & Mahmood, A. R.,
    Neural Information Processing Systems, 2024
  1. RLC
    Weight Clipping for Deep Continual and Reinforcement Learning
    Elsayed, M., Lan, Q., Lyle C., & Mahmood, A. R.,
    Reinforcement Learning (RLC), 2024
  1. ICML
    Revisiting Scalable Hessian Diagonal Approximations for Applications in Reinforcement Learning
    Elsayed, M., Farrahi, H., Dangel F., & Mahmood, A. R.,
    International Conference on Machine Learning (ICML), 2024
  1. ICLR
    Addressing Loss of Plasticity and Catastrophic Forgetting in Continual Learning
    Elsayed, M., & Mahmood, A. R.,
    International Conference on Learning Representations (ICLR), 2024

Workshops and Preprints:


  1. arXiv
    Streaming Deep Reinforcement Learning Finally Works
    Elsayed, M., Vasan, G., & Mahmood, A. R.,
    arXiv preprint arXiv:2410.14606, 2024
  1. NeurIPS
    Deep Reinforcement Learning Without Experience Replay, Target Networks, or Batch Updates
    Elsayed, M., Vasan, G., & Mahmood, A. R.,
    NeurIPS Workshop on Fine-Tuning in Modern Machine Learning, 2024
  1. NeurIPS
    Utility-based Perturbed Gradient Descent: An Optimizer for Continual Learning
    Elsayed, M., & Mahmood, A. R.,
    NeurIPS Workshop on Optimization for Machine Learning, 2022
  1. NeurIPS
    HesScale: Scalable Computation of Hessian Diagonals
    Elsayed, M., & Mahmood, A. R.,
    NeurIPS Workshop on Higher-Order Optimization in Machine Learning, 2024
  1. NeurIPS
    ULTRA: A reinforcement learning generalization benchmark for autonomous driving
    Elsayed, M., Hassanzadeh, K., Nguyen, N. M., Alban, M., Zhu, X., Graves, D., & Luo, J.,
    NeurIPS Workshop on Machine Learning for Autonomous Driving, 2020

Theses


  1. Thesis
    Investigating Generate and Test for Online Representation Search with Softmax Outputs
    Elsayed, M.,
    Master's Thesis, University of Alberta, 2022