Abdullah Akgül

Postdoctoral Researcher · University of Southern Denmark

AbdullahAkgul.jpeg

ADIN Lab · Odense, Denmark

I am a Postdoctoral Researcher at the University of Southern Denmark, working in the ADIN Lab on sample-efficient reinforcement learning. My research builds probabilistic algorithms that make agents learn faster by treating uncertainty as a signal rather than noise. Three of my algorithms rank first in sample efficiency on their respective benchmarks, with publications at NeurIPS, ICML, ICLR, and TMLR.

My work covers three reinforcement learning settings. For offline RL, MOMBO (NeurIPS 2024) replaces high-variance Monte Carlo Bellman targets with deterministic moment matching, achieving the best convergence rate on D4RL locomotion benchmarks alongside a provably tighter suboptimality bound than existing methods. For non-stationary environments, EPPO (TMLR 2025) equips standard policy optimization with an evidential critic that simultaneously preserves the network capacity to keep learning and directs exploration toward regions where dynamics have shifted. For online RL, DAIF (ICML 2026) integrates Active Inference into distributional RL by formulating quantile regression as Bayesian inference under a Normal-Inverse-Gamma model, enabling Expected Free Energy-driven exploration without a learned dynamics model. Across all three settings, the finding is consistent: explicit probabilistic representations make agents measurably faster and more robust.

news

Jun 01, 2026 Paper “Weighted Sequential Bayesian Inference for Non-Stationary Linear Contextual Bandits” accepted to UAI 2026 (Conference on Uncertainty in Artificial Intelligence).
May 18, 2026 Successfully defended PhD thesis Probabilistic Reinforcement Learning for Sample-Efficient Control at the University of Southern Denmark.
Apr 30, 2026 Paper DAIF “Distributional Active Inference” accepted to ICML 2026.
Feb 01, 2026 Started as Research Assistant – Postdoctoral Researcher at the University of Southern Denmark, continuing research on probabilistic reinforcement learning for sample-efficient control.
Jan 26, 2026 Paper iS-QL “Bridging the Performance-Gap between Target-free and Target-based Reinforcement Learning” accepted to ICLR 2026.

selected publications

  1. ICLR
    isql.png
    Bridging the performance-gap between target-free and target-based reinforcement learning
    T. Vincent, Y. Tripathi, T. Faust, A. Akgül, and 4 more authors
    In International Conference on Learning Representations, 2026
  2. ICML
    daif.png
    Distributional Active Inference
    A. Akgül, G. Baykal, M. Haußmann, M. M. Çelikok, and 1 more author
    In International Conference on Machine Learning, 2026
  3. NeurIPS
    mombo_mm_vs_mc.png
    Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
    A. Akgül, M. Haußmann, and M. Kandemir
    In Neural Information Processing Systems, 2024
  4. ICLR
    etp.png
    Evidential Turing Processes
    M. Kandemir, A. Akgül, M. Haußmann, and G. Unal
    In International Conference on Learning Representations, 2022
  5. FL-NeurIPS
    bfl.png
    How to Combine Variational Bayesian Networks in Federated Learning
    A. Ozer, K.B. Buldu, A. Akgül, and G. Unal
    In Workshop on Federated Learning: Recent Advances and New Challenges (in Conjunction with NeurIPS 2022), 2022
  6. L4DC
    cddp.png
    Continual Learning of Multi-modal Dynamics with External Memory
    A. Akgül, G. Unal, and M. Kandemir
    In Proceedings of The 6th Annual Learning for Dynamics and Control Conference, 2024
  7. TMLR
    eppo.png
    Overcoming Non-stationary Dynamics with Evidential Proximal Policy Optimization
    A. Akgül, G. Baykal, M. Haußmann, and M. Kandemir
    Transactions on Machine Learning Research, 2025