Abdullah Akgül

Postdoctoral Researcher · University of Southern Denmark

AbdullahAkgul.jpeg

🔍 Open to Research Scientist & MLE roles

ADIN Lab · Odense, Denmark

📄 Download CV (PDF)

🎨 Download Portfolio (PDF)

I build probabilistic algorithms that make reinforcement learning agents learn faster by treating uncertainty as a signal rather than noise. I am a Postdoctoral Researcher at the University of Southern Denmark (ADIN Lab). Three of my algorithms — MOMBO on offline RL, EPPO on non-stationary control, and DAIF on online control — rank first in sample efficiency on their respective benchmarks, with publications at NeurIPS, ICML, ICLR, and TMLR.

My background spans industry deployment, probabilistic modeling, and reinforcement learning. Before my PhD, I built a fraud detection system using deep metric learning at Vakifbank, and published on federated Bayesian networks, uncertainty quantification, and continual learning of dynamical systems. Across all of it, the same question recurs: how should a model represent what it does not know, and how should that uncertainty shape its decisions? My PhD focuses that question on reinforcement learning across three settings: offline learning from fixed datasets, rapid adaptation to non-stationary dynamics, and sample-efficient online exploration, where probabilistic representations consistently prove to be the decisive ingredient.

I am actively looking for research scientist or machine learning engineer roles in industry, where I can apply these methods to real-world sequential decision-making, robotics, and autonomous systems problems. Beyond research, I contribute open-source codebases (ObjectRL, MOMBO, EPPO) adopted by the research community, serve as a reviewer for NeurIPS, TNNLS, WACV, and EWRL, and have mentored MSc students whose work led to publications at TMLR and NeurIPS.

news

Jun 01, 2026 Paper “Weighted Sequential Bayesian Inference for Non-Stationary Linear Contextual Bandits” accepted to UAI 2026 (Conference on Uncertainty in Artificial Intelligence).
May 18, 2026 Successfully defended PhD thesis Probabilistic Reinforcement Learning for Sample-Efficient Control at the University of Southern Denmark.
Apr 30, 2026 Paper DAIF “Distributional Active Inference” accepted to ICML 2026.
Feb 01, 2026 Started as Research Assistant – Postdoctoral Researcher at the University of Southern Denmark, continuing research on probabilistic reinforcement learning for sample-efficient control.
Jan 26, 2026 Paper iS-QL “Bridging the Performance-Gap between Target-free and Target-based Reinforcement Learning” accepted to ICLR 2026.

selected projects

ICML
Distributional Active Inference
2026• First Author• Featured Work
Best average rank across 19 continuous control tasks on three benchmark suites, with up to +62% sample efficiency over the next-best baseline. Integrates Active Inference into distributional RL without a world model. ICML 2026.
NeurIPS
MOMBO: Deterministic Uncertainty Propagation for Offline RL
2024• First Author• Featured Work
Best convergence rate (avg AULC rank 1.2) across all 12 D4RL offline benchmarks. Deterministic moment matching replaces Monte Carlo Bellman targets, with provably tighter suboptimality bounds. NeurIPS 2024.
TMLR
EPPO: Evidential Proximal Policy Optimization
2025• First Author• Featured Work
State-of-the-art in non-stationary control: average rank 1.5 across 10+ environments. Evidential critic simultaneously preserves plasticity and drives directed exploration from a single probabilistic framework. TMLR 2025.
arXiv
ObjectRL: An Object-Oriented Reinforcement Learning Codebase
2025• 2nd Author• Featured Work
Extending SAC to a new algorithm takes roughly 5 lines: just override the two methods that change. Full OOP codebase where encapsulation, inheritance, and polymorphism map directly to RL algorithm components. arXiv 2025.

selected publications

  1. ICLR
    isql.png
    Bridging the performance-gap between target-free and target-based reinforcement learning
    T. Vincent, Y. Tripathi, T. Faust, A. Akgül, and 4 more authors
    In International Conference on Learning Representations, 2026
  2. ICML
    daif.png
    Distributional Active Inference
    A. Akgül, G. Baykal, M. Haußmann, M. M. Çelikok, and 1 more author
    In International Conference on Machine Learning, 2026
  3. NeurIPS
    mombo_mm_vs_mc.png
    Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning
    A. Akgül, M. Haußmann, and M. Kandemir
    In Neural Information Processing Systems, 2024
  4. ICLR
    etp.png
    Evidential Turing Processes
    M. Kandemir, A. Akgül, M. Haußmann, and G. Unal
    In International Conference on Learning Representations, 2022
  5. L4DC
    cddp.png
    Continual Learning of Multi-modal Dynamics with External Memory
    A. Akgül, G. Unal, and M. Kandemir
    In Proceedings of The 6th Annual Learning for Dynamics and Control Conference, 2024
  6. TMLR
    eppo.png
    Overcoming Non-stationary Dynamics with Evidential Proximal Policy Optimization
    A. Akgül, G. Baykal, M. Haußmann, and M. Kandemir
    Transactions on Machine Learning Research, 2025