Abdullah Akgül
Postdoctoral Researcher · University of Southern Denmark
ADIN Lab · Odense, Denmark
I am a Postdoctoral Researcher at the University of Southern Denmark, working in the ADIN Lab on sample-efficient reinforcement learning. My research builds probabilistic algorithms that make agents learn faster by treating uncertainty as a signal rather than noise. Three of my algorithms rank first in sample efficiency on their respective benchmarks, with publications at NeurIPS, ICML, ICLR, and TMLR.
My work covers three reinforcement learning settings. For offline RL, MOMBO (NeurIPS 2024) replaces high-variance Monte Carlo Bellman targets with deterministic moment matching, achieving the best convergence rate on D4RL locomotion benchmarks alongside a provably tighter suboptimality bound than existing methods. For non-stationary environments, EPPO (TMLR 2025) equips standard policy optimization with an evidential critic that simultaneously preserves the network capacity to keep learning and directs exploration toward regions where dynamics have shifted. For online RL, DAIF (ICML 2026) integrates Active Inference into distributional RL by formulating quantile regression as Bayesian inference under a Normal-Inverse-Gamma model, enabling Expected Free Energy-driven exploration without a learned dynamics model. Across all three settings, the finding is consistent: explicit probabilistic representations make agents measurably faster and more robust.
news
| Jun 01, 2026 | Paper “Weighted Sequential Bayesian Inference for Non-Stationary Linear Contextual Bandits” accepted to UAI 2026 (Conference on Uncertainty in Artificial Intelligence). |
|---|---|
| May 18, 2026 | Successfully defended PhD thesis Probabilistic Reinforcement Learning for Sample-Efficient Control at the University of Southern Denmark. |
| Apr 30, 2026 | Paper DAIF “Distributional Active Inference” accepted to ICML 2026. |
| Feb 01, 2026 | Started as Research Assistant – Postdoctoral Researcher at the University of Southern Denmark, continuing research on probabilistic reinforcement learning for sample-efficient control. |
| Jan 26, 2026 | Paper iS-QL “Bridging the Performance-Gap between Target-free and Target-based Reinforcement Learning” accepted to ICLR 2026. |