PhD student at Linköping University in the AI division (AIICS), working on LLM reasoning and alignment.
My research focuses on understanding how large language models think, from analyzing their internal geometry to improving how they learn through reinforcement learning and verifiable rewards.
Website: hodfa840.github.io
- LLM interpretability: detecting reasoning failures from hidden-state geometry (VANE)
- LLM safety: type-aware unlearning of memorized secrets (PSTU)
- RL for language models: GRPO, DPO, SDPO, and flow matching approaches to LLM fine-tuning
- Verifiable rewards: grammar-guided RLVF for low-resource languages (geysir)



