hodfa840 (Hoda Fakhar) · GitHub
Skip to content
View hodfa840's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report hodfa840

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
hodfa840/README.md

Hi, I'm Hoda

PhD student at Linköping University in the AI division (AIICS), working on LLM reasoning and alignment.

My research focuses on understanding how large language models think, from analyzing their internal geometry to improving how they learn through reinforcement learning and verifiable rewards.

Website: hodfa840.github.io

What I work on

  • LLM interpretability: detecting reasoning failures from hidden-state geometry (VANE)
  • LLM safety: type-aware unlearning of memorized secrets (PSTU)
  • RL for language models: GRPO, DPO, SDPO, and flow matching approaches to LLM fine-tuning
  • Verifiable rewards: grammar-guided RLVF for low-resource languages (geysir)

Selected work

Project What it does
VANE Predicts LLM reasoning correctness from hidden-state trajectory geometry. No training required.
PSTU Removes memorized secrets from LLMs via per-type saliency in weight space. Training-free, runs in seconds.
geysir RLVF for Icelandic NLP — Greynir grammar parser as a verifiable reward oracle. Compares GRPO, DPO, SDPO, and Delta-DPO.
ml-observability-argus Production MLOps platform: drift detection, automated retraining & root-cause analysis on NYC taxi data.

Get in touch

hoda.fakhar@liu.se

Pinned Loading

  1. vane vane Public

    VANE: training-free LLM reasoning failure detection via geometric analysis of hidden-state trajectories (curvature, jerk, geodesic deviation) — up to +23.7pp accuracy gain over log-prob selection.

    Python

  2. pstu pstu Public

    Per-Secret-Type Unlearning (PSTU): training-free LLM secret removal that adapts unlearning intensity per secret type, achieving 0 memorized secrets with <3% perplexity increase.

    Jupyter Notebook

  3. geysir geysir Public

    RLVF for Icelandic text generation — Greynir grammar parser as a verifiable reward oracle. Compares GRPO, DPO, SDPO, and Delta-DPO.

    Python

  4. -RetailMind-Self-Healing-LLM-for-Store-Intelligence -RetailMind-Self-Healing-LLM-for-Store-Intelligence Public

    RetailMind: Self-Healing LLM for Store Intelligence

    Python

  5. face-recognition-app-flask-python face-recognition-app-flask-python Public

    Flask-based facial biometrics dashboard with real-time gender, age, emotion, and ethnicity analysis using DeepFace (VGG-Face, ArcFace) — supports webcam and image upload.

    HTML

  6. ml-observability-argus ml-observability-argus Public

    Production MLOps platform: drift detection, automated retraining & root-cause analysis on NYC taxi data. FastAPI · scikit-learn · MLflow · Streamlit

    Python