Sunbelt Computer Software

PL/B Language Development and Support

holarissun (Hao Sun) · GitHub

holarissun

Follow

🎯

Focusing

Hao Sun holarissun

🎯

Focusing

Follow

PhD in Reinforcement Learning, LLM Alignment, RLHF

128 followers · 37 following

University of Cambridge
https://holarissun.github.io/
@HolarisSun

Achievements

Achievements

Highlights

Pro

Pinned Loading

RewardModelingBeyondBradleyTerry RewardModelingBeyondBradleyTerry Public

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

Python 73 4
RewardShifting RewardShifting Public

Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

Python 29 4
Prompt-OIRL Prompt-OIRL Public

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

Python 45 7
embedding-based-llm-alignment embedding-based-llm-alignment Public

Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs

Python 22 2
YunyiShen/ARM-FI YunyiShen/ARM-FI Public

Active reward modeling with last layer Fisher Information (ICML'25)

Python 7
InverseRLmeetsLLMs InverseRLmeetsLLMs Public

10 1