We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.
You must be logged in to block users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Hi! I'm Aryaman, a Ph.D. student at Stanford NLP. See my website for more.
Stanford NLP Python library for understanding and improving PyTorch models via interventions
Python 888 109
Stanford NLP Python library for Representation Finetuning (ReFT)
Python 1.6k 133
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
Python 54 8
Framework for performing mechanistic evaluations of language model architectures on synthetic tasks
Python 13 1
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
Python 205 41
ADAG: Transluce's MLP neuron-level circuit tracing library
Python 31 4
There was an error while loading. Please reload this page.