Cached layer activations for steering vector experiments
Abdullah
amirali1985
AI & ML interests
Mechanistic interpretability, high dimensional geometry, persona role playing.
Recent Activity
new activity about 2 hours ago
thoughtworks/gemma_psychometrics_personas_responses:can y you put a license on this please? updated a dataset about 3 hours ago
thoughtworks/gemma_psychometrics_personas_responses updated a dataset about 15 hours ago
stride-influence/qwen-leakage-math-sweep