Lexsi/audit-harden-undefended-SFT-qwen3-4b-dolly
4B • Updated • 25
Frontier research around Safe and aligned intelligence
Forgetting That Sticks: Quantization-Permanent Unlearning via Circuit Attribution
$C$-$ΔΘ$: Circuit-Restricted Weight Arithmetic for Selective Refusal