Hi everyone,
I’m preparing my first arXiv submission and currently require endorsement for the cs.LG category.
The paper studies transformer computation through the geometry of residual update trajectories. In brief, it shows:
• reasoning tokens occupy higher-dimensional task-aligned subspaces than syntactic or factual continuations
• projecting FFN updates into these subspaces causally improves reasoning confidence
• aligned reasoning trajectories emerge consistently across depth and across independently trained models
The work focuses on mechanistic interpretability using open-weights models (TinyLlama, Phi-2, Qwen).
I’m happy to share the PDF draft here or via DM. I’m only requesting endorsement confirmation that the work fits cs.LG, not a full review.
Thank you very much for your time.
