arxiv:2601.05167
Langlin Huang
shrango
AI & ML interests
LLM Reasoning, Machine Translation
Recent Activity
upvoted a paper about 3 hours ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories upvoted a paper 1 day ago
Process Rewards with Learned Reliability upvoted a paper 9 days ago
G-Zero: Self-Play for Open-Ended Generation from Zero Data