arxiv:2605.09959
Jiaxin Huang
teapot123
AI & ML interests
None yet
Recent Activity
upvoted a paper about 6 hours ago
You Only Need Minimal RLVR Training: Extrapolating LLMs via Rank-1 Trajectories upvoted a paper 1 day ago
Process Rewards with Learned Reliability authored a paper 7 days ago
Generating Training Data with Language Models: Towards Zero-Shot
Language Understanding