arxiv:2508.19652
Haitao Mi
haitaominlp
AI & ML interests
Large Language Models
Recent Activity
upvoted a paper about 2 hours ago
TRON: Targeted Rule-Verifiable Online Environments for Visual Reasoning RL upvoted a paper 4 months ago
Kimi K2.5: Visual Agentic Intelligence upvoted a paper 4 months ago
Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning