Hejian Sang's picture

Hejian Sang

pb09204048

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

Beyond GRPO and On-Policy Distillation: An Empirical Sparse-to-Dense Reward Principle for Language-Model Post-Training

authored a paper about 1 month ago

TIP: Token Importance in On-Policy Distillation

upvoted a paper about 1 month ago

TIP: Token Importance in On-Policy Distillation

View all activity

Organizations

commented 2 papers 3 months ago

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 47 •

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

Paper • 2601.18734 • Published Jan 26 • 7 •