Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation Paper • 2605.26844 • Published 8 days ago • 22
Not All Disagreement Is Learnable: Token Teachability in On-Policy Distillation Paper • 2605.26844 • Published 8 days ago • 22
E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring Paper • 2605.16882 • Published 18 days ago • 2
E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring Paper • 2605.16882 • Published 18 days ago • 2
Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs Paper • 2409.10994 • Published Sep 17, 2024 • 1
Unconstrained Model Merging for Enhanced LLM Reasoning Paper • 2410.13699 • Published Oct 17, 2024 • 1
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning Paper • 2502.11573 • Published Feb 17, 2025 • 9
InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models Paper • 2509.22536 • Published Sep 26, 2025 • 2
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training Paper • 2605.09608 • Published 24 days ago • 52
E-PMQ: Expert-Guided Post-Merge Quantization with Merged-Weight Anchoring Paper • 2605.16882 • Published 18 days ago • 2
InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion Paper • 2505.13893 • Published May 20, 2025
InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models Paper • 2505.13878 • Published May 20, 2025
InfiAlign: A Scalable and Sample-Efficient Framework for Aligning LLMs to Enhance Reasoning Capabilities Paper • 2508.05496 • Published Aug 7, 2025 • 9
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios Paper • 2411.02708 • Published Nov 5, 2024 • 1
InfiR2: A Comprehensive FP8 Training Recipe for Reasoning-Enhanced Language Models Paper • 2509.22536 • Published Sep 26, 2025 • 2
Geometry Conflict: Explaining and Controlling Forgetting in LLM Continual Post-Training Paper • 2605.09608 • Published 24 days ago • 52