DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone Paper • 2511.15927 • Published Nov 19, 2025
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings Paper • 2603.13594 • Published Mar 13 • 148
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models Nov 19, 2025 • 34
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models Nov 19, 2025 • 34
Challenging Common Assumptions about Catastrophic Forgetting Paper • 2207.04543 • Published Jul 10, 2022
Towards Modular LLMs by Building and Reusing a Library of LoRAs Paper • 2405.11157 • Published May 18, 2024 • 31