Running Featured 74 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems 📝 74 Who needs 1T parameters? Olympiad proofs with a 4B model
Running on CPU Upgrade Featured 3.19k The Smol Training Playbook 📚 3.19k The secrets to building world-class LLMs
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 43
Running on CPU Upgrade Agents Featured 1.35k Open ASR Leaderboard 🏆 1.35k Compare speech‑to‑text models across multiple benchmarks
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance tngtech • Apr 16, 2025 • 79
view article Article Efficient Request Queueing – Optimizing LLM Performance tngtech • Apr 2, 2025 • 26