10 47

srivatsa

srivatsa92

devsrivatsa

AI & ML interests

rag, agents, fine-tuning

Recent Activity

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Pro

liked a model 2 months ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

liked a Space 3 months ago

lm-provers/qed-nano-blogpost

View all activity

Organizations

liked a model about 1 month ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 22 days ago • 5.02M • • 4.37k

liked a model 2 months ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

Text Generation • 121B • Updated Mar 20 • 13k • 121

liked a Space 3 months ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

liked a dataset 3 months ago

google/mobile-actions

Viewer • Updated Dec 18, 2025 • 9.65k • 1.82k • 270

liked a model 5 months ago

ai21labs/AI21-Jamba2-3B

Text Generation • Updated Feb 2 • 816 • 42

liked a Space 6 months ago

The Smol Training Playbook

📚

3.19k

The secrets to building world-class LLMs

upvoted a collection 7 months ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 43

liked a dataset 7 months ago

bigcode/the-stack

Viewer • Updated Apr 13, 2023 • 546M • 19.9k • 1.01k

upvoted an article 7 months ago

Article

Let's talk about LLM evaluation

clefourrier

•

May 23, 2024

• 209

liked a Space 7 months ago

Open ASR Leaderboard

🏆

1.35k

Compare speech‑to‑text models across multiple benchmarks

liked a dataset 9 months ago

neerajaabhyankar/hindustani-raag-small

Viewer • Updated Mar 20, 2024 • 1.25k • 14 • 3

upvoted 2 articles 10 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

tngtech

•

Apr 16, 2025

• 79

Article

Efficient Request Queueing – Optimizing LLM Performance

tngtech

•

Apr 2, 2025

• 26

updated a Space 10 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

published a Space 10 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

liked a model 11 months ago

Comfy-Org/Wan_2.1_ComfyUI_repackaged

Updated Jan 28 • 3.39M • 891

liked 2 datasets 11 months ago

vidore/colpali_train_set

Viewer • Updated Jun 20, 2025 • 119k • 9.85k • 91

llamaindex/vdr-multilingual-train

Viewer • Updated Jan 10, 2025 • 424k • 2.29k • 30

liked 2 models 11 months ago

unsloth/Nanonets-OCR-s-GGUF

Image-Text-to-Text • 3B • Updated Jul 3, 2025 • 4.12k • 64

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 249k • 1.59k

srivatsa

AI & ML interests

Recent Activity

Organizations

srivatsa92's activity

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

The Smol Training Playbook

Let's talk about LLM evaluation

Open ASR Leaderboard

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Efficient Request Queueing – Optimizing LLM Performance

GPU VRAM Estimator

GPU VRAM Estimator