🏗️ Building on HF

Daniel Bourke PRO

mrdbourke

24 340 1101

https://www.mrdbourke.com

AI & ML interests

Computer vision. Small on-device models. VLMs. High-quality tutorials.

Recent Activity

liked a model about 18 hours ago

egeorcun/lucida

liked a model about 18 hours ago

baseten/GLM-5.2-Vision-NVFP4

liked a model 4 days ago

poolside/Laguna-S-2.1

View all activity

Organizations

upvoted 2 articles 11 days ago

Article

Welcome Inkling by Thinking Machines

burtenshaw, merve, pcuenq, ariG23498

•

13 days ago

• 130

Article

What building Shippy taught us about building agents

allenai

•

12 days ago

• 14

upvoted an article 19 days ago

Article

Native-speed vLLM transformers modeling backend

hmellor, lysandre

•

20 days ago

• 60

upvoted a paper 19 days ago

Gemma 4 Technical Report

Paper • 2607.02770 • Published 26 days ago • 73

upvoted 3 articles about 1 month ago

Article

GLM-5.2: Built for Long-Horizon Tasks

zai-org

•

Jun 17

• 136

Article

I fine-tuned a model for free from one prompt, with TRL and the Google Colab CLI

sergiopaniego

•

Jun 15

• 4

Article

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

Samoed

•

Jun 12

• 22

upvoted an article about 2 months ago

Article

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

nvidia

•

Jun 4

• 72

upvoted 2 collections about 2 months ago

Ideogram 4

Collection

8 items • Updated Jun 4 • 67

Verbatim RAG v1

Collection

Hallucination free RAG and out SOTA state-of-the-art extractors • 8 items • Updated Jun 2 • 9

upvoted 2 articles about 2 months ago

Article

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

nvidia

•

Jun 1

• 87

Article

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

JetBrains

•

Jun 1

• 34

upvoted an article 2 months ago

Article

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

ibm-research

•

May 27

• 18

upvoted a collection 2 months ago

📝 Research & Long-Form Blog Posts

Collection

In-depth technical articles and research pieces published by Hugging Face • 18 items • Updated May 28 • 35

upvoted 6 articles 2 months ago

Article

Running AI agents to automate outreach at scale

nielsr

•

Apr 27

• 15

Article

Relaunching PapersWithCode with new features

nielsr

•

May 24

• 12

Article

Eight Days in China: What I Learned from the AI Labs, Robotics Startups and Academia

matthew-d-white

•

May 22

• 5

Article

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

sergiopaniego, ariG23498

•

May 25

• 134

Article

Introducing the Ettin Reranker Family

tomaarsen

•

May 19

• 55

Article

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

ibm-granite

•

May 14

• 33

Daniel Bourke PRO

AI & ML interests

Recent Activity

Organizations

mrdbourke's activity

Welcome Inkling by Thinking Machines

What building Shippy taught us about building agents

Native-speed vLLM transformers modeling backend

GLM-5.2: Built for Long-Horizon Tasks

I fine-tuned a model for free from one prompt, with TRL and the Google Colab CLI

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

ITBench-AA: Frontier Models Score Below 50% on the First Benchmark for Agentic Enterprise IT Tasks — by Artificial Analysis and IBM

Running AI agents to automate outreach at scale

Relaunching PapersWithCode with new features

Eight Days in China: What I Learned from the AI Labs, Robotics Startups and Academia

Harness, Scaffold, and the AI Agent Terms Worth Getting Right

Introducing the Ettin Reranker Family

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality