Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
sophiamyang
's Collections
benchmark
benchmark
updated
Aug 5, 2024
Upvote
-
lmlmcat/cmmlu
Updated
Jul 13, 2023
•
24.1k
•
75
nlp-waseda/JMMLU
Updated
Feb 27, 2024
•
878
•
13
HAERAE-HUB/KMMLU
Viewer
•
Updated
Mar 5, 2024
•
244k
•
15.4k
•
98
openai/openai_humaneval
Viewer
•
Updated
Jan 4, 2024
•
164
•
298k
•
387
google-research-datasets/mbpp
Viewer
•
Updated
Jan 4, 2024
•
1.4k
•
215k
•
230
nuprl/MultiPL-E
Viewer
•
Updated
Jul 15, 2025
•
12.7k
•
72.8k
•
66
openai/gsm8k
Benchmark
•
Updated
Mar 23
•
17.6k
•
956k
•
1.33k
Upvote
-
Share collection
View history
Collection guide
Browse collections