2 13 6

Zhu

zzzhu

zhuzil

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

upvoted a paper 7 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

upvoted a paper 8 days ago

MMSkills: Towards Multimodal Skills for General Visual Agents

View all activity

Organizations

authored a paper 6 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published 9 days ago • 22

upvoted a paper 7 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

Paper • 2605.18984 • Published 9 days ago • 22

upvoted a paper 8 days ago

MMSkills: Towards Multimodal Skills for General Visual Agents

Paper • 2605.13527 • Published 13 days ago • 117

authored a paper 12 days ago

Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling

Paper • 2605.13062 • Published 14 days ago • 33

upvoted 2 papers 13 days ago

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published 14 days ago • 59

Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling

Paper • 2605.13062 • Published 14 days ago • 33

authored a paper 14 days ago

Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenization

Paper • 2605.10780 • Published 15 days ago • 33

upvoted a paper 14 days ago

Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenization

Paper • 2605.10780 • Published 15 days ago • 33

upvoted a paper about 1 month ago

AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation

Paper • 2604.18240 • Published Apr 20 • 16

authored a paper about 2 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

upvoted 2 papers about 2 months ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published Mar 29 • 147

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

Paper • 2603.25804 • Published Mar 26 • 30

authored 2 papers 2 months ago

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark

Paper • 2509.24897 • Published Sep 29, 2025 • 46

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining

Paper • 2603.15030 • Published Mar 16 • 21

updated a dataset 2 months ago

zzzhu/VTC-Bench

Viewer • Updated Mar 21 • 680 • 1.39k • 1

New activity in zzzhu/VTC-Bench 2 months ago

Improve dataset card: add metadata, GitHub link, and benchmark description

#2 opened 2 months ago by

nielsr

upvoted a paper 2 months ago

VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining

Paper • 2603.15030 • Published Mar 16 • 21

liked a dataset 2 months ago

zzzhu/VTC-Bench

Viewer • Updated Mar 21 • 680 • 1.39k • 1

published a dataset 2 months ago

zzzhu/VTC-Bench

Viewer • Updated Mar 21 • 680 • 1.39k • 1

liked a model 2 months ago

meituan-longcat/LongCat-Image

Text-to-Image • Updated Dec 16, 2025 • 22.3k • • 242

Zhu

AI & ML interests

Recent Activity

Organizations

zzzhu's activity

Improve dataset card: add metadata, GitHub link, and benchmark description