5 103 33

Ha-Yeong Choi

Ha0

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

upvoted a paper about 4 hours ago

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

upvoted a paper 1 day ago

Qwen3.5-Omni Technical Report

liked a dataset 3 days ago

walledai/AdvBench

View all activity

Organizations

None yet

upvoted a paper about 4 hours ago

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation

Paper • 2604.18168 • Published 1 day ago • 85

upvoted a paper 1 day ago

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 5 days ago • 40

liked a dataset 3 days ago

walledai/AdvBench

Viewer • Updated Jul 4, 2024 • 520 • 10.2k • 97

upvoted 2 papers 6 days ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published 9 days ago • 69

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 11 days ago • 75

upvoted 2 papers 11 days ago

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published 13 days ago • 51

HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents

Paper • 2604.07430 • Published 14 days ago • 184

upvoted a paper 14 days ago

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 16 days ago • 109

upvoted a paper 22 days ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published 26 days ago • 155

upvoted a paper about 1 month ago

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation

Paper • 2603.12267 • Published Mar 12 • 13

liked a model about 2 months ago

Qwen/Qwen3.5-0.8B

Image-Text-to-Text • 0.9B • Updated Mar 2 • 2.8M • 506

upvoted a paper 2 months ago

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published Feb 4 • 268

upvoted an article 3 months ago

Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

Jan 5

•

upvoted a paper 3 months ago

Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40

upvoted 2 papers 4 months ago

DreamOmni3: Scribble-based Editing and Generation

Paper • 2512.22525 • Published Dec 27, 2025 • 15

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published Dec 10, 2025 • 74

upvoted a paper 5 months ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 177

liked a dataset 5 months ago

yenopoya/thousand-voices-trauma

Updated Oct 24, 2025 • 20 • 4

upvoted a paper 6 months ago

FARMER: Flow AutoRegressive Transformer over Pixels

Paper • 2510.23588 • Published Oct 27, 2025 • 59

upvoted a paper 8 months ago

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27, 2025 • 32

Ha-Yeong Choi

AI & ML interests

Recent Activity

Organizations

Ha0's activity

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR