LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation Paper • 2603.20192 • Published 4 days ago • 22
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models Paper • 2603.17051 • Published 7 days ago • 92
SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing Paper • 2603.19228 • Published 5 days ago • 64
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model Paper • 2603.18524 • Published 6 days ago • 54
EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing Paper • 2603.19224 • Published 5 days ago • 17
LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition Paper • 2603.17965 • Published 6 days ago • 5
LoST: Level of Semantics Tokenization for 3D Shapes Paper • 2603.17995 • Published 6 days ago • 30
ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA Paper • 2603.10256 • Published 14 days ago • 20
Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context Paper • 2603.15653 • Published 18 days ago • 11
SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation Paper • 2603.14152 • Published 10 days ago • 6
SegviGen: Repurposing 3D Generative Model for Part Segmentation Paper • 2603.16869 • Published 7 days ago • 18
GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering Paper • 2603.15616 • Published 8 days ago • 5
Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods Paper • 2603.15026 • Published 9 days ago • 8
ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer Paper • 2603.15478 • Published 9 days ago • 24
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning Paper • 2603.12257 • Published 12 days ago • 31
WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing Paper • 2603.11593 • Published 13 days ago • 25
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers Paper • 2603.12245 • Published 12 days ago • 18