xziayro's picture

xziayro

xziayro

·

xziayro

AI & ML interests

None yet

Recent Activity

liked a Space about 18 hours ago

black-forest-labs/FLUX.2-dev

new activity about 22 hours ago

GAIR/daVinci-MagiHuman:https://github.com/sandai/MagiCompiler seems to be private

liked a model 2 days ago

Alibaba-DAMO-Academy/Lumos-1

View all activity

Organizations

upvoted 2 papers 2 days ago

LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation

Paper • 2603.20192 • Published 4 days ago • 22

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published 7 days ago • 92

upvoted a paper 4 days ago

Scale-wise Distillation of Diffusion Models

Paper • 2503.16397 • Published Mar 20, 2025 • 42

upvoted 4 papers 5 days ago

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Paper • 2603.19228 • Published 5 days ago • 64

3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model

Paper • 2603.18524 • Published 6 days ago • 54

EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing

Paper • 2603.19224 • Published 5 days ago • 17

Mixture-of-Depths Attention

Paper • 2603.15619 • Published 8 days ago • 77

upvoted 4 papers 6 days ago

Attention Residuals

Paper • 2603.15031 • Published 9 days ago • 157

LaDe: Unified Multi-Layered Graphic Media Generation and Decomposition

Paper • 2603.17965 • Published 6 days ago • 5

LoST: Level of Semantics Tokenization for 3D Shapes

Paper • 2603.17995 • Published 6 days ago • 30

ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA

Paper • 2603.10256 • Published 14 days ago • 20

upvoted 3 papers 7 days ago

Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

Paper • 2603.15653 • Published 18 days ago • 11

SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation

Paper • 2603.14152 • Published 10 days ago • 6

SegviGen: Repurposing 3D Generative Model for Part Segmentation

Paper • 2603.16869 • Published 7 days ago • 18

upvoted 3 papers 8 days ago

GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering

Paper • 2603.15616 • Published 8 days ago • 5

Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods

Paper • 2603.15026 • Published 9 days ago • 8

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Paper • 2603.15478 • Published 9 days ago • 24

upvoted 3 papers 9 days ago

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning

Paper • 2603.12257 • Published 12 days ago • 31

WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing

Paper • 2603.11593 • Published 13 days ago • 25

One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers

Paper • 2603.12245 • Published 12 days ago • 18