digital-human
updated
One Shot, One Talk: Whole-body Talking Avatar from a Single Image
Paper
•
2412.01106
•
Published
•
24
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Paper
•
2412.04448
•
Published
•
10
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Paper
•
2412.14963
•
Published
•
6
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human
Animation Models
Paper
•
2502.01061
•
Published
•
222
Pippo: High-Resolution Multi-View Humans from a Single Image
Paper
•
2502.07785
•
Published
•
10
X-Dancer: Expressive Music to Human Dance Video Generation
Paper
•
2502.17414
•
Published
•
14
Motion Anything: Any to Motion Generation
Paper
•
2503.06955
•
Published
•
35
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based
Spatiotemporal Diffusion for Audio-driven Talking Portrait
Paper
•
2503.12963
•
Published
•
7
ChatAnyone: Stylized Real-time Portrait Video Generation with
Hierarchical Motion Diffusion Model
Paper
•
2503.21144
•
Published
•
27
MoCha: Towards Movie-Grade Talking Character Synthesis
Paper
•
2503.23307
•
Published
•
138
AvatarArtist: Open-Domain 4D Avatarization
Paper
•
2503.19906
•
Published
•
8
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation
with Hybrid Guidance
Paper
•
2504.01724
•
Published
•
68
Audio-visual Controlled Video Diffusion with Masked Selective State
Spaces Modeling for Natural Talking Head Generation
Paper
•
2504.02542
•
Published
•
51
FantasyTalking: Realistic Talking Portrait Generation via Coherent
Motion Synthesis
Paper
•
2504.04842
•
Published
•
35
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High
Resolution
Paper
•
2505.00497
•
Published
•
17
MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation
Paper
•
2505.10238
•
Published
•
10
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video
Diffusion Transformers
Paper
•
2506.00830
•
Published
•
7
FantasyPortrait: Enhancing Multi-Character Portrait Animation with
Expression-Augmented Diffusion Transformers
Paper
•
2507.12956
•
Published
•
24
FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for
Audio-Driven Portrait Animation
Paper
•
2508.11255
•
Published
•
11
OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive
Simulation
Paper
•
2508.19209
•
Published
•
42
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time
Autoregressive Video Generation
Paper
•
2508.19320
•
Published
•
29
Kling-Avatar: Grounding Multimodal Instructions for Cascaded
Long-Duration Avatar Animation Synthesis
Paper
•
2509.09595
•
Published
•
48
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
Paper
•
2512.04677
•
Published
•
167
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper
•
2512.11253
•
Published
•
32
KlingAvatar 2.0 Technical Report
Paper
•
2512.13313
•
Published
•
40