- 博客(1675)
- 收藏
- 关注
原创 DINet:用于高分辨率视频上实现逼真面部视觉配音的变形修复网络
DINet: Deformation Inpainting Network for Realistic Face Visually Dubbing on High Resolution Video
2025-03-05 23:54:19
33
原创 VideoReTalking:基于音频实现口型同步的视频编辑
VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
2025-03-05 23:47:18
44
原创 Dimitra:用于表情丰富的说话人头生成的音频驱动扩散模型
Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation
2025-03-05 23:41:06
31
原创 FLAP:通过3D头部条件扩散模型实现完全可控的音频驱动肖像视频生成
FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion mode
2025-03-05 23:36:31
33
原创 ARTalk:通过自回归模型实现语音驱动的三维头部动画
ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model
2025-03-05 23:31:59
126
原创 Avat3r:高保真三维头部头像的大型可动画高斯重建模型
Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars
2025-03-01 20:53:14
143
原创 InsTaG:从几秒视频中学习个性化三维说话人头部
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
2025-03-01 13:29:34
183
原创 MEAD:用于情感说话人脸生成的大规模视听数据集
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation
2025-02-23 23:14:16
78
原创 Diffused Heads:扩散模型在生成说话人脸方面胜过生成对抗网络
Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
2025-02-23 23:11:08
48
原创 IP_LAP:利用关键点和外观先验的保持身份特征的说话人脸生成
Identity-Preserving Talking Face Generation with Landmark and Appearance Priors
2025-02-23 23:09:00
33
原创 Audio2Head:音频驱动具有自然头动的数字人生成
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion
2025-02-23 23:04:12
28
原创 MuseTalk:利用潜在空间进行高质量实时唇形同步
MuseTalk: Real-Time High Quality Lip Synchronization with Latent Space Inpainting
2025-02-23 23:01:41
438
原创 SkyReels-A1:基于 DiT 的高表现力肖像动画
SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
2025-02-20 16:01:11
54
原创 SayAnything:利用条件视频扩散实现音频驱动的口型同步
SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
2025-02-19 21:42:53
334
原创 LivePortrait:利用缝合与重定向控制实现高效肖像动画
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control
2025-02-17 21:33:45
145
原创 X-Portrait:具有层次化运动注意力的表现力肖像动画
X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention
2025-02-17 21:31:05
152
原创 DaGAN:基于深度感知生成对抗网络的说话人头视频生成
Depth-Aware Generative Adversarial Network for Talking Head Video Generation
2025-02-17 21:22:17
16
原创 DiffTallk:清华推出首个基于扩散模型的音频驱动数字人
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation
2025-02-14 23:37:25
34
原创 EMO: 在弱条件下利用音频到视频扩散模型生成富有表现力的肖像视频
EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
2025-02-14 00:11:43
385
原创 Playmate:通过3D隐式空间引导扩散实现肖像动画的灵活控制
Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion
2025-02-14 00:04:12
313
原创 VividTalk:基于三维混合先验的单次音频驱动说话人头部生成
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
2025-02-11 00:28:11
172
原创 LSP:实时照片级逼真说话人头动画
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
2025-02-11 00:25:42
21
原创 EAMM:基于音频的情感感知运动模型的单次情感表达说话人脸
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model
2025-02-10 22:33:18
127
原创 OmniHuman-1:重新思考一阶段条件化人类动画模型的扩展
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models
2025-02-07 20:46:25
85
原创 SadTalker: 学习用于风格化音频驱动的单图像说话人脸动画的真实三维运动系数
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
2025-01-31 17:58:51
135
原创 wav2lip: 音频驱动唇形同步生成!
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild
2025-01-31 17:50:51
132
原创 SyncAnimation:一种音频驱动人体姿态和说话头部动画的实时端到端框架
SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation
2025-01-31 17:21:48
65
原创 协同学习深度和外观以实现肖像图像动画
Joint Learning of Depth and Appearance for Portrait Image Animation
2025-01-24 00:41:16
40
原创 EMO2: 情感表达驱动的语音控制头像视频生成
EMO2: End-Effector Guided Audio-Driven Avatar Video Generation
2025-01-24 00:30:39
302
原创 UniAvatar:可控运动与光照的音频驱动数字人生成
UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control
2025-01-11 15:11:40
59
原创 MoEE:基于情感混合模型的音频驱动肖像动画
MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation
2025-01-09 23:31:48
69
原创 LES-Talker:基于线性情感空间的细粒度情感编辑数字人
LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space
2025-01-08 20:08:04
184
原创 协同运动与外观信息的多尺度密码本说话头生成
Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation
2025-01-08 20:01:29
40
原创 GLCF:最强数字人生成视频检测器
GLCF: A Global-Local Multimodal Coherence Analysis Framework for Talking Face Generation Detection
2025-01-03 20:32:26
49
原创 VQTalker:基于面部运动Tokenization实现多语言说话头像生成
VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
2025-01-03 20:29:53
71
原创 GaussianSpeech:音频驱动3DGS Avatar
GaussianSpeech: Audio-Driven Gaussian Avatars
2025-01-03 20:25:05
204
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人