自定义博客皮肤VIP专享

*博客头图:

格式为PNG、JPG,宽度*高度大于1920*100像素,不超过2MB,主视觉建议放在右侧,请参照线上博客头图

请上传大于1920*100像素的图片!

博客底图:

图片格式为PNG、JPG,不超过1MB,可上下左右平铺至整个背景

栏目图:

图片格式为PNG、JPG,图片宽度*高度为300*38像素,不超过0.5MB

主标题颜色:

RGB颜色,例如:#AFAFAF

Hover:

RGB颜色,例如:#AFAFAF

副标题颜色:

RGB颜色,例如:#AFAFAF

自定义博客皮肤

-+
  • 博客(1675)
  • 收藏
  • 关注

原创 aaaaaaaaa

aaaaa。

2023-07-02 12:55:40 248

原创 DINet:用于高分辨率视频上实现逼真面部视觉配音的变形修复网络

DINet: Deformation Inpainting Network for Realistic Face Visually Dubbing on High Resolution Video

2025-03-05 23:54:19 33

原创 VideoReTalking:基于音频实现口型同步的视频编辑

VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

2025-03-05 23:47:18 44

原创 Dimitra:用于表情丰富的说话人头生成的音频驱动扩散模型

Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation

2025-03-05 23:41:06 31

原创 FLAP:通过3D头部条件扩散模型实现完全可控的音频驱动肖像视频生成

FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion mode

2025-03-05 23:36:31 33

原创 ARTalk:通过自回归模型实现语音驱动的三维头部动画

ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model

2025-03-05 23:31:59 126

原创 Avat3r:高保真三维头部头像的大型可动画高斯重建模型

Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars

2025-03-01 20:53:14 143

原创 InsTaG:从几秒视频中学习个性化三维说话人头部

InsTaG: Learning Personalized 3D Talking Head from Few-Second Video

2025-03-01 13:29:34 183

原创 MEAD:用于情感说话人脸生成的大规模视听数据集

MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation

2025-02-23 23:14:16 78

原创 Diffused Heads:扩散模型在生成说话人脸方面胜过生成对抗网络

Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

2025-02-23 23:11:08 48

原创 IP_LAP:利用关键点和外观先验的保持身份特征的说话人脸生成

Identity-Preserving Talking Face Generation with Landmark and Appearance Priors

2025-02-23 23:09:00 33

原创 Audio2Head:音频驱动具有自然头动的数字人生成

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

2025-02-23 23:04:12 28

原创 MuseTalk:利用潜在空间进行高质量实时唇形同步

MuseTalk: Real-Time High Quality Lip Synchronization with Latent Space Inpainting

2025-02-23 23:01:41 438

原创 SkyReels-A1:基于 DiT 的高表现力肖像动画

SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers

2025-02-20 16:01:11 54

原创 SayAnything:利用条件视频扩散实现音频驱动的口型同步

SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion

2025-02-19 21:42:53 334

原创 LivePortrait:利用缝合与重定向控制实现高效肖像动画

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

2025-02-17 21:33:45 145

原创 X-Portrait:具有层次化运动注意力的表现力肖像动画

X-Portrait: Expressive Portrait Animation with Hierarchical Motion Attention

2025-02-17 21:31:05 152

原创 TPSM:薄板样条运动模型在图像动画中的应用

Thin-Plate Spline Motion Model for Image Animation

2025-02-17 21:28:25 30

原创 MakeItTalk:说话者感知的说话人头部动画

MakeItTalk: Speaker-Aware Talking-Head Animation

2025-02-17 21:25:43 37

原创 DaGAN:基于深度感知生成对抗网络的说话人头视频生成

Depth-Aware Generative Adversarial Network for Talking Head Video Generation

2025-02-17 21:22:17 16

原创 DiffTallk:清华推出首个基于扩散模型的音频驱动数字人

DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation

2025-02-14 23:37:25 34

原创 EMO: 在弱条件下利用音频到视频扩散模型生成富有表现力的肖像视频

EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

2025-02-14 00:11:43 385

原创 Playmate:通过3D隐式空间引导扩散实现肖像动画的灵活控制

Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion

2025-02-14 00:04:12 313

原创 VividTalk:基于三维混合先验的单次音频驱动说话人头部生成

VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior

2025-02-11 00:28:11 172

原创 LSP:实时照片级逼真说话人头动画

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation

2025-02-11 00:25:42 21

原创 EVP:音频驱动的情感视频肖像

Audio-Driven Emotional Video Portraits

2025-02-10 22:35:34 27

原创 EAMM:基于音频的情感感知运动模型的单次情感表达说话人脸

EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model

2025-02-10 22:33:18 127

原创 OmniHuman-1:重新思考一阶段条件化人类动画模型的扩展

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

2025-02-07 20:46:25 85

原创 SadTalker: 学习用于风格化音频驱动的单图像说话人脸动画的真实三维运动系数

SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

2025-01-31 17:58:51 135

原创 wav2lip: 音频驱动唇形同步生成!

A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild

2025-01-31 17:50:51 132

原创 SyncAnimation:一种音频驱动人体姿态和说话头部动画的实时端到端框架

SyncAnimation: A Real-Time End-to-End Framework for Audio-Driven Human Pose and Talking Head Animation

2025-01-31 17:21:48 65

原创 协同学习深度和外观以实现肖像图像动画

Joint Learning of Depth and Appearance for Portrait Image Animation

2025-01-24 00:41:16 40

原创 EMO2: 情感表达驱动的语音控制头像视频生成

EMO2: End-Effector Guided Audio-Driven Avatar Video Generation

2025-01-24 00:30:39 302

原创 UniAvatar:可控运动与光照的音频驱动数字人生成

UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control

2025-01-11 15:11:40 59

原创 MoEE:基于情感混合模型的音频驱动肖像动画

MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation

2025-01-09 23:31:48 69

原创 LES-Talker:基于线性情感空间的细粒度情感编辑数字人

LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space

2025-01-08 20:08:04 184

原创 协同运动与外观信息的多尺度密码本说话头生成

Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation

2025-01-08 20:01:29 40

原创 GLCF:最强数字人生成视频检测器

GLCF: A Global-Local Multimodal Coherence Analysis Framework for Talking Face Generation Detection

2025-01-03 20:32:26 49

原创 VQTalker:基于面部运动Tokenization实现多语言说话头像生成

VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization

2025-01-03 20:29:53 71

原创 GaussianSpeech:音频驱动3DGS Avatar

GaussianSpeech: Audio-Driven Gaussian Avatars

2025-01-03 20:25:05 204

空空如也

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

提示
确定要删除当前文章?
取消 删除