Capture, Learning, and Synthesisof 3D Speaking styles论文阅读笔记 VOCA
Capture, Learning, and Synthesisof 3D Speaking Styles论文阅读笔记摘要制作了一个4D面部(3D mesh 序列 + 同步语音)数据集:29分钟,60fps,12个人在该数据集上训练了一个神经网络(这句话原话是we then train a neural network on our dataset that factors identity from facial motion,不知道怎么理解)学习的模型VOCA,输入为任意语音信号(不局限为英文
原创
2020-12-31 16:58:22 ·
1983 阅读 ·
0 评论