小湉湉-优快云博客

原创 AIGC技术调研

既然编码是加噪声，那解码时就应该去掉噪声。DDPM 的解码器也不再是一个不可解释的神经网络，而是一个能预测若干个去噪结果的神经网络。

2025-05-29 14:19:13 726 4

两个极端，要么是一堆 loss 相加，要么只考虑 TTS 的 loss (如仅有一个 L1 Loss)可以考虑 content 信息 (需要用到 asr 模型), pitch 信息 (需要用到 F0 模型) -> StarGANv2-VC 等也可以完全当成是一个 TTS 模型求 loss, 不考虑生成的音频的 content 是否与 source audio 一致 -> 基于 PPG/SSL 的 VC、FragmentVC 系列等FreeVC。

2025-03-11 17:37:19 581

原创 SQL 学习笔记

语法SELECT 查询列表 # 7️⃣FROM 表1 别名 # 1️⃣连接类型 JOIN 表2 # 2️⃣ON 连接条件 # 3️⃣WHERE 筛选 # 4️⃣GROUP BY 分组列表 # 5️⃣HAVING 筛选 # 6️⃣ORDER BY 排序列表 # 8️⃣LIMIT 起始条目索引, 条目数;# 9️⃣。

2024-11-19 14:55:46 856 2

原创【飞桨PaddleSpeech语音技术课程】— 多语言合成与小样本合成技术应用实践

【飞桨PaddleSpeech语音技术课程】— 多语言合成与小样本合成技术应用实践

2022-10-31 19:50:00 1965

原创【飞桨PaddleSpeech语音技术课程】— 流式语音合成技术揭秘与实践

【飞桨PaddleSpeech语音技术课程】— 流式语音合成技术揭秘与实践

2022-10-31 17:25:29 5312

原创【飞桨PaddleSpeech语音技术课程】— 语音合成

【飞桨PaddleSpeech语音技术课程】— 语音合成

2022-10-28 21:43:39 3973 2

原创【飞桨PaddleSpeech语音技术课程】— 语音翻译

【飞桨PaddleSpeech语音技术课程】— 语音翻译

2022-10-28 20:24:58 2015

原创【飞桨PaddleSpeech语音技术课程】— 语音识别-定制化识别

【飞桨PaddleSpeech语音技术课程】— 语音识别-定制化识别

2022-10-28 19:56:17 1963

原创【飞桨PaddleSpeech语音技术课程】— 语音识别-流式服务

【飞桨PaddleSpeech语音技术课程】— 语音识别-流式服务

2022-10-28 16:44:05 5422 2

原创【飞桨PaddleSpeech语音技术课程】— 语音识别-流式服务-模型部分

【飞桨PaddleSpeech语音技术课程】— 语音识别-流式服务-模型部分

2022-10-28 16:22:58 2953

原创【飞桨PaddleSpeech语音技术课程】— 语音识别-Transformer

【飞桨PaddleSpeech语音技术课程】— 语音识别-Transformer

2022-10-28 16:06:13 2752

原创【飞桨PaddleSpeech语音技术课程】— 语音识别-Deepspeech2

【飞桨PaddleSpeech语音技术课程】— 语音识别-Deepspeech2

2022-10-28 15:35:14 3772

原创【飞桨PaddleSpeech语音技术课程】— 声纹检索系统与实践

【飞桨PaddleSpeech语音技术课程】— 声纹检索系统与实践

2022-10-27 15:49:13 2094

原创【飞桨PaddleSpeech语音技术课程】— 语音唤醒

【飞桨PaddleSpeech语音技术课程】— 语音唤醒

2022-10-27 15:16:30 3071

原创【飞桨PaddleSpeech语音技术课程】— 声音分类

【飞桨PaddleSpeech语音技术课程】— 声音分类

2022-10-27 14:05:21 3329 1

原创 PaddleSpeech 处理 SSML 输入

尝试 PaddleSpeech 处理 SSML 输入

2022-10-26 15:36:22 814

原创 PaddleSpeech 实现多种卡通音色和方言的中英文混合 TTS

🎉 PaddleSpeech 实现多种卡通音色和方言的中英文混合 TTS #2492

2022-10-26 14:28:45 2682

原创使用 PaddleSpeech 训练一个自己的 TTS 模型

使用 PaddleSpeech 训练一个自己的 TTS 模型

2022-10-26 13:40:28 4204 3

原创【SpeechX—统一高性能语音部署工具】SpeechX Architecture

【SpeechX—统一高性能语音部署工具】SpeechX Architecture

2022-10-26 11:06:57 1193

原创 PaddleSpeech 全链路声纹识别系统 PP-VPR

全链路声纹识别系统

2022-10-26 10:07:30 925

原创 PaddleSpeech 流式语音识别系统 PP-ASR

流式语音识别系统

2022-10-26 10:07:16 2893

原创 PaddleSpeech 流式语音合成系统 PP-TTS

流式语音合成系统

2022-10-25 17:09:59 4114

原创【SpeechX—统一高性能语音部署工具】SpeechX Custom ASR

PaddleSpeech SpeechX 的介绍

2022-10-25 16:52:37 501

原创基于规则的语音合成中文文本前端设计【2】

基于规则的语音合成中文文本前端设计，参考一般的流程，把文本前端分为4个阶段

2022-10-25 16:26:52 877

原创基于规则的语音合成中文文本前端设计

语音合成经典模型结构介绍

2022-10-25 15:52:07 647

原创语音合成经典模型结构介绍

语音合成经典模型结构介绍

2022-10-25 15:36:49 1406

原创 PaddleSpeech TTS 设计要素 — 实验输出目录

每次进行一个实验的时候，需要指定一个输出目录，目录结构如下：最好遵循这个规范。

2022-10-25 12:45:07 276

原创 PaddleSpeech TTS 设计要素 — 训练组件

主要讲述 PaddleSpeech TTS 的和训练相关的组件，以及我们为何如此设计它。如果你熟悉 chainer, 可以看出我们受到 chianer 的设计风格的影响。虽然这也不是 chainer 独此一家，我们也参考了 torch lightning 等专门帮忙解决训练问题的库，以及领域专用的库如 detectron2 等为了方便自己的模型开发而作出的设计。总体的设计原则是简单直观，可扩展性强，学习难度不高（这里需要斟酌，有些设计上手是有一点难度，但是理解了其设计，用起来将会很好用。）

2022-10-25 12:18:47 818

原创 PaddleSpeech TTS 设计要素 — 数据组件

PaddleSpeech TTS 内使用的数据格式，数据处理惯例的说明。

2022-10-25 12:04:36 890

原创 PaddleSpeech TTS 设计要素 — 配置组件

主要讲述和配置文件及其解析，命令行参数解析，配置在程序内的呈现方式的选择和作出这些选择的考虑。

2022-10-25 11:59:00 829

原创语音识别 CTC Loss

语音识别 CTC Loss

2022-10-25 10:40:31 908

原创语音合成 GAN Vocoders 总览

语音合成 GAN Vocoders 总览

2022-10-24 20:59:36 1002

原创语音合成 g2p 字典设计

PaddleSpeech g2p 发音字典设计原理

2022-10-24 20:17:28 1726

原创 211. Add and Search Word - Data structure design（python+cpp）（前缀树的升级版）

题目：Design a data structure that supports the following two operations:　void addWord(word)　bool search(word)search(word) can search a literal word or a regular expression string containing only le...

2018-11-22 23:28:50 349

原创 692. Top K Frequent Words（python+cpp）（字典树统计）

题目：Given a non-empty list of words, return the k most frequent elements.Your answer should be sorted by frequency from highest to lowest. If two words have the same frequency, then the word with th...

2018-11-22 19:58:50 453

原创 421. Maximum XOR of Two Numbers in an Array（python+cpp）（包含前缀树解法）

题目：Given a non-empty array of numbers, a0, a1, a2, … , an-1, where 0 ≤ ai < 231.Find the maximum result of ai XOR aj, where 0 ≤ i, j < n.Could you do this in O(n) runtime?Example:Input: [...

2018-11-22 17:13:07 295

原创 648. Replace Words（python+cpp）（需要学习一下前缀树）

题目：In English, we have a concept called root, which can be followed by some other words to form another longer word - let’s call this word successor. For example, the root an, followed by other, whi...

2018-11-22 11:19:49 316

原创 208. Implement Trie (Prefix Tree)（python+cpp）

题目：Implement a trie with insert, search, and startsWith methods.Example:Trie trie = new Trie();trie.insert(&quot;apple&quot;); trie.search(&quot;apple&quot;); // returns truetrie.search(&quot;app&

2018-11-22 09:40:48 378

原创 769. Max Chunks To Make Sorted（python+cpp）

题目：Given an array arr that is a permutation of [0, 1, ..., arr.length - 1], we split the array into some number of “chunks” (partitions), and individually sort each chunk. After concatenating them...

2018-11-21 21:34:58 153

原创 781. Rabbits in Forest（python+cpp）

题目：In a forest, each rabbit has some color. Some subset of rabbits (possibly all of them) tell you how many other rabbits have the same color as them. Those answers are placed in an array.Return th...

2018-11-21 21:00:28 265

空空如也

空空如也