Whisper Node 项目下载及安装教程-优快云博客

Whisper Node 项目下载及安装教程

1. 项目介绍

Whisper Node 是一个基于 Node.js 的 OpenAI Whisper 转录工具的绑定库。它允许用户在本地使用 CPU 进行语音转录，支持多种输出格式（如 JSON、TXT、SRT、VTT），并且针对 CPU 进行了优化，包括 Apple Silicon ARM 架构。

2. 项目下载位置

Whisper Node 项目托管在 GitHub 上，可以通过以下命令进行克隆：

git clone https://github.com/ariym/whisper-node.git

3. 项目安装环境配置

3.1 系统要求

Node.js 14.x 或更高版本
npm 6.x 或更高版本
适用于 Windows 的用户需要安装 make 命令（可以从此处下载）

3.2 环境配置示例

以下是配置环境的步骤：

安装 Node.js 和 npm

确保你的系统已经安装了 Node.js 和 npm。可以通过以下命令检查版本：
```
node -v
npm -v
```
安装 make 命令（仅限 Windows）

如果你使用的是 Windows 系统，需要安装 make 命令。下载并安装后，确保 make 命令可以在命令行中运行。

4. 项目安装方式

4.1 安装依赖

进入项目目录并安装依赖：

cd whisper-node
npm install

4.2 下载 Whisper 模型

你可以选择下载 Whisper 模型，或者使用自定义路径的模型。以下是下载模型的命令：

npx whisper-node download

5. 项目处理脚本

5.1 使用示例

以下是一个简单的使用示例，将音频文件转录为 JSON 格式：

import whisper from 'whisper-node';

const transcript = await whisper("example/sample.wav");
console.log(transcript);
// 输出: [ [start, end, speech] ]

5.2 完整选项列表

你可以使用更多的选项来定制转录过程：

import whisper from 'whisper-node';

const filePath = "example/sample.wav"; // 必需
const options = {
  modelName: "base.en", // 默认
  // modelPath: "/custom/path/to/model.bin", // 使用自定义目录中的模型（不能与 'modelName' 一起使用）
  whisperOptions: {
    language: 'auto', // 默认（使用 'auto' 进行自动检测）
    gen_file_txt: false, // 输出 .txt 文件
    gen_file_subtitle: false, // 输出 .srt 文件
    gen_file_vtt: false, // 输出 .vtt 文件
    word_timestamps: true, // 为每个单词添加时间戳
    // timestamp_size: 0, // 不能与 word_timestamps:true 一起使用
  }
};

const transcript = await whisper(filePath, options);

通过以上步骤，你就可以成功下载并安装 Whisper Node 项目，并开始使用它进行语音转录。

创作声明：本文部分内容由AI辅助生成（AIGC），仅供参考