向量数据库索引选型：IVF_FLAT、HNSW、ANNOY 的性能对比与实操（Java）

原创于 2025-12-06 09:28:34 发布 · 333 阅读

8 ·

CC 4.0 BY-SA版权

文章标签：

#数据库 #java #支持向量机

数据库同时被 3 个专栏收录

70 篇文章

订阅专栏

人工智能AI

55 篇文章

订阅专栏

Java 进阶

27 篇文章

订阅专栏

1. 向量索引概述

向量索引是向量数据库的核心组件，用于加速高维向量的相似性搜索。在选择索引类型时，需要权衡搜索速度、召回率和构建成本三个关键指标。本文将对比三种主流向量索引：IVF_FLAT、HNSW 和 ANNOY，并提供 Java 实操示例。

2. 三种索引原理对比

2.1 IVF_FLAT（倒排文件 + 扁平索引）

原理：

将向量空间划分为多个聚类（通过 K-means 算法）
每个聚类对应一个倒排列表，存储属于该聚类的向量
搜索时，先找到查询向量最近的 N 个聚类，然后在这些聚类中进行暴力搜索

特点：

构建速度快，内存占用低
召回率较高（接近暴力搜索）
搜索速度中等，适合中小规模向量库

2.2 HNSW（分层导航小世界网络）

原理：

构建多层图结构，上层图包含少量节点，下层图包含所有节点
每层图中的节点连接遵循小世界特性（短路径连接）
搜索时从顶层开始，快速定位到候选区域，然后逐层细化

特点：

搜索速度极快，尤其适合大规模向量库
召回率高，接近暴力搜索
构建成本高，内存占用大

2.3 ANNOY（近似最近邻 Oh Yeah）

原理：

基于树结构，通过随机超平面递归划分向量空间
构建多棵树（森林），搜索时综合多棵树的结果
支持动态添加向量

特点：

构建速度快，内存占用中等
搜索速度较快，适合中等规模向量库
召回率略低于 IVF_FLAT 和 HNSW

3. 性能对比

索引类型	搜索速度	召回率	构建速度	内存占用	适用场景
IVF_FLAT	中等	高	快	低	中小规模，要求高召回率
HNSW	极快	高	慢	高	大规模，要求高速搜索
ANNOY	较快	中	快	中等	中等规模，平衡速度与召回率

4. Java 实操示例

4.1 环境准备

使用 Faiss-Java 库进行向量索引操作，Faiss 是 Facebook 开源的高效向量搜索库。

Maven 依赖：

<dependency>
    <groupId>com.facebook</groupId>
    <artifactId>faiss</artifactId>
    <version>1.7.4</version>
</dependency>

4.2 IVF_FLAT 示例

import com.facebook.faiss.Index;
import com.facebook.faiss.IndexIVFFlat;
import com.facebook.faiss.IndexFlatL2;
import com.facebook.faiss.MetricType;

import java.util.Random;

public class IVFFlatExample {
    public static void main(String[] args) {
        // 配置参数
        int d = 128; // 向量维度
        int nb = 100000; // 数据库向量数量
        int nq = 1000; // 查询向量数量
        int nlist = 100; // 聚类数量

        // 生成随机向量数据
        Random random = new Random(42);
        float[] xb = new float[nb * d];
        float[] xq = new float[nq * d];
        for (int i = 0; i < nb * d; i++) {
            xb[i] = random.nextFloat();
        }
        for (int i = 0; i < nq * d; i++) {
            xq[i] = random.nextFloat();
        }

        // 1. 构建索引
        IndexFlatL2 quantizer = new IndexFlatL2(d); // 量化器，使用 L2 距离
        IndexIVFFlat index = new IndexIVFFlat(quantizer, d, nlist, MetricType.METRIC_L2);
        index.train(nb, xb); // 训练聚类中心
        index.add(nb, xb); // 添加向量到索引

        // 2. 执行搜索
        int k = 10; // 返回 Top-k 结果
        long[] I = new long[nq * k]; // 存储结果 ID
        float[] D = new float[nq * k]; // 存储结果距离
        index.search(nq, xq, k, D, I);

        // 3. 输出结果
        System.out.println("IVF_FLAT 搜索结果示例：");
        for (int i = 0; i < 5; i++) {
            System.out.print("查询 " + i + " 的 Top-5 结果：");
            for (int j = 0; j < 5; j++) {
                System.out.print(I[i * k + j] + "(" + D[i * k + j] + ") ");
            }
            System.out.println();
        }

        // 4. 释放资源
        index.close();
        quantizer.close();
    }
}

4.3 HNSW 示例

import com.facebook.faiss.Index;
import com.facebook.faiss.IndexHNSWFlat;
import com.facebook.faiss.MetricType;

import java.util.Random;

public class HNSWExample {
    public static void main(String[] args) {
        // 配置参数
        int d = 128;
        int nb = 100000;
        int nq = 1000;
        int M = 16; // HNSW 图中每个节点的最大连接数
        int efConstruction = 200; // 构建时的搜索范围

        // 生成随机向量数据
        Random random = new Random(42);
        float[] xb = new float[nb * d];
        float[] xq = new float[nq * d];
        for (int i = 0; i < nb * d; i++) {
            xb[i] = random.nextFloat();
        }
        for (int i = 0; i < nq * d; i++) {
            xq[i] = random.nextFloat();
        }

        // 1. 构建索引
        IndexHNSWFlat index = new IndexHNSWFlat(d, M, MetricType.METRIC_L2);
        index.setHnswEfConstruction(efConstruction); // 设置构建参数
        index.add(nb, xb); // HNSW 不需要显式训练

        // 2. 执行搜索
        int k = 10;
        int efSearch = 16; // 搜索时的搜索范围
        index.setHnswEfSearch(efSearch);
        long[] I = new long[nq * k];
        float[] D = new float[nq * k];
        index.search(nq, xq, k, D, I);

        // 3. 输出结果
        System.out.println("HNSW 搜索结果示例：");
        for (int i = 0; i < 5; i++) {
            System.out.print("查询 " + i + " 的 Top-5 结果：");
            for (int j = 0; j < 5; j++) {
                System.out.print(I[i * k + j] + "(" + D[i * k + j] + ") ");
            }
            System.out.println();
        }

        // 4. 释放资源
        index.close();
    }
}

4.4 ANNOY 示例

注意：Faiss 不直接支持 ANNOY，我们使用 annoy-java 库。

Maven 依赖：

<dependency>
    <groupId>com.spotify</groupId>
    <artifactId>annoy</artifactId>
    <version>0.3.2</version>
</dependency>

import com.spotify.annoy.AnnoyIndex;

import java.util.Random;

public class AnnoyExample {
    public static void main(String[] args) {
        // 配置参数
        int d = 128;
        int nb = 100000;
        int nq = 1000;
        int nTrees = 10; // 构建的树数量

        // 生成随机向量数据
        Random random = new Random(42);
        float[][] xb = new float[nb][d];
        float[][] xq = new float[nq][d];
        for (int i = 0; i < nb; i++) {
            for (int j = 0; j < d; j++) {
                xb[i][j] = random.nextFloat();
            }
        }
        for (int i = 0; i < nq; i++) {
            for (int j = 0; j < d; j++) {
                xq[i][j] = random.nextFloat();
            }
        }

        // 1. 构建索引
        AnnoyIndex index = new AnnoyIndex(d, "angular"); // 使用 angular 距离
        for (int i = 0; i < nb; i++) {
            index.addItem(i, xb[i]);
        }
        index.build(nTrees); // 构建索引

        // 2. 执行搜索
        int k = 10;
        System.out.println("ANNOY 搜索结果示例：");
        for (int i = 0; i < 5; i++) {
            long[] resultIds = index.getNNsByVector(xq[i], k, -1); // -1 表示搜索所有树
            System.out.print("查询 " + i + " 的 Top-5 结果：");
            for (int j = 0; j < 5; j++) {
                System.out.print(resultIds[j] + " ");
            }
            System.out.println();
        }

        // 3. 释放资源
        index.unload();
    }
}

5. 选型建议

小规模向量库（<100 万）：
- 优先考虑 IVF_FLAT，兼顾速度和召回率
- 若追求极致构建速度，可选择 ANNOY
大规模向量库（>1000 万）：
- 优先考虑 HNSW，搜索速度优势明显
- 若内存有限，可调整 HNSW 的 M 和 efConstruction 参数
动态更新场景：
- ANNOY 支持动态添加向量，无需重建索引
- HNSW 也支持动态更新，但性能会有一定下降
- IVF_FLAT 动态更新需要重新训练聚类中心
高召回率要求：
- 优先考虑 IVF_FLAT 或 HNSW
- 调整 IVF_FLAT 的 nlist 参数或 HNSW 的 efSearch 参数