建议收藏】一文掌握向量数据库：6大工具对比+Python实战代码

原创于 2025-11-05 11:23:58 发布 · 518 阅读

CC 4.0 BY-SA版权

文章标签：

一、什么是向量数据库？

向量数据库是一种专门用于存储和检索高维向量的数据库系统。它在语义搜索、推荐系统、图像识别、自然语言处理（NLP）和生成式 AI（如 RAG）等领域中发挥着关键作用。与传统数据库不同，向量数据库支持近似最近邻（ANN）搜索，能够在海量数据中快速找到最相似的向量。

二、主流向量数据库简介

以下是当前主流的向量数据库及其特点：

Pinecone

类型：商业云服务

特点：自动扩容、低延迟、混合搜索支持

优势：易用性高，适合快速原型开发

劣势：闭源，定制性差，价格较高

Weaviate

类型：开源 + 云服务

特点：GraphQL 查询、内置嵌入模型、强过滤能力

优势：支持多租户和混合搜索

劣势：学习曲线略高，自托管复杂

Qdrant

类型：开源 + 云服务

特点：Rust 实现，性能优异，支持量化压缩

优势：适合实时搜索和边缘部署

劣势：生态相对较小

Milvus

类型：开源

特点：支持数十亿向量、多种索引类型

优势：企业级扩展性强

劣势：部署复杂，需 Kubernetes 支持

FAISS

类型：本地库

特点：高性能、灵活索引配置

优势：适合本地实验和嵌入式系统

劣势：不支持服务端功能和元数据过滤

ChromaDB

类型：开源

特点：轻量级，易集成

优势：适合快速原型和小型项目

劣势：不适合大规模生产环境

三、实战演练：Python 示例代码

以下是使用 sentence-transformers 生成向量并插入到数据库的示例。

示例：Pinecone

import pinecone
from sentence_transformers import SentenceTransformer
pinecone.init(api_key="your-api-key", environment="us-west1-gcp")
index_name = "demo-index"
pinecone.create_index(name=index_name, dimension=384, metric="cosine")
index = pinecone.Index(index_name)
model = SentenceTransformer("all-MiniLM-L6-v2")
texts = ["AI is transforming the world", "Machine learning is powerful"]
embeddings = model.encode(texts)
index.upsert([
("id1", embeddings[0], {"source": "blog"}),
("id2", embeddings[1], {"source": "article"})
])
query_embedding = model.encode(["Artificial intelligence applications"])
results = index.query(vector=query_embedding[0], top_k=2)
print(results)

示例：FAISS

import faiss
import numpy as np
from sentence_transformers import SentenceTransformer
# 初始化模型并生成向量
model = SentenceTransformer("all-MiniLM-L6-v2")
texts = ["AI is transforming the world", "Machine learning is powerful"]
embeddings = model.encode(texts)
# 将向量转换为 float32 类型
embeddings = np.array(embeddings).astype("float32")
# 创建索引（使用 L2 距离）
d = embeddings.shape[1]  # 向量维度
index = faiss.IndexFlatL2(d)
# 添加向量到索引
index.add(embeddings)
# 查询相似向量
query = model.encode(["Artificial intelligence applications"])
query = np.array(query).astype("float32")
distances, indices = index.search(query, k=2)
# 输出结果
for i, idx in enumerate(indices[0]):
print(f"Match {i+1}: {texts[idx]} (Distance: {distances[0][i]:.4f})")

示例：Qdrant

from qdrant_client import QdrantClient
from qdrant_client.models import PointStruct, Distance, VectorParams
from sentence_transformers import SentenceTransformer
client = QdrantClient(":memory:")
model = SentenceTransformer("all-MiniLM-L6-v2")
texts = ["AI is transforming the world", "Machine learning is powerful"]
embeddings = model.encode(texts)
client.create_collection(
collection_name="demo",
vectors_config=VectorParams(size=384, distance=Distance.COSINE)
)
points = [PointStruct(id=i, vector=embeddings[i], payload={"text": texts[i]}) for i in range(len(texts))]
client.upsert(collection_name="demo", points=points)
query = model.encode(["Artificial intelligence applications"])[0]
results = client.search(collection_name="demo", query_vector=query, limit=2)
print(results)