使用AI实现自然语言接口与NebulaGraph交互

本文链接：https://blog.youkuaiyun.com/dgay_hua/article/details/146476458

NebulaGraph是一款开源、分布式、可扩展的图数据库，专为支持超大规模图数据的查询而设计，实现了毫秒级的查询延迟。它采用了nGQL作为查询语言，nGQL是一种类似SQL的声明式图查询语言，适用于开发者和运维人员。本文将展示如何利用大型语言模型（LLM）为NebulaGraph数据库提供自然语言接口。

技术背景介绍

在处理图数据时，传统的数据库系统难以应对大规模复杂图数据的查询需求。NebulaGraph通过其高效的图存储和查询技术，能够在分布式环境中处理庞大的图数据，使得其在社交网络、金融风控、推荐系统等场景中表现优异。

核心原理解析

使用大型语言模型（LLM）如OpenAI的模型，通过自然语言处理技术将人类的自然语言转化为结构化的nGQL语句，使得用户可以通过日常语言与NebulaGraph进行互动。这种接口大大降低了用户查询复杂图数据库的门槛。

代码实现演示

我们将在以下步骤中实现该功能：

设置NebulaGraph环境：

可以使用Docker容器启动NebulaGraph集群。
```
curl -fsSL nebula-up.siwei.io/install.sh | bash
```

创建数据库及模式(schema)：

# 安装Jupyter插件
%pip install --upgrade --quiet ipython-ngql
%load_ext ngql

# 连接NebulaGraph
%ngql --address 127.0.0.1 --port 9669 --user root --password nebula

# 创建空间
%ngql CREATE SPACE IF NOT EXISTS langchain(partition_num=1, replica_factor=1, vid_type=fixed_string(128));
%ngql USE langchain;

# 创建模式
%%ngql
CREATE TAG IF NOT EXISTS movie(name string);
CREATE TAG IF NOT EXISTS person(name string, birthdate string);
CREATE EDGE IF NOT EXISTS acted_in();
CREATE TAG INDEX IF NOT EXISTS person_index ON person(name(128));
CREATE TAG INDEX IF NOT EXISTS movie_index ON movie(name(128));

插入数据：

%%ngql
INSERT VERTEX person(name, birthdate) VALUES "Al Pacino":("Al Pacino", "1940-04-25");
INSERT VERTEX movie(name) VALUES "The Godfather II":("The Godfather II");
INSERT VERTEX movie(name) VALUES "The Godfather Coda: The Death of Michael Corleone":("The Godfather Coda: The Death of Michael Corleone");
INSERT EDGE acted_in() VALUES "Al Pacino"->"The Godfather II":();
INSERT EDGE acted_in() VALUES "Al Pacino"->"The Godfather Coda: The Death of Michael Corleone":();

查询图数据：

使用Langchain的NebulaGraphQAChain接口：

from langchain.chains import NebulaGraphQAChain
from langchain_community.graphs import NebulaGraph
from langchain_openai import ChatOpenAI

graph = NebulaGraph(
    space="langchain",
    username="root",
    password="nebula",
    address="127.0.0.1",
    port=9669,
    session_pool_size=30,
)

chain = NebulaGraphQAChain.from_llm(
    ChatOpenAI(temperature=0), graph=graph, verbose=True
)

result = chain.run("Who played in The Godfather II?")
print(result)