LangChain进行格式化输出

最新推荐文章于 2025-06-12 10:41:46 发布

字节自动化测试

最新推荐文章于 2025-06-12 10:41:46 发布

阅读量704

点赞数 24

分类专栏：软件测试自动化测试程序员文章标签： langchain 学习程序员程序人生深度学习测试工具软件测试

本文链接：https://blog.youkuaiyun.com/software_test010/article/details/147426929

版权

程序员同时被 3 个专栏收录

1446 篇文章

订阅专栏

软件测试

1441 篇文章

订阅专栏

自动化测试

848 篇文章

订阅专栏

📝 面试求职： 「面试试题小程序」，内容涵盖测试基础、Linux操作系统、MySQL数据库、Web功能测试、接口测试、APPium移动端测试、Python知识、Selenium自动化测试相关、性能测试、性能测试、计算机网络知识、Jmeter、HR面试，命中率杠杠的。（大家刷起来…）

📝 职场经验干货：

软件测试工程师简历上如何编写个人信息（一周8个面试）

软件测试工程师简历上如何编写专业技能（一周8个面试）

软件测试工程师简历上如何编写项目经验（一周8个面试）

软件测试工程师简历上如何编写个人荣誉（一周8个面试）

软件测试行情分享（这些都不了解就别贸然冲了.）

软件测试面试重点，搞清楚这些轻松拿到年薪30W+

软件测试面试刷题小程序免费使用（永久使用）

LangChain是一个强大的框架，可以帮助开发者更高效地构建基于大型语言模型（LLM）的应用。在实际开发中，我们常常希望模型的输出不是一段纯文本，而是结构化、格式化的数据（如JSON、表格等），以便下游进一步处理。

一、为什么需要格式化输出？

结构化数据易于处理
可以直接转换为数据库记录、前端展示等。
方便后续逻辑判断
如根据意图分类或信息抽取结果做不同操作。
提升交互体验
让用户或系统能更好地理解和利用LLM的输出内容。

二、LangChain中的输出解析器（Output Parser）

https://python.langchain.com/api_reference/core/output_parsers.html
LangChain提供了多种内置的OutputParser，也支持自定义。常用的有：

StrOutputParser
：直接返回字符串（默认）
JsonOutputParser
：解析JSON格式内容

三、代码实战——让LLM返回JSON格式的测试用例

1、安装依赖

pip install -U langchain langchain_core langchain-openai openai pydantic

2、使用LangSmith观察结果

https://smith.langchain.com/

import os  

os.environ["LANGSMITH_TRACING"] = "true"  
os.environ["LANGSMITH_API_KEY"] = "lsv2_xxx"

3、编写想要输出的json格式

from pydantic import BaseModel, Field  

class TestCase(BaseModel):  
    Method: str = Field(description="测试方法名称,例如：等价类划分、边界值分析、场景法、状态转换、安全测试、性能测试 等")  
    caseDesc: str = Field(description="用例描述")  
    caseStep: str = Field(description="用例步骤")  
    expectResult: str = Field(description="用例预期结果")

4、拼接提示词

from langchain_core.prompts import PromptTemplate 
from langchain_core.output_parsers.json import JsonOutputParser

parser = JsonOutputParser(pydantic_object=TestCase)  

template = """你是一名资深测试工程师。请根据下述模块说明，使用适当的测试方法，编写完整的功能测试用例，数量不少于10条：  
模块说明: {module_desc}  
{format_instructions}  
"""  

prompt = PromptTemplate(  
    template=template,  
    input_variables=["module_desc"],  
    partial_variables={"format_instructions": parser.get_format_instructions()},  
)

5、调用大模型

from langchain_openai import ChatOpenAI  


llm = ChatOpenAI(  
    model="gpt-4.1",  
    api_key="sk-xxx",  
    base_url="https://oneapi.qunhequnhe.com/v1"  
)  

chain = prompt | llm | parser  
chain.invoke({"module_desc": "登录场景"})

四、查看结果

1、查看运行结果

2、查看LangSmith结果

可以看到合并后的提示词：

你是一名资深测试工程师。请根据下述模块说明，使用适当的测试方法，编写完整的功能测试用例，数量不少于10条：
模块说明: 登录场景
The output should be formatted as a JSON instance that conforms to the JSON schema below.

As an example, for the schema {"properties": {"foo": {"title": "Foo", "description": "a list of strings", "type": "array", "items": {"type": "string"}}}, "required": ["foo"]}
the object {"foo": ["bar", "baz"]} is a well-formatted instance of the schema. The object {"properties": {"foo": ["bar", "baz"]}} is not well-formatted.

Here is the output schema:
"""
{"properties": {"Method": {"description": "测试方法名称,例如：等价类划分、边界值分析、场景法、状态转换、安全测试、性能测试 等", "title": "Method", "type": "string"}, "caseDesc": {"description": "用例描述", "title": "Casedesc", "type": "string"}, "caseStep": {"description": "用例步骤", "title": "Casestep", "type": "string"}, "expectResult": {"description": "用例预期结果", "title": "Expectresult", "type": "string"}}, "required": ["Method", "caseDesc", "caseStep", "expectResult"]}
"""