探索定制输出解析器：构建智能输出的钥匙-优快云博客

引言

在使用大型语言模型（LLM）或聊天模型时，默认的输出格式可能无法满足特定需求。这时，自定义输出解析器就成为了一种解决方案。本文将介绍如何实现自定义解析器，探讨两种主要实现方法，并提供实用的代码示例。

主要内容

自定义解析器的实现方式

使用 Runnable Lambda 或 Runnable Generator

这是大多数场景下推荐的实现方式。它们使得解析器与模型在一个管道中协同工作，实现简单、直观。

继承解析基类

这是一个较为复杂的方式，适用于需要更细致控制的场景。通过继承 BaseOutputParser 或类似类实现。

可运行 Lambdas 和生成器

我们将实现一个简单的解析器，它将模型输出的字母大小写反转。例如，“Meow” 将被转换为 “mEOW”。

from typing import Iterable
from langchain_anthropic.chat_models import ChatAnthropic
from langchain_core.messages import AIMessage, AIMessageChunk

model = ChatAnthropic(model_name="claude-2.1")

def parse(ai_message: AIMessage) -> str:
    """Parse the AI message."""
    return ai_message.content.swapcase()

chain = model | parse
chain.invoke("hello")  # 输出: 'hELLO!'

流式解析

通过 RunnableGenerator 实现流式解析。

from langchain_core.runnables import RunnableGenerator

def streaming_parse(chunks: Iterable[AIMessageChunk]) -> Iterable[str]:
    for chunk in chunks:
        yield chunk.content.swapcase()

streaming_parse = RunnableGenerator(streaming_parse)

chain = model | streaming_parse
for chunk in chain.stream("tell me about yourself in one sentence"):
    print(chunk, end="|", flush=True)

继承解析基类

通过继承 BaseOutputParser 实现解析器，如下所示的布尔值解析器：

from langchain_core.exceptions import OutputParserException
from langchain_core.output_parsers import BaseOutputParser

class BooleanOutputParser(BaseOutputParser[bool]):
    """Custom boolean parser."""
    true_val: str = "YES"
    false_val: str = "NO"

    def parse(self, text: str) -> bool:
        cleaned_text = text.strip().upper()
        if cleaned_text not in (self.true_val.upper(), self.false_val.upper()):
            raise OutputParserException(
                f"BooleanOutputParser expected output value to either be "
                f"{self.true_val} or {self.false_val} (case-insensitive). "
                f"Received {cleaned_text}."
            )
        return cleaned_text == self.true_val.upper()