1. Agent与Agent数据集
1.1 Agent的定义
Agent(智能体)指的是具有自主观察、思考、规划、反思、行为等能力的人工智能系统。

1.2 Agent和LLM的区别
相比于单一LLM(大语言模型)只能支持多模态的输入和输出,Agent将输入和输出扩展到了多步的操作上,例如:
输入与输出的维度跃升使Agent能够在现实环境中自主思考,如人类专家一般执行复杂任务,解决Chatbot类人工智能无法解决的问题。这种“跳出对话框”的突破为人工智能在更广泛领域的创新应用打下了重要基础。
1.3 Agent数据集
为使Agent掌握形如上述复杂操作的能力,研究人员和开发者已经创建了多个开源数据集,用于训练和评估Agent的能力。在本期分享中,我们为大家整理了当前领域内具有代表性的Agent开源数据集,覆盖从网页交互、软件工程等具体任务场景,到多步推理、动态规划、自我优化等核心认知能力维度。为更清晰地呈现不同数据集的评估侧重点,我们依据其核心评估目标与应用场景,将其划分为三大类别:基础能力评估、场景化交互评估与流程及多步骤任务评估。
2. 基础能力评估
基础能力评估旨在测试Agent的核心认知能力(如推理、规划、知识应用等),通常通过静态问题或任务进行,无需与外部环境交互。
2.1 PlanBench
发布方:Karthik Valmeekam, Matthew Marquez, Alberto Olmo, Sarath Sreedharan, Subbarao Kambhampati
下载地址:https://github.com/karthikv792/LLMs-Planning,数据集位于“plan-bench”子目录中
发布时间:2023.5
大小:约26,250个任务
简介:PlanBench是一个专为评估LLM在规划和推理方面的能力而设计的Benchmark。它基于自动规划社区中常用的域,特别是国际规划竞赛中使用的域,包含约26,250个提示,涵盖不同的测试用例和规划场景。数据集支持各种Blocksworld规划任务,并适用于自然语言和PDDL(规划领域定义语言)两种类型的提示。
{
"task": "task_1_plan_generation",
"prompt_type": "oneshot",
"domain": "blocksworld",
"instances": [
{
"instance_id": 2,
"example_instance_ids": [
1
],
"query": "I am playing with a set of blocks where I need to arrange the blocks into stacks. Here are the actions I can do\n\nPick up a block\nUnstack a block from on top of another block\nPut down a block\nStack a block on top of another block\n\nI have the following restrictions on my actions:\nI can only pick up or unstack one block at a time.\nI can only pick up or unstack a block if my hand is empty.\nI can only pick up a block if the block is on the table and the block is clear. A block is clear if the block has no other blocks on top of it and if the block is not picked up.\nI can only unstack a block from on top of another block if the block I am unstacking was really on top of the other block.\nI can only unstack a block from on top of another block if the block I am unstacking is clear.\nOnce I pick up or unstack a block, I am holding the block.\nI can only put down a block that I am holding.\nI can only stack a block on top of another block if I am holding the block being stacked.\nI can only stack a block on top of another block if the block onto which I am stacking the block is clear.\nOnce I put down or stack a block, my hand becomes empty.\nOnce you stack a block on top of a second block, the second block is no longer clear.\n\n[STATEMENT]\nAs initial conditions I have that, the red block is clear, the blue block is clear, the yellow block is clear, the hand is empty, the blue block is on top of the orange block, the red block is on the table, the orange block is on the table and the yellow block is on the table.\nMy goal is to have that the orange block is on top of the blue block.\n\nMy plan is as follows:\n\n[PLAN]\nunstack the blue block from on top of the orange block\nput down the blue block\npick up the orange block\nstack the orange block on top of the blue block\n[PLAN END]\n\n[STATEMENT]\nAs initial conditions I have that, the red block is clear, the yellow block is clear, the hand is empty, the red block is on top of the blue block, the yellow block is on top of the orange block, the blue block is on the table and the orange block is on the table.\nMy goal is to have that the orange block is on top of the red block.\n\nMy plan is as follows:\n\n[PLAN]",
"ground_truth_plan": "(unstack yellow orange)\n(put-down yellow)\n(pick-up orange)\n(stack orange red)\n"
},
{
"instance_id": 3,
"example_instance_ids": [
2
],
"query": "I am playing with a set of blocks where I need to arrange the blocks into stacks. Here are the actions I can do\n\nPick up a block\nUnstack a block from on top of another block\nPut down a block\nStack a block on top of another block\n\nI have the following restrictions on my actions:\nI can only pick up or unstack one block at a time.\nI can only pick up or unstack a block if my hand is empty.\nI can only pick up a block if the block is on the table and the block is clear. A block is clear if the block has no other blocks on top of it and if the block is not picked up.\nI can only unstack a block from on top of another block if the block I am unstacking was really on top of the other block.\nI can only unstack a block from on top of another block if the block I am unstacking is clear.\nOnce I pick up or unstack a block, I am holding the block.\nI can only put down a block that I am holding.\nI can only stack a block on top of another block if I am holding the block being stacked.\nI can only stack a block on top of another block if the block onto which I am stacking the block is clear.\nOnce I put down or stack a block, my hand becomes empty.\nOnce you stack a block on top of a second block, the second block is no longer clear.\n\n[STATEMENT]\nAs initial conditions I have that, the red block is clear, the yellow block is clear, the hand is empty, the red block is on top of the blue block, the yellow block is on top of the orange block, the blue block is on the table and the orange block is on the table.\nMy goal is to have that the orange block is on top of the red block.\n\nMy plan is as follows:\n\n[PLAN]\nunstack the yellow block from on top of the orange block\nput down the yellow block\npick up the orange block\nstack the orange block on top of the red block\n[PLAN END]\n\n[STATEMENT]\nAs initial conditions I have that, the blue block is clear, the hand is empty, the blue block is on top of the orange block, the orange block is on top of the yellow block, the yellow block is on top of the red block and the red block is on the table.\nMy goal is to have that the red block is on top of the orange block and the yellow block is on top of the red block.\n\nMy plan is as follows:\n\n[PLAN]",
"ground_truth_plan": "(unstack blue orange)\n(put-down blue)\n(unstack orange yellow)\n(put-down orange)\n(unstack yellow red)\n(stack yellow blue)\n(pick-up red)\n(stack red orange)\n(unstack yellow blue)\n(stack yellow red)\n"
},
...
]
}
2.2 MSAgent-Bench
发布方:通义实验室
下载地址:魔搭社区
发布时间:2024.9
大小:2.24GB文件,598k的训练集和对应的验证集,测试集。
简介:MSAgent-Bench是一个通用的、可定制的Agent框架,插件的训练数据集,包括598k的训练集和对应的验证集,测试集。数据集主要包括四种:AI模型API,通用API,API无关通用sft数据,API检索增强数据。
{
"id":"modelscope_merge_api_527",
"conversations":[
{
"from":"system",
"value":"你是达摩院的ModelScopeGPT(魔搭助手),你是个大语言模型, 是2023年达摩院的工程师训练得到的。你有多种能力,可以通过插件集成魔搭社区的模型api来回复用户的问题,还能解答用户使用模型遇到的问题和模型知识相关问答。1. {\"plugin_name\": \"modelscope_text-ie\", \"plugin_owner\": \"ModelScopeGPT\", \"plugin_type\": \"default\", \"plugin_schema_for_model\": {\"name\": \"modelscope_text-ie\", \"description\": \"针对中文的文本,根据schema要抽取的内容,找出其中对应信息,并用json格式展示\", \"url\": \"http://109.199.101.10:1485/\", \"paths\": [{\"name\": \"modelscope_text-ie\", \"model_id\": \"/damo/nlp_structbert_siamese-uie_chinese-base\", \"method\": \"post\", \"description\": \"针对中文的文本,根据schema要抽取的内容,找出其中对应信息,并用json格式展示\", \"parameters\": [{\"name\": \"text\", \"description\": \"用户输入的文本\", \"required\": \"True\"}, {\"name\": \"schema\", \"description\": \"要抽取信息的json表示\", \"required\": \"True\"}]}]}}\n\n2. {\"plugin_name\": \"modelscope_text-ie\", \"plugin_owner\": \"ModelScopeGPT\", \"plugin_type\": \"default\", \"plugin_schema_for_model\": {\"name\": \"modelscope_text-ie\", \"description\": \"针对中文的文本,根据schema要抽取的内容,找出其中对应信息,并用json格式展示\", \"url\": \"http://9.32.64.200:5873/\", \"paths\": [{\"name\": \"modelscope_text-ie\", \"model_id\": \"/damo/nlp_structbert_siamese-uie_chinese-base\", \"method\": \"post\", \"description\": \"针对中文的文本,根据schema要抽取的内容,找出其中对应信息,并用json格式展示\", \"parameters\": [{\"name\": \"text\", \"description\": \"用户输入的文本\", \"required\": \"True\"}, {\"name\": \"schema\", \"description\": \"要抽取信息的json表示\", \"required\": \"True\"}]}]}}\n\n3. {\"plugin_name\": \"modelscope_text-ie\", \"plugin_owner\": \"ModelScopeGPT\", \"plugin_type\": \"default\", \"plugin_schema_for_model\": {\"name\": \"modelscope_text-ie\", \"description\": \"针对中文的文本,根据schema要抽取的内容,找出其中对应信息,并用json格式展示\", \"url\": \"http://54.149.78.185:3979/\", \"paths\": [{\"name\": \"modelscope_text-ie\", \"model_id\": \"/damo/nlp_structbert_siamese-uie_chinese-base\", \"method\": \"post\", \"description\": \"针对中文的文本,根据schema要抽取的内容,找出其中对应信息,并用json格式展示\", \"parameters\": [{\"name\": \"text\", \"description\": \"用户输入的文本\", \"required\": \"True\"}, {\"name\": \"schema\", \"description\": \"要抽取信息的json表示\", \"required\": \"True\"}]}]}}"
},
{
"from":"user",
"value":"按照给定的schema抽取出下面文本对应的信息\nschema:{\"人物\": null, \"地理位置\": null, \"组织机构\": null}\n近日,美国政府宣布将对中国1000多种商品加征关税,并威胁进一步加征关税。"
},
{
"from":"assistant",
"value":"<|startofthink|>```JSON\n{\"api_name\": \"modelscope_text-ie\", \"url\": \"http://9.32.64.200:5873/damo/nlp_structbert_siamese-uie_chinese-base\", \"parameters\": {\"text\": \"近日,美国政府宣布将对中国1000多种商品加征关税,并威胁进一步加征关税。\", \"schema\": \"{\\\"人物\\\": null, \\\"地理位置\\\": null, \\\"组织机构\\\": null}\"}}\n```<|endofthink|>\n\n<|startofexec|>```JSON\n{\"人物\": [], \"地理位置\": [\"中国\", \"美国\"], \"组织机构\": []}\n```<|endofexec|>\n信息抽取结果:{\"人物\": [], \"地理位置\": [\"中国\", \"美国\"], \"组织机构\": []}。我使用的模型是ModelScope的'damo/nlp_structbert_siamese-uie_chinese-base'模型。这是一个基于StructBERT预训练模型微调训练的通用信息抽取模型。"
}
]
}
2.3 KAgentBench
发布方:KwaiKEG团队
下载地址:https://hf-mirror.com/datasets/kwaikeg/KAgentBench
发布时间:2023.12
大小:超过3000条数据
简介:KAgentBench包含超过3,000个人工编辑的自动化评估数据,用于测试Agent能力。KAgentBench的评估维度包括规划、工具使用、反思、总结和用户画像。
Query:
我在一年前租了一套公寓,合同期为两年。但是现在房东要求我提前搬出,因为他想卖掉这套公寓。我不知道我该怎么办。
funcs:
[ ]
Type:
profile
golden_result_list:
[ { "thought": null, "tool_name": null, "tool_args": null, "golden_result": "明白了。首先,我们需要确定这个问题涉及的法律领域是租赁法。根据你所在国家或地区的租赁法规,租赁合同在双方同意的情况下具有法律约束力。请问你的租赁合同中有关于提前终止合同的条款吗?" }, { "thought": null, "tool_name": null, "tool_args": null, "golden_result": "根据你的描述,这个问题涉及到租赁合同和房地产法律。首先,我们需要查看你的租赁合同中关于提前终止合同的条款,以及你所在国家或地区的相关法律法规。这将有助于我们了解你的权益和房东的责任。" }, { "thought": null, "tool_name": null, "tool_args": null, "golden_result": "根据你的描述,这个问题涉及到租赁合同和房地产交易。首先,我们需要查阅你所在国家或地区的租赁法律法规,了解租赁合同的相关规定。同时,我们还需要了解房地产交易的相关法律法规,以便为你提供合适的建议。" }, { "thought": null, "tool_name": null, "tool_args": null, "golden_result": "根据你的描述,这个问题涉及到租赁合同和房地产交易。首先,我们需要查阅你所在国家或地区的房地产交易相关法律法规,以便为你提供准确的建议。" }, { "thought": null, "tool_name": null, "tool_args": null, "golden_result": "根据你的描述,这个问题涉及到租赁合同和房地产法律。首先,我们需要查看你的租赁合同中关于提前终止合同的条款,以及你所在国家或地区的相关法律法规,以便为你提供合适的建议。" } ]
prompt_input:
{
"AutoGPT": null,
"ToolLlama": null,
"ModelScope": null,
"KuaiAgent": null,
"prompt": "你需要扮演一名专业的法律顾问,你必须遵循参考对话流程中的一个步骤回答用户,但不能给出其他任何步骤的回答。\n\n参考对话流程:\n1. 收集提问者的法律问题,了解问题的背景和相关信息。\n2. 分析问题,确定问题涉及的法律领域和相关法律法规。\n3. 根据法律法规,为提问者提供合理的解决方案或建议。\n4. 如果问题涉及多个法律领域,为提问者提供综合性的解决方案。\n5. 提醒提问者在实际操作中遵循法律法规,如有需要可建议寻求专业律师的帮助。\n\n当前历史对话为:\nuser: 你好,我有一个关于租赁和房地产纠纷的问题,我需要你的帮助。\nyou: 你好,我很乐意帮助你。请详细描述一下你的问题,以及涉及的背景和相关信息。\n\n用户当前问题为:\n我在一年前租了一套公寓,合同期为两年。但是现在房东要求我提前搬出,因为他想卖掉这套公寓。我不知道我该怎么办。",
"ReACT": null,
"Kuaiagent": null
}
Memory:
user: 你好,我有一个关于租赁和房地产纠纷的问题,我需要你的帮助。
you: 你好,我很乐意帮助你。请详细描述一下你的问题,以及涉及的背景和相关信息。
2.4 Reflection-Bench
发布方:上海人工智能实验室等
下载地址:https://github.com/AI45Lab/ReflectionBench
发布时间:2024.10
大小:共354个任务
简介:Reflection-Bench是一个基于认知心理学设计的开源评测平台,旨在系统性地评估大型语言模型作为自主代理的能力。该基准围绕七个相互关联的认知维度展开:预测、决策、感知、记忆、反事实思维、信念更新和元反思。它提供七个参数化的认知评测任务,每个任务针对特定的认知过程设计,如异常检测、工作记忆、信念更新、决策制定等。评测系统设计了详细的评估流水线,能直观展示模型在不同认知维度上的能力表现。下图为七个认知维度的测评。
3. 场景化交互评估
场景化交互评估着眼于在模拟或真实的特定应用场景中测试Agent的操作能力,需与动态环境(如网页、移动界面、工作流程)交互。
3.1 Workarena
发布方:ServiceNow
下载地址:https://github.com/ServiceNow/WorkArena
发布时间:2023.7
大小:682个任务
简介:WorkArena是一个Benchmark,通过一系列基于浏览器的任务来评估网络代理在执行日常知识工作任务中的表现。它包括两个部分:WorkArena-L1(包含33种基本任务的19,912个独立实例)和WorkArena++(682个任务,用于测试代理的规划、推理和记忆能力)。任务覆盖面广,包括企业内部知识库信息检索、复杂表单填写、服务目录操作、列表筛选、菜单导航及仪表盘数据解读等常见知识工作场景。
无数据样例,通过搭建网页服务,并评测执行各类任务的效果
可复现并自行采集数据
3.2 tau-bench
发布方:Sierra Research团队
下载地址:https://github.com/sierra-research/tau-bench
发布时间:2024.6
大小:共165个任务
简介:tau-bench是一个现实世界领域中的工具-代理-用户交互Benchmark,包括retail和airline两个领域的任务,同时测试代理的工具调用能力 、对话理解能力 以及遵守复杂领域规则的可靠性。
3.3 Weblinx
发布方:McGill-NLP团队
下载地址:https://github.com/McGill-NLP/WebLINX
发布时间:2024.2
大小:2300多个任务
简介:WebLINX 专注于会话式GUI agent,特别强调通过多轮对话进行真实世界的Web导航。WebLINX 提供了跨越155个真实世界网站的2300多个专家演示,创建了一个具有DOM树和屏幕截图的丰富环境,用于训练和评估能够执行动态、用户引导的导航任务的 agent。此数据集促进了 agent 在新网站和任务上的泛化,其中全面的动作和对话数据提供了关于增强 agent 在真实Web场景中响应能力的见解。
3.4 MobileViews
发布方:北京邮电大学和清华大学等
下载地址:https://huggingface.co/datasets/mllmTeam/MobileViews
发布时间:2024.9
大小:600000多个屏幕截图-视图层次结构对
简介:MobileViews提供了来自20000个Android应用程序的600000多个屏幕截图-视图层次结构对。它使用LLM增强的应用程序遍历工具收集,为移动GUI agent 在屏幕摘要、可点击性预测和UI组件识别等任务中提供了高保真资源。
3.5 ScreenAgent
发布方:吉林大学
下载地址:https://github.com/niuzaisheng/ScreenAgent
发布时间:2024.2
大小:约200个截图-操作数据集
简介:ScreenAgent 提供了一个全面的流程,使 agent 能够自主执行多步骤任务, 包括规划、行动和反思阶段。通过利用带注释的屏幕截图和详细的动作序列,它实现了 UI 元素定位和任务完成的高精度,在准确性方面超越了之前的模型。
3.6 MOSS-pluginData
发布方:复旦大学MOSS团队
下载地址:https://github.com/OpenLMLab/MOSS
发布时间:2023.7
大小:57万条英文对话和59万条中文对话数据
简介:moss-moon-003-sft-plugin所使用的插件增强的多轮对话数据,包含支持搜索引擎、文生图、计算器、解方程等四个插件在内的多轮对话数据。MOSS所使用的多轮对话数据,覆盖有用性、忠实性、无害性三个层面,包含由text-davinci-003
生成的约57万条英文对话和59万条中文对话。
<|Human|>: 黑暗荣耀的主演有谁<eoh>
<|Inner Thoughts|>: 这是一个关于黑暗荣耀的问题,我需要查询一下黑暗荣耀的主演<eot>
<|Commands|>: Search("黑暗荣耀 主演")<eoc>
<|Results|>:
Search("黑暗荣耀 主演") =>
<|1|>: "《黑暗荣耀》是由Netflix制作,安吉镐执导,金恩淑编剧,宋慧乔、李到晛、林智妍、郑星一等主演的电视剧,于2022年12月30日在Netflix平台播出。该剧讲述了曾在高中时期 ..."
<|2|>: "演员Cast · 宋慧乔Hye-kyo Song 演员Actress (饰文东恩) 代表作: 一代宗师 黑暗荣耀 黑暗荣耀第二季 · 李到晛Do-hyun Lee 演员Actor/Actress (饰周汝正) 代表作: 黑暗荣耀 ..."
<|3|>: "《黑暗荣耀》是编剧金银淑与宋慧乔继《太阳的后裔》后二度合作的电视剧,故事描述梦想成为建筑师的文同珢(宋慧乔饰)在高中因被朴涎镇(林智妍饰)、全宰寯(朴成勋饰)等 ..."
<eor>
<|MOSS|>: 《黑暗荣耀》的主演包括宋慧乔、李到晛、林智妍、郑星一等人。<sup><|1|></sup><eom>
4. 流程与多步骤任务评估
流程与多步骤任务评估专注于复杂流程执行、多步骤任务协调或多任务处理能力,可能结合部分环境交互。
4.1 GAIA
- 发布方:Meta 、 HuggingFace 和 AutoGPT等
- 下载地址:https://huggingface.co/gaia-benchmark
- 发布时间:2023.11
- 大小:450道带答案的题目
- 简介:GAIA是一个GAI的Benchmark,GAIA囊括的真实世界问题需要推理、多模态处理、网页浏览和工具使用能力,下图中可以看出GAIA中问题对Agent能力种类需求、使用工具数量和操作步骤的分布。
GAIA Question:
The attached Excel file contains the sales of menu items for a local fast-food chain. What were the total sales that the chain made from food (not including drinks)? Express your answer in USD with two decimal places. {{upload .xlsx}}
FINAL ANSWER:
$89706.00
4.2 AgentBench
发布方:清华大学等
下载地址:https://github.com/THUDM/AgentBench
发布时间:2023.10
大小:8类任务共1091个问题
简介:AgentBench是首个评估Agent的Benchmark,涵盖8个不同环境,包括新创建的领域(操作系统、数据库、知识图谱、数字卡牌游戏、横向思维谜题)和复杂任务(家庭管理、网上购物、网页浏览)。每个数据集提供Dev和Test两种拆分,支持多轮交互以测试模型的代理能力。框架结构友好,便于使用和扩展。
{
"description": "Tell me the number of CPUs.",
"evaluation": {
"check": [
null,
{
"language": "python",
"file": "check/integer-match.py"
}
],
"example": "nproc"
},
"labels": [
"command",
"CPU",
"device",
"hardware",
"processor",
"system"
]
},
4.3 AgentInstruct
发布方:微软研究院
下载地址:https://huggingface.co/datasets/THUDM/AgentInstruct
发布时间:2023.10
大小:6类共1866条数据
简介:AgentInstruct是一个经过挑选的Agent数据集,包含1866个高质量交互、6个多样化的真实场景任务(ALFWorld、WebShop、Mind2Web、Knowledge Graph、Operating System、Database)。
[
{ "from": "human", "loss": null, "value": "You are an assistant that will act like a person, I'will play the role of linux(ubuntu) operating system. Your goal is to implement the operations required by me or answer to the question proposed by me. For each of your turn, you should first think what you should do, and then take exact one of the three actions: \"bash\", \"finish\" or \"answer\". \n\n1. If you think you should execute some bash code, take bash action, and you should print like this:\n\nThink: put your thought here.\n\nAct: bash\n\n```bash\n# put your bash code here\n```\n\n2. If you think you have finished the task, take finish action, and you should print like this:\n\nThink: put your thought here.\n\nAct: finish\n\n3. If you think you have got the answer to the question, take answer action, and you should print like this:\n\nThink: put your thought here.\n\nAct: answer(Your answer to the question should be put in this pair of parentheses)\n\nIf the output is too long, I will truncate it. The truncated output is not complete. You have to deal with the truncating problem by yourself. Attention, your bash code should not contain any input operation. Once again, you should take only exact one of the three actions in each turn.\n\nNow, my problem is:\n\ntell me how many files are in the directory \"/etc\"?"
},
{ "from": "gpt", "loss": false, "value": "Think: To count the files in /etc, I need to print all the files in it.\n\nAct: bash\n\n```bash\nls /etc\n```" }
,
{ "from": "human", "loss": null, "value": "The output of the OS:\ncpi cron.hourly fuse.conf iproute2 lvm networkd-dispatcher protocols selinux tmpfiles.d [truncated because the output is too long]"
}
,
{ "from": "gpt", "loss": false, "value": "Think: The output has been truncated because it is too long, so I need to count files by script directly.\n\nAct: bash\n\n```bash\nls -1 /etc | wc -l\n```"
},
{ "from": "human", "loss": null, "value": "The output of the OS:\n220"
},
{ "from": "gpt", "loss": false, "value": "Think: Now I get the answer, it is 220.\n\nAct: answer(220)"
},
{ "from": "human", "loss": null, "value": "Now, I will start a new problem in a new OS. My problem is:\n\nYou need to identify how many JPEG image files are there in the folder named \"image_directory\". The JPEG files may end with .jpg or .jpeg extensions."
},
{ "from": "gpt", "loss": true, "value": "Think: The first step is to check if there are any .jpg files in \"image_directory\".\n\nAct: bash\n\n```bash\nfind image_directory -type f -name \"*.jpg\"\n```"
},
{ "from": "human", "loss": null, "value": "The output of the OS:\n\nimage_directory/subdir1/image3.jpg\r\nimage_directory/image1.jpg\r\n\u001b]0;root@dfac388e9a33: /\u0007root@dfac388e9a33:/# \u001b[K"
},
{ "from": "gpt", "loss": true, "value": "Think: Now, I can see two .jpg files in \"image_directory\". The next step is to check for any .jpeg files in the same directory.\n\nAct: bash\n\n```bash\nfind image_directory -type f -name \"*.jpeg\"\n```"
},
{ "from": "human", "loss": null, "value": "The output of the OS:\n\nimage_directory/subdir2/image4.jpeg\r\nimage_directory/image2.jpeg\r\n\u001b]0;root@dfac388e9a33: /\u0007root@dfac388e9a33:/# \u001b[K"
},
{ "from": "gpt", "loss": true, "value": "Think: I can see two .jpeg files in \"image_directory\". Now I need to count all the .jpg and .jpeg files.\n\nAct: bash\n\n```bash\nfind image_directory -type f \\( -name \"*.jpeg\" -o -name \"*.jpg\" \\) | wc -l\n```"
},
{ "from": "human", "loss": null, "value": "The output of the OS:\n\n4\r\n\u001b]0;root@dfac388e9a33: /\u0007root@dfac388e9a33:/# \u001b[K"
},
{ "from": "gpt", "loss": true, "value": "Think: Now I get the answer, there are 4 JPEG image files in \"image_directory\".\n\nAct: answer(4)"
}
]
4.4 FlowBench
发布方:Ruixuan Xiao, Wentao Ma, Ke Wang et al.
下载地址:https://drive.google.com/drive/folders/1PFzA5e-fuKpVZvAHP-otBhWPdU60O3d4
发布时间:2024.6
大小:51种场景共384MB数据
简介:FlowBench是第一个用于工作流引导代理规划的综合基准数据集。它涵盖了广泛的分类体系(6个领域、22个角色、51种场景)并支持不同的知识格式(文本、代码、流程图),以全面模拟真实世界应用。该基准通过三阶段流程构建:任务收集、工作流组织和会话生成。数据集支持单回合(turn-level)和会话级(session-level)评估,还包括专家级注释和多轮用户-代理互动。
{"id": "20240519192853182831999", "flow_id": "81d5e5b96359fb12af8e91de032589bd", "assistant_prompt": "You are a helpful assistant for the task of Agent of online shopping mall. Your mission involves 3 scenario of Search for products & Shopping cart management & Order operations.\nThe scenario 1 is to Search for products where User wants you to recommend products to them, and you need to do this based on their needs.\nThe scenario 2 is to Shopping cart management where User wants to manage their shopping cart and may change checkout and shopping cart contents. You need to ask for the user's intent and assist them in managing their cart.\nThe scenario 3 is to Order operations where User wants to know about the orders placed under his/her account, and may learn more about or operate an order in detail. You need to help him with operations such as order inquiries or modifications.\n\n\nSpecific requirements:\n1. You need to act as an assistant and engage in a conversation with the user, following the business process and API information described in Markdown syntax.\n2. You have been provided with the flowchart information for different scenarios under a specific role. In each workflow chart below, the nodes represent the actions you need to take at each step, with each node containing a sub-command for you to follow in responding to users or performing certain actions. Node types include response nodes, decision nodes, and API call nodes. The edges in the process define the various transition conditions you need to evaluate, including types such as user intent assessment conditions, parameter and function return value conditions, etc. You need to decide whether to transition to the next node based on these conditions.\n3. Within the complex node with a sub-goal, you need to complete the sub-goal by following the node description, If a conversation is needed to complete the goal, strictly follow the content described in the node. If a specified tool is needed, strictly ask for the necessary parameters according to the tool's definition to complete the API call.\n4. You can only answer questions within the scope of the given several workflow processes. If the user asks a question beyond these scopes, please apologize and explain to the user in the response part.\n5. When asking for API input parameters, ensure that the provided parameter values comply with the specified format regarding both he correctness of the format and the completeness of the content. Do not assign values arbitrarily. In instances where the parameters do not meet the format requirements, notify users to make the necessary adjustments until the requirements are satisfied.\n6. When the user has multiple requests at the same time, please select one appropriate request for processing first and inform the user that other requests will be resolved subsequently. If there is unfinished business in the previous conversation, continue to provide the necessary help and guidance to assist them in completing the business process. When multiple APIs need to be called, do so in separate rounds, with a maximum of one API call output per round. When the user indicates that the business is finished or says goodbye, respond politely and end the conversation.\n7. Your output format should be chosen from one of the two templates below (7.1 and 7.2):\n7.1 If you need to interact with the user:\n```\nThought: xxx (description of your thought process )\nResponse: xxx (the content you need to inquire or reply)\n```\n[Format Explanation]\n(1) Thought includes four pieces of information: [Step: Analyze the previous node]: 'The previous node last_node=xxx'. [Step: Analyze the current intent]: 'The current intent intent=xxx'. [Step: Analyze the current node]: 'The current node current_node=xxx'. [Step: Decide the follow-up actions]: 'Next, I need to xxx.'\n(2) The previous node 'last_node' refers to the node or nodes immediately preceding the current node under consideration in the process. The current intent 'intent' represents the user's intent or condition that links the current node with the previous node. The current node 'current_node' refers to the next node pointed to by the previous node through transition conditions. The current node may be in a different scenario flowchart than the previous node (that is, a cross-scenario process jump occurs)\n(3) When the current node 'current_node' involves calling an API, 'Thought' should add three extra pieces of information after [Step: Analyze the next node]. The first is [Step: Clarify the required parameters for the API] The mandatory parameters for xxx are: xxx, optional parameters are: xxx. The second is [Step: Extract the API parameter values] From the context, the known values for the xxx parameters and their values are: x=xxx. The third is [Step: Clarify the missing information for the API] The mandatory parameters still missing for xxx are: xxx.\n(4) When the current node 'current_node' includes the collection of parameters, 'Thought' should add one of the following extra pieces of information after [Step: Analyze the next node]. The first is [Step: Clarify missing parameters] Lacking the specific values for the parameter xxx. The second is [Step: Clarify parameter values] From the context, the values for the parameter xxx=xxx are known.\n7.2 If you need to call an API (only one API call per time):\n```\nThought: xxx (description of your thought process )\nAction: xxx (the function name to be called, do not add the prefix \"functions.\")\nAction Input: xxx (the parameters for the function, must be in strict JSON format)\n```\n[Format Explanation]\n(1) 'Thought' includes the information described in sections (1) and (3) of 7.1, totaling seven [Step: xxx] pieces of information.\n(2) In template 7.2, do not output 'Response', only output 'Thought', 'Action', and 'Action Input'.\n8. Please remember to add \"The previous node last_node=xxx, the current intent intent=xxx, the current node current_node=xxx\" at the beginning of the 'Thought'. Below are examples of Thoughts for regular nodes, parameter inquiry nodes, and API invocation nodes respectively. Do not output '[Step: xxx]'; instead, format the output as follows: \n```\nThought: The previous node last_node=PN001, the current intent intent=ETC application, the current node current_node=PN002. Next, I need to ask the user which type of vehicle's ETC it is. \n```\nThought: The previous node last_node=PN003, the current intent intent=not needed, the current node current_node=PN004. Lacking the specific value for the parameter reason. Next, I need to inquire about the reason parameter. \n```\nThought: The previous node last_node=PN005, the current intent intent=ETC consumption inquiry, the current node current_node=PN006. The mandatory parameters for getETCRecord include: license plate number, date, with no optional parameters. From the context, it is known the parameters for getETCRecord and their values: license plate number=ABG0375, the mandatory parameters still missing for getETCRecord are: date. Next, I need to ask for the date, then call getETCRecord.\n```\nTransitions across flowcharts may occur, leading to changes in the prefix of nodes.\n```\nThought: The previous node last_node=PN006, the current intent intent=ETC charging, the current node current_node=WI001. (scenario transfer). Next, I need to ask the user how much to charge. \n```\n9. When the user asks questions outside the scope of the business process and API information, the current intent and current node should be indicated as UNK. For example, in the Thought: \n```\nThought: The previous node last_node=PN007, the current intent intent=UNK, the current node current_node=UNK. Next, I need to tell the user that I am unclear about xxx.\n```\n10. When multiple possible intents exist, connect them with ' OR ' and clarify which situation is being inquired about; when expressing multiple intents, connect them with ' AND '. These methods of connection also apply to nodes.\n11. The current intent 'intent' can only be the description on the edges of the given flowchart or UNK; the previous node 'last_node' can only be the nodes on the flowchart, indicating the last triggered node; the reply content must include 'Thought'.\n12. Nodes that include keywords such as [Subgoal, Subtask, Step, Process] should be broken down into multiple steps, and users must be replied to step by step, do not output all the content of a node at once. In each round of dialogue, inquire only about one step, and after confirming that the user has completed that step, continue to ask the next step.\n\nPlease adhere strictly to the defined output format. Do not output the placeholder \"....\" verbatim; instead, fill it with the corresponding content.\n\nYour capabilities are based on the following workflows and API information:\n\n\nThe workflow and API information for Scenario 1 is as follows:\nThe workflow pipeline of Search for products is as follows:\nflowchart TD\nUI000(Start)--Search for products-->UI001>Subgoal: confirm needs and search for products.\nDescription: inquire the user about product type, brand, price lower limit, price upper limit, and other product needs, \ncall searchProducts and check whether suitable products can be found based on the returned is_finding]\nUI001--No suitable product found-->UI002[Subgoal: ask to adjust the demand.\nDescription: inform the user that no products meet the given needs, and ask if the user would like to change product requirements]\nUI002--Do not change requirements-->UI005[Subgoal: apologies to the user\nDescription:tell the user that there are no suitable products and apologize to the user]\nUI001--Suitable product found-->UI003[Subgoal: show goods to the user\nDescription: inform the user of the product according to the returned apartment_message and ask if the user is satisfied with the product]\nUI003--User is satisfied-->UI006[Ask if the user is willing to buy]\nUI006--User will buy-->UI007>Subgoal: add products to the shopping cart\nDescription: call addCart to add the current item to the shopping cart, and inform the user of the results of joining based on the returned add_result]\nUI006--User will not buy-->UI008[Inquiry the user about reasons for unwilling to buy, and collect parameters: is_reason.]\nUI003--User is unsatisfied-->UI004[Subgoal: persuade the user to consider other items\nDescription: inform the user that there are a wide variety of products, which can fully satisfy the user's experience, and ask if they want to know about other products]\nUI004--Yes-->UI001\nUI002--Change requirements-->UI001\nUI004--NO-->UI005\nUI008-->UI004\n\nThe auxiliary parameter information in the workflow is as follows:\n\"is_reason\": \"Reasons for not willing to purchase products\"\n\nThe API tool information in this workflow scenario is as follows:\nAPI: searchProducts\nAPI Desciption: Search for and find suitable products based on product type, brand, lower price limit, upper price limit, etc.\nInput parameters: {\"product_types\": {\"type\": \"string\", \"description\": \"Product type, default is \\\"Snacks\\\"\", \"required\": false}, \"product_names\": {\"type\": \"string\", \"description\": \"Brand, default is \\\"Three Squirrels\\\"\", \"required\": false}, \"lower_limit\": {\"type\": \"string\", \"description\": \"Price floor, value + yuan, default \\\"18 yuan\\\"\", \"required\": false}, \"upper_limit\": {\"type\": \"string\", \"description\": \"Price ceiling, value + yuan, default \\\"31 yuan\\\"\", \"required\": false}}\nOutput parameters: {\"is_finding\": {\"type\": \"string\", \"description\": \"Whether the corresponding product can be found, enumeration type, [not found, can be found]\"}, \"product_messages\": {\"type\": \"object\", \"description\": \"Product information sheet. If the corresponding product is not found, it will be empty.\", \"properties\": {\"product_id\": {\"type\": \"string\", \"description\": \"Product name. For example, \\\"specialty of crispy baked buns slices with fresh milk\\\"\"}, \"product_price\": {\"type\": \"string\", \"description\": \"The actual price of the product, value + yuan, such as \\\"25 yuan\\\"\"}, \"product_positions\": {\"type\": \"string\", \"description\": \"Product origin, such as \\\"Chongqing Inner Mongolia\\\"\"}}}}\n\nAPI: addCart\nAPI Desciption: Put the current item into the shopping cart and return the results\nInput parameters: {\"product_id\": {\"type\": \"string\", \"description\": \"Product name, such as \\\"specialty of crispy baked buns slices with fresh milk\\\"\", \"required\": true}, \"product_num\": {\"type\": \"integer\", \"description\": \"Quantity, positive integer\", \"required\": true}}\nOutput parameters: {\"add_result\": {\"type\": \"string\", \"description\": \"Adding result, enumeration type, [Add successfully, quantity exceeds the limit, the product name is wrong, it is removed from shelves, out of stock]\"}}\n\n\nThe workflow and API information for Scenario 2 is as follows:\nThe workflow pipeline of Shopping cart management is as follows:\nflowchart TD\nFU000(Start)--Cart management-->FU001>Subgoal: confirm the shopping cart Items.\nDescription: call lookShopCart to get the information of the shopping cart, and tell the user the contents of the shopping cart based on the returned value of lookShopCart.]\nFU001-->FU002[Ask the user whether to check out, continue purchasing, or modify the shopping cart?]\nFU002--Modify the shopping cart-->FU003>Subgoal: change the shopping cart.\nDescription: call changeShopCart to modify the shopping cart. If the user wants to modify the model, please delete it and purchase it again.\nThen tell the user the content of the shopping cart based on the returned value of changeShopCart.]\nFU003-->FU002\nFU002--Check out-->FU004>Subgoal: notify the amount.\nDescription: inquiry the user about the product they checked out based on the shopping cart checkout function shopCartCheckout, and call shopCartCheckout after the user confirms twice, and tell the user the amount based on the returned momey]\nFU004-->FU005[Ask the user whether to use account balance or use a link to pay?]\nFU005--Use account balance-->FU006>Subgoal: verify account amount and pay\nDescription: call useAccountbBalance, verify account balance and pay, and check whether it is yes based on the returned pay_result?]\nFU006--Is yes-->FU009\nFU006--Is not yes-->FU007[Tell the user that the balance is insufficient and ask if they want to use the link to pay?]\nFU007 & FU005--Use the link to pay-->FU008>Subgoal: call the link payment function\nDescription: call linkPay to get payment information and payment link, and judge whether the returned result_pay is success?]\nFU008--The result is success-->FU009[Let the user know that the payment is successful and the merchant is being notified.]\nFU008--The result isn't success-->FU010[Inform the user of the payment link based on the returned pay_link and ask the user to pay.]\nFU010-->FU011[Ask the user if the payment is completed?]\nFU011--Payment is completed-->FU008\nFU011 & FU007--Can't pay-->FU012[The user needs to pay within 24 hours. Otherwise, the order will be canceled.]\n\nThe API tool information in this workflow scenario is as follows:\nAPI: lookShopCart\nAPI Desciption: Get the information of the shopping cart based on the user's information\nInput parameters: {\"username\": {\"type\": \"string\", \"description\": \"User name is made up of 4 to 12 letters and numbers and can have spaces. If not filled in, the system will access the currently bound account to obtain it.\", \"required\": false}, \"user_account\": {\"type\": \"string\", \"description\": \"User account is made up of 6 to 12 letters and numbers, both uppercase and lowercase are ok. If left blank, the system will obtain it by accessing the currently bound account.\", \"required\": false}}\nOutput parameters: {\"shopping_cart_content\": {\"type\": \"object\", \"description\": \"Shopping cart content, empty means there are no items\", \"properties\": {\"item_name\": {\"type\": \"string\", \"description\": \"Name of items, such as \\\"Purple Dress\\\" and \\\"Nescafé\\\"\"}, \"quantity\": {\"type\": \"integer\", \"description\": \"Quantity, non-negative integer. 0 indicates the status error\"}, \"category\": {\"type\": \"string\", \"description\": \"model, it can be the model size, such as \\\"medium\\\" or \\\"small\\\", and can also be the shoe size, such as \\\"42\\\" and \\\"37\\\". If there is no model, write \\\"\\\".\"}, \"unit_price\": {\"type\": \"string\", \"description\": \"Unit price, max_retry_num = 5¥+nonnegative floating point number, like\\\"¥299.00\\\", \\\"¥137.00\\\"\"}, \"amount\": {\"type\": \"string\", \"description\": \"Amount, max_retry_num = 5¥+nonnegative floating point number, like \\\"¥3299.00\\\", \\\"¥837.00\\\"\"}, \"product_status\": {\"type\": \"string\", \"description\": \"Product status, enumeration type, [out of stock, off the shelf or normal]\"}}}, \"cart_total_amount\": {\"type\": \"string\", \"description\": \"Total amount in the shopping cart, max_retry_num = 5¥+nonnegative floating point number, like \\\"¥299.00\\\", \\\"¥137.00\\\"\"}}\n\nAPI: changeShopCart\nAPI Desciption: Modify the shopping cart according to the modification request.\nInput parameters: {\"modification_list\": {\"type\": \"object\", \"description\": \"Modify list, cannot be empty\", \"properties\": {\"item_to_modify\": {\"type\": \"string\", \"description\": \"Product names that need to be modified, such as \\\"Purple Dress\\\" and \\\"Nescafé\\\"\"}, \"modified_quantity\": {\"type\": \"integer\", \"description\": \"The modified quantity, a non-negative integer, 0 means deleting the product\"}}, \"required\": true}}\nOutput parameters: {\"shopping_cart_content\": {\"type\": \"object\", \"description\": \"Shopping cart content, empty means there are no items\", \"properties\": {\"item_name\": {\"type\": \"string\", \"description\": \"Item names, such as \\\"Purple Dress\\\" and \\\"Nescafé\\\"\"}, \"quantity\": {\"type\": \"integer\", \"description\": \"Quantity, non-negative integer, 0 indicates status error\"}, \"category\": {\"type\": \"string\", \"description\": \"model, it can be the model size, such as \\\"medium\\\" or \\\"small\\\", and can also the shoe size, such as \\\"42\\\" and \\\"37\\\".\"}, \"unit_price\": {\"type\": \"string\", \"description\": \"Unit price, max_retry_num = 5¥+nonnegative floating point number, like \\\"¥299.00\\\", \\\"¥137.00\\\"\"}, \"amount\": {\"type\": \"string\", \"description\": \"Anount, max_retry_num = 5¥+nonnegative floating point number, like \\\"¥3299.00\\\", \\\"¥837.00\\\"\"}, \"product_status\": {\"type\": \"string\", \"description\": \"Product status, enumeration type, [out of stock, off the shelf or normal]\"}}}, \"cart_total_amount\": {\"type\": \"string\", \"description\": \"The total amount in the shopping cart, max_retry_num = 5¥+nonnegative floating point number, such as \\\"¥299.00\\\" and \\\"¥137.00\\\"\"}}\n\nAPI: shopCartCheckout\nAPI Desciption: Inquiry the user for products to check out and submit the order\nInput parameters: {\"submitted_product_name\": {\"type\": \"list\", \"description\": \"Submitted product name cannot be empty\", \"items\": {\"type\": \"string\", \"description\": \"Item names, such as \\\"Purple Dress\\\" and \\\"Nescafé\\\"\"}, \"required\": true}}\nOutput parameters: {\"money\": {\"type\": \"string\", \"description\": \"Amount to be paid, max_retry_num = 5¥+nonnegative floating point number, like\\\"¥3299.00\\\", \\\"¥837.00\\\"\"}, \"order_num\": {\"type\": \"string\", \"description\": \"Payment order number, 32-bit lowercase md5 code, like \\\"8bb52003873ffd66d94057cab0b40f53\\\"\"}}\n\nAPI: useAccountbBalance\nAPI Desciption: Use the account to pay and verify whether it succeeds\nInput parameters: {\"username\": {\"type\": \"string\", \"description\": \"User name, 4 to 12 letters and numbers, can have spaces. If not filled in, the system will obtain it by accessing the currently bound account.\", \"required\": false}, \"user_account\": {\"type\": \"string\", \"description\": \"User account, 6 to 12 letters and numbers, both uppercase and lowercase are ok. If left blank, the system will access the currently bound account to obtain it.\", \"required\": false}}\nOutput parameters: {\"pay_result\": {\"type\": \"string\", \"description\": \"Payment result, enumeration type, [yes, no]\"}}\n\nAPI: linkPay\nAPI Desciption: Link payment, get the payment link\nInput parameters: {\"order_num\": {\"type\": \"string\", \"description\": \"Payment order number, 32-digit lowercase md5 code, such as \\\"8bb52003873ffd66d94057cab0b40f53\\\"\", \"required\": true}}\nOutput parameters: {\"pay_link\": {\"type\": \"string\", \"description\": \"Payment link, URL, e.g. \\\"alipay.cn/s/orderNum=8bb52003873ffd66d94057cab0b40f53\\\", the format is\\\"alipay.cn/s/orderNum=\\\"+order number. If the payment result is success, it will be empty.\"}, \"result_pay\": {\"type\": \"string\", \"description\": \"Payment result, enumeration type, [successes, failure]\"}}\n\n\nThe workflow and API information for Scenario 3 is as follows:\nThe workflow pipeline of Order operations is as follows:\nflowchart TD\nUU000(Start)--Check order-->UU001>Subgoal: get order information.\nDescription: call lookOrders to get brief order information. \nIf the returned account_details shows a normal account, tell the user rough order information based on the returned order_info. \nIf the account is abnormal, let the user provide the correct information.]\nUU001-->UU002[Confirm if the user wants to operate on a certain order?]\nUU002 & UU007--NO-->UU003[Say goodbye to the user]\nUU002 & UU007--Yes-->UU004[Subgoal: confirm the current state of the order.\nDescription: ask the user which order to operate on among returned of lookOrders, and check whether the corresponding current_status is received?]\nUU004--Goods received-->UU005[Tell the user that the product has been received, and ask if the user wants to return it?]\nUU005--Not return the good-->UU006[Explain to the user that received orders can only be returned.]\nUU006 & UU009 & UU010-->UU007[Ask if there are other orders needed to be operated?]\nUU005--Return the good-->UU010\nUU004--Goods not received-->UU008[Confirm whether to check order information, cancel the order, expedite the order, or change the address?]\nUU008--Check order information-->UU009>Subgoal: get order details and inform the user.\nDescription: call viewDetails to obtain detailed order information, and deliver the information to the user based on the returned value of viewDetails.]\nUU008--Not check order information-->UU010>Subgoal: notify the result of the operation\nDescription: call orderOperate to do order operation, and tell the result to the user according to the returned result_order]\n\nThe API tool information in this workflow scenario is as follows:\nAPI: lookOrders\nAPI Desciption: Obtain brief information about orders placed under the user's account based on user information\nInput parameters: {\"username\": {\"type\": \"string\", \"description\": \"User name, 4 to 12 letters and numbers, can have spaces. If not filled in, the system will access the currently bound account to obtain it.\", \"required\": false}, \"user_account\": {\"type\": \"string\", \"description\": \"User account, 6 to 12 letters and numbers, both uppercase and lowercase are available. If left blank, the system will access the currently bound account to obtain it.\", \"required\": false}, \"fetch_scope\": {\"type\": \"list\", \"description\": \"Get the range. If written, it must not be empty. If not written, all will be selected.\", \"items\": {\"type\": \"string\", \"description\": \"Current status, enumeration type, [received, not shipped, in transit]\"}, \"required\": false}}\nOutput parameters: {\"order_info\": {\"type\": \"object\", \"description\": \"Order information, empty means there is no order or the account is abnormal\", \"properties\": {\"order_id\": {\"type\": \"string\", \"description\": \"Order number, 3 letters + 12 numbers, such as \\\"ABC012345678901\\\"\"}, \"current_status\": {\"type\": \"string\", \"description\": \"Current status, enumeration type, [received, not shipped, in transit]\"}, \"payment_time\": {\"type\": \"string\", \"description\": \"Payment time, the format is year+month+day+hour+minute+second, for example \\\"November 12, 2044 18:11:03\\\"\"}, \"express_number\": {\"type\": \"string\", \"description\": \"Express number, 8~15 letters + numbers, the format depends on the express company, such as \\\"SF1662397783616\\\", \\\"YT7457447178601\\\"\"}}}, \"account_details\": {\"type\": \"string\", \"description\": \"Account information, enumeration type, [normal account, abnormal account, wrong username]\"}}\n\nAPI: orderOperate\nAPI Desciption: The user operates the order and returns the results\nInput parameters: {\"order_id\": {\"type\": \"string\", \"description\": \"Order number, 3 letters + 12 numbers, such as \\\"ABC012345678901\\\"\", \"required\": true}, \"operation_name\": {\"type\": \"string\", \"description\": \"Operation term, enumeration type, [return, cancel, reminder, modification]\", \"required\": true}, \"modified_address\": {\"type\": \"string\", \"description\": \"The format of the modified address is: province city + district county + road name + road number + community name + building number + floor number + household number, for example, \\\"Floor 15, Building 1, Fenghua Community, No. 22 Gucheng Road, Xi'an County, Shaanxi Province. The modification is needed.\", \"required\": false}, \"modified_recipient\": {\"type\": \"string\", \"description\": \"The modified recipient, such as \\\"Mike Din\\\", the modification should be written\", \"required\": false}, \"modified_phone_number\": {\"type\": \"string\", \"description\": \"The modified mobile phone number, a 13-digit mobile phone number, such as \\\"13801234567\\\", the modification must be written\", \"required\": false}}\nOutput parameters: {\"result_order\": {\"type\": \"string\", \"description\": \"As a result, if the goods have been received for return, it will be \\\"the merchant is reviewing\\\"; if the goods have not been received for a return or the address is modified, it will be \\\"intercepting, and the interception may fail\\\"; if it is canceled, it will be \\\"canceled\\\"; if it is an order expedited, it is \\\"has been expedited\\\".\"}}\n\nAPI: viewDetails\nAPI Desciption: Get order details\nInput parameters: {\"order_id\": {\"type\": \"string\", \"description\": \"Order number, 3 letters + 12 numbers, such as \\\"ABC012345678901\\\"\", \"required\": true}}\nOutput parameters: {\"order_id\": {\"type\": \"string\", \"description\": \"Order number, 3 letters + 12 numbers, such as \\\"ABC012345678901\\\", empty means the order number is wrong\"}, \"item_name\": {\"type\": \"string\", \"description\": \"Name of items, such as \\\"Purple Dress\\\", and \\\"Nestle Coffee\\\", empty means the order number is wrong\"}, \"progress_info\": {\"type\": \"object\", \"description\": \"Progress information, empty means the order number is wrong\", \"properties\": {\"change_time\": {\"type\": \"string\", \"description\": \"Change the time, in the format of year+month+day+hour+minute+second, such as \\\"November 12, 2044 18:11:03\\\"\"}, \"change_reason\": {\"type\": \"string\", \"description\": \"Reason for change, such as \\\"Order has been created\\\", \\\"Order has been paid\\\", \\\"Merchant has shipped\\\", \\\"Courier company has picked up\\\", \\\"Arrived at XXX location\\\", \\\"Sending from XXX to XXX\\\", \\\"Goods are being delivered\\\", \\\"The goods have been delivered\\\", \\\"The goods have been signed for\\\"\"}, \"express_number\": {\"type\": \"string\", \"description\": \"Express number, 8~15 letters + numbers, the format depends on the express company, such as \\\"SF1662397783616\\\", \\\"YT7457447178601\\\"\"}}}, \"is_order_correct\": {\"type\": \"string\", \"description\": \"Whether the order is correct, enumeration type, [correct order, wrong order]\"}}\n\n\nPlease note that the current time is:2030-11-22 15:19:50 Friday", "gt_actions_string": "Action: searchProducts\nAction Input: {\"product_types\": \"tent, backpack\", \"product_names\": \"\", \"lower_limit\": \"0 yuan\", \"upper_limit\": \"500 yuan\"}\n\nAction: addCart\nAction Input: {\"product_id\": \"high durability waterproof tent, large capacity multi-compartment backpack\", \"product_num\": 1}\n\nAction: shopCartCheckout\nAction Input: {\"submitted_product_name\": [\"high durability waterproof tent\", \"large capacity multi-compartment backpack\"]}\n\nAction: linkPay\nAction Input: {\"order_num\": \"8bb52003873ffd66d94057cab0b40f53\"}", "user_target": "User profile and background: The user is planning a camping trip and is in search of durable outdoor equipment that can withstand various weather conditions. The user shows interest in finding specific items such as a tent and a backpack, with a focus on durability and sustainability. They are also concerned about the order tracking and expedited shipping options after making a purchase.\n\nDialogue task scenario description: The dialogue encompasses a comprehensive interaction that starts with the user's request for product recommendations (Search for products) and transitions through the process of adding selected items to the shopping cart, proceeding to checkout, and finally addressing inquiries regarding order tracking and expedited shipping (Shopping cart management & Order operations).\n\nUser goals and demands:\n1. **Find durable and sustainable outdoor equipment**: The user is looking for a tent and a backpack suitable for camping, with each item priced under $500.\n - **API parameters needed**: \n - `product_types`: \"tent, backpack\"\n - `lower_limit`: \"0 yuan\"\n - `upper_limit`: \"500 yuan\"\n2. **Complete the purchase of selected items**: After finding satisfactory products, the user decides to buy both the tent and the backpack.\n - **API parameters needed**:\n - `product_id`: \"high durability waterproof tent, large capacity multi-compartment backpack\"\n - `product_num`: 1\n3. **Understand and utilize the order tracking and expedited shipping options**: Post-purchase, the user wishes to learn how to track their order and inquire about the possibility of expedited shipping.\n - **API parameters potentially needed for future actions**:\n - Not directly mentioned, but would involve interactions with APIs related to order tracking and modifications for expedited shipping, such as `lookOrders` and `orderOperate` with possible operations like `reminder` (for expedited shipping) and viewing detailed order information through `viewDetails`.", "apis": "[{'apiCode': 'lookOrders', 'description': \"Obtain brief information about orders placed under the user's account based on user information\", 'parameters': {'username': {'type': 'string', 'description': 'User name, 4 to 12 letters and numbers, can have spaces. If not filled in, the system will access the currently bound account to obtain it.', 'required': False}, 'user_account': {'type': 'string', 'description': 'User account, 6 to 12 letters and numbers, both uppercase and lowercase are available. If left blank, the system will access the currently bound account to obtain it.', 'required': False}, 'fetch_scope': {'type': 'list', 'description': 'Get the range. If written, it must not be empty. If not written, all will be selected.', 'items': {'type': 'string', 'description': 'Current status, enumeration type, [received, not shipped, in transit]'}, 'required': False}}, 'response': {'data': {'type': 'object', 'description': '返回数据', 'properties': {'order_info': {'type': 'object', 'description': 'Order information, empty means there is no order or the account is abnormal', 'properties': {'order_id': {'type': 'string', 'description': 'Order number, 3 letters + 12 numbers, such as \"ABC012345678901\"'}, 'current_status': {'type': 'string', 'description': 'Current status, enumeration type, [received, not shipped, in transit]'}, 'payment_time': {'type': 'string', 'description': 'Payment time, the format is year+month+day+hour+minute+second, for example \"November 12, 2044 18:11:03\"'}, 'express_number': {'type': 'string', 'description': 'Express number, 8~15 letters + numbers, the format depends on the express company, such as \"SF1662397783616\", \"YT7457447178601\"'}}}, 'account_details': {'type': 'string', 'description': 'Account information, enumeration type, [normal account, abnormal account, wrong username]'}}}}}, {'apiCode': 'orderOperate', 'description': 'The user operates the order and returns the results', 'parameters': {'order_id': {'type': 'string', 'description': 'Order number, 3 letters + 12 numbers, such as \"ABC012345678901\"', 'required': True}, 'operation_name': {'type': 'string', 'description': 'Operation term, enumeration type, [return, cancel, reminder, modification]', 'required': True}, 'modified_address': {'type': 'string', 'description': 'The format of the modified address is: province city + district county + road name + road number + community name + building number + floor number + household number, for example, \"Floor 15, Building 1, Fenghua Community, No. 22 Gucheng Road, Xi\\'an County, Shaanxi Province. The modification is needed.', 'required': False}, 'modified_recipient': {'type': 'string', 'description': 'The modified recipient, such as \"Mike Din\", the modification should be written', 'required': False}, 'modified_phone_number': {'type': 'string', 'description': 'The modified mobile phone number, a 13-digit mobile phone number, such as \"13801234567\", the modification must be written', 'required': False}}, 'response': {'data': {'type': 'object', 'description': '返回数据', 'properties': {'result_order': {'type': 'string', 'description': 'As a result, if the goods have been received for return, it will be \"the merchant is reviewing\"; if the goods have not been received for a return or the address is modified, it will be \"intercepting, and the interception may fail\"; if it is canceled, it will be \"canceled\"; if it is an order expedited, it is \"has been expedited\".'}}}}}, {'apiCode': 'viewDetails', 'description': 'Get order details', 'parameters': {'order_id': {'type': 'string', 'description': 'Order number, 3 letters + 12 numbers, such as \"ABC012345678901\"', 'required': True}}, 'response': {'data': {'type': 'object', 'description': '返回数据', 'properties': {'order_id': {'type': 'string', 'description': 'Order number, 3 letters + 12 numbers, such as \"ABC012345678901\", empty means the order number is wrong'}, 'item_name': {'type': 'string', 'description': 'Name of items, such as \"Purple Dress\", and \"Nestle Coffee\", empty means the order number is wrong'}, 'progress_info': {'type': 'object', 'description': 'Progress information, empty means the order number is wrong', 'properties': {'change_time': {'type': 'string', 'description': 'Change the time, in the format of year+month+day+hour+minute+second, such as \"November 12, 2044 18:11:03\"'}, 'change_reason': {'type': 'string', 'description': 'Reason for change, such as \"Order has been created\", \"Order has been paid\", \"Merchant has shipped\", \"Courier company has picked up\", \"Arrived at XXX location\", \"Sending from XXX to XXX\", \"Goods are being delivered\", \"The goods have been delivered\", \"The goods have been signed for\"'}, 'express_number': {'type': 'string', 'description': 'Express number, 8~15 letters + numbers, the format depends on the express company, such as \"SF1662397783616\", \"YT7457447178601\"'}}}, 'is_order_correct': {'type': 'string', 'description': 'Whether the order is correct, enumeration type, [correct order, wrong order]'}}}}}]", "function_agent": {"functions_with_response": [{"name": "searchProducts", "description": "Search for and find suitable products based on product type, brand, lower price limit, upper price limit, etc.", "parameters": {"type": "object", "properties": {"product_types": {"type": "string", "description": "Product type, default is \"Snacks\""}, "product_names": {"type": "string", "description": "Brand, default is \"Three Squirrels\""}, "lower_limit": {"type": "string", "description": "Price floor, value + yuan, default \"18 yuan\""}, "upper_limit": {"type": "string", "description": "Price ceiling, value + yuan, default \"31 yuan\""}}, "required": []}, "response": {"properties": {"is_finding": {"type": "string", "description": "Whether the corresponding product can be found, enumeration type, [not found, can be found]"}, "product_messages": {"type": "object", "description": "Product information sheet. If the corresponding product is not found, it will be empty.", "properties": {"product_id": {"type": "string", "description": "Product name. For example, \"specialty of crispy baked buns slices with fresh milk\""}, "product_price": {"type": "string", "description": "The actual price of the product, value + yuan, such as \"25 yuan\""}, "product_positions": {"type": "string", "description": "Product origin, such as \"Chongqing Inner Mongolia\""}}, "required": []}}, "type": "object", "required": []}}, {"name": "addCart", "description": "Put the current item into the shopping cart and return the results", "parameters": {"type": "object", "properties": {"product_id": {"type": "string", "description": "Product name, such as \"specialty of crispy baked buns slices with fresh milk\""}, "product_num": {"type": "integer", "description": "Quantity, positive integer"}}, "required": ["product_id", "product_num"]}, "response": {"properties": {"add_result": {"type": "string", "description": "Adding result, enumeration type, [Add successfully, quantity exceeds the limit, the product name is wrong, it is removed from shelves, out of stock]"}}, "type": "object", "required": []}}, {"name": "lookShopCart", "description": "Get the information of the shopping cart based on the user's information", "parameters": {"type": "object", "properties": {"username": {"type": "string", "description": "User name is made up of 4 to 12 letters and numbers and can have spaces. If not filled in, the system will access the currently bound account to obtain it."}, "user_account": {"type": "string", "description": "User account is made up of 6 to 12 letters and numbers, both uppercase and lowercase are ok. If left blank, the system will obtain it by accessing the currently bound account."}}, "required": []}, "response": {"properties": {"shopping_cart_content": {"type": "object", "description": "Shopping cart content, empty means there are no items", "properties": {"item_name": {"type": "string", "description": "Name of items, such as \"Purple Dress\" and \"Nescafé\""}, "quantity": {"type": "integer", "description": "Quantity, non-negative integer. 0 indicates the status error"}, "category": {"type": "string", "description": "model, it can be the model size, such as \"medium\" or \"small\", and can also be the shoe size, such as \"42\" and \"37\". If there is no model, write \"\"."}, "unit_price": {"type": "string", "description": "Unit price, max_retry_num = 5¥+nonnegative floating point number, like\"¥299.00\", \"¥137.00\""}, "amount": {"type": "string", "description": "Amount, max_retry_num = 5¥+nonnegative floating point number, like \"¥3299.00\", \"¥837.00\""}, "product_status": {"type": "string", "description": "Product status, enumeration type, [out of stock, off the shelf or normal]"}}, "required": []}, "cart_total_amount": {"type": "string", "description": "Total amount in the shopping cart, max_retry_num = 5¥+nonnegative floating point number, like \"¥299.00\", \"¥137.00\""}}, "type": "object", "required": []}}, {"name": "changeShopCart", "description": "Modify the shopping cart according to the modification request.", "parameters": {"type": "object", "properties": {"modification_list": {"type": "object", "description": "Modify list, cannot be empty", "properties": {"item_to_modify": {"type": "string", "description": "Product names that need to be modified, such as \"Purple Dress\" and \"Nescafé\""}, "modified_quantity": {"type": "integer", "description": "The modified quantity, a non-negative integer, 0 means deleting the product"}}, "required": []}}, "required": ["modification_list"]}, "response": {"properties": {"shopping_cart_content": {"type": "object", "description": "Shopping cart content, empty means there are no items", "properties": {"item_name": {"type": "string", "description": "Item names, such as \"Purple Dress\" and \"Nescafé\""}, "quantity": {"type": "integer", "description": "Quantity, non-negative integer, 0 indicates status error"}, "category": {"type": "string", "description": "model, it can be the model size, such as \"medium\" or \"small\", and can also the shoe size, such as \"42\" and \"37\"."}, "unit_price": {"type": "string", "description": "Unit price, max_retry_num = 5¥+nonnegative floating point number, like \"¥299.00\", \"¥137.00\""}, "amount": {"type": "string", "description": "Anount, max_retry_num = 5¥+nonnegative floating point number, like \"¥3299.00\", \"¥837.00\""}, "product_status": {"type": "string", "description": "Product status, enumeration type, [out of stock, off the shelf or normal]"}}, "required": []}, "cart_total_amount": {"type": "string", "description": "The total amount in the shopping cart, max_retry_num = 5¥+nonnegative floating point number, such as \"¥299.00\" and \"¥137.00\""}}, "type": "object", "required": []}}, {"name": "shopCartCheckout", "description": "Inquiry the user for products to check out and submit the order", "parameters": {"type": "object", "properties": {"submitted_product_name": {"type": "array", "description": "Submitted product name cannot be empty", "items": {"type": "string", "description": "Item names, such as \"Purple Dress\" and \"Nescafé\""}}}, "required": ["submitted_product_name"]}, "response": {"properties": {"money": {"type": "string", "description": "Amount to be paid, max_retry_num = 5¥+nonnegative floating point number, like\"¥3299.00\", \"¥837.00\""}, "order_num": {"type": "string", "description": "Payment order number, 32-bit lowercase md5 code, like \"8bb52003873ffd66d94057cab0b40f53\""}}, "type": "object", "required": []}}, {"name": "useAccountbBalance", "description": "Use the account to pay and verify whether it succeeds", "parameters": {"type": "object", "properties": {"username": {"type": "string", "description": "User name, 4 to 12 letters and numbers, can have spaces. If not filled in, the system will obtain it by accessing the currently bound account."}, "user_account": {"type": "string", "description": "User account, 6 to 12 letters and numbers, both uppercase and lowercase are ok. If left blank, the system will access the currently bound account to obtain it."}}, "required": []}, "response": {"properties": {"pay_result": {"type": "string", "description": "Payment result, enumeration type, [yes, no]"}}, "type": "object", "required": []}}, {"name": "linkPay", "description": "Link payment, get the payment link", "parameters": {"type": "object", "properties": {"order_num": {"type": "string", "description": "Payment order number, 32-digit lowercase md5 code, such as \"8bb52003873ffd66d94057cab0b40f53\""}}, "required": ["order_num"]}, "response": {"properties": {"pay_link": {"type": "string", "description": "Payment link, URL, e.g. \"alipay.cn/s/orderNum=8bb52003873ffd66d94057cab0b40f53\", the format is\"alipay.cn/s/orderNum=\"+order number. If the payment result is success, it will be empty."}, "result_pay": {"type": "string", "description": "Payment result, enumeration type, [successes, failure]"}}, "type": "object", "required": []}}, {"name": "lookOrders", "description": "Obtain brief information about orders placed under the user's account based on user information", "parameters": {"type": "object", "properties": {"username": {"type": "string", "description": "User name, 4 to 12 letters and numbers, can have spaces. If not filled in, the system will access the currently bound account to obtain it."}, "user_account": {"type": "string", "description": "User account, 6 to 12 letters and numbers, both uppercase and lowercase are available. If left blank, the system will access the currently bound account to obtain it."}, "fetch_scope": {"type": "array", "description": "Get the range. If written, it must not be empty. If not written, all will be selected.", "items": {"type": "string", "description": "Current status, enumeration type, [received, not shipped, in transit]"}}}, "required": []}, "response": {"properties": {"order_info": {"type": "object", "description": "Order information, empty means there is no order or the account is abnormal", "properties": {"order_id": {"type": "string", "description": "Order number, 3 letters + 12 numbers, such as \"ABC012345678901\""}, "current_status": {"type": "string", "description": "Current status, enumeration type, [received, not shipped, in transit]"}, "payment_time": {"type": "string", "description": "Payment time, the format is year+month+day+hour+minute+second, for example \"November 12, 2044 18:11:03\""}, "express_number": {"type": "string", "description": "Express number, 8~15 letters + numbers, the format depends on the express company, such as \"SF1662397783616\", \"YT7457447178601\""}}, "required": []}, "account_details": {"type": "string", "description": "Account information, enumeration type, [normal account, abnormal account, wrong username]"}}, "type": "object", "required": []}}, {"name": "orderOperate", "description": "The user operates the order and returns the results", "parameters": {"type": "object", "properties": {"order_id": {"type": "string", "description": "Order number, 3 letters + 12 numbers, such as \"ABC012345678901\""}, "operation_name": {"type": "string", "description": "Operation term, enumeration type, [return, cancel, reminder, modification]"}, "modified_address": {"type": "string", "description": "The format of the modified address is: province city + district county + road name + road number + community name + building number + floor number + household number, for example, \"Floor 15, Building 1, Fenghua Community, No. 22 Gucheng Road, Xi'an County, Shaanxi Province. The modification is needed."}, "modified_recipient": {"type": "string", "description": "The modified recipient, such as \"Mike Din\", the modification should be written"}, "modified_phone_number": {"type": "string", "description": "The modified mobile phone number, a 13-digit mobile phone number, such as \"13801234567\", the modification must be written"}}, "required": ["order_id", "operation_name"]}, "response": {"properties": {"result_order": {"type": "string", "description": "As a result, if the goods have been received for return, it will be \"the merchant is reviewing\"; if the goods have not been received for a return or the address is modified, it will be \"intercepting, and the interception may fail\"; if it is canceled, it will be \"canceled\"; if it is an order expedited, it is \"has been expedited\"."}}, "type": "object", "required": []}}, {"name": "viewDetails", "description": "Get order details", "parameters": {"type": "object", "properties": {"order_id": {"type": "string", "description": "Order number, 3 letters + 12 numbers, such as \"ABC012345678901\""}}, "required": ["order_id"]}, "response": {"properties": {"order_id": {"type": "string", "description": "Order number, 3 letters + 12 numbers, such as \"ABC012345678901\", empty means the order number is wrong"}, "item_name": {"type": "string", "description": "Name of items, such as \"Purple Dress\", and \"Nestle Coffee\", empty means the order number is wrong"}, "progress_info": {"type": "object", "description": "Progress information, empty means the order number is wrong", "properties": {"change_time": {"type": "string", "description": "Change the time, in the format of year+month+day+hour+minute+second, such as \"November 12, 2044 18:11:03\""}, "change_reason": {"type": "string", "description": "Reason for change, such as \"Order has been created\", \"Order has been paid\", \"Merchant has shipped\", \"Courier company has picked up\", \"Arrived at XXX location\", \"Sending from XXX to XXX\", \"Goods are being delivered\", \"The goods have been delivered\", \"The goods have been signed for\""}, "express_number": {"type": "string", "description": "Express number, 8~15 letters + numbers, the format depends on the express company, such as \"SF1662397783616\", \"YT7457447178601\""}}, "required": []}, "is_order_correct": {"type": "string", "description": "Whether the order is correct, enumeration type, [correct order, wrong order]"}}, "type": "object", "required": []}}], "function_prompt": "Action: searchProducts\nAction Input: {\"product_types\": \"tent, backpack\", \"product_names\": \"\", \"lower_limit\": \"0 yuan\", \"upper_limit\": \"500 yuan\"}\nfunction:\n{\"is_finding\":\"can be found\",\"product_messages\":[{\"product_id\":\"high durability waterproof tent\",\"product_price\":\"450 yuan\",\"product_positions\":\"Chongqing\"}\n\nAction: addCart\nAction Input: {\"product_id\": \"high durability waterproof tent, large capacity multi-compartment backpack\", \"product_num\": 1}\nfunction:\n{\"add_result\": \"Add successfully\"}\n\nAction: shopCartCheckout\nAction Input: {\"submitted_product_name\": [\"high durability waterproof tent\", \"large capacity multi-compartment backpack\"]}\nfunction:\n{\"money\":\"¥800.00\",\"order_num\":\"8bb52003873ffd66d94057cab0b40f53\"}\n\nAction: linkPay\nAction Input: {\"order_num\": \"8bb52003873ffd66d94057cab0b40f53\"}\nfunction:\n{\"pay_link\":\"alipay.cn/s/orderNum=8bb52003873ffd66d94057cab0b40f53\",\"result_pay\":\"success\"}"}, "assistant_agent": {"assistant_prompt": "You are a helpful assistant for the task of Agent of online shopping mall. Your mission involves 3 scenario of Search for products & Shopping cart management & Order operations.\nThe scenario 1 is to Search for products where User wants you to recommend products to them, and you need to do this based on their needs.\nThe scenario 2 is to Shopping cart management where User wants to manage their shopping cart and may change checkout and shopping cart contents. You need to ask for the user's intent and assist them in managing their cart.\nThe scenario 3 is to Order operations where User wants to know about the orders placed under his/her account, and may learn more about or operate an order in detail. You need to help him with operations such as order inquiries or modifications.\n\n\nSpecific requirements:\n1. You need to act as an assistant and engage in a conversation with the user, following the business process and API information described in Markdown syntax.\n2. You have been provided with the flowchart information for different scenarios under a specific role. In each workflow chart below, the nodes represent the actions you need to take at each step, with each node containing a sub-command for you to follow in responding to users or performing certain actions. Node types include response nodes, decision nodes, and API call nodes. The edges in the process define the various transition conditions you need to evaluate, including types such as user intent assessment conditions, parameter and function return value conditions, etc. You need to decide whether to transition to the next node based on these conditions.\n3. Within the complex node with a sub-goal, you need to complete the sub-goal by following the node description, If a conversation is needed to complete the goal, strictly follow the content described in the node. If a specified tool is needed, strictly ask for the necessary parameters according to the tool's definition to complete the API call.\n4. You can only answer questions within the scope of the given several workflow processes. If the user asks a question beyond these scopes, please apologize and explain to the user in the response part.\n5. When asking for API input parameters, ensure that the provided parameter values comply with the specified format regarding both he correctness of the format and the completeness of the content. Do not assign values arbitrarily. In instances where the parameters do not meet the format requirements, notify users to make the necessary adjustments until the requirements are satisfied.\n6. When the user has multiple requests at the same time, please select one appropriate request for processing first and inform the user that other requests will be resolved subsequently. If there is unfinished business in the previous conversation, continue to provide the necessary help and guidance to assist them in completing the business process. When multiple APIs need to be called, do so in separate rounds, with a maximum of one API call output per round. When the user indicates that the business is finished or says goodbye, respond politely and end the conversation.\n7. Your output format should be chosen from one of the two templates below (7.1 and 7.2):\n7.1 If you need to interact with the user:\n```\nThought: xxx (description of your thought process )\nResponse: xxx (the content you need to inquire or reply)\n```\n[Format Explanation]\n(1) Thought includes four pieces of information: [Step: Analyze the previous node]: 'The previous node last_node=xxx'. [Step: Analyze the current intent]: 'The current intent intent=xxx'. [Step: Analyze the current node]: 'The current node current_node=xxx'. [Step: Decide the follow-up actions]: 'Next, I need to xxx.'\n(2) The previous node 'last_node' refers to the node or nodes immediately preceding the current node under consideration in the process. The current intent 'intent' represents the user's intent or condition that links the current node with the previous node. The current node 'current_node' refers to the next node pointed to by the previous node through transition conditions. The current node may be in a different scenario flowchart than the previous node (that is, a cross-scenario process jump occurs)\n(3) When the current node 'current_node' involves calling an API, 'Thought' should add three extra pieces of information after [Step: Analyze the next node]. The first is [Step: Clarify the required parameters for the API] The mandatory parameters for xxx are: xxx, optional parameters are: xxx. The second is [Step: Extract the API parameter values] From the context, the known values for the xxx parameters and their values are: x=xxx. The third is [Step: Clarify the missing information for the API] The mandatory parameters still missing for xxx are: xxx.\n(4) When the current node 'current_node' includes the collection of parameters, 'Thought' should add one of the following extra pieces of information after [Step: Analyze the next node]. The first is [Step: Clarify missing parameters] Lacking the specific values for the parameter xxx. The second is [Step: Clarify parameter values] From the context, the values for the parameter xxx=xxx are known.\n7.2 If you need to call an API (only one API call per time):\n```\nThought: xxx (description of your thought process )\nAction: xxx (the function name to be called, do not add the prefix \"functions.\")\nAction Input: xxx (the parameters for the function, must be in strict JSON format)\n```\n[Format Explanation]\n(1) 'Thought' includes the information described in sections (1) and (3) of 7.1, totaling seven [Step: xxx] pieces of information.\n(2) In template 7.2, do not output 'Response', only output 'Thought', 'Action', and 'Action Input'.\n8. Please remember to add \"The previous node last_node=xxx, the current intent intent=xxx, the current node current_node=xxx\" at the beginning of the 'Thought'. Below are examples of Thoughts for regular nodes, parameter inquiry nodes, and API invocation nodes respectively. Do not output '[Step: xxx]'; instead, format the output as follows: \n```\nThought: The previous node last_node=PN001, the current intent intent=ETC application, the current node current_node=PN002. Next, I need to ask the user which type of vehicle's ETC it is. \n```\nThought: The previous node last_node=PN003, the current intent intent=not needed, the current node current_node=PN004. Lacking the specific value for the parameter reason. Next, I need to inquire about the reason parameter. \n```\nThought: The previous node last_node=PN005, the current intent intent=ETC consumption inquiry, the current node current_node=PN006. The mandatory parameters for getETCRecord include: license plate number, date, with no optional parameters. From the context, it is known the parameters for getETCRecord and their values: license plate number=ABG0375, the mandatory parameters still missing for getETCRecord are: date. Next, I need to ask for the date, then call getETCRecord.\n```\nTransitions across flowcharts may occur, leading to changes in the prefix of nodes.\n```\nThought: The previous node last_node=PN006, the current intent intent=ETC charging, the current node current_node=WI001. (scenario transfer). Next, I need to ask the user how much to charge. \n```\n9. When the user asks questions outside the scope of the business process and API information, the current intent and current node should be indicated as UNK. For example, in the Thought: \n```\nThought: The previous node last_node=PN007, the current intent intent=UNK, the current node current_node=UNK. Next, I need to tell the user that I am unclear about xxx.\n```\n10. When multiple possible intents exist, connect them with ' OR ' and clarify which situation is being inquired about; when expressing multiple intents, connect them with ' AND '. These methods of connection also apply to nodes.\n11. The current intent 'intent' can only be the description on the edges of the given flowchart or UNK; the previous node 'last_node' can only be the nodes on the flowchart, indicating the last triggered node; the reply content must include 'Thought'.\n12. Nodes that include keywords such as [Subgoal, Subtask, Step, Process] should be broken down into multiple steps, and users must be replied to step by step, do not output all the content of a node at once. In each round of dialogue, inquire only about one step, and after confirming that the user has completed that step, continue to ask the next step.\n\nPlease adhere strictly to the defined output format. Do not output the placeholder \"....\" verbatim; instead, fill it with the corresponding content.\n\nYour capabilities are based on the following workflows and API information:\n\n\nThe workflow and API information for Scenario 1 is as follows:\nThe workflow pipeline of Search for products is as follows:\nflowchart TD\nUI000(Start)--Search for products-->UI001>Subgoal: confirm needs and search for products.\nDescription: inquire the user about product type, brand, price lower limit, price upper limit, and other product needs, \ncall searchProducts and check whether suitable products can be found based on the returned is_finding]\nUI001--No suitable product found-->UI002[Subgoal: ask to adjust the demand.\nDescription: inform the user that no products meet the given needs, and ask if the user would like to change product requirements]\nUI002--Do not change requirements-->UI005[Subgoal: apologies to the user\nDescription:tell the user that there are no suitable products and apologize to the user]\nUI001--Suitable product found-->UI003[Subgoal: show goods to the user\nDescription: inform the user of the product according to the returned apartment_message and ask if the user is satisfied with the product]\nUI003--User is satisfied-->UI006[Ask if the user is willing to buy]\nUI006--User will buy-->UI007>Subgoal: add products to the shopping cart\nDescription: call addCart to add the current item to the shopping cart, and inform the user of the results of joining based on the returned add_result]\nUI006--User will not buy-->UI008[Inquiry the user about reasons for unwilling to buy, and collect parameters: is_reason.]\nUI003--User is unsatisfied-->UI004[Subgoal: persuade the user to consider other items\nDescription: inform the user that there are a wide variety of products, which can fully satisfy the user's experience, and ask if they want to know about other products]\nUI004--Yes-->UI001\nUI002--Change requirements-->UI001\nUI004--NO-->UI005\nUI008-->UI004\n\nThe auxiliary parameter information in the workflow is as follows:\n\"is_reason\": \"Reasons for not willing to purchase products\"\n\nThe API tool information in this workflow scenario is as follows:\nAPI: searchProducts\nAPI Desciption: Search for and find suitable products based on product type, brand, lower price limit, upper price limit, etc.\nInput parameters: {\"product_types\": {\"type\": \"string\", \"description\": \"Product type, default is \\\"Snacks\\\"\", \"required\": false}, \"product_names\": {\"type\": \"string\", \"description\": \"Brand, default is \\\"Three Squirrels\\\"\", \"required\": false}, \"lower_limit\": {\"type\": \"string\", \"description\": \"Price floor, value + yuan, default \\\"18 yuan\\\"\", \"required\": false}, \"upper_limit\": {\"type\": \"string\", \"description\": \"Price ceiling, value + yuan, default \\\"31 yuan\\\"\", \"required\": false}}\nOutput parameters: {\"is_finding\": {\"type\": \"string\", \"description\": \"Whether the corresponding product can be found, enumeration type, [not found, can be found]\"}, \"product_messages\": {\"type\": \"object\", \"description\": \"Product information sheet. If the corresponding product is not found, it will be empty.\", \"properties\": {\"product_id\": {\"type\": \"string\", \"description\": \"Product name. For example, \\\"specialty of crispy baked buns slices with fresh milk\\\"\"}, \"product_price\": {\"type\": \"string\", \"description\": \"The actual price of the product, value + yuan, such as \\\"25 yuan\\\"\"}, \"product_positions\": {\"type\": \"string\", \"description\": \"Product origin, such as \\\"Chongqing Inner Mongolia\\\"\"}}}}\n\nAPI: addCart\nAPI Desciption: Put the current item into the shopping cart and return the results\nInput parameters: {\"product_id\": {\"type\": \"string\", \"description\": \"Product name, such as \\\"specialty of crispy baked buns slices with fresh milk\\\"\", \"required\": true}, \"product_num\": {\"type\": \"integer\", \"description\": \"Quantity, positive integer\", \"required\": true}}\nOutput parameters: {\"add_result\": {\"type\": \"string\", \"description\": \"Adding result, enumeration type, [Add successfully, quantity exceeds the limit, the product name is wrong, it is removed from shelves, out of stock]\"}}\n\n\nThe workflow and API information for Scenario 2 is as follows:\nThe workflow pipeline of Shopping cart management is as follows:\nflowchart TD\nFU000(Start)--Cart management-->FU001>Subgoal: confirm the shopping cart Items.\nDescription: call lookShopCart to get the information of the shopping cart, and tell the user the contents of the shopping cart based on the returned value of lookShopCart.]\nFU001-->FU002[Ask the user whether to check out, continue purchasing, or modify the shopping cart?]\nFU002--Modify the shopping cart-->FU003>Subgoal: change the shopping cart.\nDescription: call changeShopCart to modify the shopping cart. If the user wants to modify the model, please delete it and purchase it again.\nThen tell the user the content of the shopping cart based on the returned value of changeShopCart.]\nFU003-->FU002\nFU002--Check out-->FU004>Subgoal: notify the amount.\nDescription: inquiry the user about the product they checked out based on the shopping cart checkout function shopCartCheckout, and call shopCartCheckout after the user confirms twice, and tell the user the amount based on the returned momey]\nFU004-->FU005[Ask the user whether to use account balance or use a link to pay?]\nFU005--Use account balance-->FU006>Subgoal: verify account amount and pay\nDescription: call useAccountbBalance, verify account balance and pay, and check whether it is yes based on the returned pay_result?]\nFU006--Is yes-->FU009\nFU006--Is not yes-->FU007[Tell the user that the balance is insufficient and ask if they want to use the link to pay?]\nFU007 & FU005--Use the link to pay-->FU008>Subgoal: call the link payment function\nDescription: call linkPay to get payment information and payment link, and judge whether the returned result_pay is success?]\nFU008--The result is success-->FU009[Let the user know that the payment is successful and the merchant is being notified.]\nFU008--The result isn't success-->FU010[Inform the user of the payment link based on the returned pay_link and ask the user to pay.]\nFU010-->FU011[Ask the user if the payment is completed?]\nFU011--Payment is completed-->FU008\nFU011 & FU007--Can't pay-->FU012[The user needs to pay within 24 hours. Otherwise, the order will be canceled.]\n\nThe API tool information in this workflow scenario is as follows:\nAPI: lookShopCart\nAPI Desciption: Get the information of the shopping cart based on the user's information\nInput parameters: {\"username\": {\"type\": \"string\", \"description\": \"User name is made up of 4 to 12 letters and numbers and can have spaces. If not filled in, the system will access the currently bound account to obtain it.\", \"required\": false}, \"user_account\": {\"type\": \"string\", \"description\": \"User account is made up of 6 to 12 letters and numbers, both uppercase and lowercase are ok. If left blank, the system will obtain it by accessing the currently bound account.\", \"required\": false}}\nOutput parameters: {\"shopping_cart_content\": {\"type\": \"object\", \"description\": \"Shopping cart content, empty means there are no items\", \"properties\": {\"item_name\": {\"type\": \"string\", \"description\": \"Name of items, such as \\\"Purple Dress\\\" and \\\"Nescafé\\\"\"}, \"quantity\": {\"type\": \"integer\", \"description\": \"Quantity, non-negative integer. 0 indicates the status error\"}, \"category\": {\"type\": \"string\", \"description\": \"model, it can be the model size, such as \\\"medium\\\" or \\\"small\\\", and can also be the shoe size, such as \\\"42\\\" and \\\"37\\\". If there is no model, write \\\"\\\".\"}, \"unit_price\": {\"type\": \"string\", \"description\": \"Unit price, max_retry_num = 5¥+nonnegative floating point number, like\\\"¥299.00\\\", \\\"¥137.00\\\"\"}, \"amount\": {\"type\": \"string\", \"description\": \"Amount, max_retry_num = 5¥+nonnegative floating point number, like \\\"¥3299.00\\\", \\\"¥837.00\\\"\"}, \"product_status\": {\"type\": \"string\", \"description\": \"Product status, enumeration type, [out of stock, off the shelf or normal]\"}}}, \"cart_total_amount\": {\"type\": \"string\", \"description\": \"Total amount in the shopping cart, max_retry_num = 5¥+nonnegative floating point number, like \\\"¥299.00\\\", \\\"¥137.00\\\"\"}}\n\nAPI: changeShopCart\nAPI Desciption: Modify the shopping cart according to the modification request.\nInput parameters: {\"modification_list\": {\"type\": \"object\", \"description\": \"Modify list, cannot be empty\", \"properties\": {\"item_to_modify\": {\"type\": \"string\", \"description\": \"Product names that need to be modified, such as \\\"Purple Dress\\\" and \\\"Nescafé\\\"\"}, \"modified_quantity\": {\"type\": \"integer\", \"description\": \"The modified quantity, a non-negative integer, 0 means deleting the product\"}}, \"required\": true}}\nOutput parameters: {\"shopping_cart_content\": {\"type\": \"object\", \"description\": \"Shopping cart content, empty means there are no items\", \"properties\": {\"item_name\": {\"type\": \"string\", \"description\": \"Item names, such as \\\"Purple Dress\\\" and \\\"Nescafé\\\"\"}, \"quantity\": {\"type\": \"integer\", \"description\": \"Quantity, non-negative integer, 0 indicates status error\"}, \"category\": {\"type\": \"string\", \"description\": \"model, it can be the model size, such as \\\"medium\\\" or \\\"small\\\", and can also the shoe size, such as \\\"42\\\" and \\\"37\\\".\"}, \"unit_price\": {\"type\": \"string\", \"description\": \"Unit price, max_retry_num = 5¥+nonnegative floating point number, like \\\"¥299.00\\\", \\\"¥137.00\\\"\"}, \"amount\": {\"type\": \"string\", \"description\": \"Anount, max_retry_num = 5¥+nonnegative floating point number, like \\\"¥3299.00\\\", \\\"¥837.00\\\"\"}, \"product_status\": {\"type\": \"string\", \"description\": \"Product status, enumeration type, [out of stock, off the shelf or normal]\"}}}, \"cart_total_amount\": {\"type\": \"string\", \"description\": \"The total amount in the shopping cart, max_retry_num = 5¥+nonnegative floating point number, such as \\\"¥299.00\\\" and \\\"¥137.00\\\"\"}}\n\nAPI: shopCartCheckout\nAPI Desciption: Inquiry the user for products to check out and submit the order\nInput parameters: {\"submitted_product_name\": {\"type\": \"list\", \"description\": \"Submitted product name cannot be empty\", \"items\": {\"type\": \"string\", \"description\": \"Item names, such as \\\"Purple Dress\\\" and \\\"Nescafé\\\"\"}, \"required\": true}}\nOutput parameters: {\"money\": {\"type\": \"string\", \"description\": \"Amount to be paid, max_retry_num = 5¥+nonnegative floating point number, like\\\"¥3299.00\\\", \\\"¥837.00\\\"\"}, \"order_num\": {\"type\": \"string\", \"description\": \"Payment order number, 32-bit lowercase md5 code, like \\\"8bb52003873ffd66d94057cab0b40f53\\\"\"}}\n\nAPI: useAccountbBalance\nAPI Desciption: Use the account to pay and verify whether it succeeds\nInput parameters: {\"username\": {\"type\": \"string\", \"description\": \"User name, 4 to 12 letters and numbers, can have spaces. If not filled in, the system will obtain it by accessing the currently bound account.\", \"required\": false}, \"user_account\": {\"type\": \"string\", \"description\": \"User account, 6 to 12 letters and numbers, both uppercase and lowercase are ok. If left blank, the system will access the currently bound account to obtain it.\", \"required\": false}}\nOutput parameters: {\"pay_result\": {\"type\": \"string\", \"description\": \"Payment result, enumeration type, [yes, no]\"}}\n\nAPI: linkPay\nAPI Desciption: Link payment, get the payment link\nInput parameters: {\"order_num\": {\"type\": \"string\", \"description\": \"Payment order number, 32-digit lowercase md5 code, such as \\\"8bb52003873ffd66d94057cab0b40f53\\\"\", \"required\": true}}\nOutput parameters: {\"pay_link\": {\"type\": \"string\", \"description\": \"Payment link, URL, e.g. \\\"alipay.cn/s/orderNum=8bb52003873ffd66d94057cab0b40f53\\\", the format is\\\"alipay.cn/s/orderNum=\\\"+order number. If the payment result is success, it will be empty.\"}, \"result_pay\": {\"type\": \"string\", \"description\": \"Payment result, enumeration type, [successes, failure]\"}}\n\n\nThe workflow and API information for Scenario 3 is as follows:\nThe workflow pipeline of Order operations is as follows:\nflowchart TD\nUU000(Start)--Check order-->UU001>Subgoal: get order information.\nDescription: call lookOrders to get brief order information. \nIf the returned account_details shows a normal account, tell the user rough order information based on the returned order_info. \nIf the account is abnormal, let the user provide the correct information.]\nUU001-->UU002[Confirm if the user wants to operate on a certain order?]\nUU002 & UU007--NO-->UU003[Say goodbye to the user]\nUU002 & UU007--Yes-->UU004[Subgoal: confirm the current state of the order.\nDescription: ask the user which order to operate on among returned of lookOrders, and check whether the corresponding current_status is received?]\nUU004--Goods received-->UU005[Tell the user that the product has been received, and ask if the user wants to return it?]\nUU005--Not return the good-->UU006[Explain to the user that received orders can only be returned.]\nUU006 & UU009 & UU010-->UU007[Ask if there are other orders needed to be operated?]\nUU005--Return the good-->UU010\nUU004--Goods not received-->UU008[Confirm whether to check order information, cancel the order, expedite the order, or change the address?]\nUU008--Check order information-->UU009>Subgoal: get order details and inform the user.\nDescription: call viewDetails to obtain detailed order information, and deliver the information to the user based on the returned value of viewDetails.]\nUU008--Not check order information-->UU010>Subgoal: notify the result of the operation\nDescription: call orderOperate to do order operation, and tell the result to the user according to the returned result_order]\n\nThe API tool information in this workflow scenario is as follows:\nAPI: lookOrders\nAPI Desciption: Obtain brief information about orders placed under the user's account based on user information\nInput parameters: {\"username\": {\"type\": \"string\", \"description\": \"User name, 4 to 12 letters and numbers, can have spaces. If not filled in, the system will access the currently bound account to obtain it.\", \"required\": false}, \"user_account\": {\"type\": \"string\", \"description\": \"User account, 6 to 12 letters and numbers, both uppercase and lowercase are available. If left blank, the system will access the currently bound account to obtain it.\", \"required\": false}, \"fetch_scope\": {\"type\": \"list\", \"description\": \"Get the range. If written, it must not be empty. If not written, all will be selected.\", \"items\": {\"type\": \"string\", \"description\": \"Current status, enumeration type, [received, not shipped, in transit]\"}, \"required\": false}}\nOutput parameters: {\"order_info\": {\"type\": \"object\", \"description\": \"Order information, empty means there is no order or the account is abnormal\", \"properties\": {\"order_id\": {\"type\": \"string\", \"description\": \"Order number, 3 letters + 12 numbers, such as \\\"ABC012345678901\\\"\"}, \"current_status\": {\"type\": \"string\", \"description\": \"Current status, enumeration type, [received, not shipped, in transit]\"}, \"payment_time\": {\"type\": \"string\", \"description\": \"Payment time, the format is year+month+day+hour+minute+second, for example \\\"November 12, 2044 18:11:03\\\"\"}, \"express_number\": {\"type\": \"string\", \"description\": \"Express number, 8~15 letters + numbers, the format depends on the express company, such as \\\"SF1662397783616\\\", \\\"YT7457447178601\\\"\"}}}, \"account_details\": {\"type\": \"string\", \"description\": \"Account information, enumeration type, [normal account, abnormal account, wrong username]\"}}\n\nAPI: orderOperate\nAPI Desciption: The user operates the order and returns the results\nInput parameters: {\"order_id\": {\"type\": \"string\", \"description\": \"Order number, 3 letters + 12 numbers, such as \\\"ABC012345678901\\\"\", \"required\": true}, \"operation_name\": {\"type\": \"string\", \"description\": \"Operation term, enumeration type, [return, cancel, reminder, modification]\", \"required\": true}, \"modified_address\": {\"type\": \"string\", \"description\": \"The format of the modified address is: province city + district county + road name + road number + community name + building number + floor number + household number, for example, \\\"Floor 15, Building 1, Fenghua Community, No. 22 Gucheng Road, Xi'an County, Shaanxi Province. The modification is needed.\", \"required\": false}, \"modified_recipient\": {\"type\": \"string\", \"description\": \"The modified recipient, such as \\\"Mike Din\\\", the modification should be written\", \"required\": false}, \"modified_phone_number\": {\"type\": \"string\", \"description\": \"The modified mobile phone number, a 13-digit mobile phone number, such as \\\"13801234567\\\", the modification must be written\", \"required\": false}}\nOutput parameters: {\"result_order\": {\"type\": \"string\", \"description\": \"As a result, if the goods have been received for return, it will be \\\"the merchant is reviewing\\\"; if the goods have not been received for a return or the address is modified, it will be \\\"intercepting, and the interception may fail\\\"; if it is canceled, it will be \\\"canceled\\\"; if it is an order expedited, it is \\\"has been expedited\\\".\"}}\n\nAPI: viewDetails\nAPI Desciption: Get order details\nInput parameters: {\"order_id\": {\"type\": \"string\", \"description\": \"Order number, 3 letters + 12 numbers, such as \\\"ABC012345678901\\\"\", \"required\": true}}\nOutput parameters: {\"order_id\": {\"type\": \"string\", \"description\": \"Order number, 3 letters + 12 numbers, such as \\\"ABC012345678901\\\", empty means the order number is wrong\"}, \"item_name\": {\"type\": \"string\", \"description\": \"Name of items, such as \\\"Purple Dress\\\", and \\\"Nestle Coffee\\\", empty means the order number is wrong\"}, \"progress_info\": {\"type\": \"object\", \"description\": \"Progress information, empty means the order number is wrong\", \"properties\": {\"change_time\": {\"type\": \"string\", \"description\": \"Change the time, in the format of year+month+day+hour+minute+second, such as \\\"November 12, 2044 18:11:03\\\"\"}, \"change_reason\": {\"type\": \"string\", \"description\": \"Reason for change, such as \\\"Order has been created\\\", \\\"Order has been paid\\\", \\\"Merchant has shipped\\\", \\\"Courier company has picked up\\\", \\\"Arrived at XXX location\\\", \\\"Sending from XXX to XXX\\\", \\\"Goods are being delivered\\\", \\\"The goods have been delivered\\\", \\\"The goods have been signed for\\\"\"}, \"express_number\": {\"type\": \"string\", \"description\": \"Express number, 8~15 letters + numbers, the format depends on the express company, such as \\\"SF1662397783616\\\", \\\"YT7457447178601\\\"\"}}}, \"is_order_correct\": {\"type\": \"string\", \"description\": \"Whether the order is correct, enumeration type, [correct order, wrong order]\"}}\n\n\nPlease note that the current time is:2030-11-22 15:19:50 Friday"}, "user_agent": {"first_turn_hint": "Notably, You cannot put forward all the goals and requirements in a single round too quickly, nor can you put forward too many irrelevant content. The rhythm of your dialogue can refer to the following template:\n Dialog template of user: Hello! I'm planning a camping trip and I'm looking for durable outdoor equipment that can withstand various weather conditions. Can you recommend some products that are known for their durability and sustainability? Additionally, I would like to know how I can track my order status and if there are options for expedited shipping.\nDialog template of user: Those options sound spot on! I'll take both. Could you guide me on how to track the status of my order once placed and if there's a way to get expedited shipping?\nDialog template of user: Yes, let's proceed to checkout. Also, could you please explain how the tracking and expedited shipping options work after I've made the purchase? Thank you!\nDialog template of user: Yes, please proceed with the purchase. And once that's done, I'd appreciate a walk-through on setting up the tracking and exploring the expedited shipping options.\nDialog template of user: Absolutely, I confirm the purchase of both the tent and the backpack. Let's finalize this and then move on to the tracking and shipping details. Thanks!\nDialog template of user: Using the payment link sounds good to me. Could you send it over? Also, I'm eager to learn about the tracking and expedited shipping once we wrap up the payment. Thanks!\n\n", "every_turn_hint": "The following is the user profile, task background and user goals:\nUser profile and background: The user is planning a camping trip and is in search of durable outdoor equipment that can withstand various weather conditions. The user shows interest in finding specific items such as a tent and a backpack, with a focus on durability and sustainability. They are also concerned about the order tracking and expedited shipping options after making a purchase.\n\nDialogue task scenario description: The dialogue encompasses a comprehensive interaction that starts with the user's request for product recommendations (Search for products) and transitions through the process of adding selected items to the shopping cart, proceeding to checkout, and finally addressing inquiries regarding order tracking and expedited shipping (Shopping cart management & Order operations).\n\nUser goals and demands:\n1. **Find durable and sustainable outdoor equipment**: The user is looking for a tent and a backpack suitable for camping, with each item priced under $500.\n - **API parameters needed**: \n - `product_types`: \"tent, backpack\"\n - `lower_limit`: \"0 yuan\"\n - `upper_limit`: \"500 yuan\"\n2. **Complete the purchase of selected items**: After finding satisfactory products, the user decides to buy both the tent and the backpack.\n - **API parameters needed**:\n - `product_id`: \"high durability waterproof tent, large capacity multi-compartment backpack\"\n - `product_num`: 1\n3. **Understand and utilize the order tracking and expedited shipping options**: Post-purchase, the user wishes to learn how to track their order and inquire about the possibility of expedited shipping.\n - **API parameters potentially needed for future actions**:\n - Not directly mentioned, but would involve interactions with APIs related to order tracking and modifications for expedited shipping, such as `lookOrders` and `orderOperate` with possible operations like `reminder` (for expedited shipping) and viewing detailed order information through `viewDetails`.\n\nThe following is the information corresponding to some API parameters that the user role may provide during the conversation. If the conversation involves related parameters, you need to comply with the value in 'Action Input'.\nAction: searchProducts\nAction Input: {\"product_types\": \"tent, backpack\", \"product_names\": \"\", \"lower_limit\": \"0 yuan\", \"upper_limit\": \"500 yuan\"}\n\nAction: addCart\nAction Input: {\"product_id\": \"high durability waterproof tent, large capacity multi-compartment backpack\", \"product_num\": 1}\n\nAction: shopCartCheckout\nAction Input: {\"submitted_product_name\": [\"high durability waterproof tent\", \"large capacity multi-compartment backpack\"]}\n\nAction: linkPay\nAction Input: {\"order_num\": \"8bb52003873ffd66d94057cab0b40f53\"}\n\nDo not introduce any goals beyond those specified or suggest API call parameters not already provided. Avoid other topics and be concise. After completing the goals, say goodbye to the assistant and end the conversation with '[DONE]'\n."}, "datetime": "2030-11-22 15:19:50 Friday"}
5. 总结
在本期Agent开源数据集分享中,我们整理了多个Agent开源数据集,涵盖网页操作、软件工程等多个任务和计划、多步推理、反思等多个核心能力。值得注意的是,相比LLM,由于Agent的数据具有跨模态、轮数多、绝对正确性弱等特点,故这些开源数据以BenchMark为主,且往往需要进一步提取与整合,才能得到格式化、标准化的Agent数据。未来,我们将致力于Agent数据的收集和构建,为Agent在各个领域落地提供数据支撑。
整数智能信息技术(杭州)有限责任公司,起源自浙江大学计算机创新技术研究院,致力于成为AI行业的数据合伙人。整数智能也是中国人工智能产业发展联盟、ASAM协会、浙江省人工智能产业技术联盟成员,其提供的智能数据工程平台(MooreData Platform)与数据集构建服务(ACE Service),满足了智能驾驶(Automobile AI)、生成式人工智能(Generative AI)、具身智能(Embodied AI)等数十个人工智能应用场景对于先进的智能标注工具以及高质量数据的需求。
目前公司已合作海内外顶级科技公司与科研机构客户2000余家,拥有知识产权数十项,通过ISO 9001、ISO 27001、ISO 27701等国际认证,多次参与人工智能领域的标准与白皮书撰写,也受到《CCTV财经频道》《新锐杭商》《浙江卫视》《苏州卫视》等多家新闻媒体报道。