AI Agents入门教程之从零开始构建Agent,建议收藏起来慢慢看!!

1、引言

在本文中,我们将探讨如何使用 Python 从零开始构建一个Agent。该Agent能够根据用户输入做出决策、选择适当的工具并执行相应的任务。让我们开始吧!

2、什么是Agent?

Agent是指一种能够感知环境、做出决策并采取行动以实现特定目标的自主实体。AI Agent的复杂程度各不相同,既有仅对刺激做出反应的简单反应式智能体,也有能够随时间推移不断学习和适应的高级智能体。常见的智能体类型包括:

  • Reactive Agents: 直接响应环境变化,不具备内部记忆。
  • Model-Based Agents: 利用内部世界模型进行决策的智能体。
  • Goal-Based Agents: 以实现特定目标为基础规划行动。
  • Utility-Based Agents: 基于效用函数评估潜在行动,以实现结果的最大化。

示例包括聊天机器人、推荐系统和自动驾驶汽车,它们各自利用不同类型的Agent,以高效且智能的方式执行任务。

智能体的核心组成部分如下:

  • Models: 智能体的大脑,负责处理输入信息并做出反应。
  • Tools: 智能体可根据用户请求执行的预定义功能。
  • Toolbox: 智能体可使用的工具集合。
  • System Prompt: 指导智能体处理用户输入并选择正确工具的指令集。

3、实 现

现在,让我们卷起袖子开始实现吧!

图片

  • 前提条件

本教程的完整代码可在 AI Agents GitHub 代码库中找到。

Github: https://github.com/vsingh9076/AI-Agents/tree/main/build-agent-from-scratch

运行代码前,请确保您的系统满足以下前提条件:

  • Python 环境设置

运行AI Agent需要安装 Python虚拟环境。请按照以下步骤设置虚拟环境:

python -m venv ai_agents_envsource ai_agents_env/bin/activate  # On Windows: ai_agents_env\Scripts\activate

安装所需的依赖项,导航至本仓库代码页面,按照requirements.txt安装所需依赖,指令如下:

pip install -r requirements.txt
  • 在本地设置 Ollama

Ollama 用于高效运行和管理本地大语言模型。请按照以下步骤进行安装和配置:大家可以访问 Ollama 官方网站,下载适用于你操作系统的安装程序。

Ollama官网:https://ollama.com/

按照官网说明进行安装后,大家运行以下命令检查 Ollama 是否已正确安装:

ollama --version

拉取模型,某些Agent的实现可能需要特定的模型。您可以使用以下命令拉取模型:

ollama pull mistral  # Replace 'mistral' with the model needed 

4、定义Model类

本文实现的具体流程如下:

图片

除了 Python,我们还需要安装一些必要的库。在本教程中,我们将使用 requests、json 和 termcolor。此外,我们还将使用 dotenv 来管理环境变量。

pip install requests termcolor python-dotenv

我们首先需要一个处理用户输入的模型。我们将创建一个 OllamaModel 类,该类与本地 API 交互以生成响应。

下面是一个简单的代码实现:

from termcolor import coloredimport osfrom dotenv import load_dotenvload_dotenv()### Modelsimport requestsimport jsonimport operator
class OllamaModel:    def __init__(self, model, system_prompt, temperature=0, stop=None):        """        Initializes the OllamaModel with the given parameters.        Parameters:        model (str): The name of the model to use.        system_prompt (str): The system prompt to use.        temperature (float): The temperature setting for the model.        stop (str): The stop token for the model.        """        self.model_endpoint = "http://localhost:11434/api/generate"        self.temperature = temperature        self.model = model        self.system_prompt = system_prompt        self.headers = {"Content-Type": "application/json"}        self.stop = stop    def generate_text(self, prompt):        """        Generates a response from the Ollama model based on the provided prompt.        Parameters:        prompt (str): The user query to generate a response for.        Returns:        dict: The response from the model as a dictionary.        """        payload = {            "model": self.model,            "format": "json",            "prompt": prompt,            "system": self.system_prompt,            "stream": False,            "temperature": self.temperature,            "stop": self.stop        }        try:            request_response = requests.post(                self.model_endpoint,                 headers=self.headers,                 data=json.dumps(payload)            )            print("REQUEST RESPONSE", request_response)            request_response_json = request_response.json()            response = request_response_json['response']            response_dict = json.loads(response)            print(f"\n\nResponse from Ollama model: {response_dict}")            return response_dict        except requests.RequestException as e:            response = {"error": f"Error in invoking model! {str(e)}"}            return response

该类使用参数model、system_prompt、temperature和stop token进行初始化。其中generate_text 函数向模型 API 发送请求并返回响应。

5、创建Agent所需工具

下一步是创建智能体Agent可以使用的工具。这些工具是执行特定任务的简单 Python 函数。下面是一个基本计算器和一个字符串反转器的示例:

def basic_calculator(input_str):    """    Perform a numeric operation on two numbers based on the input string or dictionary.    Parameters:    input_str (str or dict): Either a JSON string representing a dictionary with keys 'num1', 'num2', and 'operation',                            or a dictionary directly. Example: '{"num1": 5, "num2": 3, "operation": "add"}'                            or {"num1": 67869, "num2": 9030393, "operation": "divide"}    Returns:    str: The formatted result of the operation.    Raises:    Exception: If an error occurs during the operation (e.g., division by zero).    ValueError: If an unsupported operation is requested or input is invalid.    """    try:        # Handle both dictionary and string inputs        if isinstance(input_str, dict):            input_dict = input_str        else:            # Clean and parse the input string            input_str_clean = input_str.replace("'", "\"")            input_str_clean = input_str_clean.strip().strip("\"")            input_dict = json.loads(input_str_clean)                # Validate required fields        if not all(key in input_dict for key in ['num1', 'num2', 'operation']):            return "Error: Input must contain 'num1', 'num2', and 'operation'"        num1 = float(input_dict['num1'])  # Convert to float to handle decimal numbers        num2 = float(input_dict['num2'])        operation = input_dict['operation'].lower()  # Make case-insensitive    except (json.JSONDecodeError, KeyError) as e:        return "Invalid input format. Please provide valid numbers and operation."    except ValueError as e:        return "Error: Please provide valid numerical values."    # Define the supported operations with error handling    operations = {        'add': operator.add,        'plus': operator.add,  # Alternative word for add        'subtract': operator.sub,        'minus': operator.sub,  # Alternative word for subtract        'multiply': operator.mul,        'times': operator.mul,  # Alternative word for multiply        'divide': operator.truediv,        'floor_divide': operator.floordiv,        'modulus': operator.mod,        'power': operator.pow,        'lt': operator.lt,        'le': operator.le,        'eq': operator.eq,        'ne': operator.ne,        'ge': operator.ge,        'gt': operator.gt    }    # Check if the operation is supported    if operation not in operations:        return f"Unsupported operation: '{operation}'. Supported operations are: {', '.join(operations.keys())}"    try:        # Special handling for division by zero        if (operation in ['divide', 'floor_divide', 'modulus']) and num2 == 0:            return "Error: Division by zero is not allowed"        # Perform the operation        result = operations[operation](num1, num2)                # Format result based on type        if isinstance(result, bool):            result_str = "True" if result else "False"        elif isinstance(result, float):            # Handle floating point precision            result_str = f"{result:.6f}".rstrip('0').rstrip('.')        else:            result_str = str(result)        return f"The answer is: {result_str}"    except Exception as e:        return f"Error during calculation: {str(e)}"def reverse_string(input_string):    """    Reverse the given string.    Parameters:    input_string (str): The string to be reversed.    Returns:    str: The reversed string.    """    # Check if input is a string    if not isinstance(input_string, str):        return "Error: Input must be a string"        # Reverse the string using slicing    reversed_string = input_string[::-1]        # Format the output    result = f"The reversed string is: {reversed_string}"        return result

这些函数旨在根据所提供的输入执行特定任务。basic_calculator 处理算术运算,而 reverse_string 则反转给定的字符串。

6、 创建工具箱

工具箱ToolBox类存储了智能体可以使用的所有工具,并提供了每种工具的说明:

class ToolBox:    def __init__(self):        self.tools_dict = {}    def store(self, functions_list):        """        Stores the literal name and docstring of each function in the list.        Parameters:        functions_list (list): List of function objects to store.        Returns:        dict: Dictionary with function names as keys and their docstrings as values.        """        for func in functions_list:            self.tools_dict[func.__name__] = func.__doc__        return self.tools_dict    def tools(self):        """        Returns the dictionary created in store as a text string.        Returns:        str: Dictionary of stored functions and their docstrings as a text string.        """        tools_str = ""        for name, doc in self.tools_dict.items():            tools_str += f"{name}: \"{doc}\"\n"        return tools_str.strip()

这个类将帮助智能体了解哪些工具可用以及每种工具的具体用途。

7、创建Agent类

Agent需要思考、决定使用哪种工具并执行它。下面是Agent类的代码实现:

系统提示词如下:

agent_system_prompt_template = """You are an intelligent AI assistant with access to specific tools. Your responses must ALWAYS be in this JSON format:{{    "tool_choice": "name_of_the_tool",    "tool_input": "inputs_to_the_tool"}}TOOLS AND WHEN TO USE THEM:1. basic_calculator: Use for ANY mathematical calculations   - Input format: {{"num1": number, "num2": number, "operation": "add/subtract/multiply/divide"}}   - Supported operations: add/plus, subtract/minus, multiply/times, divide   - Example inputs and outputs:     Input: "Calculate 15 plus 7"     Output: {{"tool_choice": "basic_calculator", "tool_input": {{"num1": 15, "num2": 7, "operation": "add"}}}}          Input: "What is 100 divided by 5?"     Output: {{"tool_choice": "basic_calculator", "tool_input": {{"num1": 100, "num2": 5, "operation": "divide"}}}}2. reverse_string: Use for ANY request involving reversing text   - Input format: Just the text to be reversed as a string   - ALWAYS use this tool when user mentions "reverse", "backwards", or asks to reverse text   - Example inputs and outputs:     Input: "Reverse of 'Howwwww'?"     Output: {{"tool_choice": "reverse_string", "tool_input": "Howwwww"}}          Input: "What is the reverse of Python?"     Output: {{"tool_choice": "reverse_string", "tool_input": "Python"}}3. no tool: Use for general conversation and questions   - Example inputs and outputs:     Input: "Who are you?"     Output: {{"tool_choice": "no tool", "tool_input": "I am an AI assistant that can help you with calculations, reverse text, and answer questions. I can perform mathematical operations and reverse strings. How can I help you today?"}}          Input: "How are you?"     Output: {{"tool_choice": "no tool", "tool_input": "I'm functioning well, thank you for asking! I'm here to help you with calculations, text reversal, or answer any questions you might have."}}STRICT RULES:1. For questions about identity, capabilities, or feelings:   - ALWAYS use "no tool"   - Provide a complete, friendly response   - Mention your capabilities2. For ANY text reversal request:   - ALWAYS use "reverse_string"   - Extract ONLY the text to be reversed   - Remove quotes, "reverse of", and other extra text3. For ANY math operations:   - ALWAYS use "basic_calculator"   - Extract the numbers and operation   - Convert text numbers to digitsHere is a list of your tools along with their descriptions:{tool_descriptions}Remember: Your response must ALWAYS be valid JSON with "tool_choice" and "tool_input" fields."""

Agent类的代码实现如下:

class Agent:    def __init__(self, tools, model_service, model_name, stop=None):        """        Initializes the agent with a list of tools and a model.        Parameters:        tools (list): List of tool functions.        model_service (class): The model service class with a generate_text method.        model_name (str): The name of the model to use.        """        self.tools = tools        self.model_service = model_service        self.model_name = model_name        self.stop = stop    def prepare_tools(self):        """        Stores the tools in the toolbox and returns their descriptions.        Returns:        str: Descriptions of the tools stored in the toolbox.        """        toolbox = ToolBox()        toolbox.store(self.tools)        tool_descriptions = toolbox.tools()        return tool_descriptions    def think(self, prompt):        """        Runs the generate_text method on the model using the system prompt template and tool descriptions.        Parameters:        prompt (str): The user query to generate a response for.        Returns:        dict: The response from the model as a dictionary.        """        tool_descriptions = self.prepare_tools()        agent_system_prompt = agent_system_prompt_template.format(tool_descriptions=tool_descriptions)        # Create an instance of the model service with the system prompt        if self.model_service == OllamaModel:            model_instance = self.model_service(                model=self.model_name,                system_prompt=agent_system_prompt,                temperature=0,                stop=self.stop            )        else:            model_instance = self.model_service(                model=self.model_name,                system_prompt=agent_system_prompt,                temperature=0            )        # Generate and return the response dictionary        agent_response_dict = model_instance.generate_text(prompt)        return agent_response_dict    def work(self, prompt):        """        Parses the dictionary returned from think and executes the appropriate tool.        Parameters:        prompt (str): The user query to generate a response for.        Returns:        The response from executing the appropriate tool or the tool_input if no matching tool is found.        """        agent_response_dict = self.think(prompt)        tool_choice = agent_response_dict.get("tool_choice")        tool_input = agent_response_dict.get("tool_input")        for tool in self.tools:            if tool.__name__ == tool_choice:                response = tool(tool_input)                print(colored(response, 'cyan'))                return        print(colored(tool_input, 'cyan'))        return

该类有三个主要方法:

  • prepare_tools: 存储并返回工具说明。
  • think: 根据用户提示决定使用哪种工具。
  • work: 执行所选工具并返回结果。

8、运行Agent

最后,让我们将所有内容整合在一起,运行我们的Agent智能体。在脚本的main入口函数内,初始化Agent并开始接受用户输入:

# Example usage
if __name__ == "__main__":
    """    
    Instructions for using this agent:        

	Example queries you can try:    
    1. Calculator operations:       
	 - "Calculate 15 plus 7"       
	 - "What is 100 divided by 5?"       
	 - "Multiply 23 and 4"        
    2. String reversal:       
	 - "Reverse the word 'hello world'"       
	 - "Can you reverse 'Python Programming'?"       
    3. General questions (will get direct responses):       
    	- "Who are you?"       
    	- "What can you help me with?"       
     Ollama Commands (run these in terminal):    
     - Check available models:    'ollama list'    
     - Check running models:      'ps aux | grep ollama'    
     - List model tags:          'curl http://localhost:11434/api/tags'    
     - Pull a new model:         'ollama pull mistral'    
     - Run model server:         'ollama serve'    
     """    
     tools = [basic_calculator, reverse_string]    
     # Uncomment below to run with OpenAI   
     # model_service = OpenAIModel    
     # model_name = 'gpt-3.5-turbo'    
     # stop = None    
     # Using Ollama with llama2 model    
     model_service = OllamaModel    
     model_name = "llama2"  # Can be changed to other models like 'mistral', 'codellama', etc.    
     stop = "<|eot_id|>"    
     agent = Agent(tools=tools, model_service=model_service, model_name=model_name, stop=stop)    
     print("\nWelcome to the AI Agent! Type 'exit' to quit.")    
     print("You can ask me to:")    
     print("1. Perform calculations (e.g., 'Calculate 15 plus 7')")    
     print("2. Reverse strings (e.g., 'Reverse hello world')")    
     print("3. Answer general questions\n")    
     while True:        
     	prompt = input("Ask me anything: ")        
     	if prompt.lower() == "exit":           
     	 break       
     agent.work(prompt)

9、结论

在这篇文章中,我们一步一步地探索了对Agent是什么的理解。我们建立了虚拟环境,定义了模型,创建了基本工具,并构建了一个结构化工具箱来支持我们的代理功能。最后,我们通过运行Agent,将一切整合在一起。

这种结构化方法为构建能够自动执行任务和做出明智决策的智能交互Agent奠定了坚实的基础。随着AI Agent的不断发展,其应用范围将扩展到各个行业,从而推动效率和创新。

最后的最后

感谢你们的阅读和喜欢,作为一位在一线互联网行业奋斗多年的老兵,我深知在这个瞬息万变的技术领域中,持续学习和进步的重要性。

为了帮助更多热爱技术、渴望成长的朋友,我特别整理了一份涵盖大模型领域的宝贵资料集。

这些资料不仅是我多年积累的心血结晶,也是我在行业一线实战经验的总结。

这些学习资料不仅深入浅出,而且非常实用,让大家系统而高效地掌握AI大模型的各个知识点。如果你愿意花时间沉下心来学习,相信它们一定能为你提供实质性的帮助。

这份完整版的大模型 AI 学习资料已经上传优快云,朋友们如果需要可以微信扫描下方优快云官方认证二维码免费领取【保证100%免费

大模型知识脑图

为了成为更好的 AI大模型 开发者,这里为大家提供了总的路线图。它的用处就在于,你可以按照上面的知识点去找对应的学习资源,保证自己学得较为全面。
在这里插入图片描述

经典书籍阅读

阅读AI大模型经典书籍可以帮助读者提高技术水平,开拓视野,掌握核心技术,提高解决问题的能力,同时也可以借鉴他人的经验。对于想要深入学习AI大模型开发的读者来说,阅读经典书籍是非常有必要的。

在这里插入图片描述

实战案例

光学理论是没用的,要学会跟着一起敲,要动手实操,才能将自己的所学运用到实际当中去,这时候可以搞点实战案例来学习。

在这里插入图片描述

面试资料

我们学习AI大模型必然是想找到高薪的工作,下面这些面试题都是总结当前最新、最热、最高频的面试题,并且每道题都有详细的答案,面试前刷完这套面试题资料,小小offer,不在话下

在这里插入图片描述

640套AI大模型报告合集

这套包含640份报告的合集,涵盖了AI大模型的理论研究、技术实现、行业应用等多个方面。无论您是科研人员、工程师,还是对AI大模型感兴趣的爱好者,这套报告合集都将为您提供宝贵的信息和启示。

在这里插入图片描述

这份完整版的大模型 AI 学习资料已经上传优快云,朋友们如果需要可以微信扫描下方优快云官方认证二维码免费领取【保证100%免费

### 关于AI Agents项目的实例与教程 #### 使用AutoGen创建对话代理 对于希望构建能够处理自然语言交互的AI代理人,`agentchat.contrib.retrieve_user_proxy_agent | AutoGen` 提供了一个强大的起点[^1]。此模块允许开发者快速搭建起具备基本功能的聊天机器人原型,这些机器人可以根据预设逻辑响应用户的输入。 ```python from autogen import UserProxyAgent, AssistantAgent user_proxy = UserProxyAgent(name="User", human_input_mode="ALWAYS") assistant = AssistantAgent( name="Assistant", system_message="You are a helpful assistant.", ) conversation_history = [ {"role": "system", "content": "You will be having a conversation with an AI."}, ] while True: user_message = input("Enter your message: ") response = assistant.respond(user_message=user_message) print(f"Response from {response['author']}: {response['message']}") ``` 这段代码展示了如何利用Python中的Autogen库初始化两个代理——一个是代表最终用户的代理,另一个则是执行任务或提供帮助的服务端代理。通过这种方式,可以轻松模拟出真实的对话场景并测试不同的交流策略。 #### 利用NVIDIA TAO Toolkit加速开发流程 当涉及到更复杂的计算机视觉应用时,如图像分类、目标检测等领域内的AI代理人设计,则可借助[NVIDIA TAO Toolkit][^2]的力量。该工具包不仅提供了大量已经过良好训练的基础模型作为起点,还极大地降低了调整超参数以及优化性能所需的时间成本和技术门槛。这意味着即使是没有深厚机器学习背景的人也能高效地完成特定应用场景下的定制化工作。 例如,在医疗影像分析方面,可以选择一个适用于胸部X光片诊断肺炎状况的预训练ResNet架构,并针对本地收集的数据集做进一步精细化调优;整个过程中几乎不需要额外编写任何底层算法实现细节方面的代码。 #### 探索大型语言模型资源集合 为了深入理解当前最先进的文本理解和生成技术背后原理及其实际运用方式,访问由Wang Rongsheng整理维护的大规模语言模型LLMs)资料库不失为明智之举[^3]。这里汇集了众多高质量的学习材料,包括但不限于: - **理论讲解**:从零开始介绍神经网络基础知识直到前沿研究进展; - **实战演练**:分享具体案例解析及配套源码片段以便读者模仿练习; - **社区互动**:鼓励参与者贡献自己的见解形成良性循环的知识共享平台。 综上所述,无论是初学者还是有一定经验的研究人员都能在此找到适合自身的切入点去探索有关智能体项目的一切可能性。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值