刷屏的 Nano Banana 有多强？5 大图像功能 + 完整代码

最新推荐文章于 2025-11-21 06:22:29 发布

转载最新推荐文章于 2025-11-21 06:22:29 发布 · 189 阅读

CC 4.0 BY-SA版权

原文链接：https://mp.weixin.qq.com/s?__biz=MjM5OTE4MzcyNA==&mid=2648848779&idx=1&sn=c7709b0820e2ffef29f3b899a850edff&chksm=be7491fd7723538696aae10212811d4ca43517e85dcaa5364b91e18b2847eb2d77317f28fb2a&scene=126&sessionid=0

部署运行你感兴趣的模型镜像

一 Nano Banana 介绍

谷歌最近发布了，这是一种强大的图像生成和编辑新模型，代号为 Nano Banana。该模型引入了用于创建和作图像的最先进的功能，解锁了广泛的新应用程序。Gemini 2.5 Flash Image

二功能介绍

核心功能：

图像创建
图片编辑
照片修复
多个输入图像
对话式图像编辑

三代码实践

来自谷歌自身的《如何使用 Nano Banana 进行构建：完整的开发人员教程》文章。下面会用python代码调用API，需要先安装依赖库，以python为例：

pip install -U google-genai
# Install the Pillow library for image manipulation
pip install Pillow

3.1 图像创建

使用 Nano Banana 从描述性文本提示生成一个或多个图像。对所有 API 请求使用模型 ID gemini-2.5-flash-image-preview。

代码：

from google import genai
from PIL import Image
from io import BytesIO

# Configure the client with your API key
client = genai.Client(api_key="YOUR_API_KEY")

prompt = """Create a photorealistic image of an orange cat
with a green eyes, sitting on a couch."""

# Call the API to generate content
response = client.models.generate_content(
    model="gemini-2.5-flash-image-preview",
    contents=prompt,
)

# The response can contain both text and image data.
# Iterate through the parts to find and save the image.
for part in response.candidates[0].content.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = Image.open(BytesIO(part.inline_data.data))
        image.save("cat.png")

生成图：

3.2 使用文本和图像输入进行图像编辑

提供现有图像以及执行编辑的文本提示。该模型擅长保持输入图像的字符和内容一致性。

代码：

from google import genai
from PIL import Image
from io import BytesIO

client = genai.Client(api_key="YOUR_API_KEY")

prompt = """Using the image of the cat, create a photorealistic,
street-level view of the cat walking along a sidewalk in a
New York City neighborhood, with the blurred legs of pedestrians
and yellow cabs passing by in the background."""

image = Image.open("cat.png")

# Pass both the text prompt and the image in the 'contents' list
response = client.models.generate_content(
    model="gemini-2.5-flash-image-preview",
    contents=[prompt, image],
)

for part in response.candidates[0].content.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = Image.open(BytesIO(part.inline_data.data))
        image.save("cat2.png")

生成图

3.3 用 Nano Banana 修复照片

该模型的强大应用之一是照片修复。只需一个简单的提示，它就可以恢复和着色旧照片，并获得令人印象深刻的效果。

代码：

from google import genai
from PIL import Image
from io import BytesIO

client = genai.Client(api_key="YOUR_API_KEY")

prompt = "Restore and colorize this image from 1932"

image = Image.open("lunch.jpg")  # "Lunch atop a Skyscraper, 1932"

response = client.models.generate_content(
    model="gemini-2.5-flash-image-preview",
    contents=[prompt, image],
)

for part in response.candidates[0].content.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = Image.open(BytesIO(part.inline_data.data))
        image.save("lunch-restored.png")

修复图前后对比：

3.4 使用多个输入图像

您可以提供多个图像作为输入，以执行更复杂的编辑任务。

from google import genai
from PIL import Image
from io import BytesIO

client = genai.Client(api_key="YOUR_API_KEY")

prompt = "Make the girl wear this t-shirt. Leave the background unchanged."

image1 = Image.open("girl.png")
image2 = Image.open("tshirt.png")

response = client.models.generate_content(
    model="gemini-2.5-flash-image-preview",
    contents=[prompt, image1, image2],
)

for part in response.candidates[0].content.parts:
    if part.text is not None:
        print(part.text)
    elif part.inline_data is not None:
        image = Image.open(BytesIO(part.inline_data.data))
        image.save("girl-with-tshirt.png")

合成图：

3.5 对话式图像编辑

为了进行迭代优化，可以使用聊天会话来维护多个请求的上下文。这允许您以对话方式编辑图像。

from google import genai
from PIL import Image
from io import BytesIO

client = genai.Client(api_key="YOUR_API_KEY")

# Create a chat
chat = client.chats.create(
    model="gemini-2.5-flash-image-preview"
)

# Make the first image edit
response1 = chat.send_message(
    [
        "Change the cat to a bengal cat, leave everything else the same",
        Image.open("cat.png"),
    ]
)
# display / save image...

# Continue chatting and editing
response2 = chat.send_message("The cat should wear a funny party hat")
# display / save image...