如何高效合并音视频文件(时间短消耗资源少)(二)

英语字幕

1
00:00:06,480 --> 00:00:08,400
Good morning. We have a banger for you

2
00:00:08,400 --> 00:00:09,840
today. We're going to launch chatbt

3
00:00:09,840 --> 00:00:11,519
agent. But before jumping into that, I'd

4
00:00:11,519 --> 00:00:12,559
like to ask the team to introduce

5
00:00:12,559 --> 00:00:14,080
themselves. Starting with Yosh.

6
00:00:14,080 --> 00:00:17,840
Hi, I'm Yash. I work on agent team and

7
00:00:17,840 --> 00:00:20,080
before that I used to work on operator.

8
00:00:20,080 --> 00:00:22,560
Hi, I'm Jing. I work on agents research

9
00:00:22,560 --> 00:00:24,400
previously on deep research.

10
00:00:24,400 --> 00:00:26,000
Hi, I'm Casey. I'm a researcher on

11
00:00:26,000 --> 00:00:27,920
agents formerly operator.

12
00:00:27,920 --> 00:00:30,560
Hi, I'm Issa. I'm a researcher on agent

13
00:00:30,560 --> 00:00:32,640
formerly on deep research.

14
00:00:32,640 --> 00:00:34,880
So we we started launching agents

15
00:00:34,880 --> 00:00:36,800
earlier this year. Uh we launched deep

16
00:00:36,800 --> 00:00:38,879
research, we launched operator and

17
00:00:38,879 --> 00:00:40,160
people were very excited about this.

18
00:00:40,160 --> 00:00:42,480
People could see that now uh AI was

19
00:00:42,480 --> 00:00:44,640
going off to do complex tasks for them.

20
00:00:44,640 --> 00:00:46,079
But it became clear to us that what

21
00:00:46,079 --> 00:00:48,000
people really wanted was for us to bring

22
00:00:48,000 --> 00:00:49,760
those capabilities and more together.

23
00:00:49,760 --> 00:00:51,920
People wanted a unified agent that could

24
00:00:51,920 --> 00:00:55,039
go off, use its own computer and do real

25
00:00:55,039 --> 00:00:57,360
complex tasks for them, that could uh

26
00:00:57,360 --> 00:00:59,359
seamlessly transition from thinking

27
00:00:59,359 --> 00:01:01,520
about something to taking actions to

28
00:01:01,520 --> 00:01:03,359
using lots of tools using the terminal,

29
00:01:03,359 --> 00:01:05,360
clicking around the web, even producing

30
00:01:05,360 --> 00:01:06,880
things like spreadsheets and slides and

31
00:01:06,880 --> 00:01:08,960
and much more. And wanted people want to

32
00:01:08,960 --> 00:01:10,159
be able to do this over a long time

33
00:01:10,159 --> 00:01:12,159
horizon and a sort of for universal

34
00:01:12,159 --> 00:01:13,840
tasks. So the team has been working

35
00:01:13,840 --> 00:01:16,400
super hard to bring that together. And

36
00:01:16,400 --> 00:01:18,080
today we have chat with the agent. Um,

37
00:01:18,080 --> 00:01:19,680
it's probably easier to show it to you

38
00:01:19,680 --> 00:01:21,439
than to keep talking about it. It is one

39
00:01:21,439 --> 00:01:23,360
of the feel the aon moments for me to

40
00:01:23,360 --> 00:01:25,280
watch it work. So, let's take a look.

41
00:01:25,280 --> 00:01:27,840
Awesome. Thanks, Sam. Hello, everyone.

42
00:01:27,840 --> 00:01:29,920
Very excited to share chat GBD agent

43
00:01:29,920 --> 00:01:31,600
with everybody. And as Sam said, let's

44
00:01:31,600 --> 00:01:33,759
just dive right into the demo. Okay, so

45
00:01:33,759 --> 00:01:36,159
we are on Chad GBD as we all know and

46
00:01:36,159 --> 00:01:39,119
love. And to turn on the agent mode, you

47
00:01:39,119 --> 00:01:40,880
just click the tools menu and select

48
00:01:40,880 --> 00:01:43,280
agent. You can also just type agent in

49
00:01:43,280 --> 00:01:45,040
the composer bar and it'll take you to

50
00:01:45,040 --> 00:01:47,520
agent mode. Um, Edward and I have a

51
00:01:47,520 --> 00:01:49,360
wedding to go to later this year. Uh,

52
00:01:49,360 --> 00:01:51,119
it's for one of our mutual friends.

53
00:01:51,119 --> 00:01:52,560
Should we should we have the Asian

54
00:01:52,560 --> 00:01:53,280
planet?

55
00:01:53,280 --> 00:01:55,680
Yeah, let's do it. I need an outfit. And

56
00:01:55,680 --> 00:01:56,799
don't forget the gift.

57
00:01:56,799 --> 00:01:58,719
Okay, great. We won't forget the gift.

58
00:01:58,719 --> 00:02:00,240
Um, it's a little bit of a longer

59
00:02:00,240 --> 00:02:01,680
prompt, so I have it copied in my

60
00:02:01,680 --> 00:02:02,799
buffer, so I'm just going to go ahead

61
00:02:02,799 --> 00:02:05,759
and paste it. Um, okay. So, let's see.

62
00:02:05,759 --> 00:02:07,360
Let's see what it says. Our friends are

63
00:02:07,360 --> 00:02:08,640
getting married later this year, as I

64
00:02:08,640 --> 00:02:10,720
said, Minia and Sarah. And we want the

65
00:02:10,720 --> 00:02:12,879
agent to help us find an outfit that

66
00:02:12,879 --> 00:02:15,520
matches the dress code. uh propose a few

67
00:02:15,520 --> 00:02:17,840
options. Nice mid luxury taking into

68
00:02:17,840 --> 00:02:21,040
account venue and weather. We also want

69
00:02:21,040 --> 00:02:23,280
to find us some hotels and as Edward

70
00:02:23,280 --> 00:02:25,760
said, don't forget the gift. Um so let's

71
00:02:25,760 --> 00:02:27,840
see and

72
00:02:27,840 --> 00:02:30,319
send the prompt away. As Sam said, agent

73
00:02:30,319 --> 00:02:32,640
uses a computer. Uh so in the beginning

74
00:02:32,640 --> 00:02:34,959
it sets up its environment. It it you

75
00:02:34,959 --> 00:02:38,000
know it'll take a minute or two or not

76
00:02:38,000 --> 00:02:39,680
really 5 seconds to set up its

77
00:02:39,680 --> 00:02:41,440
environment. And in this case, as you

78
00:02:41,440 --> 00:02:43,840
see, it understands the prompt. It's

79
00:02:43,840 --> 00:02:46,319
asking for me for a clarification. I'm

80
00:02:46,319 --> 00:02:48,000
just going to let it just continue and

81
00:02:48,000 --> 00:02:51,120
work. Anyway, um I think it got confused

82
00:02:51,120 --> 00:02:54,239
by saying, "Oh, where's the um what

83
00:02:54,239 --> 00:02:55,680
exactly is the time of the date of the

84
00:02:55,680 --> 00:02:57,200
wedding?" I think it'll figure out using

85
00:02:57,200 --> 00:02:59,840
the website. Okay, cool. So, now it's

86
00:02:59,840 --> 00:03:01,760
kicked off. It's starting the process,

87
00:03:01,760 --> 00:03:03,920
the prompt, and it's open up a browser.

88
00:03:03,920 --> 00:03:04,959
And to walk you through what's

89
00:03:04,959 --> 00:03:06,800
happening, here's

90
00:03:06,800 --> 00:03:09,040
Yeah. So, as mentioned, we gave the

91
00:03:09,040 --> 00:03:10,879
agent access to its own virtual

92
00:03:10,879 --> 00:03:13,280
computer, and the computer has many

93
00:03:13,280 --> 00:03:14,720
different tools installed, and it can

94
00:03:14,720 --> 00:03:16,239
choose which to use as it's working

95
00:03:16,239 --> 00:03:18,640
through the task. So, in chat GPT, you

96
00:03:18,640 --> 00:03:21,360
can see a visualization of the agent's

97
00:03:21,360 --> 00:03:23,680
computer screen, and you can see

98
00:03:23,680 --> 00:03:25,519
overlaid its chain of thought in text,

99
00:03:25,519 --> 00:03:27,200
and that's what it's thinking as it's

100
00:03:27,200 --> 00:03:28,480
working through the task and deciding

101
00:03:28,480 --> 00:03:30,799
what to do next. We gave the agent

102
00:03:30,799 --> 00:03:32,400
access to two different ways to browse

103
00:03:32,400 --> 00:03:34,560
the internet. First, we gave it a text

104
00:03:34,560 --> 00:03:36,159
browser, and this is similar to the deep

105
00:03:36,159 --> 00:03:38,000
research tool. And this is what lets it

106
00:03:38,000 --> 00:03:40,159
really efficiently and quickly read many

107
00:03:40,159 --> 00:03:43,440
web pages um um and search for them. And

108
00:03:43,440 --> 00:03:45,040
we also gave it access to a visual

109
00:03:45,040 --> 00:03:46,319
browser. And this is similar to the

110
00:03:46,319 --> 00:03:48,239
operator tool. And this is what lets it

111
00:03:48,239 --> 00:03:50,159
actually interact with the UI of a web

112
00:03:50,159 --> 00:03:52,720
page. So it can um drag things. It can

113
00:03:52,720 --> 00:03:54,879
use the cursor to click around. It can

114
00:03:54,879 --> 00:03:57,280
open UI components. It can fill out

115
00:03:57,280 --> 00:03:59,920
forms and enter text and text areas.

116
00:03:59,920 --> 00:04:02,560
It's very flexible. So those two tools

117
00:04:02,560 --> 00:04:04,720
are very complimentary. And then we also

118
00:04:04,720 --> 00:04:06,720
gave it access to its own terminal so

119
00:04:06,720 --> 00:04:08,720
that it can run code and it can also

120
00:04:08,720 --> 00:04:10,640
generate and analyze files like slide

121
00:04:10,640 --> 00:04:12,879
decks and spreadsheets. And then through

122
00:04:12,879 --> 00:04:14,560
the terminal it's also able to call

123
00:04:14,560 --> 00:04:17,840
APIs. So both public APIs and APIs to

124
00:04:17,840 --> 00:04:19,840
access your private data sources like

125
00:04:19,840 --> 00:04:22,479
Google Drive, Google Calendar, GitHub,

126
00:04:22,479 --> 00:04:25,360
SharePoint and many others um and only

127
00:04:25,360 --> 00:04:26,960
if you explicitly connect them similar

128
00:04:26,960 --> 00:04:28,960
to deep research connectors. And then it

129
00:04:28,960 --> 00:04:31,680
also has access to the image gen API so

130
00:04:31,680 --> 00:04:34,240
it can create nice visuals for um slide

131
00:04:34,240 --> 00:04:36,080
decks and other things as it's working

132
00:04:36,080 --> 00:04:38,240
through its tasks.

133
00:04:38,240 --> 00:04:40,800
How is deciding which tools to use here?

134
00:04:40,800 --> 00:04:42,560
Yes, we train the model to move between

135
00:04:42,560 --> 00:04:44,160
these capabilities with reinforcement

136
00:04:44,160 --> 00:04:46,080
learning. This is the first model we

137
00:04:46,080 --> 00:04:48,880
trained that has access to this unified

138
00:04:48,880 --> 00:04:52,000
tool box. A text browser, a GUI browser

139
00:04:52,000 --> 00:04:53,840
and a terminal all in one virtual

140
00:04:53,840 --> 00:04:57,120
machine. To guide its learning, we

141
00:04:57,120 --> 00:04:59,360
created hard tasks that require using

142
00:04:59,360 --> 00:05:01,919
all these tools. This allows the model

143
00:05:01,919 --> 00:05:04,000
not only to learn how to use these

144
00:05:04,000 --> 00:05:06,160
tools, but also when to use which tool

145
00:05:06,160 --> 00:05:08,400
depending on the task at hand. At the

146
00:05:08,400 --> 00:05:10,400
beginning of the training, the model

147
00:05:10,400 --> 00:05:12,880
might attempt to use all these tools to

148
00:05:12,880 --> 00:05:15,600
solve a relatively simple problem. Over

149
00:05:15,600 --> 00:05:17,840
time, as we reward the model for solving

150
00:05:17,840 --> 00:05:20,560
problems correctly and efficiently, the

151
00:05:20,560 --> 00:05:24,080
model will have smarter tool choice.

152
00:05:24,080 --> 00:05:27,360
For example, if you ask a model to uh

153
00:05:27,360 --> 00:05:29,039
find a restaurant with specific

154
00:05:29,039 --> 00:05:31,919
requirements and make a reservation, the

155
00:05:31,919 --> 00:05:34,479
model may typically just start a deep

156
00:05:34,479 --> 00:05:36,160
research in the text browser to find

157
00:05:36,160 --> 00:05:39,039
some candidates, then switch to the GUI

158
00:05:39,039 --> 00:05:42,160
browser to view photos of food, uh check

159
00:05:42,160 --> 00:05:45,600
availability, and complete the booking.

160
00:05:45,600 --> 00:05:48,000
Similarly, for creative task like

161
00:05:48,000 --> 00:05:50,160
creating an artifact, the model will

162
00:05:50,160 --> 00:05:51,680
first search online for public

163
00:05:51,680 --> 00:05:54,479
resources, then switch to the terminal

164
00:05:54,479 --> 00:05:57,039
to do some code editing to compile the

165
00:05:57,039 --> 00:05:59,919
artifact and finally verify the final

166
00:05:59,919 --> 00:06:02,960
outputs in the GUI browser. With this,

167
00:06:02,960 --> 00:06:05,600
we truly feel like we brought together

168
00:06:05,600 --> 00:06:08,240
the best of deep research and operator

169
00:06:08,240 --> 00:06:11,759
and added some extra sparkle.

170
00:06:11,759 --> 00:06:14,000
That's right. Yeah. So to put this

171
00:06:14,000 --> 00:06:15,520
project in context, I want to give a bit

172
00:06:15,520 --> 00:06:18,000
of history. So a few months ago, we

173
00:06:18,000 --> 00:06:20,960
shipped operator in January and this was

174
00:06:20,960 --> 00:06:23,120
our agent that lets you do online tasks

175
00:06:23,120 --> 00:06:25,759
like book reservations and um send

176
00:06:25,759 --> 00:06:27,840
emails and then two weeks later we

177
00:06:27,840 --> 00:06:29,919
shipped deep research and deep research

178
00:06:29,919 --> 00:06:31,919
is a tool that lets you do in-depth

179
00:06:31,919 --> 00:06:35,759
internet research and output highquality

180
00:06:35,759 --> 00:06:39,280
um um research reports. And after launch

181
00:06:39,280 --> 00:06:41,039
we realized that actually these two

182
00:06:41,039 --> 00:06:42,319
approaches are actually deeply

183
00:06:42,319 --> 00:06:44,160
complimentary.

184
00:06:44,160 --> 00:06:46,400
Um for example operator has some trouble

185
00:06:46,400 --> 00:06:48,720
reading super long articles. Um it has

186
00:06:48,720 --> 00:06:50,400
to scroll. It takes a long time. But

187
00:06:50,400 --> 00:06:51,759
that's something that deep research is

188
00:06:51,759 --> 00:06:56,240
good at. Conversely operator uh uh deep

189
00:06:56,240 --> 00:06:58,240
research isn't as good at interacting

190
00:06:58,240 --> 00:07:00,319
with web pages interactive elements

191
00:07:00,319 --> 00:07:03,199
visual uh highly visual web pages but

192
00:07:03,199 --> 00:07:04,800
that's something that operator excels

193
00:07:04,800 --> 00:07:08,639
at. So uh yeah we felt these approaches

194
00:07:08,639 --> 00:07:11,120
were complimentary and then we we were

195
00:07:11,120 --> 00:07:13,120
also looking at some customer feedback.

196
00:07:13,120 --> 00:07:14,880
So for example one of our most highly

197
00:07:14,880 --> 00:07:17,120
requested features for deep research was

198
00:07:17,120 --> 00:07:18,960
the ability to log into websites and

199
00:07:18,960 --> 00:07:20,960
access authenticated sources. That's

200
00:07:20,960 --> 00:07:22,880
something that operator can do.

201
00:07:22,880 --> 00:07:24,000
I've been waiting for that for a long

202
00:07:24,000 --> 00:07:24,560
time.

203
00:07:24,560 --> 00:07:26,160
Yeah.

204
00:07:26,160 --> 00:07:28,479
Um another thing is that we were looking

205
00:07:28,479 --> 00:07:29,840
at the prompts that people were trying

206
00:07:29,840 --> 00:07:31,520
for operator and we saw that they were

207
00:07:31,520 --> 00:07:32,880
actually more deep research type

208
00:07:32,880 --> 00:07:35,199
prompts. for example, plan a trip and

209
00:07:35,199 --> 00:07:38,240
then book it. And so, yeah, we we really

210
00:07:38,240 --> 00:07:39,360
feel like we're bringing the best of

211
00:07:39,360 --> 00:07:41,440
both worlds here. And on a personal

212
00:07:41,440 --> 00:07:42,800
note, we've all been friends for a

213
00:07:42,800 --> 00:07:44,160
while, and it's really exciting to be

214
00:07:44,160 --> 00:07:46,479
working together. So, speaking of

215
00:07:46,479 --> 00:07:48,960
matches made in heaven, how is the

216
00:07:48,960 --> 00:07:50,319
wedding planning going?

217
00:07:50,319 --> 00:07:51,759
It's amazing to watch. This is an

218
00:07:51,759 --> 00:07:53,599
example of a task I hate doing. This can

219
00:07:53,599 --> 00:07:55,520
like ruin like, you know, multiple hours

220
00:07:55,520 --> 00:07:56,960
for me as I get sucked into these rabbit

221
00:07:56,960 --> 00:07:58,160
holes. So, just watching this as you

222
00:07:58,160 --> 00:07:59,520
guys have been talking click through

223
00:07:59,520 --> 00:08:01,199
this and just like do the whole thing is

224
00:08:01,199 --> 00:08:03,360
really quite remarkable. Yeah, totally.

225
00:08:03,360 --> 00:08:06,560
Um, looks like it started off by

226
00:08:06,560 --> 00:08:08,560
figuring out the weather. One of the

227
00:08:08,560 --> 00:08:11,280
cool features, um, is that, you know, as

228
00:08:11,280 --> 00:08:12,560
some of these tasks may take a little

229
00:08:12,560 --> 00:08:14,160
bit longer, you can just go back and see

230
00:08:14,160 --> 00:08:15,759
what it was doing. So, that's what we're

231
00:08:15,759 --> 00:08:17,199
exactly going to do. Looks like it went

232
00:08:17,199 --> 00:08:18,720
through the website to use the text

233
00:08:18,720 --> 00:08:21,039
browser. Interestingly, for that, now

234
00:08:21,039 --> 00:08:22,400
it's looking through the suits for

235
00:08:22,400 --> 00:08:23,919
Edward. I think it'll find something

236
00:08:23,919 --> 00:08:25,360
good. Here you can see it switched over

237
00:08:25,360 --> 00:08:27,199
to actually a visual browser to make

238
00:08:27,199 --> 00:08:28,960
sure suit will look really good on

239
00:08:28,960 --> 00:08:31,280
Edward.

240
00:08:31,280 --> 00:08:34,560
And now looks like yeah, it's got

241
00:08:34,560 --> 00:08:36,880
chugging along, figuring out what to do.

242
00:08:36,880 --> 00:08:39,599
Um, and still on suits and now probably

243
00:08:39,599 --> 00:08:41,919
getting to the gifts section. Um, okay,

244
00:08:41,919 --> 00:08:43,279
cool. So, this is going to take a while.

245
00:08:43,279 --> 00:08:44,959
As Sam said, these tasks sometimes can

246
00:08:44,959 --> 00:08:46,160
take a long time. So, it's going to

247
00:08:46,160 --> 00:08:47,680
continue doing hopefully much faster

248
00:08:47,680 --> 00:08:49,760
than we will do. Um, should we do

249
00:08:49,760 --> 00:08:51,600
something else while it's doing it? I

250
00:08:51,600 --> 00:08:53,519
think the team really wanted the um

251
00:08:53,519 --> 00:08:55,279
stickers, some stickers for the for the

252
00:08:55,279 --> 00:08:56,480
launch. Should we do that?

253
00:08:56,480 --> 00:08:57,279
Yeah, cool.

254
00:08:57,279 --> 00:08:59,040
All right. So, we have a team mascot,

255
00:08:59,040 --> 00:09:00,320
which is one of our colleagues, Bunny

256
00:09:00,320 --> 00:09:03,279
Doodle. really really cute tell you. Um

257
00:09:03,279 --> 00:09:06,080
and we're going to try and bring um get

258
00:09:06,080 --> 00:09:08,480
some laptop stickers for everybody. Uh

259
00:09:08,480 --> 00:09:10,480
one of the favorite features for agent

260
00:09:10,480 --> 00:09:13,120
is given that trajectories can take 15

261
00:09:13,120 --> 00:09:15,040
minutes, 20 minutes, 30 minutes

262
00:09:15,040 --> 00:09:17,120
depending on the complexity of the task.

263
00:09:17,120 --> 00:09:19,120
Um a lot of times the you might need to

264
00:09:19,120 --> 00:09:20,560
help the agent. Agent might need to ask

265
00:09:20,560 --> 00:09:22,480
you clarifications, confirmations and

266
00:09:22,480 --> 00:09:25,040
things like that. Um so I love to use it

267
00:09:25,040 --> 00:09:26,640
on the go. So I'm going to use my mobile

268
00:09:26,640 --> 00:09:28,160
phone to actually send the query this

269
00:09:28,160 --> 00:09:30,240
time and then see how it goes.

270
00:09:30,240 --> 00:09:32,880
Okay, so let's see. Okay, so we are on

271
00:09:32,880 --> 00:09:35,519
Chad Gibbdi. Uh I have already selected

272
00:09:35,519 --> 00:09:38,560
the agent mode. I've also inputed our uh

273
00:09:38,560 --> 00:09:40,560
cute mascot and I'm going to quickly

274
00:09:40,560 --> 00:09:43,040
paste a query. So query says make some

275
00:09:43,040 --> 00:09:45,279
swag for the team one by one laptop

276
00:09:45,279 --> 00:09:47,920
stickers and order 500 of them. I'll

277
00:09:47,920 --> 00:09:52,959
also say I like sticker mule

278
00:09:52,959 --> 00:09:55,279
which we have used in the past and send

279
00:09:55,279 --> 00:09:57,200
it off.

280
00:09:57,200 --> 00:10:00,080
Okay. So, just like it was doing on the

281
00:10:00,080 --> 00:10:02,080
web, it's going to take some time, think

282
00:10:02,080 --> 00:10:04,080
about like what's it doing, and it'll

283
00:10:04,080 --> 00:10:07,120
kick off kick off the query. And as it's

284
00:10:07,120 --> 00:10:08,880
going, it'll take some time to kick it

285
00:10:08,880 --> 00:10:11,200
off. Is it Oh, there we go. So, it'll

286
00:10:11,200 --> 00:10:12,480
start working on it. Looks like it's

287
00:10:12,480 --> 00:10:14,720
starting to create the anime art. It'll

288
00:10:14,720 --> 00:10:16,640
probably use image that Isa referred

289
00:10:16,640 --> 00:10:18,399
earlier on to create hopefully an anime

290
00:10:18,399 --> 00:10:20,240
art. We'll see how it comes out. While

291
00:10:20,240 --> 00:10:21,760
that's going, anything else we want to

292
00:10:21,760 --> 00:10:22,399
do?

293
00:10:22,399 --> 00:10:24,720
Oh, yeah. I also need a pair of shoes

294
00:10:24,720 --> 00:10:26,320
because my shoes got damaged.

295
00:10:26,320 --> 00:10:27,360
How did they get damaged?

296
00:10:27,360 --> 00:10:28,560
Uh, by the rain

297
00:10:28,560 --> 00:10:30,000
in SF.

298
00:10:30,000 --> 00:10:30,800
Yes.

299
00:10:30,800 --> 00:10:32,160
Cool. All right. Uh, well, let's get

300
00:10:32,160 --> 00:10:34,240
Edward a pair of shoes as well. So, oh,

301
00:10:34,240 --> 00:10:40,320
can you also find us um pair of men's

302
00:10:40,320 --> 00:10:43,519
dress black shoes in size

303
00:10:43,519 --> 00:10:44,240
9.5?

304
00:10:44,240 --> 00:10:46,000
9.5.

305
00:10:46,000 --> 00:10:47,920
So, one of the key capabilities of the

306
00:10:47,920 --> 00:10:49,920
model is being able to interrupt. I

307
00:10:49,920 --> 00:10:51,920
think you know as trajectories take long

308
00:10:51,920 --> 00:10:53,760
time or whatever time it's really

309
00:10:53,760 --> 00:10:56,720
important for us to for it to feel very

310
00:10:56,720 --> 00:10:59,120
multi-turn so the users can interject

311
00:10:59,120 --> 00:11:01,120
user can direct it user can give it more

312
00:11:01,120 --> 00:11:02,640
guidance less guidance whatever we want

313
00:11:02,640 --> 00:11:04,320
to do and that's what we're doing here

314
00:11:04,320 --> 00:11:07,040
we essentially the the model was

315
00:11:07,040 --> 00:11:08,720
chugging along figuring out all the

316
00:11:08,720 --> 00:11:10,240
things that we had asked before and in

317
00:11:10,240 --> 00:11:12,320
this case we essentially said hey can

318
00:11:12,320 --> 00:11:16,000
you also uh get us a pair of men's black

319
00:11:16,000 --> 00:11:18,160
shoes and now it's thinking and soon

320
00:11:18,160 --> 00:11:19,839
enough hopefully it'll take that into

321
00:11:19,839 --> 00:11:22,000
account and keep going uh into its

322
00:11:22,000 --> 00:11:23,600
trajectory. There we go. So, it said

323
00:11:23,600 --> 00:11:25,120
acknowledge the interruption. It said,

324
00:11:25,120 --> 00:11:26,880
"Okay, cool. I'll also research men's

325
00:11:26,880 --> 00:11:29,600
black shoes in size 9.5." Um, and then

326
00:11:29,600 --> 00:11:31,680
it'll probably get on its way. Um, but

327
00:11:31,680 --> 00:11:33,120
maybe Issa can tell us a little bit more

328
00:11:33,120 --> 00:11:34,240
about how that works.

329
00:11:34,240 --> 00:11:36,320
Yeah, sure. So, as you can see, the

330
00:11:36,320 --> 00:11:38,079
agent is very collaborative, and this

331
00:11:38,079 --> 00:11:39,920
was really important to us when we were

332
00:11:39,920 --> 00:11:41,200
training the model and building the

333
00:11:41,200 --> 00:11:42,880
product. If you were asking another

334
00:11:42,880 --> 00:11:44,399
person to do a task for you that would

335
00:11:44,399 --> 00:11:45,519
take them a really long time to

336
00:11:45,519 --> 00:11:46,959
complete, you'd probably give them some

337
00:11:46,959 --> 00:11:48,800
instructions to start and then they

338
00:11:48,800 --> 00:11:50,640
might ask you some clarifying questions

339
00:11:50,640 --> 00:11:52,320
and then they'd start the task and maybe

340
00:11:52,320 --> 00:11:53,600
realize, oh, they need more

341
00:11:53,600 --> 00:11:55,440
clarification from you or they need your

342
00:11:55,440 --> 00:11:56,880
permission to sign into something or do

343
00:11:56,880 --> 00:11:58,560
something on your behalf and then you

344
00:11:58,560 --> 00:12:00,240
might realize, oh, I forgot to mention

345
00:12:00,240 --> 00:12:02,640
this thing or um what's your status? How

346
00:12:02,640 --> 00:12:04,240
are you doing? Can I help redirect you

347
00:12:04,240 --> 00:12:05,760
if you're getting along the wrong path

348
00:12:05,760 --> 00:12:07,760
or something? And so similarly for these

349
00:12:07,760 --> 00:12:09,680
really longrunning agentic tasks, it's

350
00:12:09,680 --> 00:12:11,519
very important that both the user and

351
00:12:11,519 --> 00:12:13,600
the agent are able to initiate

352
00:12:13,600 --> 00:12:15,519
communication with each other so that um

353
00:12:15,519 --> 00:12:17,200
the agent is able to most effectively

354
00:12:17,200 --> 00:12:19,360
help you with your tasks. And so this is

355
00:12:19,360 --> 00:12:20,560
something that we actually trained into

356
00:12:20,560 --> 00:12:22,320
the model. We trained it to be able to

357
00:12:22,320 --> 00:12:24,160
ask clarifying questions, not every

358
00:12:24,160 --> 00:12:26,240
single time like deep research. Um we

359
00:12:26,240 --> 00:12:28,800
also asked it we also trained it to be

360
00:12:28,800 --> 00:12:30,560
interruptible as Yash just showed. And

361
00:12:30,560 --> 00:12:32,000
also sometimes it will ask you for

362
00:12:32,000 --> 00:12:33,519
clarification and confirmation

363
00:12:33,519 --> 00:12:35,680
mid-trajectory.

364
00:12:35,680 --> 00:12:38,079
Yeah. And part of working with agent is

365
00:12:38,079 --> 00:12:40,480
that well sometimes it'll make mistakes.

366
00:12:40,480 --> 00:12:42,079
And that's why we felt it was important

367
00:12:42,079 --> 00:12:44,079
to train the model to ask you for

368
00:12:44,079 --> 00:12:45,920
confirmation at the last step of

369
00:12:45,920 --> 00:12:49,279
important steps. Um so for example maybe

370
00:12:49,279 --> 00:12:51,519
before it's going to send the email um

371
00:12:51,519 --> 00:12:53,440
it'll ask you to take a look at the

372
00:12:53,440 --> 00:12:54,720
draft and whether it makes sense and

373
00:12:54,720 --> 00:12:56,079
whether there are any embarrassing

374
00:12:56,079 --> 00:12:59,200
typos. Um, and if there are, then you

375
00:12:59,200 --> 00:13:01,360
can either ask it to fix it or you can

376
00:13:01,360 --> 00:13:03,440
directly take over the browser and jump

377
00:13:03,440 --> 00:13:06,079
right into the um, agents environment

378
00:13:06,079 --> 00:13:09,040
and correct it yourself. And that way it

379
00:13:09,040 --> 00:13:10,720
feels collaborative and you can um,

380
00:13:10,720 --> 00:13:13,680
really work with the agent.

381
00:13:13,680 --> 00:13:15,120
Should we look at maybe one more demo?

382
00:13:15,120 --> 00:13:17,279
We've got this uh, sort of fun tradition

383
00:13:17,279 --> 00:13:19,600
in live streams of using uh, using our

384
00:13:19,600 --> 00:13:21,120
newest models to sort of evaluate

385
00:13:21,120 --> 00:13:23,040
themselves or do something kind of meta.

386
00:13:23,040 --> 00:13:24,240
Anything like that we could do?

387
00:13:24,240 --> 00:13:27,440
Yeah, let's do it.

388
00:13:27,440 --> 00:13:28,320
So um

389
00:13:28,320 --> 00:13:29,440
I think people would love to know how

390
00:13:29,440 --> 00:13:30,320
good the model is.

391
00:13:30,320 --> 00:13:33,920
Yes. So this is a prompt we previously

392
00:13:33,920 --> 00:13:36,880
gave the a agent yesterday. So basically

393
00:13:36,880 --> 00:13:38,959
it asks the model to pull its own

394
00:13:38,959 --> 00:13:40,959
evalution number from our Google job

395
00:13:40,959 --> 00:13:43,440
connector and make some slides. So we

396
00:13:43,440 --> 00:13:44,959
want to keep it simple like no

397
00:13:44,959 --> 00:13:47,360
introduction no conclusion just present

398
00:13:47,360 --> 00:13:50,000
the results with in the charts. As you

399
00:13:50,000 --> 00:13:52,160
can see now the model is connecting to

400
00:13:52,160 --> 00:13:55,120
the Google Drive API and uh then search

401
00:13:55,120 --> 00:13:57,600
within API it right now it looks like

402
00:13:57,600 --> 00:13:59,920
the first result is very relevant. So

403
00:13:59,920 --> 00:14:02,720
it's reading the first result.

404
00:14:02,720 --> 00:14:04,959
Now it's reading the first result uh in

405
00:14:04,959 --> 00:14:07,920
details. Uh let's accelerate this uh

406
00:14:07,920 --> 00:14:12,800
replay. So then the model might read

407
00:14:12,800 --> 00:14:15,279
from the result again and write some

408
00:14:15,279 --> 00:14:16,959
code.

409
00:14:16,959 --> 00:14:19,519
So here you can see that the model is

410
00:14:19,519 --> 00:14:21,920
using the image generation model called

411
00:14:21,920 --> 00:14:24,480
image generation tool to generate some

412
00:14:24,480 --> 00:14:28,079
decorations for the slides.

413
00:14:28,079 --> 00:14:30,160
And let's see what's the first slide the

414
00:14:30,160 --> 00:14:33,399
model made.

415
00:14:33,920 --> 00:14:35,920
So here the model is writing some code

416
00:14:35,920 --> 00:14:38,399
that will be compiled to be the final

417
00:14:38,399 --> 00:14:41,120
slides. So this is the first slide the

418
00:14:41,120 --> 00:14:44,160
model make in this demo which looks okay

419
00:14:44,160 --> 00:14:46,240
but it's not polished enough.

420
00:14:46,240 --> 00:14:48,240
One of the key feature in reinforcement

421
00:14:48,240 --> 00:14:50,160
learning is that the model will re

422
00:14:50,160 --> 00:14:52,240
review its own results and refine the

423
00:14:52,240 --> 00:14:55,120
results to to deliver a good final

424
00:14:55,120 --> 00:14:57,839
results. Let's see what's the finally

425
00:14:57,839 --> 00:15:00,320
what the model give us.

426
00:15:00,320 --> 00:15:04,000
We can click skip and then the model

427
00:15:04,000 --> 00:15:07,519
give us a good uh PowerPoint file. So

428
00:15:07,519 --> 00:15:09,040
it's a real PowerPoint that you can

429
00:15:09,040 --> 00:15:14,040
download and open it in any software.

430
00:15:14,639 --> 00:15:19,279
Let's open it in uh in the office. So

431
00:15:19,279 --> 00:15:22,160
let's present the slides the model just

432
00:15:22,160 --> 00:15:23,839
generated.

433
00:15:23,839 --> 00:15:27,120
First are two intelligence benchmarks.

434
00:15:27,120 --> 00:15:30,480
Humanities last exam is a benchmark that

435
00:15:30,480 --> 00:15:33,519
measures AI's ability to solve a broad

436
00:15:33,519 --> 00:15:37,120
range of subjects on hard problems. We

437
00:15:37,120 --> 00:15:40,320
evaluate the models with two settings

438
00:15:40,320 --> 00:15:43,440
with and without tool use.

439
00:15:43,440 --> 00:15:45,920
We can see that the agent modes the raw

440
00:15:45,920 --> 00:15:48,720
intelligence is already pretty nice and

441
00:15:48,720 --> 00:15:50,880
with access to all tools nearly double

442
00:15:50,880 --> 00:15:54,720
the performance to 42%.

443
00:15:54,720 --> 00:15:56,720
When evaluating models on humanity's

444
00:15:56,720 --> 00:15:59,360
last exam, especially with the browsing

445
00:15:59,360 --> 00:16:01,759
ability, we have a two-layer

446
00:16:01,759 --> 00:16:04,399
decontamination that ensure that the

447
00:16:04,399 --> 00:16:07,680
model doesn't cheat on this benchmark.

448
00:16:07,680 --> 00:16:10,079
Front TMS is a benchmark that measures

449
00:16:10,079 --> 00:16:11,839
advanced mathematical reasoning ability

450
00:16:11,839 --> 00:16:13,680
of models.

451
00:16:13,680 --> 00:16:16,000
Different from our baseline of mini and

452
00:16:16,000 --> 00:16:18,560
03 which use Python with function

453
00:16:18,560 --> 00:16:21,440
coding. We give the agent model all

454
00:16:21,440 --> 00:16:23,440
available tools like a browser, a

455
00:16:23,440 --> 00:16:26,320
computer and a terminal. The agent

456
00:16:26,320 --> 00:16:29,360
achieves new state art of 27% on this

457
00:16:29,360 --> 00:16:31,440
benchmark with the help of all these

458
00:16:31,440 --> 00:16:34,440
tools.

459
00:16:34,639 --> 00:16:36,880
Next, we evaluated the model on two

460
00:16:36,880 --> 00:16:39,519
agentic benchmarks. Web arena is a

461
00:16:39,519 --> 00:16:41,519
benchmark that measures web agents

462
00:16:41,519 --> 00:16:43,600
ability so to solve real world web

463
00:16:43,600 --> 00:16:47,279
tasks. The agent model improves over

464
00:16:47,279 --> 00:16:51,360
previous O3 model that powers the core.

465
00:16:51,360 --> 00:16:54,399
Browse comp is a benchmark we introduced

466
00:16:54,399 --> 00:16:56,240
earlier this year that measures the

467
00:16:56,240 --> 00:16:58,880
browsing agents ability to search and

468
00:16:58,880 --> 00:17:02,320
find uh how to locate information.

469
00:17:02,320 --> 00:17:03,839
The agent model significantly

470
00:17:03,839 --> 00:17:06,160
outperforms 03 and deep research on this

471
00:17:06,160 --> 00:17:11,679
benchmark achieving 69% pass rate.

472
00:17:11,679 --> 00:17:14,559
Finally, we care about how the users

473
00:17:14,559 --> 00:17:16,959
will benefit from our model in the real

474
00:17:16,959 --> 00:17:19,919
world. Spreadsheet bench is a benchmark

475
00:17:19,919 --> 00:17:21,919
that measures the model's ability to

476
00:17:21,919 --> 00:17:24,400
edit spreadsheets derived from the real

477
00:17:24,400 --> 00:17:28,079
world use case. Here the agent model

478
00:17:28,079 --> 00:17:30,480
with the liberal office and the computer

479
00:17:30,480 --> 00:17:34,000
tool can already solve 30% of the task

480
00:17:34,000 --> 00:17:36,480
when we give the model the access to the

481
00:17:36,480 --> 00:17:39,840
raw Excel file in the terminal which

482
00:17:39,840 --> 00:17:44,000
further boost the performance to 45%.

483
00:17:44,000 --> 00:17:46,000
Finally we evated the model on an

484
00:17:46,000 --> 00:17:48,000
internal banking benchmark. The bench

485
00:17:48,000 --> 00:17:49,760
this benchmark evaluated the model's

486
00:17:49,760 --> 00:17:52,559
ability to to conduct first to third

487
00:17:52,559 --> 00:17:55,679
year investment bank uh banking analyst

488
00:17:55,679 --> 00:17:58,799
tasks such as like putting together a

489
00:17:58,799 --> 00:18:00,559
three statement financial model for

490
00:18:00,559 --> 00:18:04,000
Fortune uh 500 company in this

491
00:18:04,000 --> 00:18:06,160
benchmark. The agent model significantly

492
00:18:06,160 --> 00:18:08,080
outperforms the previous deep research

493
00:18:08,080 --> 00:18:11,760
and all three models. As you can see

494
00:18:11,760 --> 00:18:13,919
this model is one of the most powerful

495
00:18:13,919 --> 00:18:16,080
model we've ever trained.

496
00:18:16,080 --> 00:18:18,960
It's not only good on benchmarks, it's

497
00:18:18,960 --> 00:18:22,480
also capable of reasoning, browsing, and

498
00:18:22,480 --> 00:18:24,720
tackling real world tasks at a level

499
00:18:24,720 --> 00:18:28,480
that we cannot imagine three months ago.

500
00:18:28,480 --> 00:18:31,600
That's right. Um, as Edward said, um, we

501
00:18:31,600 --> 00:18:32,799
think we've trained a very powerful

502
00:18:32,799 --> 00:18:35,280
model and a lot of the power comes from

503
00:18:35,280 --> 00:18:38,240
its ability to browse the internet. And

504
00:18:38,240 --> 00:18:40,240
as we know, the internet can be a scary

505
00:18:40,240 --> 00:18:42,400
place. There are all sorts of hackers

506
00:18:42,400 --> 00:18:45,120
trying to steal your information, scams,

507
00:18:45,120 --> 00:18:48,480
uh fishing attempts. Um and agent isn't

508
00:18:48,480 --> 00:18:51,120
immune to all these things. Um one

509
00:18:51,120 --> 00:18:53,360
particular thing we're worried about is

510
00:18:53,360 --> 00:18:55,520
a new uh attack called prompt

511
00:18:55,520 --> 00:18:57,120
injections.

512
00:18:57,120 --> 00:18:59,840
This is where let's say you ask agent to

513
00:18:59,840 --> 00:19:02,080
buy you a book and you give it your

514
00:19:02,080 --> 00:19:04,400
credit card information to do that.

515
00:19:04,400 --> 00:19:06,240
Agent might stumble upon a malicious

516
00:19:06,240 --> 00:19:08,559
website that asks it, "Oh, enter your

517
00:19:08,559 --> 00:19:10,400
credit card information here. it'll help

518
00:19:10,400 --> 00:19:12,799
you with your task. An agent, which is

519
00:19:12,799 --> 00:19:15,200
trained to be helpful, might decide

520
00:19:15,200 --> 00:19:18,080
that's a good idea.

521
00:19:18,080 --> 00:19:19,760
We've done a lot of work to try to

522
00:19:19,760 --> 00:19:22,320
ensure that this doesn't happen. We've

523
00:19:22,320 --> 00:19:24,240
trained our model to ignore suspicious

524
00:19:24,240 --> 00:19:27,120
instructions on on suspicious websites.

525
00:19:27,120 --> 00:19:29,039
We've also have uh we also have layers

526
00:19:29,039 --> 00:19:32,000
of monitors that kind of peer over the

527
00:19:32,000 --> 00:19:33,760
agent's shoulder and watch it as it's

528
00:19:33,760 --> 00:19:36,480
going um and stop the trajectory if

529
00:19:36,480 --> 00:19:38,799
anything looks suspicious. We can even

530
00:19:38,799 --> 00:19:41,919
update these in real time if new attacks

531
00:19:41,919 --> 00:19:44,160
are found in the wild.

532
00:19:44,160 --> 00:19:45,919
That said though, you know, this is a

533
00:19:45,919 --> 00:19:47,760
cutting edge product. This is a new

534
00:19:47,760 --> 00:19:50,000
surface and we can't stop everything.

535
00:19:50,000 --> 00:19:51,280
And so that's why I feel it's very

536
00:19:51,280 --> 00:19:52,559
important for the audience to be aware

537
00:19:52,559 --> 00:19:55,360
of the risks involved in using agent.

538
00:19:55,360 --> 00:19:57,440
And um we encourage users to be

539
00:19:57,440 --> 00:19:59,520
proactive in kind of thinking about how

540
00:19:59,520 --> 00:20:01,120
they share their information. You know,

541
00:20:01,120 --> 00:20:02,880
if it's highly sensitive information,

542
00:20:02,880 --> 00:20:06,799
maybe don't share that. um maybe um uh

543
00:20:06,799 --> 00:20:08,799
use our features like takeover mode to

544
00:20:08,799 --> 00:20:10,799
directly input your credit credit card

545
00:20:10,799 --> 00:20:12,880
information into the browser instead of

546
00:20:12,880 --> 00:20:15,679
um giving it to agent. Um we feel like

547
00:20:15,679 --> 00:20:18,640
we've built a very powerful product but

548
00:20:18,640 --> 00:20:20,480
again it's important for our users to

549
00:20:20,480 --> 00:20:21,760
understand the risk involved.

550
00:20:21,760 --> 00:20:23,280
Yeah, I really want to emphasize that I

551
00:20:23,280 --> 00:20:25,520
think this is a new level of capability

552
00:20:25,520 --> 00:20:27,120
in AI. It's a new way to use AI, but

553
00:20:27,120 --> 00:20:28,799
there will be a new set of attacks that

554
00:20:28,799 --> 00:20:30,799
come with that. And society and the

555
00:20:30,799 --> 00:20:33,120
technology will have to evolve and learn

556
00:20:33,120 --> 00:20:34,320
how we're going to mitigate things that

557
00:20:34,320 --> 00:20:36,159
we can't even really imagine yet. Uh, as

558
00:20:36,159 --> 00:20:37,360
people start doing more and more work

559
00:20:37,360 --> 00:20:39,679
this way. Before I wrap up, should we

560
00:20:39,679 --> 00:20:41,840
check in on some of the tasks you kicked

561
00:20:41,840 --> 00:20:42,080
off?

562
00:20:42,080 --> 00:20:46,159
Yeah, let's do it. Um, okay. So, I am

563
00:20:46,159 --> 00:20:48,240
going to open a new tab and make sure

564
00:20:48,240 --> 00:20:51,840
that we can see the progress of our um,

565
00:20:51,840 --> 00:20:55,679
stickers as well. Okay. Let's see. All

566
00:20:55,679 --> 00:20:58,159
right. So, sounds like stickers are

567
00:20:58,159 --> 00:21:00,880
ready. Let me see what it actually Okay.

568
00:21:00,880 --> 00:21:03,200
So, cool thing. This is sort of the end

569
00:21:03,200 --> 00:21:06,720
end result of the took about 7 minutes.

570
00:21:06,720 --> 00:21:08,480
Highly likely figured out everything.

571
00:21:08,480 --> 00:21:09,840
We'll go back and look at the trajectory

572
00:21:09,840 --> 00:21:11,679
and see how it did. But at the end

573
00:21:11,679 --> 00:21:13,679
result, it looks like it's added to the

574
00:21:13,679 --> 00:21:15,360
cart. This is the subtotal. I can just

575
00:21:15,360 --> 00:21:17,360
go ahead and look at it and then figure

576
00:21:17,360 --> 00:21:20,000
out uh I can just take over at this

577
00:21:20,000 --> 00:21:21,600
point as Casey said to enter my credit

578
00:21:21,600 --> 00:21:23,039
card information and then place the

579
00:21:23,039 --> 00:21:25,200
order really quickly. model is asking

580
00:21:25,200 --> 00:21:27,120
for confirmations, etc. as it's supposed

581
00:21:27,120 --> 00:21:29,280
to do. Let's just quickly browse through

582
00:21:29,280 --> 00:21:31,039
the trajectory and see what it actually

583
00:21:31,039 --> 00:21:33,280
did. Oh, it looks like it generated some

584
00:21:33,280 --> 00:21:35,840
stickers. Oh, look at that. That's what

585
00:21:35,840 --> 00:21:38,880
it generated sticker. Cool. So, yeah,

586
00:21:38,880 --> 00:21:40,640
that's the task. I think I can at this

587
00:21:40,640 --> 00:21:42,559
point finish up by myself or I can ask

588
00:21:42,559 --> 00:21:43,919
the model to actually go ahead and do it

589
00:21:43,919 --> 00:21:46,720
for me as well. Let's check on the

590
00:21:46,720 --> 00:21:49,840
wedding. Okay, great. Looks like it just

591
00:21:49,840 --> 00:21:52,720
finished in the nick of time. Uh, okay,

592
00:21:52,720 --> 00:21:55,520
cool. So in this case, as as we said, we

593
00:21:55,520 --> 00:21:57,840
were looking for hotel, stress, uh

594
00:21:57,840 --> 00:22:01,919
suits, and also shoes. So it's come out

595
00:22:01,919 --> 00:22:03,520
with a pretty comprehensive report. It

596
00:22:03,520 --> 00:22:05,840
looks like wedding venue, date, when it

597
00:22:05,840 --> 00:22:10,240
is with the Zilla links, dress codes. It

598
00:22:10,240 --> 00:22:11,600
figured out like what the suit

599
00:22:11,600 --> 00:22:12,960
recommendation should be, where you can

600
00:22:12,960 --> 00:22:14,799
buy. Now I can go ahead and buy myself

601
00:22:14,799 --> 00:22:17,120
or I can ask the agent to go and buy for

602
00:22:17,120 --> 00:22:20,960
me. Um also figured out footwear hurdle

603
00:22:20,960 --> 00:22:23,360
options. It actually looked through all

604
00:22:23,360 --> 00:22:27,120
the oop sorry it looked through all the

605
00:22:27,120 --> 00:22:29,360
availability. You can see actually it

606
00:22:29,360 --> 00:22:31,440
gives screenshots of what it checked. In

607
00:22:31,440 --> 00:22:33,120
this case we use booking.com and it's

608
00:22:33,120 --> 00:22:35,280
able to do that. Also has gift

609
00:22:35,280 --> 00:22:37,360
suggestions etc. And next step I can ask

610
00:22:37,360 --> 00:22:39,760
it as you said the agent says hey if you

611
00:22:39,760 --> 00:22:41,520
need assistance purchasing any item or

612
00:22:41,520 --> 00:22:42,960
have any further adjustments let me know

613
00:22:42,960 --> 00:22:44,880
so we can do that. Um, and I want to

614
00:22:44,880 --> 00:22:46,320
show one last demo which we didn't

615
00:22:46,320 --> 00:22:48,640
really run live but I think it's really

616
00:22:48,640 --> 00:22:51,280
cool and especially because the folks

617
00:22:51,280 --> 00:22:52,880
who are getting married are really into

618
00:22:52,880 --> 00:22:57,679
MLB. U so we asked the agent uh to go

619
00:22:57,679 --> 00:22:59,679
and build an optimal itinary for

620
00:22:59,679 --> 00:23:02,640
visiting all 30 MLB stadiums in just in

621
00:23:02,640 --> 00:23:05,200
case you're thinking of a satical uh and

622
00:23:05,200 --> 00:23:08,159
then design the optimal route prioritize

623
00:23:08,159 --> 00:23:10,960
Hello Kitty nights and whatnot and

624
00:23:10,960 --> 00:23:12,400
present a final plan as a detailed

625
00:23:12,400 --> 00:23:13,520
spreadsheet. I'll really quickly run

626
00:23:13,520 --> 00:23:15,440
through this. Um I think it's just so

627
00:23:15,440 --> 00:23:18,240
fun to see. So again like as we have

628
00:23:18,240 --> 00:23:20,720
thrown shown throughout the the live

629
00:23:20,720 --> 00:23:23,919
stream it uses a multitude of tools uses

630
00:23:23,919 --> 00:23:26,240
container the terminal use using the

631
00:23:26,240 --> 00:23:28,799
browser working through all the details.

632
00:23:28,799 --> 00:23:30,400
It'll probably use again back to the

633
00:23:30,400 --> 00:23:33,200
browser figuring out Hello Kitty nights

634
00:23:33,200 --> 00:23:36,559
and then sports stadium and whatnot. Oh

635
00:23:36,559 --> 00:23:39,520
let's see did I miss the Oh go map.

636
00:23:39,520 --> 00:23:42,080
building a map using code to actually

637
00:23:42,080 --> 00:23:43,919
build it out and then overall we get

638
00:23:43,919 --> 00:23:46,159
like a pretty solid result I think at

639
00:23:46,159 --> 00:23:48,880
the end takes 25 minutes to work where

640
00:23:48,880 --> 00:23:50,400
does the season start and what not you

641
00:23:50,400 --> 00:23:51,919
have a spreadsheet that you can quickly

642
00:23:51,919 --> 00:23:55,760
view inside just right inside Chad GBD

643
00:23:55,760 --> 00:23:57,919
you can map the journey cool looking map

644
00:23:57,919 --> 00:24:00,400
I guess and that's it so this is Chad

645
00:24:00,400 --> 00:24:02,240
GBD agent we hope you really like it and

646
00:24:02,240 --> 00:24:04,000
over to Sam

647
00:24:04,000 --> 00:24:05,919
amazing work all of you and and to your

648
00:24:05,919 --> 00:24:07,440
teams this is I think uh really

649
00:24:07,440 --> 00:24:08,720
something that's going to help people

650
00:24:08,720 --> 00:24:10,720
get worked done uh and have more time to

651
00:24:10,720 --> 00:24:12,240
do the things they want to do. Um I

652
00:24:12,240 --> 00:24:13,520
think it's it's really amazing how much

653
00:24:13,520 --> 00:24:15,360
you've brought together to deliver this

654
00:24:15,360 --> 00:24:17,760
experience and watching the agent sort

655
00:24:17,760 --> 00:24:19,120
of use the internet, make these

656
00:24:19,120 --> 00:24:20,640
spreadsheets, make PowerPoints, whatever

657
00:24:20,640 --> 00:24:22,960
else uh and do all this work is is quite

658
00:24:22,960 --> 00:24:26,000
amazing. We're going live today for pro

659
00:24:26,000 --> 00:24:28,880
plus and team users. Pro users will get

660
00:24:28,880 --> 00:24:30,720
uh 400 queries a month plus some team

661
00:24:30,720 --> 00:24:32,720
users will get 40 a month. Uh the

662
00:24:32,720 --> 00:24:34,000
rollout should be finished by the end of

663
00:24:34,000 --> 00:24:36,159
the day for pro and very soon for plus

664
00:24:36,159 --> 00:24:38,400
and team users. will try to be live for

665
00:24:38,400 --> 00:24:40,799
enterprise and edu by the end of this

666
00:24:40,799 --> 00:24:43,360
month. As Casey mentioned, although this

667
00:24:43,360 --> 00:24:45,360
is an extremely exciting new technology,

668
00:24:45,360 --> 00:24:48,080
there are new risks. Uh people learned

669
00:24:48,080 --> 00:24:49,520
how to use the internet generally pretty

670
00:24:49,520 --> 00:24:50,880
safely, although of course there are

671
00:24:50,880 --> 00:24:52,880
still scammers and other attacks. People

672
00:24:52,880 --> 00:24:54,559
are going to need to learn to use AI

673
00:24:54,559 --> 00:24:56,080
agents. Uh and societyy's going to need

674
00:24:56,080 --> 00:24:57,919
to learn to build up defenses against

675
00:24:57,919 --> 00:25:00,080
attacks on AI agents as well. So we're

676
00:25:00,080 --> 00:25:02,080
starting with a very robust system, lots

677
00:25:02,080 --> 00:25:04,240
of warnings. We will relax that over

678
00:25:04,240 --> 00:25:05,679
time as people get more comfortable with

679
00:25:05,679 --> 00:25:07,600
it. But we do want people to treat this

680
00:25:07,600 --> 00:25:09,919
as a new technology and a new risk

681
00:25:09,919 --> 00:25:12,080
surface and use all of the caution that

682
00:25:12,080 --> 00:25:14,799
Casey talked about. Um, but that said,

683
00:25:14,799 --> 00:25:16,720
we hope you'll love it. Uh, this is

684
00:25:16,720 --> 00:25:18,159
still very early. We will improve it

685
00:25:18,159 --> 00:25:20,640
rapidly and we're excited to see where

686
00:25:20,640 --> 00:25:22,640
it all goes. So, congrats again. Thank

687
00:25:22,640 --> 00:25:26,440
you very much. Hope you enjoy.
字幕中英文转换的网址

中文字幕:

1
00:00:06,480 --> 00:00:08,400
早上好。我们为您准备了美味佳肴。

2
00:00:08,400 --> 00:00:09,840
今天。我们将推出 ChatBT

3
00:00:09,840 --> 00:00:11,519
经纪人。但在开始之前,我

4
00:00:11,519 --> 00:00:12,559
喜欢请团队介绍

5
00:00:12,559 --> 00:00:14,080
他们自己。从 Yosh 开始。

6
00:00:14,080 --> 00:00:17,840
嗨,我是 Yash。我在代理团队工作,

7
00:00:17,840 --> 00:00:20,080
在此之前我曾从事过操作员工作。

8
00:00:20,080 --> 00:00:22,560
你好,我是 Jing。我负责经纪人研究

9
00:00:22,560 --> 00:00:24,400
之前曾进行过深入研究。

10
00:00:24,400 --> 00:00:26,000
嗨,我是 Casey。我是一名研究员

11
00:00:26,000 --> 00:00:27,920
代理商原为运营商。

12
00:00:27,920 --> 00:00:30,560
你好,我是Issa。我是一名特工研究员

13
00:00:30,560 --> 00:00:32,640
以前进行过深入研究。

14
00:00:32,640 --> 00:00:34,880
所以我们开始推出代理

15
00:00:34,880 --> 00:00:36,800
今年早些时候。我们推出了深度

16
00:00:36,800 --> 00:00:38,879
研究,我们推出了运营商和

17
00:00:38,879 --> 00:00:40,160
人们对此感到非常兴奋。

18
00:00:40,160 --> 00:00:42,480
人们可以看到,现在人工智能

19
00:00:42,480 --> 00:00:44,640
去为他们完成复杂的任务。

20
00:00:44,640 --> 00:00:46,079
但我们清楚地认识到

21
00:00:46,079 --> 00:00:48,000
人们真正想要的是让我们带来

22
00:00:48,000 --> 00:00:49,760
将这些功能和更多功能结合在一起。

23
00:00:49,760 --> 00:00:51,920
人们想要一个统一的代理,可以

24
00:00:51,920 --> 00:00:55,039
出发,使用自己的计算机并进行实际操作

25
00:00:55,039 --> 00:00:57,360
对他们来说很复杂的任务,这可能呃

26
00:00:57,360 --> 00:00:59,359
无缝过渡到思考

27
00:00:59,359 --> 00:01:01,520
关于某事采取行动

28
00:01:01,520 --> 00:01:03,359
使用终端中的大量工具,

29
00:01:03,359 --> 00:01:05,360
在网络上点击,甚至制作

30
00:01:05,360 --> 00:01:06,880
比如电子表格和幻灯片

31
00:01:06,880 --> 00:01:08,960
以及更多。并希望人们想要

32
00:01:08,960 --> 00:01:10,159
能够长期做到这一点

33
00:01:10,159 --> 00:01:12,159
地平线和一种普遍的

34
00:01:12,159 --> 00:01:13,840
任务。因此团队一直在努力

35
00:01:13,840 --> 00:01:16,400
很难把这些结合起来。而且

36
00:01:16,400 --> 00:01:18,080
今天我们和经纪人聊了聊。嗯,

37
00:01:18,080 --> 00:01:19,680
给你看可能更容易

38
00:01:19,680 --> 00:01:21,439
而不是继续谈论它。这是

39
00:01:21,439 --> 00:01:23,360
感受我此刻的感受

40
00:01:23,360 --> 00:01:25,280
观察它的工作原理。那么,让我们来看看吧。

41
00:01:25,280 --> 00:01:27,840
太棒了!谢谢,Sam。大家好。

42
00:01:27,840 --> 00:01:29,920
非常高兴与 GBD 代理分享聊天

43
00:01:29,920 --> 00:01:31,600
和大家一起。正如萨姆所说,让我们

44
00:01:31,600 --> 00:01:33,759
直接进入演示。好的,

45
00:01:33,759 --> 00:01:36,159
众所周知,我们位于乍得 GBD,

46
00:01:36,159 --> 00:01:39,119
爱。要打开代理模式,你

47
00:01:39,119 --> 00:01:40,880
只需单击工具菜单并选择

48
00:01:40,880 --> 00:01:43,280
代理人。您也可以直接输入代理人

49
00:01:43,280 --> 00:01:45,040
作曲家栏,它会带你到

50
00:01:45,040 --> 00:01:47,520
代理模式。嗯,爱德华和我有一个

51
00:01:47,520 --> 00:01:49,360
今年晚些时候要去参加婚礼。呃,

52
00:01:49,360 --> 00:01:51,119
这是我们共同的朋友之一的礼物。

53
00:01:51,119 --> 00:01:52,560
我们应该有亚洲

54
00:01:52,560 --> 00:01:53,280
行星?

55
00:01:53,280 --> 00:01:55,680
好的,我们开始吧。我需要一套衣服。还有

56
00:01:55,680 --> 00:01:56,799
别忘了礼物。

57
00:01:56,799 --> 00:01:58,719
好的,太好了。我们不会忘记礼物的。

58
00:01:58,719 --> 00:02:00,240
嗯,有点长

59
00:02:00,240 --> 00:02:01,680
提示,所以我把它复制到我的

60
00:02:01,680 --> 00:02:02,799
缓冲区,所以我要继续

61
00:02:02,799 --> 00:02:05,759
然后粘贴。嗯,好的。那么,我们看看。

62
00:02:05,759 --> 00:02:07,360
让我们看看它说了什么。我们的朋友是

63
00:02:07,360 --> 00:02:08,640
今年晚些时候结婚,因为我

64
00:02:08,640 --> 00:02:10,720
米妮娅和莎拉说道。我们希望

65
00:02:10,720 --> 00:02:12,879
经纪人帮我们找到一套

66
00:02:12,879 --> 00:02:15,520
符合着装要求。呃,推荐几个

67
00:02:15,520 --> 00:02:17,840
选项。不错的中型豪华酒店,

68
00:02:17,840 --> 00:02:21,040
考虑到场地和天气。我们还希望

69
00:02:21,040 --> 00:02:23,280
帮我们找到一些酒店,就像爱德华

70
00:02:23,280 --> 00:02:25,760
说,别忘了礼物。嗯,那我们

71
00:02:25,760 --> 00:02:27,840
看到和

72
00:02:27,840 --> 00:02:30,319
把提示发送出去。正如 Sam 所说,

73
00:02:30,319 --> 00:02:32,640
使用电脑。呃,一开始

74
00:02:32,640 --> 00:02:34,959
它会设置它的环境。它会

75
00:02:34,959 --> 00:02:38,000
知道这需要一两分钟还是不知道

76
00:02:38,000 --> 00:02:39,680
只需 5 秒钟即可设置

77
00:02:39,680 --> 00:02:41,440
环境。在这种情况下,正如你

78
00:02:41,440 --> 00:02:43,840
瞧,它理解了提示。它

79
00:02:43,840 --> 00:02:46,319
要求我澄清。我

80
00:02:46,319 --> 00:02:48,000
就让它继续下去吧

81
00:02:48,000 --> 00:02:51,120
工作。总之,嗯,我觉得搞混了

82
00:02:51,120 --> 00:02:54,239
说“哦,那个什么

83
00:02:54,239 --> 00:02:55,680
正是日期的时间

84
00:02:55,680 --> 00:02:57,200
婚礼?“我想它会弄清楚使用

85
00:02:57,200 --> 00:02:59,840
网站。好的,很酷。所以,现在

86
00:02:59,840 --> 00:03:01,760
开始了。它正在启动这个过程,

87
00:03:01,760 --> 00:03:03,920
提示,然后打开一个浏览器。

88
00:03:03,920 --> 00:03:04,959
并引导你了解

89
00:03:04,959 --> 00:03:06,800
正在发生的事情,这里是

90
00:03:06,800 --> 00:03:09,040
是的。正如之前提到的,我们给了

91
00:03:09,040 --> 00:03:10,879
代理访问自己的虚拟

92
00:03:10,879 --> 00:03:13,280
计算机,并且计算机有很多

93
00:03:13,280 --> 00:03:14,720
安装了不同的工具,它可以

94
00:03:14,720 --> 00:03:16,239
选择使用哪个

95
00:03:16,239 --> 00:03:18,640
完成任务。因此,在聊天 GPT 中,你

96
00:03:18,640 --> 00:03:21,360
可以看到代理的可视化

97
00:03:21,360 --> 00:03:23,680
电脑屏幕上,你可以看到

98
00:03:23,680 --> 00:03:25,519
用文字覆盖其思路,

99
00:03:25,519 --> 00:03:27,200
这就是它的想法,因为它

100
00:03:27,200 --> 00:03:28,480
完成任务并决定

101
00:03:28,480 --> 00:03:30,799
下一步该做什么?我们给了经纪人

102
00:03:30,799 --> 00:03:32,400
可以使用两种不同的方式浏览

103
00:03:32,400 --> 00:03:34,560
互联网。首先,我们给它一个文本

104
00:03:34,560 --> 00:03:36,159
浏览器,这类似于深度

105
00:03:36,159 --> 00:03:38,000
研究工具。这就是它

106
00:03:38,000 --> 00:03:40,159
真正高效、快速地阅读许多

107
00:03:40,159 --> 00:03:43,440
网页,嗯,嗯,然后搜索它们。还有

108
00:03:43,440 --> 00:03:45,040
我们还允许它访问视觉

109
00:03:45,040 --> 00:03:46,319
浏览器。这类似于

110
00:03:46,319 --> 00:03:48,239
操作员工具。这就是它

111
00:03:48,239 --> 00:03:50,159
实际与网页的 UI 进行交互

112
00:03:50,159 --> 00:03:52,720
页面。所以它可以拖动东西。它可以

113
00:03:52,720 --> 00:03:54,879
使用光标点击。它可以

114
00:03:54,879 --> 00:03:57,280
打开 UI 组件。它可以填写

115
00:03:57,280 --> 00:03:59,920
表格并输入文本和文本区域。

116
00:03:59,920 --> 00:04:02,560
它非常灵活。所以这两个工具

117
00:04:02,560 --> 00:04:04,720
非常赞赏。然后我们也

118
00:04:04,720 --> 00:04:06,720
让它访问自己的终端,

119
00:04:06,720 --> 00:04:08,720
它可以运行代码,也可以

120
00:04:08,720 --> 00:04:10,640
生成并分析幻灯片等文件

121
00:04:10,640 --> 00:04:12,879
卡片和电子表格。然后通过

122
00:04:12,879 --> 00:04:14,560
它还可以调用终端

123
00:04:14,560 --> 00:04:17,840
API。因此,公共 API 和 API

124
00:04:17,840 --> 00:04:19,840
访问您的私人数据源,例如

125
00:04:19,840 --> 00:04:22,479
Google 云端硬盘、Google 日历、GitHub、

126
00:04:22,479 --> 00:04:25,360
SharePoint 和许多其他

127
00:04:25,360 --> 00:04:26,960
如果你明确地将它们联系起来

128
00:04:26,960 --> 00:04:28,960
深入研究连接器。然后它

129
00:04:28,960 --> 00:04:31,680
也可以访问图像生成 API,因此

130
00:04:31,680 --> 00:04:34,240
它可以为幻灯片创建漂亮的视觉效果

131
00:04:34,240 --> 00:04:36,080
甲板和其他东西在工作时

132
00:04:36,080 --> 00:04:38,240
通过其任务。

133
00:04:38,240 --> 00:04:40,800
如何决定在这里使用哪些工具?

134
00:04:40,800 --> 00:04:42,560
是的,我们训练模型在

135
00:04:42,560 --> 00:04:44,160
这些能力通过强化

136
00:04:44,160 --> 00:04:46,080
学习。这是我们的第一个模型

137
00:04:46,080 --> 00:04:48,880
接受过培训的人员可以访问这个统一

138
00:04:48,880 --> 00:04:52,000
工具箱。一个文本浏览器,一个 GUI 浏览器

139
00:04:52,000 --> 00:04:53,840
以及一个虚拟的终端

140
00:04:53,840 --> 00:04:57,120
机器。为了指导它的学习,我们

141
00:04:57,120 --> 00:04:59,360
创建需要使用

142
00:04:59,360 --> 00:05:01,919
所有这些工具。这使得模型

143
00:05:01,919 --> 00:05:04,000
不仅要学习如何使用这些

144
00:05:04,000 --> 00:05:06,160
工具,以及何时使用哪种工具

145
00:05:06,160 --> 00:05:08,400
取决于手头的任务。在

146
00:05:08,400 --> 00:05:10,400
训练开始时,模型

147
00:05:10,400 --> 00:05:12,880
可能会尝试使用所有这些工具来

148
00:05:12,880 --> 00:05:15,600
解决一个相对简单的问题。结束

149
00:05:15,600 --> 00:05:17,840
时间,因为我们奖励模型解决

150
00:05:17,840 --> 00:05:20,560
正确有效地解决问题,

151
00:05:20,560 --> 00:05:24,080
模型将有更智能的工具选择。

152
00:05:24,080 --> 00:05:27,360
例如,如果你要求一个模特呃

153
00:05:27,360 --> 00:05:29,039
找到有特定

154
00:05:29,039 --> 00:05:31,919
要求并进行预订,

155
00:05:31,919 --> 00:05:34,479
模型通常可能只是开始深度

156
00:05:34,479 --> 00:05:36,160
在文本浏览器中搜索

157
00:05:36,160 --> 00:05:39,039
一些候选人,然后切换到 GUI

158
00:05:39,039 --> 00:05:42,160
浏览器查看食物照片,呃检查一下

159
00:05:42,160 --> 00:05:45,600
确认是否有空位,并完成预订。

160
00:05:45,600 --> 00:05:48,000
同样,对于创造性任务,

161
00:05:48,000 --> 00:05:50,160
创建一个工件,模型将

162
00:05:50,160 --> 00:05:51,680
首先在网上搜索公众

163
00:05:51,680 --> 00:05:54,479
资源,然后切换到终端

164
00:05:54,479 --> 00:05:57,039
进行一些代码编辑来编译

165
00:05:57,039 --> 00:05:59,919
工件并最终验证最终

166
00:05:59,919 --> 00:06:02,960
在 GUI 浏览器中输出。这样,

167
00:06:02,960 --> 00:06:05,600
我们真的感觉我们团结在一起

168
00:06:05,600 --> 00:06:08,240
深度研究和运营商的最佳

169
00:06:08,240 --> 00:06:11,759
并增添了一些额外的光彩。

170
00:06:11,759 --> 00:06:14,000
没错。是的。所以这么说吧

171
00:06:14,000 --> 00:06:15,520
项目背景,我想提供一点

172
00:06:15,520 --> 00:06:18,000
历史。几个月前,我们

173
00:06:18,000 --> 00:06:20,960
一月份发货了操作员,这是

174
00:06:20,960 --> 00:06:23,120
我们的代理可让您执行在线任务

175
00:06:23,120 --> 00:06:25,759
比如预订并发送

176
00:06:25,759 --> 00:06:27,840
两周后我们

177
00:06:27,840 --> 00:06:29,919
进行了深入研究和深入研究

178
00:06:29,919 --> 00:06:31,919
是一个可以让你深入

179
00:06:31,919 --> 00:06:35,759
互联网研究和高质量输出

180
00:06:35,759 --> 00:06:39,280
嗯嗯研究报告。发布后

181
00:06:39,280 --> 00:06:41,039
我们意识到实际上这两个

182
00:06:41,039 --> 00:06:42,319
方法实际上很深刻

183
00:06:42,319 --> 00:06:44,160
免费。

184
00:06:44,160 --> 00:06:46,400
嗯,比如说操作员遇到了一些麻烦

185
00:06:46,400 --> 00:06:48,720
阅读超长文章。嗯,它有

186
00:06:48,720 --> 00:06:50,400
滚动。这需要很长时间。但是

187
00:06:50,400 --> 00:06:51,759
这是需要深入研究的

188
00:06:51,759 --> 00:06:56,240
擅长。相反,运算符呃呃深

189
00:06:56,240 --> 00:06:58,240
研究并不擅长互动

190
00:06:58,240 --> 00:07:00,319
带有网页交互元素

191
00:07:00,319 --> 00:07:03,199
视觉呃高度视觉化的网页,但是

192
00:07:03,199 --> 00:07:04,800
这是运营商擅长的

193
00:07:04,800 --> 00:07:08,639
嗯。嗯,是的,我们觉得这些方法

194
00:07:08,639 --> 00:07:11,120
是免费的,然后我们

195
00:07:11,120 --> 00:07:13,120
还查看了一些客户的反馈。

196
00:07:13,120 --> 00:07:14,880
例如,我们最受推崇的

197
00:07:14,880 --> 00:07:17,120
深入研究所要求的功能是

198
00:07:17,120 --> 00:07:18,960
登录网站的能力和

199
00:07:18,960 --> 00:07:20,960
访问经过身份验证的来源。

200
00:07:20,960 --> 00:07:22,880
操作员可以做的事情。

201
00:07:22,880 --> 00:07:24,000
我已经等待很久了

202
00:07:24,000 --> 00:07:24,560
时间。

203
00:07:24,560 --> 00:07:26,160
是的。

204
00:07:26,160 --> 00:07:28,479
嗯,另一件事是,我们正在寻找

205
00:07:28,479 --> 00:07:29,840
在人们尝试的提示下

206
00:07:29,840 --> 00:07:31,520
对于操作员,我们看到他们

207
00:07:31,520 --> 00:07:32,880
实际上是更深入的研究类型

208
00:07:32,880 --> 00:07:35,199
提示。例如,计划一次旅行,

209
00:07:35,199 --> 00:07:38,240
然后预订。所以,是的,我们真的

210
00:07:38,240 --> 00:07:39,360
感觉我们正在带来最好的

211
00:07:39,360 --> 00:07:41,440
两个世界。在个人方面

212
00:07:41,440 --> 00:07:42,800
请注意,我们都是朋友了

213
00:07:42,800 --> 00:07:44,160
而这真的非常令人兴奋

214
00:07:44,160 --> 00:07:46,479
一起工作。所以,说到

215
00:07:46,479 --> 00:07:48,960
天作之合,

216
00:07:48,960 --> 00:07:50,319
婚礼筹划进行得如何?

217
00:07:50,319 --> 00:07:51,759
看起来棒极了。这是

218
00:07:51,759 --> 00:07:53,599
我讨厌做某件事的例子。这可以

219
00:07:53,599 --> 00:07:55,520
就像毁掉几个小时一样

220
00:07:55,520 --> 00:07:56,960
对我来说,当我被这些兔子吸进去时

221
00:07:56,960 --> 00:07:58,160
洞。所以,当你看着这个的时候,

222
00:07:58,160 --> 00:07:59,520
伙计们一直在谈论点击

223
00:07:59,520 --> 00:08:01,199
这就像做整件事一样

224
00:08:01,199 --> 00:08:03,360
真的非常了不起。是的,完全是。

225
00:08:03,360 --> 00:08:06,560
嗯,看起来它开始于

226
00:08:06,560 --> 00:08:08,560
了解天气。其中之一

227
00:08:08,560 --> 00:08:11,280
很酷的功能,嗯,你知道,作为

228
00:08:11,280 --> 00:08:12,560
其中一些任务可能需要一点时间

229
00:08:12,560 --> 00:08:14,160
再过一会儿,你就可以回去看看

230
00:08:14,160 --> 00:08:15,759
它在做什么。所以,这就是我们要做的

231
00:08:15,759 --> 00:08:17,199
确实会这么做。看起来

232
00:08:17,199 --> 00:08:18,720
通过网站使用文本

233
00:08:18,720 --> 00:08:21,039
浏览器。有趣的是,现在

234
00:08:21,039 --> 00:08:22,400
它正在检查西装

235
00:08:22,400 --> 00:08:23,919
爱德华。我想它会找到一些东西

236
00:08:23,919 --> 00:08:25,360
很好。在这里你可以看到它切换了

237
00:08:25,360 --> 00:08:27,199
实际上是一个可视化浏览器

238
00:08:27,199 --> 00:08:28,960
穿上这套西装一定会很好看

239
00:08:28,960 --> 00:08:31,280
愛德華。

240
00:08:31,280 --> 00:08:34,560
现在看起来是的,它有

241
00:08:34,560 --> 00:08:36,880
努力前行,思考该做什么。

242
00:08:36,880 --> 00:08:39,599
嗯,现在仍然穿着西装,可能

243
00:08:39,599 --> 00:08:41,919
去礼品区吧。嗯,好的,

244
00:08:41,919 --> 00:08:43,279
太棒了。所以,这需要一段时间。

245
00:08:43,279 --> 00:08:44,959
正如 Sam 所说,这些任务有时可以

246
00:08:44,959 --> 00:08:46,160
需要很长时间。所以,它将会

247
00:08:46,160 --> 00:08:47,680
继续做,希望能更快

248
00:08:47,680 --> 00:08:49,760
比我们做的要多。嗯,我们应该

249
00:08:49,760 --> 00:08:51,600
在它做这件事的时候还做了其他什么?我

250
00:08:51,600 --> 00:08:53,519
我认为球队真的想要

251
00:08:53,519 --> 00:08:55,279
贴纸,一些贴纸

252
00:08:55,279 --> 00:08:56,480
发射。我们应该这么做吗?

253
00:08:56,480 --> 00:08:57,279
是的,很酷。

254
00:08:57,279 --> 00:08:59,040
好的。我们有一个球队吉祥物,

255
00:08:59,040 --> 00:09:00,320
这是我们的一位同事,Bunny

256
00:09:00,320 --> 00:09:03,279
涂鸦。真的很可爱告诉你。嗯

257
00:09:03,279 --> 00:09:06,080
我们将努力

258
00:09:06,080 --> 00:09:08,480
给大家一些笔记本电脑贴纸。呃

259
00:09:08,480 --> 00:09:10,480
代理最喜欢的功能之一

260
00:09:10,480 --> 00:09:13,120
假设轨迹可能需要 15

261
00:09:13,120 --> 00:09:15,040
分钟、20分钟、30分钟

262
00:09:15,040 --> 00:09:17,120
取决于任务的复杂性。

263
00:09:17,120 --> 00:09:19,120
嗯,很多时候你可能需要

264
00:09:19,120 --> 00:09:20,560
帮助经纪人。经纪人可能需要询问

265
00:09:20,560 --> 00:09:22,480
您的澄清、确认和

266
00:09:22,480 --> 00:09:25,040
诸如此类。嗯,所以我喜欢用它

267
00:09:25,040 --> 00:09:26,640
在路上。所以我要用我的手机

268
00:09:26,640 --> 00:09:28,160
手机实际发送查询

269
00:09:28,160 --> 00:09:30,240
时间,然后看看进展如何。

270
00:09:30,240 --> 00:09:32,880
好的,那我们看看。好的,我们继续

271
00:09:32,880 --> 00:09:35,519
Chad Gibbdi。呃,我已经选好了

272
00:09:35,519 --> 00:09:38,560
代理模式。我还输入了我们的呃

273
00:09:38,560 --> 00:09:40,560
可爱的吉祥物,我要快点

274
00:09:40,560 --> 00:09:43,040
粘贴一个查询。查询说做一些

275
00:09:43,040 --> 00:09:45,279
为团队逐一赠送笔记本电脑

276
00:09:45,279 --> 00:09:47,920
贴纸,并订购500张。我会

277
00:09:47,920 --> 00:09:52,959
还说我喜欢贴纸骡子

278
00:09:52,959 --> 00:09:55,279
我们过去使用过并发送

279
00:09:55,279 --> 00:09:57,200
把它关掉。

280
00:09:57,200 --> 00:10:00,080
好的。所以,就像在

281
00:10:00,080 --> 00:10:02,080
网络,这需要一些时间,想想

282
00:10:02,080 --> 00:10:04,080
它在做什么,它会

283
00:10:04,080 --> 00:10:07,120
开始开始查询。因为它是

284
00:10:07,120 --> 00:10:08,880
继续,这需要一些时间

285
00:10:08,880 --> 00:10:11,200
关掉。是吗?哦,我们走了。所以,它会

286
00:10:11,200 --> 00:10:12,480
开始着手吧。看起来

287
00:10:12,480 --> 00:10:14,720
开始创作动画艺术。它将

288
00:10:14,720 --> 00:10:16,640
可能使用 Isa 提到的图像

289
00:10:16,640 --> 00:10:18,399
希望能够制作一部动画

290
00:10:18,399 --> 00:10:20,240
艺术。我们拭目以待。

291
00:10:20,240 --> 00:10:21,760
就这样,还有什么我们想做的

292
00:10:21,760 --> 00:10:22,399
做?

293
00:10:22,399 --> 00:10:24,720
哦,是的。我还需要一双鞋

294
00:10:24,720 --> 00:10:26,320
因为我的鞋子损坏了。

295
00:10:26,320 --> 00:10:27,360
它们是怎么受损的?

296
00:10:27,360 --> 00:10:28,560
呃,因为下雨

297
00:10:28,560 --> 00:10:30,000
在旧金山。

298
00:10:30,000 --> 00:10:30,800
是的。

299
00:10:30,800 --> 00:10:32,160
酷。好吧。呃,好吧,我们开始吧

300
00:10:32,160 --> 00:10:34,240
爱德华也给我买了一双鞋。所以,哦,

301
00:10:34,240 --> 00:10:40,320
你也可以找到我们嗯一对男士的

302
00:10:40,320 --> 00:10:43,519
穿着黑色鞋子尺码

303
00:10:43,519 --> 00:10:44,240
9.5304
00:10:44,240 --> 00:10:46,000
9.5.

305
00:10:46,000 --> 00:10:47,920
因此,

306
00:10:47,920 --> 00:10:49,920
模型能够中断。我

307
00:10:49,920 --> 00:10:51,920
你知道,因为轨迹需要很长时间

308
00:10:51,920 --> 00:10:53,760
时间或任何时间,它真的

309
00:10:53,760 --> 00:10:56,720
对我们来说很重要,因为感觉非常

310
00:10:56,720 --> 00:10:59,120
多轮,以便用户可以插入

311
00:10:59,120 --> 00:11:01,120
用户可以直接它用户可以给它更多

312
00:11:01,120 --> 00:11:02,640
指导 更少指导 无论我们想要什么

313
00:11:02,640 --> 00:11:04,320
我们要做的事情,这就是我们在这里做的事情

314
00:11:04,320 --> 00:11:07,040
我们本质上的模型是

315
00:11:07,040 --> 00:11:08,720
努力弄清楚所有

316
00:11:08,720 --> 00:11:10,240
我们之前问过的事情

317
00:11:10,240 --> 00:11:12,320
在这种情况下,我们基本上说,嘿,可以

318
00:11:12,320 --> 00:11:16,000
你也给我们买一双男士黑色

319
00:11:16,000 --> 00:11:18,160
鞋子,现在它正在思考,很快

320
00:11:18,160 --> 00:11:19,839
希望它能考虑到这一点

321
00:11:19,839 --> 00:11:22,000
帐户并继续进入其

322
00:11:22,000 --> 00:11:23,600
轨迹。就是这样。所以,它说

323
00:11:23,600 --> 00:11:25,120
承认打扰。它说,

324
00:11:25,120 --> 00:11:26,880
“好的,很酷。我也会研究一下男士的

325
00:11:26,880 --> 00:11:29,600
9.5码的黑色鞋子。嗯,然后

326
00:11:29,600 --> 00:11:31,680
它可能会继续前进。嗯,但是

327
00:11:31,680 --> 00:11:33,120
也许 Issa 可以告诉我们更多

328
00:11:33,120 --> 00:11:34,240
关于它是如何运作的。

329
00:11:34,240 --> 00:11:36,320
是的,当然。所以,正如你所看到的,

330
00:11:36,320 --> 00:11:38,079
经纪人非常合作,而且

331
00:11:38,079 --> 00:11:39,920
对我们来说真的很重要

332
00:11:39,920 --> 00:11:41,200
训练模型并构建

333
00:11:41,200 --> 00:11:42,880
产品。如果你问的是另一个

334
00:11:42,880 --> 00:11:44,399
为您完成一项任务的人

335
00:11:44,399 --> 00:11:45,519
花了很长时间

336
00:11:45,519 --> 00:11:46,959
完成,你可能会给他们一些

337
00:11:46,959 --> 00:11:48,800
开始的说明,然后他们

338
00:11:48,800 --> 00:11:50,640
可能会问你一些澄清问题

339
00:11:50,640 --> 00:11:52,320
然后他们就开始任务,也许

340
00:11:52,320 --> 00:11:53,600
意识到,哦,他们需要更多

341
00:11:53,600 --> 00:11:55,440
你需要澄清,或者他们需要你的

342
00:11:55,440 --> 00:11:56,880
允许登录或做某事

343
00:11:56,880 --> 00:11:58,560
为你做一些事情,然后你

344
00:11:58,560 --> 00:12:00,240
可能会意识到,哦,我忘了说

345
00:12:00,240 --> 00:12:02,640
这件事,或者你的状态怎么样?

346
00:12:02,640 --> 00:12:04,240
你好吗?我可以帮你转接一下吗?

347
00:12:04,240 --> 00:12:05,760
如果你走错了路

348
00:12:05,760 --> 00:12:07,760
或者其他什么?同样,对于这些

349
00:12:07,760 --> 00:12:09,680
真正长期运行的代理任务,它是

350
00:12:09,680 --> 00:12:11,519
非常重要的是,用户和

351
00:12:11,519 --> 00:12:13,600
代理人能够发起

352
00:12:13,600 --> 00:12:15,519
互相沟通,以便

353
00:12:15,519 --> 00:12:17,200
代理人能够最有效地

354
00:12:17,200 --> 00:12:19,360
帮助你完成任务。所以这是

355
00:12:19,360 --> 00:12:20,560
我们实际上训练过的东西

356
00:12:20,560 --> 00:12:22,320
模型。我们训练它能够

357
00:12:22,320 --> 00:12:24,160
提出澄清问题,不是每个

358
00:12:24,160 --> 00:12:26,240
像深入研究这样的一次性研究。嗯,我们

359
00:12:26,240 --> 00:12:28,800
还问了它我们还训练它

360
00:12:28,800 --> 00:12:30,560
就像 Yash 刚才展示的那样,是可中断的。并且

361
00:12:30,560 --> 00:12:32,000
有时它还会要求你

362
00:12:32,000 --> 00:12:33,519
澄清和确认

363
00:12:33,519 --> 00:12:35,680
中段轨迹。

364
00:12:35,680 --> 00:12:38,079
是的。和经纪人合作的一部分是

365
00:12:38,079 --> 00:12:40,480
有时它会犯错误。

366
00:12:40,480 --> 00:12:42,079
这就是为什么我们觉得这很重要

367
00:12:42,079 --> 00:12:44,079
训练模型来向你询问

368
00:12:44,079 --> 00:12:45,920
最后一步确认

369
00:12:45,920 --> 00:12:49,279
重要的步骤。嗯,比如说

370
00:12:49,279 --> 00:12:51,519
在发送电子邮件之前

371
00:12:51,519 --> 00:12:53,440
它会要求你看一下

372
00:12:53,440 --> 00:12:54,720
草案以及它是否有意义,

373
00:12:54,720 --> 00:12:56,079
是否有任何尴尬

374
00:12:56,079 --> 00:12:59,200
拼写错误。嗯,如果有的话,那么你

375
00:12:59,200 --> 00:13:01,360
您可以要求它修复它,或者您可以

376
00:13:01,360 --> 00:13:03,440
直接接管浏览器并跳转

377
00:13:03,440 --> 00:13:06,079
直接进入代理环境

378
00:13:06,079 --> 00:13:09,040
并自己纠正。这样

379
00:13:09,040 --> 00:13:10,720
感觉合作,你可以,嗯,

380
00:13:10,720 --> 00:13:13,680
真正与代理商合作。

381
00:13:13,680 --> 00:13:15,120
我们是否应该再看一个演示?

382
00:13:15,120 --> 00:13:17,279
我们有这个呃,有点有趣的传统

383
00:13:17,279 --> 00:13:19,600
在直播中使用我们的

384
00:13:19,600 --> 00:13:21,120
最新模型的评估

385
00:13:21,120 --> 00:13:23,040
他们自己或者做一些元的事情。

386
00:13:23,040 --> 00:13:24,240
我们能做类似的事情吗?

387
00:13:24,240 --> 00:13:27,440
是的,我们开始吧。

388
00:13:27,440 --> 00:13:28,320
只有一个

389
00:13:28,320 --> 00:13:29,440
我想人们很想知道

390
00:13:29,440 --> 00:13:30,320
这个模型很好。

391
00:13:30,320 --> 00:13:33,920
是的。这是我们之前提出的一个提示。

392
00:13:33,920 --> 00:13:36,880
昨天给了经纪人。所以基本上

393
00:13:36,880 --> 00:13:38,959
它要求模型自己

394
00:13:38,959 --> 00:13:40,959
来自我们 Google 工作的评估编号

395
00:13:40,959 --> 00:13:43,440
连接器并制作一些幻灯片。所以我们

396
00:13:43,440 --> 00:13:44,959
想要保持简单,就像没有

397
00:13:44,959 --> 00:13:47,360
引言 没有结论 只是提出

398
00:13:47,360 --> 00:13:50,000
图表中的结果。正如你

399
00:13:50,000 --> 00:13:52,160
现在可以看到模型正在连接到

400
00:13:52,160 --> 00:13:55,120
Google Drive API 然后搜索

401
00:13:55,120 --> 00:13:57,600
在 API 中它现在看起来像

402
00:13:57,600 --> 00:13:59,920
第一个结果非常相关。所以

403
00:13:59,920 --> 00:14:02,720
它正在读取第一个结果。

404
00:14:02,720 --> 00:14:04,959
现在它正在读取第一个结果

405
00:14:04,959 --> 00:14:07,920
细节。呃,让我们加速这个呃

406
00:14:07,920 --> 00:14:12,800
重播。那么模型可能会读

407
00:14:12,800 --> 00:14:15,279
从结果中再次写出一些

408
00:14:15,279 --> 00:14:16,959
代码。

409
00:14:16,959 --> 00:14:19,519
所以在这里你可以看到模型是

410
00:14:19,519 --> 00:14:21,920
使用名为

411
00:14:21,920 --> 00:14:24,480
图像生成工具来生成一些

412
00:14:24,480 --> 00:14:28,079
幻灯片的装饰。

413
00:14:28,079 --> 00:14:30,160
让我们看看第一张幻灯片是什么

414
00:14:30,160 --> 00:14:33,399
模型制作。

415
00:14:33,920 --> 00:14:35,920
所以这里的模型正在写一些代码

416
00:14:35,920 --> 00:14:38,399
将被编译为最终版本

417
00:14:38,399 --> 00:14:41,120
幻灯片。这是第一张幻灯片

418
00:14:41,120 --> 00:14:44,160
此演示中的模型看起来不错

419
00:14:44,160 --> 00:14:46,240
但还不够精致。

420
00:14:46,240 --> 00:14:48,240
强化的关键特征之一

421
00:14:48,240 --> 00:14:50,160
学习是模型将重新

422
00:14:50,160 --> 00:14:52,240
审查自己的结果并改进

423
00:14:52,240 --> 00:14:55,120
取得好成绩

424
00:14:55,120 --> 00:14:57,839
结果。让我们看看最终结果如何

425
00:14:57,839 --> 00:15:00,320
模型给了我们什么。

426
00:15:00,320 --> 00:15:04,000
我们可以点击跳过,然后点击模型

427
00:15:04,000 --> 00:15:07,519
给我们一个好的PowerPoint文件。所以

428
00:15:07,519 --> 00:15:09,040
这是一个真正的 PowerPoint,你可以

429
00:15:09,040 --> 00:15:14,040
下载并在任何软件中打开它。

430
00:15:14,639 --> 00:15:19,279
我们在办公室里打开它吧。所以

431
00:15:19,279 --> 00:15:22,160
让我们展示一下幻灯片模型

432
00:15:22,160 --> 00:15:23,839
生成。

433
00:15:23,839 --> 00:15:27,120
首先是两个情报基准。

434
00:15:27,120 --> 00:15:30,480
人文学科的期末考试是

435
00:15:30,480 --> 00:15:33,519
衡量人工智能解决广泛问题的能力

436
00:15:33,519 --> 00:15:37,120
一系列关于难题的主题。我们

437
00:15:37,120 --> 00:15:40,320
用两种设置评估模型

438
00:15:40,320 --> 00:15:43,440
无论是否使用工具。

439
00:15:43,440 --> 00:15:45,920
我们可以看到代理模式原始

440
00:15:45,920 --> 00:15:48,720
智力已经相当不错了,

441
00:15:48,720 --> 00:15:50,880
所有工具的使用率几乎翻倍

442
00:15:50,880 --> 00:15:54,720
性能提升至42%443
00:15:54,720 --> 00:15:56,720
在评估人类的模型时

444
00:15:56,720 --> 00:15:59,360
上次考试,尤其是浏览

445
00:15:59,360 --> 00:16:01,759
能力,我们有两层

446
00:16:01,759 --> 00:16:04,399
净化,确保

447
00:16:04,399 --> 00:16:07,680
模型在这个基准上没有作弊。

448
00:16:07,680 --> 00:16:10,079
前 TMS 是衡量

449
00:16:10,079 --> 00:16:11,839
高级数学推理能力

450
00:16:11,839 --> 00:16:13,680
模型。

451
00:16:13,680 --> 00:16:16,000
与我们的迷你基准不同,

452
00:16:16,000 --> 00:16:18,560
03 使用 Python 函数

453
00:16:18,560 --> 00:16:21,440
编码。我们给代理模型所有

454
00:16:21,440 --> 00:16:23,440
可用的工具,如浏览器、

455
00:16:23,440 --> 00:16:26,320
计算机和终端。代理

456
00:16:26,320 --> 00:16:29,360
在这方面取得了 27% 的新状态

457
00:16:29,360 --> 00:16:31,440
借助所有这些

458
00:16:31,440 --> 00:16:34,440
工具。

459
00:16:34,639 --> 00:16:36,880
接下来,我们在两个方面评估了模型

460
00:16:36,880 --> 00:16:39,519
代理基准。Web 竞技场是一个

461
00:16:39,519 --> 00:16:41,519
衡量网络代理的基准

462
00:16:41,519 --> 00:16:43,600
能够解决现实世界的网络问题

463
00:16:43,600 --> 00:16:47,279
任务。代理模型改进了

464
00:16:47,279 --> 00:16:51,360
为核心提供动力的先前的 O3 模型。

465
00:16:51,360 --> 00:16:54,399
浏览公司是我们推出的基准

466
00:16:54,399 --> 00:16:56,240
今年早些时候,

467
00:16:56,240 --> 00:16:58,880
浏览代理搜索能力和

468
00:16:58,880 --> 00:17:02,320
查找呃如何定位信息。

469
00:17:02,320 --> 00:17:03,839
代理模型显著

470
00:17:03,839 --> 00:17:06,160
优于03并对此进行深入研究

471
00:17:06,160 --> 00:17:11,679
基准测试通过率为69%472
00:17:11,679 --> 00:17:14,559
最后,我们关心的是用户

473
00:17:14,559 --> 00:17:16,959
将在现实中受益于我们的模型

474
00:17:16,959 --> 00:17:19,919
世界。电子表格工作台是一个基准

475
00:17:19,919 --> 00:17:21,919
衡量模型的能力

476
00:17:21,919 --> 00:17:24,400
编辑来自真实

477
00:17:24,400 --> 00:17:28,079
世界用例。这里是代理模型

478
00:17:28,079 --> 00:17:30,480
拥有自由的办公室和电脑

479
00:17:30,480 --> 00:17:34,000
工具已经可以解决 30% 的任务

480
00:17:34,000 --> 00:17:36,480
当我们让模型访问

481
00:17:36,480 --> 00:17:39,840
终端中的原始 Excel 文件

482
00:17:39,840 --> 00:17:44,000
进一步提升性能至45%483
00:17:44,000 --> 00:17:46,000
最后,我们在

484
00:17:46,000 --> 00:17:48,000
内部银行基准。基准

485
00:17:48,000 --> 00:17:49,760
该基准评估了该模型的

486
00:17:49,760 --> 00:17:52,559
能够进行第一到第三

487
00:17:52,559 --> 00:17:55,679
年度投资银行 uh 银行分析师

488
00:17:55,679 --> 00:17:58,799
诸如组装

489
00:17:58,799 --> 00:18:00,559
三表财务模型

490
00:18:00,559 --> 00:18:04,000
财富 500 强公司

491
00:18:04,000 --> 00:18:06,160
基准。代理模型显著

492
00:18:06,160 --> 00:18:08,080
优于之前的深入研究

493
00:18:08,080 --> 00:18:11,760
以及所有三个模型。正如你所见

494
00:18:11,760 --> 00:18:13,919
这个模型是最强大的模型之一

495
00:18:13,919 --> 00:18:16,080
我们曾经训练过的模型。

496
00:18:16,080 --> 00:18:18,960
它不仅在基准测试中表现出色,而且

497
00:18:18,960 --> 00:18:22,480
还具有推理、浏览和

498
00:18:22,480 --> 00:18:24,720
在一定程度上解决现实世界的任务

499
00:18:24,720 --> 00:18:28,480
这是三个月前我们无法想象的。

500
00:18:28,480 --> 00:18:31,600
没错。嗯,就像爱德华说的,嗯,我们

501
00:18:31,600 --> 00:18:32,799
我认为我们已经训练了一支非常强大的

502
00:18:32,799 --> 00:18:35,280
模型,很大一部分力量来自于

503
00:18:35,280 --> 00:18:38,240
浏览互联网的能力。并且

504
00:18:38,240 --> 00:18:40,240
我们知道,互联网可能是一个可怕的

505
00:18:40,240 --> 00:18:42,400
那里有各种各样的黑客

506
00:18:42,400 --> 00:18:45,120
试图窃取您的信息、诈骗、

507
00:18:45,120 --> 00:18:48,480
呃,钓鱼尝试。嗯,经纪人没有

508
00:18:48,480 --> 00:18:51,120
对所有这些事情都免疫。嗯,一个

509
00:18:51,120 --> 00:18:53,360
我们特别担心的是

510
00:18:53,360 --> 00:18:55,520
一种名为“prompt”的新攻击

511
00:18:55,520 --> 00:18:57,120
注射。

512
00:18:57,120 --> 00:18:59,840
假设你要求代理人

513
00:18:59,840 --> 00:19:02,080
给你买一本书,你给它你的

514
00:19:02,080 --> 00:19:04,400
信用卡信息即可实现这一点。

515
00:19:04,400 --> 00:19:06,240
代理可能会偶然发现恶意

516
00:19:06,240 --> 00:19:08,559
网站询问,“哦,输入你的

517
00:19:08,559 --> 00:19:10,400
信用卡信息在这里。这会有帮助

518
00:19:10,400 --> 00:19:12,799
完成你的任务。代理

519
00:19:12,799 --> 00:19:15,200
受过培训,可以提供帮助,可能会决定

520
00:19:15,200 --> 00:19:18,080
这是个好主意。

521
00:19:18,080 --> 00:19:19,760
我们做了很多工作,试图

522
00:19:19,760 --> 00:19:22,320
确保这种情况不会发生。我们已经

523
00:19:22,320 --> 00:19:24,240
训练我们的模型忽略可疑

524
00:19:24,240 --> 00:19:27,120
有关可疑网站的说明。

525
00:19:27,120 --> 00:19:29,039
我们也有呃,我们也有层

526
00:19:29,039 --> 00:19:32,000
监视着

527
00:19:32,000 --> 00:19:33,760
特工的肩膀,看着它

528
00:19:33,760 --> 00:19:36,480
如果

529
00:19:36,480 --> 00:19:38,799
任何事看起来都很可疑。我们甚至可以

530
00:19:38,799 --> 00:19:41,919
如果有新的攻击,请实时更新这些

531
00:19:41,919 --> 00:19:44,160
在野外发现。

532
00:19:44,160 --> 00:19:45,919
尽管如此,你知道,这是一个

533
00:19:45,919 --> 00:19:47,760
尖端产品。这是一个新的

534
00:19:47,760 --> 00:19:50,000
表面,我们无法阻止一切。

535
00:19:50,000 --> 00:19:51,280
所以我觉得这非常

536
00:19:51,280 --> 00:19:52,559
让观众意识到这一点很重要

537
00:19:52,559 --> 00:19:55,360
使用代理所涉及的风险。

538
00:19:55,360 --> 00:19:57,440
我们鼓励用户

539
00:19:57,440 --> 00:19:59,520
积极思考如何

540
00:19:59,520 --> 00:20:01,120
他们分享信息。你知道,

541
00:20:01,120 --> 00:20:02,880
如果是高度敏感的信息,

542
00:20:02,880 --> 00:20:06,799
也许不要分享这个。嗯也许嗯呃

543
00:20:06,799 --> 00:20:08,799
使用我们的功能(例如接管模式)

544
00:20:08,799 --> 00:20:10,799
直接输入您的信用卡

545
00:20:10,799 --> 00:20:12,880
信息到浏览器中,而不是

546
00:20:12,880 --> 00:20:15,679
嗯,把它交给经纪人。嗯,我们觉得

547
00:20:15,679 --> 00:20:18,640
我们已经打造了一款非常强大的产品,但是

548
00:20:18,640 --> 00:20:20,480
再次强调,对于我们的用户来说

549
00:20:20,480 --> 00:20:21,760
了解所涉及的风险。

550
00:20:21,760 --> 00:20:23,280
是的,我真的想强调一下

551
00:20:23,280 --> 00:20:25,520
认为这是一种新的能力水平

552
00:20:25,520 --> 00:20:27,120
在人工智能领域。这是一种使用人工智能的新方法,但是

553
00:20:27,120 --> 00:20:28,799
将会有一系列新的攻击

554
00:20:28,799 --> 00:20:30,799
随之而来。社会和

555
00:20:30,799 --> 00:20:33,120
技术必须不断发展和学习

556
00:20:33,120 --> 00:20:34,320
我们将如何缓解

557
00:20:34,320 --> 00:20:36,159
我们甚至还无法想象。呃,因为

558
00:20:36,159 --> 00:20:37,360
人们开始做越来越多的工作

559
00:20:37,360 --> 00:20:39,679
这边走。在我结束之前,我们应该

560
00:20:39,679 --> 00:20:41,840
检查你踢出的一些任务

561
00:20:41,840 --> 00:20:42,080
离开?

562
00:20:42,080 --> 00:20:46,159
好的,我们开始吧。嗯,好的。所以我

563
00:20:46,159 --> 00:20:48,240
打开新标签页并确保

564
00:20:48,240 --> 00:20:51,840
我们可以看到我们的进展,

565
00:20:51,840 --> 00:20:55,679
还有贴纸。好的。我看看。所有

566
00:20:55,679 --> 00:20:58,159
对。所以,听起来贴纸

567
00:20:58,159 --> 00:21:00,880
准备好了。让我看看它到底怎么样。好的。

568
00:21:00,880 --> 00:21:03,200
太棒了。这算是个结局了

569
00:21:03,200 --> 00:21:06,720
最终结果耗时约 7 分钟。

570
00:21:06,720 --> 00:21:08,480
很可能已经弄清楚了一切。

571
00:21:08,480 --> 00:21:09,840
我们将回过头来看一下轨迹

572
00:21:09,840 --> 00:21:11,679
看看效果如何。但最后

573
00:21:11,679 --> 00:21:13,679
结果,它看起来像是被添加到

574
00:21:13,679 --> 00:21:15,360
购物车。这是小计。我可以

575
00:21:15,360 --> 00:21:17,360
继续看,然后弄清楚

576
00:21:17,360 --> 00:21:20,000
我可以接手这个

577
00:21:20,000 --> 00:21:21,600
正如凯西所说,输入我的信用

578
00:21:21,600 --> 00:21:23,039
卡信息,然后放置

579
00:21:23,039 --> 00:21:25,200
订购非常快。模特正在询问

580
00:21:25,200 --> 00:21:27,120
确认等,因为它应该

581
00:21:27,120 --> 00:21:29,280
要做。我们先快速浏览一下

582
00:21:29,280 --> 00:21:31,039
看看它实际上

583
00:21:31,039 --> 00:21:33,280
确实。哦,看起来它生成了一些

584
00:21:33,280 --> 00:21:35,840
贴纸。哦,看看这个。这就是

585
00:21:35,840 --> 00:21:38,880
它生成了贴纸。很酷。所以,是的

586
00:21:38,880 --> 00:21:40,640
这就是任务。我想我可以

587
00:21:40,640 --> 00:21:42,559
我自己完成,或者我可以问

588
00:21:42,559 --> 00:21:43,919
真正继续执行的模型

589
00:21:43,919 --> 00:21:46,720
对我来说也是如此。让我们检查一下

590
00:21:46,720 --> 00:21:49,840
婚礼。好的,太好了。看起来

591
00:21:49,840 --> 00:21:52,720
及时完成了。嗯,好吧,

592
00:21:52,720 --> 00:21:55,520
很酷。所以在这种情况下,正如我们所说的,我们

593
00:21:55,520 --> 00:21:57,840
正在寻找酒店,压力很大,呃

594
00:21:57,840 --> 00:22:01,919
西装,还有鞋子。所以它出来了

595
00:22:01,919 --> 00:22:03,520
一份相当全面的报告。它

596
00:22:03,520 --> 00:22:05,840
看起来像婚礼场地、日期、时间

597
00:22:05,840 --> 00:22:10,240
是与 Zilla 链接,着装规范。它

598
00:22:10,240 --> 00:22:11,600
弄清楚了这套衣服

599
00:22:11,600 --> 00:22:12,960
建议应该是,你可以

600
00:22:12,960 --> 00:22:14,799
买。现在我可以自己买了

601
00:22:14,799 --> 00:22:17,120
或者我可以请代理去买

602
00:22:17,120 --> 00:22:20,960
我。嗯,也解决了鞋类障碍

603
00:22:20,960 --> 00:22:23,360
选项。它实际上查看了所有

604
00:22:23,360 --> 00:22:27,120
哎呀,抱歉,它查看了所有的

605
00:22:27,120 --> 00:22:29,360
可用性。你实际上可以看到

606
00:22:29,360 --> 00:22:31,440
提供检查结果的屏幕截图。在

607
00:22:31,440 --> 00:22:33,120
在这种情况下,我们使用 booking.com,它是

608
00:22:33,120 --> 00:22:35,280
能够做到这一点。也有天赋

609
00:22:35,280 --> 00:22:37,360
建议等。下一步我可以问

610
00:22:37,360 --> 00:22:39,760
正如你所说,经纪人说,嘿,如果你

611
00:22:39,760 --> 00:22:41,520
需要协助购买任何物品或

612
00:22:41,520 --> 00:22:42,960
有任何进一步的调整请告诉我

613
00:22:42,960 --> 00:22:44,880
这样我们就可以做到。嗯,我想

614
00:22:44,880 --> 00:22:46,320
展示最后一个我们没有展示的演示

615
00:22:46,320 --> 00:22:48,640
真的现场直播,但我认为这真的

616
00:22:48,640 --> 00:22:51,280
很酷,尤其是因为人们

617
00:22:51,280 --> 00:22:52,880
即将结婚的人真的很喜欢

618
00:22:52,880 --> 00:22:57,679
MLB。所以我们叫经纪人去

619
00:22:57,679 --> 00:22:59,679
并制定最佳行程

620
00:22:59,679 --> 00:23:02,640
参观所有 30 个 MLB 体育场

621
00:23:02,640 --> 00:23:05,200
如果你正在考虑一个讽刺的呃和

622
00:23:05,200 --> 00:23:08,159
然后设计最优路线,优先考虑

623
00:23:08,159 --> 00:23:10,960
Hello Kitty 之夜等等

624
00:23:10,960 --> 00:23:12,400
提出最终计划作为详细的

625
00:23:12,400 --> 00:23:13,520
电子表格。我会很快运行

626
00:23:13,520 --> 00:23:15,440
通过这个。嗯,我觉得这太

627
00:23:15,440 --> 00:23:18,240
很有趣。所以再次像我们一样

628
00:23:18,240 --> 00:23:20,720
在整个直播中展示

629
00:23:20,720 --> 00:23:23,919
流它使用多种工具使用

630
00:23:23,919 --> 00:23:26,240
集装箱终端使用使用

631
00:23:26,240 --> 00:23:28,799
浏览器处理所有细节。

632
00:23:28,799 --> 00:23:30,400
它可能会再次使用回到

633
00:23:30,400 --> 00:23:33,200
浏览器搞清楚 Hello Kitty 之夜

634
00:23:33,200 --> 00:23:36,559
然后还有体育场等等。哦

635
00:23:36,559 --> 00:23:39,520
让我们看看我是否错过了 Oh go 地图。

636
00:23:39,520 --> 00:23:42,080
使用代码构建地图来实际

637
00:23:42,080 --> 00:23:43,919
将其构建出来然后我们总体上得到

638
00:23:43,919 --> 00:23:46,159
我认为这是一个相当可靠的结果

639
00:23:46,159 --> 00:23:48,880
最终需要 25 分钟才能完成

640
00:23:48,880 --> 00:23:50,400
赛季开始了,你

641
00:23:50,400 --> 00:23:51,919
有一个电子表格,你可以快速

642
00:23:51,919 --> 00:23:55,760
查看内部,恰好位于 Chad GBD 内部

643
00:23:55,760 --> 00:23:57,919
你可以绘制旅程很酷的地图

644
00:23:57,919 --> 00:24:00,400
我想就是这样了,这就是乍得

645
00:24:00,400 --> 00:24:02,240
GBD 代理我们希望您真的喜欢它,

646
00:24:02,240 --> 00:24:04,000
交给 Sam

647
00:24:04,000 --> 00:24:05,919
你们都做得很棒,

648
00:24:05,919 --> 00:24:07,440
团队这是我认为呃真的

649
00:24:07,440 --> 00:24:08,720
一些能够帮助人们的东西

650
00:24:08,720 --> 00:24:10,720
完成工作,有更多的时间

651
00:24:10,720 --> 00:24:12,240
做他们想做的事。嗯,我

652
00:24:12,240 --> 00:24:13,520
想想这真是太神奇了

653
00:24:13,520 --> 00:24:15,360
你们齐心协力完成了这项任务

654
00:24:15,360 --> 00:24:17,760
体验和观察代理排序

655
00:24:17,760 --> 00:24:19,120
使用互联网,使这些

656
00:24:19,120 --> 00:24:20,640
电子表格、制作 PowerPoint 等等

657
00:24:20,640 --> 00:24:22,960
否则呃,做所有这些工作是相当

658
00:24:22,960 --> 00:24:26,000
太棒了。我们今天要为专业版直播

659
00:24:26,000 --> 00:24:28,880
plus 和团队用户。Pro 用户将获得

660
00:24:28,880 --> 00:24:30,720
呃,每月 400 个查询,加上一些团队

661
00:24:30,720 --> 00:24:32,720
用户每月可获得 40 美元。呃

662
00:24:32,720 --> 00:24:34,000
部署工作应在年底前完成

663
00:24:34,000 --> 00:24:36,159
Pro 版即将面世,Plus 版也即将面世

664
00:24:36,159 --> 00:24:38,400
和团队用户。将尝试直播

665
00:24:38,400 --> 00:24:40,799
企业和教育机构

666
00:24:40,799 --> 00:24:43,360
月。正如 Casey 提到的,尽管这

667
00:24:43,360 --> 00:24:45,360
是一项极其令人兴奋的新技术,

668
00:24:45,360 --> 00:24:48,080
有新的风险。呃,人们学到了

669
00:24:48,080 --> 00:24:49,520
如何使用互联网一般很漂亮

670
00:24:49,520 --> 00:24:50,880
安全地,当然也有

671
00:24:50,880 --> 00:24:52,880
诈骗者和其他攻击。人们

672
00:24:52,880 --> 00:24:54,559
需要学习使用人工智能

673
00:24:54,559 --> 00:24:56,080
特工。呃,社会需要

674
00:24:56,080 --> 00:24:57,919
学会建立防御机制

675
00:24:57,919 --> 00:25:00,080
攻击人工智能代理。所以我们

676
00:25:00,080 --> 00:25:02,080
从一个非常强大的系统开始,很多

677
00:25:02,080 --> 00:25:04,240
警告。我们将放宽

678
00:25:04,240 --> 00:25:05,679
随着人们越来越习惯

679
00:25:05,679 --> 00:25:07,600
但我们确实希望人们能够

680
00:25:07,600 --> 00:25:09,919
作为一项新技术和新风险

681
00:25:09,919 --> 00:25:12,080
表面并采取所有谨慎措施

682
00:25:12,080 --> 00:25:14,799
凯西说过。嗯,不过话说回来,

683
00:25:14,799 --> 00:25:16,720
希望你会喜欢。呃,这是

684
00:25:16,720 --> 00:25:18,159
还为时过早。我们会改进

685
00:25:18,159 --> 00:25:20,640
我们很高兴看到

686
00:25:20,640 --> 00:25:22,640
一切顺利。所以,再次祝贺。谢谢

687
00:25:22,640 --> 00:25:26,440
非常感谢。希望你喜欢。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值