Robot Knows the Right Question to Ask When It's Confused

智能提问机器人
布朗大学的研究人员开发了一种名为FETCH-POMDP的系统,使机器人能够理解常见手势的隐含意义,并将其与人类语言相结合,从而更准确地理解人类的需求。通过在物体抓取任务中引入社交反馈,该系统能有效减少错误。

[] http://spectrum.ieee.org/automaton/robotics/artificial-intelligence/robot-knows-the-right-question-to-ask-when-its-confused


Brown University researcher works with a Baxter robot.Photo: Nick Dentamaro/Brown University

Last week, we (and most of the rest of the internet) covered some research from MIT that uses a brain interface to help robots correct themselves when they’re about to make a mistake. This is very cool, very futuristic stuff, but it only works if you wear a very, very silly hat that can classify your brain waves in 10 milliseconds flat.

At Brown University, researchers in Stefanie Tellex’s lab are working on a more social approach to helping robots more accurately interact with humans. By enabling a robot to model its own confusion in an interactive object-fetching task, the robot can ask relevant clarifying questions when necessary to help understand exactly what humans want. No hats required. 

Whether you ask a human or a robot to fetch you an object, it’s a simple task to perform if the object is unique in some way, and a more complicated task to perform if it involves several similar objects. Say you’re a mechanic, and you want an assistant to bring you a tool. You can point at a shelf of tools and say, “Bring me that tool.” Your assistant, if they’re human, will look where you point, and if there are only a handful tools on the shelf, they’ll probably be able to infer what tool you mean. But if the shelf is mostly full, especially if it’s full of objects that are similar, your assistant might not be able to determine exactly what tool you’re talking about, so they’ll ask you to clarify somehow, perhaps by pointing at a tool and saying, “Is this the one you mean?”

To be useful in situations like these, your assistant has to have an understanding of ambiguity and uncertainty: They have to be able to tell when there is (or is not) enough information to complete a task, and then take the right action to help get more information when necessary, whether that uncertainty comes from the assistant not really getting something, or just from you not being specific enough about what you want. For robot assistants, it’s a much more difficult problem than it is for human assistants, because of the social components involved. Pointing, gestures, gaze, and language cues are all tricks that humans use to communicate information that robots are generally quite terrible at interpreting.

Brown researchers created a system called “FEedback To Collaborative Handoff Partially Observable Markov Decision Process,” or FETCH-POMDP, which is able to understand the implicit meaning of common gestures, and merge those meanings with what the person making the gestures is saying to improve its understanding of what the person wants. 

At Brown, they’ve created a painfully acronym’d system called “FEedback To Collaborative Handoff Partially Observable Markov Decision Process,” or FETCH-POMDP. It’s able to understand the implicit meaning of common gestures, and merge those meanings with what the person making the gestures is saying to improve its understanding of what the person wants. Assuming that the person is being cooperative (not lying about what they want), the system is able to model its own confusion and ask questions only when necessary for accuracy so as not to be unduly bothersome. 

To test out the FETCH-POMDP system, the Brown researchers asked people who had no idea what was going on to ask a Baxter robot to fetch things for them. A third of the time, the robot asked no questions; a third of the time, the robot always asked clarifying questions; and the final third of the time, the robot only asked questions when it decided that a question was necessary. The researchers expected that the robot would be fastest when asking no questions, and most accurate when always asking questions, but it turned out that the intelligent questioning approach managed to be both the fastest and most accurate. This is because human robot interaction is messy: People asking questions led to transcription errors (confusing “yes” with “hand,” for example), so more questions meant more misunderstandings.

Interestingly, the participants in the trials also ascribed all kinds of capabilities to the robot which it didn’t actually have:

During trials, many users used prepositional phrases in order to describe items, such as “Hand me the spoon to the left of the bowl.” Although the language model in this work did not account for referential phrases, the agent was able to use intelligent social feedback to figure out what the human desired. This may explain why many users reported that they thought the robot did understand prepositional phrases. Methods exist to interpret referential language, but problems in understanding will still occur. Our model will help correct those mistakes, regardless of the exact method of state estimation and language understanding.

The researchers do plan to update their model to include referential phrasing like this, and they also want to add things like eye tracking to improve accuracy even more. Adding the ability to place (along with pick) has the potential to make this system much more workplace useful, especially if you have a spaceship that needs fixing



Brown University researcher works with a Baxter robot.Photo: Nick Dentamaro/Brown University

Last week, we (and most of the rest of the internet) covered some research from MIT that uses a brain interface to help robots correct themselves when they’re about to make a mistake. This is very cool, very futuristic stuff, but it only works if you wear a very, very silly hat that can classify your brain waves in 10 milliseconds flat.

At Brown University, researchers in Stefanie Tellex’s lab are working on a more social approach to helping robots more accurately interact with humans. By enabling a robot to model its own confusion in an interactive object-fetching task, the robot can ask relevant clarifying questions when necessary to help understand exactly what humans want. No hats required. 

Whether you ask a human or a robot to fetch you an object, it’s a simple task to perform if the object is unique in some way, and a more complicated task to perform if it involves several similar objects. Say you’re a mechanic, and you want an assistant to bring you a tool. You can point at a shelf of tools and say, “Bring me that tool.” Your assistant, if they’re human, will look where you point, and if there are only a handful tools on the shelf, they’ll probably be able to infer what tool you mean. But if the shelf is mostly full, especially if it’s full of objects that are similar, your assistant might not be able to determine exactly what tool you’re talking about, so they’ll ask you to clarify somehow, perhaps by pointing at a tool and saying, “Is this the one you mean?”

To be useful in situations like these, your assistant has to have an understanding of ambiguity and uncertainty: They have to be able to tell when there is (or is not) enough information to complete a task, and then take the right action to help get more information when necessary, whether that uncertainty comes from the assistant not really getting something, or just from you not being specific enough about what you want. For robot assistants, it’s a much more difficult problem than it is for human assistants, because of the social components involved. Pointing, gestures, gaze, and language cues are all tricks that humans use to communicate information that robots are generally quite terrible at interpreting.

Brown researchers created a system called “FEedback To Collaborative Handoff Partially Observable Markov Decision Process,” or FETCH-POMDP, which is able to understand the implicit meaning of common gestures, and merge those meanings with what the person making the gestures is saying to improve its understanding of what the person wants. 

At Brown, they’ve created a painfully acronym’d system called “FEedback To Collaborative Handoff Partially Observable Markov Decision Process,” or FETCH-POMDP. It’s able to understand the implicit meaning of common gestures, and merge those meanings with what the person making the gestures is saying to improve its understanding of what the person wants. Assuming that the person is being cooperative (not lying about what they want), the system is able to model its own confusion and ask questions only when necessary for accuracy so as not to be unduly bothersome. 

To test out the FETCH-POMDP system, the Brown researchers asked people who had no idea what was going on to ask a Baxter robot to fetch things for them. A third of the time, the robot asked no questions; a third of the time, the robot always asked clarifying questions; and the final third of the time, the robot only asked questions when it decided that a question was necessary. The researchers expected that the robot would be fastest when asking no questions, and most accurate when always asking questions, but it turned out that the intelligent questioning approach managed to be both the fastest and most accurate. This is because human robot interaction is messy: People asking questions led to transcription errors (confusing “yes” with “hand,” for example), so more questions meant more misunderstandings.

Interestingly, the participants in the trials also ascribed all kinds of capabilities to the robot which it didn’t actually have:

During trials, many users used prepositional phrases in order to describe items, such as “Hand me the spoon to the left of the bowl.” Although the language model in this work did not account for referential phrases, the agent was able to use intelligent social feedback to figure out what the human desired. This may explain why many users reported that they thought the robot did understand prepositional phrases. Methods exist to interpret referential language, but problems in understanding will still occur. Our model will help correct those mistakes, regardless of the exact method of state estimation and language understanding.

The researchers do plan to update their model to include referential phrasing like this, and they also want to add things like eye tracking to improve accuracy even more. Adding the ability to place (along with pick) has the potential to make this system much more workplace useful, especially if you have a spaceship that needs fixing

Reducing Errors in Object-Fetching Interactions through Social Feedback,” by David Whitney, Eric Rosen, James MacGlashan, Lawson L.S. Wong, and Stefanie Tellex from Brown University will be presented at ICRA 2017 in Singapore.

Brown ]

Reducing Errors in Object-Fetching Interactions through Social Feedback,” by David Whitney, Eric Rosen, James MacGlashan, Lawson L.S. Wong, and Stefanie Tellex from Brown University will be presented at ICRA 2017 in Singapore.

【四轴飞行器】非线性三自由度四轴飞行器模拟器研究(Matlab代码实现)内容概要:本文围绕非线性三自由度四轴飞行器的建模与仿真展开,重点介绍了基于Matlab的飞行器动力学模型构建与控制系统设计方法。通过对四轴飞行器非线性运动方程的推导,建立其在三维空间中的姿态与位置动态模型,并采用数值仿真手段实现飞行器在复杂环境下的行为模拟。文中详细阐述了系统状态方程的构建、控制输入设计以及仿真参数设置,并结合具体代码实现展示了如何对飞行器进行稳定控制与轨迹跟踪。此外,文章还提到了多种优化与控制策略的应用背景,如模型预测控制、PID控制等,突出了Matlab工具在无人机系统仿真中的强大功能。; 适合人群:具备一定自动控制理论基础和Matlab编程能力的高校学生、科研人员及从事无人机系统开发的工程师;尤其适合从事飞行器建模、控制算法研究及相关领域研究的专业人士。; 使用场景及目标:①用于四轴飞行器非线性动力学建模的教学与科研实践;②为无人机控制系统设计(如姿态控制、轨迹跟踪)提供仿真验证平台;③支持高级控制算法(如MPC、LQR、PID)的研究与对比分析; 阅读建议:建议读者结合文中提到的Matlab代码与仿真模型,动手实践飞行器建模与控制流程,重点关注动力学方程的实现与控制器参数调优,同时可拓展至多自由度或复杂环境下的飞行仿真研究。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值