Robust Constrained Learning-based NMPC enabling reliable mobile robot path tracking

最新推荐文章于 2024-08-23 09:38:30 发布

Vic_Hao

最新推荐文章于 2024-08-23 09:38:30 发布

阅读量586

点赞数

CC 4.0 BY-SA版权

分类专栏：机器人论文阅读

本文链接：https://blog.youkuaiyun.com/weixin_42018112/article/details/93540837

机器人论文阅读专栏收录该内容

33 篇文章

订阅专栏

文章旨在实现未知干扰下的鲁棒约束、高性能路径跟踪。思路是从简单且高不确定性的过程模型学习准确、低不确定性模型，用VO做定位。与传统约束NMPC不同，本文学习干扰模型加强过程模型，实时应用鲁棒约束。介绍了NMPC、鲁棒约束NMPC、不确定轨迹预测及高斯过程干扰模型等内容。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Introduction

这篇文章的目的就是achieving robust constrained, high performance path-tracking in spite of unknown disturbances.

这篇文章的思路：simple process model, high model uncertainty $→learn\rightarrow^{learn}$ accurate, low-uncertainty model

这篇文章用VO做Localization.

这篇文章和传统contrained NMPC有如下两方面的不同：

传统方法中process model是预先设计好并且不变的，在这篇文章中learn到disturbance model来加强process model，使得process model可以predict the mean and uncertainty of effects.
传统contrained NMPC没有考虑模型的不确定性，这篇文章apply robust constraints in real time considering the learned uncertainty. We provide robust constraint satisfaction when uncertainty is high and increased performance as uncertainty is reduced through learning.

这篇文章的主要创新点就是：

use learned models
account for model uncertainty

在这里插入图片描述
上面就是本文整体控制框图。RC-LB-NMPC主要包含两个主要的部分：

the robust constrained, path-tracking NMPC algorithm based on an a priori process
the GP-based disturbance model

Mathematical Formulation

先大概介绍一下NMPC吧：
At a given sample time, NMPC finds a sequence of control inputs that optimizes the plant behavior over a prediction horizon based on current state. The first input in the optimal sequence is then applied to the system. The entire process is repeated at the next sample time for the new system state.

Robust Constrained NMPC

首先肯定是要讲一下状态转移model

The true system is approximate by the sum of an a priori model and an experienced-based, learned model:
$x_{k+1} = f(x_{k}, u_{k}) + g(a_{k})$
where:
$f(⋅)f(\cdot)$ ——a known nonlinear process model representing our knowledge of $ftrue(⋅)f_{true}(\cdot)$
$g(⋅)g(\cdot)$ —— an (initially unknown) disturbance model representing discrepancies between the a priori model and the actual system behavior. $g(⋅)g(\cdot)$ is modeled as GP. For simplicity, $ak=(xkˉ,uk)a_{k} = (\bar{x_{k}}, u_{k})$

再来讲一下cost function

定义the cost function to be minimized over the next $K$ time-steps as:
$J(xˉ,u)=(xd−xˉ)TQ(xd−xˉ)+(ud−u)TR(ud−u)J(\bar{x}, u) = (x_{d} - \bar{x})^{T}Q(x_{d} - \bar{x}) + (u_{d} - u)^{T}R(u_{d} - u)$
其中：
$Q$ 是半正定矩阵， $R$ 是正定矩阵
$x_{d} = (x_{d, k+1}, ..., x_{d, k+K})$ ——a sequence of desired states
$x = (x_{k+1}, ..., x_{k+K})$ ——a sequence of uncertain predicted states, $xˉ\bar{x}$ is the sequence of mean values based on $x$
$u_{d} = (u_{d, k}, ..., u_{d, k+K-1})$ ——a sequence of desired inputs
$u = (u_{k}, ..., u_{k+K-1})$ ——a sequence of inputs

接下来就是要定义robust constraint了
从state和input两个角度定义

基于以上基础，我们就可以formulate the following constrained optimization problem:

$xopt,uopt=argminx,uJ(xˉ,u){x_{opt}, u_{opt}} = \underset{x,u}{arg min}J(\bar{x}, u)$ $\bar{x}_{k+i+1} = f(\bar{x}_{k+i} , u_{k+i}) + g(a_{k+i}), i=0, ..., K-1$ $ci(xˉ,u)>0c_{i}(\bar{x}, u) > 0$

整个算法的流程：
在这里插入图片描述
在算法收敛之后，we apply the first element of the resulting optimal control input sequence for one time-step, and start all over at the next time-step.

Predicting uncertain trajectories

state都是正态分布的，所以使用Sigma-Point Transform来iteratively predict state sequences.

定义state $zi=(xˉk+i,μ(ak+i))∈R2nz_{i} = (\bar{x}_{k+i}, \mu(a_{k+i})) \in R^{2n}$ representing the mean state and disturbance at time $k + i$ with uncertainty $Pi=diag(∑k+i,∑gp(ak+i))P_{i} = diag(\sum_{k+i}, \sum_{gp}(a_{k+i}))$