【论文精读】时序逻辑推理之决策树方法 A Decision Tree Approach to Data Classification using Signal Temporal Logic

本文链接：https://blog.youkuaiyun.com/WSQ_2000/article/details/123480407

前言：基于决策树的结构学习方法在最近的文献中比较普遍，今天来精读一下这篇文章。

主要参考文献：
Bombara, G., Vasile, C.-I., Penedo, F., Yasuoka, H., & Belta, C. (2016, April 11). A Decision Tree Approach to Data Classification using Signal Temporal Logic. Proceedings of the 19th International Conference on Hybrid Systems: Computation and Control. https://doi.org/10.1145/2883817.2883843

概览

这篇文章针对公式学习提出了以下新方法：

二叉决策树
杂质度量法

在以下案例中进行了验证：

海事监控的异常检测
燃油控制系统的异常监控

流水账笔记

1 Introduction

本文提出了一个基于决策树的时序逻辑推理框架。为什么叫做框架呢？因为这个框架包含了一系列不同的算法，可以根据情况进行选择。

2 Related Work

文献整理

参数学习

[1] Parametric Identification of Temporal Properties
[2] Querying Parametric Temporal Logic Properties on Embedded Systems
[3] Mining Requirements From Closed-Loop Control Models
[4] System design of stochastic models using robustness of temporal properties

DAG方法

[1] Anomaly detection in cyber-physical systems: A formal methods approach
[2] Temporal logic inference for classification and prediction from data

结构学习与参数学习分离的方法

[1] Data-Driven Statistical Learning of Temporal Logic Properties
[2] Temporal Logic Based Monitoring of Assisted Ventilation in Intensive Care Patients

Spatial Logic Inference

[1] Learning and detecting emergent behavior in networks of cardiac myocytes
[2] A formal methods approach to pattern synthesis in reaction diffusion systems

3 Signal Temporal Logic

这里STL和PSTL的定义与这篇文献的定义一致，这里就不做介绍了。

4 Problem Formulation

本篇文章解决的问题，依然是给据信号 $s$ ，求时序公式 $\phi$ ，使最小化误分类率：
$\operatorname{MCR}(\phi):=\frac{\left|\left\{s^{i} \mid\left(s^{i} \models \phi \wedge l^{i}=C_{n}\right) \vee\left(s^{i} \not \models \phi \wedge l^{i}=C_{p}\right)\right\}\right|}{N}$
上面 $C_n$ 和 $C_p$ 分别代表负例集合和正例集合。

5 Learning Decision Trees

key:找到STL子式和决策树的对应
二叉决策树：每个非叶节点将数据分入两个子节点中，最后的叶节点代表数据的分类

In this paper, we focus on binary decision trees, where every non-terminal node splits the data into two children nodes and every leaf node predicts a label.

树生长算法的3个meta-parameter：
1. 划分方式：即一系列 $\phi$
2. 最佳划分选取：如果选取最佳的 $\phi^*$
3. 终止条件

5.1 Parameterized Learning Algorithm

第一个算法buildTree讲的是如何构建决策树。
在这里插入图片描述

要看懂这段代码，首先要理解代码中几个函数的意义：

$buildTree(\phi^{path},S,h)$ 接受三个参数， $\phi^{path}$ 代表当前节点到根节点的路径所代表的公式， $S$ 是一系列带标签的信号， $h$ 是当前节点所在的深度，函数返回以当前节点为根节点的决策树。
$stop(\phi^{path},h,S)$ 也是接受三个参数，判断这三个参数满足递归的终止条件
$l e a f (c)$ 创建一个带标签的叶节点
$non\_terminal(\phi)$ 创建一个包含原子命题的非叶节点，原子命题从 $\mathcal{P}$ 中选取
$J(S,S_\top,S_\perp)$ 输入分类后的信号，返回本节点最佳的子公式 $\phi^*$ (即选择最佳划分方式)