机器学习的数据基础习题回答

最新推荐文章于 2022-09-25 12:15:26 发布

酌酒不舍昼夜

最新推荐文章于 2022-09-25 12:15:26 发布

阅读量224

点赞数

分类专栏： Smale 作业

本文链接：https://blog.youkuaiyun.com/XxxHsiung/article/details/116401760

版权

Smale 作业专栏收录该内容

6 篇文章

订阅专栏

习题原文链接

1. 集合、关系、元组

习题 1.1: ${0,1，\{0,1\}，\{1,2\}\}$ 有几个元素? 机器学习中, 这类形式的集合有什么优点和缺点?

1.有 $4$ 个元素，分别为 $0,1，\{0,1\}，\{1,2\}$
2.优点：在标签分类学习中，集合的每一个元素代表一类。在传统的单标签分类中，训练集中的每一个样本只有一个相关的标签，这里的标签就可以是题目集合中的 $0, 1$ 等数字，这个标签来自于一个不重合的标签集合。而对于多标签分类，采用题目中的形式的集合就可以表示含有多个标签的样本，如 ${0,1\}$ 、 ${1,2\}$ 。当然这种方式也存在缺点，标签的增加会导致信息量增加，数据处理更加繁琐等。

习题 1.2: $\emptyset$ 的基数是多少? $\{\emptyset\}$ 呢?

$\emptyset$ 为空集，因此基数为 $0$ 。 $\{\emptyset\}$ 表示包含一个元素的集合，这个元素为 $\emptyset$ ，因此基数为 $1$ .

习题 1.3: 多标签学习中, 输出为一个向量，相应的学习器算不算函数呢?

我认为算是函数。多标签学习中输出的一个向量是属于一个不重合的标签集合的，这个集合相对于是函数的值域，而所有样本组合起来的集合则是函数定义域。定义域中每一个样本在值域中都有一个唯一的向量与其对应，因此算是函数。

习题 1,4: 元组只能表达对象的数据部分, 还是可以完整地表达 (既包括数据, 也包括方法/函数)? 用一个具体的程序来说明.

可以完整表达

public class Child<N, G, A> {

    private N name;
    private G gender;
    private A age;

    public Child(N name, G gender, A age) {
    	super();
        this.name = name;
        this.gender = gender;
        this.age = age;
    }

    public N getName() {
		return name;
	}
	public G getGender() {
		return gender;
	}
	public A getAge() {
		return age;
	}

}

习题 1,5: 定义二叉树.

Let $\Sigma = \{\mathrm{l}, \mathrm{r}\}$ be the alphbet and $\phi$ be a null node. A binary tree is a triple $(\bm{V}, r, c)$ , where $\bm{V} = \{v_1, \dots, v_n\}$ is the set of nodes, $\in \bm{V}$ is the root, and $\bm{V} \cup \{\phi\} \times \Sigma^* \to \bm{V} \cup \{\phi\}$ satisfying
$\forall v \in \bm{V}$ , $\exists 1$ $\in \Sigma^*$ st. $c (r, s) = v$ .

习题 1.6: 定义带权无向图.

A weighted undirected graph is a tuple $G_w = ( \mathbf {V} , E,w )$ ，where $\mathbf{V} = \{v_1, \dots, v_n\}$ is the set of nodes， $\mathbf{E} \subseteq \mathbf{V} \times \mathbf{V}$ is the set of edge， $\langle v_i, v_j \rangle \in \bm{E}$ iff $\langle v_j, v_i \rangle \in \bm{E}$ , and $\subseteq \mathbb{R}^+ \cup \{0\}$ is the edge weight.

2. 字母表、二叉树、树

习题 9. 考虑 $\phi$ , 重新写 Definition 6 以解决其存在的问题, 见其讨论 d).

Definition 6. A tree is a triple $(\mathbf{V}, r, p)$ ，where $\mathbf{V} = \{v_1, \dots, v_n\}$ is the set of nodes， $\in \mathbf{V}$ is the root, and $\mathbf{V} \setminus \{r\} \rightarrow \mathbf{V}$ is the parent function satisfying
a) $\forall k \geq 1$ , $p^k ( v ) ≠ v$ , and
b) $\forall v \in \mathbf{V} \setminus \{r\}$ , $\exists 1 k \geq 1$ , st. $p^k(v) = r$ .

A tree is a triple $(\mathbf{V}, r, p)$ ，where $\mathbf{V} = \{v_1, \dots, v_n\}$ is the set of nodes， $\in \mathbf{V}$ is the root, and $\mathbf{V} \rightarrow \mathbf{V}$ is the parent function satisfying
a) $\forall k \geq 1$ , $p^k ( v ) ≠ v$ , and;
b) $\forall v \in \mathbf{V} \setminus \{r\}$ , $\exists 1 k \geq 1$ , st. $p^k(v) = r$ ;
c) $p(r)=\phi$ , $p(\phi)=\phi$ .

3. 有限状态自动机

习题 3.1 模仿自动机的样子来重新定义二叉树.

A binary tree is a 5-tuple $(\Sigma, \bm{V'}, \bm{r}, \bm{\phi}, c)$ , where
a) $\Sigma$ is the alphabet, $\Sigma = \{\mathrm{l}, \mathrm{r}\}$ ;
b) $\bm{V'}$ is the set of states, $\bm{V'}=\bm{V} \cup \{\phi\}$ , $\mathbf{V} = \{v_1, \dots, v_n\}$ ;
c) $\in \bm{V'}$ is the root;
d) $\phi \in \bm{V'}$ is the terminal states;
e) $\bm{V'}\times \Sigma^* \to \bm{V'}$ is the transition function, satisfying $\forall v \in \bm{V}$ , $\exists 1$ $\in \Sigma^*$ st. $c (r, s) = v$ .

习题 3.2 模仿自动机的样子来重新定义树.

A tree is a 5-tuple $(\Sigma, \bm{V'}, \bm{r}, \bm{\phi}, p)$ , where
a) $\Sigma$ is the alphabet, $\Sigma= \{k\}$ ;
b) $\bm{V'}$ is the set of states, $\bm{V'}=\bm{V} \cup \{\phi\}$ , $\mathbf{V} = \{v_1, \dots, v_n\}$ ;
c) $\in \bm{V'}$ is the root;
d) $\phi \in \bm{V'}$ is the terminal states;
e) $\bm{V'}\times \Sigma^* \to \bm{V'}$ is the transition function, satisfying $\forall v \in \bm{V}$ , $\exists 1$ $\in \Sigma^*$ st. $c (v, s) = r$ .
这里树用p函数还是用c函数有点不确定。