JUST-Are Meta-Paths Necessary

最新推荐文章于 2025-01-18 15:12:18 发布

zhou_zjuer

最新推荐文章于 2025-01-18 15:12:18 发布

阅读量328

点赞数

分类专栏：网络嵌入文章标签： networkEmbedding

本文链接：https://blog.youkuaiyun.com/baidu_40614951/article/details/97246220

版权

网络嵌入专栏收录该内容

4 篇文章

订阅专栏

在异构图中，传统方法存在对高可见域的偏向。作者提出JUST-算法，通过概率平衡随机游走时的域选择，以避免这种偏向，解决元路径选取的不确定性问题。该算法考虑了跳转到其他域或留在同一域的概率，并通过指数衰减函数控制。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

JUST-Are Meta-Paths Necessary

摘要

在包含不同域的结点的异构图中，传统的randomwalk方法会偏向于走向高可见域，高可见域就是那些域中的结点拥有支配性数量边的(或者说边的数量占主要部分)

In the context of a heterogeneous graph, which contains nodes from different domains, classical random walks are biased towards highly visible domains where nodes are associated with a dominant number of paths.

可以通过使用Meta-Path的方法来引导随机游走，但是元路径需要先验知识或者额外运算，于是作者提出了JUST-(JUmp和STay)来解决randomwalk的那种偏向性。

Intro

学术界当前主要依靠元路径来为网络嵌入采样。

具体来说，元路径就是一个结点类型的序列，这个序列编码了(即包含了)这些相关结点之间的复合关系。

Specifically, a meta-path is defined as a sequence of node types encoding key composite relations among the involved node types .

举个例子，在书目的网络表示中，A-P-V表示两个作者合著了一篇论文，而A-P-V-P-A表示两个作者各自的论文在同一个会议中发表。

然而如何从一张图中选取元路径仍然时不清楚的，而考虑到元路径对学习质量的巨大影响，其可谓时一把双刃剑。

本文提出的算法，旨在平衡随机游走时域选择的概率

Specifically, when performing random walks over a heterogeneous graph, we choose the next node either by jumping to one of the other data domains, or staying in the same data domain. The key idea of our solution is to probabilistically balance these two options, in order to avoid the above-mentioned bias.

主要考虑：

jump还是stay
- 通过一个指数衰变函数，来控制stay的可能性
如果jump,去哪
- m最后访问

符号定义

异构图：
$(V,E)\\ 对任意v \in V,有一个映射函数\phi,\phi(v) = q,q表示结点所处的域$

$e_{he}:异构边\\ e_{ho}：同构边$

JUST-算法

Jump or stay
$Pr_{stay}(v_i)= \alpha^l\,\,\,\,\,\,$
Where to Jump

该跳到那个域q？在最近访问的m个域之外随机选取一个，以平衡样本点在各个域中的分布。

Truncated Random Walk

如果长度达到了 $L_{MAX}$ 就停止随机游走。

伪代码

//要求:异构图G = (V,E),初始停留概率a,参数m，每个接待你随机游走的次数
//r,最长游走长度L
Initialize W = NULL
for i = 1 to r do
    for each v in V do
        Initialize a random walk by adding v
        Initialize Qhist by adding phi(v)
        while |w|<L do
            pick a jump or stay decision//选择停留还是跳越
            if Stay then
            	Continue w by staying
            else if Jump then
            	Sample a target q
            	Continue w by q
            	update Qhist
            end if
        add w to W
    end for
end for
return W

代码理解在此