Graph Convolutional Neural Network - Spatial Convolution 图卷积神经网络 — 空域卷积详解

最新推荐文章于 2024-03-06 17:35:13 发布

Jay_Tang

最新推荐文章于 2024-03-06 17:35:13 发布

阅读量5.7k

点赞数 56

分类专栏：图神经网络文章标签：神经网络深度学习

本文链接：https://blog.youkuaiyun.com/Jay_Tang/article/details/108115302

版权

本文介绍了图卷积神经网络（ConvGNNs），特别是空域卷积方法。ConvGNNs将卷积操作从网格数据扩展到图数据，通过节点特征聚合和邻居特征来生成节点表示。文章对比了ConvGNNs与RecGNNs，重点讨论了Message Passing Neural Network (MPNN)、GraphSAGE和PATCHY-SAN等模型的工作原理和特点。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

文章目录

往期文章链接目录

Note

This is the second post of the Graph Neural Networks (GNNs) series.

Convolutional graph neural networks (ConvGNNs)

Convolutional graph neural networks (ConvGNNs) generalize the operation of convolution from grid data to graph data. The main idea is to generate a node $v$ ’s representation by
aggregating its own features $\mathbf{x}_{v}$ and neighbors’ features $\mathbf{x}_{u}$ , where $\in N(v)$ . Different from RecGNNs, ConvGNNs stack fixed number of multiple graph convolutional layers with different weights to extract high-level node representations.

ConvGNNs fall into two categories:

spatial-based GCN: Spatial-based approaches inherit ideas from RecGNNs to define graph convolutions by information propagation.
spectral-based GCN: Spectral based approaches define graph convolutions by introducing filters from the perspective of graph signal processing where the graph convolutional operation is interpreted as removing noises from graph signals.

Spatial-based methods have developed rapidly recently due to its attractive efficiency, flexibility, and generality. In this post, we mainly focus on spatial-based GCN and leave spectral-based GCN to the next post. Let’s get started.

GCN Framework

As shown in the figure above, the input of GCN is the entire graph. In each convolution layer, a convolution operation is performed on the neighbors of each node, and the center node representation is updated with the result of the convolution. Then an activation function such as ReLU is used before going through the next layer of convolution layer. The above process continues until the number of layers reaches the expected depth (a hyper-parameter).

GCN v.s. RecGNN

The main difference between GCN and RecGNN is that each convolutional layer of GCN has unique weights, and, on the other hand, in RecGNN the weights of each layer are shared.

What is Convolution

In mathematics, convolution is a mathematical operation on two functions $f$ and $g$ that produces a third function $(f * g)$ expressing how the shape of one is modified by the other.

The term convolution is defined as the integral of the product of the two functions after one is reversed and shifted. The mathematical definition is the following:

$\begin{array}{c} (f * g)(t)=\int_{-\infty}^{\infty} f(\tau) g(t-\tau) \quad (\text {continuous}) \\ (f * g)(t)=\sum_{\tau=-\infty}^{\infty} f(\tau) g(t-\tau) \quad(\text {discrete}) \end{array}$

The convolution formula can be described as a weighted average of the function $f(\tau)$ at the moment $t$ where the weighting is given by $g(-\tau)$ simply shifted by amount $t$ . As $t$ changes, the weighting function emphasizes different parts of the input function.

As the figure shown above, the filter is moved over by one pixel and this process is repeated until all of the possible locations in the image are filtered. At each step, the convolution takes the weighted average of pixel values of the center pixel along with its neighbors. Since the center pixel is changing at each time step, the convolution is emphasizing on different parts of the image.

Spatial-based ConvGNNs

Analogous to the convolutional operation of a conventional CNN on an image, spatial-based methods define graph convolutions based on a node’s spatial relations.

Images can be considered as a special form of graph with each pixel representing a node. Each pixel is directly connected to its nearby pixels, as illustrated in the figure above (left). A filter is applied to a $\times 3$ patch by taking the weighted aver

最低0.47元/天解锁文章