References:
Motivation for graph convolutions
Enumerating the desirable traits of image convolutions, we arrive at the following properties we would ideally like our graph convolutional layer to have:
- Computational and storage efficiency (requiring no more than O ( V + E ) O(V+E) O(V+E) time and memory);
- Fixed number of parameters (independent of input graph size);
- Localization (acting on a local neighborhood of a node);
- Ability to specify arbitrary importances to different neighbors;
- Applicability to inductive problems (arbitrary, unseen graph structures).
Towards a viable graph convolution
A graph of n n n nodes:
- a set of node features: ( h ⃗ 1 , h ⃗ 2 , . . . , h ⃗ n ) (\vec{h}_1,\vec{h}_2,..., \vec{h}_n) (h1,h2,...,hn)
- an adjacency matrix A A A: A i j = 1 A_{ij}=1 Aij=1 if i i i and j j j are connected, and 0 otherwise
- A graph convolutional layer then computes a set of new node features ( h ⃗ 1 ′ , h ⃗ 2 ′ , . . . , h ⃗ n ′ ) (\vec{h}_1',\vec{h}_2',..., \vec{h}_n') (h