Paper Notes: IntentGC

提出了一种名为IntentGC的可扩展图卷积框架,该框架结合了异构信息,解决了网站上用户-商品交互稀疏性的挑战。通过收集丰富的用户行为和商品信息,此框架利用图卷积网络同时考虑显式偏好和异构关系,自动学习不同关系的重要性,并设计了更快的图卷积模型IntentNet以适应大规模应用。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

IntentGC: a Scalable Graph Convolution Framework Fusing Heterogeneous Information for Recommendation
  • LINK: https://arxiv.org/abs/1907.12377

  • CLASSIFICATION: RECOMMENDER-SYSTEM, HETEROGENEOUS NETWORK, GCN

  • YEAR: Submitted on 24 Jul 2019

  • FROM: KDD 2019

  • WHAT PROBLEM TO SOLVE: The sparsity of user-item interactions (i.e., explicit preferences) on websites remains a big challenge for predicting users’ behaviors. Although research efforts have been made in utilizing some auxiliary information (e.g., social relations between users) to solve the problem, the existing rich heterogeneous auxiliary relationships are still not fully exploited. Moreover, previous works relied on linearly combined regularizers and suffered parameter tuning.

    We find that all previous works only captured one type of auxiliary information for users and/or one type for items in the model, while ignoring a plenty of additional heterogeneous relationships on the graph.

    image.png
  • SOLUTION: In this work, we collect abundant relationships from common user behaviors and item information, and propose a novel framework named IntentGC to leverage both explicit preferences and heterogeneous relationships by graph convolutional networks. In addition to the capability of modeling heterogeneity, IntentGC can learn the importance of different relationships automatically by the neural model in a nonlinear sense. To apply IntentGC to web-scale applications, we design a faster graph convolutional model named IntentNet by avoiding unnecessary feature interactions.

  • CORE POINT:

    • Difference between IntentGC and many state-of-the-art GCN models

      1. Their model considers only item information, while ignoring users and auxiliary objects.
      2. To scale up, GraphSage needs to sample many clustered mini-graphs of items for embedding reuse. However, it is hard to find such clustered mini-graphs that contain both users and items, due to the sparsity issue mentioned above.
      3. Their method is proposed for homogeneous networks, while user-item graphs studied in this work are heterogeneous.
    • Innovative points of IntentGC

      1. Fully exploiting auxiliary information

        To facilitate modeling and improve robustness, we translate auxiliary relationships of first order proximity into more robust weighted relationships of second order proximity.

        With different types of auxiliary objects, we can generate heterogeneous relationships of second-order proximity. IntentGC automatically determines the weights of different types of relationships in training.

        image.png
      2. Faster graph convolution

        The key idea of IntentNet is to avoid unnecessary feature interactions by dividing the functionality of graph convolution into two components: a vector-wise convolution component for neighborhood feature propagation and a fully-connected network component for node feature interaction.

      3. Dual graph convolution in heterogeneous networks

        First, we take advantage of two independent IntentNets that separately operate on user nodes and item nodes. After nonlinear projection through the fully-connected network in the respective IntentNet, the obtained embeddings of users and items can be deemed to form a common space. Then, with training guided by explicit preferences, relevance can be assessed between users and items in the space.

    • Usage of auxiliary information

      Unlike previous works of capturing auxiliary relationships in the objective function with a regularizer, which is linear and heavily depends on handcraft parameter tuning, our method can automatically learn the importance of different auxiliary relationships through non-linear neural network.

      We note that auxiliary information could also be designed as node input features. However, nodes sharing some input features would not be near in the high-level embedding space due to the complex neural network projection.

    • User-Item recommendation

      We can formulate the user-item recommendation problem as a link prediction problem on graph in the following:

      Input: A HIN G=(V,E)G = (V, E)G=(V,E) based on historical data.

      Output: A predicted edge set E^labelp\hat{E}^p_{label}E^labelp , which is the prediction of the real edge set ElabelpE^p_{label}Elabelp on GpG^pGp .

    • Methodology

      • Network Translation

        In this paper, we utilize the second-order proximity to capture the similarity between two users (or items), which is measured by the number of common auxiliary neighbors of the same type shared by them.

      • Faster Convolutional Network: IntentNet

        image.png
        • Vector-wise convolution operation

          During representation learning, there are mainly two tasks in the convolution operation: One is to learn the interactions between self node and its neighborhood, which determines how neighborhood boosts the results; the other one is to learn the interactions between different dimensions of the embedding space, which will extract useful combinatory features automatically.

          A key insight is that the interaction between feature hih_ihi in huk−1^{k−1}_uuk1 and hj(j≠i)h_j (j ≠ i)hj(j=i) in hN(u)k−1^{k −1}_{N(u)}N(u)k1 is less informative.

          Based on this observation, we designed a vector-wise convolution function in the following:

          image.png image.png image.png

          Each local filter can be viewed as learning how self node and neighborhood interact in a vector-wise manner, the multiple local filters here ensure a rich information extraction capability.

        • IntentNet

          With the core idea of dividing the work of graph convolution into two components: vector-wise convolution for learning the neighborhood’s utility, and fully-connected layers for extracting the node-level combinatory features.

          In practice, IntentGC is not only more efficient than conventional GCNs but also more effective in performance. A probable reason is that IntentGC can avoid useless feature interactions and is more robust to overfitting.

        • Complexity

          m to denote the sizes of representation vectors in different layers, ρ-neighborhood, L is the number of local filters.

          The complexity of the convolution operation:

          1. IntentNet: O(m∗(ρ+L))≈O(m)O(m∗(ρ+L))≈O(m)O(m(ρ+L))O(m)
          2. GraphSAGE: O(m∗(ρ+m))≈O(m2)O(m∗(ρ+m)) ≈ O(m2)O(m(ρ+m))O(m2)

          q-stacked graph convolution:

          1. IntentNet: O(ρq−1∗m+m2)O(ρ^{q−1}∗m+m^2)O(ρq1m+m2)
          2. GraphSAGE: O(ρq−1∗m2)O(ρ^{q−1} ∗ m^2)O(ρq1m2)
        • Heterogeneous relationships

          image.png image.png
      • Dual Graph Convolution in HIN

        We employ two IntentNets, IntentNetuuu and IntentNetvvv , for users and items respectively. By iteratively running q times of convolutional forward propagation as in Eq (1), Eq (5) and Eq (4) and additional dense forward propagation via the fully-connected layers, we can obtain the final user and item representations zuuu , zvvv , by IntentNetuuu and IntentNetvvv respectively.

        Triplet loss function:

        image.png
      • The IntentGC Framework

        image.png
    • Experiments

      • Datasets

        image.png
      • Compared methods

        DeepWalk, GraphSage, DSPR, Metapath2vec++, BiNE, IntentGC(Single), IntentGC(All)

      • Hyper-parameter settings

        image.png
      • System architecture

        image.png
  • EXISTING PROBLEMS: No Taobao dataset.

  • IMPROVEMENT IDEAS: 404

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值