谣言检测《Data Fusion Oriented Graph Convolution Network Model for Rumor Detection》 _生活百科

论文信息

论文标题：Data Fusion Oriented Graph Convolution Network Model for Rumor Detection论文作者：Erxue Min, Yu Rong, Yatao Bian, Tingyang Xu, Peilin Zhao, Junzhou Huang,Sophia Ananiadou论文来源：2020,IEEE Transactions on Network and Service Management论文地址：download 论文代码：download

1 Introduction本文不仅考虑了用户的基本信息和文本内容等静态特征，还考虑了谣言传播关系等动态特征。我们还对特征融合模块和池化模块进行了优化，使模型具有更好的性能。
本文贡献：

Considering the real dataset from social media, we extract static features such as users’ basic information and text contents, as well as dynamic features such as rumor propagation relations, and propose the data fusion method.
GCN is introduced into the rumor detection task, which represents the rumor propagation mode. And we propose to select the suitable graph convolution operator to update the node vectors, and improve the feature fusion and pooling module.
Experiments based on Sina Weibo dataset validate the performance of the propsed GCN-based model for rumor detection.

2 Main整体框架如下：

文章插图
【谣言检测《Data Fusion Oriented Graph Convolution Network Model for Rumor Detection》】主要包括如下四个模块：

the feature extraction module
the feature fusion module
the graph convolution module
the pooling module

2.1 Feature Extraction Module2.1.1 Features of User Basic Information常见的 User basic information：

文章插图
加入这些特征的原因：如 gender 为女的情况下，是谣言的概率更高。
特征预处理：
对于 gender 采用 One-hot 向量；
对于追随者特征，采用的是 Min-Max normalization ，但是这对于普通用户（如拥有 follower 小的用户）用以造成大部分的数值为 $0$ ，所以本文采用 $\text{log}$ 处理，如下：
$x^{*}=\left\{\begin{array}{ll}\frac{\log x-\log x_{\min }}{\log x_{\max }-\log x_{\min }} & x>0 \\0 & x=0\end{array}\right\} \quad\quad\quad(2)$
其中，$x$ 代表归一化前的追随者数量，$x^{*} $ 表示标准化值，$x_{\min }$ 和 $x_{\max }$ 表示中的最小和最大追随者数量。
2.1.2 User Similarity Feature考虑用户相似性，首先构造一个 user-event matrix $M$ ，其中 User 有 $N_{1}$ 个，event 有 $N_{2}$ 个，所以 $M \in N_{1} \times N_{2}$。可以预见的是 $M$ 是一个稀疏矩阵，所以本文采用 SVD 分解：
$A=U \Sigma V^{T}\quad\quad\quad(3)$
其中 $A$ 为需要分解的矩阵，$U$ 为左奇异值矩阵， $\Sigma$ 为对角矩阵，对角元素为奇异值，$V$ 为右奇异值矩阵。根据奇异值分解在推荐系统中的应用思想，我们可以取前 $N$ 个奇异值，计算 $\Sigma$ 与 $U$ 之间的点积，得到用户的向量表示，从而实现降维的目的。最后，每个用户都将有一个 $N$ 维的向量表示。两个用户向量之间的距离越近，它们共同参与的事件的数量就越多。基于同样的思想，还可以构建 users-users 之间的矩阵，矩阵元素表示两个用户都参与的事件的数量。然后使用相同的方法为用户生成另一组向量特征，并将基于用户-事件矩阵分解为用户相似性特征的向量相结合。
2.1.3 Representation of Text Content使用 $BERT_{base}$ Chinese model 提取文本表示。
2.1.4 Feature Fusion Module直接拼接特征会导致训练不稳定，在 Fig. 3 ， $x \in R^{N \times D_{1}}$ 和 $x^{\prime} \in R^{N \times D_{2}}$ 代表两个不同的特征向量， $N$ 代表节点的数量，$D_{1}$ 和 $D_{2}$ 代表节点的维度。首先将上述特征放入一个两层的 MLP 模块，然后执行 BN ，$\begin{array}{l}\mu \leftarrow \frac{1}{m} \sum\limits_{i=0}^{m} h_{i} \\\sigma^{2} \leftarrow \frac{1}{m} \sum\limits_{i=0}^{m}\left(h_{i}-\mu\right)^{2} \\\hat{h}_{l} \leftarrow \frac{h_{i}-\mu}{\sqrt{\sigma^{2}+\varepsilon}} \\w_{i} \leftarrow \gamma \hat{h}_{i}+\beta\end{array}$
其中，$\gamma$ 和 $\beta$ 是可学习参数。
最后再执行 concat。
2.1.5 Graph Convolution Module