COVID-19 假新闻检测与零售场景下的视觉注意力研究
一、COVID - 19 假新闻检测
- 特征提取算法
特征提取是假新闻检测的基础步骤,其算法流程如下:
Algorithm 1: Feature extraction
Data: Truthful news directory dT, Fake news directory dF
Result: Scaled train and test features datasets Strain, Stest, train and test
principal components PCtrain, PCtest
DT, LT ←load data and labels(dT);
DF, LF ←load data and labels(dF);
D ←DT ∪DF;
L ←LT ∪LF;
Dtrain, Ltrain, Dtest, Ltest ←train test split(D, L, test size = 20%);
ME ←train Doc2Vec LM(Dtrain);
Etrain, Etest ←encode documents(Dtrain, Dtest, ME);
f1train, f1test ←get documents length(Dtrain, Dtest);
f2train, f2test ←get exclamation ratio(Dtrain, Dtest);
f3train, f3test ←get caps percentage(Dtrain, Dtest);
f