1. 稀疏矩阵(sparse matrix)
2. feature selection or feature filtering
参考文献:
A. Caliskan-Islam, R. Harang, A. Liu, A. Narayanan, C. Voss, F. Yamaguchi, and R. Greenstadt. 2015. De-anonymizing Programmers via Code Stylometry. In 24th USENIX Security Symposium (USENIX Security 15). USENIX Association, Washington, D.C., 255–270. https://www.usenix.org/conference/usenixsecurity15/
technical-sessions/presentation/caliskan-islam
X. Meng. 2016. Fine-grained Binary Code Authorship Identification. In Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering (Seattle, WA, USA) (FSE 2016). ACM, New York, NY, USA, 1097–1099. https://doi.org/10.1145/2950290.2983962
N. Rosenblum, X. Zhu, and B. P. Miller. 2011. Who Wrote This Code? Identifying the Authors of Program Binaries. In Proceedings of the 16th European Conference on Research in Computer Security (Leuven, Belgium) (ESORICS’11). Springer-Verlag, Berlin, Heidelberg, 172–189. http://dl.acm.org/citation.cfm?id=2041225.2041239