文本相似度
1. 编辑距离,集合相似度def get_jaccard_distance(seq1, seq2): "seq1 and seq2 are two sequences, return value 0 means equal, 1 means totally different" set1, set2 = set(seq1), set(seq2) return 1 - len(set1 & set2) / float(len(set1 | set2))def ge
原创
2022-02-18 17:08:30 ·
220 阅读 ·
0 评论