Hash Trees

lewutian

于 2009-08-10 15:23:00 发布

阅读量568

点赞数

分类专栏： Algorithm 文章标签： traversal tree branch c

Algorithm 专栏收录该内容

115 篇文章

订阅专栏

An Hash tree stores all candidate k-itemsets and their counts. The root is empty and its children are the frequent 1-itemsets. Any node at depth = k will denote and frequent k-itemset. An example for an hash tree for C₂ = 12, 13, 15, 23, 25, 35 is shown below

An internal node v at level m contains, bucket pointers. These tell which branch is the next one to be traversed. The hash of the m^thitem is used to decide this.

Join step using Hash Tree

Only the frequent k-1 itemsets, who have common parents should be considered for the joining step. So checking all k-1 itemsets in L_k-1is avoided.

Prune step using Hash Tree

To determine if a k-1 itemset is frequent, we have to look only for those itemsets who have common parents, and thus avoid going through all k-1 itemsets in L_k-1.

Added advantages of Hash Trees

Enumeration is replaced by tree traversal, i.e. there is no need to enumerate all k-subsets of transactions. Such considerations are limit by tree traversal.