The R*-tree: An Efficient Robust Access Method for Points and Rectangles

R*树是一种基于最小边界矩形(目录矩形)重叠区域的空间访问方法。它通过实验优化,旨在最小化内部节点中每个包围矩形的面积。该文批判了仅考虑面积最小化的旧方法,提出结合多种优化标准的工程方法。文中介绍了性能测试标准,重新评估了R树,并提出了插入优化、选择子树、R*树分裂和强制重新插入等原则,以改善检索性能和整体结构。尽管存在一些疑问,如点插入的处理,但R*树展现了优化多方面因素的潜力。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

 

4/8/10
The R*-tree: An Efficient Robust Access Method for Points and Rectangles
This paper elaborates further on the Guttman's R-tree and discuss several drawbacks of the Guttman's version. Then it put forward a new variant of R-tree: the R* tree which largely improve the performance of spatial access.
Summary of the paper:
  1. Summarize of the main property of R-tree
    Its spatial access method(SAM) is mainly based on overlapping regions by using minimum bound rectangles(directory rectangles).
    The R-tree is based on a heuristic optimization and the optimization criterion it persues is to minimize the area of each enclosing rectangle in the inner nodes.
  2. Be Critical about the old method
    It points out that the old method only consider one aspect in optimization that is to minimize area, instead the paper suggests we can incorporate various optimization criteria together and to adopt the engineering approach which work out the best solution by experiments.
  3. A standard performance test is introduced.
  4. Re-evaluate the R-tree
    Many facts contribute to good retrieval performance, but they may affect each other in a complex ways.
    It list 4 principles for optimizations: these principle may have negative effect on each other
    • Minimize area
    • Minimize overlapping
    • Minimize margin
    • Storage utilization should be optimized
  5. The core of optimization: Insertion
    The paper take insight into some insertion variants of the R-tree, elaborate the idea each of them takes and states some problems with these variants such as small seeds, biased to one group and ignorance of geometric properties.
    It introduce Greene's-Split which in my eyes, it makes another standard in evaluating the distance between two rectangles. Of course, it is still not perfect.
  6. The R* tree's solution
    • Choose Subtree
      Instead of focus on only one aspect, it takes three aspects into consideration. And it works out a optimized combination through great amount of experiments. I think it is a suggestive trial in doing research which consider several facts and use them according to different classification, in this case, the leaf and non-leaf.
      Since the quadratic cost for choosing subtree maybe undesirable in practice, a approximation algorithm is given.
    • Split of R* tree
      Again the splitting process takes area-value, margin-value and overlap-value into account.
    • Forced Reinsert
      This optimization is somewhat different from earlier approach. Considering the R-tree is comparatively static once the entry have been inserted, it cannot guarantee overall performance after thousands of insertion. Instead, a reconstruction of the tree is necessary.
      By utilizing forced reinsertion, it improves overall structure and require less splits.
  7. Performance of the R* tree
    Compare between variants of R-tree on different test cases. Since it is mainly about testing, I read through this part roughly.
  8. My Questions
  •  
    • What does quadratic query rectangles mean? To minimize margin will make the directory rectangles more quadratic, can't understand.
    • Why R* tree's implementation of choose subtree routine can reduce disc access? Disc access is equal to log(N)-1, R* tree implementation could not reduce the height of the tree I think.
    • How to insert a point into a R-tree? The point has no size and it will never cause enlargement of area, the algorithm seems doesn't work...
      So I can hardly understand the point access method(PAM) part in the paper.
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值