宁波大学信息科学与工程学院,浙江,宁波,315211
网络出版:2020-01-25,
纸质出版:2020
移动端阅览
施炜杰, 董一鸿, 钱江波, 等. SCBT-index:基于谱编码的子图索引算法[J]. 电子学报, 2020,48(1):110-117.
SCBT-index: Subgraph Indexing Algorithm Based on Spectral Coding[J]. Acta Electronica Sinica, 2020, 48(1): 110-117.
施炜杰, 董一鸿, 钱江波, 等. SCBT-index:基于谱编码的子图索引算法[J]. 电子学报, 2020,48(1):110-117. DOI: 10.3969/j.issn.0372-2112.2020.01.013.
SCBT-index: Subgraph Indexing Algorithm Based on Spectral Coding[J]. Acta Electronica Sinica, 2020, 48(1): 110-117. DOI: 10.3969/j.issn.0372-2112.2020.01.013.
随着图模型规模的扩大,单机算法难以适应大规模数据集下的子图查询.而现有的分布式算法基于无索引的简单遍历,join过程容易出现内存溢出,而且查询图分布异常时易出现负载不均衡.提出了一种基于谱编码的二叉索引树(SCBT-index),首先对数据图中的顶点谱编码,根据编码信息构建二叉索引树.然后对查询图使用最小查询计划进行分解,最后join过程使用3个剪枝策略:基于拓扑结构的预剪枝、序列化join和基于分布式下的join优化.实验结果表明,SCBT-index在图集下的综合性能优于现有主流算法,单图下的查询时间为现有算法的1/2到1/4.
With the expansion of graph scale
single-machine algorithms can hardly adapt to the sub-graph queries in large-scale data sets. As existing distributed algorithms are based on simple traversal without index
the join process is prone to memory overflow in the distributed algorithms and load imbalance occurs when the query graph distribution is abnormal. Therefore
a binary index tree based on spectral coding named SCBT-index is proposed. Firstly
for vertex spectrum coding in the data graph
a binary index tree is constructed according to the coding information. Then
the query graph is decomposed using the minimum query plan. Finally
three pruning strategies are used in the join process: Structure matching based on topological structure
serialized join and the distributed join optimization. The experimental results show the comprehensive performance of SCBT-index under the graph set is better than that of the popular algorithms. In addition
the query time under the single graph is 1/2 to 1/4 of that of the existing algorithms.
0
浏览量
103
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621