电子学报 ›› 2013, Vol. 41 ›› Issue (5): 966-976.DOI: 10.3969/j.issn.0372-2112.2013.05.021

• 综述评论 • 上一篇    下一篇

不确定数据查询处理

蒋涛1, 高云君2, 张彬1, 周傲英3, 乐光学1   

  1. 1. 嘉兴学院数理与信息工程学院,浙江嘉兴 314001;
    2. 浙江大学计算机学院,浙江杭州310027;
    3. 华东师范大学软件学院,上海 200062
  • 收稿日期:2012-07-25 修回日期:2013-03-18 出版日期:2013-05-25 发布日期:2013-05-25
  • 通讯作者: 高云君男,1977年生,博士,副研究员.研究方向为空间数据库、GIS系统、Skyline查询. E-mail:gaoyj@zju.edu.cn
  • 作者简介:蒋 涛 男,1973年生于湖南衡阳,博士,讲师.研究方向为时空数据库查询、Skyline计算、不确定数据处理.
  • 基金资助:
    国家自然科学基金(No.61003049);浙江省自然科学基金(No.LY12F02047,No.LY12F02019);浙江省公益性技术应用研究计划(No.2011C23130);中央高校基本科研业务费专项资金(No.2010QNA5051,No.2012QNA5018);浙江大学紫金计划重点项目;嘉兴市科技计划基金(No.2011AY1005);浙江省优秀青年教师项目(No.70611011);嘉兴学院博士启动项目(No.70510010)

Query Processing on Uncertain Data

JIANG Tao1, GAO Yun-jun2, ZHANG Bin1, ZHOU Ao-ying3, YUE Guang-xue1   

  1. 1. College of Mathematics and Information Engineering,Jiaxing University.Jiaxing,Zhejiang 314001,China;
    2. College of Computer Science and Technology,Zhejiang University.Hangzhou,Zhejiang 310027,China;
    3. Shanghai Key Laboratory of Trustworthy Computing,Software Engineering Institute,East China Normal University.Shanghai 200062,China
  • Received:2012-07-25 Revised:2013-03-18 Online:2013-05-25 Published:2013-05-25

摘要: 数据的不确定性在现实世界中的经济、军事、物流、金融、电信等领域普遍存在.不确定数据广泛应用于环境维护、市场分析、基于位置的服务LBS以及数量经济研究等应用.由于这些应用的重要性以及收集和累积的不确定数据数量的快速增长,查询这些数据已经成为一个重要的任务,并日益受到广大数据库研究者的关注.本文介绍了不确定数据查询的基本原理,并对不确定数据的近邻查询、逆向近邻查询、排序查询、Top-k查询以及连接查询进行了详细的讨论.同时对这些技术的优缺点进行了分析、对比.最后给出了未来的研究方向.

关键词: 不确定数据, 近邻, 逆向近邻, 连接, 查询处理

Abstract: Data uncertainty is pervasive in various fields,for example,economy,military,logistic,finance and telecommunication,etc.Uncertain data are inherent in some important applications,such as environmental surveillance,market analysis,Location-Based Service(LBS),and quantitative economics research.Due to the importance of those applications and the rapidly increasing amount of uncertain data collected and accumulated,querying large collections of uncertain data has become an important task and has received more and more attention from the database community in recent years.This paper introduces the principle of uncertain data query,and surveys the advance of the research on uncertain data query processing,including Nearest Neighbor(NN)query,Reverse Nearest Neighbor(RNN)query,Ranking query,top-k query and join query.By a detailed comparison,the pros and cons of the techniques are discussed.In the end,the problems in current research and some future research issues are outlined.

Key words: uncertain data, nearest neighbor, reverse nearest neighbor, join, query processing

中图分类号: