[1]张 军,王永利.不确定性多维传感器数据的有效存储与查询方法[J].南京理工大学学报(自然科学版),2014,38(06):750.
 Zhang Jun,Wang Yongli.Efficient storage and query method for multidimensional uncertain sensor data[J].Journal of Nanjing University of Science and Technology,2014,38(06):750.
点击复制

不确定性多维传感器数据的有效存储与查询方法
分享到:

《南京理工大学学报》(自然科学版)[ISSN:1005-9830/CN:32-1397/N]

卷:
38卷
期数:
2014年06期
页码:
750
栏目:
出版日期:
2014-12-31

文章信息/Info

Title:
Efficient storage and query method for multidimensional uncertain sensor data
作者:
张 军1王永利2
1.江苏警官学院,江苏 南京 210031; 2.南京理工大学 计算机科学与工程学院,江苏 南京 210094
Author(s):
Zhang Jun1Wang Yongli2
1.Jiangsu Police Institute,Nanjing 210031,China; 2.School of Computer Science and Engineering,NUST,Nanjing 210094,China
关键词:
多维传感器 数据 存储 查询 多维数组树 贝叶斯网络 图数据结构 概率图模型 真实数据集 合成数据集
Keywords:
multidimensional sensors data storage query multidimensional array-tree Bayesian network graph data structure probabilistic graphical model real data sets synthetic data sets
分类号:
TP311.13
摘要:
为解决传统数据库管理技术无法有效管理不确定性数据的问题,该文设计了一种多维数组树(MB树)。MB树是一种基于贝叶斯网络的图数据结构,以贝叶斯网络作为概率图模型解决存储和查询问题。对海量数据建模并响应查询。证明了可预测性和结构关联性。利用真实数据集和合成数据集对MB树的性能进行了测试。验证了具有潜在联合分布的MB树的编码准确度。与相似的图模型比较,采用MB树的查询处理效率平均可提升约3倍。
Abstract:
To solve the problem that traditional database management technology can't manage uncertain data efficiently,a multidimensional array B-tree(MB-tree)is designed here.The MB-tree is a graph data structure based on Bayesian network.Bayesian network is used as a probabilistic graphical model to solve the storage and query problem of uncertain data.Mass multidimensional sensor data is modeled and responds to query.The predictability and relevance of multidimensional data structure are proved.The performance of the MB-tree is tested using real data sets and synthetic data sets.The coding accuracy of the MB-tree with potential co-distribution is verified.The query efficiency of the MB-tree is about 4 times as fast as those of alike graphical models.

参考文献/References:

[1] Chang C,Acharya A,Sussman A,et al.T2:A customizable parallel database for multi-dimensional data[A].SIGMOD[C].[s.l.],USA:ACM Press,1998:221-232.
[2]Marathe A P,Salem K.Query processing techniques for arrays[J].The VLDB Journal,2002,11(1):68-91.
[3]SciDB Community Forum.SciDB:The computational DBMS for data-obsessed organizations; programmable from R & Python[EB/OL].http://scidb.org/,2014-11-01.
[4]Antova L,Jansen T,Koch C,et al.Fast and simple relational processing of uncertain data[A].2008 IEEE 24th International Conference on Data Engineering(ICDE'08)[C].Cancun,Mexico:ICDE,2008.
[5]Benjelloun O,Das Sarma A,Halevy A,et al.ULDBs:Databases with uncertainty and lineage[A].VLDB'06 Proceedings of the 32nd International Conference on Very Large Data Bases[C].Seoul,Korea:2006 VLDB Endowment,2006:953-964.
[6]Sen P,Deshpande A.Representing and querying correlated tuples in probabilistic databases[A].2007 IEEE 23rd International Conference on Data Engineering(ICDE,2007)[C].Istanbul,Turkey:IEEE,2007:596-605.
[7]蒋涛,高云君,张彬,等.不确定数据查询处理[J].电子学报,2013,41(5):966-976. Jiang Tao,Gao Yunjun,Zhang Bin,et al.Query processing on uncertain data[J].Acta Electronica Sinica,2013,41(5):966-976.
[8]张慧,郑吉平,韩秋廷.BTreeU-Topk:基于二叉树的不确定数据上的Top-k查询算法[J].计算机研究与发展,2012,49(10):2095-2105. Zhang Hui,Zheng Jiping,Han Qiuting.BTreeU-Topk:Binary-tree based Top-k query algorithms on uncertain data[J].Journal of Computer Research and Development,2012,49(10):2095-2105.
[9]周逊,李建中,石胜飞.不确定数据上两种查询的分布式聚集算法[J].计算机研究与发展,2010,47(5):762-771. Zhou Xun,Li Jianzhong,Shi Shengfei.Distributed aggregations for two queries over uncertain data[J].Journal of Computer Research and Development,2010,47(5):762-771.
[10]Jordan M.Learning in graphical models[M].Cambridge,MA,USA:MIT Press,1998.
[11]Ge T,Zdonik S.Handling uncertain data in array database systems[A].IEEE 24th International Conference on Data Engineering(ICDE 2008)[C].Cancun,Mexico:IEEE,2008:1140-1149.
[12]Ge Tingjian,Zdonik S.Handling uncertain data in array database systems[A].IEEE 24th International Conference on Data Engineering(ICDE 2008)[C].Cancun,Mexico:ICDE,2008:1140-1149.
[13]Jampani R,Xu Fei,Wu Mingxi,et al.MCDB:A Monte Carlo approach to managing uncertain data[A].Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data(SIGMOD'08)[C].New York,NY,USA:ACM press,2008.
[14]Bishop C M.Pattern recognition and machine learning[M].New York,NY,USA:Springer,2006.

相似文献/References:

[1]申凡,郑敏,鲍明.用互功率谱进行未知激励下的模态振型识别[J].南京理工大学学报(自然科学版),2000,(02):117.
 ShenFan ZhengMin BaoMing.Identification of Mode Shape by Fitting Cross power Spectra under Unknown Excitation[J].Journal of Nanjing University of Science and Technology,2000,(06):117.
[2]李金伟,蒋继厚.计算机技术在质量管理中的一项应用——用计算机绘制直方图和进行工序能力调查[J].南京理工大学学报(自然科学版),1983,(01):124.

备注/Memo

备注/Memo:
收稿日期:2014-04-10 修回日期:2014-11-18
基金项目:国家自然科学基金(61170035); 中国博士后科学基金特别资助项目(200902517); 中央高校基本科研业务费专项资金(30920130112006); 江苏省自然科学基金重大专项(BK2011022); 江苏省自然科学基金(BK2011702); 江苏省重点学科建设专项经费(公安技术)
作者简介:张军(1973-),男,讲师,主要研究方向:信息技术、数据库技术、安全防范技术等,E-mail: zhang-jun@jspi.cn; 通讯作者:王永利(1974-),男,博士,副教授,主要研究方向:数据库技术、情境感知、物联网数据处理、模式识别等,E-mail: yongliwang@njust.edu.cn。
引文格式:张军,王永利.不确定性多维传感器数据的有效存储与查询方法[J].南京理工大学学报,2014,38(6):750-756.
投稿网址:http://zrxuebao.njust.edu.cn
更新日期/Last Update: 2014-12-31