[1]崔诗程,李千目,戈 峰.基于Lucene的全文检索架构设计[J].南京理工大学学报(自然科学版),2015,39(06):692.
 Cui Shicheng,Li Qianmu,Ge Feng.Full-text search architecture design based on Lucene[J].Journal of Nanjing University of Science and Technology,2015,39(06):692.
点击复制

基于Lucene的全文检索架构设计
分享到:

《南京理工大学学报》(自然科学版)[ISSN:1005-9830/CN:32-1397/N]

卷:
39卷
期数:
2015年06期
页码:
692
栏目:
出版日期:
2015-12-31

文章信息/Info

Title:
Full-text search architecture design based on Lucene
作者:
崔诗程1李千目1戈 峰2
1.南京理工大学 计算机科学与工程学院,江苏 南京 210094;
2.南京信息技术研究院 计算技术研究所,江苏 南京 210036
Author(s):
Cui Shicheng1Li Qianmu1Ge Feng2
1.School of Computer Science and Engineering,NUST,Nanjing 210094,China;
2.Institute of Computation Technology,Nanjing Information Technology Academe,Nanjing 210036,China
关键词:
全文检索 分布式并行计算 子节点服务器 根节点服务器
Keywords:
full-text search distributed parallel computing child-node servers root-node servers
分类号:
TP391.3
摘要:
为在海量数据中快速定位所需信息,解决因数据结构化、半结构化差异造成的检索困难,该文提出了一种基于Lucene的全文检索架构。根据分布式并行计算的设计原理,将检索任务分发给每个子节点服务器并行完成检索工作,最终由根节点服务器汇总结果。子节点服务器也采用了并行化的设计理念。验证性实验显示该文基于Lucene的全文检索架构与传统全文检索架构相比检索耗时降低55%以上。
Abstract:
In order to locate needed information in massive data and solve the search problem caused by the difference between structured and unstructured data,a full-text search architecture based on Lucene is proposed here.According to the design principle of the distributed parallel computing,the search tasks are dispatched to every child-node server,and the root-node server took responsibility for gathering results.Every child-node server adopts the design concept of parallel.Verification experiments show that compared with the traditional full-text search architecture,the search consuming time of the full-text search architecture based on Lucene proposed here decreases by 55% at least.

参考文献/References:

[1] 谭文堂,贺明科,李阜.基于Lucene.Net的分布式全文检索系统[J].计算机应用与软件,2009,26(9):142-145.
Tan Wentang,He Mingke,Li Fu.Distributed full-text search system based on Lucene.Net[J].Computer Applications and Software,2009,26(9):142-145.
[2]张丽霞.基于Lucene的全文检索系统设计与实现[D].武汉:华中科技大学计算机学院,2013.
[3]王莉云,王华,陈刚,等.基于Lucene的全文检索系统的设计与实现[J].计算机工程与设计,2007,28(24):9-11.
Wangliyun,Wang hua,Chen gang,et al.Design and implementation of full text search engine based on Lucene[J].Computer Engineering and Design,2007,28(24):9-11.
[4]郭永利,卢颖颖.基于Lucene对文件全文检索的研究与应用[J].微型电脑应用,2014,30(1):51-54.
Guo Yongli,Lu Yingying.Research and application of full-text retrieval technology for document based on Lucene[J].Microcomputer Applications,2014,30(1):51-54.
[5]李永春,丁华福.Lucene的全文检索的研究与应用[J].计算机技术与发展,2010,20(2):12-15.
Li Yongchun,Ding Huafu.Research and application of full text search based on Lucene[J].Computer Technology and Development,2010,20(2):12-15.
[6] Li Shengdong,Lv Xueqiang,Ling Feng,et al.Study on efficiency of full-text retrieval based on Lucene[A].International Conference on Information Engineering and Computer Science(ICIECS 2009)[C].Wuhan:IEEE,2009:1-4.
[7]Zhao Wei.The design and research of literary retrieval system based on Lucene[A].2011 International Conference on Electronic and Mechanical Engineering and Information Technology(EMEIT)[C].Harbin:IEEE,2011:4146-4148.
[8] Huang Hua,Gao Shu,Shao Chaojie.Distributed search engine design and implementation based on Lucene[A].2010 International Conference on Computer Design and Applications(ICCDA)[C].Qinhuangdao:IEEE,2010:25-27.
[9]Zhang Yong,Li Jianlin.Research and improvement of search engine based on Lucene[A].2009 International Conference on Intelligent Human-Machine Systems and Cybernetics(IHMSC'09)[C].Hangzhou:IEEE,2009:270-273.
[10]宋佳,诸云强,刘润达.一种基于Lucene改进的全文检索工具包[J].计算机工程与应用,2008,44(4):172-175.
Song Jia,Zhu Yunqiang,Liu Runda.Enhanced full text retrieval kit based on Lucene[J].Computer Engineering and Applications,2008,44(4):172-175.
[11]Li Bo,Zhang Jingjie,Chen Mingyu,et al.DIFTSAS:A distributed full text search and analysis system for big data[A].2013 IEEE 16th International Conference on Computational Science and Engineering(CSE 2013)[C].Sydney,Australia:IEEE,2013:1303-1309.
[12]Sun Lincheng.A large-scale full-text search engine using dot Luence[A].2011 3ed IEEE International Conference on Communication Software and Networks(ICCSN)[C].Xi'an:IEEE,2011:793-795.
[13]Zhang Hongbin,Liu Juefu.Search engine design based on Web service and Lucene[A].2009 Wase Interna-tional Conference on Information Engineering(ICIE 2009,VOL II)[C].Taiyuan:IEEE,2009:458-461.

备注/Memo

备注/Memo:
收稿日期:2015-05-14 修回日期:2015-06-19
基金项目:国家自然科学基金(61272419); 江苏省未来网络前瞻性研究项目(BY2013095-3-02); 江苏省产学研前瞻性项目(BY2014089; BY2013039; BY2013037); 连云港国际合作项目(CH1304)
作者简介:崔诗程(1992-),男,硕士生,主要研究方向:全文检索、数据挖掘,E-mail:shicheng.cui@foxmail.com。
引文格式:崔诗程,李千目,戈峰.基于Lucene的全文检索架构设计[J].南京理工大学学报,2015,39(6):692-697.
投稿网址:http://zrxuebao.njust.edu.cn
DOI:10.14177/j.cnki.32-1397n.2015.39.06.010
更新日期/Last Update: 2015-12-31