[1]袁家斌,浦海晨.基于遗传算法优化的神经网络电子邮件信息分类器的研究[J].南京理工大学学报(自然科学版),2008,(01):78-82.
 YUAN Jia-bin,PU Hai-chen.E-mail Information Classifier of Neural Network Based on Genetic Algorithm Optimization[J].Journal of Nanjing University of Science and Technology,2008,(01):78-82.
点击复制

基于遗传算法优化的神经网络电子邮件信息分类器的研究
分享到:

《南京理工大学学报》(自然科学版)[ISSN:1005-9830/CN:32-1397/N]

卷:
期数:
2008年01期
页码:
78-82
栏目:
出版日期:
2008-02-28

文章信息/Info

Title:
E-mail Information Classifier of Neural Network Based on Genetic Algorithm Optimization
作者:
袁家斌;浦海晨;
南京航空航天大学信息科学与技术学院, 江苏南京210016
Author(s):
YUAN Jia-binPU Hai-chen
College of Information Science and Technology,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China
关键词:
电子邮件分类器 特征选择 遗传算法 人工神经网络
Keywords:
e-mail information classifiers feature selection genetic algorithms artificial neural network
分类号:
TP183;TP393.098
摘要:
结合反垃圾邮件技术的研究,分析了电子邮件数字信息预处理中的特征选择法和将机器学习技术应用于数字信息分类器的方法。针对邮件信息特征向量庞大的问题,提出了GA-CHI特征选择法作为信息预处理,将复杂的邮件数字信息转变成易于机器学习处理的形式。基于BP神经网络电子邮件数字信息分类器,采用遗传算法来优化神经网络邮件数字信息分类器,以进一步提高对中文电子邮件的分类效果。通过对系统的实验分析表明:该文采用的方法能有效地实现对电子邮件数字信息的分类。
Abstract:
Combined with the research on Anti-Spam technology,the feature selection algorithm in pretreatment of e-mail information and the method of applying machine learning technology to digital information classifier is analyzed.In view of the problem that mail message eigenvector is so huge,GACHI feature selection algorithm as pretreatment of information is proposed.It transforms complicated e-mail information into the form which can be easily managed by machine learning.In order to further enhance the effectiveness of Chinese e-mail classification,e-mail information classifier based on BP neural network adopts genetic algorithm to optimize itself.Experimental analysis of the system shows that the method described in the paper can effectively realize the classification of the e-mail information.

参考文献/References:

[1] M itche ll T. M achine Learn ing [M ]. Be ijing: Ch ina M ach ine Press, 2003.
[2] Kwak N, Cho iC. Input feature se lec tion fo r c lassification problem s [ J ]. IEEE Trans Neu ra l Netwo rks, 2002, 13 ( 1): 221- 230.
[3] L iu H, Yu L. Tow ard integ rating feature se lection a-l gor ithm s fo r class ification and cluster ing [ J]. IEEE Trans on Know ledge and Data Eng ineer ing, 2005, 17 ( 3): 1- 12.
[4] Yang Y M, Liu X. A re- exam ina tion of text categor-i za tion m ethods [ J]. Proceedings o f S IGIR-99, 22nd ACM Interna tiona l Con ference on Research and Deve-l opm ent in In fo rm ation Retriev a,l Be rkeley, US: 1999, 42- 49.
[5] M laden ic D, Grobe linkM. Feature se lec tion on h ie rarchy o f web docum ents [ J]. Dec ision Support System s, 2003, 35: 452- 475.
[6] H aykin S. 神经网络原理[M ] . 叶世伟, 史忠植, 译. 北京: 机械工业出版社, 2004.
[7] 赵云, 刘惟一. 基于遗传算法的特征选择方法[ J]. 计算机工程与应用, 2005, 15: 52- 54.
[8] 张金萍, 刘杰, 李允公. 一种动态种群不对称交叉 的新型遗传算法[ J]. 南京理工大学学报, 2007, 31 ( 4): 444- 448.
[9] 刘颖, 谷延锋, 张晔. 基于改进遗传算法的超光谱 图象特征选择方法[ J]. 哈尔滨工业大学学报, 2005, 37( 6) : 733- 735.
[10] 杜福银, 徐扬. 基于递归神经网络的预测模糊控制 [ J]. 西南交通大学学报, 2006, 41( 6): 733- 736.
[11] H aber R E, A lique J R, A liqueA, e t a.l Contro lling a comp lex electrom echanica l process on the basis of a neuro fuzzy approach [ J]. FutureGene ration Compu ter Systems, 2005, 21 ( 7): 1 083- 1 095

相似文献/References:

[1]赵海涛,金忠.一种改进的最佳鉴别平面[J].南京理工大学学报(自然科学版),2000,(01):88.
 ZhaoHaitao JinZhong.An Improved Optimal Discriminant Plane[J].Journal of Nanjing University of Science and Technology,2000,(01):88.
[2]黄 伟,陈 昊,郭雅娟,等.基于集成分类的恶意应用检测方法[J].南京理工大学学报(自然科学版),2016,40(01):35.
 Huang Wei,Chen Hao,Guo Yajuan,et al.Mobile malware detection approach using ensemble classification[J].Journal of Nanjing University of Science and Technology,2016,40(01):35.
[3]王战红.特征和分类器参数组合优化的网络入侵检测[J].南京理工大学学报(自然科学版),2017,41(01):59.[doi:10.14177/j.cnki.32-1397n.2017.41.01.008]
 Wang Zhanhong.Network intrusion detection by using combination optimizingfeatures and classifier parameters[J].Journal of Nanjing University of Science and Technology,2017,41(01):59.[doi:10.14177/j.cnki.32-1397n.2017.41.01.008]
[4]张前进,王华东.基于核典型相关分析和支持向量机的语音情感识别模型[J].南京理工大学学报(自然科学版),2017,41(02):191.[doi:10.14177/j.cnki.32-1397n.2017.41.02.009]
 Zhang Qianjin,Wang Huadong.Speech emotion recognition model based on kernel canonicalcorrelation analysis and support vector machine[J].Journal of Nanjing University of Science and Technology,2017,41(01):191.[doi:10.14177/j.cnki.32-1397n.2017.41.02.009]
[5]张佳欢,李磊军,李美争,等.基于变精度邻域粗糙集的多标记子空间研究[J].南京理工大学学报(自然科学版),2019,43(04):414.[doi:10.14177/j.cnki.32-1397n.2019.43.04.006]
 Zhang Jiahuan,Li Leijun,Li Meizheng,et al.Research on multi-label subspace based on variableprecision neighborhood rough sets[J].Journal of Nanjing University of Science and Technology,2019,43(01):414.[doi:10.14177/j.cnki.32-1397n.2019.43.04.006]
[6]陈 红,马盈仓,杨小飞,等.包含标签信息的最小二乘多标签特征选择算法[J].南京理工大学学报(自然科学版),2019,43(04):423.[doi:10.14177/j.cnki.32-1397n.2019.43.04.007]
 Chen Hong,Ma Yingcang,Yang Xiaofei,et al.Least squares multi-label feature selection algorithmwith label information[J].Journal of Nanjing University of Science and Technology,2019,43(01):423.[doi:10.14177/j.cnki.32-1397n.2019.43.04.007]

备注/Memo

备注/Memo:
基金项目: 国家/ 8630计划( 2005AA103) 作者简介: 袁家斌( 1968- ), 男, 副教授, 硕士生导师, 博士后, 主要研究方向: 信息安全, E-mail: jbyuan@ nuaa. edu. cn。
更新日期/Last Update: 2012-12-05