[1]曾毓敏.基于倒谱修正模型的语音基音检测算法[J].南京理工大学学报(自然科学版),2007,(04):503-508.
 ZENG Yu-min,WU Zhen-yang.Speech Pitch Detection Algorithm Based on Modified Cepstrum Model[J].Journal of Nanjing University of Science and Technology,2007,(04):503-508.
点击复制

基于倒谱修正模型的语音基音检测算法
分享到:

《南京理工大学学报》(自然科学版)[ISSN:1005-9830/CN:32-1397/N]

卷:
期数:
2007年04期
页码:
503-508
栏目:
出版日期:
2007-08-30

文章信息/Info

Title:
Speech Pitch Detection Algorithm Based on Modified Cepstrum Model
作者:
曾毓敏1 2 吴镇扬1
1. 东南大学信息科学与工程学院, 江苏南京210096; 2. 南京师范大学物理科学与技术学院, 江苏南京210097
Author(s):
ZENG Yu-min12WU Zhen-yang1
1.School of Information Science and Engineering,Southeast University,Nanjing 210096,China;2.School of Physics and Technology,Nanjing Normal University,Nanjing 210097,China
关键词:
基音 倒谱 线性预测编码 预测残差
Keywords:
pitch cepstrum linear pred ictive cod ing pred ictive pesidua l
分类号:
TN912.3
摘要:
该文提出了一种基于修正倒谱模型的改进的倒谱基音检测算法。该算法首先对分帧语音进行10阶线性预测编码(LPC)分析和逆滤波,获得LPC预测残差;然后对残差信号进行倒谱分析,倒谱分析中采用了离散傅里叶变换频谱的高频分量置零的计算措施;最后根据倒谱的特征求得浊音语音的基音周期。仿真检测结果表明:该算法无论对纯净语音,还是对不同加噪情况下的含噪语音,其基音检测结果都明显优于传统倒谱基音检测算法,并且也明显优于基于平均幅度差函数的基音检测算法,而略优于基于自相关函数的基音检测算法。
Abstract:
An improved speech pitch detect ion a lgorithm based on mod if ied cepstrum model is proposed. In the proposed algorithm, a ten-order LPC ( linear predictive coding ) analysis is performed on a segmented speech, and the segmented speech is f iltered by the inverse filter to g ive theLPC pred ict ive residua.l The cepstrum of the pred ict ive residual is calcu lated w ith the simp le method of the h igh frequency spectral components of DFT being set to zero. The pitch period o f the vo iced speech is extracted from the cepstrum of pred ict ive residua.l The simu lated p itch detection results show that the pitch extract ion error of the proposed a lgorithm is significantly low er than that of the conventional cepstrum based a lgorithm bo th for clean speech and d ifferent no isy speech. The performance o f the proposed algorithm is alsomuch better than that of the average magn itude difference funct ion based p itch detection algorithm and slightly better than that of the basic autocorrelat ion function based algorithm.

参考文献/References:

[ 1] Rab ine r L, ChengM, RosenbergA, et a .l A com pa rative perform ance study o f several pitch detection a lgorithm s [ J] . IEEE Trans on Acoustics, Speech, and S ignal Processing, 1976, 24 ( 5): 399- 417.
[ 2] No llA M. Cepstrum pitch dete rm ina tion [ J] . Journa l o f the Acoustic Soc ie ty of Ame rica, 1967, 41 ( 2 ): 293- 309.
[ 3] Kadambe S, Boudreaux-Barte ls G. App lication o f the w avelet transform for p itch detection of speech signals [ J] . IEEE Trans on Inform ation Theo ry, 1992, 38 ( 2): 917- 924.
[ 4] H uang D, L inW, Raha rdja S. Speech p itch de tection in no isy env ironm ent us ing m ult-i rate adap tive lo ssless FIR filters [ A ]. Proceed ing s of the Inte rna tiona l Symposium on C ircu its and Systems 2004 ( ISCAS . 04) [ C ]. [ S. .l ]: IEEE, 2004. Ⅲ - 429- 432.
[ 5] Xu X, M iyanag aY. A robust pitch de tection in no isy speech w ith band-pass filtering on m odulation spec tra [ A]. Proceed ings of In ternational Sym po sium on Communications and Inform ation T echno logy ( ISC IT2005) [ C] . [ S. .l ]: IEEE, 2005. 266- 269.
[ 6] Ahm adi S, Span ias A. Cepstrum-based p itch de tection using a new statistica l V /UV c lassifica tion a lgor ithm [ J]. IEEE T rans on Speech and Audio Processing, 1999, 7 ( 3): 333- 338.
[ 7] H odgson L, Je rniganM, W ills B. Nonlinearmu ltiplicative cepstra l ana lysis for pitch extraction in speech [ A] . Proceed ings o f Internationa l Con ference on Acoustics, Speech, and S ignal Pro cessing ( ICASSP-90) [ C] . A lbuquerque, USA: IEEE, 1990. 257- 260.
[ 8] Nadeu C, Pascua l J, H ernando J. P itch de term ina tion using the cepstrum of the one- sided autoco rre lation sequence [ A]. Pro ceedings o f Interna tiona l Con ference on Acoustics, Speech, and S ignal Process ing ( ICASSP- 91) [ C ]. Toronto, Canada: IEEE, 1991. 3 677- 3 680.
[ 9] Andrew sM, Picone J, Deg roat R. Robust p itch de term ina tion v ia SVD based cepstra l methods [ A ]. Proceedings o f International Conference on Acoustics, Speech, and S igna l Processing ( ICASSP-90) [ C]. A-l buquerque, USA: IEEE, 1990. 253- 256.
[ 10] 杨行峻, 迟惠生, 李爱军, 等. 语音信号数字处理 [M ]. 北京: 电子工业出版社, 2000. 54- 62.
[ 11] Ve rhelstW, Steenhaut O. A new m ode l for the shorttim e comp lex cepstrum of vo iced speech [ J] . IEEE T rans on Acoustics, Speech, and S igna l Processing, 1986, 34 ( 1): 43- 51.
[ 12] SPIB. No iseX92 no ise database [ EB /OL]. H ttp: / / spib. rice. edu /spib / select_noise. htm ,l 2002- 11- 15.

相似文献/References:

[1]梁国龙,张 瑶,付 进.基于自适应滤波与倒谱联合分析的时延估计方法[J].南京理工大学学报(自然科学版),2014,38(01):147.
 Liang Guolong,Zhang Yao,Fu Jin.Time delay estimation based on adaptive filtering and cepstrum analysis method[J].Journal of Nanjing University of Science and Technology,2014,38(04):147.

备注/Memo

备注/Memo:
基金项目: 国家973计划项目( 2002CB312102); 江苏省高校自然科学基础研究项目( 07KJD510110)
作者简介: 曾毓敏( 1962- ), 男, 副教授, 博士生, 主要研究方向: 语音与音频信号处理, E-m a il:zengyum in@ n jnu.edu. cn;
通讯作者: 吴镇扬( 1949- ), 男, 教授, 博士生导师, 主要研究方向: 视觉与听觉信号处理, 通信信号处理, E-m ail:zhenyang@ seu. edu. cn。
更新日期/Last Update: 2007-08-30