Near-field localization algorithm of multiple sound sources based on approximated kernel density estimator
Papers|更新时间:2024-06-05
|
Near-field localization algorithm of multiple sound sources based on approximated kernel density estimator
Journal on CommunicationsVol. 38, Issue 1, Pages: 106-116(2017)
作者机构:
南京理工大学电子工程与光电技术学院,江苏 南京 210094
作者简介:
基金信息:
The National Natural Science Foundation of China(61171167);The National Natural Science Foundation of China(61401203);The Natural Science Founda-tion of Jiangsu Province(BK20130776)
Yu-zhuo FANG, Zhi-yong XU, Zhao ZHAO. Near-field localization algorithm of multiple sound sources based on approximated kernel density estimator[J]. Journal on Communications, 2017, 38(1): 106-116.
DOI:
Yu-zhuo FANG, Zhi-yong XU, Zhao ZHAO. Near-field localization algorithm of multiple sound sources based on approximated kernel density estimator[J]. Journal on Communications, 2017, 38(1): 106-116. DOI: 10.11959/j.issn.1000-436x.2017013.
Near-field localization algorithm of multiple sound sources based on approximated kernel density estimator
For near-field localization of multiple sound sources in reverberant environments
a algorithm model based on approximated kernel density estimator (KDE) was proposed.Multi-stage (MS) of sub-band processing was introduced to effectively solve the spatial aliasing by wide spacing.Spatial likelihood function (SLF) was built for multi-dimensional fusion by using two operators
sum (S) and prod (P).Then four algorithms
S-KDE
P-KDE
S-KDEMS
P-KDEMS
were derived.By the comprehensive comparison of the two statistical indicators root mean square error (RMSE) and percentage of SLF (PSLF) which denoted the recognition
P-KDEMS is confirmed as a near-field localization algorithm of multiple sound sources with high robustness and recognition.
关键词
Keywords
references
WU K , KHONG A W H . Sound source localization and tracking [M ] // Context Aware Human-Robot and Human-Agent Interaction . Springer International Publishing , 2016 : 55 - 78 .
KNAPP C H , CARTER G C . The generalized correlation method for estimation of time delay [J ] . IEEE Transactions on Acoustics,Speech,and Signal Processing , 1976 , 24 ( 4 ): 320 - 327 .
BRANDSTEIN M S , SILVERMAN H F . A practical methodology for speech source localization with microphone arrays [J ] . Computer Speech and Language , 1997 , 11 ( 2 ): 91 - 126 .
RABINKIN D V , RANOMERON R J , DAHL A , et al . A DSP implementation of source location using microphone arrays [J ] . Journal of the Acoustical Society of America , 1996 , 99 ( 4 ): 88 - 99 .
WARD D B , WILLIAMSON R C . Particle filter beamforming for acoustic source localization in a reverberant environment [C ] // 2002 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) . Orlando,USA , 2002 : 1777 - 1788 .
DIBIASE J H . A high-accuracy,low-latency technique for talker localization in reverberant environments using microphone arrays [D ] . Brown University , 2000 : 73 - 105 .
PERTILÄ P , KORHONEN T , VISA A . Measurement combination for acoustic source localization in a room environment [J ] . EURASIP Journal on Audio Speech and Music Processing , 2008 , 2008 : 1 - 14 .
TSIAMI A , KATSAMANIS A , MARAGOS P , et al . .Experiments in acoustic source localization using sparse arrays in adverse indoors environments [C ] // 2014 European Signal Processing Conference (EUSIPCO) . Lisbon,Portugal , 2014 : 2390 - 2394 .
XU Z Y , ZHAO Z , LIU M . Real-time unambiguous passive direction finding for multiple sound sources with widely spaced microphone array [J ] . Journal of Electronics & Information Technology , 2011 , 33 ( 9 ): 2056 - 2061 .
NESTA F , OMOLOGO M . Generalized state coherence transform for multidimensional TDOA estimation of multiple sources [J ] . IEEE Transactions on Audio,Speech,and Language Processing , 2012 , 20 ( 1 ): 246 - 260 .
BRUTTI A , NESTA F . Tracking of multidimensional TDOA for multiple sources with distributed microphone pairs [J ] . Computer Speech and Language , 2013 , 27 ( 3 ): 660 - 682 .
YILMAZ O , RICKARD S . Blind separation of speech mixtures via time-frequency masking [J ] . IEEE Transactions on Signal Processing , 2004 , 52 ( 7 ): 1830 - 1847 .
REDDY V V , KHONG W H , NG B P . Unambiguous speech DOA estimation under spatial aliasing conditions [J ] . IEEE Transactions on Audio,Speech,and Language Processing , 2014 , 22 ( 12 ): 2133 - 2145 .
MOHAN S , LOCKWOOD M E , KRAMER M L , et al . Localization of multiple acoustic sources with small arrays using a coherence test [J ] . Journal of the Acoustical Society of America , 2008 , 123 ( 4 ): 2136 - 2147 .
GUSTAFFSON T , RAO B D , TRIVEDI M . Source localization in reverberant environments:modeling and statistical analysis [J ] . IEEE Transactions on Speech and Audio Processing , 2003 , 11 ( 6 ): 791 - 803 .
LEHMANN E and JOHANSSON A . Prediction of energy decay in room impulse responses simulated with an image-source model [J ] . Journal of the Acoustical Society of America , 2008 , 124 ( 1 ): 269 - 277 .