Hybrid architecture based on fuzzy classifier and multiplayer feed-forward neural network for speaker identification
|更新时间:2024-10-14
|
Hybrid architecture based on fuzzy classifier and multiplayer feed-forward neural network for speaker identification
Issue 11, (2005)
作者机构:
1. 南京邮电大学通信与信息工程学院
2. 南京邮电大学通信与信息工程学院,江苏,南京,210003
作者简介:
基金信息:
DOI:
CLC:TN912.3
Published:2005
稿件说明:
移动端阅览
ZHANG Ling-hua, YANG Zhen, ZHENG Bao-yu. Hybrid architecture based on fuzzy classifier and multiplayer feed-forward neural network for speaker identification[J]. 2005, (11).
DOI:
ZHANG Ling-hua, YANG Zhen, ZHENG Bao-yu. Hybrid architecture based on fuzzy classifier and multiplayer feed-forward neural network for speaker identification[J]. 2005, (11).DOI:
Hybrid architecture based on fuzzy classifier and multiplayer feed-forward neural network for speaker identification
摘要
提出了基于模糊超椭球聚类算法的说话人辨认新方法。该算法首先将某一类的训练数据分成若干子类
对每一子类在其中心周围定义具有超椭球区域的模糊规则。实验表明
该系统可以较快的聚类速度取得与HMM 相当的识别效果。进一步的研究表明
基于模糊超椭球聚类算法的说话人辨认系统与传统的基于HMM的识别方法存在一个共同的缺点
即抗噪性能较差。为此
通过引入多层前馈神经网络(MLFNN)与模糊超椭球分类器构成混合模型
使系统的识别性能和抗噪能力显著提高。
Abstract
A novel method for speaker identification was proposed which was based on a fuzzy classifier with hyperellipsoidal regions. First
the training data for each class were divided into several clusters. Then
for each cluster
a fuzzy rule with a hyperellipsoidal region was defined around a cluster center. The evaluation experiments had been conducted to compare the fuzzy hyperellipsoidal classifier with the HMM. It was found that the former classifier can achieve a comparable speaker identification performance to the latter with higher clustering speed. Further research showed that both fuzzy hyperellipsoidal classifier and the HMM worsened the recognition ability when the test data contained noise. To overcome this problem
a hybrid architecture based on fuzzy classifier and multilayer feed-forward neural network (MLFNN) was developed for speaker recognition. The experimental results showed that the new method can achieve a much better identification performance and robustness to the additive noise.