Academic Journals Database
Disseminating quality controlled scientific knowledge

Bit Rate Reduction of FS-1015 Speech Coder Using Fuzzy ARTMAP and KSOFM Neural Networks

Author(s): Mansour Sheikhan | Davood Gharavian | Ali Eslamzadeh

Journal: Majlesi Journal of Electrical Engineering
ISSN 2008-1413

Volume: 3;
Issue: 1;
Start page: 19;
Date: 2009;
Original page

Keywords: Speech coder | Neural networks | Vector quantization | Fuzzy ARTMAP.

The speech spectrum is very sensitive to linear predictive coding (LPC) parameters, so small quantization errors may cause unstable synthesis filter. Line spectral pairs (LSPs) are more efficient representations than LPC parameters. On the other hand, artificial neural networks (ANNs) have been used successfully to improving the quality and also reduction the computational complexity of speech coders. This work proposes an efficient technique to reduce the bit rate of FS-1015 speech coder, while improving the performance. In this way, LSP parameters are used instead of the LPC parameters. In addition, neural vector quantizers based on Kohonen self-organizing feature map (KSOFM), with a modified-supervised training algorithm, and fuzzy ARTMAP are also employed to reduce the bit rate. By using the mentioned neural vector quantizer models, the quality of synthesized speech, in terms of mean opinion score (MOS), is improved 0.13 and 0.26, respectively. The execution time of proposed models, as compared to FS-1015 standard, is also reduced 27% and 43%, respectively.
Why do you need a reservation system?      Save time & money - Smart Internet Solutions