Author(s): Swati D. Bhutekar | M. B. Chandak
Journal: International Journal of Computer Applications
ISSN 0975-8887
Volume: ncipet;
Issue: 14;
Date: 2012;
Original page
Keywords: Extraction of Emotion in Speech | Database | Recording
ABSTRACT
This paper describes the factors used in designing and recording large speech databases for applications requiring speech synthesis. Given the growing demand for customized and domain specific voices for use in corpus based synthesis systems, good practices should be established for the creation of these databases which are a key factor in the quality of the resulting speech synthesizer. This paper focuses on the factors affecting to the designing of the recording prompts, on the speaker selection procedure, on the recording setup and on the quality control of the resulting database. One way to find the emotions in the speech is , Once the speech has been recorded from the user it is converted into text, at the same time the stressed word from the speech is recorded & then the frequency for that word is find out for recording the corresponding emotion.
Journal: International Journal of Computer Applications
ISSN 0975-8887
Volume: ncipet;
Issue: 14;
Date: 2012;
Original page
Keywords: Extraction of Emotion in Speech | Database | Recording
ABSTRACT
This paper describes the factors used in designing and recording large speech databases for applications requiring speech synthesis. Given the growing demand for customized and domain specific voices for use in corpus based synthesis systems, good practices should be established for the creation of these databases which are a key factor in the quality of the resulting speech synthesizer. This paper focuses on the factors affecting to the designing of the recording prompts, on the speaker selection procedure, on the recording setup and on the quality control of the resulting database. One way to find the emotions in the speech is , Once the speech has been recorded from the user it is converted into text, at the same time the stressed word from the speech is recorded & then the frequency for that word is find out for recording the corresponding emotion.