Volume 5, Issue 1 (March 2009)                   IJEEE 2009, 5(1): 1-12 | Back to browse issues page

XML Print


Abstract:   (19226 Views)
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in applications dealing with images, it is still in its infancy in speech processing. Age classification, on the other hand, is also concerned as a useful tool in different applications, like issuing different permission levels for different aging groups. This paper concentrates on a comparative study of gender and age classification algorithms applied to speech signal. Experimental results are reported for the Danish Emotional Speech database (DES) and English Language Speech Database for Speaker Recognition (ELSDSR). The Bayes classifier using sequential floating forward selection (SFFS) for feature selection, probabilistic Neural Networks (PNNs), support vector machines (SVMs), the K nearest neighbor (K-NN) and Gaussian mixture model (GMM), as different classifiers, are empirically compared in order to determine the best classifier for gender and age classification when speech signal is processed. It is proven that gender classification can be performed with an accuracy of 95% approximately using speech signal either from both genders or male and female separately. The accuracy for age classification is about 88%.
Full-Text [PDF 412 kb]   (6321 Downloads)    
Type of Study: Research Paper |
Received: 2009/03/10 | Accepted: 2009/03/10

Rights and permissions
Creative Commons License This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.