Survey on Phoneme Recognition using Support Vector Machine

Fathima Nazarath P. A, Adi Shankara Institute of Engineering and Technology

Phoneme Recognition, Mel Frequency Cepstral Coefficient (MFCC), Support Vector Machine (SVM)

Automatic Speech Recognition (ASR) is a process in which speech signal is converted into a sequence of words, other linguistic units by making use of an algorithm which is implemented as a computer program. The speech recognition system would support many valuable applications that require human interaction with machine. The major objective with which ASR works is the development of the techniques and a system that enables the computers to recognize speech as input. Most precisely speech recognition means phoneme recognition. Good phonetic decoding leads to good word decoding, and the ability to recognize the English phones accurately will undoubtedly provide the basis for an accurate word recognizer. In this work, a detailed survey on a classification technique called support vector machine (SVM) is carried out. Linear SVM are mainly used where linear separation between two classes is possible and in nonlinear SVM’s kernel functions are used for conversion purpose. Among two types of SVM classifier soft margin SVM is used to improve the performance when error occurs and least square SVM is used for large scale classification.
    [1] Eray, Osman, Sezai Tokat, and Serdar Iplikci. "An application of speech recognition with support vector machines." 2018 6th International Symposium on Digital Forensic and Security (ISDFS). IEEE, 2018. [2] Aida-zade, Kamil, Anar Xocayev, and Samir Rustamov. "Speech recognition using support vector machines." 2016 IEEE 10th International Conference on Application of Information and Communication Technologies (AICT). IEEE, 2016. [3] Chen, JinBiao, and Shiqing Zhang. "Manifold learning-based phoneme recognition." 2009 International Conference on Image Analysis and Signal Processing. IEEE, 2009. [4] TombaloĞlu, Burak, and Hamit Erdem. "A SVM based speech to text converter for Turkish language." 2017 25th Signal Processing and Communications Applications Conference (SIU). IEEE, 2017. [5] Sonkamble, Balwant A., and D. D. Doye. "An overview of speech recognition system based on the support vector machines." 2008 International Conference on Computer and Communication Engineering. IEEE, 2008. [6] Fletcher, Tristan. "Support vector machines explained." Tutorial paper (2009). [7] Suykens, Johan AK, and Joos Vandewalle. "Least squares support vector machine classifiers." Neural processing letters9.3 (1999): 293-300. [8] Theodoridis, Sergios, and Konstantinos Koutroumbas. "Pattern recognition and neural networks." Advanced Course on Artificial Intelligence. Springer, Berlin, Heidelberg, 1999. [9] Solera-Urena, Rubén, et al. "Real-time robust automatic speech recognition using compact support vector machines." IEEE Transactions on Audio, Speech, and Language Processing 20.4 (2012): 1347-1361. [10] Cui, Jianguo, et al. "The application of support vector machine in pattern recognition." 2007 IEEE International Conference on Control and Automation. IEEE, 2007. [11] Salomon, Jesper. "Support vector machines for phoneme classification." Master of Science, School of Artificial Intelligence, Division of Informatics, University of Edinburgh (2001). [12] Cutajar, Michelle, et al. "Discrete wavelet transforms with multiclass SVM for phoneme recognition." Eurocon 2013. IEEE, 2013. [13] Mporas, Iosif, et al. "Recognition of greek phonemes using support vector machines." Hellenic Conference on Artificial Intelligence. Springer, Berlin, Heidelberg, 2006.
Paper ID: GRDCF013043
Published in: Conference : National Conference on Emerging Research Trend in Electrical and Electronics Engineering (ERTE’19)
Page(s): 187 - 192