|
[1] L. Weigelt, S. Sadoff and J. Miller, “ Plosive/fricative distinction: the voiceless case,” J. Acoust. Soc. Am., vol. 87, no. 6, pp. 2729-2737, 1990. [2] S. B. Davis and P. Mermelstein, “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences,” in IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357366, 1980. [3] A. V. Oppenheim and R. W. Schafer, Discrete-time Signal Processing. Pearson Education, 2009. [4] H. Sakoe and S. Chiba, “Dynamic programming algorithm optimization for spoken word recognition,” IEEE Trans. Acoustics, Speech and Signal Processing, vol. 26, no. 1, pp. 43 49, 1978. [5] J. Makhoul, “Linear prediction: A tutorial review,” in Proc. IEEE, vol. 63 no. 5, pp. 561 580, 1975. [6] H. Kamata, H. Oka, and Y. Ishida, “Estimation of vocal tract transfer function considering the glottis open and close characteristics,” in Proc. IEEE Pacific Rim Conf. Commun. Comput. Signal Process., vol. 1, pp. 137-140, 1993. [7] J. Schroeter, “Techniques for estimating vocal-tract shapes from the speech signal,” IEEE Trans. Speech Audio Process, vol. 2, no. 1, pp. 133-150, 1994. [8] P. Ladefoged, A Course in Phonetics, 5th ed, Boston, MA: Thomson Wadsworth, 2006. [9] Florian Keiler, Daniel Arfib, and Udo Zölzer, “Efficient linear prediction for digital audio effects,” in Proc. Int. Conf. on Digital Audio Effects, Dec. 2000. [10] N. Levinson, “The Wiener RMS error criterion in filter design and prediction.” J. Math. Phys, vol. 25, pp. 261278, 1947. [11] P. Escudero, P. Boersma, A. S. Rauber, and R. A. Bion, “A cross-dialect acoustic description of vowels: Brazilian and European Portuguese,” J. Acoust. Soc. Am., vol. 126, no. 3, pp. 1379-1393, 2009. [12] F. Itakura and S. Saito, “Digital filtering techniques for speech analysis and synthesis,” in Proc. 7th Int. Conf. Acoust., 1971. [13] N. Anderson, “On the calculation of filter coefficients for maximum entropy spectral analysis,” IEEE Modern Spectral Analysis, New York, 1978. [14] Nayland College - Mathematics: Comparing Box plots. Available: http://maths.nayland.school.nz/Year_11/AS1.10_Multivar_data/11_Comparing_Boxplots.htm [15] J. A. Hanley, and B. J. McNeil, “The meaning and use of the area under a receiver operating characteristic (ROC) curve,” Radiology, vol. 143, no. 1, pp. 29-36, 1982. [16] M. A. Aizerman, “Theoretical foundations of the potential function method in pattern recognition learning,” Automation and Remote Control, vol. 25, pp. 821-837, 1964. [17] J. Mercer, “Functions of positive and negative type, and their connection with the theory of integral equations,” Philos. Trans. R. Soc. A Math. Phys. Eng. Sci., vol. 209, no. 441-458, pp. 415-446, Jan. 1909. [18] R. Fletcher, Practical Methods of Optimization; 2nd ed, Wiley- Interscience, 1987. [19] C. J. C. Burges, “A tutorial on support vector machines for pattern recognition,” Data Min. Knowl. Discov., vol. 2, no. 2, pp. 121-167, Jun. 1998. [20] A. J. Smola and B. Schölkopf, “A tutorial on support vector regression,” Statistics and Computing Archive, vol. 14, no. 3, pp. 199-222, Aug. 2004. [21] B. S. Atal and S. L. Hanauer, “Speech analysis and synthesis by linear prediction of the speech wave,” J. Acoust. Soc. Am., vol. 50, no. 2B, pp. 637–655, 1971. [22] 李俊毅,「語音評分」, 國立清華大學, 2002. [23] C.-W. Hsu and C.-J. Lin, “A comparison of methods for multi-class support vector machines,” IEEE Trans. Neural Networks, vol. 13, no. 2, pp. 415-425, 2002. [24] B. E. Boser, I. M. Guyon, and V. N. Vapnik, “A train algorithm for optimal margin classifiers,” in Proc. Fifth Annual Workshop on Computational Learning Theory, pp. 144-152, 1992. [25] 何育澤, 「基於支持向量機之混合聲響辨認」, 國立清華大學, 2014. [26] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, É. Duchesnay, “Scikit-learn: Machine Learning in Python,” J. Machine Learning Research, vol. 12, pp. 2825-2830, 2011. [27] A. M. Kondoz and B. G. Evans, “A high quality voice coder with integrated echo canceller and voice activity detector for VSAT systems,” in Proc. 3rd Eur. Conf. Satellite Commun., pp. 196-200, 1993. [28] L. R. Rabiner and M. R. Sambur, “An algorithm for determining the endpoints of isolated utterances,” Bell Syst. Tech. J., vol. 54, no. 2, pp. 297-315, 1975. [29] R. Bachu, S. Kopparthi, B. Adapa, and B. Barkana, Separation of voiced and unvoiced using zero crossing rate and energy of the speech signal,” in Proc. Am. Soc. for Eng. Education Zone Conf., pp. 17, 2008. [30] A. Bala, “Voice command recognition system based on mfcc and dtw,” Int. J. Engineering Science and Technology, vol. 2, no. 12, pp. 7335-7342, 2010. [31] M. Diogo, M. Eskenazi, J. Magalhaes, and S. Cavaco, “Robust scoring of voice exercises in computer-based speech therapy systems,” in Signal Processing Conf., pp. 393-397, Aug. 2016. [32] O. Mich, A. Neri, and D. Giuliani, “The effectiveness of a computer assisted pronunciation training system for young foreign language learners,” in Proc. CALL Conf., Taylor & Francis, 2006.
|