|
[1]M. Annamaria, T. Heittola, and T. Virtanen. "TUT database for acoustic scene classification and sound event detection," 24th European Signal Processing Conference. Vol. 2016. 2016. [2]J. Schröder, J. Anemüller, and S. Goetze. "Performance Comparision of GMM, HMM and DNN Based Approches For Acoustic Event Detection Within Task 3 of the DCASE 2016 Challenge," Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016), 2016. [3]S. Adavanne, G. Parascandolo, P. Pertila, T. Heittola, T. Virtanen. “Sound event detection in multichannel audio using spatial and harmonic features," Proceedings of the Detection and Classification of Acoustic Scenes and Events 2016 Workshop (DCASE2016), 2016. [4]E. Cakir, T. Heittola, H. Huttunen, and T. Virtanen, "Polyphonic sound event detection using multi label deep neural networks," 2015 international joint conference on neural networks (IJCNN). IEEE, 2015. [5]E. Cakir, T. Heittola, H. Huttunen, and T. Virtanen, "Multi-label vs. combined single-label sound event detection with deep neural networks," Signal Processing Conference (EUSIPCO), 2015 23rd European. IEEE, 2015. [6]A. Mesaros, T. Heittola, and T. Virtanen. "Metrics for Polyphonic Sound Event Detection," Applied Sciences 6.6 (2016): 162. [7]O. Gencoglu, T. Virtanen, and H. Huttunen, "Recognition of acoustic events using deep neural networks," 2014 22nd European Signal Processing Conference (EUSIPCO). IEEE, 2014. [8]Detection and Classification of Acoustic Scenes and Events 2016. [Online]. Available: http://www.cs.tut.fi/sgn/arg/dcase2016/ [9]Y. Shao and D.-L. Wang, “Robust Speaker Identification Using Auditory Features And Coputational Auditory Scene Analysis,” 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008). [10]T. Irino and R.D. Patterson , “A Dynamic Compressive Gammachirp Auditory Filterbank,” IEEE Transactions on Audio, Speech, and Language Processing ( Volume: 14, Issue: 6, Nov. 2006 ). [11]H.Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. Am., vol. 87, no. 4, pp. 1738-1752, Apr. 1990. [12]H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. on Speech and Audio Proc., vol. 2, no. 4, pp. 578-589, Oct. 1994. [13]S. T. Neely, J. Rodriguez, Y.-W. Liu, W. Jesteadt, and M. P. Gorga (2009), “A computational model of loudness density,” unpublished manuscript. [14]G. Hinton, L. Deng, D. Yu, G. Dahl, M. Abdel-rahman, N. Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara Sainath, and Brian Kingsbury, “Deep Neural Networks for Acoustic Modeling in Speech Recognition,” IEEE Signal Processing Magazine ( Volume: 29, Issue: 6, Nov. 2012 ). [15]M. Tanaka and M. Okutomi, “A Novel Inference of a Restricted Boltzmann Machine,” International Conference on Pattern Recognition (ICPR2014), August, 2014. [16]C. M. Bishop, “Neural Networks for Pattern Recognition (Oxford Press),” (1995). [17]D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning representations by back-propagating errors,” Nature, vol. 323, no. 6088, pp. 533–536, 1986. [18]K. Kumar, C. Kim and R. M. Stern , “Delta-spectral Cepstral Coefficients For Robust Speech Recognition,” Acoustics, Speech, and Signal Processing, 1988. ICASSP-88. [19]S. R. Chetupalli , A. Gopalakrishnan and T. V. Sreenivas, “Feature Selection and Model Optimization for Semi-supervised Speaker Spotting,” Signal Processing Conference (EUSIPCO), 2016 24th European. [20]M. Zhang and Z. Zhou, “Multilabel neural networks with applications to functional genomics and text categorization,” IEEE Trans. Knowledge and Data Engineering, vol. 18, no. 10, pp. 1338–1351, 2006. [21]何育澤,“基於支持向量機之混合聲響辨認,”國立清華大學, 2014. [22]T. Heittola, A. Mesaros, A. Eronen, and T. Virtanen, “Context-dependent sound event detection,” EURASIP J. Audio, Speech, Music Process., vol. 2013, no. 1, p. 1, Jan. 2013. [23]D. Stowell, D. Giannoulis, E. Benetos, M. Lagrange, and M. D. Plumbley, “Detection and Classification of Acoustic Scenes and Events,” IEEE TranSactions on MultiImedia, VOL. 17, NO. 10, October 2015. [24]Q. Kong, I. Sobieraj, W. Wang, M. D. Plumbley, “Deep Neural Network Baseline For DCASE Challenge 2016,” Detection and Classification of Acoustic Scenes and Events 2016. [25]R. F. Lyon, “Cascades of two-pole–two-zero asymmetric resonators are good models of peripheral auditory function,” Journal of the Acoustical Society of America, vol. 130 (2011), pp. 3893-3904. [26]R. F. Lyon, M. Rehn, S. Bengio, T. C. Walters, G. Chechik, “Sound retrieval and ranking using sparse auditory representations,” Neural Computation, Volume 22 Issue 9, September 2010, Pages 2390-2416. [27]C. Clavel, T. Ehrette, G. Richard, “Events Detection for an Audio-Based Surveillance System,” Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on Multimedia and Expo. [28]J. K¨urby, R. Grzeszick, A. Plinge, and G. A. Fink, “Bag-of-Features Acoustic Event Detection For Sensor Networks,” Detection and Classification of Acoustic Scenes and Events 2016. [29]D. P. W. Ellis, “Prediction-driven computational auditory scene analysis,” Doctoral Dissertation, Massachusetts Institute of Technology Cambridge, MA, USA, 1996. [30]C.-W. Wu and Y.-W. Liu, “Event-related sounds in residential environment: Classification and outlier rejection,” in National Computer Symposiums, Taichung, Taiwan, 2013. [31]J. Salamon, C. Jacoby, J. P. Bello, “A Dataset and Taxonomy for Urban Sound Research,” MM '14 Proceedings of the 22nd ACM international conference on Multimedia, Pages 1041-1044, Orlando, Florida, USA — November 03 - 07, 2014. [32]C.-C. Chang and C.-J. Lin, “LIBSVM : a library for support vector machines,” ACM Transactions on Intelligent Systems and Technology, 2:27:1--27:27, 2011. [33]Y.-H. Lai, C.-H. Wang , S.-Y. Hou , B.-Y. Chen , Y. Tsao , Y.-W. Liu, “DCASE Report for Task 3: Sound Event Detection in Read Life Audio,” Detection and Classification of Acoustic Scenes and Events 2016, 2016. [34]S. Sigtia, A. M. Stark, S. Krstulovic and M. D. Plumbley, “Automatic Environmental Sound Recognition: Performance versus Computational Cost,” IEEE/ACM Transactions on Audio, Speech, and Language Processing), Volume: 24, Issue: 11, Nov. 2016.
|