應用深度學習模型與注意力機制於軟體錯誤預測之研究__國立清華大學博碩士論文全文影像系統

帳號：guest(3.144.103.238) 離開系統

字體大小：

詳目顯示

第 1 筆 / 共 1 筆

/1頁

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士論文系統

、以作者查詢全國書目

論文基本資料
摘要
外文摘要
論文目次
參考文獻
電子全文

作者(中文):	于庭妍
作者(外文):	Yu, Ting-Yan
論文名稱(中文):	應用深度學習模型與注意力機制於軟體錯誤預測之研究
論文名稱(外文):	Analysis of Applying Deep Learning Model with Attention Mechanism for Software Defect Prediction
指導教授(中文):	黃慶育
指導教授(外文):	Huang, Chin-Yu
口試委員(中文):	林振緯蘇銓清林其誼
口試委員(外文):	Lin, Jenn-Wei Sue, Chuan-Ching Lin, Chi-Yi
學位類別:	碩士
校院名稱:	國立清華大學
系所名稱:	資訊工程學系
學號:	106062634
出版年(民國):	108
畢業學年度:	107
語文別:	英文
論文頁數:	87
中文關鍵詞:	軟體工程、錯誤預測、深度學習、卷積神經網路、注意力機制、自注意力機制
外文關鍵詞:	Software engineering、Defect prediction、Deep learning、Convolutional Neural Network、Attention mechanism、Self-Attention mechanism
相關次數:	推薦:0 點閱:283 評分: 下載:0 收藏:0

隨著科技發展，軟體開發變得越來越複雜，項目也越來越龐大，為了完成需求、確保品質和安排開發行程，越來越多軟體工程的技術與方法被提出，軟體錯誤預測即是軟體工程中的一種技巧，其能增加程式的可靠性，幫助開發人員找到導致錯誤和失敗的缺陷並節省測試的時間。近15年來，大多數錯誤預測研究都是基於規模度和複雜性的度量方式；而近年來，隨著機器學習的發展，越來越多基於機器學習的錯誤預測研究，大多數錯誤預測技術使用的特徵是提取已標記的歷史錯誤數據以進行預測。
然而現有的傳統特徵多從已標記的歷史錯誤數據中提取，而程式中的語義和語句對於建模特徵極為重要，但傳統特徵無法擷取程式的語義，要建立準確的預測模型，選擇有效的特徵至關重要，在這項研究中，我們建立了一個深度學習模型，利用自注意力機制模型以提取程式中的語義特徵並自動預測錯誤，我們將程式碼對應到抽象語法樹並將它們編碼為單詞序列向量，並利用輸入的特徵訓練自注意力模型以提取程式的語義特徵並預測錯誤。此外因為錯誤預測有兩種：專案內錯誤預測和跨專案錯誤預測，在專案內錯誤預測中，訓練資料和測試資料屬於同一個專案；相反的在跨專案錯誤預測中，訓練資料和測試資料在不同專案中。
而我們用F度量評估十個開源資料集的預測表現，F1度量考慮了準確率和召回率計算其準確度分數，我們的方法也與所選的深度學習方法進行比較，以驗證模型表現和消耗時間之間的效益比。在專案內錯誤預測中，與基於選擇機器學習的方法選擇基於深度學習的方法相比，DPSAM在F1度量分別有25％和42％的表現優化，並且分別提高了1.27倍和1.17倍的準確率。此外，在跨專案錯誤預測中，與基於選擇的基於機器學習的方法選擇的基於深度學習的方法相比，DPSAM在F1度量分別有31％和54％的表現優化，並且分別提高了1. 25倍和1. 18倍的準確率。

Advances in science and technology has led to increased complexity and scale in software development and projects. In order to meet standards in quality and complete schedules on time, numerous methodologies related to software engineering technology have been proposed. Software defect prediction is a skill in software engineering that can increase program reliability. It assists developers in discovering defects that can cause error and failure and can save testing time. In the past 15 years, most defect prediction studies have been based on size and complexity metrics. In recent years, with the development of machine learning, more and more machine learning based predictive studies have been conducted. Most defect-prediction techniques use features which are extracted from labeled historical defect data to predict defect.
Existing traditional characteristics are manually extracted from labeled historical defect data. There are semantic and structural information on programs which are important in modeling program functionality. However, traditional features are unable to capture semantic features of programs. To build an accurate prediction model, choosing effective features remains critical. In this study, we constructed a deep learning model called Defect Prediction via Self-Attention mechanism (DPSAM) to extract semantic features of programs and predict defects automatically. We transferred programs into abstract syntax trees (ASTs) and encoded them into token vectors. With input features, we trained a self-attention mechanism to extract semantic features of programs and predict defects. There are two kinds of defect predictions: within-project defect prediction (WPDP) and cross-project defect prediction (CPDP). In WPDP, training data and testing data were in the same project. Conversely, in CPDP, training data and testing data were in different projects.
We evaluated performance on 10 open source projects using F1 score, precision, recall, and accuracy. F1 score considers both the precision and the recall to compute the performance score. In addition, to validate the trade-off between performance and time, our proposed approach was also compared to selected methods in deep learning models. In WPDP, compared to the selected machine learning based method and the selected deep learning-based method, DPSAM achieved 25% and 42% performance improvement in F1 score, and achieved 1.27 times and 1.17 times improvement in accuracy. In addition, in CPDP, DPSAM achieved 31% and 54% performance improvement in F1 score and achieved 1.25 times and 1.18 times improvement in accuracy compared to selected machine learning based method and selected deep learning-based method, respectively.

LIST OF FIGURES............... IV
LIST OF TABLES ............... V
LIST OF SYMBOLS............... VIII
ABSTRACT ..................... X
中文摘要 ....................... XIII

1. INTRODUCTION .............. 1

2. RELATED WORK .............. 6
2.1 OVERVIEW OF SOFTWARE DEFECT PREDICTION................ 6
2.2 DEEP LEARNING IN DEFECT PREDICTION . 12
2.3 ATTENTION MECHANISM .......15

3. DEFECT PREDICTION VIA SELF-ATTENTION MECHANISM 19
3.1 PARSING SOURCE CODE ...... 20
3.2 DATA PREPROCESSING ....... 23
3.2.1 Encoding Tokens ........ 23
3.2.2 Handling Imbalance ..... 25
3.3 TRAINING SELF-ATTENTION MECHANISM AND PREDICT DEFECT ......... 26

4. EXPERIMENT AND DISCUSSION . 31
4.1 DATASET ..... 31
4.2 EVALUATION METRICS ....... 33
4.3 BASELINE METHODS ......... 34
4.4 EXPERIMENTAL RESULT ...... 36
4.4.1 Within-Project Defect Prediction (WPDP) .............. 36
4.4.2 Cross-Project Defect Prediction(CPDP) ................. 41
4.5 OBSERVATION AND DISCUSSION........ 46
4.6 RESEARCH QUESTION......... 48
4.6.1 RQ1: Does Self-Attention mechanism outperform machine learning based methods in defect prediction? ........... 48
4.6.2 RQ2: What is the consuming time using deep learning based methods in WPDP and CPDP?.. 49
4.6.3 RQ3: Does Self-Attention mechanism outperform deep learning based methods in cross-project defect prediction?................. 51
4.6.4 RQ4: Does Self-Attention mechanism outperform deep learning based methods in within-project defect prediction? ............... 53
4.6.5 RQ5: Do different parameter settings in the Self-Attention mechanism affect the model performance? ........... 54
4.6.6 RQ6: What is the benefit of the Self-Attention mechanism? ........... 55
4.6.7 RQ7: Does Self-Attention mechanism outperform traditional methods in defect prediction? ... 56
4.7 THREATS TO VALIDITY ...... 57

5. CONCLUSION AND FUTURE WORK ...................... 60

6. REFERENCE.................... 63

APPENDIX A. CROSS PROJECT DEFECT PREDICTION RESULT... 68
A.1 F1 SCORE ... 68
A.2 ACCURACY 73
A.3 PRECISION.. 78
A.4 RECALL...... 83

[1] R. Moser, W. Pedrycz, and G. Succi, “A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction,” in ICSE’08: Proc. of the International Conference on Software Engineering, 2008.
[2] X.-Y. Jing, S. Ying, Z.-W. Zhang, S.-S. Wu, and J. Liu, "Dictionary learning based software defect prediction," in Proceedings of the 36th International Conference on Software Engineering, 2014, pp. 414-423: ACM.
[3] M. Tan, L. Tan, S. Dara, and C. Mayeux, “Online defect prediction for imbalanced data,” in ICSE’15: Proc. of the International Conference on Software Engineering- Volume 2, 2015.
[4] J. Nam, S. J. Pan, and S. Kim, “Transfer defect learning,” in ICSE’13: Proc. of the International Conference on Software Engineering, 2013.
[5] J. Nam, “Survey on software defect prediction,” Department of Computer Science and Engineering, The Hong Kong University of Science and Technology, Tech. Rep, 2014.
[6] V. Suma and T. R. G. K. Nair, “Effective Defect Prevention Approach in Software Process for Achieving Better Quality Levels,” World Academy of Science, Engineering and Technology (WASET), 42, pp. 258_262, 2008. M. R. Lyu et al., Handbook of software reliability engineering. IEEE computer society press CA, 1996, vol. 222.
[7] G. E. Hinton and R. R. Salakhutdinov. Reducing the dimensionality of data with neural networks. Science’06, 313(5786):504–507.
[8] J. Li, P. He, J. Zhu, M. R. Lyu, “Software Defect Prediction via Convolutional Neural Network,” in QRS ’17: Proc. of the International Conference on Software Quality, Reliability and Security, 2017.
[9] H. K. Dam, T. Pham, S. W. Ng, T. Tran, J. Grundy, A. Ghose, T. Kim, and C.-J. Kim, “A Deep Tree Based Model for Software Defect Prediction,” [Online]. Available: https://arxiv.org/abs/1802.00921. Accessed: Mar. 7, 2018.
[10] S. Wang, T. Liu and L. Tan, “Automatically Learning Semantic Features for Defect Prediction,” in ICSE '16, Proceedings of the 38th International Conference on Software Engineering, Pages 297-308.
[11] H. K. Dam, T. Tran, T. Pham, S. W. Ng, J. Grundy, and A. Ghose “Automatic feature learning for vulnerability prediction,” [Online]. Available: https://arxiv.org/abs/1708.02368. Accessed: Mar. 7, 2018.
[12] M. White, C. Vendome, M. Linares-Va ́squez, and D. Poshyvanyk. Toward deep learning software repositories. In MSR’15, pages 334–345.
[13] H. K. Dam, T. Tran, T. Pham, “A Deep Language Model for Software Code,”
63
[Online]. Available: https://arxiv.org/abs/1608.02715. Accessed: Mar. 7, 2018.
[14] H. Peng, L. Mou, G. Li, Y. Liu, L. Zhang, and Z. Jin, "Building program vector representations for deep learning," in International Conference on Knowledge
Science, Engineering and Management, 2015, pp. 547-553: Springer.
[15] A. Hindle, E. T. Barr, Z. Su, M. Gabel, and P. Devanbu, “On the naturalness of
software,” in ICSE’12, pages 837–847.
[16] C. J. Maddison, and D. Tarlow, “Structured Generative Models of Natural Source
Code,” [Online]. Available: https://arxiv.org/abs/1401.0514. Accessed: Mar. 7,
2018.
[17] J. Nam and S. Kim. Heterogeneous defect prediction. In FSE’15, pages 508–519.
[18] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser,
and I. Polosukhin, “Attention is All You Need,” in NIPS’17: Proc. of the
Conference on Neural Information Processing Systems, 2017.
[19] M.-T. Luong, H. Pham, and C. D. Manning, “Effective Approaches to Attention- based Neural Machine Translation,” Proceedings of the Conference on Empirical
Methods in Natural Language Processing, 2015
[20] K. Xu, J. Ba, R. Kiros, K. Cho, A. Courville, R. Salakhutdinov, R. Zemel, and Y.
Bengio, “Show, Attend and Tell: Neural Image Caption Generation with Visual Attention,” [Online]. Available: https://arxiv.org/abs/1502.03044. Accessed: Mar. 7, 2018.
[21] V. Mnih, N. Heess, A. Graves, and K. Kavukcuoglu, “Recurrent Model of Visual Attention,” in NIPS’14: Advances in Neural Information Processing Systems 27.
[22] Y. C. Huang, K. L. Peng, and C. Y. Huang, “A History-based Cost-Cognizant Test Case Prioritization Technique in Regression Testing,” Journal of Systems and Software, Vol. 85, Issue 3, pp. 626-637, March 2012.
[23] C. Y. Huang, C. S. Kuo, and S. P. Luan, “Evaluation of Bounded Generalized Pareto Model for the Analysis of Fault Distribution of Open Source Software,” IEEE Trans. on Reliability, Vol. 63, No. 1, pp. 309-319, March 2014.
[24] S. P. Luan and C. Y. Huang, “An Improved Pareto Distribution for Modeling the Fault Data of Open Source Software,” Software Testing, Verification and Reliability, Vol. 24, Issue 6, pp. 416-437, Sept. 2014.
[25] H. H. Maurice, “Elements of software science (operating and program- ming systems series),” 1977.
[26] T. J. McCabe, “A complexity measure,” IEEE Transactions on Software Engineering, no. 4, pp. 308–320, 1976.
[27] S. R. Chidamber and C. F. Kemerer, “A metrics suite for object oriented design,” IEEE Transactions on Software Engineering, vol. 20, no. 6, 1994.
[28] M.H. Halstead, “Elements of Software Science.” Elsevier, North-Holland, 1975.
64
[29] T. Compton, and C. Withrow, “Prediction and Control of Ada Software Defects,” J. Systems and Software, vol. 12, pp. 199-207, 1990.
[30] T. M. Khoshgoftaar, E. B. Allen, N. Goel, A. Nandi, and J. McMullan, "Detection of software modules with high debug code churn in a very large legacy system," in Proceedings of ISSRE'96: 7th International Symposium on Software Reliability Engineering, 1996, pp. 364-371: IEEE.
[31] A. B. Binkley and S. R. Schach, "Validation of the coupling dependency metric as a predictor of run-time failures and maintenance measures," in Proceedings of the 20th international conference on Software engineering, 1998, pp. 452-455: IEEE.
[32] A. E. Hassan. Predicting faults using the complexity of code changes. In ICSE’09, pages 78–88.
[33] T. Lee, J. Nam, D. Han, S. Kim, and H. P. In. Micro interaction metrics for defect prediction. In FSE’11, pages 311–321.
[34] R. Moser, W. Pedrycz, and G. Succi. A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction. In ICSE’08, pages 181–190.
[35] T. T. Nguyen, T. N. Nguyen, and T. M. Phuong. Topic-based defect prediction. In ICSE’11, pages 932–935.
[36] M. Allamanis, E. T. Barr, P. Devanbu, and C. Sutton, “A Survey of Machine Learning for Big Code and Naturalness,” ACM Computing Surveys (CSUR) Surveys Homepage archive, Volume 51 Issue 4, September 2018, Article No. 81.
[37] J. Wang, B. Shen, and Y. Chen. Compressed c4. 5 models for software defect prediction. In QSIC’12, pages 13–16.
[38] T. Khoshgoftaar and N. Seliya. Tree-based software quality estimation models for fault prediction. In Software Metrics’02, pages 203–214.
[39] W. Tao and L. Wei-hua. Naive Bayes software defect prediction model. In CiSE’10, pages 1–4.
[40] J. Nam, S. J. Pan, and S. Kim, “Transfer defect learning,” in ICSE’13: Proc. of the International Conference on Software Engineering, 2013.
[41] B. Turhan, T. Menzies, A. B. Bener, and J. Di Stefano. On the relative value of cross-company and within-company data for defect prediction. Empirical Softw. Engg., 14(5):540–578, 2009.
[42] S. Watanabe, H. Kaiya, and K. Kaijiri. Adapting a fault prediction model to allow inter language reuse. In Proceedings of the 4th International Workshop on Predictor Models in Software Engineering, pages 19–24, 2008.
[43] X. Yang, D. Lo, X. xia, Y. Zhang, and J. Sun. “Deep learning for just-in-time defect prediction.” In QRS’15, pages 17–26.
[44] X.-Y. Jing, S. Ying, Z.-W. Zhang, S.-S. Wu, and J. Liu, “Dictionary learning based
65
software defect prediction,” in ICSE’14: Proc. of the International Conference on
Software Engineering, 2014.
[45] I. H. Witten, E. Frank, M. A. Hall, and C. J. Pal, Data Mining: Practical machine
learning tools and techniques. Morgan Kaufmann, 2016.
[46] A. Krizhevsky, I. Sutskever, and G. Hinton. “Imagenet classification with deep
convolutional neural networks”. In NIPS, 2012.
[47] Xu, L., Ren, J.S., Liu, C., and Jia, J. “Deep convolutional neural network for image
deconvolution.” In: NIPS. (2014) 1790–1798.
[48] S. Ren, K. He, R. Girshick, and J. Sun. “Faster R-CNN: Towards real-time object
detection with region proposal networks.” In NIPS, 2015.
[49] A. Graves, A.-r. Mohamed, and G. Hinton, "Speech recognition with deep
recurrent neural networks," in 2013 IEEE international conference on acoustics,
speech and signal processing, 2013, pp. 6645-6649: IEEE.
[50] K. Cho, B. V. Merrie ̈nboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk,
and Y. Bengio. “Learning phrase representations using rnn encoder-decoder for statistical machine translation.” 2014.T. Mikolov, M. Karafia ́t, L. Burget, J. Cernocky`, and S. Khudanpur. Recurrent neural network based
[51] T. Mikolov, M. Karafia ́t, L. Burget, J. Cernocky`, and S. Khudanpur. “Recurrent neural network based language model.” In INTERSPEECH, pages 1045–1048, 2010.
[52]S. Hochreiter and J. Schmidhuber. “Long short-term memory.” Neural computation, 9(8):1735–1780, 1997.
[53] S. R. Bowman, L. Vilnis, O. Vinyals, A. M. Dai, R. Jo ź efowicz, and S. Bengio. 2016. “Generating sentences from a continuous space.” In Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, pages 10–21. 2016.
[54] W. Ling, E. Grefenstette, K. M. Hermann, T. Kočiský, A. Senior, F. Wang, and P. Blunsom, “Latent Predictor Networks for Code Generation,” in ACL’ 16: proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Volume 1.
[55] S. Iyer, I. Konstas, A. Cheung, and L. Zettlemoyer, “Summarizing Source Code using a Neural Attention Model,” in ACL’ 16: proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Volume 1.
[56] P. Yin, and G. Neubig, “A Syntactic Neural Model for General-Purpose Code Generation,” in ACL’ 17: proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Volume 1.
[57] K. Cho, B. V. Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio, “Learning Phrase Representations using RNN Encoder-Decoder
66
for Statistical Machine Translation,” in EMNLP’14: Proc. of the Empirical
Methods in Natural Language Processing, 2014.
[58] “JavaParser,” [Online]. Available:
https://github.com/donnchadh/JavaParser/tree/master/JavaParser. Accessed: Mar.
7, 2018.
[59] R. Scandariato, J. Walden, A. Hovsepyan, and W. Joosen, “Predicting vulnerable
software components via text mining.” IEEE Trans. Software Eng., vol. 40, no. 10,
pp. 993–1006, 2014. [Online]. Available: http://dblp.uni- trier.de/db/journals/tse/tse40.html#ScandariatoWHJ14
[60] C. D. Manning and H. Schutze. Foundations of statistical natural language processing. MIT press, 1999.
[61] M. Tan, L. Tan, S. Dara, and C. Mayeux, “Online defect prediction for imbalanced data,” in ICSE’15: Proc. of the International Conference on Software Engineering- Volume 2, 2015.
[62] “PROMISE dataset,” [Online]. Available: http://openscience.us/repo/defect/. Accessed: Mar. 7, 2018.
[63]
[64] T. Menzies, Z. Milton, B. Turhan, B. Cukic, Y. Jiang, and A. Bener. Defect prediction from static code features: current results, limitations, new approaches. ASE’10, 17(4):375–407.
[65] J. T. J. P. Townsend and Psychophysics, "Theoretical analysis of an alphabetic confusion matrix," vol. 9, no. 1, pp. 40-50, 1971.
[66] F. Rahman and P. Devanbu. Comparing static bug finders and statistical prediction. In Proceedings of the 2014 International Conference on Software Engineering, ICSE ’14, 2014.
[67] I. Jolliffe, Principal component analysis. Springer, 2011.
[68] P. Jalote, Software Project Management in Practice, Pearson Education, 2002.
[69] C. Ebert, R. Dumke, M. Bundschuh, and A. Schmietendorf, Best Practices in
Software Measurement, Springer Verlag, 2004.

(此全文未開放授權)
電子全文
中英文摘要

推文
推薦
評分
引用網址
轉寄

top

詳目顯示

相關論文