通過卷積自編碼器神經網路之小數據整合於化工製程建模研究_

帳號：guest(216.73.216.88) 離開系統

字體大小：

詳目顯示

第 1 筆 / 共 1 筆

/1頁

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士論文系統

、以作者查詢全國書目

論文基本資料
摘要
外文摘要
論文目次
參考文獻
電子全文

作者(中文):	劉梓堂
作者(外文):	Liu, Tzu-Tang
論文名稱(中文):	通過卷積自編碼器神經網路之小數據整合於化工製程建模研究
論文名稱(外文):	Process Modeling With Small Data Integration via Deep Convolutional Autoencoder-based Embedding Model
指導教授(中文):	姚遠
指導教授(外文):	Yao, Yuan
口試委員(中文):	汪上曉康嘉麟
口試委員(外文):	Wong, Shan-Hill Kang, Jia-Lin
學位類別:	碩士
校院名稱:	國立清華大學
系所名稱:	化學工程學系
學號:	108032538
出版年(民國):	110
畢業學年度:	109
語文別:	中文
論文頁數:	55
中文關鍵詞:	前饋全連接神經網路、卷積自動編碼器、製程建模、小數據、整合分析
外文關鍵詞:	Feedforward fully connected neural network、Convolutional Autoencoder,、Process modeling、Small data、Integrated analysis
相關次數:	推薦:0 點閱:270 評分: 下載:0 收藏:0

隨著數據科學的興起，大數據正在成為學術研究和商業業務發展的流行趨勢。但是，對於高附加價值產業，不一定有足夠的數據可用於建立一個可靠的數據驅動模型。因此如何整合多個不同但類似的任務收集的小數據並通過在任務之間共享信息來構建準確的模型是一個研究挑戰。而其中一個例子是針對不同操作條件配置的雙螺桿擠出機製程進行建模。
卷積神經網絡（CNN）是計算機視覺中常用的深度學習技術。在這項工作中，採用了卷積自動編碼器（一種深度圖像去噪模型）來描述雙螺桿擠出過程中的定性因素，即螺桿元件的幾何形狀。具體而言，通過卷積自動編碼器嵌入來提取這些定性因素中包含的信息；然後將嵌入值以及定量條件合併輸入到前饋全連接神經網絡模型，以實現過程輸出的預測。與傳統的卷積自動編碼器不同，該模型同時考量了自動編碼器的重構損失和最終預測的回歸損失進行迭代訓練，從而確保了模型的可解釋性。
在本次研究中以雙螺桿押出過程的數值模擬用於說明所提出模型的可行性。從研究的結果之下，表現出該模型具有良好的解釋性和預測準確性。特別是對於包含未知定性因素的過程模擬條件下，即使僅收集了有限數量種類的螺桿元件製程數據，該模型仍根據不同螺桿間的相似性而做出合理的預測。

Big data is becoming a popular trend of research and business development. Nevertheless, for high-value process industries, sufficient data is not necessarily available for data-driven process modeling. How to integrate small data collected from several different tasks and build an accurate process model by sharing the information between tasks is a challenge research topic. A typical example is the modeling of a twinscrew extruder for screw configuration.
Convolutional neural network (CNN) has been a common deep learning technique used in computer vision. In this work, a convolutional autoencoder, a deep image denoising model, is adopted to describe the qualitative factors, i.e. the geometries of the screw elements, in a twin-screw extrusion process. In detail, the information contained in these qualitative factors is extracted by convolutional autoencoder embedding; then the embedding codes are connected to a fully connected feedforward neural network, together with the quantitative process conditions, to achieve the prediction of the process outputs. Different from the conventional convolutional autoencoders, the proposed model is trained using both the reconstruction loss of autoencoder and the regression loss of final prediction, ensuring the model interpretability.
Numerical simulations of a twin-screw extrusion process are used to illustrate the feasibility of the proposed model. In the studied case, it shows that this model has both good interpretability and prediction accuracy. Specifically, for the process contain qualitative factors with extrapolate values, the model can still make reasonable predictions, given that only a limited amount of data was collected for each screw configuration.

摘要 1
Abstract 2
目錄 3
圖目錄 4
表目錄 6
第一章緒論 7
1-1前言 7
1-2研究背景(文獻與動機) 8
1-3文章架構 9
第二章研究理論 10
2-1前饋全連接神經網路 10
2-2卷積神經網路 12
2-3處理定性因子之神經網路模型 14
2-3-1自動編碼器 14
2-3-2卷積自編碼器 16
第三章案例分析與實驗方法 14
3-1雙螺桿押出機製程 18
3-2少量數據於雙螺桿押出機製程案例 18
3-2-1 三十種螺桿元件製程之小數據情況 21
3-2-2 額外導入外插製程小數據情況 27
3-3 實驗方法 28
第四章實驗數據與討論 35
4-1 三十種螺桿元件製程之小數據情況實驗結果 35
4-2 額外導入外插製程小數據情況實驗結果 41
4-2-1 二十九種螺桿元件製程之小數據情況 41
4-2-2 二十八種螺桿元件製程之小數據情況 46
第五章結論 53
第六章參考文獻 54

1. Qian, Peter Z. G., Huaiqing Wu, and CF Jeff Wu. "Gaussian process models for computer experiments with qualitative and quantitative factors." Technometrics 50.3 (2008): 383-396.
2. G. Hinton, L. Deng, D. Yu, G. Dahl, A.-r. Mohamed, N. Jaitly, et al., "Deep neural networks for acoustic modeling in speech recognition," IEEE Signal processing magazine, vol. 29, 2012.
3. D. Cireşan, U. Meier, and J. Schmidhuber, "Multi-column deep neural networks for image classification," arXiv preprint arXiv:1202.2745, 2012.
4. T. N. Sainath, O. Vinyals, A. Senior, and H. Sak, "Convolutional, long short-term memory, fully connected deep neural networks," in 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015, pp. 4580-4584.
5. W. Sun, S. Shao, R. Zhao, R. Yan, X. Zhang, and X. Chen, "A sparse auto-encoder-based deep neural network approach for induction motor faults classification," Measurement, vol. 89, pp. 171-178, 2016.
6. M. Kim, W. Lee, J. Yoon, and O. Jo, "Building Encoder and Decoder with Deep Neural Networks: On the Way to Reality," arXiv preprint arXiv:1808.02401, 2018.
7. Y. Shan, T. R. Hoens, J. Jiao, H. Wang, D. Yu, and J. Mao, "Deep crossing: Web-scale modeling without manually crafted combinatorial features," in Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, 2016, pp. 255-262.
8. Pan, S. J., & Yang, Q. (2009). A survey on transfer learning. IEEE Transactions on knowledge and data engineering, 22(10), 1345-1359.
9. Y.-C. Chuang, T. Chen, Y. Yao, and D. S. H. Wong, "Transfer learning for efficient meta-modeling of process simulations," Chemical Engineering Research and Design, vol. 138, pp. 546-553, 2018.
10. C. M. Jaeckle and J. F. MacGregor, "Product transfer between plants using historical process data," AIChE journal, vol. 46, pp. 1989-1997, 2000.
11. W. Yan, S. Hu, Y. Yang, F. Gao, and T. Chen, "Bayesian migration of Gaussian process regression for rapid process modeling and optimization," Chemical Engineering Journal, vol. 166, pp. 1095-1103, 2011.
12. Houlsby, N., Giurgiu, A., Jastrzebski, S., Morrone, B., De Laroussilhe, Q., Gesmundo, A., ... & Gelly, S. (2019, May). Parameter-efficient transfer learning for NLP. In International Conference on Machine Learning (pp. 2790-2799). PMLR.[
13. W. S. McCulloch and W. Pitts, "A logical calculus of the ideas immanent in nervous activity," The bulletin of mathematical biophysics, vol. 5, pp. 115-133, 1943.
14. Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
15. Boureau, Y-Lan, Jean Ponce, and Yann LeCun. "A theoretical analysis of feature pooling in visual recognition." Proceedings of the 27th international conference on machine learning (ICML-10). 2010.
16. D. E. Rumelhart, G. E. Hinton, and R. J. Williams, "Learning representations by back-propagating errors," Cognitive modeling, vol. 5, p. 1, 1988.
17. M. Bierdel and K. Kohlgruber, "Co-Rotating Twin-screw Extruders: Fundamentals, Technology, and Applications," Hanser, Munich, 2007.
18. M. Booy, "Geometry of fully wiped twin‐screw equipment," Polymer Engineering & Science, vol. 18, pp. 973-984, 1978.
19. A. Gaspar‐Cunha, J. A. Covas, and B. Vergnes, "Defining the configuration of co‐rotating twin‐screw extruders with multiobjective evolutionary algorithms," Polymer Engineering & Science, vol. 45, pp. 1159-1173, 2005.
20. D. B. Todd, "Residence time distribution in twin‐screw extruders," Polymer Engineering & Science, vol. 15, pp. 437-443, 1975.
21. J. Joo and T. Kwon, "Analysis of residence time distribution in the extrusion process including the effect of 3‐D circulatory flow," Polymer Engineering & Science, vol. 33, pp. 959-970, 1993.

(此全文20260801後開放外部瀏覽)
電子全文
中英文摘要

推文
推薦
評分
引用網址
轉寄

top

詳目顯示

相關論文