作者(外文):Chang, Qi-Lin
論文名稱(外文):Optimization of Graph Convolutional Network Using Improved Simplified Swarm Optimization Algorithm and Natural Gradient Descent
指導教授(外文):Yeh, Wei-Chang
口試委員(外文):Lai, Chyh-Ming
Liu, Ta-Sheng
外文關鍵詞:Graph Neural NetworksGraph Convolutional NetworksImproved Simplified Swarm AlgorithmNatural Gradient DescentNeural Network Model Training
Various neural network model structures, such as Convolutional Neural Networks, have demonstrated excellent performance on many real-world problems. However, early model designs were mostly focused on processing data in Euclidean spaces, and there was a lack of suitable deep learning models for tasks involving non-Euclidean space data. To address this issue, Graph Neural Network (GNN) model was introduced, providing a framework applicable to most graph data.
While GNN-based models excel in handling graph data, the optimizers used in current structures predominantly rely on traditional gradient descent (GD) methods. These methods have been proven to be susceptible to issues such as getting stuck in local optima, being highly dependent on initial solutions, and poor convergence efficiency. Even natural gradient descent (NGD) can address them by considering the underlying structure of parameters, it still has limited effectiveness. Beyond NGD, metaheuristic algorithms (MH) have been proposed for training NN architectures, but current research has only focused on data in Euclidean spaces. To effectively train GNNs, this study proposed a hybrid algorithm, iSSOβ-Optimizer-KFACϵ, combines Improved Simplified Swarm Optimization Algorithm with NGD. And its feasibility is validated across various datasets. The contributions of this research can be summarized as follows: (1) the first application of MH algorithms to train the weight and bias parameters of GNN models; (2) the proposal of a combination of iSSO and NGD, which outperforms iSSO with traditional GD; (3) exploration of challenges faced by iSSO in training NN models with a large number of parameter updates, along with the introduction of iSSOβ to address this issue; and (4) achieving state-of-the-art performance on the Cora dataset.
摘要 i
Abstract ii
目錄 iii
第一章 緒論 1
1.1 研究背景與動機 1
1.2 研究目的 6
1.3 研究架構 7
第二章 文獻回顧 9
2.1 圖神經網路 9
2.2 圖卷積網路 11
2.3 優化器 14
2.4 自然梯度下降 17
2.5 改良式簡化群體演算法 20
第三章 研究方法 24
3.1 粒子編碼方式 24
3.2 初始化解 25
3.3 模型評估函數 26
3.4 演算法說明 27
3.4.1 iSSOβ-Optimizer-KFACϵ演算法符號 27
3.4.2 iSSOβ-Optimizer-KFACϵ更新機制 28
第四章 實驗與結果分析 33
4.1 資料集介紹 33
4.2 超參數最佳化 35
4.2.1 實驗(一):優化器與自然梯度下降 35
4.2.2 實驗(二):iSSOβ參數設定 39
4.2.3 實驗(三):iSSOβ-Optimizer 42
4.3 實驗結果比較 45
4.3.1 Cora : ANOVA檢定 50
4.3.2 Citeseer : ANOVA檢定 54
4.3.3 Pubmed : ANOVA檢定 58
4.3.4 小結 63
第五章 結論與未來研究規劃 64
5.1 結論 64
5.2 後續研究方向 65
參考文獻 66
