|
[1] L. D. Harmon, "Artificial neuron," Science, vol. 129, no. 3354, pp. 962-963, 1959. [2] Y. LeCun et al., "Backpropagation applied to handwritten zip code recognition," In: Neural computation, vol. 1, no. 4, pp. 541-551, 1989. [3] W. Zaremba, I. Sutskever, and O. Vinyals, "Recurrent neural network regularization," In: arXiv preprint arXiv:1409.2329, 2014. [4] Geoffrey E. Hinton, Li Deng, Dong Yu, George E. Dahl, Abdel rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N. Sainath, and Brian Kingsbury, "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82-97, 2012. [5] Alex Krizhevsky, Ilya Sutskever, and Geoff Hinton, "Imagenet classification with deep convolutional neural networks,” Advances in Neural Information Processing Systems 25, pp. 1106-1114, 2012. [6] Pierre Sermanet, Soumith Chintala, and Yann LeCun, "Convolutional neural networks applied to house numbers digit classification, " International Conference on Pattern Recognition (ICPR 2012), 2012. [7] Q. V. Le, W. Y. Zou, S. Y. Yeung and A. Y. Ng, "Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis," CVPR 2011, pp. 3361-3368, 2011. [8] M. M. Bronstein, J. Bruna, Y. LeCun, A. Szlam and P. Vandergheynst, "Geometric Deep Learning: Going beyond Euclidean data," IEEE Signal Processing Magazine, vol. 34, no. 4, pp. 18-42, 2017. [9] G. Tesauro, D. S. Touretzky, and T. Leen, Advances in neural information processing systems 7. MIT press, 1995. [10] Yoshua Bengio, Learning Deep Architectures for AI, 2009. [11] Z. Wu, S. Pan, F. Chen, G. Long, C. Zhang and P. S. Yu, "A Comprehensive Survey on Graph Neural Networks," IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 1, pp. 4-24, 2021. [12] Gori, M., Monfardini, G., and Scarselli, F., "A new model for learning in graph domains," Proceedings. 2005 IEEE International Joint Conference on Neural Networks, vol. 2, pp. 729-734, 2005. [13] F. Scarselli, M. Gori, A. C. Tsoi, M. Hagenbuchner and G. Monfardini, "The Graph Neural Network Model," IEEE Transactions on Neural Networks, vol. 20, no. 1, pp. 61-80, 2009. [14] C. Gallicchio and A. Micheli, "Graph Echo State Networks," The 2010 International Joint Conference on Neural Networks (IJCNN), pp. 1-8, 2010. [15] Pham, P., Nguyen, L.T.T., Pedrycz, W. et al., "Deep learning, graph-based text representation and classification: a survey, perspectives and challenges," Artificial Intelligence Review, vol. 56, pp. 4893-4927, 2023. [16] Justin Gilmer, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, and George E. Dahl., "Neural message passing for Quantum chemistry," Proceedings of the 34th International Conference on Machine Learning, vol. 70, 2017. [17] Rex Ying, Ruining He, Kaifeng Chen, Pong Eksombatchai, William L. Hamilton, and Jure Leskovec, "Graph Convolutional Neural Networks for Web-Scale Recommender Systems," Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018. [18] S. Rahmani, A. Baghbani, N. Bouguila and Z. Patterson, "Graph Neural Networks for Intelligent Transportation Systems: A Survey," IEEE Transactions on Intelligent Transportation Systems, vol. 24, no. 8, pp. 8846-8885, 2023. [19] Kipf, T. N. & Welling, M., "Semi-Supervised Classification with Graph Convolutional Networks, " Proceedings of the 5th International Conference on Learning Representations, 2017. [20] L. Zhao et al., "T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction," IEEE Transactions on Intelligent Transportation Systems, vol. 21, no. 9, pp. 3848-3858, 2020. [21] Wei-Lin Chiang, Xuanqing Liu, Si Si, Yang Li, Samy Bengio, Cho-Jui Hsieh, " Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks," arXiv preprint arXiv: 1905.07953, 2019. [22] Shikhar Vashishth, Soumya Sanyal, Vikram Nitin, Partha Talukdar, "Composition-based Multi-Relational Graph Convolutional Networks," Proceedings of the 8th International Conference on Learning Representations, 2020. [23] Ming Chen, Zhewei Wei, Zengfeng Huang, Bolin Ding, Yaliang Li, "Simple and Deep Graph Convolutional Networks," Proceedings of the 37th International Conference on Machine Learning, vol. 119, pp. 1725-1735, 2020. [24] Lu H, Uddin S., "Disease Prediction Using Graph Machine Learning Based on Electronic Health Data: A Review of Approaches and Trends," Healthcare, vol. 11, no. 7, pp. 1031, 2023. [25] J. Zhang, X. Shi, S. Zhao, and I. King, "STAR-GCN: Stacked and reconstructed graph convolutional networks for recommender systems, " Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 4264–4270, 2019. [26] Lv S, Guo D, Xu J, Tang D, Duan N, Gong M, Shou L, Jiang D, Cao G, Hu S, "Graph-Based Reasoning over Heterogeneous External Knowledge for Commonsense Question Answering," Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 05, pp. 8449-8456, 2020. [27] Yao, L., Mao, C., & Luo, Y., "Graph Convolutional Networks for Text Classification," Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 1, pp. 7370-7377, 2019. [28] Shoujin Wang, Liang Hu, Yan Wang, Xiangnan He, Quan Z. Sheng, Mehmet A. Orgun, Longbing Cao, Francesco Ricci, Philip S. Yu, "Graph Learning based Recommender Systems: A Review," Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, pp. 4644-4652, 2019. [29] Sebastian Ruder, "An overview of gradient descent optimization algorithms," arXiv preprint arXiv: 1609.04747, 2016. [30] Shun-ichi Amari, "Natural Gradient Works Efficiently in Learning," Neural Comput, vol. 10, no. 2, pp. 251–276, 1998. [31] Ruder, S., "An overview of gradient descent optimization algorithms," arXiv preprint arXiv: 1609.04747, 2016. [32] Razvan Pascanu, Yoshua Bengio, "Revisiting Natural Gradient for Deep Networks," arXiv preprint arXiv:1301.3584, 2013. [33] Weihua Liu, Xiabi Liu, "A Novel Structured Natural Gradient Descent for Deep Learning," arXiv preprint arXiv:2109.10100, 2021. [34] Mohammad Rasool Izadi, Yihao Fang, Robert Stevenson, Lizhen Lin, "Optimization of Graph Neural Networks with Natural Gradient Descent," 2020 IEEE International Conference on Big Data (Big Data), pp. 171-179, 2020. [35] M. Črepinšek, S.-H. Liu, and M. Mernik, "Exploration and exploitation in evolutionary algorithms: A survey," ACM computing surveys (CSUR), vol. 45, no. 3, pp. 1-33, 2013. [36] G. Xu, "An adaptive parameter tuning of particle swarm optimization algorithm," Applied Mathematics and Computation, vol. 219, no. 9, pp. 4560-4569, 2013. [37] X.-S. Yang, Nature-inspired optimization algorithms. Academic Press, 2020. [38] D. Zang, J. Ding, J. Cheng, D. Zhang, and K. Tang, "A hybrid learning algorithm for the optimization of convolutional neural network," International Conference on Intelligent Computing, pp. 694-705, 2017. [39] Wael, Korani., Malek, Mouhoub., Samira, Sadaoui., "Optimizing Neural Network Weights using Nature-Inspired Algorithms," arXiv preprint arXiv:2105.09983, 2021. [40] Kaveh M, Mesgari MS., "Application of Meta-Heuristic Algorithms for Training Neural Networks and Deep Learning Architectures: A Comprehensive Review," Neural Processing Letters, vol. 55, pp. 4519-4622, 2023. [41] W.-C. Yeh, "An improved simplified swarm optimization," Knowledge-Based Systems, vol. 82, pp. 60-69, 2015. [42] CHIANG, PO-HUNG, "Convolution Neural Network Weight Optimization Using Improved Simplified Swarm Optimization," National Tsing Hua University, Taiwan. [43] James Martens and Roger Grosse, "Optimizing neural networks with kronecker-factored approximate curvature," International conference on machine learning, pp. 2408-2417, 2015. [44] Y. LeCun, Y. Bengio, and G. Hinton, "Deep learning," nature, vol. 521, no. 7553, pp. 436-444, 2015. [45] A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," Advances in neural information processing systems, vol. 25, pp. 1097-1105, 2012. [46] G. Hinton et al., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups," IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 82-97, 2012. [47] Hang Li, "Deep learning for natural language processing: advantages and challenges," National Science Review, vol. 5, issue 1, pp. 24–26, 2018. [48] Sylvester, J. J., "On an Application of the New Atomic Theory to the Graphical Representation of the Invariants and Covariants of Binary Quantics, with Three Appendices, " American Journal of Mathematics, vol. 1, no. 1, 1878, pp. 64-104. [49] Thulasiraman, K.; Swamy, M. N. S., Graphs: Theory and Algorithms, pp.97-125, 1992. [50] D. von Winterfeldt, Ward Edwards, Decision Analysis and Behavioral Research, Cambridge University Press. pp. 63-89, 1986. [51] Guan, F., Zhu, T., Zhou, W. et al., "Graph neural networks: a survey on the links between privacy and security," Artificial Intelligence Review, vol. 57, no. 2, 2024. [52] Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, Maosong Sun, "Graph neural networks: A review of methods and applications," AI Open, vol. 1, pp. 57-81, 2020. [53] M. A. Khamsi, An Introduction to Metric Spaces and Fixed Point Theory. New York: Wiley, 2001. [54] Keyulu Xu, Weihua Hu, Jure Leskovec, Stefanie Jegelka, "How Powerful are Graph Neural Networks? " arXiv preprint arXiv:1810.00826, 2018. [55] Yi Ma, Jianye Hao, Yaodong Yang, Han Li, Junqi Jin, Guangyong Chen, " Spectral-based Graph Convolutional Network for Directed Graphs," arXiv preprint arXiv:1907.08990, 2019. [56] Thomas N. Kipf, Max Welling, "Variational Graph Auto-Encoders," arXiv preprint arXiv:1611.07308, 2017. [57] D. H. Hubel and T. N. Wiesel, "Receptive fields, binocular interaction and functional architecture in the cat's visual cortex," The Journal of physiology, vol. 160, no. 1, pp. 106-154, 1962. [58] Fukushima, Kunihiko, "Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position," Biological Cybernetics, vol 36, no. 4, pp.193-202, 1980. [59] Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition," Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998. [60] Alzubaidi, L., Zhang, J., Humaidi, A.J. et al., "Review of deep learning: concepts, CNN architectures, challenges, applications, future directions, " Journal of Big Data, vol.8, no.53, 2021. [61] Joan Bruna, Wojciech Zaremba, Arthur Szlam, and Yann LeCun, "Spectral networks and locally connected networks on graphs," International Conference on Learning Representations (ICLR), 2014. [62] Michael Defferrard, Xavier Bresson, and Pierre Vandergheynst, "Convolutional neural networks on graphs with fast localized spectral filtering," Advances in Neural Information Processing Systems, 2016. [63] H. Robbins and S. Monro, "A stochastic approximation method," The annals of mathematical statistics, pp. 400-407, 1951. [64] N. Qian, "On the momentum term in gradient descent learning algorithms," Neural networks, vol. 12, no. 1, pp. 145-151, 1999. [65] J. Duchi, E. Hazan, and Y. Singer, "Adaptive subgradient methods for online learning and stochastic optimization," Journal of machine learning research, vol. 12, no. 7, 2011. [66] T. Tieleman and G. Hinton., Lecture 6.5—rmsprop: Divide the gradient by a running average of its recent magnitude, COURSERA: Neural Networks for Machine Learning, 2012. [67] D. P. Kingma and J. Ba, "Adam: A method for stochastic optimization," arXiv preprint arXiv:1412.6980, 2014. [68] M. Kuusela, T. Raiko, A. Honkela and J. Karhunen, "A gradient-based algorithm competitive with variational Bayesian EM for mixture of Gaussians," 2009 International Joint Conference on Neural Networks, pp. 1688-1695, 2009. [69] James Martens, "New insights and perspectives on the natural gradient method," arXiv preprint arXiv:1412.1193, 2014. [70] Weihua Liu, Xiabi Liu. "A Novel Structured Natural Gradient Descent for Deep Learning," arXiv preprint arXiv: 2109.10100, 2021. [71] Zhang, Guodong and Martens, James and Grosse, Roger B., "Fast Convergence of Natural Gradient Descent for Over-Parameterized Neural Networks," Advances in Neural Information Processing Systems, vol. 32, 2019. [72] J. Kennedy and R. Eberhart, "Particle swarm optimization," Proceedings of ICNN'95 - International Conference on Neural Networks, pp. 1942-1948, 1995. [73] W.-C. Yeh, W.-W. Chang, and Y. Y. Chung, "A new hybrid approach for mining breast cancer pattern using discrete particle swarm optimization and statistical method," Expert Systems with Applications, vol. 36, no. 4, pp. 8204-8211, 2009. [74] W.-C. Yeh, "A two-stage discrete particle swarm optimization for the problem of multiple multi-level redundancy allocation in series systems," Expert Systems with Applications, vol. 36, no. 5, pp. 9192-9200, 2009. [75] C.-L. Huang, "A particle-based simplified swarm optimization algorithm for reliability redundancy allocation problems," Reliability Engineering & System Safety, vol. 142, pp. 221-230, 2015. [76] W.-C. Yeh, S.-Y. Tan, "Simplified Swarm Optimization for the Heterogeneous Fleet Vehicle Routing Problem with Time-Varying Continuous Speed Function," Electronics. vol. 10, no.15, p.1775, 2021. [77] W.-C. Yeh, Y.-P. Lin, Y.-C. Liang, C.-M. Lai, C.-L. Huang, "Convolution neural network hyperparameter optimization using simplified swarm optimization," arXiv preprint arXiv: 2103.03995, 2021. [78] D. Karaboga, B. Akay, "A comparative study of artificial bee colony algorithm," Applied Mathematics and Computation, vol. 214, issue 1, pp.108-132, 2009. [79] Chyh-Ming Lai, Chun-Chih Chiu, Yuh-Chuan Shih, Hsin-Ping Huang, "A hybrid feature selection algorithm using simplified swarm optimization for body fat prediction," Computer Methods and Programs in Biomedicine, vol. 226, 2022. [80] X. Zhang, W.-c. Yeh, Y. Jiang, Y. Huang, Y. Xiao, and L. Li, "A case study of control and improved simplified swarm optimization for economic dispatch of a stand-alone modular microgrid," Energies, vol. 11, no. 4, pp. 793, 2018. [81] C.-M. Lai, W.-C. Yeh, and C.-Y. Chang, "Gene selection using information gain and improved simplified swarm optimization," Neurocomputing, vol. 218, pp. 331-338, 2016. [82] McCallum, A.; Nigam, K.; Rennie, J.; and Seymore, K., "Automating the construction of internet portals with machine learning," Information Retrieval, vol. 3, pp. 127-163, 2000. [83] X. Glorot and Y. Bengio, "Understanding the difficulty of training deep feedforward neural networks," Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, vol. 9, pp. 249-256, 2010. [84] Giles, C. & Bollacker, Kurt & Lawrence, Steve, "CiteSeer: An Automatic Citation Indexing System, " Proceedings of 3rd ACM Conference on Digital Libraries, 2000. [85] Zhilin Yang, William Cohen, and Ruslan Salakhudinov, "Revisiting semi-supervised learning with graph embeddings," International conference on machine learning, " pp. 40-48., 2016 [86] Jie Chen, Tengfei Ma, and Cao Xiao, "Fastgcn: fast learning with graph convolutional networks via importance sampling," arXiv preprint arXiv:1801.10247, 2018. [87] Ron Levie et al., "Cayleynets: Graph convolutional neural networks with complex rational spectral filters," IEEE Transactions on Signal Processing, vol. 67, no. 1, pp. 97-109, 2018. [88] Inselberg, A., "The plane with parallel coordinates," The Visual Computer, vol. 1, pp. 69-91, 1985. [89] Fisher, R. A., The Design of Experiments, 1935. [90] Anderson, T. W., Darling, D. A., "Asymptotic Theory of Certain “Goodness of Fit” Criteria Based on Stochastic Processes," The Annals of Mathematical Statistics, vol. 23, no. 2, pp.193-212, 1952. [91] Bartlett, M. S., "Properties of Sufficiency and Statistical Tests," Proceedings of the Royal Society of London. Series A, Mathematical and Physical Sciences, vol. 160, no. 901, pp.268-282, 1937. [92] Tukey, J. W., "Comparing Individual Means in the Analysis of Variance," Biometrics, vol. 5, no. 2, pp. 99-114, 1949. [93] Welch, B. L., "On the Comparison of Several Mean Values: An Alternative Approach," Biometrika, vol. 38, no. 3/4, pp. 330-336, 1951. [94] Games, P. A., & Howell, J. F., "Pairwise Multiple Comparison Procedures with Unequal N’s and/or Variances: A Monte Carlo Study," Journal of Educational Statistics, vol. 1, no. 2, pp.113-125, 1976.
|