以遞歸卷積神經網路擷取財經新聞知識預測股價__國立清華大學博碩士論文全文影像系統

帳號：guest(18.191.45.169) 離開系統

字體大小：

詳目顯示

第 1 筆 / 共 1 筆

/1頁

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士論文系統

、以作者查詢全國書目

論文基本資料
摘要
外文摘要
論文目次
參考文獻
電子全文

作者(中文):	李哲宇
作者(外文):	Lee, Che-Yu
論文名稱(中文):	以遞歸卷積神經網路擷取財經新聞知識預測股價
論文名稱(外文):	Extracting Information from Financial News to Predict Stock Price Using Recurrent Convolutional Neural Networks
指導教授(中文):	蘇豐文
指導教授(外文):	Soo, Von-Wun
口試委員(中文):	陳宜欣林哲群
口試委員(外文):	Chen, Yi-Shin Lin, Che-Chun
學位類別:	碩士
校院名稱:	國立清華大學
系所名稱:	資訊系統與應用研究所
學號:	104065531
出版年(民國):	106
畢業學年度:	105
語文別:	英文
論文頁數:	61
中文關鍵詞:	機器學習、卷積神經網路、股價預測、深度學習、遞歸神經網路、財經新聞、詞嵌入
外文關鍵詞:	Machine Learning、Convolutional Neural Networks、Stock Forecast、Deep Learning、Recurrent Neural Networks、Word Embedding、Financial News
相關次數:	推薦:0 點閱:640 評分: 下載:117 收藏:0

人們對於股價預測深感興趣，然其不確定性的變因使預測股價一直是棘手問題。在此研究中，我們結合詞向量、長短期記憶神經網路對於時間序列預測之特性，及卷積神經網路對於特徵提取之長處，提出「遞歸卷積神經網路」以預測股價漲跌。我們結合技術分析指標與此模型，而結果顯示結合模型之收益較未採用前的技術分析模型來得高。此外，與單純的長短期記憶神經網路相比，此模型在股價的預測上有較低的誤差。而我們也能藉由卷積神經網路特徵提取的特性，從中擷取財經相關知識。

People have been interested in making profits from financial market prediction. Stock mar- ket forecast has always been a frustrating problem because of its uncertainty and volatility. We take a different approach by a model named recurrent convolutional neural networks (RCN), combining the advantages of convolutions, sequence modeling, word embedding for stock price analysis and knowledge extraction. We combine technical analysis indicators with RCN, and the results suggest that technical analysis models with RCN perform better. Besides, another experimental result indicates the prediction error of RCN is lower than Long-short term memory networks. Moreover, we are capable of extracting information from the financial news during the training process.

1 Introduction 1
1.1 Stock Price Analysis ............................. 2
1.1.1 Fundamental Analysis ........................ 3
1.1.2 Technical Analysis .......................... 3
1.2 Bringing Machine Learning Into Play .................... 5
1.2.1 Neural Networks Learning...................... 5
1.2.2 Knowledge Extraction ........................ 6
1.3 Motivation and ProblemDescription..................... 6
1.3.1 Prediction and Knowledge Extraction ................ 7
2 Related Work 8
2.1 Applications of Machine Learning to financial analysis . . . . . . . . . . . 8
2.1.1 Stock Price Forecasting........................ 9
2.1.2 Portfolio Selection and Optimization . . . . . . . . . . . . . . . . 9
2.1.3 Deep Learning in Finance ...................... 10
2.2 Recurrent Neural Networks in Different Fields. . . . . . . . . . . . . . . . 11
2.2.1 Recurrent Neural Networks on Text Summarization and Speech
Recognition.............................. 11
2.2.2 Recurrent Neural Networks on Music Generation . . . . . . . . . . 12
2.3 Applications of Convolutional Neural Networks . . . . . . . . . . . . . . . 12
2.3.1 Convolutional Neural Networks on Image Processing and Recognition................................. 13
2.3.2 SentimentAnalysis.......................... 13
3 Methodology 14
3.1 WordEmbedding–Word2vec ........................ 15
3.1.1 Neural Network of word2vec..................... 15
3.1.2 Continuous Bag of Words and Skip-gram . . . . . . . . . . . . . . 16
3.1.3 Evaluation of the word2vec bin ................... 18
3.2 ConvolutionalNeuralNetworks ....................... 21
3.2.1 Convolutions and Feature Extraction. . . . . . . . . . . . . . . . . 21
3.2.2 Pooling and Flattening ........................ 23
3.2.3 Fully-ConnectedLayers ....................... 24
v
3.3 Long Short-Term Memory Networks..................... 25
3.3.1 Recurrent Neural Network and Its Limitations . . . . . . . . . . . . 25
3.3.2 Long Short-Term Memory Network Architecture . . . . . . . . . . 26
4 Evaluation 31
4.1 Experimental Setups ............................. 31
4.1.1 InputData............................... 32
4.1.2 Models and Evaluation Metrics ................... 33
4.1.3 PresentedModel ........................... 37
4.1.4 Combination of Technical Analysis Models and RCN . . . . . . . . 39
4.2 Hyperparameters Tuning ........................... 41
4.2.1 GridSearch.............................. 41
4.2.2 Tuned Hyperparameters ....................... 42
4.3 Results..................................... 45
4.3.1 Comparison.............................. 46
4.3.2 Evaluation of Learned Filters..................... 50
5 Discussion and Conclusion 52
5.1 Discussion................................... 52
5.2 Conclusion .................................. 53
References 55

[1] Johan Bollen, Huina Mao, and Xiao-Jun Zeng. Twitter mood predicts the stock mar- ket. doi: 10.1016/j.jocs.2010.12.007.
[2] Venkata Sasank Pagolu, Kamal Nayan Reddy Challa, Ganapati Panda, and Babita Majhi. Sentiment analysis of twitter data for predicting stock market movements.
[3] Xiang Zhang, Junbo Jake Zhao, and Yann LeCun. Character-level convolutional net- works for text classification. CoRR, abs/1509.01626, 2015. URL http://arxiv. org/abs/1509.01626.
[4] Aliaksei Severyn and Alessandro Moschitti. Unitn: Training deep convolutional neu- ral network for twitter sentiment classification. In SemEval@ NAACL-HLT, pages 464–469, 2015.
[5] Baruch Lev and S Ramu Thiagarajan. Fundamental information analysis. Journal of Accounting research, pages 190–215, 1993.
[6] Patricia M Dechow, Amy P Hutton, Lisa Meulbroek, and Richard G Sloan. Short-sellers, fundamental analysis, and stock returns. Journal of Financial Eco- nomics, 61(1):77 – 106, 2001. ISSN 0304-405X. doi: http://dx.doi.org/10.
55
1016/S0304-405X(01)00056-3. URL http://www.sciencedirect.com/ science/article/pii/S0304405X01000563.
[7] Eugene F. Fama. Efficient capital markets: A review of theory and empirical work. The Journal of Finance, 25(2):383–417, 1970. ISSN 00221082, 15406261. URL http://www.jstor.org/stable/2325486.
[8] WILLIAM BROCK, JOSEF LAKONISHOK, and BLAKE LeBARON. Simple tech- nical trading rules and the stochastic properties of stock returns. The Journal of Fi- nance, 47(5):1731–1764, 1992. ISSN 1540-6261. doi: 10.1111/j.1540-6261.1992. tb04681.x. URL http://dx.doi.org/10.1111/j.1540-6261.1992. tb04681.x.
[9] Rodolfo Torbio Farias Nazrio, Jssica Lima e Silva, Vinicius Amorim Sobreiro, and Herbert Kimura. A literature review of technical analysis on stock mar- kets. The Quarterly Review of Economics and Finance, pages –, 2017. ISSN 1062-9769. doi: https://doi.org/10.1016/j.qref.2017.01.014. URL http://www. sciencedirect.com/science/article/pii/S1062976917300443.
[10] T. Kimoto, K. Asakawa, M. Yoda, and M. Takeoka. Stock market prediction system with modular neural networks. In 1990 IJCNN International Joint Conference on Neural Networks, pages 1–6 vol.1, June 1990. doi: 10.1109/IJCNN.1990.137535.
[11] Xiao Ding, Yue Zhang, Ting Liu, and Junwen Duan. Deep learning for event-driven
56
stock prediction. In Proceedings of the 24th International Conference on Artificial In- telligence, IJCAI’15, pages 2327–2333. AAAI Press, 2015. ISBN 978-1-57735-738- 4. URL http://dl.acm.org/citation.cfm?id=2832415.2832572.
[12] KAZUHIRO KOHARA, TSUTOMU ISHIKAWA, YOSHIMI FUKUHARA, and YUKIHIRO NAKAMURA. Stock price prediction using prior knowledge and neu- ral networks. Intelligent Systems in Accounting, Finance & Management, 6(1): 11–22, 1997. ISSN 1099-1174. doi: 10.1002/(SICI)1099-1174(199703)6:1⟨11:: AID-ISAF115⟩3.0.CO;2-3. URL http://dx.doi.org/10.1002/(SICI) 1099-1174(199703)6:1<11::AID-ISAF115>3.0.CO;2-3.
[13] Dau-Heng Hsu. Auto-identify the influence of events based on stock newsauto- identify the influence of events based on stock news. Master’s thesis, National Tsing Hua University, 2012.
[14] Yi Zuo and Eisuke Kita. Stock price forecast using bayesian network. Expert Sys- tems with Applications, 39(8):6729 – 6737, 2012. ISSN 0957-4174. doi: http: //dx.doi.org/10.1016/j.eswa.2011.12.035. URL http://www.sciencedirect. com/science/article/pii/S0957417411017064.
[15] Lean Yu, Shouyang Wang, and Kin Keung Lai. Neural network-based mean–variance–skewness model for portfolio selection. Computers & Op- erations Research, 35(1):34 – 46, 2008. ISSN 0305-0548. doi: http://dx. doi.org/10.1016/j.cor.2006.02.012. URL http://www.sciencedirect.com/ science/article/pii/S0305054806000505. Part Special Issue: Applica- tions of OR in Finance.
57
[16] Richard Socher, Danqi Chen, Christopher D Manning, and Andrew Ng. Reasoning with neural tensor networks for knowledge base completion. In C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 26, pages 926–934. Curran Associates, Inc., 2013. URL https://goo.gl/P8cBeb.
[17] Y. Deng, F. Bao, Y. Kong, Z. Ren, and Q. Dai. Deep direct reinforcement learning for financial signal representation and trading. IEEE Transactions on Neural Networks and Learning Systems, 28(3):653–664, March 2017. ISSN 2162-237X. doi: 10.1109/ TNNLS.2016.2522401.
[18] Alexander M. Rush, Sumit Chopra, and Jason Weston. A neural attention model for abstractive sentence summarization.
[19] Haim Sak, Andrew Senior, and Franoise Beaufays. Long short-term memory based recurrent neural network architectures for large vocabulary speech recognition.
[20] Douglas Eck and Juergen Schmidhuber. A first look at music composition using lstm recurrent neural networks. Technical report, 2002.
[21] Chun-Chi J. Chen and Risto Miikkulainen. Creating melodies with evolving recurrent neural networks. In Proceedings of the INNS-IEEE International Joint Conference on Neural Networks, pages 2241–2246, Piscataway, NJ, 2001. IEEE. URL http: //nn.cs.utexas.edu/?chen:ijcnn01.
[22] Jeffrey L. Elman. Distributed representations, simple recurrent networks, and gram- matical structure. Machine Learning, 7(2):195–225, Sep 1991. ISSN 1573-0565.
58
doi: 10.1023/A:1022699029236. URL http://dx.doi.org/10.1023/A: 1022699029236.
[23] Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel. Backpropagation applied to handwritten zip code recognition. Neural Comput., 1(4):541–551, December 1989. ISSN 0899-7667. doi: 10.1162/neco.1989. 1.4.541. URL http://dx.doi.org/10.1162/neco.1989.1.4.541.
[24] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. In F. Pereira, C. J. C. Burges, L. Bottou, and K. Q. Weinberger, editors, Advances in Neural Information Processing Systems 25, pages 1097–1105. Curran Associates, Inc., 2012. URL https://goo.gl/ 6UhXmK.
[25] KarenSimonyanandAndrewZisserman.Verydeepconvolutionalnetworksforlarge- scale image recognition. CoRR, abs/1409.1556, 2014. URL http://arxiv.org/ abs/1409.1556.
[26] Christian Szegedy, Sergey Ioffe, and Vincent Vanhoucke. Inception-v4, inception- resnet and the impact of residual connections on learning. CoRR, abs/1602.07261, 2016. URL http://arxiv.org/abs/1602.07261.
[27] Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Sub- hashini Venugopalan, Kate Saenko, and Trevor Darrell. Long-term recurrent convo- lutional networks for visual recognition and description. CoRR, abs/1411.4389, 2014. URL http://arxiv.org/abs/1411.4389.
59
[28] Hyeonwoo Noh, Paul Hongsuck Seo, and Bohyung Han. Image question answer- ing using convolutional neural network with dynamic parameter prediction. CoRR, abs/1511.05756, 2015. URL http://arxiv.org/abs/1511.05756.
[29] Ronan Collobert, Jason Weston, Le ́on Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel P. Kuksa. Natural language processing (almost) from scratch. CoRR, abs/1103.0398, 2011. URL http://arxiv.org/abs/1103.0398.
[30] Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. Efficient estimation of word representations in vector space. CoRR, abs/1301.3781, 2013. URL http: //arxiv.org/abs/1301.3781.
[31] Tobias Schnabel, Igor Labutov, David M Mimno, and Thorsten Joachims. Evaluation methods for unsupervised word embeddings. In EMNLP, pages 298–307, 2015.
[32] D. E. Rumelhart, G. E. Hinton, and R. J. Williams. Parallel distributed processing: Explorations in the microstructure of cognition, vol. 1. chapter Learning Internal Representations by Error Propagation, pages 318–362. MIT Press, Cambridge, MA, USA, 1986. ISBN 0-262-68053-X. URL http://dl.acm.org/citation. cfm?id=104279.104293.
[33] Y.Bengio,P.Simard,andP.Frasconi.Learninglong-termdependencieswithgradient descent is difficult. Trans. Neur. Netw., 5(2):157–166, March 1994. ISSN 1045-9227. doi: 10.1109/72.279181. URL http://dx.doi.org/10.1109/72.279181.
[34] SeppHochreiterandJu ̈rgenSchmidhuber.Longshort-termmemory.NeuralComput.,
60
9(8):1735–1780, November 1997. ISSN 0899-7667. doi: 10.1162/neco.1997.9.8. 1735. URL http://dx.doi.org/10.1162/neco.1997.9.8.1735.
[35] Junyoung Chung, C ̧aglar Gu ̈lc ̧ehre, KyungHyun Cho, and Yoshua Bengio. Empir- ical evaluation of gated recurrent neural networks on sequence modeling. CoRR, abs/1412.3555, 2014. URL http://arxiv.org/abs/1412.3555.

電子全文
中英文摘要

推文
推薦
評分
引用網址
轉寄

top

詳目顯示

相關論文