Responsive image
博碩士論文 etd-0706124-091114 詳細資訊
Title page for etd-0706124-091114
論文名稱
Title
運用文字探勘分析網路新聞情緒預測歐元匯率
Using text-mining technique to analyze online news to predict EUR foreign exchange
系所名稱
Department
畢業學年期
Year, semester
語文別
Language
學位類別
Degree
頁數
Number of pages
61
研究生
Author
指導教授
Advisor
召集委員
Convenor
口試委員
Advisory Committee
口試日期
Date of Exam
2024-07-22
繳交日期
Date of Submission
2024-08-06
關鍵字
Keywords
文字探勘、新聞情緒、匯率預測、LSTM、VARMA
Text-mining, News sentiment, Exchange rate prediction, LSTM, VARMA
統計
Statistics
本論文已被瀏覽 437 次,被下載 18
The thesis/dissertation has been browsed 437 times, has been downloaded 18 times.
中文摘要
外匯市場是全球流動性最強的金融市場,匯率波動對企業和國家的經濟競爭力有重要影響,不論企業是否進行跨國交易,匯率波動都會影響其經營成本和利潤。隨著網路技術的發展,網路新聞成為影響外匯市場的重要因素,然而人工分析大量新聞的速度常常落後於市場反應,導致決策延誤。因此,本研究運用文字探勘技術分析網路新聞情緒,以預測隔日歐元兌美元的匯率變動,並探討其有效性。
研究期間為2020年1月1日至2023年12月29日的歐元兌美元收盤匯率資料,並從Investing.com蒐集相應期間內的「FOREX」分類之新聞文章,利用Bert-base進行情感分析,將情緒分數納入VARMA和LSTM模型中進行匯率預測,並評估加入新聞情緒前後兩模型的預測效果。研究發現,加入新聞情緒變數後,LSTM模型的預測準確度顯著提高,優於傳統的VARMA模型。因此,新聞情緒對匯率的波動具有有效的預測能力,能夠有效預測隔日歐元匯率。
本研究證實了新聞情緒在預測匯率變動中的重要性,並表明LSTM模型結合情緒分析在匯率預測中的優越性,企業可利用這一方法應對市場變化,減少匯率風險。本研究提供了文字探勘技術在金融市場中的應用範例,對未來的研究具有參考價值。
Abstract
The foreign exchange market is the most liquid financial market globally, with exchange rate fluctuations significantly impacting economic competitiveness. These fluctuations affect operating costs and profits, regardless of cross-border transactions. Online news has become a crucial factor influencing the market, but manual news analysis often lags behind market reactions, causing delayed decisions. This study uses text mining techniques to analyze online news sentiment to predict next-day EUR/USD exchange rate changes and assess its effectiveness.
The research covers EUR/USD closing exchange rate data from January 1, 2020, to December 29, 2023, and collects news articles categorized under "FOREX" from Investing.com. Sentiment analysis is conducted using Bert-base, with sentiment scores incorporated into VARMA and LSTM models for exchange rate prediction. The study evaluates model performance with and without news sentiment. Results show that incorporating news sentiment significantly enhances the LSTM model's prediction accuracy, outperforming the traditional VARMA model. Thus, news sentiment effectively predicts exchange rate fluctuations and the next day's EUR exchange rate.
This study confirms the importance of news sentiment in predicting exchange rate changes and demonstrates the superiority of combining sentiment analysis with the LSTM model. Businesses can use this method to respond to market changes and reduce exchange rate risks. This study provides a valuable example of applying text mining techniques in the financial market, offering reference value for future research.
目次 Table of Contents
論文審定書 i
誌謝 ii
摘要 iii
Abstract iv
目錄 v
圖次 vii
表次 viii
第一章、 緒論 1
第一節、 研究背景 1
第二節、 研究動機 2
第三節、 研究目的 3
第二章、 文獻探討 5
第一節、 文字探勘與情緒分析 5
第二節、 新聞情緒對於市場影響 5
第三節、 BERT 6
第四節、 長短期記憶模型(LSTM) 9
第三章、 研究方法 14
第一節、 研究標的 15
第二節、 解釋變數 15
(一). 通膨差異(INF) 15
(二). 實質即期利率差分(INT) 17
(三). 恐慌指數(Volatility Index,VIX) 17
(四). MSCI Europe Index(MSCI) 17
第三節、 新聞情緒(News_sent) 19
第四節、 實驗模型 23
(一). 傳統時間序列模型-向量自回歸移動平均模型(VARMA) 23
(二). 神經網路模型-長短期記憶模型(LSTM) 26
第五節、 評估預測表現 27
(一). 平均平方誤差(Mean-Square Error , MSE) 28
(二). 均方根誤差(Root Mean Squared Error , RMSE) 28
(三). 平均絕對誤差(Mean Absolute Error , MAE) 28
(四). 平均絕對百分比誤差(Mean Absolute Percentage Error , MAPE) 29
第四章、 實驗分析 30
第一節、 資料敘述統計 30
第二節、 平穩性檢定 35
第三節、 實驗模型分析結果 37
第五章、 結論與建議 42
第一節、 研究結論 42
第二節、 研究限制與建議 43
參考文獻 44
參考文獻 References
1. Adebiyi, A., Adewumi, A., & Ayo, C. (2014). Stock price prediction using the ARIMA model. In Proceedings—UKSim-AMSS 16th International Conference on Computer Modelling and Simulation, UKSim 2014. https://doi.org/10.1109/UKSim.2014.67
2. Aggarwal, R., & Harper, J. T. (2010). Foreign exchange exposure of “domestic” corporations. Journal of International Money and Finance, 29(8), 1619–1636. https://doi.org/10.1016/j.jimonfin.2010.05.003
3. Akpınar, M., & Yumusak, N. (2013). Forecasting household natural gas consumption with ARIMA model: A case study of removing cycle. 1–6. https://doi.org/10.1109/ICAICT.2013.6722753
4. Anaroua, A., Adhikari, H., & Choi, Y. (2023). Analysis of US Airline Stock Performance using Latent Dirichlet Allocation (LDA). https://doi.org/10.21203/rs.3.rs-3084031/v1
5. Armstrong, J. S. (1985). Long-Range Forecasting: From Crystal Ball to Computer. Wiley. https://archive.org/details/longrangeforecas0002arms
6. Athanasopoulos, G., Poskitt, D. S., & Vahid, F. (2012). Two Canonical VARMA Forms: Scalar Component Models Vis-à-Vis the Echelon Form. Econometric Reviews, 31(1), 60–83. https://doi.org/10.1080/07474938.2011.607088
7. Bhandari, H., Rimal, B., Pokhrel, N. R., Rimal, R., & Dahal, K. (2022). LSTM-SDM: An integrated framework of LSTM implementation for sequential data modeling. Software Impacts, 14. https://doi.org/10.1016/j.simpa.2022.100396
8. Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. J. Mach. Learn. Res., 3(null), 993–1022.
9. Bollen, J., Mao, H., & Zeng, X. (2011). Twitter mood predicts the stock market. Journal of Computational Science, 2(1), 1–8. https://doi.org/10.1016/j.jocs.2010.12.007
10. Bothos, E., Apostolou, D., & Mentzas, G. (2010). Using Social Media to Predict Future Events with Agent-Based Markets. IEEE Intelligent Systems, 25, 50–58. https://doi.org/10.1109/MIS.2010.152
11. Bowman, S. R., Angeli, G., Potts, C., & Manning, C. D. (2015). A large annotated corpus for learning natural language inference. In L. Màrquez, C. Callison-Burch, & J. Su (Eds.), Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (pp. 632–642). Association for Computational Linguistics. https://doi.org/10.18653/v1/D15-1075
12. Box, G. E., Jenkins, G. M., Reinsel, G. C., & Ljung, G. M. (2015). Time series analysis: Forecasting and control. John Wiley & Sons.
13. Cassel, G. (1916). The Present Situation of the Foreign Exchanges. The Economic Journal, 26(103), 319–323. https://doi.org/10.2307/2221918
14. Chaboud, A., Rime, D., & Sushko, V. (2023). The foreign exchange market. BIS Working Papers. https://www.bis.org/publ/work1094.htm
15. Chen, M., Guo, Z., Abbass, K., & Huang, W. (2022). Analysis of the impact of investor sentiment on stock price using the latent dirichlet allocation topic model. Frontiers in Environmental Science, 10. https://doi.org/10.3389/fenvs.2022.1068398
16. Chen, S., & Hsu, C. (2019). Do stock markets have predictive content for exchange rate movements? Journal of Forecasting, 38(7), 699–713. https://doi.org/10.1002/for.2592
17. Coudert, V., Mignon, V., & Penot, A. (2008). Oil price and the dollar. Energy Studies Review, 15. https://doi.org/10.15173/esr.v15i2.508
18. Crawford, G., & Fratantoni, M. (2003). Assessing the forecasting performance of regime-switching, ARIMA and GARCH models of house prices. Real Estate Economics, 31, 223–243. https://doi.org/10.1111/1540-6229.00064
19. Feng, W., Guan, N., Li, Y., Zhang, X., & Luo, Z. (2017). Audio visual speech recognition with multimodal recurrent neural networks. 2017 International Joint Conference on Neural Networks (IJCNN), 681–688. https://doi.org/10.1109/IJCNN.2017.7965918
20. Feunou, B. (2009). No-Arbitrage VARMA Term Structure Models with Macroeconomic Variables. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.1360864
21. Fischer, T., & Krauss, C. (2018). Deep learning with long short-term memory networks for financial market predictions. European Journal of Operational Research, 270(2), 654–669. https://doi.org/10.1016/j.ejor.2017.11.054
22. Freeman, J., Hays, J., & Stix, H. (2000). Democracy and markets: The case of exchange rates. American Journal of Political Science, 44. https://doi.org/10.2307/2669258
23. García, F., Guijarro, F., Oliver, J., & Tamošiūnienė, R. (2023). Foreign Exchange Forecasting Models: ARIMA and LSTM Comparison. ITISE 2023, 81. https://doi.org/10.3390/engproc2023039081
24. Ghalayini, L. (2013). Modeling and Forecasting the US Dollar/Euro Exchange Rate. International Journal of Economics and Finance, 6(1), p194. https://doi.org/10.5539/ijef.v6n1p194
25. Gillam, L., Ahmad, K., Ahmad, S., Cheng, D., Oliveira, P., & Manomaisupat, P. (2003). Economic News and Stock Market Correlation: A Study of the UK.
26. Giot, P. (2005). Relationships between implied volatility indices and stock index returns. Journal of Portfolio Management, 31(3), 92–100.
27. Glorot, X., Bordes, A., & Bengio, Y. (2010). Deep Sparse Rectifier Neural Networks. 15.
28. Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT press.
29. Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. The Journal of Machine Learning Research, 3(null), 1157–1182.
30. Hai, Z., Chang, K., Kim, J.-J., & Yang, C. C. (2014). Identifying Features in Opinion Mining via Intrinsic and Extrinsic Domain Relevance. IEEE Transactions on Knowledge and Data Engineering, 26(3), 623–634. https://doi.org/10.1109/TKDE.2013.26
31. Halder, S. (2022). FinBERT-LSTM: Deep Learning based stock price prediction using News Sentiment Analysis (arXiv:2211.07392). arXiv. https://doi.org/10.48550/arXiv.2211.07392
32. Hau, H., & Rey, H. (2006). Exchange Rates, Equity Prices, and Capital Flows. The Review of Financial Studies, 19(1), 273–317.
33. Henckel, T. (2004). Default on Government Debt and Exchange Rate Dynamics. Econometric Society, Econometric Society 2004 Australasian Meetings.
34. Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780.
35. Hodson, T. O. (2022). Root-mean-square error (RMSE) or mean absolute error (MAE): When to use them or not. Geoscientific Model Development, 15(14), 5481–5487. https://doi.org/10.5194/gmd-15-5481-2022
36. How, D. N. T., Loo, C. K., & Sahari, K. S. M. (2016). Behavior recognition for humanoid robots using long short-term memory. International Journal of Advanced Robotic Systems, 13(6), 1729881416663369. https://doi.org/10.1177/1729881416663369
37. Huang, X., Zhang, W., Tang, X., Zhang, M., Surbiryala, J., Iosifidis, V., Liu, Z., & Zhang, J. (2021). LSTM Based Sentiment Analysis for Cryptocurrency Prediction (arXiv:2103.14804). arXiv. https://doi.org/10.48550/arXiv.2103.14804
38. Keynes, J. M. (1923). A Tract on Monetary Reform. Macmillan. https://books.google.com.tw/books?id=8HsuAAAAYAAJ
39. King, M., & Rime, D. (2010). The $4 Trillion Question: What Explains FX Growth Since the 2007 Survey? BIS Quarterly Review.
40. Konchady, M. (2006). Text Mining Application Programming. Charles River Media. https://books.google.com.tw/books?id=m9JQAAAAMAAJ
41. Kumar, M., & M., T. (2014). Forecasting stock index returns using ARIMA-SVM, ARIMA-ANN, and ARIMA-random forest hybrid models. International Journal of Banking, Accounting and Finance, 5, 284. https://doi.org/10.1504/IJBAAF.2014.064307
42. Lin, L., & Kuo, C.-J. (2008). The Relation between Exchange Rate Volatility and Firm Valuation: Polynomial Distributed Lag Model. Journal of Economics and Management, 4, 145–162.
43. Liu, B. (2022). Sentiment analysis and opinion mining. Springer Nature.
44. Mashadihasanli, T. (2022). Stock Market Price Forecasting Using the Arima Model: An Application to Istanbul, Turkiye. Journal of Economic Policy Researches, 9(2), 439–454.
45. Miśkiewicz, J. (2021). Network Analysis of Cross-Correlations on Forex Market during Crises. Globalisation on Forex Market. Entropy, 23, 352. https://doi.org/10.3390/e23030352
46. Naik, D. A., Mythreyan, S., & Seema, S. (2022). Relevance Feature Discovery in Text Mining Using NLP. 2022 3rd International Conference for Emerging Technology (INCET), 1–6. https://doi.org/10.1109/INCET54531.2022.9824807
47. Namouri, H., Jawadi, F., Ftiti, Z., & Hachicha, N. (2018). Threshold effect in the relationship between investor sentiment and stock market returns: A PSTR specification. Applied Economics, 50(5), 559–573. https://doi.org/10.1080/00036846.2017.1335387
48. Nanthiya, D., Gopal, S. B., Balakumar, S., Harisankar, M., & Midhun, S. P. (2023). Gold price prediction using ARIMA model. 2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN), 1–6. https://doi.org/10.1109/ViTECoN58111.2023.10157017
49. Palangi, H., Deng, L., Shen, Y., Gao, J., He, X., Chen, J., Song, X., & Ward, R. (2016). Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(4), 694–707. https://doi.org/10.1109/TASLP.2016.2520371
50. Pang, B., & Lee, L. (2008). Opinion Mining and Sentiment Analysis. Foundations and Trends® in Information Retrieval, 2(1–2), 1–135. https://doi.org/10.1561/1500000011
51. Pornwattanavichai, A., Maneeroj, S., & Boonsiri, S. (2022). BERTFOREX: Cascading Model for Forex Market Forecasting Using Fundamental and Technical Indicator Data Based on BERT. IEEE Access, 10, 23425–23437. https://doi.org/10.1109/ACCESS.2022.3152152
52. Shin, K.-H., Jung, J.-W., Chang, K.-H., Kim, K., Jung, W., Lee, D.-I., & You, C.-H. (2022). Dynamical prediction of two meteorological factors using the deep neural network and the long short-term memory (ΙΙ). Journal of the Korean Physical Society, 80, 1–17. https://doi.org/10.1007/s40042-022-00472-4
53. Shumway, R. H., Stoffer, D. S., & Stoffer, D. S. (2000). Time series analysis and its applications (Vol. 3). Springer.
54. Simionescu, M. (2013). The Use of Varma Models in Forecasting Macroeconomic Indicators. ECONOMICS & SOCIOLOGY, 6(2), 94–102. https://doi.org/10.14254/2071-789X.2013/6-2/9
55. Simon, D. P. (2003). The Nasdaq volatility index during and after the bubble. Journal of Derivatives, 11(2), 9–24.
56. Smales, L. A. (2015). Time-variation in the impact of news sentiment. International Review of Financial Analysis, 37, 40–50. https://doi.org/10.1016/j.irfa.2014.11.019
57. Sousa, M. G., Sakiyama, K., Rodrigues, L. D. S., Moraes, P. H., Fernandes, E. R., & Matsubara, E. T. (2019). BERT for Stock Market Sentiment Analysis. 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI), 1597–1601. https://doi.org/10.1109/ICTAI.2019.00231
58. Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. (2014). Dropout: A Simple Way to Prevent Neural Networks from Overfitting. Journal of Machine Learning Research, 1929–1958.
59. Tandel, S. S., Jamadar, A., & Dudugu, S. (2019). A Survey on Text Mining Techniques. 2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS), 1022–1026. https://doi.org/10.1109/ICACCS.2019.8728547
60. Vuong, G. T. H., Nguyen, M. H., & Keung Wong, W. (2022). CBOE volatility index (VIX) and corporate market leverage. Cogent Economics & Finance, 10(1), 2111798. https://doi.org/10.1080/23322039.2022.2111798
61. Whaley, R. E. (1993). Derivatives on market volatility. The Journal of Derivatives, 1(1), 71–84.
62. Y. Ayyappa, B. V. Kumar, S. P. Priya, S. Akhila, T. P. Vardhan Reddy, & S. M. Goush. (2023). Forecasting Equity Prices using LSTM and BERT with Sentiment Analysis. 2023 International Conference on Inventive Computation Technologies (ICICT), 643–648. https://doi.org/10.1109/ICICT57646.2023.10134443
63. Yıldırım, D. C., Toroslu, I. H., & Fiore, U. (2021). Forecasting directional movement of Forex data using LSTM with technical and macroeconomic indicators. Financial Innovation, 7(1), 1. https://doi.org/10.1186/s40854-020-00220-2
64. 周佩親. (n.d.). 探討重大新聞對於匯率之影響-以新台幣兌美元為例 [國立政治大學]. In 國際經營與貿易研究所: Vol. 碩士. 臺灣博碩士論文知識加值系統. https://hdl.handle.net/11296/uy7cs9
65. 曾子維. (2021). 基於新聞情緒建構LSTM神經網路之匯率預測模型與量化交易策略 [國立臺灣師範大學]. In 管理研究所: Vol. 碩士. https://hdl.handle.net/11296/682jx5
66. 李揚生. (2017). 文字探勘與財經新聞: 新聞是否會改變市場行為 [國立中山大學]. In 財務管理學系研究所: Vol. 碩士. 臺灣博碩士論文知識加值系統. https://hdl.handle.net/11296/2px56t
67. 王彥鈞. (2017). 不同市場狀態下新聞情緒的預測能力:以台灣五十指數為例 [國立中央大學]. In 財務金融學系: Vol. 碩士. https://hdl.handle.net/11296/3cea6s
68. 謝人俊 & 林耀傑. (2020). 數位金融環境下銀行經營型態的演變與對策. 中央銀行金融業務檢查處.
69. 陳裕菘, 謝邦昌, 李勝輝, & 陳郁婷. (2014). 運用文字探勘與資料採礦技術建立匯率預測模型-以人民幣兌新台幣為例. Journal of Data Analysis, 9(1). https://doi.org/10.6338/JDA.201402_9(1).0007
電子全文 Fulltext
本電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
論文使用權限 Thesis access permission:校內校外完全公開 unrestricted
開放時間 Available:
校內 Campus: 已公開 available
校外 Off-campus: 已公開 available


紙本論文 Printed copies
紙本論文的公開資訊在102學年度以後相對較為完整。如果需要查詢101學年度以前的紙本論文公開資訊,請聯繫圖資處紙本論文服務櫃台。如有不便之處敬請見諒。
開放時間 available 已公開 available

QR Code