An Analysis of Covid-19 Pandemic Outbreak on Economy using Neural Network and Random Forest
محورهای موضوعی : Machine learningMd. Nahid Hasan 1 , Tanvir Ahmed 2 , Md. Ashik 3 , Md. Jahid Hasan 4 , Tahaziba Azmin 5 , Jia Uddin 6
1 - Department of Computer Science and Engineering, Brac University, 66 Mohakhali, Dhaka-1212
2 - Department of Computer Science and Engineering, Brac University, 66 Mohakhali, Dhaka-1212
3 - Department of Computer Science and Engineering, Brac University, 66 Mohakhali, Dhaka-1212
4 - Department of Computer Science and Engineering, Brac University, 66 Mohakhali, Dhaka-1212
5 - Department of Computer Science and Engineering, Brac University, 66 Mohakhali, Dhaka-1212
6 - Artifical Intelligence and Big Data Department, Endicott College, Woosong University, Daejeon, South Korea
کلید واژه: Multi-layer perceptron (MLP), Long Short-Term Memory (LSTM), Random Forest, Economic Recession, Machine learning (ML), Covid-19,
چکیده مقاله :
The pandemic disease outbreaks are causing a significant financial crisis affecting the worldwide economy. Machine learning techniques are urgently required to detect, predict and analyze the economy for early economic planning and growth. Consequently, in this paper, we use machine learning classifiers and regressors to construct an early warning model to tackle economic recession due to the cause of covid-19 pandemic outbreak. A publicly available database created by the National Bureau of Economic Research (NBER) is used to validate the model, which contains information about national revenue, employment rate, and workers' earnings of the USA over 239 days (1 January 2020 to 12 May 2020). Different techniques such as missing value imputation, k-fold cross validation have been used to pre-process the dataset. Machine learning classifiers- Multi-layer Perceptron- Neural Network (MLP-NN) and Random Forest (RF) have been used to predict recession. Additionally, machine learning regressors-Long Short-Term Memory (LSTM) and Random Forest (RF) have been used to detect how much recession a country is facing as a result of positive test cases of covid-19 pandemic. Experimental results demonstrate that the MLP-NN and RF classifiers have exhibited average 88.33% and 85% of recession (where 95%, 81%, 89% and 85%, 81%, 89% for revenue, employment rate and workers earnings, respectively) and average 90.67% and 93.67% of prediction accuracy for LSTM and RF regressors (where 92%, 90%, 90%, and 95%, 93%, 93% respectively).
The pandemic disease outbreaks are causing a significant financial crisis affecting the worldwide economy. Machine learning techniques are urgently required to detect, predict and analyze the economy for early economic planning and growth. Consequently, in this paper, we use machine learning classifiers and regressors to construct an early warning model to tackle economic recession due to the cause of covid-19 pandemic outbreak. A publicly available database created by the National Bureau of Economic Research (NBER) is used to validate the model, which contains information about national revenue, employment rate, and workers' earnings of the USA over 239 days (1 January 2020 to 12 May 2020). Different techniques such as missing value imputation, k-fold cross validation have been used to pre-process the dataset. Machine learning classifiers- Multi-layer Perceptron- Neural Network (MLP-NN) and Random Forest (RF) have been used to predict recession. Additionally, machine learning regressors-Long Short-Term Memory (LSTM) and Random Forest (RF) have been used to detect how much recession a country is facing as a result of positive test cases of covid-19 pandemic. Experimental results demonstrate that the MLP-NN and RF classifiers have exhibited average 88.33% and 85% of recession (where 95%, 81%, 89% and 85%, 81%, 89% for revenue, employment rate and workers earnings, respectively) and average 90.67% and 93.67% of prediction accuracy for LSTM and RF regressors (where 92%, 90%, 90%, and 95%, 93%, 93% respectively).
[1] M. K. Goyal and A. K. Gupta, “Integrated risk of pandemic: Covid-19 impacts, resilience and recommendations,” Berlin: Springer, 2020.
[2] D. Hanna and Y. Huang, “The impact of sars on asian economies,” Asian Economic Papers, vol. 3, no. 1, 2004, pp. 102–112.
[3] W.Y. Lin, Y.H. Hu, and C.F. Tsai, “Machine learning in financial crisis prediction: a survey,” IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 42, no. 4, 2011, pp. 421– 436.
[4] M. R. Keogh-Brown and R. D. Smith, “The economic impact of sars: how does the reality match the predictions?,” Health policy, vol. 88, no. 1, 2008, pp. 110–120.
[5] R. D. Smith, M. R. Keogh-Brown, and T. Barnett, “Estimating the economic impact of pandemic influenza: an application of the computable general equilibrium model to the uk,” Social science and medicine, vol. 73, no. 2, 2011, pp. 235–244.
[6] M. Fern´andez-Delgado, E. Cernadas, S. Barro, and D. Amorim, “Do we need hundreds of classifiers to solve real world classification problems?,” The journal of machine learning research, vol. 15, no. 1, 2014, pp. 3133–3181.
[7] S. Balcaen and H. Ooghe, “35 years of studies on business failure: an overview of the classic statistical methodologies and their related problems,” The British Accounting Review, vol. 38, no. 1, 2006, pp. 63–93.
[8] P. R. Kumar and V. Ravi, “Bankruptcy prediction in banks and firms via statistical and intelligent techniques–a review,” European journal of operational research, vol. 180, no. 1, 2007, pp. 1–28.
[9] S. J. Fong, G. Li, N. Dey, R. G. Crespo, and E. Herrera-Viedma, “Composite monte carlo decision making under high uncertainty of novel coronavirus epidemic using hybridized deep learning and fuzzy rule induction,” Applied Soft Computing, vol. 93, 2020, pp. 106282.
[10] K. Bluwstein, M. Buckmann, A. Joseph, M. Kang, S. Kapadia, and O. Simsek, “Credit growth, the yield curve and financial crisis prediction: evidence from a machine learning approach,” 2020.
[11] R. Nyman, and P. Ormerod, “Predicting economic recessions using machine learning algorithms,” arXiv preprint arXiv:1701.01428, 2017.
[12] L. Breiman, “Random forests,” Machine learning, vol. 45, no. 1, 2001, pp. 5–32.
[13] U. Gr¨omping, “Variable importance assessment in regression: linear regression versus random forest,” The American Statistician, vol. 63, no. 4, 2009, pp. 308–319.
[14] Z. Car, S. B. Šegota, N. Anđelić, I. Lorencin, and V.Mrzljak, "Modeling the spread of COVID-19 infection using a multilayer perceptron", Computational and mathematical methods in medicine, Hindawi, vol. 2020, 2020.
[15] J. A. Ohlson, “Financial ratios and the probabilistic prediction of bankruptcy”, Journal of accounting research, vol. 18, no. 1, 1980, pp. 109–131.
[16] R. C. West, “A factor-analytic approach to bank condition”, Journal of Banking and Finance, vol. 9, no. 2, 1985, pp. 253–266.
[17] A. F. Atiya, “Bankruptcy prediction for credit risk using neural networks: A survey and new results,” IEEE Transactions on neural networks, vol. 12, no. 4, 2001, pp. 929–935.
[18] J. H. Min and Y.-C. Lee, “Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters,” Expert systems with applications, vol. 28, no. 4, 2005, pp. 603–614.
[19] K.-S. Shin, T. S. Lee, and H.-j. Kim, “An application of support vector machines in bankruptcy prediction model,” Expert systems with applications, vol. 28, no. 1, 2005, pp. 127–135.
[20] S. Sarkar, and R. S. Sriram, “Bayesian models for early warning of bank failures”, Management Science, vol. 47, no. 11, 2001, pp. 1457–1475.
[21] E. Fedorova, E. Gilenko, and S. Dovzhenko, “Bankruptcy prediction for russian companies: Application of combined classifiers”, Expert Systems with Applications, vol. 40, no. 18, 2013, pp. 7285–7293. [22] J. Abell´an and C. J. Mantas, “Improving experimental studies about ensembles of classifiers for bankruptcy prediction and credit scoring,” Expert Systems with Applications, vol. 41, no. 8, 2014, pp. 3825–3830.
[23] R. Chetty, J. N. Friedman, N. Hendren, M. Stepner, and T. O. I. Team, “The economic impacts of covid-19: Evidence from a new public database built using private sector data,” National Bureau of Economic Research, Working Paper 27431, June 2020.
[24] R. M. Simon, J. Subramanian, M.-C. Li, and S. Menezes, “Using crossvalidation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data,” Briefings in bioinformatics, vol. 12, no. 3, 2011, pp. 203–214.
[25] Haykin S., “Neural networks and learning machines,” 3/E: Pearson Education India, 2010.
[26] I. Syarif, A. Prugel-Bennett, and G. Wills, “Svm parameter optimization using grid search and genetic algorithm to improve classification performance,” Telkomnika, vol. 14, no. 4, 2016, pp. 1502.
[27] L. Fraiwan, K. Lweesy, N. Khasawneh, H. Wenz, and H. Dickhaus, “Automated sleep stage identification system based on time–frequency analysis of a single EEG channel and random forest classifier,” Computer methods and programs in biomedicine, vol. 108, no. 1, 2012, pp. 10-19.
[28] V. Y. Kulkarni and P. K. Sinha, “Pruning of random forest classifiers: A survey and future directions,” in 2012 International Conference on Data Science and Engineering, IEEE, 2012, pp. 64–68. [29] Y. E. Cakra and B. D. Trisedya, “Stock price prediction using linear regression based on sentiment analysis,” in 2015 international conference on advanced computer science and information systems, IEEE, 2015, pp. 147–154.
[30] B. Wang, L. Gao, and Z. Juan, “Travel mode detection using gps data and socioeconomic attributes based on a random forest classifier,” IEEE Transactions on Intelligent Transportation Systems, vol. 19, no. 5, 2017, pp. 1547–1558.
[31] F. A. Gers, J. Schmidhuber, and F. Cummins, “Neural Nets WIRN Vietri-99,” Continual prediction using LSTM with forget gates: Springer, 1999.
[32] K. Suri and R. Gupta, “Transfer learning for semg-based hand gesture classification using deep learning in a master-slave architecture,” in 3rd International Conference on Contemporary Computing and Informatics (IC3I), IEEE, 2018, pp. 178–183.
[33] S. Polyzos, A. Samitas, and A. E. Spyridou, “Tourism demand and the covid-19 pandemic: An lstm approach,” Tourism Recreation Research, vol. 46, no. 2, 2021, pp. 175–187.
[34] L. Breiman, “Bagging predictors,” Machine learning, vol. 24, no. 2, 1996, pp. 123–140.
[35] J. Kevric and A. Subasi, “Comparison of signal decomposition methods in classification of EEG signals for motor-imagery BCI system,” Biomedical Signal Processing and Control, vol. 31, 2017, pp. 398–406.
[36] Md. Monirul Islam, Mohammod Abul Kashem, Jia Uddin, “Fish Survival Prediction in an Aquatic Environment Using Random Forest Model,” IAES International Journal of Artificial Intelligence (IJ-AI), Indonesia, 2021, vol. 10, no. 3, pp. 614-622.
[37] J. Ferdoush, B. N. Mahmud, A. Chakrabarty, J. Uddin, “A Short-Term Hybrid Forecasting Model for Time Series Electrical-Load Data using Random Forest and Bidirectional Long Short-Term Memory” International Journal of Electrical and Computer Engineering, Indonesia, 2020, vol. 11, no. 1, pp. 763-771.
[38] R. Islam, J. Uddin, J.M. Kim, “An Acoustic Emission Sensor based Fault Diagnosis of Induction Motors using Gabor filter and Multiclass SVM”, Journal of Ad-hoc and Sensors Wireless Networks, Old city publisher, 2016, vol. 34, no. 1, pp. 273-287.