Universepg Journal Article Details

Review Article | Open Access | nt. J. Mat. Math. Sci., 2020; 2(1), 8-15 | doi: 10.34104/ijmms.020.08015

Fitting ARIMA Model and Forecasting for the Tea Production, and Internal Consumption of Tea (Per year) and Export of Tea

Md. Aliul Islam*

Mst. Sharmin Akter Sumy ,

Md. Alhaz Uddin ,

Md. Sazzad Hossain

Abstract

Every year, in Bangladesh the production has increased by 1.84 % and contributes 1.37 in export in the world tea trade and earnings nearing 1775 million Taka. The internal consumers of the country are presently consuming about 98% of its produce. Consumption is increasing day by day mainly due to the rapid increase in population. This study considered the published secondary data on yearly tea production in Bangladesh over the period 1990 to 2015 by the Bangladesh Tea Board. We are forecasting the production of tea and internal consumption of tea by using Auto-Regressive Integrated Moving Average (ARIMA) for the next 5 years. Run test, Jarque, and Bera test criteria are used for the adequacy of the fitted model which is followed by the residual analysis. The comparison between the original series and the forecasted series are showing the same trends which indicate the fitted model is statistically well and appropriate for the forecast of the production of tea in Bangladesh.

Keywords

INTRODUCTION

Tea is one of the best export items of Bangladesh. But exporting tea of the country has been declined day by day for the calm growth of production rising consumption and strong competition from other tea exporting courtiers in the world. Bangladesh attester substantial changes for exporting tea to several countries over time (Uddin et al., 2020). It is necessary to search new customers and explore an incursive policy for cope with similar changes in the world tea market.

This paper gives several suggestions and examines present situation for raising tea export. Domestic tea consumption has increased steadily. Likely to remain this trend and even may increase day by day. In the face of rising domestic consumption, a sustained increase in tea production can be stable -

level of tea export maintained (Islam et al., 2019). Various aspects of tea production and export have been dealt with, among others, by Khalid (1975), Majid (1991) and Sabur (2002).

Objectives of the Study:

• Investigate different properties of time series plot.

• To test the non-stationarity of the series by Graphical methods

• Examination of the present status of production, domestic consumption and export of tea of the country.

• Select an appropriate multivariate time series model for the data.

METHODOLOGY

Dickey-Fuller unit root test - Dickey and Fuller suggested that under the null hypothesis the estimated coefficient of in the model above follows the tau ( ) statistic is known as Dickey-Fuller test.

Allowing the various possibilities DF test is estimated in three different forms, under different null hypothesis

is a random walk:

is a random walk with drift: .

is a random walk with drift around a stochastic trend- .

Now we have to test the following hypothesis-

i.e., the time series data of production of rice of X is non-stationary,

i.e., the time series data of production of rice of X is stationary.

Where, X is a particular series concerned in this report.

The appropriate test statistic is tau statistic. This test is also known as Dickey-Fuller (DF) test and is defined by,

Decision: If the computed the DF critical value, then we do not accept the null hypothesis, otherwise accept.

Augmented Dickey-Fuller test - In case of the error term are correlated. Dickey and Fuller have developed a test named as the Augmented Dickey-Fuller (ADF) test. The following regression model is estimate by the ADF test:

Where is a pure white noise error and where

We estimate the model (3.3) and test the following hypothesis

i.e. the time series is non-stationary.

i.e. the time series is stationary.

The appropriate test statistic under,

Decision: If the computed the DF critical value, then we is not accept the null hypothesis, otherwise accept.

Box-Jenkins Methodology - The puissant work of Box-Jenkins expelled professional intentness away from the stationary serially correlated deviations from deterministic trend paradigm toward the paradigm. It can manage any series, stationary or not with or without seasonal elements thats why it is popular.

The following five basic steps consist in the Box-Jenkins methodology:

1. Identification of appropriate model: We test the Correlogram to conclude on the suitable orders of the and components, for this we used the different procedure to get a stationary time series. After a point, the Correlogram of a process is zero, that of a process declined geometrically. Different patterns are showed by the Correlogram of process (but all dampers after a while). On this basis, one arrives at a tentative model. This step includes more of a judgment procedure than the use of any clear-cut rules.

2. ¬Estimation of the model: The next step is the estimation of the tentative model identified in step-2. The estimation of model is straight forward. We can estimate OLS by minimizing the error sum of squares . In case of models, Box-Jenkins expressed a grid-search procedure. In that procedure we compute by successive substitution for each value of the parameters and choose the set of values of the parameters that minimizes the error sum of squares . For models, both the AR and MA parts are to be estimated with the procedure discussed above.

3. Diagnostic Checking: It is an advisable to check that the model does actually given an adequate statement on the date when a has been fitted to bestowed time series. There are two measures after used such as reflecting the nearness of fir and the number of parameters estimated. One is called the Akaike information criterion (AIC) and the other one is Schewatz Bayesian information criterion (BIC). If is the total numeral estimated parameters

Here is the sample size.

If is the residual aggregate squares, then

If we consider different models we select the one which has the smallest AIC of BIC. The two criteria can lead to several conclusions.

4. Forecasting: Imagine that we estimate the model which has observations, and then we want to forecast . That is told a k-periods ahead forecast. First we want to write out the evolution for and then replace all future values with their factors and by zero (since the expected value is zero). We also replace all by the predicted residuals.

Moving average process or MA (q) - A series is said to be a moving average method of order q if,

Where, and are constants. This series is also called as MA (q) process

Autoregressive process or AR (p) process - A series is said to be a autoregressive method of order p if,

Where, ~ WN (0, ) and are constants. The series is said as AR (p) process

ARMA (p,q) process - The time series is an ARMA (p,q) process if it stationary and satisfies for every t,

Where, and the polynomials have no common factors.

ARIMA (p,d,q) process - If the ARMA (p,q) process is also integrated process than it is known as ARMA process of order (p,d,q). Here d represents the number of difference the needed to be stationary. ARIMA process is used when data is non-stationary.

Analysis and Findings -A time series is told to be stationary if its mean, variance and covariance function does not depend on time, and otherwise it is non-stationary. The time series plot of the data is given below:

Fig 1: Time series Plot of the data.

Comments: From the above time plot we see that the mean and variance of the tea production, internal consumption, export and export price of Bangladesh from 1990-2014 is not same over time. That is the mean and variance has changed over time. So from the properties of non-stationary time series we can tell that the above time series is non-stationary process. We know that in the time series analysis it is possible to make the non-stationary time series into stationary time series. In order to make the above non stationary time series into stationary process we have to make some differences of the tea production.

Fig 2: Time series plot of first difference of the data.

Comments: From the above figure we see that variance are almost stable here and the production of tea changes equally over time i.e, mean and variance are about to same. So, we can ask that the series is Stationary.

The Auto-Correlation Function of ARMA - We get the following acf of ist difference of tea production,

Fig 3: ACF of Tea Production after taking one difference.

We get the following acf of ist difference of Export of tea,

Fig 4: ACF of export of Tea after taking one difference.

The Partial Auto-Correlation Function of an ARMA

Comments: From the above figure of ACF curve of first difference on tea production is statistically significant at the lag 1. So we can mention that the production of tea follows AR (1) Process.

We get the following pacf of ist difference of tea production,

Fig 5: PACF of tea production after taking one difference (thousand metric tons).

Comments: From the above figure of Partial auto-correlation function of tea production we can mention that the series is a Moving Average Process of order 0.

We get the following pacf of export of tea,

Fig 6: PACF of exported of tea (thousand metric tons).

Comments: From the above figures it is evident that of ACF and PACF curve of first difference on tea production is Statistically significant at the lag 1.Therefore the time series of production of tea follows ARIMA (1,1.0). Again, From the above figures it is evident that of ACF and PACF curve of first difference on export tea from Bangladesh is Statistically significant at the lag 1.Therefore the time series of production of tea follows ARIMA (1,1.0).

Unit root test - The stationarity of a time series can be tested directly with a unit root test. Two tests are available, such as DF test and ADF test. DF is useful when errors are uncorrelated but when errors are correlated then ADF test is useful.

Dickey Fuller Test – Data = Tea production

Consider the hypothesis

At 5% level of significance we will conduct the test.

P-Value of the Dickey-Fuller test.

P-Value: 0.01

This is less than level of significance (0.05)

So, we are not accepting the null hypothesis.

Comments: Since the null hypothesis is not accepted so we can say that the series of tea production after taking first differences is stationary.

Data = Export tea

Consider the hypothesis:

At 5% level of significance we will conduct the test.

We get the following P-Value of the Dickey-Fuller test.

P-Value: 0.0106

This is less than level of significance (0.05)

So, we are not accepting the null hypothesis.

Comments: Since the null hypothesis is not accepted so we can ask that the series of export tea after taking first differences is stationary.

Augmented Dickey- Fuller Test - Consider the hypothesis,

: The tea production is non-stationary.

: The tea production is stationary.

Data: Tea production

Dickey-Fuller = -0.9184, Lag order = 1, p-value = 0.9326

At 5% level of significance the p-value of Augmented Dickey-Fuller Test is 0.9326, which is greater than 0.05. So the null hypothesis is not rejected.

That means, the tea production is non-stationary.

Augmented Dickey- Fuller Test - Consider the hypothesis,

: The export of tea is non- stationary.

: The export of tea is stationary.

Data: Export of Tea

Dickey-Fuller = -3.529, Lag order = 2, p-value = 0.04172

At 5% level of significance the p-value of Augmented Dickey-Fuller Test is 0.04172, which is smaller than 0.05. So the null hypothesis is rejected.

That means, the export of tea is stationary.

Fitted ARIMA Model for tea data - The fitted ARIMA (1,1,0) model for the production of tea which we find from R-Console is given below:

X (t) = 46.9224+3.0592*X(t-1)

The fitted ARIMA (1,1,0) model for export of tea which we find from R-Console is given below:

X (t) = 14.486+1.975*X(t-1)

Diagnostic checking for tea production in Bangladesh - At this time we have to perform diagnostic checking for the fitted model. Now from R3.0.2 we have found the diagnostic checking plot, which is given below:

Fig 7: Diagnostic checking for the tea production in Bangladesh.

Comments: From this plot we can observe that the plot of the standardized residual and the ACF plots are correctly specified. From the plot of P-value we can view that all p-values are above the level of significance. So, we can interpret that the residual is white noise, hence the model is also correctly estimated.

Diagnostic checking for the internal consumption of tea - At this point we have to perform diagnostic checking for the fitted model. Now from R3.0.2 we have found the diagnostic checking plot, which is given below:

Fig 8: Diagnostic checking for the fitted model of internal consumption of tea.

Comments: From this plot we can observe that the plot of the standardized residual and the ACF plots are correctly specified. From the plot of P-value we can see that all p-values are above the level of significance. So we, can interpret that the residual is white noise, hence the model is also correctly estimated.

Diagnostic checking for the exported tea from Bangladesh - At this point we have to perform diagnostic checking for the fitted model. We have found the diagnostic checking plot, which is given below:

Fig 9: Diagnostic checking for the fitted model of exported tea.

Comments: From this plot we can notice that the plot of the standardized residual and the ACF plots are correctly specified. From the plot of P-value we can observe that all p-values are above the level of significance. So we, can interpret that the residual is white noise, hence the model is also correctly estimated. From the above analysis we may say that the production of tea and the amount of exported tea are non-stationary process.

After taking successive differences we have made all the non-stationary time series into stationary process. In these analysis, production of tea follows ARIMA (0,1,1) model, export of tea follows ARIMA (0,1,1) model. After diagnostic checking for all types of data of tea we have found that all the models are appropriate.

CONCLUSION

The main concept of this work is forecasting a group of associated variables thus making use of all possible information such as correlation. The correlations among the export variables may be high or comparatively low (Islam et al., 2019). In this project, we focus better forecast which can give strong correlation among these associated export variables. In multivariate time series models, the vector autoregressive model is used for forecasting. For this we find out the forecasted values. We measure the forecasting accuracy by using mean error (ME), mean square error (MSE), Root Mean Squared Error (RMSE), mean absolute error (MAE), mean percent error (MPE), mean absolute percent error (MAPE), mean absolute percent error (MSPE) and so on. We also select a proper model for forecasting on the basis of model selection criterions such as AIC and BIC, HQ and SC of the model. The others highly correlated variables forecast in multivariate forecast are good. The forecasting accuracy ME, MSE, RMSE, MAE, MPE, MAPE and so on all are not so very distance in multivariate forecast. Finally we conclude that before forecasting we should check correlations among the variables, if existing highly correlation then multivariate time series is better than univariate time series. So all over we conclude that if we want to forecast, at first checking correlations among the variables. If there existing highly correlated variables then we should use multivariate forecast for these variables. If there do not exist highly correlated variables then we should use univariate forecast for these variables. Since our case study export variable are highly correlated to production and internal consumption for the selected export variable than univariate time series analysis using ARIMA model.

ACKNOWLEDGEMENT

We are gratefully acknowledged to my Supervisor, Dept. of Statistics, Islamic University, Jhenidah-Kushtia, Bangladesh for logistically supported and Co-author who encouraged and supported to the successful research.

CONFLICTS OF INTEREST

The authors declared no conflicts of the interest with respect to the present research work.

Article References:

Anderson, T.W. (1984). “An introduction to Multivariate Statistical Analysis”, 2nd ed., New York: John Wiley and Sons Inc. https://www.wiley.com/en-us/An+Introduction+to+Multivariate+Statistical+Analysis%2C+3rd+Edition-p-9780471360919
Bangladesh bureau of statistics. Ministry of planning, government of Peoples Republic of Bangladesh, pp. 81 & 327. Dhaka, Bangladesh.
Bangladesh tea research institute, government of peoples republic of Bangladesh, pp. 94-98, Srimangal, Moulvibazar, Bangladesh.
Box, G.E.P., Jankins, G.W, and Reinsel G. (1994). “Time Series Analysis, Forecasting and Control”, edition, Englewood Cliffs, N.J.; Prentice-Hall.
Brokwell P. J. and R. A. Davis (1996) “Introduction to Time Series and Forecasting, Springer, New York.
Chatfield, C. (1993). Calculating Interval Forecasts, Journal of Business and Economic Statistics, 11(2), 121-35. https://econpapers.repec.org/article/besjnlbes/v_3a11_3ay_3a1993_3ai_3a2_3ap_3a121-35.htm
Cromwell J.B., Labys, W.C., & Terraza, M. (1994). “Univariate tests for time series models”, SAGE, London.
Diebold, F.X. (2007) “Elements of Forecasting, edition, Thompson South-Western.
Gebhard K. and J. Wolters (2007). “Introduction to Modern Time Series Analysis”, Springer, New York. https://www.springer.com/gp/book/9783642334351
Gebhard Kirchgässner, Jürgen Wolters (2008). “Introduction to Modern Time Series Analysis”. ISBN 3540687351, 9783540687351, Springer Science & Business Media.
Gujarati, D. (2002). “Basic Econometrics”, 4th edition, McGraw-Hill.
Hamilton, J. (1969). “A New Approach to the Economic Analysis of Nonstationary Time Series and the Business Cycle”, Econometrica, p. 57, 357-384. https://doi.org/10.2307/1912559
Helmut Lütkepohl (2005). Department of Economics, European University Institute “New Introduction to Multiple Time Series Analysis”, Springer, New York.
Islam KA, Deeba F, and Hassan MKA. (2019). Dust Ion Acoustic Solitary Waves in Multi-Ion Dusty Plasma System with Adiabatic Thermal Change, Aust. J. Eng. Innov. Technol., 1(5), 1-5. https://doi.org/10.34104/ajeit.019.0105
Islam MT and Hossain MS. (2019). Hybridization of Vigenere Technique with the Collaboration of RSA for Secure Communi-cation, Aust. J. Eng. Innov. Technol., 1(6), 6-13. https://doi.org/10.34104/ajeit.019.06013
Lütkepohl, H. (2005). “New Introduction to Multiple Time Series Analysis”, Berlin: Springer.
Lütkepohl, H. and Poskitt, D.S. (1996). “Specification of Echelon-Form VARMA Models”, Biometrika.
Uddin MM, Karim R, Kaysar MA, Dayan MAR, and Islam KA. (2020). Low-Cost Jute-Cotton and Glass Fibre Reinforced Textile Composite Sheet, Int. J. Mat. Math. Sci., 2(1), 1-7. https://doi.org/10.34104/ijmms.020.01007
Wayne A. Fuller (1995). “Introduction to Statistical Time Series”, 2nd ed., first published: Print ISBN: 9780471552390.
Wilfredo Palma, (2007). “Long-Money Time Series Theory and Methods.” ISBN-13: 978-0470114025, ISBN-10: 0470114029, Wiley-Interscience, 1st edition.

Article Info:

Academic Editor

Dr. Toansakul Tony Santiboon, Professor, Curtin University of Technology, Bentley, Australia.

Received

January 14, 2020

Accepted

February 19, 2020

Published

February 27, 2020

Article DOI: 10.34104/ijmms.020.08015

Corresponding author

Md. Aliul Islam*

Dept. of Statistics, Islamic University, Jhenidah-Kushtia, Bangladesh

Cite this article

Islam MA, Sumy MSA, Uddin MA, and Hossain MS. (2020). Fitting ARIMA Model and Forecasting for the Tea Production, and Internal Consumption of Tea (Per year) and Export of Tea, Int. J. Mat. Math. Sci., 2(1), 8-15. https://doi.org/10.34104/ijmms.020.08015