Abstract
In this study, we demonstrate the usefulness of ARIMA-Intervention time series analysis as both an analytical and forecast tool. The data base for this study is from the PACAP-CCER China Database developed by the Pacific-Basin Capital Markets (PACAP) Research Center at the University of Rhode Island (USA) and the SINOFIN Information Service Inc, affiliated with the China Center for Economic Research (CCER) of Peking University (China). These data are recent and not fully explored in any published study. The forecasting analysis indicates the usefulness of the developed model in explaining the rapid decline in the values of the price index of Shanghai A shares during the world economic debacle stating in China in 2008. Explanation of the fit of the model is described using the latest development in statistical validation methods. We note that the use of a simpler technique although parsimonious will not explain the variation properly in predicting daily Chinese stock prices. Furthermore, we infer that the daily stock price index contains an autoregressive component; hence, one can predict stock returns.
1. Introduction
In this study, we report an analysis not heretofore reported in the literature and for the data base collected. Our purpose is twofold; (1) to analyze data by implementing
Following the study of Zhong, Gui and Lui (1999) for the Chinese Bourses (Shanghai and Shenzhen), we selected a newer and finer data base from the PACAP-CCER China Database developed by the Pacific-Basin Capital Markets (PACAP) Research Center at the University of Rhode Island (USA) and the SINOFIN Information Service Inc, affiliated with the China Center for Economic Research (CCER) of Peking University (China). The length of data was for approximately ten years resulting in a time period where analysis can lead to interpretable results. Smaller time periods such as two or three or even four years are usually too small to reduce the effects of disturbances in economic data and more specifically do not produce enough degrees of freedom such that one may identify significant events and factors. A small number of degrees of freedom in sample data may not lead to determining “statistically significant” events even when they exist. The theoretical minimum number of observations for an ARIMA model is p+q+d+1, with the caveat that observations below three result in infinite standard errors. We also require each parameter in a regression equation to have at least one observation (Hyndman and Kostenko, 2007). That would result in a minimum number of observations for our model of five (one for the AR term, one for the difference of price, one for the IV volume, and two for the dummy variable intervention). To further account for randomness in the data we increased the sample size knowing that margins of error decrease proportional to the square of the sample size. [See Hyndman and Kostenko (2007)].
The regression will include for the response variable is the daily price index for the entire data base on daily returns produce by the PACAP-CCER Greater China Database. These data are part of the website created for research on Pacific-Basin nations. [The website and its content is a copyright of PACAP-CCER. All rights reserved. If you have any question or comment, please e.mail to
2. ARIMA Modeling and Intervention Analysis
We follow the methods
As discussed before by Box and Tiao (1975) an intervention model is of the general form:
where
In the transfer function models, the input is a general exogenous times series that influences the output series. We consider a special situation in which the input is a series of indicator variables that rrepresent the occurrence of identifiable unique events that affect the output variable. The identifiable events are the
To identify the correct model for prediction and explanation, ARIMA modeling consists of three steps. First, we identify the order of the differencing, i.e.,
Alternatives to ARIMA do exist for forecasting stock prices. For example, neural networks popularized in recent years due to increased availability of computer software are comparable to ordinary least squares regression (OLS) regression. However, neural networks are a black box method for providing accurate predictions, but contain little or no explanatory power. Hence, they are inappropriate for this study (see Lievano and Kyper, 2006). To analyze interrupted (intervention) series, sophisticated techniques like Fourier analysis and cross-spectrum analysis are available (see Hill and Lewicki, 2007). However, these techniques are more appropriate for complex time series with cyclical components or the simultaneous analysis of two series respectively, and were not deemed appropriate to test here.
3. The Analysis of the ARIMA Intervention Model
We include the financial crisis as the intervention of the time series which results in an ARIMA-Intervention model. As the global financial crisis extended to China in September 2008 and beyond, the intervention was initiated at that time. In the Appendix Table 1, we have the results in tabular form for both the model estimates and the initial
Results of ARIMA-Intervention Analysis for model price = d (price) c ar (1) volume inter
When we evaluate the differenced term, (Δ Yt), the equation for the model becomes:
Last, we evaluate the AR (1) term 0.992153 Yt-1 to form the final estimation model:
What we learn from the estimated model equation is that the intervention is significant and that prices were inversely related to financial crisis. This is what we would expect the results to be and is likely to be corroborated by studies using other methods for analyzing time series data having been related to the economics of the environment. Furthermore, the volume effect, although statistically significant, leads us to conclude that there is only a tiny effect on the volume of transaction on prices. The differencing which was necessary to build a time series that was stationary for modeling purposes was also significant and positive. Last, the significant autoregressive coefficient of the first order, i.e., AR (1), led to the estimated coefficient of 0.492162 for lagged value of Yt, i.e., Yt-1.
4. Additional Analytical Results
In addition the analytical table in the Appendix further gives diagnostic information about the estimated model and its equation. The coefficient of determination and the adjusted coefficient of determination are both extremely large and are the same because of the large number of degrees of freedom, i.e., 0.987864. The standard error of regresion (3.065267) is relatively small. The log likelihood statistic is −5756615 and is the ingredient in likelihood ratio tests. A
The F-statistic of 46137147 with a probability of 0.0000 indicates that the regression coefficients (except for the constant or intercept term) are nonzero. The probability (p-value) indicates the marginal significance of the F-test.
The AIC (Akaike information criterion, 1974) indicates the usefulness of model selection for non-nested alternative. Smaller values for AIC is preferred. As in our illustration, one best chooses the length of the lag distribution by choosing the lowest value of the AIC, 5.078149 is for our estimated model. The Schwarz criterion (1978) is 5.078156. It is an alternative to the AIC and penalizes more greatly for additional coefficients. The two criterions are so similar that either will lead to the conclusion that this model is best. Last, the Hannan-Quinn Criterion (1979) yields an additional test for model selection similar to the other two noted before. The result is again very similar to the AIC and Schwarz criterions already considered.
The well-known Durbin-Watson Statistic is the most universally accepted testing criterion for measuring the magnitude of autocorrelation in the residuals from a model. A statistic of 2.245173 would lead one to conclude that one should accept the hypothesis of the autocorrelation coefficient equal zero. With the absence of autocorrelation in the residuals from forecast, one concludes the previous tests of significance are useful and the model is available for predictive purposes. The Portmanteau Q-statistic calculated along with the autocorrelation function of the residual resulted in the same conclusion. Hence, the estimated model was a good fit for the time series. If we were to fit the model without the intervention the AIC, Schwarz Criterion and Hanna-Quinn Criterion would all have larger values indicating also that the model with the intervention was a better fit to the data. The coefficient of determination adjusted for degrees of freedom was also small for the model without intervention. To further examine these results observe Appendix Table 2 which contains the ARIMA model without use of the intervention term. Note, that in Appendix Table 2 we have the results of the analysis when the model does not contain the intervention term. Observe the three criterions Akaike, Schwarz and Hannan-Quinn produce values larger than existing for these criterions for the ARIMA model with intervention. Though these values appear small, these values are not so relatively small because the left hand size variable is D (PRICE). This variable has an extremely small standard error. The interpretation is obvious at this point that including the intervention term has benefits.
Results of ARIMA-No Intervention Analysis for model price = d (price) c ar (1) volume
5. Summary and Conclusions
The stock market price index in China was modeled by the methods of ARIMA-Intervention analysis and produced a fit for one to analyze and draw conclusion concerning how the index behave over time. The results indicate that the World Financial Crisis included China and affected its stock as well as its manufacturing industry. The intervention effect as measured in this study was alarming and clearly felt by the financial industry in China as well as its other component. The negative effect of the stock market decline is likely to correlate with the decline in Chinese manufacturing during the period and to have negative impacts on the Chinese Macroeconomy. In addition, the results corroborate that daily prices of Chinese equity securities have an autoregressive component. Whether this in a permanent or temporary component of the time series requires a more exhaustive study involving long term modeling of financial time series as exemplified by Ray, Jarrett and Chen (1997) in a study of the Japanese stock market index. One very basic conclusion is that we indicated that the use of intervention analysis is very useful in explaining the dynamics of the impact of serious interruptions in an economy and the changes in the time series of a price index in a precise and detailed manner.
