Sage Journals: Discover world-class research

Abstract

ISTAT has recently released an updated version of short-term statistics on hours worked in Italy, which are used in labor input estimates by the Quarterly National Accounts (QNA). The coverage of these statistics has been expanded from larger-than-ten workers firms to include the entire universe of Italian private firms. To include the updated indicator within estimates by QNA, the series must be reconstructed back to 1995 first quarter (1995q1) due to methodological requirements of QNA. In this paper, we first reconstruct the updated indicator using the Kalman filter and smoother algorithms applied to a state-space representation of a multivariate structural model (SUTSE). Next, we comparatively assess the performance of the new indicator against the non-updated one. This assessment is based on estimates of quarterly per-employee hours worked using temporal disaggregation methods for seven economic sections spanning the non-agricultural private business economy over the period 1995q1 to 2020q4. Compared to the previous indicator, the reconstructed indicator (i) implies improvements in temporal disaggregation model fitting in the majority of economic sections considered; (ii) returns smaller forecast errors in the 64.3% of the estimations, based on MAE; (iii) ensures a higher correlation between the estimated quarterly series to the indicator in the 71.4% of the estimates.

Keywords

National Accounts structural models back-casting temporal disaggregation

1. Introduction

Since mid-eighties, the Italian National Institute of Statistics (ISTAT) has been systematically disseminating Quarterly National Accounts (QNA) data. ISTAT employs an indirect estimation approach for QNA, primarily based on temporal disaggregation (TD) techniques. These techniques employ high-frequency reference indicators and are applied to the corresponding Annual National Accounts (ANA) totals. Historically, estimation approaches have relied on static regression methods such as Chow and Lin (1971), Denton (1971), and Fernàndez (1981). Notably, in 2019, the set of models was expanded to include dynamic regression methods based on autoregressive distributed lag (ADL) models, in line with the approach suggested by Proietti (2006).

Short-term reference indicators used in the production of QNA mainly originate from monthly or quarterly statistics produced by ISTAT surveys (e.g., Industrial Production Index, Labour Force, Services Turnover, etc.). The present analysis focuses on the quarterly estimates of per-employee hours worked by QNA. These estimates are obtained through TD methods using “per-employee hours worked” statistics derived from the ISTAT survey on job vacancies and hours worked (VELA survey) as reference indicator. The estimates of per-employee hours worked are then multiplied by the estimates of employee jobs to obtain employees total hours worked by QNA. Finally, these figures are benchmarked with the corresponding ANA totals to ensure consistency between quarterly and annual data.

Until June 2019, the VELA survey exclusively provided quarterly index statistics for non-agricultural private firms with at least ten workers. These statistics, expressed in levels, serve as the current reference indicator for estimating hours worked within QNA. However, in compliance with regulatory requirements from the European Commission (EC Reg. N. 1165/98 and subsequent modifications), the VELA survey expanded its coverage as of June 2019 to include all private firms with employees. The updated index statistics, available from 2015q1 onwards, have been released in a seasonally non-adjusted form.

The exclusion of micro firms (those with fewer than ten workers) from the short-term indicator used for the estimation of hours worked by QNA can lead to three main inconsistencies. First, there may be inconsistency with respect to ANA totals of hours worked referring to the entire population of resident firms as established by the European system of national and regional accounts ESA 2010 (EU 2013). Second, there could be inconsistency with the estimates of other labor input aggregates (e.g., self-employed hours worked, employment, full-time equivalent) and labor cost by QNA, both relying on short-term indicators (e.g., Labour Force survey indicators or administrative data-based surveys) referring to the whole population of firms. Third, there might be inconsistency with estimates of value added by QNA, with a potential impact on the accuracy of labor productivity measure.

The use of the updated indicator from the VELA survey in principle has the potential to address the drawbacks associated with the use of the indicators excluding micro firms. Moreover, the improved alignment between the coverage of the updated indicator and the definition of National Accounts (NA) aggregates should enhance the performance of TD models. This could lead to a better fit between the quarterly estimates and the annual aggregates, reducing the amount of revisions and improving the quality of the extrapolated observations.

Quarterly series in ISTAT are compiled and estimated at each release over the whole period starting from 1995q1 (ISTAT 2023). Hence, the introduction of the updated indicator from the VELA survey, including data for all private firms with employees from 2015q1 onwards, calls for data reconstruction back to 1995q1 within the estimation of QNA by TD models.

The present contribution aims to accomplish two main objectives: first, reconstructing the updated indicator of per-employee hours worked (expressed in levels) from 2015q1 back to 1995q1; and second, assessing the performance of the reconstructed extended indicator in the estimates of per-employee hours worked by QNA, compared to the performance of the indicator currently in use.

Concerning the reconstruction, we employ a model-based approach, specifically opting for the class of structural time series models (Harvey 1990). In this case, we utilize a bivariate structural model in which both the updated and the non-updated indicators are endogenous. The non-updated indicator, available throughout the entire time span considered (i.e., from 1995q1 to 2021q4), serves as a proxy for the missing data in the extended indicator from 2014q4 back to 1995q1.

The advantage of this approach lies in the statistical treatment of the model that can be carried out by the Kalman filter and smoother algorithms. The latter allows for a diffuse representation of starting conditions within the state-space representation (Koopman 1997) and facilitates the reconstruction of missing observations of the shorter indicator as a by-product of the algorithms. Additionally, this approach enables the overcoming of certain restrictive hypotheses and limitations that are inherent in relatively more naïve back-calculation approaches, such as proportional methods or regression approaches (Caporin and Sartore 2006).

To evaluate the performance of the extended indicator, the empirical strategy involves the application of TD methods to the annual per-employee hours worked by ANA, alternatively using the non-updated and the updated reconstructed indicator as reference indicator. The estimations span is from 1995q1 to 2020q4, covering the non-agricultural business economy (i.e., economic sections from BCDE-to-MN according to NACE rev.2). Indeed, although hours worked estimates within QNA are conducted at the NACE rev. 2 section detail (from section A to T), the use of indicators from the VELA survey is limited to the estimates of sections from B to N. This limitation is attributed to both the coverage of the VELA survey, which is confined to private firms (sections from B to S), and the fact that non-business services (sections from O to S) mainly consist of non-market economy activities.

Multiple TD model specifications, such as the Chow-Lin (1971) and Fernàndez (1981) models, have been employed, due to their frequent application in QNA estimates. The chosen time interval aligns with the current availability of ANA final totals derived from structural statistics, with only provisional ANA figures accessible for the subsequent period (i.e., 2021 onwards) at the time of writing. Performance evaluation of the two indicators relies on both diagnostic statistics of the estimated models and computed statistics measuring the forecasting errors of the fitted model, along with the quality of disaggregated observations. Diagnostic statistics encompass goodness-of-fit measures, such as computed information criteria and coefficients of determination. The forecasting performance of the models is assessed using standard statistics gauging the size of out-of-sample forecast errors. Finally, the quality of temporal disaggregation models is evaluated by examining both the correlations and coherence of signs between the quarterly and annual dynamics of the indicator series and the estimated output series by TD.

The results indicate that the reconstructed indicator enhances the fitting of the temporal disaggregation model in the majority of the examined economic sections when compared to the non-updated one. Additionally, the use of the reconstructed indicator yields smaller forecast errors than those obtained using the non-updated indicator in most estimations, according to Mean Absolute Error (MAE). Moreover, it ensures a higher correlation between the estimated quarterly series and the indicator in 71.4% of the estimates. Both indicators are equally satisfactory in terms of coherence with the dynamics of the disaggregated series, irrespective of the model used.

The remainder of the paper is structured as follows: Section 2 delves into more details on the method employed for data backwards reconstruction and presents the results of the structural model estimations. Section 3 provides descriptive evidence on the indicators and ANA data, along with a detailed comparison of the performance of the restricted and extended indicators in the temporal disaggregation exercise. Finally, Section 4 offers conclusive remarks.

2. Model-Based Backwards Reconstruction of the Indicator

Backwards reconstruction, also known as back-calculation, involves estimating unavailable past values of economic variables at the desired frequency using all available relevant information. However, several factors should be considered when choosing a methodological strategy for back-calculating a time series from existing approaches (e.g., retropolation, interpolation, regression methods). These factors include the availability of a related series, its availability at the desired frequency, the microeconomic versus macroeconomic dimension of the data, and the presence of cross-sectional or time constraints to be respected (United Nations 2018).

The present analysis employs a model-based approach to backwards reconstruction, specifically relying on the class of structural models (Harvey 1990). In particular, we consider a bivariate version of a basic structural model. In this model, both the non-updated indicator (referred to as “10+” hereafter, covering firms larger than ten workers) and the updated indicator (referred to as “TOT” hereafter, covering the totality of private companies) serve as dependent variables. This strategy offers several advantages compared to alternative approaches that utilize related information (i.e., time series) for the series to be reconstructed. First, unlike the Ordinary Least Squares (OLS) framework, which involves regressing the “short” series against the “long” series to estimate a proportional beta coefficient, a structural model-based approach is not constrained by the exogeneity hypothesis of the “x” terms, which can be unrealistic in this context. Second, in contrast to plain retropolation methods, this approach does not assume that the reconstructed and the reference series are linked by a constant proportional relationship over the entire period.

Third, among methods not involving the use of related information, a structural model-based approach outperforms univariate Auto Regressive Integrated Moving Average (ARIMA) models. Indeed, univariate ARIMA models, involving reversing the series, fitting the model, and producing forecasts to predict the past, have a limited backwards forecast capacity (Caporin and Sartore 2006).

The statistical treatment, can be effectively managed by the Kalman filter (KF hereinafter) (Kalman 1960) applied to the model in the state-space form (SSF) (Durbin and Koopman 2012; Koopman et al. 1999) where the reconstructed observations result as a by-product of the smoothing algorithm.

The effectiveness of the KF in the context of missing observations and data reconstruction is well-known (Gómez and Maravall 1994; Harvey 1990; Harvey et al. 1998), finding application in various model classes such as dynamic factor models (Poncela et al. 2021) and structural models (Koopman et al. 2007; Moauro 2001; Moauro and Savio 2005).

Among the advantages of a SSF and the use of KF it is worth mentioning: (i) the availability of a rich set of diagnostic statistics on the quality of the estimated model; (ii) the backwards reconstruction of the time-series as a by-product of the KF smoothing algorithm where the whole available information set over the time interval is efficiently used; (iii) the opportunity for a formal treatment of the initial conditions (initialization of the KF algorithm) that can be optimally estimated (Koopman 1997).

2.1. Structural Models in This Application

According to the structural time series models framework (Harvey 1990), a time series $y_{t}$ is represented in terms of a long-run or trend (mt) plus a cycle ( $ψ_{t}$ ), seasonal ( $γ_{t}$ ), and an irregular ( $ε_{t}$ ) unobserved component, the latter generally represented by a white-noise disturbance. The trend can assume plenty of formulations depending on both its deterministic/stochastic nature and the presence/absence of a slope component ( $β_{t}$ ) which, in turn, can be either deterministic or stochastic. In formulas, the univariate basic structural model (BSM) with local linear trend (LLT) specification includes both a stochastic level and slope components plus a white noise as shown by Equation (1.1) to (1.3):

y_{t} = μ_{t} + ε_{t}, ε_{t} ~ N I D (0, σ_{ε}^{2})

(1.1)

μ_{t} = μ_{t - 1} + β_{t} + η_{t}, η_{t} ~ N I D (0, σ_{η}^{2})

(1.2)

β_{t} = β_{t - 1} + ξ_{t}, ξ_{t} ~ N I D (0, σ_{ξ}^{2})

(1.3)

with $ε_{t}, η_{t}$ and $ξ_{t}$ normally distributed, serially independent and mutually uncorrelated white-noise disturbances, with zero-mean and positive variance $σ^{2}$ . This specification yields an “integrated of order 2” or I(2) model. However, LLT model can assume various forms. When both the level and the slope are deterministic (i.e., $σ_{ξ}^{2}$ = $σ_{η}^{2}$ = 0) LLT collapses into a deterministic linear trend case defined by Equation (1.1) and (1.4):

μ_{t} = μ_{0} + β t

(1.4)

where mt has to be initialized to a starting value m0.

In the absence of the slope component ( $β_{t}$ = 0), the stochastic trend follows an I(1) random walk (RW) process making the LLT turning into a Local level model (LL) defined by Equation (1.1) and (1.5):

μ_{t} = μ_{t - 1} + η_{t}, η_{t} ~ N I D (0, σ_{η}^{2}) .

(1.5)

Alternatively, if the slope is present but has a deterministic form ( $σ_{ξ}^{2}$ = 0), the trend reduces to an I(1) random walk with drift (RWD) specification defined by Equation (1.1) and (1.6):

μ_{t} = μ_{t - 1} + β + η_{t}, η_{t} ~ N I D (0, σ_{η}^{2})

(1.6)

Finally, if a deterministic level ( $σ_{η}^{2}$ = 0) is combined with a stochastic slope ( $σ_{ξ}^{2}$ is positive) following a random walk form (1.3), an I(2) integrated random walk (IRW) model is obtained as defined by Equation (1.1), (1.7) and (1.3):

μ_{t} = μ_{t - 1} + β_{t}

(1.7)

The multivariate generalization of the univariate structural model is known as Seemingly Unrelated Time Series (SUTSE) models (Harvey 1990). In our application, we consider a bivariate SUTSE model with LLT specification plus a seasonal component ( $γ_{t})$ . In formulas:

y_{t} = μ_{t} + γ_{t} + ε_{t}, ε_{t} ~ N I D (0, Σ_{ε})

(2.1)

μ_{t} = μ_{t - 1} + β_{t} + η_{t}, η_{t} ~ N I D (0, Σ_{η})

(2.2)

β_{t} = β_{t - 1} + ξ_{t}, ξ_{t} ~ N I D (0, Σ_{ξ})

(2.3)

where $y_{t} = [y_{1 t,} y_{2 t}]$ with $y_{1 t}$ is the restricted indicator (10+) available over the whole period considered $(t = 1, \dots, T)$ and $y_{2 t}$ is the extended indicator (TOT) to be reconstructed, whose observations are available only for the l periods, whereas the first $t = i, \dots, l$ observations must be estimated. Accordingly, each model component is bivariate that is, $μ_{t} = [μ_{1 t,} μ_{2 t}]$ , $β_{t} = [β_{1 t,} β_{2 t}]$ , $γ_{t} = [γ_{1 t,} γ_{2 t}], ε_{t} = [ε_{1 t,} ε_{2 t}]$ .

The seasonal component ( $γ_{t}$ ) is defined by

γ_{t} = \sum_{j = 1}^{[\frac{S}{2}]} γ_{j, t}

(2.4)

where each $γ_{j, t}$ is generated by

[\begin{matrix} γ_{j, t} \\ γ_{j, t}^{*} \end{matrix}] = [\begin{matrix} c o s λ_{j} & s i n λ_{j} \\ - s i n λ_{j} & c o s λ_{j} \end{matrix}] [\begin{matrix} γ_{j, t - 1} \\ γ_{j, t - 1}^{*} \end{matrix}] + [\begin{matrix} ω_{j, t} \\ ω_{j, t}^{*} \end{matrix}], j = 1, \dots, [\frac{S}{2}];, ω_{t} ~ N I D (0, Σ_{ω})

where: $λ_{j} = 2 π j / s$ denotes the frequency in radians; $γ_{j, t}$ and $γ_{j, t}^{*}$ are bivariate vectors and $ω_{j, t}$ and $ω_{j, t}^{*}$ are mutually uncorrelated disturbances sharing the same variance: Var $[\begin{matrix} ω_{j, t} \\ ω_{j, t}^{*} \end{matrix}] = I_{2} \otimes Σ_{ω}$ . The trigonometric formulation of the stochastic seasonal component Equation (2.4) overcomes some restrictive hypothesis of more naïve formulations like for example, the dummy variable form (Harvey 1990; Koopman et al. 2007).

For each component i, the bivariate (2 × 1) vector of white-noise errors are characterized by zero mean and (2 × 2) unknown variance-covariance matrices ( $Σ_{ε}, Σ_{η}, Σ_{ξ}, Σ_{ω}) .$ The structure of each variance-covariance matrix is such that:

Σ_{i} = [\begin{matrix} σ_{y 1}^{2} & ρ_{y 1, y 2}^{i} σ_{y 1} σ_{y 2} \\ ρ_{y 2, y 1}^{i} σ_{y 2} σ_{y 1} & σ_{y 2}^{2} \end{matrix}]

(3)

For $i = ε, ξ$ , $η, ω$ , where the terms on the main diagonal are the unknown variances of the disturbances and the elements on the secondary diagonal denote the covariance terms.

SUTSE models allow for common factor restrictions (Koopman et al. 2007) which imply the covariance matrices of the relevant disturbances of one (or multiple) component(s) are less than full rank. In the case of a general LLT model, the series may share a common pattern over either the level, slope, seasonal, and/or the irregular component. Common trends arise in case of either common levels and/or common slopes. When restricted version of the LLT model are considered, common trend may arise in turn on levels (LL, RWD) or the slope (IRW) stochastic component. In general, the presence of a common factor in the multivariate LLT model with N series is that the rank of matrices $Σ_{i}$ ( $i = ε, η, ξ$ , $ω$ ) is K < N, with either $ρ_{y 1, y 2}^{i}$ or $ρ_{y 2, y 1}^{i}$ equal to one. However, it must be noted that the existence of a less-than-full rank disturbance covariance matrices [ $Σ_{i}$ ] does not necessarily imply the case of common factor restrictions. Indeed, if a component i turns out to be deterministic for one series ( $σ_{y 1}$ = 0) and stochastic in the other ( $σ_{y 2}$ is positive semi-definite), the rank of (3) anyway decreases from K = 2 to K = 1 in absence of common factors.

In general, the cointegration of two series ensures that a common stochastic trend drives their long-run behavior. This proves favorable within the reconstruction framework, when a “long” series is available as a reference indicator to infer the “short” one. Indeed, cointegration implies that the reconstructed series would, at least, not diverge from the observed series—a convenient aspect in official statistics where transparency is appreciated by the users. In the case of common trends, the observed and reconstructed series would span parallel paths, while cointegrated seasonality would induce aligned seasonal patterns between the two series.

The statistical treatment of structural models in this application is based on the state-space representation (SSF) of the model and the use of the KF (Harvey et al. 1998; Koopman et al. 2007). The SSF is characterized by two equations: the first describes the time series structure (measure equation), the second equation describes the transition of the structural latent components from one state to the following one (transition or state equation). The application of the KF to the SSF allows to compute an optimal estimator of the state variables vector at time t for t = 1,. . .,T, given the information available in t. By contrast, the associated smoothing algorithm allows an optimal estimation of the state vector, given the whole information set, that is, until time T. In the present context, given $y_{1 t,}$ observed at quarterly frequency for T periods ( $t = 1, \dots T$ ) and $y_{2 t}$ observed at the same frequency but only in the last l periods, the first $i = T - l$ observations of $y_{2 t}$ are retrieved by applying the KF and smoother to the SSF-representation of model in Equation (2.1) to (2.4) where both $y_{1 t}$ and $y_{2 t}$ are included as state variables.

2.2. Reconstructing Per-Employee Hours Worked Series

The object of the reconstruction is the series on per-employee hours worked in levels, which is disseminated on a quarterly basis by the VELA survey in index form. The non-updated indicator (10+) refers to companies with at least ten workers and is available from 1995q1 onwards, whereas the updated version (TOT) covers all firms with employees and is available from 2015q1 onwards. The scope of the reconstruction exercise is limited to the non-agricultural business economy (from B to N according to NACE rev.2) aligning with the breakdown of economic activities adopted for the estimates of labor input by QNA (see Appendix, Table A2). In Figure 1, both indicators, in index form, are plotted for the selected sections over their respective available span. The dynamics of the two series appear highly correlated in the overlapping period (2015q1–2022q4) for the majority of sections.

Figure 1.

Per-employee hours worked index according to TOT and 10+ indicator, respectively. 1995q1 to 2022q4 (2015q1–q4 = 100). Non-seasonally adjusted data.

The structural model described by Equation (2.1) to (2.4) is considered, where the bivariate dependent $Y_{t}$ vector comprises both the TOT indicator available from 2015q1 up to 2022q4 and the 10+ indicator, available from 1995q1 up to 2022q4. According to the breakdown of economic activities shown in Table A2, seven separate models are estimated (one for each different section/group of sections considered) where the series enter in levels.

The KF is applied to the SSF representation of Equation (2.1) to (2.4), providing the estimates of the related log-likelihood and the unknown parameters. The smoothing algorithm efficiently estimates the state vector and the error vector using both past and future observations. As a by-product of the latter, the estimation of the observations of the TOT indicator for the 1995q1 to 2014q4 time span (eighty quarterly observations) is obtained.

Using the programming language Ox, model estimation is performed by the STAMP package (Koopman et al. 2007) that uses the Broyden-Fletcher-Goldfarb-Shanno (BFGS) approximation method for the maximization process. Table 1 presents selected estimated parameters for the seven estimated models, namely: the estimated maximum of the sample log-likelihood ( $L$ ), the estimated variances of the disturbances ( ${\hat{σ}}_{η}^{2}, {\hat{σ}}_{ξ}^{2}, {\hat{σ}}_{ω}^{2}$ , ${\hat{σ}}_{ε}^{2}$ ) in Equation (2.1) to (2.4) for both the TOT and the 10+ indicators, respectively, and the estimated correlation between the disturbances of the two series ( $ρ_{T O T, 10 +}^{η}$ , $ρ_{T O T, 10 +}^{ξ}$ , $ρ_{T O T, 10 +}^{ω}$ , $ρ_{T O T, 10 +}^{ε}$ ) for each structural component, that is, level, slope, seasonal, and irregular.

Table 1.

Estimated Parameters From the BiVariate SUTSE Model Fitted to Data From 2015q1 to 2021q4.

		BCDE	F	G-to-I	J	K	L	MN
	L	−230.97	−278.57	−232.39	−258.32	−208.77	−362.73	−237.60
Level	$σ_{η, 10 +}^{2}$	18.66	2.11	8.18	4.38	1.62	47.06	4.81
	$σ_{η, T O T}^{2}$	23.32	18.12	25.01	1.60	4.66	45.33	16.23
	$ρ_{T O T, 10 +}^{η}$	0.99	1.00	1.00	0.69	1.00	1.00	1.00
Slope	$σ_{ξ, 10 +}^{2}$	0.00	0.23	0.00	0.00	0.00	0.00	0.00
	$σ_{ξ, T O T}^{2}$	0.00	0.08	0.00	0.00	0.00	0.00	0.00
	$ρ_{T O T, 10 +}^{ξ}$	0.00	1.00	0.00	0.00	0.00	0.00	0.00
Seasonal	$σ_{ω, 10 +}^{2}$	0.09	0.19	1.27	0.22	0.56	2.27	0.38
	$σ_{ω, T O T}^{2}$	0.08	0.05	0.14	0.29	0.33	2.52	0.61
	$ρ_{T O T, 10 +}^{ω}$	1.00	1.00	1.00	1.00	1.00	1.00	1.00
Irregular	$σ_{ε, 10 +}^{2}$	24.00	43.63	8.93	34.62	22.18	108.55	18.94
	$σ_{ε, T O T}^{2}$	23.14	14.96	11.98	31.15	22.59	34.99	4.94
	$ρ_{T O T, 10 +}^{ε}$	1.00	0.90	1.00	0.93	1.00	0.65	1.00

Note. Estimated Log-likelihood ( $L$ ), estimated correlations ( $ρ_{T O T, 10 +}^{i}$ ), and variances $({\hat{σ}}_{i}^{2}$ ) from the covariance matrices for disturbance i ( $i = ε, ξ$ , $η, ω$ ) from the bivariate SUTSE estimated model.

A brief inspection of Table 1 reveals that overall the level component disturbances are highly correlated—as revealed by $ρ_{T O T, 10 +}^{η}$ which equals unity in all the estimated models, except section J. This implies that the two series share a common trend, at least for the time span over which both the indicators are available, as discussed in Subsection 2.1. Similarly, seasonal patterns share a common trend in the totality of the estimated models, being also the correlation term of the seasonal component ( $ρ_{T O T, 10 +}^{ω}$ ) equal to unity. Moreover, the estimated variance of the disturbances of the slope component ( ${\hat{σ}}_{ξ}^{2}$ ) turns out to be zero for both the series ( ${\hat{σ}}_{ξ, T O T}^{2}, {\hat{σ}}_{ξ, 10 +}^{2}$ ), except for section F, which indicates the presence of a deterministic slope consistent with an random walk with drift specification as a trend. Accordingly, I(1) series are identified, consistent with previous evidence on aggregate employment-related series in the Italian economy (Moauro 2014). Consequently, the long-run trend and the seasonal pattern of the reconstructed series are expected not to diverge from those of the reference indicator, as shown by Figure A2 in Appendix where both are represented in index form.

Table 2 presents diagnostics derived from the expected residuals of the estimated models, along with the coefficient of determination. The diagnostics indicate that the null hypotheses for both normality and homoscedasticity of the residuals cannot be rejected for the entire set the estimated models. Additionally, there is no evidence of first-order autocorrelation in the residuals across all estimates as indicated by both the N-statistics and Durbin-Watson (DW) statistics (Durbin and Watson 1950, 1951) for both the 10+ and TOT series. According to the Ljung-Box Q-statistics (Ljung and Box 1978), the null hypothesis of the absence of autocorrelations in the first eight lags is never significant for the TOT series. However, in the case of the 10+ series, the null hypothesis cannot be rejected in more than one instance (F, GHI, K, L, MN). Overall, the model fitting is deemed satisfactory, as indicated by R² values.

Table 2.

Summary Statistics From the BiVariate SUTSE Estimated Model.^a

	BCDE		F		GHI		J		K		L		MN
	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT
Normality- $χ_{2}^{2}$ (Doornik-Hansen)	2.84	2.65	3.03	2.14	5.10	4.00	1.06	0.03	3.28	5.25	3.90	4.19	0.13	0.52
Heteroskedasticity-F(7,7)	1.56	1.81	0.61	0.94	2.35	2.72	0.88	0.60	0.27	0.25	1.43	2.41	0.72	1.48
Durbin-Watson	1.81	1.56	2.08	1.84	1.77	1.99	2.20	2.35	2.45	2.21	1.48	1.35	1.49	1.56
Auto-correlation(1)-N(0,1/28)	0.02	0.13	−0.10	0.01	0.09	−0.05	−0.20	−0.22	−0.24	−0.13	0.26	0.32	0.25	0.21
Q(8)-Ljung-Box	13.40	4.14	65.73***	6.12	19.94*	4.17	11.16	3.03	74.18***	6.52	33.45***	11.77	20.12**	7.31
R-sq	0.93	0.89	0.94	0.94	0.98	0.95	0.91	0.79	0.95	0.86	0.90	0.72	0.95	0.86

Note. ***, **, and * indicate p-value ⩽.001, ⩽.01, and ⩽.5, respectively. Doornik-Hansen = chi-square test statistics for errors normality; F = F-test statistics for errors homoscedasticity; Durbin-Watson = N(2,4/T)-test statistics for errors autocorrelation; N = N(0,1/T)-test statistics on first-order autocorrelation; Q(p)-Ljung-Box = chi-squared test-statistics on errors autocorrelation (at the first p-lags); R-sq = coefficient of determination.

The STAMP package neglects the option to control for outlier observations—by including intervention variables—when they occur within a time span that is not common to both series. This limitation affects the outcome of the 10+ series, primarily due to disruption caused by the 2008 to 2009 crisis that was not possible to account for.

The reconstruction of the time series based on KF algorithms allows for the assessment of the precision of the estimates based on the estimated standard errors. This feature stands as an advantage over alternative reconstruction methods—such as retropolation and proportional methods. In Figure A1 in the Appendix, the reconstructed series derived from the structural models is depicted, along with 95% confidence interval bands. As anticipated, the precision of the estimates noticeably diminishes when moving from 2014q4 to the beginning of the time span, as indicated by wider confidence bands.

As a further assessment of the quality of the reconstructed series by the structural model, we calculated the correlation between the annual totals by ANA and the annualized indicators reconstructed using two alternative approaches. Namely, (i) the proportional method, which entails retrieving the beta estimated coefficient from the regression of the TOT indicator over 10+ indicator over the common interval and, as a second step, multiplying the beta coefficient by the observations of the 10+ series over the span of 1995q1 to 2014q4 to obtain the missing observations; (ii) a retropolation approach, which applies year-on-year back-growth rates from the 10+ quarterly series to the TOT quarterly series over the blank span from 1995q1 to 2014q4. Table A2 shows that the quarterly series obtained by structural models ensures a higher, or at least comparable, correlation at the annual frequency with ANA totals compared to the alternative methods considered. It is noteworthy that the model-based approach proves preferable in this context, despite the relatively greater data demands of the Kalman Filter (KF) estimation in structural models compared to, for example, linear models (proportional methods), where a smaller amount of parameters has to be estimated. Importantly, the higher the correlation between the high-frequency indicator and the low-frequency series, which serves as the longitudinal benchmark for the high-frequency estimates, the better the TD model should fit and the higher its out-of-sample forecast capacity is expected.

3. Temporal Disaggregation of Per-Employee Hours Worked

After obtaining the reconstruction of the TOT indicator (see Appendix, Figure A2), we assess its performance in a temporal disaggregation exercise. This exercise allows us to determine if and to what extent the transition to the new indicator would enhance the quality of quarterly estimates of per-employee hours worked by QNA, as compared to the current estimates based on the 10+ indicator. Given its inherent higher correlation with ANA totals, the primary assumption of this application is that the quarterly TOT indicator should improve the temporal disaggregation estimates compared to the 10+ series, in terms of both goodness of fit and out-of-sample forecast performance. The latter is a by-product of the statistical approach adopted to derive quarterly estimates of NA aggregates at ISTAT, which relies on the SSF representation of reference models (e.g., Chow-Lin, Fernández, ADL), and the associated use of the Kalman filter (Bisio and Moauro 2018). A preliminary examination of the relationship between ANA per-capita hours worked and the annual averages of TOT and 10+ indicators, spanning from 1995 to 2020, reveals a consistently high correlation for both indicators with the annual totals. Notably, the TOT indicator exhibits higher correlation values across the majority of the sections of interest, as presented in Table 3. Indeed, a visual inspection confirms very similar paths in the annual growth rates between the two indicators throughout the entire period (Figure 2), with the dynamics of the 10+ series during the 2020 crises—particularly in F, GHI, and K sections—closely mirroring that of ANA totals more closely than the TOT indicator.

Table 3.

Correlations Between ANA Per-Employee Hours Worked and Annualized Indicators by Annual Levels (1995–2020).

	BCDE	F	GHI	J	K	L	MN
TOT	0.92	0.87	0.91	0.82	0.75	0.94	0.61
10+	0.79	0.82	0.93	0.74	0.68	0.75	0.65

Figure 2.

Per-employee hours worked by annualized indicators (10+ and TOT) and Annual National Accounts, 1995 to 2020. Annual growth rates.

Temporal disaggregations are carried out over the 1995q1 to 2020q4 interval, using final estimates of ANA that are not subject to structural revisions. Indeed, the restriction of the time span of the application up to 2020 is motivated by the fact that it is the last year for which ANA figures will not undergo future revisions, at the time of writing, consistently to the revisions policy of ANA in ISTAT. The analysis covers the non-agricultural business economy (from BCDE to MN section, according to NACE rev.2), consistent with the breakdown of per-employee hours worked estimates by QNA (Table A2). ANA totals of per-capita hours worked are regressed over, alternatively, the TOT indicator and the 10+, in a non-seasonally adjusted form. The temporal disaggregation models considered are, in turn, Chow and Lin (1971) and Fernàndez (1981) with and without the intercept term, respectively. In other words, four temporal disaggregation model specifications are run for each of the seven economic sections considered, resulting in a set of twenty-eight outcomes by distinct estimated models.

Table 4 displays the main outputs and diagnostic statistics derived by the “best” model specification obtained among the set of four TD model specifications, for each section where the two indicators are used alternatively. For each section, we select the specification with the best performance in terms of information criteria (AIC/BIC) and Log-likelihood out of the four estimated specifications. In this respect, it is worth noting that the same model is selected for each given sector, regardless of the reference indicator used.

Table 4.

Regression Diagnostic Statistics on Quarterly Disaggregation of Per-Employee Hours Worked, Non-Seasonally Adjusted Data, 1995q1 to 2020q4.

Reference indicator	BCDE		F		GHI		J		K		L		MN
Reference indicator	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT
Model	Férnandez with constant	Férnandez with constant	Chow-Lin with constant	Chow-Lin with constant	Férnandez with constant	Férnandez with constant	Chow-Lin with constant	Chow-Lin with constant	Férnandez with constant	Férnandez with constant	Chow-Lin with constant	Chow-Lin with constant	Férnandez	Férnandez
Rho	0.00	0.00	0.90	0.84	0.00	0.00	0.95	0.96	0.00	0.00	0.97	0.84	0.00	0.00
Beta	93.89**	112.34**	4.66**	14.09**	1.09**	94.00**	9.31**	4.44**	214.02**	241.06**	5.04**	10.27**	1.02**	0.95**
Constant	0.88**	0.82**	0.94	0.87**	0.99	0.76**	0.57**	0.84	0.57	253**	0.53**	0.90**
Log-likelihood	−104.75	−103.06	−113.62	−111.64	−107.83	−110.76	−116.13	−111.64	−112.95	−112.73	−133.68	−127.20	−123.13	−121.77
Adj-R squared	1.00	1.00	0.99	0.99	1.00	1.00	0.97	0.98	0.98	0.98	0.98	0.98	0.98	0.98
AIC	1.66	1.53	2.61	2.62	1.90	2.12	2.60	2.24	2.29	2.27	3.87	3.82	3.00	2.89
BIC	1.76	1.63	2.70	2.71	1.99	2.22	2.69	2.34	2.39	2.37	3.97	3.92	3.04	2.94
Durbin-Watson	1.85	1.92	1.92	1.76	2.04	1.70	1.76	2.08	2.83	2.78	1.41	1.27	2.49	1.97
Jarque-Bera	2.15	2.22	0.32	0.11	0.44	8.73*	5.69	0.33	1.16	1.32	41.87**	2.47	0.30	0.94
Q-Ljung Box	3.07	2.88	2.36	2.49	4.44	2.02	2.28	4.20	8.77	9.30	2.79	7.27	13.96**	7.03
H-test	0.86	0.60	1.44	0.04	1.52	2.97**	1.71	1.07	29.28**	1.94*	8.22**	4.10*	0.53	0.05

Note. **, and * indicate p-value ⩽.01, ⩽.01, and ⩽ 0.5, respectively. Rho, Beta, Constant = estimated parameters from the temporal disaggregation model; Adj-R squared = adjusted coefficient of determination; AIC = Akaike Information Criterion; BIC = Bayesian Information Criterion; Durbin-Watson = N(2,4/T)-test statistics for errors autocorrelation; Jarque-Bera (Jarque and Bera, 1987) = normality chi-squared test statistics; Q-Ljung Box = autocorrelation chi-squared test statistics; Q(p)-Ljung-Box = chi-squared test-statistics on errors autocorrelation (at the first p-lags); H-test = F-test statistics for heteroscedasticity.

Overall, the estimates exhibit satisfactory results, in terms of both goodness of fit and linearity and independence of the residuals. These outcomes are qualitatively similar when comparing the use of the two alternative indicators. However, some concerns arise in sections K (financial services) and L (real estates) regarding both non-normality of errors and errors autocorrelation (at the 8-lag), regardless of the indicator used. The absence of autocorrelated errors at the first lag is overall confirmed by Durbin-Watson statistics, independently of the indicator used. Nevertheless, the use of TOT implies slightly higher log-likelihood values and lower values of the information criteria (AIC, BIC) in the majority of cases, except for two relevant sectors (GHI and F). Lastly, the hypothesis of errors homoscedasticity is accepted at the 95% of confidence in the majority of sections (namely, BCDE, F, J, and MN), while it is rejected in a couple of cases (K and L) irrespectively of the indicator used, and for GHI sector when using the TOT indicator.

To compare the out-of-sample forecast performances of the two indicators we compute both the mean absolute error (MAE) and the root mean squared error (RMSE) referred to the difference between the growth rates of ANA totals and the annual growth rates resulting from the sum of the four extrapolated quarters over the annual totals of the previous year. Both the statistics are computed on the last eight annual observations of the series (2013–2020). In essence, both metrics measure the out-of-sample forecasting performance of the indicators in terms of low-frequency (i.e., annual) estimates—with large errors weighted relatively higher by RMSE. Lower values in these metrics imply smaller revisions between the annual totals derived by quarterly figures estimated by the TD model and ANA.

Table 5 reports both MAE and RMSE computed for each temporal disaggregation model specification that has been tested, employing the two indicators under evaluation alternatively. According to MAE, TOT indicator outperforms 10+ indicator in eighteen cases over twenty-eight estimates, while according to RMSE, TOT indicator outperforms 10+ indicator in twenty cases over twenty-eight estimates. Overall, we observe that the use of the TOT indicator results in a smaller forecast error at the annual frequency in the majority of cases when estimating QNA over the business economy sections. Namely, the improvement occurs in the 64.3% and 71.4% of cases, according respectively to MAE and RMSE.

Table 5.

MAE and RMSE of Annual Growth Rates by ANA and Extrapolated Quarters by Temporal Disaggregation Estimates, 1995q1 to 2020q4.

		BCDE		F		GHI		J		K		L		MN
		10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT
Chow-Lin	MAE var.	0.56	0.63	1.01	1.19	0.56	1.21	1.35	0.88	0.34	0.81	1.58	0.91	1.81	1.09
Chow-Lin	RMSE var.	0.82	0.79	1.46	1.45	0.61	2.04	1.51	1.00	0.42	1.00	1.98	1.08	2.20	1.44
Chow-Lin with constant	MAE var.	0.66	0.56	1.18	0.81	1.30	1.49	0.96	0.82	0.64	0.46	2.56	2.02	1.52	1.48
Chow-Lin with constant	RMSE var.	1.01	0.90	1.67	1.08	2.33	2.80	1.57	1.17	0.87	0.69	4.72	2.46	2.92	2.99
Férnandez	MAE var.	0.57	0.57	1.12	1.31	0.68	0.83	1.52	0.87	0.40	0.52	1.51	1.01	2.13	1.16
Férnandez	RMSE var.	0.66	0.71	1.69	1.65	0.73	0.96	1.64	1.05	0.51	0.65	1.98	1.21	2.45	1.39
Férnandez with constant	MAE var.	0.60	0.47	1.42	1.30	1.34	1.52	1.00	0.78	0.73	0.70	2.26	1.93	1.48	1.54
Férnandez with constant	RMSE var.	0.96	0.85	2.08	1.98	2.45	2.86	1.51	1.11	1.04	1.00	4.52	3.98	3.07	3.05

Note. Statistics are computed on non-seasonally adjusted series over the last half of the sample.

Moreover, to assess the quality of the disaggregated series in terms of association to the reference indicator used, we both (i) compute the correlation and (ii) test the coherence of signs between the quarterly disaggregated series and the indicator used, considering both quarter-on-quarter (q-o-q) and year-on-year (y-o-y) growth rates. In particular, the coherence of signs is assessed by a binomial test where the probability of success (i.e., consistency of signs in q-o-q/y-o-y changes between the indicator and the output series in each quarter) under the null hypothesis is p = .5. Differently than MAE/RMSE, both these metrics provide a measure of the quality of TD models at the high (quarterly) frequency.

Table 6 reports the correlations for each of the estimated model, computed over the last half span of the series that is, 2007q4 to 2020q4. Overall, higher correlations, whether based on “q-o-q” or “y-o-y” growth rates, between the disaggregated series and the reference indicator are identified in twenty out of twenty-eight estimated models when the TOT indicator is used (71.4%), as compared to the 10+ indicator. It is worth noting that, irrespective of the model specification, TOT indicator outperforms the alternative one in sections F, J, L, and MN. Conversely, the 10+ indicator ensures higher quality in section GHI, regardless of the model specified. By contrast, a mixed evidence emerges for BCDE and K sections where the superiority of one indicator over the other depends on the specified model.

Table 6.

Correlations Between Q-o-q Growth Rates and Y-o-Y Growth Rates of Indicators and Disaggregated Series by Temporal Disaggregation Estimates Over 1995q1 to 2020q4.

		BCDE		F		GHI		J		K		L		MN
		10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT
Chow-Lin	Q-o-q correlation	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	0.99	1.00
Chow-Lin	Y-o-Y correlation	0.98	0.98	0.95	0.95	0.98	0.96	0.77	0.91	0.93	0.91	0.91	0.95	0.76	0.81
Chow-Lin with constant	Q-o-q correlation	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	0.99	1.00	0.98	1.00
Chow-Lin with constant	Y-o-Y correlation	0.98	0.98	0.94	0.95	0.98	0.96	0.70	0.91	0.91	0.91	0.89	0.95	0.72	0.81
Férnandez	Q-o-q correlation	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	1.00	0.99	0.99
Férnandez	Y-o-Y correlation	0.98	0.98	0.95	0.95	0.98	0.97	0.77	0.91	0.93	0.92	0.91	0.95	0.76	0.81
Férnandez with constant	Q-o-q correlation	1.00	1.00	1.00	1.00	1.00	0.99	1.00	1.00	1.00	1.00	0.98	1.00	0.99	0.99
Férnandez with constant	Y-o-Y correlation	0.98	0.98	0.94	0.96	0.98	0.96	0.67	0.89	0.88	0.87	0.88	0.94	0.72	0.78

Note. Correlation coefficients are computed on non-seasonally adjusted series over the last half of the sample.

Finally, we investigate the extent to which each indicator, respectively, co-moves with the disaggregated series by checking the coherence of signs, considering both their “q-o-q” and “y-o-y” changes. Table 7 reports p-values from a sign test based on a one-sided binomial distribution at the 95% confidence level. The results highlight that dynamics direction of both TOT and 10+ indicators follows that of the disaggregated series, both on a quarterly and annual basis. Specifically, the null hypothesis of a lower-than 0.5 probability of growth rate signs coherence is rejected with 95% of confidence, regardless of the TD specification used for both indicators, across almost all economic sections. The only exception concerns annual changes for MN section, where the null cannot be rejected for TOT indicator in three cases over four and for 10+ only once.

Table 7.

Sign Test on Q-o-q Growth Rates and Y-o-Y Growth Rates of Indicators and Disaggregated Series by Temporal Disaggregation Estimates Over 1995q1 to 2020q4.

		BCDE		F		GHI		J		K		L		MN
		10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT	10+	TOT
Chow-Lin	Q-o-q growth rate	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00
Chow-Lin	Y-o-Y growth rate	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.01	0.04
Chow-Lin with constant	Q-o-q growth rate	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00
Chow-Lin with constant	Y-o-Y growth rate	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.04	0.06
Férnandez	Q-o-q growth rate	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00
Férnandez	Y-o-Y growth rate	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.01	0.05
Férnandez with constant	Q-o-q growth rate	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00
Férnandez with constant	Y-o-Y growth rate	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.00	0.06	0.05

Note. p-Values from a sign test from 95% one-sided confidence interval of binomial distribution.

4. Concluding Remarks

The primary objective of this paper is to pave the way for the integration of a recently released short-term indicator on per-employee hours worked into the estimates of QNA hours worked at ISTAT. Due to its extension to micro-firms, the coverage of the new indicator aligns more consistently with the definitions of labor input by NA, offering advantages in terms of both quality and internal consistency among the estimated aggregates by QNA.

To achieve this objective, the first contribution of this work involves the retrospective reconstruction of the new indicator of per-employee hours worked, to align with the methodological requirements of QNA estimation at ISTAT. Specifically, a model-based approach to backwards reconstruction is proposed, where both the new indicator (covering all firms with employees) and its previous version (covering firms with more than ten workers) are the dependent variables in a bivariate SUTSE model (Harvey 1990). Based on the state-space representation of the model, the Kalman filter and smoothing algorithms enable the recovering of missing observations for the short indicator. Model estimates show that the two indicators share common factors over the trend, the seasonal and the irregular component. In comparison to alternative reconstruction methods, such as retropolation and ordinary least squares (OLS), we observe that the model-based approach yields a reconstructed quarterly series that exhibits a higher correlation with the annual totals reported by ANA at the annual frequency.

The second contribution of the analysis involves the assessment of the advantages derived from the use of the reconstructed extended indicator within the estimates of per-employee hours worked by QNA, in comparison to the use of the previous version of the indicator. The evaluation is based on diagnostics obtained from estimated TD models, employing both the Chow and Lin (1971) and Fernàndez (1981) specifications. Quarterly per-employee hours worked by NA are derived alternately using either the reconstructed or the old indicator. Conducting separate model runs for seven economic sections (NACE rev.2) within the non-agricultural business economy, our findings indicate that the reconstructed TOT indicator leads to slightly higher goodness-of-fit statistics compared to the 10+ version in the majority of the estimated models. Moreover, the predictive power of the models in terms of annual growth rates of the annualized quarterly estimated series is higher when the TOT indicator is used in the majority of cases, based on either MAE (64.3% of cases) and RMSE (71.4% of cases) statistics. Smaller out-of-sample forecasting errors imply smaller revisions in the estimated QNA, when the extrapolated quarters are benchmarked to the annual totals as soon as they become available, representing a valuable outcome for the quality of QNA estimates according to international standards (Eurostat 2018, 2021). Moreover, the reconstructed indicator exhibits higher correlation with the disaggregated series—on both quarterly and annual basis—compared to the 10+ indicator in the majority of the estimated models. This aspect stands as a strong advantage for the quality of the empirical estimates, particularly when extrapolated quarters are released while annual totals are not yet available. In such cases, the short-term indicators serve as the sole information guiding the short-term dynamics of NA estimates and orienting data users.

All in all, based on the presented results, this analysis indicates that the introduction of the updated indicator covering all firms with employees would lead to an improvement in the quality of the estimates of per-employee hours worked by QNA in various respects. However, this evidence represents only an initial step toward the incorporation of such a new source of information into official statistics compilation. The adoption of the new short-term indicator should indeed be complemented by an accurate revisions analysis of the estimates by QNA. Moreover, users and practitioners of NA data should promptly receive explanations about the factors behind data revisions and the economic significance of these revisions.

Footnotes

Appendix A

Table A2.

Comparing Correlation Between ANA Totals and Reconstructed TOT Indicator of Per-Employee Hours Worked (in Levels), According to Alternative Reconstruction Approaches, 1995 to 2014.

	BCDE	F	GHI	J	K	L	MN
OLS	0.86	0.75	0.88	0.64	0.48	0.31	0.44
Retropolation	0.86	0.75	0.88	0.64	0.48	0.31	0.44
Structural model	0.94	0.79	0.87	0.73	0.64	0.84	0.46

Acknowledgements

I am grateful to Filippo Moauro, Barbara Guardabascio and the participants of the 2nd Workshop on Time Series Methods for Official Statistics held at OECD in September 2022, for their helpful comments. All mistakes remain mine. The views expressed in this paper reflect only the author’s views and do not necessarily reflect the views of ISTAT.

Funding

The author(s) declared that they received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Laura Bisio

Received: July 2023

Accepted: February 2024

References

Bisio

Moauro

2018. “Temporal Disaggregation by Dynamic Regressions: Recent Developments in Italian Quarterly National Accounts.” Statistica Neerlandica 72 (4): 471–94. DOI: https://doi.org/10.1111/stan.12156.

Caporin

Sartore

2006. “Methodological Aspects of Time Series Back-Calculation.” Department of Economics Research Paper Series No. 56/06, University Ca’Foscari of Venice. https://ssrn.com/abstract=950923.

Chow

G. C.

Lin

1971. “Best Linear Unbiased Interpolation, Distribution and Extrapolation of Time Series by Related Series.” The Review of Economics and Statistics 53: 372–5. DOI: https://doi.org/10.2307/1928739.

Denton

F. T.

1971. “Adjustment of Monthly or Quarterly Series to Annual Totals: An Approach Based on Quadratic Minimization.” Journal of the American Statistical Association 66: 99–102. DOI: https://doi.org/10.1080/01621459.1971.10482227.

Durbin

Koopman

S. J.

2012. Time Series Analysis by State Space Methods. 2nd ed. Oxford: Oxford University Press. https://doi.org/10.1093/acprof:oso/9780199641178.001.0001 (accessed February 2024).

Durbin

Watson

G. S.

1950. “Testing for Serial Correlation in Least Squares Regression. I” Biometrika 37: 409–28. DOI: https://doi.org/10.2307/2332391.

Durbin

Watson

G. S.

1951. “Testing for Serial Correlation in Least Squares Regression. II” Biometrika 38: 159–79. DOI: https://doi.org/10.1093/biomet/38.1-2.159.

EU. 2013. “Regulation (EU) No. 549/2013 on the European System of National and Regional Accounts.” https://eur-lex.europa.eu/legal-content/EN/ALL/?uri=celex:32013R0549 (accessed February 2024).

Eurostat. 2018. “ESS Guidelines on Temporal Disaggregation, Benchmarking and Reconciliation.” https://doi.org/10.2785/846595.

10.

Eurostat. 2021. “Quality Report on National and Regional Accounts.” https://doi.org/10.2785/85993.

11.

Fernàndez

R. B.

1981. “A Methodological Note on the Estimation of Time Series.” The Review of Economics and Statistics 63: 471–6. DOI: https://doi.org/10.2307/1924371.

12.

Gómez

Maravall

1994. “Estimation, Prediction and Interpolation for Nonstationary Series With the Kalman Filter.” Journal of the American Statistical Association 89 (426): 611–62. DOI: https://doi.org/10.2307/2290864.

13.

Harvey

A. C.

1990. Forecasting, Structural Time Series Models and the Kalman Filter. Cambridge: Cambridge University Press. https://doi.org/10.1017/CBO9781107049994.

14.

Harvey

A. C.

Koopman

S. J.

Penzer

1998. “Messy Time Series: A Unified Approach.” Advances in Econometrics 13: 103–43. DOI: https://doi.org/10.12691/ajams-4-3-2.

15.

ISTAT. 2023. “Quarterly National Accounts.” https://www.istat.it/en/archivio/285056 (accessed February 2024).

16.

Jarque

C. M.

Bera

A. K.

1987. “A Test for Normality of Observations and Regression Residuals.” International Statistical Review 55: 163–72. DOI: https://doi.org/10.2307/1403192

17.

Kalman

R. E.

1960. “A New Approach to Linear Filtering and Prediction Problems.” Journal of Basic Engineering 82 (Series D): 35–45. DOI: https://doi.org/10.1115/1.3662552.

18.

Koopman

S. J.

1997. “Exact Initial Kalman Filtering and Smoothing for Non-Stationary Time Series Models.” Journal of the American Statistical Association 9: 1630–8. DOI: https://doi.org/10.2307/2965434.

19.

Koopman

S. J.

Harvey

A. C.

Doornik

J. A.

Shephard

2007. Structural Time Series Analyser, Modeller and Predictor. STAMP 8. London: Timberlake Consultants.

20.

Koopman

S. J.

Shephard

Doornik

J. A.

1999. “Statistical Algorithms for Models in State Space Using SsfPack 2.2.” The Econometrics Journal 2 (1): 107–60. DOI: https://doi.org/10.1111/1368-423X.00023.

21.

Ljung

G. M.

Box

G. E. P.

1978. “On a Measure of a Lack of Fit in Time Series Models.” Biometrika 65 (2): 297–303. DOI: https://doi.org/10.1093/biomet/65.2.297.

22.

Moauro

2001. “Modelling a Change of Classification by a Structural Time Series Approach.” In Advances in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization, edited by Borra

Rocci

Vichi

Schader

Berlin, Heidelberg: Springer. https://doi.org/10.1007/978-3-642-59471-7_37.

23.

Moauro

2014. “Monthly Employment Indicators of the Euro Area and Larger Member States: Real-Time Analysis of Indirect Estimates.” Journal of Forecasting 33(5): 339–49. DOI: https://doi.org/10.1002/for.2295.

24.

Moauro

Savio

2005. “Temporal Disaggregation Using Multivariate Structural Time Series Models.” The Econometrics Journal 8 (2): 214–34. DOI: https://doi.org/10.1111/j.1368-423X.2005.00161.x

25.

Poncela

Ruiz

Miranda

2021. “Factor Extraction Using Kalman Filter and Smoothing: This is Not Just Another Survey.” International Journal of Forecasting 37 (4): 1399–425. DOI: https://doi.org/10.1016/j.ijforecast.2021.01.027.

26.

Proietti

2006. “Temporal Disaggregation by State Space Methods: Dynamic Regression Methods Revisited.” The Econometrics Journal 9 (3): 357–72. DOI: https://doi.org/10.1111/j.1368-423X.2006.00189.x.

27.

United Nations. 2018. Handbook on National Accounting Backcasting Methodology. New York City, NY: United Nations Statistics Division. Draft September 2018 New York. https://unstats.un.org/unsd/nationalaccount/aeg/2018/M12_8iiib_Backcasting.pdf (accessed February 2024).

Reconstructing a Short-Term Indicator by State-Space Models: An Application to Estimate Hours Worked by Quarterly National Accounts

Abstract

Keywords

1. Introduction

2. Model-Based Backwards Reconstruction of the Indicator

2.1. Structural Models in This Application

2.2. Reconstructing Per-Employee Hours Worked Series

3. Temporal Disaggregation of Per-Employee Hours Worked

4. Concluding Remarks

Footnotes

Appendix A

Acknowledgements

Funding

ORCID iD

References