Abstract
While in developing countries around 50% of total employment are informal sector jobs, and much more among the poorest, its statistical measurement is a major issue. Having long remained a black hole of national statistical systems, sound methodologies have been developed to measure the informal sector in its different dimensions (employment, production, and more broadly articulation with the rest of the economy) in a reliable manner. Taking stock on nearly four decades of experiences accumulated around the world, this article presents a system of original surveys, specially designed for this objective: the Informal Sector Mixed (households-businesses) Surveys (ISMSs). After exposing their theoretical principles, the paper addresses the main challenges of implementation in the field, detailing their advantages and limitations. In particular, we question the compatibility of ISMSs with the recommendations of the new Resolution concerning statistics on the informal economy, adopted in 2023 during the 21st International Conference of Labor Statisticians. We also identify avenues to address a certain number of challenges that still arise with this type of survey, both in the methodological field and on the institutional level. We illustrate our discussion with a set of selected empirical results drawn from surveys conducted in various contexts.
Introduction
While most developing countries are adopting economic policies aimed at reducing poverty and establishing models for sustainable growth, the statistical information systems on the informal economy, in particular the informal sector, is still deficient. This weakness severely affects the assessment of the informal sector characteristics and the implementation, monitoring and evaluation of policies towards this sector, in the short run (to face crises, like the COVID-19 pandemics, during which the informal sector was affected the most) as in the long run (to boost structural transformation). Moreover, whatever the assumptions about future growth rates in the global south, the informal sector will not disappear. It will even increase in many countries.
However, the experience accumulated over approximately four decades in conducting informal sector surveys shows that its statistical measurement is not only possible, but contrary to intuition, are no more complex to carry out than other types of surveys, such as labour force surveys (LFS), enterprises surveys (ES) or income-expenditure surveys (IES), to quote only the most classic. Among the different existing options, the purpose of this article is to present the mixed surveys on the informal sector (ISMSs), which combines households and businesses, a tool specially designed at the end of the 1980s for the statistical measurement of the informal sector. Although this type of survey has been conducted in dozens of countries, it still remains partly unknown, even among the community of survey statisticians. While there are numerous technical manuals on traditional surveys, ISMSs are to date very poorly documented. The aim of this paper is to contribute to filling this knowledge gap, so that the ISMSs enter the standard toolbox of the survey statistician. We draw on a large number of ISMSs conducted around the world to present their theoretical principles, the challenges encountered in their implementation in the field, and various concrete empirical examples. While focused on methodological issues, the paper also addresses some institutional considerations.
The paper is organized as follows. After this introduction, Section 2 presents the main principles of the mixed surveys and stresses their comparative advantage over other alternatives in measuring the informal sector. A special attention will be given to one kind of ISMS, the
Main principles of informal sector mixed surveys (ISMSs)
After years of cacophony [3], the definition of the informal economy, and in particular of the informal sector is now well established [4, 5, 6]. According to the 21st ICLS resolution, the definition of the informal sector is as follows:
“
Given the definition, the economic unit should be the starting point. In this context, two kinds of issues arise, in order to minimize sampling errors and measurement errors. First, how to elaborate a sampling frame for a target population which is not registered, thus probably unknown from public authorities? Second, how to get reliable estimators from economic units which do not keep a formal set of accounts? Without positive answers to these two challenges, indirect method should be favoured over direct survey approach.
Sampling strategy
The fundamental intuition of informal sector mixed surveys (ISMSs) is that Informal Economic Units are easier to capture through the jobs of informal sector workers than via the identification of premises in which the activity is run. The main challenge is then to build the implicit population of individuals, jobs and production units, and their relations. The sequence is, starting with the population, to get to the production units through the jobs (which play the role of an intermediary variable). The two-phase design was developed in the late 1980s in México by statisticians of the National Statistics Office (INEGI) in collaboration of researchers of the French research institute for sustainable development (IRD). They proposed a solution to overcome the conventional approach’s main shortcomings (“first strategy” in Fig. 1), that is the difficulty in achieving exhaustive coverage of the informal sector [7] (see also [8]).
Why informal establishment/enterprise surveys (IES), the natural way to measure the informal sector, is not adequate? Two major issues at stake. The first is to get a comprehensive picture of the reference’s population: the Informal Economic Units. Unless the Economic Census sampling frames is conducted and tied in with the Population Census, exhaustive coverage cannot be achieved, especially as regards activities performed at home or in non-fixed premises. Second, given the informal sector’s erratic demographic laws (business creation and mortality rates), updating registers to draw representative samples of the informal sector is ruled out. That is why most of the surveys conducted along these lines have produced unreliable and inconsistent estimators (underestimating totals; overestimating the share of the Informal Economic Units with premises, and consequently overstating the economic performances of the informal sector, as located enterprises usually perform better than non-located ones).

Two alternative sampling strategies for measuring the informal sector. Source: based on Roubaud and Séruzier, 1991 [9].
The Vietnamese case provides an empirical example of the shortcomings generated by the first sampling strategy. In Vietnam, the two kinds of surveys (enterprise survey and mixed survey) have been conducted in parallel, which allows comparing the results. Unfortunately, the Household Economic Unit survey, specially designed to capture household businesses, does not distinguish between their formal and informal status (Table 1). Nevertheless, according to the Household Economic Unit surveys, there were 3.75 million Household Economic Units at the national level in 2007. The corresponding figure drawn for the ISMS was 10.14 million. The IES survey captures just a bit more than one Household Economic Unit out of three. From 2007 to 2015, the coverage rate by the Household Economic Unit survey increased significantly (resp. 37% and 52%), but was still very low (ignoring one Household Economic Unit out of two). In terms of economic performances, the same comparison, restricted to Hanoi and Ho Chi Minh City (the two main Vietnamese cities), shows that total turnover is underestimated by a huge 60% through the Household Economic Unit survey, while the turnover per economic units is over estimated by 26% to 42% in 2007 and 2009. Obviously, these discrepancies would be even higher when considering only informal economic units, as the enterprise surveys only capture the informal sector upper tier.
In order to carry out an ISMS, some conditions are required (Guerrero et al, 2013). First, the phase 1 survey should be an “augmented” LFS (or any other household survey that include a component identifying persons in employment), in two respects. On the one hand, the criterion to identify informal sector jobs (the status of registration of the Economic Unit) should be included in the questionnaire, at least for independent workers (employers, own account workers and dependent contractors). On the other hand, to get the full picture of the informal sector, the filter survey should consider the primary and the secondary job, with the same identifying variables.
If LFSs are the best filter survey (phase 1), other household surveys may be used as second best. However, the same main requirements should be respected: the survey should ensure that all Informal Economic Units are associated with corresponding informal sector jobs, held by independent workers as their head. This condition is not always respected. For instance, the main disadvantage of using the World Bank Living Standard Measurement Surveys (LSMS) to address informal sector measurement is that there is no link between the employment section and the activities of household businesses [10].
Another option would be to rely on an “augmented” LFS only (see paragraph above), not adding a phase 2. Again, this solution is not optimal, for various reasons. Indeed, LFSs provide valuable information on informal sector: informal sector employment, number of Informal Economic Units by sector, size, type of premises, socioeconomic characteristics of Informal Economic Unit’ heads, etc. However LFSs have serious shortcomings in providing statistics on the production within the informal sector. First, measurement of income is based on one or a few questions, with limited reliability. They do not make it possible to elaborate the economic accounts of the Informal Economic Units and other kind of information (see below the list of modules of the
Using LFSs as a filter survey raises two additional sampling issues. When the LFS sampling frame has not been design ex ante to measure the informal sector, the sample size may be too small (depending on Informal Economic Units prevalence). In such a case, one may use the information provided by dependent workers to identify the Informal Economic Unit they are working for. It will make the calculation of the weight more complex but still tractable. However using employees information to track their employer location may be ethically problematic. Another option is to used various previous rounds of the phase 1 survey, as experienced in Palestine. Again, some adjustments should be made to compute the appropriate weights (combination of samples), taking into account the mortality rate. The other issue arises when Informal Economic Units are geographically concentrated in some specific areas. In this case, auxiliary information is useful. Obviously, the best option is to elaborate the LFS sample frame in order to satisfy jointly both the LFS and the informal sector survey objectives.
A final concern is the management of the time lag between phase 1 and phase 2. These periods should be as short as possible. The longer the time period between phase 1 and phase 2, the higher the risk of attrition as a greater proportion of Informal Economic Units will have disappeared. The extreme solution to avoid this problem of attrition is to conduct the two phases back to back (once phase 1 has been filled in and the Informal Economic Units identified, conduct phase 2) or even to integrate the two phases into the same questionnaire. The Peruvian household survey (
An augmented ISMS: The generic scheme of the 1-2-3 Survey
The
The first phase of the
Let us focus on the phase 2 of the
Module A.: The establishment’s characteristics Module B.: Labour force Module C.: Production Module D.: Expenditure and costs Module E.: Customers, suppliers and competitors Module F.: Capital, investment and financing Module G.: Problems and prospects Module S.: Other topics (optional)
Previous to these subject-specific modules, the first page of questionnaire begins with a screening module that to some extend includes the same questions used in the first phase to identify Informal Economic Units. This module is designed to check that information about the Informal Economic Units collected in phase 1 is accurate. Relevant information from phase 1 on Informal Economic Units selected for phase 2 (main characteristics of the Informal Economic Unit – postal address, industry, legal status, type of accounts, registers, type of premises, etc. – and the Informal Economic Unit’s holder – name, age, gender, relation to household head, job status, etc.) is reported
Compared with other informal sector surveys, the
The objective in this section is not to comment in details the results of the figures but to illustrate which kind of information can be obtained through ISMSs, which could not be obtained for other sources. We selected six of them, based on ISMSs conducted in different parts of the world.
Informal sector production and distribution accounts (WAEMU, 2017–2018).
Source: ERI-ESI surveys; based on AFRISTAT (2019) [21]. Note: in Benin the production represents 68% of the turnover, and the value added 71% of the production.
Informal sector production and distribution accounts (WAEMU, 2017–2018).
Source: ERI-ESI surveys; based on AFRISTAT (2019) [21]. Note: in Benin the production represents 68% of the turnover, and the value added 71% of the production.
Table 2 is based on a set of national harmonized surveys on labour market and the informal sector conducted by the NSOs and coordinated by AFRISTAT in the WAEMU (West Africa Economic and Monetary Union, counting 8 countries) in 2017–2018 [21]. The surveys are a local version of the

Macro-dynamics of Informal and formal household economic units production (Peru, 2007–2021). Source: ENAHO surveys; based on INEI (2022) [23]. Note: VA FEU (VA IEU) correspond to the value added of Formal (Informal) Economic Units.
As stressed above, the national household survey (ENAHO) is an integrated
Transitions from the informal to the formal economy are part of the international agenda ([24]; SDG 8). One way to assess these transitions at the micro level is to conduct panel surveys, following the same Economic Units over time. This was done in Vietnam where a 3 waves longitudinal survey were collected in 2007, 2009 and 2014 for a representative sample of Household Economic Units in Hanoi and Ho Chi Minh City [10, 11]. Figure 3 presents the transitions between formal and informal sectors. In this context, a minority of Informal Economic Units got formalized, but an equivalent number of Formal Economic Units got informalized. But for these movers, it corresponds to a transient status, as a half of them returned to their initial sector in the 3rd period. The big picture is that transitions are a complex phenomenon, which should be investigated further with panel data, as the only way to capture the micro-dynamics of Household Economic Units. A complex economic phenomenon indeed, but also a tricky statistical challenge: to define how to identify the same Economic Units (for instance, how to consider an Economic Unit with the same manager but in a different sector from one date to the other?). How to address the attrition process? It shows also the interest of adding formal household economic units in ISMSs, as in Peru since more than 20 years.
Variation of informal sector price indexes, CPI and GDP deflator (Madagascar, 1995–2010) (%)
Source: Rakotomanana (2011) [26].

Micro-dynamics of Informal and formal household economic units (Vietnam, 2007–2014). Source: based on Ralaimidona (2024) [25]. Note: FEU (IEU) correspond to Formal (Informal) Economic Units.
Despite the proliferation of surveys, it is still impossible today to reply to the simplest question concerning the informal sector: “

The previous figures present an extended set of classical indicators on the informal sector (jobs, production, prices…). However, phases 2 and 3 of the
Demand side: Consumption informal sector by quartile of consumption per head (various countries, circa 2000).
Sources:
While Fig. 4 shows that the informal sector is mainly dedicated at satisfying final consumption, the dual question is which part of the household consumption is satisfied by the informal sector? Phase 3 of the
The increase in the number of informal sector surveys over the last thirty years is an indication of their progressive anchorage in the world’s statistical toolbox on the informal sector, to the detriment of enterprise surveys, which statisticians preferred in the past (enterprise surveys have not completely disappeared). While this change in the method of understanding the informal sector is squarely in line with the recommendations of the 1993 ICLS Resolution, a number of difficulties currently encountered – some old, others are more recent and potentially engendered by the new 2023 ICLS Resolution are worth highlighting.
First, concerning the definition and boundaries of the informal sector, the 2023 ICLS Resolution introduced three main changes: a) to get aligned with the SNA concepts, the Informal Economic Unit should be mainly intended for the market instead of partly intended for the market; b) the Informal Economic Unit should be not registered (not covered by formal arrangements), while 1993 Resolution accepted other second best criteria, like the size; c) agriculture is now included in the boundaries of the informal sector. In fact, none of these changes create serious challenges to the ISMSs strategy. The non-registration criterion was already adopted in most of the ISMSs (and systematically in the
Second, the development of platform workers and the introduction of dependent contractor category in the ICSE-18 [29] add an additional degree of complexity to identifying Informal Economic Units in the filter survey (LFSs) It should be noted that it is a common challenge to all LFSs, but does not create specific problems in the second phase. The difficulty does not rely mainly on distinguishing formal and informal dependent contractors, as the criterion for her business to be considered as an Informal Economic Unit is the same as for independent workers (provided that non registration apply to their own business and not the enterprise they are working for), but arises because some of them may declare themselves as employees (of the platform there are working for, for instance). Clear instructions should be given upstream to interviewers to avoid this kind of misclassification and question should be included that enables an identification of this group in the LFSs The other case of misclassification (dependent contractor declaring themselves as independent workers) is not an issue, as, once identified as holding an Informal Economic Unit, they will be surveyed in the second phase. It is worth noting that some dependent contractors might only provide their labour as input. However in most cases, they should invest in their own Economic Unit, not providing capital being one of the contractor main incentives to develop this kind of labour arrangement (for instance, the car of platform drivers).
Third the question of the institutionalisation of the ISMSs within the official National Statistical System is at stake. Today, relatively few countries conduct mixed surveys regularly. Apart from the necessity to disseminate information about the long standing experience on ISMSs and advocating activities to promote them, the time frame to implement these surveys should be considered. Because of their cost and the fact that the structural parameters (technical coefficients) does not change so quickly over time, high frequency survey, annual or infra-annual ISMSs may not be required. The ISMSs could be conducted every 3 to 5 years, and at least at each change in the national account basis. In between, the LFS should be used to update the data, in terms of jobs and economic output [30]. On the one hand, the LFS captures employment in the informal sector. On the other end, in general the LFS provides information on job earnings: both employee wages and independent workers mixed income, which correspond to the two value added main components. Production taxes are missing, but they are marginal as shown by Table 2 (1% in the WAEMU). As seen above, these measures may not be completely reliable (and probably underestimated), but their evolution is a good proxy of the informal sector value added dynamics, provided the bias is constant over time (a plausible hypothesis; Roubaud, 1989). A more ambitious option would be to develop a light phase 2 questionnaire, with a selection of key variables, which could be grafted to the LFS without overloading the survey.
Finally, in this paper, we show that ISMSs are a good (and according to us, the best) candidate to measure the informal sector in its different dimensions. From its inception four decades ago, a lot of experience has been accumulated at the international level to prove it is a robust instrument to be incorporated in the toolbox of the survey statistician. As for other surveys, there are still a number of challenges to be overcome, the main one being the elaboration of appropriate deflators. Toeven better equip the countries to conduct mixed surveys, it would it be beneficial to implement cognitive testing to try to develop more respondent friendly question. Methodological exercises should also be conducted: for instance to assess pros and cons of direct questions vs detailed questions on economic outcomes, the extent of misclassification of dependent contractors, employees and independent workers, the impact of proxy respondent on data quality, or list experiments to assess social desirability biases. But to begin with, taking stock on past experiences, accumulated over the decades but never consolidated, is required urgently.
However the main challenge is not technical but institutional. Because of the lack of institutionalization within National Statistical Offices, ISMSs lost momentum in the recent years. This is particularly the case of the phase 3 of the
Footnotes
Acknowledgments
I thank the two anonymous referees for their valuable comments.
