Abstract
Direct flicker is a perception effect causing discomfort generated from lighting systems. While the short-term flicker indicator,
1. Introduction
Temporal light modulation (TLM), variations in the light intensity or spectral distribution as a function of time, is present in almost all lighting systems including solid-state light sources. Inadequate selection of drivers for lamps or dimmers, and compatibility problems between the connected electrical components can lead to noticeable temporal light artefacts (TLAs), such as direct flicker or stroboscopic effects. TLA refers to the visible portion of TLM responses, i.e., those that can be perceived by the human eye, while TLM responses encompass the full range of temporal responses, including both visible and non-visible effects, which may have physiological impacts even if they are not directly perceptible. Direct flicker, as an important perceptual effect, is defined as the direct perception of temporal changes in light output with a steady gaze. It is considered a type of TLA that may cause unwanted discomfort or health problems.1–5 The direct flicker effect is common in everyday life, which has been widely observed in products from LED holiday light strings to lamps.
The availability of direct flicker metrics, their ease of use, and measurement uncertainty are important for product characterization. Short-term flicker indicator,
Recently, to validate the consistency of TLM measurements and to investigate the measurement uncertainty of TLM quantities, the International Energy Agency (IEA) 4E Solid-State Lighting Annex organized an Interlaboratory Comparison of TLM Measurements of Solid-State Lighting (SSL) Products (IC 2023),8,9 which evaluated direct flicker metrics,
The MP metric uses a calculation method in the frequency domain with shorter measurement time (several seconds), similar to that used in SVM.
11
Specifically, the MP metric uses the relative light waveform as input, weights the DC-normalized Fourier transform (FT) spectral components of the waveform using a perceptual sensitivity weighting function based on experimental data from human vision and sums them in quadrature to derive a metric that predicts the degree of perceived visible modulation.
13
As the MP metric was designed to measure flicker solely from the luminous output of a lamp, and is not necessarily concerned with power line disturbances, it requires a much shorter measurement time (∼2 s) compared to 180 s in
M P can evaluate the flicker from the lamp alone, allowing reproducible measurements with low uncertainties, and is thus suitable for testing of lighting products. However, large variations in MP measurements were observed in IC 2023. 9 Thus, the aim of this study is to investigate the causes of the measurement variations in MP and to improve the calculation method of the MP metric to provide more robust results. Our investigation revealed that the MP calculation method is very sensitive to how the waveform was digitally acquired. It was found that different measurement conditions (e.g. measurement duration, starting phase of waveform, sampling rate) can have a significant impact on the MP value. The experimental analysis on this aspect is discussed in Section 2.
In addition, the choice of the frequency range of the direct flicker sensitivity function is critical for perceptual relevance of the results. Based on the IEEE 1789 Standard on LED Lighting Flicker and Potential Health Concerns, risks of seizures due to flicker at frequencies are considered within the range of about 3 Hz to 70 Hz.16,17 The temporal contrast sensitivity function (TCSF) is used to characterize the relation between flicker visibility and temporal frequency. De Lange 18 and Kelly19,20 measured TCSFs using different visual fields, for several retinal illuminance levels. In 2015, the perceptual modulation (MP) flicker metric was developed at the Lighting Research Center based on Bodington’s visual data. 7 The total perceived modulation is calculated as the square root of the sum of the squares of the perceptual frequency components. The component perceptual strengths are calculated by dividing the modulation amplitudes (Ak, where k indicates the kth frequency component) by the DC level (A0) for Weber–Fechner law scaling, and dividing by the modulation detection threshold (MDTH,k) value for each frequency, as indicated in Equation (1). 7
The experiment of Bodington et al. 13 was conducted in central vision, by directly observing a 600-lumen A-lamp (7° visual angle) illuminating a light grey background (∼90° viewing field, ∼3000 K, 20 cd m−2 to 200 cd m−2). The resultant metric is most applicable for emission measurements, i.e. product characterization. The direct flicker sensitivity curve of MP covers a frequency range 5 Hz to 65 Hz, slightly narrower than the IEEE recommended risky frequency range of ∼3 Hz to ∼70 Hz.16,17
In 2017, Perz et al.
21
conducted another direct flicker experiment using a diffused stimulus covering both peripheral and central vision (large visual field ∼120°), at high correlated colour temperature (CCT of 6500 K) and high luminance level (209 cd m−2). They found that direct flicker can be observed at more than 80 Hz. Based on these data, in 2017, the flicker visibility measure (generally referred to as FVM) was developed by Perz et al.,
21
with a mathematical definition similar to that of MP, calculated as a squared summation of the DC-normalized modulation of each frequency component (
One point worth noting is that both MP and the flicker measure of Perz et al. did not employ windowing techniques to guard against spectral power leakage in their calculations. Converting a time-domain sampled waveform to a power spectrum in the frequency domain by way of a discrete Fourier transform (DFT) assumes that the measured signal is one full or multiple full cycles of a periodic signal. Thus, for the DFT, both the time and frequency domains are circular topologies, and the two ends of the measured signal are assumed to be connected. When the measured waveform signal has an integer number of cycles, the DFT result is correct as the assumption is satisfied. However, when the measured signal does not have an integer number of cycles, the truncated waveform of the measured signal has different characteristics than the original continuous time-domain signal that are generated by the introduction of a sharp transition change to the time-domain signal where the two ends of the measured signal meet. This sharp transition change, or artificial discontinuity, appears in the DFT as additional frequency content that does not exist in the original signal. This can look as if the energy of the dominant frequency of the signal is leaking into other frequency regions, and so it is called spectral leakage. 22 A windowing technique is an established method and is often used in digital signal processing to improve this spectral leakage issue; it consists of multiplying the time-domain signal by a set of numerical weights the same length as the signal, called a window, with the amplitude peaked at the centre with a value of 1 and gradually decreasing symmetrically towards zero at the edges of the window. 22 The window introduces additional spectral components to the original FT spectrum while reducing overall spectral power, so a correction factor is needed to ensure the consistent scale of the result and maintain the perceptual relevance of the direct flicker measure. The ‘window + correction factor for integrated spectral power’ method presented in this paper will not only benefit the MP metric but also the other TLA measures working in the frequency domain, such as Perz et al.’s FVM. The windowing technique has been implemented in SVM (Mvs),11,23,24 but it seems that the correction factor used in Mvs does not fully address the integration of spectral power when an additional windowing function is applied to the waveform data. Instead, it appears to scale the peak amplitudes to match the values from the rectangular-windowed approach. Since window functions can affect the spectral amplitude distribution in various ways beyond just the peak amplitude (such as spectral peak broadening), it might be necessary to consider a correction factor that accounts for the integrated spectral power, rather than just the peak amplitudes, when working with frequency-domain metrics like MP and Perz et al.’s FVM. In the case of Mvs, the correction factor used with ‘findpeaks()’, a MATLAB function employed in the normative MATLAB script provided by the IEC to compute this metric, may serve as a workaround. 11 However, this approach does not fully account for the changes in integrated spectral power introduced by windowing. A correction factor explicitly designed to compensate for these spectral power integration effects is not included in the current implementation. This omission results in an incomplete correction from a spectral energy standpoint.
In this current study, the original MP is revised by adding a Hann window and the above correction factor, and its performance is evaluated. The light waveforms of 11 typical commercial lamps, including 10 LED lamps and one compact fluorescent lamp (CFL), are measured using our customized measurement system at the National Institute of Standards and Technology (NIST). The MP values are first calculated for the measured waveform at different durations and different starting phases using the MATLAB script included in the ASSIST document 7 and then recalculated using the revised MP calculation method, MP,26, to verify that the revision significantly improves MP reproducibility. To broaden the scope of application of the MP metric, the sensitivity curve in the original MP is replaced by Perz’s sensitivity curve, 21 which has a higher sensitivity and a broader frequency range than the original sensitivity curve. Furthermore, an extensive statistical analysis is conducted to give recommendations on measurement duration, sampling rate and the anti-aliasing filter settings, by comparing the statistical variations in MP,26 measured under different conditions. Finally, the performance of the MP,26 metric and the recommended measurement conditions are validated using the measured waveforms of the 11 commercial lamps. This study presents a general method, referred to as the ‘window + correction factor’ method, to improve TLA metrics (particularly MP) implemented in the frequency domain, and a novel method using statistical variations to reveal the effects of different waveform acquisition conditions (e.g. sampling frequency and duration) on TLA metric calculation. Both these methods can be utilized in best-practice recommendations and in regulations and standards for direct flicker evaluation. The MATLAB code for the MP,26 metric is available in Supplemental Material associated with this paper.
2. Variations in measurement results of the original MP
In the current study, the waveforms of light filtered by a V(λ) filter from a total of 11 different lamps (60 Hz AC power operation), including 7 relatively new LED lamps, 3 several year-old LED lamps and 1 CFL, were measured using a photodiode with a built-in amplifier, an anti-aliasing filter (commercial active low-pass filter) and sampled with a commercial high-speed digitizer (maximum sample rate of 20 MHz). The details of the tested lamps, including technical data, time and place of purchase, are given in Table A1. All the lamps were stabilized for 30 min before the measurements. The waveforms measured with the anti-aliasing filter are shown in Figure 1. The measurement duration for all waveforms was 180 s. The 10 LED lamps are indicated as ‘Lamp 1’ to ‘Lamp 10’, and the CFL is shown as ‘Lamp 11’.

Normalized waveforms of the 11 lamps. Waveforms were measured in the laboratory using a cut-off filter at 1 kHz with a 12 dB/octave roll-off and digitized at a 50 kHz sampling rate. Lamps 1 to 7 represent the 7 new LED lamps, Lamps 8 to 10 represent the 3 relatively old LED lamps and Lamp 11 represents the CFL
To investigate the impact of the waveform duration on the MP values, the measured waveforms were cut into different segments of different durations, varying from 1 s to 180 s, always starting from the beginning of the measured waveforms. The waveforms used for the MP calculation were filtered by an anti-aliasing analogue filter with a cut-off frequency at 1 kHz and a roll-off of 12 dB/octave, sampled at 50 kHz. The MP values of the waveforms at different durations were calculated for each lamp and plotted as solid lines in Figure 2. Relatively large variations in MP values can be observed for most lamp waveforms, due to different measurement durations. To compare, the MP,26 values calculated using the revised method and the waveforms measured at the recommended conditions described in the following sections (filtered at a cut-off frequency at 1 kHz and a roll-off of 12 dB/octave, sampled at 10 kHz) were plotted in the same figure as dashed lines. As can be observed, the variations of MP,26 values are reduced to within 0.01 for all 11 lamps, when using waveforms measured over 6 s as recommended in the following sections.

M P values calculated for the waveforms of measurement durations varied from 1 s to 180 s (solid curve), and the corresponding revised metric, MP,26, calculated using the revised method and recommended measurement conditions described in subsequent sections (dashed curve). Waveforms used for the calculations were measured with an anti-aliasing filter featuring a 1 kHz cut-off frequency and a 12 dB/octave roll-off, and were digitized at a sampling rate of 10 kHz.
The impact of starting phase of the waveforms on the MP and the revised MP,26 values was also investigated. The waveforms used for the MP calculation were filtered at 1 kHz cut-off frequency and sampled at 50 kHz, and the waveforms for MP,26 calculation were filtered at 1 kHz cut-off frequency and sampled at 10 kHz as recommended in the following sections. The measured filtered waveforms were cut into segments of the same waveform length (100 s) with varied starting sample points for calculating MP. The total number of the sampling points within one cycle of each waveform is around 833 which can be obtained by dividing the duration of one cycle by the sampling time interval ((1/60 Hz)/(1/50 000 Hz)). The phase interval between the sampling points is around 0.432°. For calculating MP,26, the waveforms were cut into segments of 6 s, which is the recommended duration for MP,26 measurement. Calculating the MP and MP,26 values of waveform segments starting from each sample point of the first cycle gives MP and MP,26 as functions of starting phases, shown as solid and dashed lines in Figure 3. Relatively large variations in MP values can be observed for many of the lamp waveforms, due to different starting phases. The revised MP,26 values, measured at recommended conditions, show virtually no variation due to starting phases. These variations in MP values for different measurement durations and starting phases are evidently due to spectral leakage, which is minimized by the revised calculation method of MP,26, as described in the next section.

M P values calculated for the measurement waveforms of starting phases varied from 1° to 360° (solid curve), and the corresponding revised metric, MP,26, calculated using the revised method and recommended measurement conditions described in subsequent sections (dashed curve). Waveforms used for the calculations were low-pass filtered with a 1 kHz cut-off frequency and a 12 dB/octave roll-off. For MP calculations, the waveforms were sampled at 50 kHz over a duration of 100 s, whereas for MP,26 calculations, they were sampled at 10 kHz over a duration of 6 s
Similar reductions in measurement uncertainty were observed in simulations using waveforms with 100% modulation and MP values well above the visual threshold of 1. Figure 4 shows an example of MP variation for simulated sinusoidal waveforms modulated at 64.1 Hz and 64.9 Hz. The waveforms have a duration of 6 s and were sampled at 10 kHz as recommended in the following sections. The variation of MP values (∼1.8 in absolute values of MP) has been removed by using the revised calculation method. This indicates that the method is effective even when the simulated flicker is very strong and easily perceptible. This result is consistent with what was observed in measurements of a commercial lamp that had very low MP values – i.e. very little flicker. Together, these findings demonstrate that the relative improvement in measurement uncertainty applies across a wide range of flicker levels, from barely perceptible to clearly visible.

M P values calculated for the simulated sine waveforms of starting phases varied from 1° to 360° (solid curve), and the corresponding revised metric, MP,26, calculated using the revised method and recommended measurement conditions described in subsequent sections (dashed curve). Theoretical MP and MP,26 values derived directly from the sensitivity curves are also shown as markers at the 180° starting phase for visual comparison
Although the products shown in Figure 3 exhibit relatively low MP values, products with higher
3. Revised method of MP calculation
3.1 Hann window
Spectral leakage is thought to be the cause of the substantial variation in MP values for the same waveforms when resampled at different measurement durations and starting phases, and due to the varied amplitudes of the circular discontinuities. 22 This effect had not been explicitly considered in the original formulation of MP, nor in similar metrics such as Perz et al.’s FVM, which also lacked the application of proper windowing techniques with window correction factors to mitigate spectral leakage. A windowing technique is often used to improve this spectral leakage issue in digital signal processing, as explained in Section 1. Illustrations of a rectangular window (a weighting of 1 everywhere, so no change to the originally sampled data), a Hann window and a flattop window are shown in Figure 5. This technique allows the two ends of the measured waveform signal to coincide in zero amplitude, resulting in a cyclical time-domain signal with no added discontinuities.

(a) Window functions and (b) their amplitude-normalized DFT spectra for a sine wave with a 30-Hz modulation frequency, sampled at 20 kHz
Several different types of window functions have been developed for digital signal processing, depending on the type of spectral information sought. For example, Figure 5 shows how different windowing techniques (rectangular, Hann and flattop) affect the calculated spectral content for a 30 Hz sinewave. A Hann (or Hanning) window is generally the best option in most cases, balancing spectral resolution (the width of the spectral peak) with reduced spectral leakage. The Hann window has a sinusoidal shape (see Figure 5), which reaches zero at both ends avoiding all cyclical discontinuity in the analysed signal.
To reduce the spectral leakage, a Hann window was added in the MP calculation method before performing the FT. With the addition of the Hann window, the variation in MP values due to the DFT spectral leakage has been largely reduced. Mantela et al. also pointed out the need for windowing in computing TLM metrics. 26
3.2 Window effect correction factor
The added Hann window affects the DFT components of the waveforms in that it decreases the amplitude of the DFT peaks while broadening their bandwidths (see Figure 5). Therefore, if keeping the same perceptual sensitivity weighting function of MP, the addition of the Hann window would result in a different DFT spectrum, and thus a different MP value, termed MP,H. This discrepancy between the MP,H and the original MP with its uniform window, which can also be considered as using a rectangular window, termed MP,R, should be corrected as it does not reflect a difference in perception. The following steps were used to derive the correction factor for the window effect.
The window effect on MP can be quantified by the ratio of the sum of the squared FT components of the Hann windowed waveform to the sum of the squared FT components of the rectangular windowed waveform. This ratio will be used to correct the effect of the Hann window so that values of the revised MP have the same meaning as the original MP values. For easier calculation, the summation of discrete FT frequency components is replaced by an integral of spectral density over frequency. As the use of the continuous formulation simplifies the mathematics with equivalent results, it will be used throughout the following mathematical derivations.
The Hann windowed MP (
In Equation (3), the DC components in the Hann windowed (
Based on the Convolution Theorem, the FT of a convolution of two functions is the pointwise product of their FT. Therefore, the FT of the product of the temporal wave (
From Fubini’s Theorem, 27 a double integral can be computed by using an iterated integral, the square of the integral in the numerator of Equation (5) is therefore given as shown in Equation (6).
Equation (6) can be substituted into the numerator and denominator of Equation (5). As the FT spectrum of the window is left–right symmetric on the frequency axis, the expression in Equation (5) can be rewritten as in Equation (7).
From Equation (7), it is clear that if this is to be equal to 1,
For a unit length window, the Hann window and rectangular window functions in the time domain can be mathematically described as shown in Equations (9a) and (9b).
The FT frequency response of a unit length Hann window and a rectangular window are as given in Equations (10a) and (10b).
By substituting the FT spectra of the window functions (Equations (10a) and (10b)) and eliminating the identical parts in the numerator and denominator in Equation (8),
Using the same method as described above for other window functions leads to different correction factors. For example, the correction factor for a Bartlett (triangular) window is approximately 1.155, and for a flattop window is approximately 1.942.
The Hann window corrected form
As shown in Figure 6, the MP variations due to spectral leakage are significantly reduced after adding the Hann window and its correction factor. The sensitivity curve of the MP metric derived from Bodington et al.’s experiment starts with a relatively high value at 5 Hz modulation frequency and ends with a small but non-zero value at 65 Hz. At modulation frequencies below 5 Hz or above 65 Hz, the weighting values are all set to zero. As a result, the weighting value changes abruptly at 5 Hz and, to a lesser extent at 65 Hz. However, there is often an uncertainty in the dominant frequency of a DFT spectrum of measured waveforms, due to different sampling durations and the phase of the waveform when sampling starts. This uncertainty is especially high for short duration waveforms due to low frequency resolution in the resulting DFT spectrum. When this uncertainty in the dominant frequency occurs at the end of the weighting function’s frequency range (e.g. 5 Hz), it results in a large variability in the MP values, from values near zero to several hundred, depending on the waveform. Therefore, the abrupt change in the weighting function at 5 Hz may result in a potential problem for the evaluation of the direct flicker effect of the TLM using the MP metric without a method to prevent this from happening. Fortunately, the use of the Hann window greatly reduces the variations in MP due to the metric’s sensitivity cut-offs at 5 Hz and 65 Hz. Spectral broadening will result in some of the actual spectral content less than 5 Hz being transferred (leaked) to frequencies greater than 5 Hz and inflating the metric. Likewise, some spectral content above 5 Hz is assigned to frequencies less than 5 Hz and does not contribute to MP. The combined effect is a smooth transition from cut-off to cut-on frequencies. The benefit of using the Hann window is that this leakage is much more limited across frequencies and much less in magnitude for different sampling phases and durations than with a rectangular window. Figure 6 compares MP with and without Hann windowing by plotting the range of MP values obtained for sine waves sampled over a range of phases (100 random phases) from 0 to 2π. The Hann-windowed method reduces MP variation at each frequency to negligible amounts and produces a less abrupt transition at the 5 Hz cut-off frequency. This less abrupt transition at 5 Hz with negligible variations due to starting phase is preferable to the much larger and variable spectral leakage of the rectangular window that extends farther in frequency from the 5 Hz cut-off.

M P with and without Hann window as a function of modulation frequency for 2-s sine waveforms at 100% modulation sampled over a range of starting phases from 0 to 2π. The median and interquartile range of MP values at each frequency for different waveform starting phases are shown as solid lines and shaded regions (red for without Hann window, blue for with Hann window). The interquartile range for the MP with Hann window is indistinguishable from the line
4. Modifications in the sensitivity weighting function
The perceptual sensitivity weighting function in the original MP calculation covers the modulation frequencies from 5 Hz to 65 Hz. Furthermore, the original sensitivity curve of the MP metric is derived under an experimental condition simulating an individual lamp in a small indoor environment, which may underestimate the direct flicker effect in application scenarios with higher light levels and high correlated colour temperature (CCT), where the direct flicker effect tends to be more severe and harmful to health. Therefore, another sensitivity curve from the study of Perz et al., 21 obtained under high CCT (6500 K) and high luminance (209 cd m−2) conditions was adopted because it represents viewing conditions that heighten direct flicker sensitivity and extends the sensitivity range to 80 Hz. This sensitivity curve is incorporated in the revised MP for characterization of the direct flicker effect in a wider range of applications. The sensitivity curve was mathematically derived following the same procedure described in Perz’s publication. 21 In their study, flicker visibility thresholds were first averaged across participants. These mean threshold values were then converted into sensitivity values by taking their reciprocals. The mean sensitivities, expressed on a logarithmic scale as a function of log frequency, were subsequently interpolated using a shape-preserving piecewise cubic interpolation to generate the TCSF.
A comparison of the two sensitivity curves is shown in Figure 7. According to the IEEE Standard 1789 and other literature reviews, flicker causing health issues typically occurs in the modulation frequency range of 3 Hz to 70 Hz.16,17 Accordingly, the revised MP,26 calculation method also incorporates this updated sensitivity weighting curve from 3 Hz (indicated as dotted line in Figure 7) to 80 Hz.

5. Recommended measurement conditions for MP,26 calculation
We have evaluated the variation of MP,26 caused by varied spectral leakage due to different starting phases, based on simulated sinusoidal and square waves. An example of the MP,26 variation at modulation frequencies below 10 Hz at 2 s measurement duration is shown in Figure 8, for sinusoidal and square waves with 100% percent flicker. The median values and interquartile range of MP,26 are indicated. The variation of MP,26 is negligible for sinusoidal waves, but more noticeable for square waves, especially at modulation frequencies below 3 Hz, where the transition in the sensitivity curve is much sharper. (Note that for square waves, MP,26 does not quickly decrease to zero below the 3 Hz cut-off because of the higher frequency harmonics present in square waves.) MP,26 at low modulation frequencies varies more with short measurement durations (e.g. 2 s), where the lower DFT spectral resolution results in higher spectral uncertainty at the dominant frequency. Increasing the measurement duration (e.g. 5 s) would reduce the spectral uncertainty and the MP,26 variations, as shown in Figure 8c. In practice, a duration of 5 s will result in negligible errors.

M P,26 as a function of frequency for sine waves (a, duration 2 s) and square waves (b, duration 2 s; c, duration 5 s) at 100% modulation, sampled over a range of starting phases from 0 to 2π. The median and interquartile range of MP,26 values at each frequency across different starting phases are shown as solid lines and shaded regions (red for sine waves, blue for square waves). For the sine wave and the 5 s square wave, the shaded interquartile ranges are indistinguishable from the lines
Therefore, it is crucial to evaluate and recommend the minimum measurement duration for accurate MP,26 calculation. Besides, the sampling rate might be influential on calculated MP,26 when an anti-aliasing filter is applied in practical measurements. The effects of measurement duration, sampling rate and anti-aliasing filter setting (digital anti-aliasing filters with different cut-off frequencies and filter orders) on MP,26 variation due to different starting phases were investigated at varied modulation frequency below and above 3 Hz, respectively, using simulated square waveforms with random noise added. The signal noise was simulated based on the noise of the dark current measured at the NIST photometry lab during the IC 2023. The conditions are shown numerically in Table 1.
The different measurement conditions used for MP,26 calculation
Since the sampling rate is required to be higher than twice the Nyquist frequency of the anti-aliasing filter, only sampling rates complying with the Nyquist criterion were used to configure the measurement conditions for the MP,26 calculations. The MP,26 values for 100 random starting phases were calculated for each combined measurement condition. A total of 1 296 000 simulated waveforms with varying conditions and random starting phases were generated for modulation frequencies both below and above 4 Hz. The MP,26 results were statistically analysed in RStudio using a generalized linear mixed-effects model (R function glmer). 28 The model accommodated the unbalanced design, which included differing numbers of MP,26 observations across conditions. Residual diagnostics were conducted using the DHARMa package, 29 to verify the model’s validity and robustness. The statistical results are indicated in Table 2. According to the analysis of variance (ANOVA) for the general linear mixed-effect model, the measurement duration, anti-aliasing filter setting (cut-off frequency, filter order and their interaction) and modulation frequency have statistically significant impact on MP,26 variation with a p-value of <0.01.
ANOVA table (Type III Wald chi-square tests) for the impact of different measurement conditions on the MP,26 variation at modulation frequencies below 4 Hz (significant impact factors are in italic)
To find which measurement settings (duration, filter setting) result in significantly different MP,26 variations, pairwise comparisons between factor levels (different durations, filter cut-off frequencies, filter orders, etc.) based on the statistical model output were conducted using the R-function ‘emmeans’ in ‘emmeans’ library. 30 The pairwise comparison results were confirmed based on the raw data by the pairwise t-test using the R-function ‘pairwise.t.test’ with Benjamini and Hochberg (‘BH’) testing corrections. 31 The ‘BH’ methods control the false discovery rate, the expected proportion of false discoveries amongst the rejected hypotheses. 32 The numerical details of corrected p-values for pairwise comparison of duration levels are included in Table A2. The results show that for modulation frequencies below 3 Hz, measurement durations of 5 s or shorter yield a significantly larger MP,26 variation compared to longer measurement durations. Furthermore, for measurement durations longer than 6 s, the MP,26 variation is no longer (statistically) significantly different. Additionally, a measurement duration of 5 s results in practically negligible errors. Therefore, we recommend a minimum measurement duration of 5 s for more accurate MP,26 calculations at modulation frequencies below 3 Hz. No statistically significant difference was found between the different filter settings based on the results of pairwise comparison analysis after the BH correction (though the interaction between the cut-off frequency and filter order was found significant from the ANOVA table), which is reasonable as the evaluated modulation frequencies (≤3 Hz) are very low compared to the filter cut-off frequencies.
The same statistical analysis was performed for waveforms with modulation frequencies above 3 Hz (from 3 Hz to 100 Hz), where the MP,26 sensitivity curve changes smoothly leading to smaller MP,26 variation. The significant impact factors found for modulation frequency below 3 Hz were also found significant above 3 Hz. In addition, the sampling rate was found to be an additional significant impact factor for modulation frequency above 3 Hz. With higher frequency components present in the testing waveforms, the sampling rate becomes critical for successful anti-aliasing.
The pairwise comparison results for modulation frequency above 3 Hz are also shown in Table A2. Based on the pairwise comparison analysis, at a sampling rate of 1 kHz, the MP,26 variation is significantly different from the higher sampling rates. The pairwise comparison result also indicates significant difference in MP,26 measured at 1 s compared to other durations. These results lead us to recommend a minimum measurement duration of 5 s and a sampling rate of at least 2 kHz for MP,26 measurements. Regarding filter settings, a significant difference in MP,26 variation was observed between filters with cut-off frequencies below and above 400 Hz at modulation frequencies above 3 Hz. Therefore, we recommend using a cut-off frequency of ≥400 Hz when the filter’s cut-off is not sufficiently sharp. A filter order of at least 2 is also recommended, as first-order filters produced MP,26 results that differed significantly from those obtained with the other four tested filter orders, while no significant differences were found among orders 2 and higher. A recent study by Li et al. 12 has shown that the sampling rate of a first-order anti-aliasing filter may need more than twice the Nyquist frequency depending on the waveform. The current MP,26 results obtained for the first-order anti-aliasing filter may therefore be different from those for other filter orders due to the same reason. A summary of recommended measurement conditions for the MP,26 calculation is indicated in Table 3.
Recommendation of measurement conditions for MP,26 calculation
Taking the 100% modulated square wave as an example, the MP values are calculated both with the original method and the revised method for modulation frequencies between 3 Hz and 100 Hz with an interval of 0.05 Hz. The uncertainty in MP is estimated by the standard deviation at each modulation frequency. Comparing the MP calculated with the original method using 2 s duration and 1 kHz sampling rate, with the revised MP,26 using the proposed measurement conditions (5 s measurement duration, 2 kHz sampling rate, 400 Hz cut-off frequency and second order anti-aliasing filter), the average uncertainties have been reduced to 5% of the original for modulation frequency between 3 Hz and 100 Hz. The average standard deviation, normalized to the mean calculated MP, was reduced from 38.42% to 0.29%. Additionally, the average absolute uncertainty – initially 5.15 (well above the metric’s visibility threshold of 1) – was reduced to 0.09. The improved reproducibility of MP,26 for typical light sources is illustrated in Figures 2 and 3.
5. Conclusions
In the current study, the light waveforms of 11 typical commercial lamps including 10 LED lamps and 1 CFL were measured. Substantial variations in the original MP values (defined in the ASSIST document) 7 for some types of waveforms were found when using different durations and starting phases, and it was discovered that this was due to the DFT spectral leakage caused by finite duration waveform sampling. The MP variation caused by spectral leakage was greatly improved by adding a Hann window. To maintain the perceptual relevance of the revised MP, a window correction factor was added to account for the window’s effect on integrated spectral power. To broaden the scope of application of the MP metric, the sensitivity weighting function in the MP calculation was replaced by the one derived from the experimental data of Perz et al. in 2017, covering a frequency range of 3 Hz to 80 Hz. Specific waveform measurement conditions, including measurement duration, sampling rate and the anti-aliasing filter settings, were recommended based on an extensive statistical analysis of MP,26 variation due to these effects. A linear mixed-effects model was fitted to the MP,26 results as a function of different measurement effects (duration, sampling rate, filter cut-off frequency and filter order). A pairwise comparison between different levels in each measurement effect was conducted. A measurement duration of ≥5 s, sampling rate of ≥2 kHz, anti-aliasing filter cut-off frequency of ≥400 Hz and filter order ≥2 are recommended for accurate measurement and calculation of the revised MP,26. With MP,26 and the recommended measurement conditions, the average absolute uncertainties for measurement of the sample of 11 commercial lamps have been reduced to less than 5% of the original (from 5.15 to 0.09) for modulation frequency above 3 Hz. Similar percentage reductions in uncertainty were obtained for simulated waveforms having 100% modulation and MP values much higher than the visual threshold of 1, thereby demonstrating that the relative improvement in measurement uncertainty seen in the commercial lamp sample exhibiting very low MP values would also occur for lamps having perceptually high amounts of flicker. This study proposed an approach with ‘window + correction factor for integrated spectral power’ to be used in the frequency-domain TLA metrics without losing the perceptual correlation, and a novel statistical analysis using a linear mixed-effects model for comparing the different measurement conditions for metric calculation, providing a reference for the recommendations in regulations and standards for direct flicker evaluation in practical measurements. The MATLAB code for the MP,26 is provided as Supplemental Material with this paper, and will also be made publicly available in the revision of ASSIST document when it is published. The perceptual threshold of the revised MP,26 is planned to be validated through human visual experiments in future studies.
Supplemental Material
sj-docx-1-lrt-10.1177_14771535261435634 – Supplemental material for Revision of MP calculation method for flicker measurement
Supplemental material, sj-docx-1-lrt-10.1177_14771535261435634 for Revision of MP calculation method for flicker measurement by J Li, A Bierman and Y Ohno in Lighting Research & Technology
Footnotes
Appendix A
Filter order (modulation frequency: below 3 Hz/above 3 Hz)
| Order | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| 1 | NA |
|
|
|
|
| 2 | 0.72 | NA | 0.006 | 0.003 |
|
| 3 | 0.72 | 1.00 | NA | 0.04 |
|
| 4 | 0.72 | 1.00 | 1.00 | NA |
|
| 5 | 0.72 | 1.00 | 1.00 | 1.00 | NA |
The p-values of each contrast are reported in four sub-tables for the four measurement factors analysed in the current study: measurement duration, sampling rate, filter cut-off frequency and filter order. When the p-value after a BH correction is significant (<0.01) it has been printed in bold.
Acknowledgements
The authors thank Naomi Miller from PNNL and Jennifer Veitch from NRC Canada for their advice on sensitivity curve modification. The authors also thank Yuqin Zong from NIST for assistance with the TLM measurements. This study is connected with IEA SSL Annex Interlaboratory Comparison IC 2023, the authors thank experts in IC 2023 Core Team and the IEA SSLC management team for their support.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was carried out as part of the TWINKLE project, financially supported by the Auvergne-Rhône-Alpes Region (grant number 00284356-1).
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
