Abstract
Electromyography (EMG) signal analysis is critical in understanding and treating work-related musculoskeletal disorders (WMSDs). Despite the increasing use of EMG signals combined with machine learning to assess biomechanical risks in various occupational settings, a significant shortage of extensive EMG datasets hinders progress. This shortage is primarily due to stringent data management plans and the limited availability of EMG datasets representing occupational tasks. To address this, our research leverages diffusion models to synthesize EMG signals tailored to manual material handling (MMH) tasks, aiming to enrich occupational EMG data repositories while maintaining data privacy. Using a conditional diffusion model with a residual U-Net architecture, we synthesized EMG signals for MMH activities such as pulling, pushing, and lifting. The synthesized data, evaluated across time and frequency domains, demonstrated fidelity to the original EMG signals, capturing distinct patterns and amplitudes characteristic of different tasks. Our findings highlight the potential of diffusion models in generating high-fidelity EMG data, providing a novel solution to the data scarcity challenge in occupational health research.
Keywords
Introduction
Electromyography (EMG) signal analysis is essential in the identification and treatment of work-related musculoskeletal disorders (WMSDs) as it provides insights into muscle function and fatigue (Campanini et al., 2022; Gazzoni et al., 2016). In recent years, there has been a rise in studies leveraging EMG signals, combined with machine learning models, to evaluate biomechanical risk exposures associated with WMSDs in various occupational settings (Donisi et al., 2023; Mudiyanselage et al., 2021). However, the success of these techniques relies heavily on access to extensive EMG datasets of occupational tasks, which are currently in short supply (Phinyomark & Scheme, 2018). This shortage can result in models that are overly tailored to the limited data, leading to overfitting issues that compromise the generalizability and effectiveness of ergonomic assessments and interventions (Alzubaidi et al., 2023; Bansal et al., 2022).
The reason for data shortage is twofold. First, data management plans aimed at protecting patient confidentiality usually limit the sharing and usage of data gathered from human subject (Pereira, 2020). These plans, while necessary for protecting personal information, pose significant challenges to researchers who require access to a broad range of EMG data for their studies. Secondly, while the majority of EMG datasets are dedicated to fields like sports science (Taborri et al., 2020), prosthetics and robotics (Atzori et al., 2014), and human-computer interaction (Sun et al., 2020), there is a noticeable shortfall in datasets specifically aimed at occupational tasks (Bassani et al., 2021). Therefore, there is a pressing need for innovative methods to enrich the repository of EMG datasets associated with workplace ergonomics, while strictly maintaining user data privacy.
Data synthesis, maintaining the statistical characteristics of the original data without collecting data from real-world events or individuals, provides an effective solution to the challenges posed by data shortage. (Park et al., 2018). Networks like Generative Adversarial Networks (GANs; Goodfellow et al., 2014) and Variational Autoencoders (VAEs; (Kingma & Welling, 2013) have been widely adopted for synthesizing new data (Barsoum et al., 2018; Pu et al., 2016; Qing et al., 2023). However, these generative networks often face challenges in generating time-series signals, such as EMG signals, due to issues like temporal coherence and the complexity of capturing dynamic physiological variations (Lin et al., 2020). In contrast, diffusion model, with its distinctive method of incrementally refining signals from noise, provides an innovative approach for generating time-dependent data (Ho et al., 2020). Its capability to effectively capture the intricate patterns and dynamics inherent in time-series data makes it potentially suitable for generating EMG signals.
This research aims to leverage the inverse process of diffusion models to synthesize EMG signals tailored to occupational settings. Specifically, we focus on widespread work scenario-manual material handling (MMH), characterized by frequent tasks such as pushing and lifting. The goal is to generate EMG signals conditioned on the input task type. This effort seeks to overcome the challenges associated with data privacy concerns and to enrich the pool of available EMG data for occupational studies.
Method
The training dataset used to construct the proposed diffusion model originated from a prior study (Chen, 2022). This dataset includes raw EMG data obtained from participants in four typical movements during MMH: pulling, pushing, squat/stoop lifting. The pulling and pushing tasks involved maneuvering a loaded handcart over a set distance, while the lifting tasks required adopting squat or stoop lifting techniques to raise a load slightly above the floor and maintaining these postures for a period. EMG signals were collected from the lumbar erector spinae. The raw data were first filtered with a fourth order Butterworth filter at cutoff frequencies 10 to 500 Hz, followed by segmenting the continuous EMG recordings into 3-second intervals using sliding windows techniques. These segments were transformed into 2D spectrum images via a short-time Fourier transform.
We then trained a conditional diffusion model with a residual U-Net architecture to synthesize EMG signals of different MMH tasks. This network architecture can efficiently capture long-term dependencies and temporal dynamics within the data due to its deep residual blocks. The diffusion model utilized 2D spectral images as both inputs and outputs, taking advantage of U-Net’s proficiency in image processing. The model’s loss function combined both spectral and temporal components: the spectral loss measured the differences between input and reconstructed spectral images; and the temporal loss aimed to reduce the discrepancies between the original time-series data and its spectral reconversion. After training, the output spectral images were converted back to 1-D signals, mimicking EMG signals in time domain among various activities.
Results
We developed an evaluation framework to analyze our synthesized data across both time and frequency domains. Recognizing that different MMH tasks produce distinct signal patterns and amplitudes, we conducted a class-specific comparison between our synthesized outputs and the original data. In the frequency domain (Figure 1), we observed that for both original data and synthesized data, “pull” and “push” exhibit periodic patterns, reflecting fluctuating muscle efforts required around the lumbar spine. “Squat lifting” and “stoop lifting” tasks are characterized by a broad uniform frequency distribution spanning from 10 to 200 Hz, mirroring the postural and muscle effort maintenance required during these tasks. The intensity of frequencies for “squat lifting” task is approximately 10−3, while for “stoop lifting” task is around 10−4.

Spectral comparison among four tasks.
The spectrums were then converted to the time domain for further evaluations (Figure 2): for both original data and synthesized data, “pull” and “push” signals demonstrate periodic fluctuations around a baseline, with ‘pull’ task marked by an amplitude of approximately 4 × 10−3 and “push” task by 2 × 10−3. Meanwhile, “squat lifting” and “stoop lifting” exhibit a uniform distribution around the zero point, with ‘squat lifting’ task reaching an amplitude of 10−2, significantly higher than “stoop lifting” at 10−3.

Time domain comparison among four tasks.
Through the visual evaluation, it appears the patterns and amplitudes of the synthesized outputs align well with those of the original data, confirming the fidelity of the proposed synthesis process in replicating EMG signal variations in both the frequency and time domains.
Conclusion
This study assessed the effectiveness of conditional diffusion models in generating high-fidelity EMG signals associated with MMH tasks. The results present a novel approach to address the significant challenge of data shortage in occupational health research by generating non-identified EMG signals. Aggregating these non-identified EMG signals could enhance the creation of a more comprehensive dataset covering various occupational tasks. It is important to note that in the current study, the evaluation of the synthesized data relied solely on visual inspection and comparison. Future research should also incorporate analytical scrutiny of the synthesized data using statistical metrics. Additionally, the potential of combining synthesized data with machine learning techniques for creating a robust biomechanical risk exposure assessment tool warrants further investigation.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This manuscript is based on work supported by the National Science Foundation under Grant #2024688.
