Sage Journals: Discover world-class research

Abstract

Trace-based measurements in self-regulated learning (SRL) require a careful balance between context and generalizability, while ensuring robust validity. This review of 58 theoretically grounded articles addresses three prevalent issues in SRL: contextualization, generalization, and validation. We seek to objectively define a context and examine how the design of software systems can facilitate generalizable trace capture and alignment of theoretical constructs. The learning environment, task type, and task duration are the key factors to consider while defining a context. We identify four categories of trace data useful for informing learning system design. Theoretical generalization is demonstrated by mapping trace data to constructs from two distinct SRL models—Zimmerman’s and Winne & Hadwin’s. We highlight that multichannel and multimodal data help capture enhanced trace data, while continuous measures, such as think-aloud protocols, are better suited than questionnaires to validate traces. We outline what validation in trace-based research should entail from the perspective of a modern validation framework and provide the best practices for balancing contextual depth with theoretical generalization in SRL research.

Keywords

computers and learning instructional technologies learning environments learning processes/strategies metacognition technology,measurements statistics self-regulated learning trace data generalization contextualization validation

Self-regulated learning (SRL) is a prominent construct of human learning that has been extensively researched in education (Panadero, 2017; Saint et al., 2022; Viberg et al., 2020; Zhang et al., 2021), and understandably so. Self-regulated learners are likely to succeed academically and are more hopeful about their future (Zimmerman, 2002). Its importance has been highlighted in lifelong learning as well; children and younger people with higher SRL skills are more likely to succeed compared to those with lower levels of self-regulation (Dignath & Büttner, 2008). From a motivational perspective, good SRL skills correlate with higher academic motivation and effective learning ability (Johnson et al., 2023; Pintrich, 2003).

In modern digital learning systems, SRL is often studied using a form of digital interaction data that is commonly referred to as trace data. Schraw (2010), in his taxonomy of measurements, lists trace data as an unobtrusive method of measurement that can be collected through active episodes of learning without consciously alerting the learner. In general, traces are observable evidence of learners’ self-regulatory processes generated in real-time as learners work on a learning task (Howard-Rose & Winne, 1993). The generation of trace data is approximately synchronous with the cognitive operations that the learner applies to the information in the working memory (Winne, 2010). Because of these reasons, trace data has been put forward as a modern means to unlock inferences about learners’ SRL.

One fundamental challenge that accompanies the measurement of SRL is the scope of context. Self-regulation is highly contextual (Winne, 2010), and students’ use of self-regulatory processes differs based on their learning environments (Viberg et al., 2020). While current research has started to explore what to trace (Viberg et al., 2020) and the methods to analyze SRL, these exploratory studies need to translate to confirmatory studies for testing theoretical models (Dawson et al., 2019; Winne, 2014). We require robust findings that can be investigated and that can eventually be sustained in multiple learning setups (Winne & Baker, 2013). We need to look for commonalities that exist between learning environments and how we can capture equivalent learning traces across different environments. Striking the right balance between contextualization and generalization is a requirement of the current time (Du et al., 2023; Quick et al., 2023). Although a number of systematic literature reviews have been put forward in the last decade focusing on the measurement of SRL using trace data (Du et al., 2023; H. Min & M. Nasir, 2020; Saint et al., 2022; Viberg et al., 2020; Wong et al., 2019), we have yet to identify themes of generalization across learning contexts. The closest one addressing this question is the one published by Du et al. (2023), which attempts to answer how indicators of SRL have been operationalized by trace data–based studies and the associated challenges. Their review found generalization and validity as two of the three major challenges in the measurement of SRL using trace data. The review by Viberg et al. (2020) is another relevant one, which was directed toward finding out the current state of research in the field of learning analytics to measure and support SRL in online learning environments. They identified that fewer studies have examined the reflection phase of SRL as compared to the forethought or performance phases. While most research has been directed toward the measurement of SRL rather than supporting it (Viberg et al., 2020), we are still far from identifying themes of generalization across studies in trace-data-based SRL research. Indeed, some efforts toward distinguishing generic SRL processes that researchers measure in SRL have come up (Du et al., 2023), but a lack of theoretical grounding in the field presents a hurdle to such efforts (Matcha et al., 2020). To address these challenges, we have kept theoretical alignment as a primary requirement for studies to be included in our current review. The review by Saint et al. (2022), on the other hand, focused on temporally focused analytics in measuring SRL. They primarily focused on the theoretical underpinnings of these studies and the choice of methods for the temporal modelling of SRL. They found several studies that combined constructs from different theoretical models to model their traces, as opposed to choosing a single theoretical model. The validity of measurement protocols in different study contexts, the generalizability of the findings, and the provision for triangulation with other measurements once again emerged as some of the areas that require emphasis in future studies.

Although previous literature reviews have acknowledged the issues surrounding generalization and contextualization in SRL, none have attempted to address them. To address this challenge of generalizability, the current systematic review compiles operationalizations used by researchers to represent higher-level SRL processes, supplemented with information about the learning contexts in which they were collected, thereby emphasizing the relevance of the context in each study. We performed a coding of traces across two well-known theoretical models in SRL, demonstrating the possibility of theoretical generalization and how combining theoretical perspectives from multiple models can enrich traces of SRL. We categorized traces based on their usage in a context, which can help learning system designers operationalize SRL with more informed traces. We also reiterate the issue of validity in SRL measurements, compile the contemporary ways of triangulation and validation, and offer our recommendations for the same. Researchers who work on challenges related to the interpretation of traces and the SRL constructs they might represent while setting up digital learning systems to effectively measure SRL will find this literature review particularly useful.

Background

Self-Regulated Learning and the Need for Theoretical Grounding

Self-regulated learning is a theoretical umbrella that encompasses cognitive, metacognitive, behavioral, and affective aspects of learning (Panadero, 2017). Over the past decades, researchers have introduced several theoretical models to conceptualize and understand SRL (Boekaerts, 2011; Efklides, 2011; Pintrich, 2000; Winne & Hadwin, 1998; Zimmerman, 1986). Puustinen and Pulkkinen (2001) and Panadero (2017) identified that these SRL models comprise three fundamental phases—(a) preparatory phase, where the learner analyzes the task and sets personal goals and plans; (b) performance phase, where the learner performs the task and monitors their progress; and (c) appraisal phase, where the learner assesses their attainment of set goals and refines their plans for further iterations of the task. These phases are loosely cyclical and recursive in nature, and each phase contains its unique sub-constructs in its respective model (Bernacki, 2017; Panadero, 2017). The conceptual differences among these theories primarily arise because of the difference in emphasis that the proponents of the theories put on different aspects of self-regulation in their respective theoretical models. For example, motivation finds primary emphasis in Zimmerman’s model (Zimmerman & Moylan, 2009) and Pintrich’s (2000) model, while affect or emotion is of special focus in Boekaert’s model (Boekaerts, 2011) of SRL (Panadero, 2017). When choosing a theoretical lens for empirical research, it is worth keeping in mind that each of these models provides alternative views to observe and understand the same learning process. The choice of a theoretical model should thus be motivated by the points of emphasis of the particular model and how well-suited it is for a particular research context or population of learners (Panadero, 2017).

A popular contemporary approach to capture SRL from an event perspective (Winne, 2010; Winne & Perry, 2000) is trace data, where researchers attempt to represent SRL constructs using the actions of learners (Winne, 2010). This fundamental challenge of mapping logged digital actions to higher-level educational constructs is sometimes referred to as going from clicks to constructs (Buckingham Shum & Deakin Crick, 2016). Most trace-based studies in SRL typically involves capturing these low-level clicks and aggregating them to proxies for their approximately synchronous SRL processes (Greene & Azevedo, 2010; Winne, 2010), with the assumption that a complete coding of the traces in a study will be adequate to comprehensively encompass the underlying theoretical model(s) to which the learners’ s self-regulatory behaviors conform (Greene & Azevedo, 2009). This top-down mapping of higher-level SRL constructs to lower-level clicks has been at the forefront of research into SRL in recent years, which has led several trace-based methodologies to come into the fray in a variety of contexts. Some of the most notable ones are pioneered by Winne et. al. (Bernacki et al., 2012; Hadwin et al., 2007; Jamieson-Noel & Winne, 2003) in reading, Azevedo et al. (Greene & Azevedo, 2009, 2010; Taub & Azevedo, 2019) through think-aloud studies in teaching human circulatory system, Aleven et al. (2006) in a math tutoring environment, Siadaty et al. (2016a, 2016b) at workplace, Biswas et al. (Paquette et al., 2020; Roscoe et al., 2013) in an intelligent tutoring system, Saint, Gašević, et al. (2020a), Saint, Whitelock-Wainwright, et al., (2020b) in an online course, and Fan, Lim, et al. (2022a; Fan, van der Graaf, et al., 2022b; Fan et al., 2023) for reading-writing.

It has been repeatedly emphasized that for trace-based research to progress, it is essential to have sound theoretical grounding (Dawson et al., 2019; Rovers et al., 2019; Winne & Baker, 2013). Measures in SRL should be closely aligned with the underlying models, which was often not the case in the review reported by Rovers et al. (2019). This may have contributed to the apparent lack of robust findings that empirically demonstrate benefits in promoting learning in all students, as noted by Winne and Baker (2013). While there has been considerable progress, the lack of refinement of and feedback into theories remains noticeable (Dawson et al., 2019). Dawson et al. (2019) posited that research in learning analytics is supposed to move through a series of maturation phases. In its most mature phases, learning analytics research is supposed to find its way into large-scale projects along with theory building and refinement. However, although there has been a surge in studies focusing on analyses, there has not been a proportionate contribution in terms of application in practice and theory building (Dawson et al., 2019). To overcome this gap, there is a need for a purposeful shift from strictly exploratory models to a more holistic approach to test out theoretical models (Dawson et al., 2019; Winne, 2017). In an effort to make such a purposeful shift, this review kept theoretical grounding as a primary criterion for the selection and review of articles.

Measuring SRL—Trace Data and Associated Challenges

Trace data allows measurement of SRL at a much finer granularity compared to other measures (Rovers et al., 2019). There are several other benefits of trace data as well: (a) Trace data can be captured reliably (Winne, 2020); (b) trace data collection is unobtrusive and does not interrupt a learner’s natural flow of activity; (c) trace data can capture finer temporal changes in learners’ self-regulation, unlike questionnaires (Winne, 2010); (d) trace data can be captured from ecologically valid, real-world learning contexts; and (e) trace data collection is scalable to large, real-world educational settings. A typical trace-based measurement protocol involves abstracting the raw computerized behavioral trace data and aggregating it to represent higher-level SRL processes. This aggregation of raw data can be done either by finding a summary measure of actions over time or by combining sequences of actions (B. Chen et al., 2018). For instance, the proportion of course units accessed before deadline has been interpreted to represent time management SRL process (Q. Li et al., 2020), and the learner reading the general task instructions/scoring rubric of the task, then navigating to read a page relevant to the goal for the first time, was interpreted to represent orientation (Srivastava et al., 2022). Researchers typically hypothesize a list of learner actions or a list of sequences of learner actions that are representative of SRL processes in their respective research contexts. In this review, we refer to such coding approaches as an action library or a pattern library, and broadly as a trace library. Such libraries are aimed at mapping out the entire timeline of learning into chronological self-regulatory processes. This aggregation of actions to a sequence of actions to higher-level SRL processes relies on the basic assumption that such aggregated trace measures complement self-reports and can be used to proxy for learners’ self-regulation (Winne, 2014).

Issues surrounding contextualization and generalization in SRL

Since low-level measurements to trace SRL in online environments are so dependent on the learning context, researchers often struggle to align these measurements with higher dimensions of SRL theories—theories that are generic and are not proposed with any particular learning context in view. The traces that are applicable while measuring SRL during a school classroom assignment in a computer-based learning environment may not be valid in a workplace setting, which involves tracing SRL over several months, despite both studies using the same underlying theoretical model. To reflect this impact of context, modern frameworks of SRL have placed enhanced emphasis on the interplay of context with individual learner characteristics that influence SRL (Ben-Eliyahu & Bernacki, 2015; Efklides, 2011; Winne, 2010). In such a scenario, identifying what a context in trace-based SRL entails becomes imperative. While a broad perspective on “context” in learning can incorporate distal factors like familial issues, emotional well-being, disciplinary nuances, or even governmental policies, we will limit our definition of context for this review to the factors most urgent and immediate to the learner at the performance of a learning activity, which Ben-Eliyahu and Bernacki (2015) would also call a “microsystem.” Such microsystems include the most situated affordances available to the learner during the task, the type of the learning system, the nature and requirements of the learning activity, and the time scale at which a researcher chooses to observe for their research. For our review, we look at the context of a study in three aspects: learning environment, task type, and task duration.

The time scale of observation of a learning activity influences the granularity of the trace data that researchers may choose to record (Ben-Eliyahu & Bernacki, 2015; Newell, 1994). Ben-Eliyahu and Bernacki (2015) revived and proposed the use of bands based on a time scale to measure SRL, an idea originally developed by Newell (1994). They proposed to conceptualize the time scale of observation in SRL as events happening in three kinds of bands on the logarithmic scale of base 10—cognitive (cognitive and metacognitive events, happening over the range of seconds), rational (learner strategies and adaptive metacognitive control strategies, occurring over several minutes to hours), or social bands (e.g., use of self-report questionnaires to identify level of liking for a class, evolving over several weeks or months). This choice of granularity in trace-based measurement of SRL also needs to be informed by theory and the time bands that they aim to capture (Rovers et al., 2019). Before attempting to generalize findings from SRL studies, researchers must consider the contextual nuances of each study, the purpose of integrating a feature in a learning environment, and task constraints. Only then can we focus on reevaluating insights from a study in different contexts and achieve eventual generalization, which remains essential for theory building and validation (Winne & Baker, 2013).

Validity issues in trace data-based measurements

Because self-regulation is a metacognition-heavy phenomenon, it seldom emanates overt, dependable, behavioral indicators that can be used to validate traces. Messick (1987) defined validity as “the integrated evaluative judgement of the degree to which empirical evidence and theoretical rationale support the adequacy and appropriateness of inferences and actions based on test scores.” There are three commonly considered types of validity: content validity, criterion-related validity, and construct validity. In this review, we focus on construct validity only, which is defined, in the context of measurement of SRL, as a set of concerns about whether the measurements, as they are operationally defined, actually represent the SRL processes that researchers intend to measure and no other phenomena (Winne & Perry, 2000).

The use of trace-data-based protocols relies on researchers’ interpretations, under the assumption that such protocols can represent theoretical self-regulatory processes. For example, Wong et al. (2021) used the trace number of course preparatory items accessed in a MOOC as a proxy for the SRL process of planning in Zimmerman’s model of SRL. This interpretation is fundamentally contingent on the notion that the learner visits the course preparatory item only when they intend to plan. But does a learner always undertake an action only when they enter a singular metacognitive state (Winne, 2020)? Is accessing a course preparatory item at a later point in the course also an indicator of planning? Will this interpretation hold in another MOOC or a different learning environment that contains no SRL prompt of the nature that was used in the study of Wong et al. (2021)? These are legitimate questions and deserve attention.

Traditional ways of validating trace libraries for SRL had involved correlating online behavior with survey instruments (Cicchinelli et al., 2018; Jamieson-Noel & Winne, 2003). These questionnaires are collected before or after a learning activity, making them a convenient way to correlate global measures of self-regulation with trace measures. Recently, we have seen the emergence of another class of validation studies that use concurrent measures like students’ verbal accounts (Bernacki et al., 2025; Fan et al., 2023) or students’ in situ self-reports (Salehian Kia et al., 2021). These approaches offer a temporal and granular measure of SRL and allow comparison of two channels of data collected throughout a learning activity, which a questionnaire collected at the beginning or end of a session cannot provide. So, which of these approaches is more suitable, and what are their benefits and limitations? We need to identify the best practices for validating trace-based studies in SRL.

Research Questions

Our review focuses on the possibility of generalizing trace-based codings across similar learning contexts and what such generalization efforts would entail. We also aim to identify ways of validating trace data-based operationalization. The research questions that drove this literature review are: (1) What theoretical models of SRL have been used for the trace-based SRL measurements? (2) In what contexts (learning environment type, task type, task duration) have these studies been done? (3) What kinds of features in the learning environment help measure and support the self-regulation of learners? (4) What operationalizations are used for mapping trace data to SRL processes? (5) How have the researchers investigated/improved the validity of trace-based operationalizations?

Methodology

Search Strategy, Filtering Criteria, and Initial Screening

We searched for the literature in two relevant databases—SCOPUS and Web of Science Core Collection. These databases are two of the largest bibliographic databases and virtually cover all the major journals and conference venues in the field. We experimented with several variations of suitable keywords to generate the set of relevant literature from the databases. Our final keyword search string, inclusion criteria, and the filtering process are provided in Figure 1. We also limited the search to only those articles that contained the words “self-regulated learning,” “self-regulation,” or similar keywords. The objective of this process was to capture all the articles that have created theoretically grounded, trace-based libraries to operationalize SRL as a whole and have demonstrated the applicability of their trace libraries in a population of learners. The first search was performed on July 1, 2021. We later refreshed our corpus with another couple of searches conducted on May 9, 2022, and August 02, 2023. The three searches combined yielded a total of 1325 unique articles. All the articles went through an initial phase of title and abstract screening, following which 477 articles emerged and fit for the second step of filtering, which involved full-text assessment of the articles. A total of 150 papers emerged suitable for further review after the full-text assessment in Step 2.

Figure 1.

Filtering process for our systematic literature review.

Filtering for Theoretical Relevance

The third step of filtering entailed assessing the theoretical grounding and relevance of each of the 150 papers retrieved from the second step of filtering. We developed a coding strategy where we assessed each article on two criteria: (a) whether specific SRL theory/theories is/are mentioned within the article as the grounding theory, and (b) whether the traces are mapped to SRL constructs with adequate justification. Based on the fulfilment of these two criteria, we categorized the papers into one of the following three categories:

High theoretical relevance: This category of papers fulfils both criteria (a) and (b). An example of papers in this category includes the work of Saint, Whitelock-Wainwright, et al. (2020b), who were explicit in highlighting Zimmerman’s SRL model as their theoretical basis, and all of their trace data-based coding schemes were explicitly detailed within their article.

Medium theoretical relevance: This category of papers does not explicitly state an established SRL theory as their theoretical basis (criteria [a]), but they map their actions/patterns to theorized phases/strategies of self-regulated learning (criteria [b]). For example, Roscoe et al. (2013) did not explicitly mention a specific theoretical model used for grounding their trace-data-based mappings, but rather acknowledged the generic three-phase conceptualization of SRL and used it for mapping trace data.

Low theoretical relevance: This category of papers neither explicitly mentions their choice of theory (criteria [a]) nor talks about the mapping and reasoning for operationalization in their articles (criteria [b]).

The filtering and coding in steps 2 and 3 were done by the first and second authors of this paper independently, and discrepancies were resolved through discussion. At the end of step 3, we categorized 42 papers into the high theoretical relevance category, 16 papers into the medium theoretical relevance category, and the rest of the papers into the low theoretical relevance category. We decided to keep the high and medium-relevance categories for our review, prompted by the concept of generic SRL models (Panadero, 2017; Puustinen & Pulkkinen, 2001; Saint et al., 2022), apart from the formal ones like Zimmerman’s. That brought the final count of the papers for review to 58, the complete list of which can be found in the appendix.

Coding of the Articles to Conform to the Dimensions of SRL Models (RQ4)

To investigate the potential of theoretical generalization of traces across models, we coded the SRL traces of our articles separately into Zimmerman’s cyclical model (Zimmerman & Moylan, 2009) and Winne & Hadwin’s COPES model (Winne & Hadwin, 1998). While Zimmerman’s model is the popular choice used for trace-based research in SRL (Saint et al., 2022; this review), Winne & Hadwin’s model is particularly prevalent in modeling self-regulated learning in computer-supported learning environments (Panadero et al., 2016). Winne & Hadwin’s model further represents some of the most contemporary thinking of this field, based on the relevant chapters in the recent editions of the handbook of educational psychology (Greene et al., 2024), the handbook of learning analytics (Winne, 2017), and the handbook of self-regulated learning and performance (Bernacki, 2017). Both models are unique and distinct in several aspects (Panadero, 2017), and generalizing the traces of the articles in our corpus across constructs of both of these models will indicate the generalizability of the theoretical mappings across multiple models, with interpretations from different studies capable of contributing to multiple SRL models.

According to Zimmerman’s model, SRL happens over three phases: forethought, performance, and self-reflection. Each of the three phases contains a set of categories of SRL processes in the second level of the model. Each of these process categories then includes specific lower-level SRL processes (Panadero & Alonso-Tapia, 2014). On the other hand, Winne & Hadwin’s model bifurcates the forethought phase into two and proposes that learning occurs in four phases: task definition, goal setting, enactment, and adaptation. Each phase is characterized by COPES—an interplay of the learner’s conditions, operations, products, evaluations, and standards, which evolve over the four phases. We reviewed the trace-based features used in each of the 58 studies included in our review and conducted two rounds of coding, once for each of the two theoretical models. For coding according to Zimmerman’s model, we reviewed the traces and first mapped them to one of the three phases, and then into one of the process categories within that phase. For coding to Winne & Hadwin’s model, we first categorized each trace-based feature into one of the four phases and then into one of the five facets of COPES that the trace represented. Two authors of this systematic review were involved in this coding process, both of whom have prior experience in the theoretical coding of trace data-based features. For both rounds of coding, after reviewing half of the papers, the authors reconvened to discuss the disagreements and settled on a single mapping for each trace-based feature after extensive discussions. The inter-rater agreements of the two reviewers for Zimmerman’s and Winne & Hadwin’s coding at that point were then measured by using Cohen’s kappa, and the agreements were substantial (k = 0.906 and k = 0.866, respectively). In coding these traces, we took careful consideration of the context of each study, as context influences how a trace could be interpreted. This entire process of theoretical coding forms the basis for answering our RQ4.

Overall Summary of the Findings for Our Research Questions

To answer our five research questions, we conceive a set of categories that summarize the overall findings of each research question. These categories are reported in Table 1. For RQ1, the code grounding SRL theory indicates the popular theoretical models that researchers chose to base their studies on in our review. Theory drives how researchers operationalize higher-level SRL constructs, and it is essential to get an overview of the popular theoretical models. RQ2 relates to the context in which a research study was done. Traces studied in the papers in our database come from different learning systems, in different learning setups, in different grain sizes, and over various time periods—all of which can impact how we observe SRL (Ben-Eliyahu & Bernacki, 2015; Winne, 2017). RQ2 is thus operationalized in three main aspects: (a) learning environment, which indicates the type of platform/software system used for delivering instruction; (b) task type, which aims to categorize what a learner was required to do during the research study. Different tasks come with their own set of affordances and constraints, which also influence the reasoning of trace libraries. Within the task type, we also report the actual task the students perform, the constraints within each task, and the task domain. Further, there is evidence that the choice of a theoretical model needs to suit the duration of the study (Nitta & Baba, 2015), and the duration of the task can further influence the granularity of trace data and interpretation of the same. Much can change if a research study is done over a series of sessions instead of a single learning episode (Winne, 2017). The code task duration was hence employed to reflect the importance of the duration of a learning task. RQ3 focuses on creating broader categories of traces that can help researchers set up learning environments to collect trace data and operationalize theoretical models of SRL. Schraw (2010) provides a taxonomy distinguishing and categorizing the measures that can be captured from computer-based learning environments (CBLEs). We took inspiration from their taxonomy to create the categories of trace data for answering RQ3, represented using the code type of trace, while also taking into account what each trace represents in the context of the learning environment and the learning task. The categorization also reflects the third wave of SRL measurements, where affordances within a learning environment serve to both measure and support SRL (Panadero et al., 2016). Since designers of digital learning systems and researchers are capable of customizing what and how much interaction data they can collect, the answer to RQ3 can benefit researchers in their a priori design choices and in determining which tools and features to include in their learning systems to capture cognitive and metacognitive processes through trace data. RQ4 concerns the actual features derived from the trace data and how they are interpreted and mapped to higher-level SRL processes. This is an important step to improve the validity of trace-based interpretations and a necessary step to generalize these studies so that researchers can gain insights from comparable trace libraries. We employed the code SRL dimensions, which enlists the dimensions of Zimmerman and Winne & Hadwin models, to which we manually categorized the trace-based features from each study in our review. RQ5 concerns the validation of trace-based operationalization. To answer RQ5, we extensively went through the full texts of the articles included in the review, aiming to understand the rationale behind the trace-data-based coding used by experts and the methods they used to gauge the validity of these codings. The code validation method summarizes the answers for this research question.

Table 1

Analysis of papers to address the research questions

Code	Categories
RQ1: What are the theoretical underpinnings of the studies?
Grounding SRL theory	Zimmerman, Winne & Hadwin, Pintrich, generic, combination, others
RQ2: In what contexts were these studies conducted?
Learning environment	MOOC platforms; LMS; CBLE, TELE, ITS, etc.
Task type	Online course, college course, session-based tasks
Task constraints	Time limit, multimedia content, pedagogical limitations, SRL prompts, animated agents, open-ended learning, etc.
Task domain	Programming, science, mathematics, management, physics, chemistry, biology, lesson plan design, medicine, gaming, etc.
Task duration	6–13 weeks; 45 min–4 hrs
RQ3: What kinds of traces in the learning environment help measure and support the self-regulation of learners?
Type of trace	Content interaction, tool interaction, meta-content interaction, external support interaction
RQ4: What operationalizations are used for mapping trace data to higher-level SRL constructs?
SRL dimensions	Task analysis, self-control, self-observation, self-judgement, self-reaction (Zimmerman model) Conditions, operations, products, evaluations, standards (Winne & Hadwin Model)
Tracing cycle	Step, session, semester
RQ5: How have the researchers ensured the validity of trace-based operationalizations?
Validation method	Expert reasoning, triangulation with other sources

Results

RQ1—Grounding SRL Theory

The studies show an inclination toward Zimmerman’s model of SRL, with 12 of the papers opting for the model for its theoretical basis. Winne & Hadwin’s model is another popular choice of grounding SRL theory, with nine studies opting for it. Three of the papers used Pintrich’s SRL theory to inform their choice of operationalizations of trace data. A summary of these results is reported in Table 2. Some of the studies (e.g., Fan et al., 2021; Huang et al., 2023; Quick et al., 2023; Rakovic et al., 2022; Siadaty et al., 2016c) have mentioned highly cited SRL models like that of Siadaty’s micro-level SRL framework (Siadaty et al., 2016a) or Greene and Azevedo’s SRL framework (Greene & Azevedo, 2009) as their grounding theory. Although most of these models are themselves spinoffs of the Zimmerman or Winne & Hadwin models, we have included them under “others” for this review. The majority of papers have, however, opted to map their traces to the generic phases of SRL (planning, enactment, and reflection) rather than explicitly going for a single established theoretical model. These papers have been included under the generic category. Notably, a few studies combined multiple dimensions from multiple theoretical models to inform their study. For example, J. Zheng et al. (2019) combined Winne and Hadwin’s (1998) model and socially shared regulated learning (SSRL) models like Hadwin et al.’s (2018) model for their computer-supported collaborative learning (CSCL) study. These papers have been listed under the combination category.

Table 2

Studies included in the systematic literature review, their grounding theories (RQ1), and their contexts (RQ2)

Studies	Grounding SRL theory						Learning environment			Task type			Task duration
Studies	Zimmerman	Winne & Hadwin	Pintrich	Generic	Combination	Others	MOOC	LMS	CBLE/ITS/App	Academic course	Session-based task	Workplace	5–18 weeks	45 min-2.5 hrs
Ali and Hanna (2021)		X						X		X			X
Bernacki et al. (2012)					X				X		X			X
Bouchet et al. (2012)		X							X		X			X
Cerezo et al. (2020)					X			X		X			X
K.-Z. Chen and Li (2021)				X				X		X			X
Cicchinelli et al. (2018)					X			X		X			X
Fan et al. (2021)	X							X		X			X
Fan, Lim, et al. (2022a)		X							X		X			X
Fan, van der Graaf, et al. (2022b)						X			X		X			X
Fan et al. (2023)						X			X		X			X
Greene et al. (2021)					X			X		X			X
Guo and Trainin (2022)				X				X		X			X
Hadwin et al. (2007)			X						X		X			X
Hatala et al. (2023)								X		X			X
Huang and Lajoie (2021)						X			X		X			X
Huang et al. (2023)		X				X			X		X			X
Jansen et al. (2020)	X						X			X			X
Jamieson-Noel and Winne (2003)				X					X		X			X
Kim et al. (2018)					X			X		X			X
Lan et al. (2019)	X						X			X			X
Leite et al. (2022)					X				X		X			X
S. Li et al. (2018)				X					X		X			X
Q. Li et al. (2020)			X					X		X			X
S. Li et al. (2021)				X					X		X			X
Lim et al. (2023)		X							X		X			X
Maldonado-Mahauad et al. (2018)					X		X			X			X
Min & Jingyan (2017)				X			X			X			X
Ng et al. (2023)				X				X		X			X
Nguyen and Ikeda (2015)				X					X		X			X
Paans et al. (2019)					X				X		X			X
Paquette et al. (2020)				X					X		X			X
Poquet et al. (2023)				X				X		X			X
Qiao et al. (2021)	X							X		X			X
Quick et al. (2023)						X		X		X			X
Rakovic et al. (2022)						X		X		X			X
Rizki et al. (2022)								X		X			X
Roscoe et al. (2013)				X					X		X			X
Saint, Gašević, et al. (2020a)						X		X		X			X
Saint, Whitelock-Wainwright, et al. (2020b)						X		X		X			X
Salehian Kia et al. (2021)		X						X		X			X
Siadaty et al. (2016b)						X			X			X	X
Siadaty et al. (2016c)						X			X			X	X
Srivastava et al. (2022)		X			X				X		X			X
Sun, Liu, et al. (2023a)	X							X		X				X
Sun, Tsai, et al. (2023b)				X					X	X				X
Taub and Azevedo (2019)		X							X		X			X
Taub et al. (2022)		X						X		X			X
Tang (2021)	X						X			X			X
Wang et al. (2022)				X					X		X			X
Wang et al. (2023)	X								X		X			X
Warden et al. (2022)	X								X		X		X
Wong et al. (2021)	X						X			X			X
Yang and Song (2022)	X								X	X			X
Ye and Pennisi (2022)			X					X		X			X
Zhang et al. (2021)	X							X		X			X
Zhang et al. (2022)	X							X		X			X
J. Zheng et al. (2019)					X				X		X			X
J. Zheng et al. (2020)				X					X		X			X
Total:	12	9	3	14	10	10	6	24	28	32	24	2	33	25

RQ2—Context of Studies

Learning environment

The kinds of learning environments used are fairly distributed in nature. Six of the studies have been done exclusively within massive open online courses (MOOCs) on MOOC platforms. Twenty-four studies were conducted on learning management systems (LMS), which were mostly used in the context of courses in higher education. The remaining 28 studies were done on platforms like open-ended learning environments (OELE) and intelligent tutoring systems (ITS). Some examples of these systems are MetaTutor (Taub & Azevedo, 2019), Betty’s Brain (Paquette et al., 2020), and Learn-B (Siadaty et al., 2016c). Most of these CBLEs are bespoke learning environments that have been set up and optimized specifically to study self-regulated learning. The study by Yang and Song (2022) is a notable one, which used mobile apps during a vocabulary learning task. The summary of these findings is listed in Table 2.

Task types

The nature of a task, its objectives and facilities influence the self-regulation of a learner and need to be studied in closer detail. An overview of the distribution of the studies based on task type is presented in Table 2. A total of 32 studies were conducted within academic courses, typically extending over a semester. These studies include those conducted in MOOCs and formal courses in higher education. We found a number of studies that investigated SRL over a specific period, often around assignments in the course, rather than investigating SRL over the entire course (Salehian Kia et al., 2021; Wise & Hsiao, 2019). Such studies were also categorized under academic courses for this review. A total of 24 studies were conducted where self-regulation was investigated over a session or a set of sessions. These tasks range from multisource essay writing (Srivastava et al., 2022), concept mapping after reading (Paquette et al., 2020; Roscoe et al., 2013), an intelligent tutoring system to diagnose a virtual patient (S. Li et al., 2021; Wang et al., 2023), interacting with an ITS while learning the human circulatory system (Bouchet et al., 2012; Taub & Azevedo, 2019), game-based learning environments (Warden et al., 2022), creating lesson plans for students (Huang & Lajoie, 2021; Huang et al., 2023), vocabulary learning on a mobile app (Yang & Song, 2022), among others. A couple of studies were also done in a professional workplace learning context (Siadaty et al., 2016b, 2016c).

Task constraints

Task constraints serve as mental boundaries that shape how learners approach and carry out a task. Even in relatively structured settings like online courses, learner behavior can be influenced by the specific nature of these constraints. Mandatory assessment attempt in each unit of a course (Taub et al., 2022; Zhang et al., 2021, 2022), whether all course materials are released at the start of the course (Q. Li et al., 2020) or not (Kim et al., 2018), how many graded assessments happen during a course (Chen & Li, 2021; Qiao et al., 2021), and whether the modules in a course are sequential (Quick et al., 2023) are all idiosyncrasies of courses that influence the cognitive and metacognitive activities of a learner. These constraints are even more varied in bespoke learning environments (Fan et al., 2023; Paquette et al., 2020; Taub et al., 2022; Wang et al., 2023). A list of constraints accompanying each study in the articles is included under the column “Task constraints” in our supplementary material.

Task domain

Using affordances in a task often requires detailed, domain-specific reasoning, which highlights the importance of considering the task’s domain. The studies in our review span a wide range of knowledge domains, ranging from courses in biology (Greene et al., 2021; Rakovic et al., 2022), project management (Tang, 2021), dentistry (Lan et al., 2019), and agriculture (Ye & Pennisi, 2022) to session-based activities requiring knowledge of physics (Jamieson-Noel & Winne, 2003), environmental science (Roscoe et al., 2013), artificial intelligence and education (Srivastava et al., 2022), medicine (J. Zheng et al., 2019), and business (Warden et al., 2022). A full list of the domain knowledge concerned in each task is listed under “Task domain” in the supplementary document.

Task duration

A total of 25 studies were conducted where the task was completed in a single session or as a series of small sessions. The duration of these tasks ranged from 60 minutes to 4 hours. A number of studies focused on reading and comprehension tasks, which were typically followed by a post-test requiring learners to answer questions based on the material they had read (Bernacki et al., 2012; Fan, Lim, et al., 2022a; Fan et al., 2023; Jamieson-Noel & Winne, 2003; Taub & Azevedo, 2019). Some reading tasks involved creating concept maps after reading the content (Paquette et al., 2020; Roscoe et al., 2013). On the other hand, longer duration tasks ranged anywhere between 5 to 18 weeks. These learning tasks were generally courses in higher education or MOOCs, or studies conducted within these courses. An exception here is the study by Rizki et al. (2022), who collected data from a countrywide learning platform for nine months, but there is no mention of the duration of the courses themselves within their article. Another exception is the study of Yang and Song (2022) on school students’ vocabulary learning, which lasted for 7 months. The findings are summarized in Table 2.

RQ3—Types of Trace Data for Measuring and Supporting SRL

We categorized the traces into four types based on the learning environment design and the purpose they served in their tasks:

Content interaction: Most of the studies we reviewed required learners to read or engage with content and complete related activities, which characterizes this category of trace data. For example, viewing instructional materials, submitting answers (Qiao et al., 2021), and viewing video lectures (Min & Jingyan, 2017) are examples of content interactions necessary for completing a learning task. Note that some content may be optional or supplementary (e.g., optional assessment (Zhang et al., 2021, 2022), supplementary readings (Rakovic et al., 2022)). Some meta-level features were also put into this same category based on the content they were referencing, which include traces like course items completed on time (Wong et al., 2021), login and logout of sessions (Kim et al., 2018), attended class meeting (Greene et al., 2021), and off-task behaviors (Fan, van der Graaf et al., 2022b; Sun, Liu, et al., 2023a).

Tool interaction: Schraw (2010) refers to “palette choices” as an unobtrusive measure where the learner is given access to a palette or list of resources in the learning environment. These resources or tools are typically optional. Purposeful inclusion of such tools within the digital learning environment can cater to the convenience of the learners and enable researchers to capture traces that represent cognitive and metacognitive processes (van der Graaf et al., 2021). Tool interactions comprises interactions with such tools in a learning environment. Examples include traces of text annotations tagged as comment, confusing, errata, question, important or interesting (Fan et al., 2021), accessing the timer to check remaining time (Fan, Lim, et al., 2022a; Sun, Liu, et al., 2023a), search function to search notes or content (Fan et al., 2023), or bookmarks (Huang & Lajoie, 2021).

Meta-content interaction: Some content in a learning environment may not necessarily be a direct way to disseminate knowledge content. Rather, they contain information about the main content, which helps learners plan and orient their strategies. We call this category of trace data meta-content interactions. In the study of Wong et al. (2021), interactions like visit to course overview page, visit to course information page, and visit to grade information page in their MOOC are examples of this kind of trace. This category of traces has often been used as a means to make latent metacognitive processes overt.

External support interaction: Learning environments that facilitate complex tasks often contain external support or help, which can act as a facilitator for problem-solving. One of the most common forms of such support is the discussion forum, which has an almost ubiquitous presence in online courses. Discussion forums (Lan et al., 2019) or an external library of additional information (S. Li et al., 2018) can help learners access more support while performing their tasks. Another kind of external support is hyperlinks to additional sources of information. Then Study CBLE (Bernacki et al., 2012) offers the feature of clicking on a term to obtain its definition. While often optional, these features offer additional support to learners who choose to use them, enabling researchers to link such interactions to higher-level SRL processes like help-seeking (Jansen et al., 2020).

Researchers often characterize similar interactions in different ways to represent distinct higher-level SRL processes. For example, interacting with a page might be classified as thorough reading, skimming (Bouchet et al., 2012), or scrolling back (Jamieson-Noel & Winne, 2003), each reflecting a different SRL process. A few examples of traces under each category, along with their nature of interactions, are given in Table 3. The complete list is provided as a supplementary document.

Table 3

RQ3: Traces in learning environments to measure and support SRL

Category	Freq.	Task type	Type of trace	Example
Contentinteraction	54	MOOC	Access course item, repeat course item	(Wong et al., 2021)
		College course	Access lecture notes, access additional reading, viewed homework without submitting	(Greene et al., 2021)
		Session-based task	Skimming text, reading text	(Bouchet et al., 2012)
		Session-based task	Diagram AOI, text AOI	(Taub & Azevedo, 2019)
Meta-contentInteraction	37	College course	Dashboard access, access index page, access schedule, access learning objectives	(Saint, Whitelock-Wainwright, et al., 2020b)
Meta-contentInteraction	37	Session-based task	View table of contents, view learning goals	(Taub & Azevedo, 2019)
Toolsinteraction	29	College course	Annotation with tags	(Fan et al., 2021)
		College course	Embedded tool for viewing course reserves	(Rakovic et al., 2022)
		Session-based task	Update note, update glossary, highlight, link note to external information	(Hadwin et al., 2007)
Externalsupportinteraction	32	MOOC	Access discussion forum	(Wong et al., 2021)
		Session-based task	Click on a link to obtain the definition of a term	(Bernacki et al., 2012)
			Requesting hints to understand a term	(Huang & Lajoie, 2021)

The words in italics represent the feature in the learning environment, and the complete phrase represents the trace that was used to measure SRL process.

Chunking of content into multiple modules

We have noticed that researchers often deliberately chunk their content into multiple modules or pages, which generates traces when learners navigate between these pages. These design choices help researchers to theoretically hypothesize SRL behaviors even before the commencement of data collection. For instance, chunking introductory content into pages like overview, guidelines, detailed specifications, marking criteria, and how to submit can help researchers operationalize SRL processes based on what the learner was focused on at the time, for how long, and their sequence of actions (Salehian Kia et al., 2021). Had this content been on a single page, it would have required additional software capabilities or other modes of data to capture such processes. A similar approach was seen in Srivastava et al. (2022), where they chunked their content into topical sets of readings. This allowed them to understand a learner’s judgment based on their relevant and irrelevant reading.

Different pedagogies prompt different design choices

The pedagogical choices for delivering instruction can impact the nature of self-regulation, and in turn, may influence what traces we can and may want to capture. We can take the example of the studies by Zhang et al. (2021, 2022), who utilized online mastery-based learning modules (OLMs) to deliver their course, to illustrate this point. A series of OLMs was designed to teach a particular topic. Each OLM was optional, containing an assessment and an instructional component. The learners were presented with the assessment at the start, where they could make an initial attempt at the test and receive a score. The learners were then allowed to go through the instructional component and could take repeated attempts at the assessment component later to improve their scores. From this context, Zhang et al. (2022) used traces like accessing the learning materials after the first mandatory attempt and is the first attempt short as indicators of planning. These interpretations of traces only work where OLMs or pedagogy with a similar instructional format is used. Another example of how pedagogy can influence trace capture is the study by Wong et al. (2021), which used video prompts in their MOOC video content to stimulate self-regulation. The course videos contained questions that were designed to instigate planning, monitoring, or reflection in learners. Another example is that of Ali and Hanna (2021), who used a hybrid learning structure in their college course. Half of their learning tasks were conducted in class, and the other half asynchronously on Moodle. In such a scenario, where only 50% of learning interactions are expected to be captured digitally, researchers have to take the pedagogy into consideration while conceptualizing their traces. Fan et al. (2021) and Saint, Whitelock-Wainwright, et al. (2020b) used a flipped classroom; Cicchinelli et al. (2018) used blended learning; J. Zheng et al. (2019) used a collaborative inquiry-learning environment; and Siadaty et al. (2016b, 2016c) conceptualized traces for a workplace environment. These are all examples to show how the nature of instruction influences not only what traces to capture and features to create but also how to operationalize them to model self-regulated learning.

RQ4—Trace Data Mapping to Higher-Level SRL Constructs

To present our results of the coding of traces to map to the constructs of Zimmerman’s and Winne & Hadwin’s models, we represent each trace in our studies as a 3D tuple:

Trace = {Phase, Process Category / Facet, Process}

Phase ∈{Forethought, Performance, Self-Reflection}, for Zimmerman Model

∈{Task Definition, Goals & Plan, Enactment, Adaptation}, for Winne & Hadwin

Process category/Facet ∈ {Task Analysis, Self-Motivation Beliefs, Self-Control, Self-Observation,

Self-Judgement, Self-Reaction}, for Zimmerman Model

∈{Conditions, Operations, Products, Evaluations, Standards}, for Winne & Hadwin

Process ∈ the set of SRL processes used in the studies in our corpus

The first element of the tuple represents the phase of the task that the trace is representing in its respective theoretical model. The second element represents the process category (for Zimmerman’s model) or the facet of COPES (for Winne & Hadwin’s model). The third element represents the micro-level process that is being operationalized using the trace, as mentioned by the authors in their respective articles. A few examples of these studies categorized into this three-dimensional taxonomy can be found in Table 4. However, we strongly recommend that our readers review the comprehensive table included as a supplementary document, which comprises all the trace-based operationalizations of the articles included in this review. The interpretations form an important part of theory-driven, trace-based conceptualizations and our compilation is meant to help researchers toward this end.

Table 4

Examples of SRL processes across theoretical models (RQ4)

SRL process	Related processes from our articles	Definition
Task analysis and orientation	• Task analysis (Fan et al., 2021; Siadaty et al., 2016b, 2016c; Wang et al., 2023)• Task exploration (Maldonado-Mahauad et al., 2018)• Perception (Qiao et al., 2021)• Orientation (Srivastava et al., 2022)	Analyze and explore the learning task for the first time
Planning	• Planning (Cerezo et al., 2020; Taub et al., 2019;)• Making personal plans (Saint, Gašević, et al., 2020a; Siadaty et al., 2016a)	Planning for the upcoming task
Goal setting	• Goal setting (Fan et al., 2021; Huang et al., 2021)	Set definite learning goals for the task
Information gathering and knowledge acquisition	• Information interpreting (J. Zheng et al., 2019)• Collecting evidence (S. Li et al., 2021)	Acquire task knowledge relevant for the upcoming task
Strategizing	• Strategizing (Taub et al., 2019)• Task strategy (Lan et al., 2019)• Applying appropriate strategy changes (Siadaty et al., 2016b)	Coordinating learning activities to achieve set goals
Time management	• Time management (Jansen et al., 2020)• Time investment (Kim et al., 2018)• Procrastination (Cicchinelli et al., 2018)	Managing time constraints
Monitoring	• Monitoring (Rizki et al.., 2022)• Metacognitive monitoring (Rakovic et al., 2022)• Self-monitoring (Wong et al., 2021)	Comparing the alignment of own processes with goals and constraints
Control and regulation	• Control (Tang, 2021)• Effort regulation (Q. Li et al., 2020)	Adjusting learning processes to align with goals and constraints
Cognitive processes and operations	• Cognitive processes (Taub et al., 2019)• Learning (Jamieson-Noel & Winne, 2003)• Low/high cognition (Fan et al., 2023)	Processes students engage in to facilitate and progress the task
Help-seeking	• Help-seeking (Ali & Hanna, 2021)• Cognitive help-seeking (Greene et al., 2021)	Seeking overt help from available sources
Environmental structuring	• Environmental structuring (Ali & Hanna, 2021)	Creating a congenial environment that can facilitate the performance of the activity
Information processing	• Information organization (J. Zheng et al., 2019)• Hypothesizing (S. Li et al., 2021)	Acquiring information and processing it for the progress of the task
Reflection	• Reflective thinking (J. Zheng et al., 2019)• Self-reflection (S. Li et al., 2018)	Reflecting on one’s own performance
Evaluation and assessment	• Evidence evaluation (Wang et al., 2023)• Hypothesis evaluation (Wang et al., 2023)• Self-judgment (Yang & Song, 2022)	Compare performance against set goals or given criteria
Adaptation	• Adapting (Hatala et al., 2023)• Adjust (Warden et al., 2022)	Adjust strategies to refine the tangible product created and upgrade performance

Coding the articles for the first level of the models (phase level) was easier; our filtering process for this review ensures that each article explicitly indicates the phase of a theoretical model (preparatory, performance, or appraisal) from which the trace is generated. The next level of coding is where it became trickier; although Zimmerman’s model provides a specific and exhaustive list of processes within its process categories, the nomenclature of the processes used by the researchers in their articles is not necessarily the same as the ones provided by Zimmerman and Moylan (2009). We resolved this by coding them into the process category based on their theme and interpretation. For instance, Rakovic et al. (2022) interpreted the trace viewing calendar events as a course information gathering process, an indicator of the preparatory phase. As this process has no literal equivalence in Zimmerman’s model, we code it under strategic planning in the task analysis process category, as the process involves analyzing and planning for the course. This coding process was even trickier for Winne & Hadwin’s model of coding; no paper, even those that mention Winne & Hadwin’s model as their theoretical grounding, makes direct reference to which facet of COPES it traces. For the same example of course information gathering, we coded it to the Conditions facet of Winne & Hadwin’s model, as visiting the course information pages for the first time is likely to update the conditions of the learner.

It has been identified in prior reviews that the performance phase receives more focus in the investigations in SRL (K.-Z. Chen and Li, 2021; Tang, 2021; Viberg et al., 2020), and this finding is reiterated in our review as well. The performance phase has been investigated in all the articles reviewed. A possible reason for this could be that the cognitive actions during this phase become more overt and can be traced with relative ease. On the other hand, the appraisal phase is the least investigated phase, which was highlighted in the review by Viberg et al. (2020) as well. This, in the case of Winne & Hadwin’s model, can be attributed to the peculiarity of the adaptation phase, as the definition of adaptation in Winne & Hadwin’s model is not entirely equivalent to reflection (Greene & Azevedo, 2007).

While compiling the trace-based operationalizations and their corresponding interpretations, we took special care in keeping the context of their studies in view. A list of the SRL processes identified across the studies is given in Table 4.

Use of various tracing cycles in modeling SRL

In trace-based research, the broad goal of learning analysts is to model SRL from the start to the end of a learning task. Often dictated by their research goals, researchers choose different grain sizes to model out the learning task (Ben-Eliyahu & Bernacki, 2015; Rovers et al., 2019; Winne, 2017). This modeling usually starts from the atomic interactions left by the learners in the system in the wake of their learning activity. The atomic interactions are then mapped to SRL processes. Examples include view podcast video (control), post article (evaluation) (Qiao et al., 2021), edit discussion topics (reflection) (Tang, 2021), and set and update progress (evaluation) (Nguyen & Ikeda, 2015). Sometimes, even a pattern of actions such as looking at content AOI and then at the notes AOI (strategizing) (Taub & Azevedo, 2019) or identifying a learning gap by reading and then annotating as “confusing” (goal setting) (Fan et al., 2021) is also used. This approach models a step in a learning activity. This is then often followed by advanced analytical methods to model out the entire learning activity, usually with the temporal aspects of SRL in consideration (Nath et al., 2024; Saint et al., 2022). Another way to model this is to find an aggregate measure of these steps over a session or an entire semester. It could be aggregating over learning activities happening over a single session, such as frequency and duration of visits to review and conclusion pages (reflection) (Paans et al., 2019) in a 45-minute activity (using steps to model a session), or it could be modeling an entire semester over several sessions as in fraction of engaging attempts in the initial attempt in an OLM assessment module (planning) (Zhang et al., 2021) during a semester-long course (modeling semester using sessions, which was modeled using steps). A few of such other examples of modeling a semester (or broadly any activity stretching over an extended period of days) using aggregated measures include proportion of course units accessed before deadline (time management) (Q. Li et al., 2020) and fraction of short attempts among all attempts (limited planning/lower quality study strategies) (Taub et al., 2022). The choice of tracing cycles as a step, session, or semester in the articles included in our review is presented in Appendices A and B.

Use of supplementary tools and software to capture additional traces

A few learning environments have used multiple tools or software to capture trace data. Fan et al. (2021) integrated Alexandria (to host learning materials and activities) and Hypothes.is (a tool to facilitate annotations and highlighting in e-books) with the base Moodle platform. Greene et al. (2021) used the Piazza discussion forum along with their LMS. Salehian Kia et al. (2021) also linked an integrated development environment (IDE) for programming within their LMS. These integrated software systems enabled researchers to capture enhanced trace data that would have otherwise required extensive developmental changes in the native learning system.

RQ5—Methods of Validating Trace Data

We were able to identify two major approaches to validate trace data: (i) expert reasoning and (ii) comparing with other modes of data. A vast majority of the studies relied on expert reasoning for the validation of traces. For instance, Tang (2021) designed content pages corresponding to each SRL phase in Zimmerman’s model for their MOOC study and mapped traces based on task structure and system features that fostered self-regulation. It is to be noted that for our review, studies that used trace-based operationalizations from past studies also come under the category of expert reasoning, since they still indirectly rely on the reasoning of experts. One such example is that of Wong et al. (2021), who cited previous works to justify their choices of operationalization. Researchers who use data-driven techniques look for emerging behavioral patterns in the empirical data and use expert rationale to map those patterns to SRL theory using a bottom-up approach (K.-Z. Chen & Li, 2021; Maldonado-Mahauad et al., 2018) (see the section Theory-driven and data-driven approaches can be combined for creating more comprehensive trace libraries for more on data-driven techniques).

The second approach for validation involves comparison with multimodal data sources. This method involves corroborating digital traces with another measurement of SRL that has been historically relied upon as a valid interpretation of SRL, such as think-aloud protocols, self-report questionnaires, and semistructured interviews. Examples include Fan, van der Graaf, et al. (2022b) and Bernacki et al. (2025), who used think-aloud protocols to validate their trace libraries. In the case of Fan, van der Graaf, et al. (2022b), both trace data and think-aloud data were collected over a 45-minute writing task. They derived theoretically driven SRL processes from trace data and think-aloud data separately, synchronized them on the 45-minute timeline, and measured the match rates of the two channels to check the alignment of SRL processes identified from both channels. Their initial attempt gave them a median match rate of only 38.97%. They enhanced this match rate by improving their trace data library with data-driven approaches, but only up to 54.24%. This gives us an idea of how even theory-driven trace data may be susceptible to validity concerns and how keeping a critical view of the measurements and additional data triangulation efforts is necessary to improve the trace libraries. However, it must be acknowledged that a 100% alignment of trace data and think-aloud data is not realistic, as they may measure different aspects of SRL (Rovers et al., 2019). Also, there is a possibility that some digital events may co-occur with multiple verbalized SRL processes as identified by Bernacki et al. (2025), who undertook their validation study in a learning task sampled from a classroom course in a laboratory setting. Also, think-aloud, like any other measurement, suffers from limitations, including validity concerns (Young, 2009). Further, it is worth mentioning that Fan, van der Graaf, et al. (2022b) managed to detect SRL processes from 96.81% of a 45-minute task using trace data, compared to only 58.34% with think-aloud. It is an overestimation to expect that a learner will continuously verbalize their thinking, especially in long learning sessions, and trace data can overcome this limitation and help to capture much more user actions at a much finer granularity. The vast advantages of trace data outweigh its current limitations.

Rather than asking students to verbalize their actions, Salehian Kia et al. (2021) asked the students to self-report their SRL phase periodically during their activity, which they later used as ground truth. The learners were asked to report every 20 minutes in the learning environment about what they were doing at that moment (planning, enacting, or adapting), and they measured the agreement of these in-situ self-reports and trace data using Cohen’s kappa. On the other hand, Jamieson-Noel and Winne (2003) compared trace data with post hoc self-report surveys on specific SRL strategies and tactics and found considerable differences between the two. Ye and Pennisi (2022) compared their LMS trace data to self-reported questionnaires collected before and after the course. The LMS trace data correlated better with the post-course self-reports. However, there was again very little correlation between the SRL scales of both measurements. Only goal-setting showed correlations between the self-report and the trace data. Other scales showed none and, in some cases, even negative correlations. Cicchinelli et al. (2018) also looked for correlations with pre-course questionnaires and found positive correlations with only the frequency of planning and monitoring processes with trace data.

In summary, studies that have attempted to validate their trace libraries so far using other data sources have reported considerable mismatches between SRL processes derived from trace data and their second data channel. These mismatches were attributed to several factors, including faulty interpretations or over-interpretations (Fan, van der Graaf, et al., 2022b). An example of such misinterpretation is “highlighting during reading,” which was initially interpreted as a high-cognition process (elaboration/organization) but was later amended to a low-cognition process (reading). Additional data-driven efforts helped in fixing these theory-driven trace misinterpretations. This is an example of how a combination of theory-driven and data-driven analyses can improve the validity of our trace-based studies. Among other reasons for reported mismatches between self-report questionnaires and trace data, Salehian Kia et al. (2021) posit that distinguishing planning and adapting phases from an adjacent phase (i.e., the enactment phase) requires more information. They also found that real-time endorsements of highly self-regulated learners tended to disagree with theory-driven trace indicators of the adapting phase, which they reported as enactment. Ye and Pennisi (2022) conducted cluster analysis and semistructured interviews to delve deeper into the reasons for mismatches between self-reports and digital trace data. However, contrary to Salehian Kia et al. (2021), they found that in learners with high self-regulation, self-reports and trace data correlated better than in learners with low self-regulation. Jamieson-Noel and Winne (2003) attributed the mismatches in their study to students’ different criteria for self-reporting as compared to the experts’ reasoning for the corresponding study tactic. Students may not accurately report the use of a tactic after the study. Jamieson-Noel and Winne (2003), based on a prior study (Howard-Rose & Winne, 1993), also mention that students may have a better understanding of their overall study strategy when asked in retrospect in a questionnaire, compared to their granular study tactics. This issue/opinion about the grain size of data for analysis of SRL has been revisited in a recent review (Rovers et al., 2019).

Discussion

The findings of our review indicate that the challenge of balancing contextualization and generalization in trace-based measurements of SRL must be considered from the very beginning of any trace-based research, even before data collection begins. This balance can be achieved in two broad aspects—design and theory. Design of a learning environment that is deployed in a real-world setting (such as a MOOC) should aim to achieve a generic setup that can be replicated by other researchers across similar settings. That will help researchers understand self-regulation at play in authentic situations, which can help in conceptualizing traces that can be reused in other similar learning setups. Bespoke learning environments designed to prompt SRL behavior often come with enhancements that may not be characteristic of a real-world system. The design of such systems, while being contextual, can still be generic in the type of traces that they capture. The category of traces that we identified in our answer to RQ3 can help designers of such systems conceptualize trace data in their learning environments. It is worth noting that categorization of these interactions is also dependent on the instructional design and the design of the learning system. Interaction with a page of text offered as content material may not be equivalent when the same content is provided by a peer on a discussion forum, and our proposed categories of traces will be able to identify such differences. Suitable theoretical grounding depends on the extent to which a learning environment can trace learners’ actions, and our categories of traces are also targeted toward improving this tracing capability of a learning environment if taken into consideration while designing the learning environment.

Generalizability in theory starts when researchers theoretically hypothesize and map traces to the constructs of a theoretical model. Researchers should choose a theoretical model for grounding with considerable care and justification, keeping in mind the nuances each SRL model offers. Once chosen, researchers should ideally aim to exhaustively trace all the SRL processes of the model. This is likely an iterative process, and investing multiple cycles into this process is worthwhile, which will ultimately strengthen the overall research. The SRL processes identified in RQ4 and our process of coding them into SRL models, coupled with the categories of traces in RQ3, can be useful toward designing learning systems that are capable of tracing the SRL processes in different learning contexts (RQ2).

The rest of the discussion section is structured into three main sections: theoretical considerations, design considerations, and validation considerations, presenting our key learnings from this systematic literature review and offering concrete suggestions for future research.

Theoretical Considerations

Technological advancements have come a long way since the first theoretical SRL models were introduced. We can now capture very granular information about the learners’ activities in a learning environment using trace data. When combined with multimodal data sources, we obtain even more granular data that can capture very subtle SRL processes as they are externalized by learners. In light of these advancements, we can and should interpret learners’ actions even more thoroughly, reevaluate SRL models in digital learning contexts, and refine them as needed. Future theoretical models should take multimodal data sources into consideration and offer guidelines on how such models can be operationalized using multimodal data.

Need for the use of holistic trace data mappings in SRL studies

Our review finds that several researchers prefer a consolidated model of SRL, combining dimensions from multiple models. A similar observation was seen in the review by Saint et al. (2022). While such an approach may be convenient for mapping trace data, it may raise questions of construct validity (Saint et al., 2022). Our suggestion for researchers is to first make adequate efforts to fit their context into the constructs of a single theoretical model. They may still decide to integrate dimensions from other theories, depending on their study. However, since constructs of self-regulation are interrelated and co-dependent, studying only specific aspects of SRL (unless warranted by research questions) in theoretical models may come with limitations. There are phases and facets of SRL (as seen in RQ4) that have received less attention from researchers. The meta-analysis by L. Zheng (2016) also indicates that there is more virtue in focusing on all the phases of self-regulation rather than specific ones. Researchers should make purposeful efforts toward comprehensive coding of events in a learning task, rather than mapping only a small set of actions or sequences of actions to SRL processes. One such notable effort is that of Fan, van der Graaf, et al. (2022b), who demonstrated that with extensive efforts, it is possible to generate a great deal of inferences about the learners’ SRL processes based on their traced actions throughout the learning session. Learners with a specific goal in mind have agency over their path to achieving that goal (Hadwin, 2008; Saint et al., 2022). Their actions have reasons, and this notion leads us to believe that with adequate effort, most learner actions can be interpreted and mapped to higher-level self-regulatory processes. Additionally, the use of dimensions of a single theoretical model sets a precedent for future researchers on how to operationalize those dimensions using trace data in another similar context. Prior efforts by Greene and Azevedo (2009) to elaborately operationalize the constructs of Winne & Hadwin’s model can be taken as an example for such holistic mappings, which were later referred to by many researchers. However, it is worth mentioning at this point that the goodwill of the researchers alone may not be sufficient to capture certain self-regulatory processes using trace data. Certain metacognitive processes may not compel a learner to act physically and, in turn, do not leave sufficient traces for researchers to operationalize. Researchers may consider other data channels to operationalize such SRL constructs and supplement trace data. Notable examples in our review include Taub and Azevedo (2019), who used eye-tracking, and J. Zheng et al. (2019), who coded chat messages for certain SRL processes and collaborative self-regulatory processes. Taub et al. (2022) also proposed a trace-data-based measure for operationalizing self-motivation (see section Different pedagogies prompt different design choices). Researchers should make a purposeful decision about the type of study they want to conduct and which SRL dimensions they can analyze, and then specify what the digital learning system should capture as traces. At the same time, this also serves as a motivation for advancements in software development and technological improvements for capturing enhanced traces. Future research questions in SRL should focus on thorough mappings of actions in learners to a theoretical model and offer empirical benefits of such efforts.

A thorough consideration of all aspects is necessary before choosing a theoretical model

Zimmerman’s cyclical model (Zimmerman & Moylan, 2009) and Winne & Hadwin’s model (Winne & Hadwin, 1998) emerged as the two popular models used by researchers in trace-based SRL research. Before we discuss our results further, we must outline the points of similarity and difference between these two models. Both models conceptualize SRL as a process unfolding over phases (Panadero, 2017). In the case of Winne & Hadwin’s model, however, these phases and the processes within them are not as clearly demarcated as they are in Zimmerman’s model. Zimmerman’s model has clearly defined and discrete process categories and processes within them under each of the three phases of SRL, which likely contributes to the popularity of Zimmerman’s model for trace-data-based studies (Saint et al., 2022). On the other hand, Winne & Hadwin’s model conceptualizes SRL as a free-flowing recursive process where each of the five facets of COPES (i.e., conditions, operations, products, evaluations, standards) keeps on evolving throughout a learning activity over four loosely cyclical and recursive phases (Greene & Azevedo, 2007; Panadero, 2017). The Winne & Hadwin model’s relatively flexible conceptualization could be useful to model activities that are unstructured and may involve multiple cycles of SRL (Bernacki, 2017). The model’s grounding in the information processing theory (Greene & Azevedo, 2007) makes it suitable for modeling sessions where the learner acts as an initiator as well as a reactor to stimuli. Instead of attempting to trace one of the three SRL phases, researchers can attempt to trace the dynamic unfolding of COPES using SRL processes without being restricted to a strict chronological structure to comply with and see the phases of SRL emerge in the traced data in retrospection, as it did in a recent study (Nath et al., 2024). Further, the smaller grain size of SRL processes compared to phases makes it more suitable for modeling using trace data (Howard-Rose & Winne, 1993; Jamieson-Noel & Winne, 2003). Few papers in our corpus explicitly investigated the dynamic nature of COPES, and that remains an interesting and open perspective to investigate in the future. The theoretical basis for modeling SRL should fit the learner, the task, and how the learner engages with it. Researchers should have a clear understanding of their context in terms of the task type, constraints, domain, and duration (as identified in our RQ2), along with other factors that can influence self-regulation (Panadero, 2017), before they make their choice for theoretical grounding.

Our review raises another question: Can and should the theoretical constructs of two models be combined conceptually to generate better traces? Let us take the example of Paquette et al. (2021). Their study in Betty’s Brain ITS used the process information seeking, which represents seeking information to progress on the task. Information seeking can occur at different phases of the activity. Depending on whether it is operationalized as reading a page for the first time or reading a related page after reviewing quiz results, information seeking can represent a trace from either the preparatory phase or the appraisal phase, respectively. SRL is cyclical and recursive, and it is not unreasonable to expect SRL processes that we may typically associate with one phase to occur in another, even in structured tasks. Researchers should consider such possibilities when creating their coding frameworks and examine the nuances of each theoretical model before making any selection. The possibility of merging SRL theories should only be considered if the uniqueness of different models can enhance our coding and help in conceptualizing SRL for the task better, rather than in a way that combines redundant constructs. Can we create separate trace libraries for a learning context using two different theoretical models? Is one model more suitable than the other for certain contexts? Can trace libraries created based on various theoretical models be combined? Can insights generated from two theoretical models enhance our understanding of self-regulated learning (SRL) in our learners? Can we generalize and recommend the right theoretical model of SRL for a set of learning tasks (say MOOCs)? These are some questions that future empirical studies should address in this regard.

Theory-driven and data-driven approaches can be combined to create more comprehensive trace libraries

We identified two main approaches to operationalizing SRL processes using trace data: theory-driven and data-driven. In the theory-driven approach, researchers typically predict the types of actions learners may take within a learning environment and map these actions to SRL processes defined by an existing theoretical model (Fan, van der Graaf, et al., 2022b). These top-down mappings are usually established before data collection. For instance, time spent reviewing the syllabus and rubric was interpreted as evidence of forethought—planning and activation in Pintrich’s SRL model (Ye & Pennisi, 2022). The vast majority of studies in our literature corpus followed this theory-driven approach to ground their trace-data analyses.

In contrast, data-driven approaches rely on analytical techniques to infer higher-level SRL processes from raw learning data. For example, Maldonado-Mahauad et al. (2018) used process mining to extract interaction patterns from MOOC log data, which were then interpreted as indicators of various SRL processes. Similarly, K.-Z. Chen and Li (2021) applied lag-sequential analysis (LSA) to identify behavioral patterns in learners categorized as high or low SRL based on questionnaire results. Their findings revealed distinct patterns that differentiated the two groups, as well as some interaction behaviors that were common across both.

Fan, van der Graaf, et al. (2022b) demonstrated how integrating theory-driven and data-driven approaches can improve and enhance the mapping of trace data to SRL processes. They began with a theory-driven SRL process library based on an established model and were able to enhance their initial process library by applying process mining to think-aloud data collected concurrently from learners. This data-driven analysis allowed them to expand and refine their initial trace-based SRL process library. Such studies illustrate that researchers need not rely solely on predefined SRL processes; instead, they can iteratively enhance trace libraries based on data from empirical data collected during or after the learning activity.

Design Considerations

The design of a learning system can both foster self-regulatory processes in learners and support researchers in operationalizing these processes through trace data. The design of the learning environment is a crucial phase in the SRL research cycle, and each step of the design process should be grounded in theoretical justification. Equipping the learning environment with enhanced tracing software, using multimodal solutions to maximize the capture of learner behavior, using carefully hypothesized and theoretically grounded segmentation of the content, and integrating tools based on the generic categories of traces offered in this literature review are some a priori design choices that can help progress toward creating a generic design that can then be repurposed for contextualized SRL studies.

Pedagogy can guide system design choices and generate more informed traces

Pedagogy can serve both as a boundary and a frame of reference for researchers, offering opportunities to capture more meaningful trace data. While capturing trace-based measures is often viewed as a technological and theoretical challenge, the studies discussed in the subsection Different pedagogies prompt different design choices show how instructional and pedagogical design can be leveraged to operationalize SRL processes through learner actions. We suggest that researchers place particular emphasis on pedagogical and instructional design decisions to enable richer and more informative traces of SRL processes.

Additional tools and software can enhance a learning environment’s tracing capabilities

Modern digital learning systems are capable of capturing data traces at unprecedented granularity. Designers of these systems can harness this capability to support the researchers, provided there is clear communication of the research needs. It is advisable that researchers first pre-conceptualize their theoretical underpinnings clearly, which in turn can help them hypothesize ways in which traces can align with their research. This will help them to work with designers of learning systems to incorporate tools to capture their targeted data. There are several examples in our review that used custom-built tools within their learning systems to capture trace data that would otherwise be missed in standard learning systems. Features like linking notes to external information (Hadwin et al., 2007), clicking a link to obtain the definition of a term (Bernacki et al., 2012), and creating concept maps (Roscoe et al., 2013) help externalize the latent mental processes of learners. Additional integrations with learning systems can also help characterize interactions that enhance a trace. For example, characterizing “accessing a page” as reading, skimming, or scrolling back (Bouchet et al., 2012; Jamieson-Noel & Winne, 2003; Zhang et al., 2021) can help differentiate SRL processes. Instead of undertaking a major overhaul of learning systems, researchers can consider integrating lightweight, complementary tools and software to enhance data capture. Our supplementary document provides an exhaustive list of such tools and integrations.

Multimodal data can provide enhanced traces

Apart from multichannel data from added tools, recent studies have shown that valid measurements of self-regulation can be captured using multimodal sources like eye-tracking (Fan, Lim, et al., 2022a). Common trace data is captured only through physical interactions with the learning system, and modalities like eye-tracking can shed light on the “blind spots” between these interactions. In two different studies in the MetaTutor learning environment, Taub and Azevedo (2019) interpreted shifts of eye gaze from text content AOI to Learning Goal AOI as evidence of planning, while Bouchet et al. (2012) used time spent on a page captured using log data to categorize reading behavior during planning. These examples illustrate how the same activity within the same learning environment can be operationalized differently depending on the modality used. Taub and Azevedo (2019) were able to identify distinct SRL processes from eye-tracking data while using log data to identify others. Fan, Lim, et al. (2022a) demonstrated how combining eye-tracking data with log data can deepen the interpretation of SRL behaviors.

Chunking content into multiple pages can make SRL behavior overt

Thematic segregation of content in a learning platform into modules can help externalize learners’ SRL behaviors (see subsection Use of multimodal channels to capture traces). Learner interactions—such as opening a module, highlighting, and annotating the module—can serve as proxies for higher-level SRL processes. These design choices need to be driven by theory and expert hypotheses. Our supplementary material outlines how contemporary researchers have implemented such strategies, offering guidance for future researchers developing trace-based SRL learning environments.

Validation Considerations

Emerging validation studies in SRL indicate plenty of scope for progress in this area. Most studies have reported substantial discrepancies between trace data and the corresponding validation data sources. Researchers have attributed such mismatches to faulty or over-interpretations (Fan, van der Graaf, et al., 2022b), difficulty in distinguishing phases of SRL that tend to co-occur or occur in adjacency (Salehian Kia et al., 2021), and misalignment of students’ own criteria of self-reporting and researchers’ reasoning for a study tactic (Jamieson-Noel & Winne, 2003). Researchers have also identified learners’ internal SRL conditions playing a part in these mismatches (Salehian Kia et al., 2021; Ye & Pennisi, 2022). Studies have also highlighted that SRL processes operationalized using traces may co-occur with multiple verbalized SRL processes (Bernacki, et al., 2025; Fan, van der Graaf, et al., 2022b), opening up the possibility of a many-to-many mapping of trace data to self-reported SRL processes. In light of these findings, we need to reevaluate what validation should entail in trace-based studies in SRL.

A look at validation of traces in SRL using the lens of a modern validation framework

When examining contemporary validation theories, Cizek (2020) highlights a critical limitation that these theories fail to distinguish between two incompatible yet pertinent concerns: (a) the intended meaning of the measurement and (b) the intended use of the measurement. Cizek (2020) notes that validity concerns encompass both these issues, and each requires independent consideration. These issues seem to extend into trace-based measurements as well. We have seen that researchers often adopt theoretical rationales from other studies into their context (also highlighted in section RQ5—Methods of validating trace data). Extending Cizek’s view, we propose that validation of trace libraries in SRL research needs to consider two distinct aspects regarding trace libraries: (a) whether the trace library truly indicates the SRL processes that it aims to operationalize and (b) whether the trace libraries can be reused as a proxy for the corresponding SRL processes in a new setting. Researchers not only need to validate their trace libraries with relevant techniques but also should offer adequate justification as to why the trace library created in a previous context can be validly used to identify self-regulation in a new setting. Efforts undertaken by Fan, Lim, et al. (2022a; Fin, van der Graaf, et al., 2022b; Fan et al., 2023) and Bernacki et al. (2025) to validate trace libraries by temporally aligning them to student verbalizations and attempts by Salehian Kia et al. (2021) to validate using students’ periodic real-time endorsements can serve as examples to address the first validity concern—that is, the intended meaning of trace libraries. The demonstration by Bernacki et al. (2025) that such validated trace libraries can possess substantial predictive validity in another comparable setting serves as a precedent for addressing the second validity concern, which is to prove whether trace libraries can be reused in analogous settings.

Validation is an urgent need in trace-based measurements

Our review acknowledges the importance of experts in interpreting learners’ actions, as the vast majority of the papers use researchers’ hypotheses and domain expertise to measure constructs of SRL models using trace data. Our results in response to RQ5 in section RQ5—Methods of validating trace data list studies that compared such expert hypothesis-driven trace libraries with other measures of self-regulation. All studies found considerable differences between expert-based trace interpretations and self-reports/think-aloud data. Fan, van der Graaf, et al. (2022b) attributed their mismatches to invalid interpretations of trace data, which were based solely on theoretical hypotheses. This reiterates the validity concerns echoed by Winne (2020) and possible fallacies of taking expert reasoning at face value. Trace data, which comes with its multitude of unmatched benefits and conveniences, requires careful attention and rigorous measures to ensure its validity. Such validation studies can improve the reusability of trace-based libraries in other relevant contexts, which can also improve the reliability of SRL studies that are informed by expert-opinionated trace operationalizations.

The validation efforts of Fan et al. using concurrent think-aloud protocols (Fan, Lim, et al., 2022a; Fan, van der Graaf, et al., 2022b; Fan et al., 2023) and Bernacki et al. (2025) and Salehian Kia et al. using periodic self-reports during the task (Salehian Kia et al., 2021) are particularly noteworthy. These authors attempted to temporally validate their trace-data-based SRL processes, keeping the temporal event view of SRL intact (Winne, 2010). There could be obvious concerns about whether these validation approaches may interrupt the natural flow of activity of the learners, and if such tedious studies are worth taking up. The recent multimodal study (Bernacki et al., 2025) has demonstrated that validated trace events (against think-aloud protocol) in controlled laboratory settings can be reapplied in classrooms for future learners and possess ample predictive validity. Researchers should thus consider these validation studies not as a burden but as an opportunity to strengthen their trace-based research. Even a small-scale but thorough validation study is likely to translate into improvement of inferences generated in naturalistic studies at scale, as identified by Bernacki et al. (2025). They also found that certain digital traces co-occurred with multiple verbalized SRL processes, providing empirical support for Winne (2020), who posited that a digital trace may be indicative of multiple SRL processes and should be associated with a probability for each possible SRL process, rather than a one-to-one mapping. This finding is interesting for future pursuits in validating trace-based SRL research.

Continuous measures, such as think-aloud, are a better means of validating trace data compared to global questionnaires

Trace-data-based SRL research, in its current state, relies heavily on experts who hypothesize trace data and map them to SRL processes. While attempts to compare trace data against other measures of self-regulation are commendable, it cannot be denied that each measure has its share of limitations. Think-aloud data are inherently reliant on participants’ verbalization abilities and have their share of validity concerns (Young, 2009). It has also been suggested that questionnaires are not suitable instruments for measuring specific local SRL strategies or tactics in a learning task (Bernacki, 2017; Jamieson-Noel & Winne, 2003; Rovers et al., 2019; Winne & Jamieson-Noel, 2002). All the studies in our review that looked for validation of their trace-data-based operationalizations using questionnaires found considerable mismatches. Apart from possible contextual reasons listed in each study (detailed in section RQ5—Methods of validating trace data), we should not forget a fundamental point that questionnaires measure SRL from the perspective of an aptitude, while trace data measures SRL from the event perspective (Winne, 2010). Experts have also emphasized the important point that aptitudes are malleable; they can change and develop throughout a learning episode (Azevedo et al., 2010; Winne, 2010). So, researchers should not presume that aptitudes measured before a learning episode will remain constant throughout an intervention. Questionnaires collected post-intervention may correlate better with trace data, as Ye and Pennisi (2022) found; however, the grain sizes measured by questionnaires and trace data are still not the same (Jamieson-Noel & Winne, 2003; Rovers et al., 2019). Salehian Kia et al. (2021) developed a workaround by collecting periodic self-reports throughout the learning task, but they too found inconsistencies between the trace data and self-reports. Keeping all these results and previous suggestions in view, we propose that measurements that can capture SRL and its changes continuously (Winne, 2010), such as think-aloud data, are likely to be more effective measures for investigating the validity of trace-data-based codings. Alternative measures, such as retrospective think-aloud protocols (Pathan et al., 2021; Prokop et al., 2020) and retrospective semistructured interviews (Ye & Pennisi, 2022), can also provide continuous measures of SRL, although not on-the-fly. These suggestions had already been somewhat indicated in the literature (Bernacki, 2017). It has also been discovered that while students’ self-reports can provide a better account of the global measure of self-regulation, when it comes to local and specific SRL strategies and tactics, behavioral indicators (such as traces) give a more accurate account (Jamieson-Noel & Winne, 2003; Rovers et al., 2019; Winne, & Jamieson-Noel, 2002). This can be an important aspect to consider before researchers decide to combine self-reports with trace-based measurements. SRL is inherently a multilevel construct (Howard-Rose & Winne, 1993), and both its higher and lower levels of conceptualization can contribute to the understanding of how people learn. It is hence worth exploring which measurements are more suitable for measuring lower-level tactics (such as scrolling through a chapter at the outset of a task or highlighting a paragraph during the task) vs those suitable for high-level phases (like planning or reviewing).

Limitations

Coding traces to constructs of two fundamentally distinct theoretical models, trying to make generalizations across them while keeping the context of the studies in view, is indeed a complex task and not without limitations. While coding into facets of Winne & Hadwin’s model in RQ4, we coded each trace into a single facet, which may not always be the case. From the perspective of COPES, a trace may be an indicator of change to more than a single facet. We do not make these distinctions in our current review. We developed a definition of theoretical relevance in the section Filtering for theoretical relevance, which is appropriate for this review but may not necessarily apply to all scenarios.

Conclusion

Trace-based measurements that account for the context of learning scenarios while contributing to general theoretical understandings represent a critical need in SRL research. We posit that understanding a context in terms of the learning environment and its features, the nature of the task and accompanying constraints, and the duration of the learning activity is crucial to generate adequately informed traces. Generalization of these traces should occur in two broad aspects—design and theory. Our review presents evidence that traces in SRL research can be aligned across even fundamentally different theoretical frameworks. However, that raises a different dilemma: Do some aspects of certain theoretical models remain underutilized? If so, how can we design learning systems that are suitable for generating more theoretically informed traces and map them meaningfully to specific SRL models? What should our learning systems log to better trace SRL processes? Findings from this review suggest practical approaches to these challenges. Attaining a validated balance of contextualization and generalization is essential for theory building and theory refinement in this field.

Supplemental Material

sj-docx-5-rer-10.3102_00346543251382238 – Supplemental material for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies

Supplemental material, sj-docx-5-rer-10.3102_00346543251382238 for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies by Debarshi Nath, Yizhou Fan, Dragan Gašević and Ramkumar Rajendran in Review of Educational Research

Supplemental Material

sj-xlsx-1-rer-10.3102_00346543251382238 – Supplemental material for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies

Supplemental material, sj-xlsx-1-rer-10.3102_00346543251382238 for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies by Debarshi Nath, Yizhou Fan, Dragan Gašević and Ramkumar Rajendran in Review of Educational Research

Supplemental Material

sj-xlsx-2-rer-10.3102_00346543251382238 – Supplemental material for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies

Supplemental material, sj-xlsx-2-rer-10.3102_00346543251382238 for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies by Debarshi Nath, Yizhou Fan, Dragan Gašević and Ramkumar Rajendran in Review of Educational Research

Supplemental Material

sj-xlsx-3-rer-10.3102_00346543251382238 – Supplemental material for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies

Supplemental material, sj-xlsx-3-rer-10.3102_00346543251382238 for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies by Debarshi Nath, Yizhou Fan, Dragan Gašević and Ramkumar Rajendran in Review of Educational Research

Supplemental Material

sj-xlsx-4-rer-10.3102_00346543251382238 – Supplemental material for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies

Supplemental material, sj-xlsx-4-rer-10.3102_00346543251382238 for Balancing Contextualization and Generalization in Trace-based Measurement of Self-Regulated Learning: A Systematic Review of Theoretically Grounded Studies by Debarshi Nath, Yizhou Fan, Dragan Gašević and Ramkumar Rajendran in Review of Educational Research

Footnotes

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work has in part been funded by the Australian government through the Australian Research Council (DP240100069 and DP220101209) and the Jacobs Foundation (CELLA 2 CERES).

ORCID iDs

Debarshi Nath

Yizhou Fan

Dragan Gašević

Ramkumar Rajendran

AUTHORS

DEBARSHI NATH is a doctoral researcher in the joint PhD program at IIT Bombay-Monash Research Academy. He is jointly affiliated with the Centre for Educational Technology at Indian Institute of Technology Bombay, Mumbai, India, and the Centre for Learning Analytics at Monash (CoLAM), Faculty of Information Technology at Monash University, Clayton, Australia. His research interests lie in the field of self-regulated learning, multimodal learning analytics, data science and artificial intelligence in education.

YIZHOU FAN is an assistant professor in the Graduate School of Education, Peking University. Yizhou considers himself a learning analyst using computational methods to advance the understanding of online learning strategies and self-regulated learning. His research interests are AI in education, self-regulated learning, learning design, learning tactics and strategies, and multimodal learning analytics.

DRAGAN GAŠEVIĆ is Distinguished Professor of Learning Analytics and the Director of the Centre for Learning Analytics at Monash University, 20 Exhibition Walk, Clayton, VIC 3800, Australia. His research interests center around data analytics, AI, and design methods that can advance understanding of self-regulated and collaborative learning. He served as the president (2015–2017) of the Society for Learning Analytics Research (SoLAR). He is a recipient of the Life-time Member Award (2022) from the Society for Learning Analytics Research (SoLAR) and Distinguished Member Award (2022) of the Association for Computing Machinery (ACM) and is recognized as the national field leader in educational technology (2019–2024) in The Australian’s Research Magazine, which is published annually.

RAMKUMAR RAJENDRAN is an associate professor at the Centre for Educational Technology at the Indian Institute of Technology Bombay, Mumbai, India. His research interests include learning analytics, affective computing, AI in education, personalized and adaptive learning environments, and self-regulated learning.

References

Aleven

McLaren

Roll

Koedinger

(2006). Toward meta-cognitive tutoring: A model of help seeking with a cognitive tutor. International Journal of Artificial Intelligence in Education, 16(2), 101–128. https://doi.org/10.3233/irg-2006-16(2)02

Ali

Hanna

(2021). Predicting students’ achievement in a hybrid environment through self-regulated learning, log data, and course engagement: A data mining approach. Journal of Educational Computing Research, 60, 960–985. https://doi.org/10.1177/07356331211056178

Azevedo

Moos

D. C.

Johnson

A. M.

Chauncey

A. D.

(2010). Measuring cognitive and metacognitive regulatory processes during hypermedia learning: Issues and challenges. Educational Psychologist, 45(4), 210–223. https://doi.org/10.1080/00461520.2010.515934

Ben-Eliyahu

Bernacki

M. L.

(2015). Addressing complexities in self-regulated learning: A focus on contextual factors, contingencies, and dynamic relations. Metacognition and Learning, 10, 1–13. https://doi.org/10.1007/s11409-015-9134-6

Bernacki

M. L.

(2017). Examining the cyclical, loosely sequenced, and contingent features of self-regulated learning: Trace data and their analysis. In Schunk

D. H.

Greene

J. A.

(Eds.), Handbook of self-regulation of learning and performance (pp. 370–387). Routledge/Taylor & Francis Group.

Bernacki

M. L.

Byrnes

Cromley

(2012). The effects of achievement goals and self-regulated learning behaviors on reading comprehension in technology-enhanced learning environments. Contemporary Educational Psychology, 37, 148–161. https://doi.org/10.1016/j.cedpsych.2011.12.001

Bernacki

M. L.

Kuhlmann

S. L.

Plumley

R. D.

Greene

J. A.

Duke

R. F.

Freed

Hollander-Blackmon

Hogan

K. A.

(2025). Using multimodal learning analytics to validate digital traces of self-regulated learning in a laboratory study and predict performance in undergraduate courses. Journal of Educational Psychology, 117(2), 176–205. https://doi.org/10.1037/edu0000890

Boekaerts

(2011). Emotions, emotion regulation, and self-regulation of learning. In Schunk

D. H.

Zimmerman

(Eds.), Handbook of self-regulation of learning and performance (pp. 408–425). Routledge.

Bouchet

Kinnebrew

Biswas

Azevedo

(2012). Identifying students’ characteristic learning behaviors in an intelligent tutoring system fostering self-regulated learning [Conference session]. Proceedings of the 5th International Conference on Educational Data Mining (pp. 65–72). International Educational Data Mining Society.

10.

Buckingham Shum

Deakin Crick

(2016). Learning analytics for 21st century competencies. Journal of Learning Analytics, 3, 6–21. https://doi.org/10.18608/jla.2016.32.2

11.

Cerezo

Bogarín

Esteban

Romero

(2020). Process mining for self-regulated learning assessment in e-learning. Journal of Computing in Higher Education, 32, 74–88. https://doi.org/10.1007/s12528-019-09225-y

12.

Chen

Knight

Wise

A. F.

(2018). Critical issues in designing and implementing temporal analytics. Journal of Learning Analytics, 5, 1–9. https://doi.org/10.18608/jla.2018.51.1

13.

Chen

K.-Z.

S.-C.

(2021). Sequential, typological, and academic dynamics of self-regulated learners: Learning analytics of an undergraduate chemistry online course. Computers and Education: Artificial Intelligence, 2, 100024. https://doi.org/10.1016/j.caeai.2021.100024

14.

Cicchinelli

Veas

Pardo

Pammer-Schindler

Fessl

Barreiros

Lindstädt

(2018). Finding traces of self-regulated learning in activity streams [Conference session]. Proceedings of the 8th International Conference on Learning Analytics and Knowledge (pp. 191–200). Association for Computing Machinery.

15.

Cizek

G. J.

(2020). Validity: An integrated approach to test score meaning and use (1st ed.). Routledge.

16.

Dawson

Joksimovic

Poquet

Siemens

(2019). Increasing the impact of learning analytics [Conference session]. Proceedings of the 9th International Conference on Learning Analytics & Knowledge (pp. 446–455). Association for Computing Machinery.

17.

Dignath

Büttner

(2008). Components of fostering self-regulated learning among students. A meta-analysis on intervention studies at primary and secondary school level. Metacognition and Learning, 3, 231–264. https://doi.org/10.1007/s11409-008-9029-x

18.

Hew

K. F.

Liu

(2023). What can online traces tell us about students’ self-regulated learning? A systematic review of online trace data analysis. Computers & Education, 201, 104828. https://doi.org/10.1016/j.compedu.2023.104828

19.

Efklides

(2011). Interactions of metacognition with motivation and affect in self-regulated learning: The MASRL model. Educational Psychologist, 46, 6–25. https://doi.org/10.1080/00461520.2011.538645

20.

Fan

Lim

van der Graaf

Kilgour

Rakovic

Moore

Molenaar

Bannert

Gasevic

(2022a). Improving the measurement of self-regulated learning using multi-channel data. Metacognition and Learning, 17, 1025–1055. https://doi.org/10.1007/s11409-022-09304-z

21.

Fan

Rakovic

van der Graaf

Lim

Singh

Moore

Molenaar

Bannert

Gašević

(2023). Towards a fuller picture: Triangulation and integration of the measurement of self-regulated learning based on trace and think aloud data. Journal of Computer Assisted Learning, 39, 1303–1324. https://doi.org/10.1111/jcal.12801

22.

Fan

Saint

Singh

Jovanovic

Gašević

(2021). A learning analytic approach to unveiling self-regulatory processes in learning tactics [Conference session]. Proceedings of the 11th International Learning Analytics and Knowledge Conference (pp. 184–195). Association for Computing Machinery.

23.

Fan

van der Graaf

Lim

Raković

Singh

Kilgour

Moore

Molenaar

Bannert

Gašević

(2022b). Towards investigating the validity of measurement of self-regulated learning based on trace data. Metacognition and Learning, 17, 949–987. https://doi.org/10.1007/s11409-022-09291-1

24.

Greene

J. A.

Azevedo

(2007). A theoretical review of Winne and Hadwin’s model of self-regulated learning: New perspectives and directions. Review of Educational Research, 77(3), 334–372. https://doi.org/10.3102/003465430303953

25.

Greene

J. A.

Azevedo

(2009). A macro-level analysis of SRL processes and their relations to the acquisition of a sophisticated mental model of a complex system. Contemporary Educational Psychology, 34, 18–29. https://doi.org/10.1016/j.cedpsych.2008.05.006

26.

Greene

J. A.

Azevedo

(2010). The measurement of learners’ self-regulated cognitive and metacognitive processes while using computer-based learning environments. Educational Psychologist, 45, 203–209. https://doi.org/10.1080/00461520.2010.515935

27.

Greene

J. A.

Bernacki

M. L.

Hadwin

A. F.

(2024). Self-regulation. In Schutz

P. A.

Muis

K. R.

(Eds.), Handbook of educational psychology (pp. 314–334). Routledge.

28.

Greene

J. A., D.

Plumley

Urban

Bernacki

Gates

Hogan

K. A.

Demetriou

Panter

(2021). Modeling temporal self-regulatory processing in a higher education biology course. Learning and Instruction, 72, 101201. https://doi.org/10.1016/j.learninstruc.2019.04.002

29.

Guo

Trainin

(2022). Measuring self-regulation: A learning analytics approach [Conference session]. Proceedings of the Finnish Learning Analytics and Artificial Intelligence in Education Conference (FLAIEC22), Joensuu, Finland. CEUR Workshop Proceedings.

30.

Hadwin

A. F.

(2008). Self-regulated learning. In Good

T. L.

(Ed.), 21st century education: A reference handbook (pp. 175–183). Sage Publications.

31.

Hadwin

A. F.

Järvelä

Miller

(2018). Self-regulation, co-regulation, and shared regulation in collaborative learning environments. In Schunk

D. H.

Greene

J. A.

(Eds.), Handbook of self-regulation of learning and performance (2nd ed., pp. 83–106). Routledge/Taylor & Francis Group.

32.

Hadwin

A. F.

Nesbit

Jamieson-Noel

Code

Winne

(2007). Examining trace data to explore self-regulated learning. Metacognition and Learning, 2, 107–124. https://doi.org/10.1007/s11409-007-9016-7

33.

Hatala

Nazeri

Kia

F. S.

(2023). Progression of students' SRL processes in subsequent programming problem-solving tasks and its association with tasks outcomes. The Internet and Higher Education, 56, 100881. https://doi.org/10.1016/j.iheduc.2022.100881

34.

Howard-Rose

Winne

P. H.

(1993). Measuring component and sets of cognitive processes in self-regulated learning. Journal of Educational Psychology, 85, 591–604. https://doi.org/10.1037//0022-0663.85.4.591

35.

Huang

Doleck

Chen

Huang

Tan

Lajoie

Wang

(2023). Multimodal learning analytics for assessing teachers’ self-regulated learning in planning technology-integrated lessons in a computer-based environment. Education and Information, 28, 15823–15843. https://doi.org/10.1007/s10639-023-11804-7

36.

Huang

Lajoie

(2021). Process analysis of teachers’ self-regulated learning patterns in technological pedagogical content knowledge development. Computers & Education, 166, 104169. https://doi.org/10.1016/j.compedu.2021.104169

37.

Jamieson-Noel

Winne

(2003). Comparing self-reports to traces of studying behavior as representations of students' studying and achievement. Zeitschrift Fur Padagogische Psychologie, 17, 159–171. https://doi.org/10.1024//1010-0652.17.3.159

38.

Jansen

R. S.

van Leeuwen

Janssen

Conijn

Kester

(2020). Supporting learners' self-regulated learning in Massive Open Online Courses. Computers & Education, 146, 103771. https://doi.org/10.1016/j.compedu.2019.103771

39.

Johnson

C. C.

Walton

J. B.

Strickler

Elliott

J. B.

(2023). Online teaching in K-12 education in the United States: A systematic review. Review of Educational Research, 93(3), 353–411. https://doi.org/10.3102/00346543221105550

40.

Kim

Yoon

I.-H.

Branch

(2018). Learning analytics to support self-regulated learning in asynchronous online courses: A case study at a women's university in South Korea. Computers & Education, 127, 233–251. https://doi.org/10.1016/j.compedu.2018.08.023

41.

Lan

Hou

Mattheos

(2019). Self-regulated learning strategies in world's first MOOC in implant dentistry. European Journal of Dental Education, 23, 278–285. https://doi.org/10.1111/eje.12428

42.

Leite

W. L.

Kuang

Jing

Xing

Cavanaugh

Huggins-Manley

A. C.

(2022). The relationship between self-regulated student use of a virtual learning environment for algebra and student achievement: An examination of the role of teacher orchestration. Computers & Education, 191, 104615. https://doi.org/10.1016/j.compedu.2022.104615

43.

Baker

Warschauer

(2020). Using clickstream data to measure, understand, and support self-regulated learning in online courses. The Internet and Higher Education, 45, 100727. https://doi.org/10.1016/j.iheduc.2020.100727

44.

Zheng

Lajoie

(2021). The frequency of emotions and emotion variability in self-regulated learning: What matters to task performance? Frontline Learning Research, 9, 76–91. https://doi.org/10.14786/flr.v9i4.901

45.

Zheng

Poitras

Lajoie

(2018). The allocation of time matters to students’ performance in clinical reasoning. In Nkambou

Azevedo

Vassileva

(Eds.), Intelligent tutoring systems. ITS 2018. Lecture notes in computer science (Vol. 10858). Springer.

46.

Lim

Bannert

van der Graaf

Singh

Fan

Surendrannair

Rakovic

Molenaar

Moore

Gašević

(2023). Effects of real-time analytics-based personalized scaffolds on students’ self-regulated learning. Computers in Human Behavior, 139, 107547. https://doi.org/10.1016/j.chb.2022.107547

47.

Maldonado-Mahauad

Pérez-Sanagustín

Kizilcec

R. F.

Morales

Munoz-Gama

(2018). Mining theory-based patterns from Big data: Identifying self-regulated learning strategies in Massive Open Online Courses. Computers in Human Behavior, 80, 179–196. https://doi.org/10.1016/j.chb.2017.11.011

48.

Matcha

Uzir

N. A.

Gašević

Pardo

(2020). A systematic review of empirical studies on learning analytics dashboards: A self-regulated learning perspective. IEEE Transactions on Learning Technologies, 13, 226–245. https://doi.org/10.1109/tlt.2019.2916802

49.

Messick

(1987). Validity. ETS Research Report Series, 1987, i–208.

50.

Min

Jingyan

(2017). Assessing the effectiveness of self-regulated learning in MOOCs using macro-level behavioural sequence data [Conference session]. Proceedings of European MOOCs Stakeholders Summit 2017: Work in Progress Papers of the Experience and Research Tracks and Position Papers of the Policy Track, Leganes (Madrid), Spain (pp. 1–9).

51.

Min

Nasir

M. K. M.

(2020). Self-regulated learning in a massive open online course: A review of literature. European Journal of Interactive Multimedia and Education, 1, e02007. https://doi.org/10.30935/ejimed/8403

52.

Nath

Gasevic

Fan

Rajendran

(2024). CTAM4SRL: A consolidated temporal analytic method for analysis of self-regulated learning [Conference session]. Proceedings of the 14th Learning Analytics and Knowledge Conference (LAK '24) (pp. 645–655). Association for Computing Machinery.

53.

Newell

(1994). Unified theories of cognition. Harvard University Press.

54.

J. T.

Liu

Chui

D. S.

Man

J. C.

(2023). Leveraging LMS logs to analyze self-regulated learning behaviors in a maker-based course [Conference session]. LAK23: 13th International Learning Analytics and Knowledge Conference (pp. 670–676). Association for Computing Machinery.

55.

Nguyen

L. T.

Ikeda

(2015). The effects of ePortfolio-based learning model on student self-regulated learning. Active Learning in Higher Education, 16, 197–209. https://doi.org/10.1177/1469787415589532

56.

Nitta

Baba

(2015). Self-regulation in the evolution of the ideal L2 self: A complex dynamic systems approach to the L2 motivational self system. In Dörnyei

MacIntyre

P. D.

Henry

(Eds.), Motivational dynamics in language learning (pp. 367–396). Multilingual Matters.

57.

Paans

Molenaar

Segers

Verhoeven

(2019). Temporal variation in children's self-regulated hypermedia learning. Computers in Human Behavior, 96, 246–258. https://doi.org/10.1016/j.chb.2018.04.002

58.

Panadero

(2017). A review of self-regulated learning: Six models and four directions for research. Frontiers in Psychology, 8, 422. https://doi.org/10.3389/fpsyg.2017.00422

59.

Panadero

Alonso-Tapia

(2014). How do students self-regulate? Review of Zimmerman’s cyclical model of self-regulated learning. Anales de Psicología, 30, 450–462.

60.

Panadero

Klug

Järvelä

(2016). Third wave of measurement in the self-regulated learning field: When measurement and intervention come hand in hand. Scandinavian Journal of Educational Research, 60, 723–735. https://doi.org/10.1080/00313831.2015.1066436

61.

Paquette

Grant

Zhang

Biswas

Baker

(2020). Using epistemic networks to analyze self-regulated learning in an open-ended problem-solving environment [Conference session]. Proceedings of the 2nd International Conference on Quantitative Ethnography (pp. 185–201).

62.

Pathan

Murthy

Rajendran

(2021). A coding mechanism for analysis of SRL processes in an open-ended learning environment [Conference session]. 29th International Conference on Computers in Education Conference, ICCE.

63.

Pintrich

P. R.

(2000). The role of goal orientation in self-regulated learning. In Boekaerts

Pintrich

P. R.

Zeidner

(Eds.), Handbook of self-regulation (pp. 451–502). Academic Press.

64.

Pintrich

P. R.

(2003). A motivational science perspective on the role of student motivation in learning and teaching contexts. Journal of Educational Psychology, 95, 667–686. https://doi.org/10.1037/0022-0663.95.4.667

65.

Poquet

Jovanovic

Pardo

(2023). Student profiles of change in a university course: A complex dynamical systems perspective [Conference session]. LAK23: 13th International Learning Analytics and Knowledge Conference (pp. 197–207). Association for Computing Machinery.

66.

Prokop

Pilař

Tichá

(2020). Impact of think-aloud on eye-tracking: A comparison of concurrent and retrospective think-aloud for research on decision-making in the game environment. Sensors, 20, 2750. https://doi.org/10.3390/s20102750

67.

Puustinen

Pulkkinen

(2001). Models of self-regulated learning: A review. Scandinavian Journal of Educational Research, 45, 269–286. https://doi.org/10.1080/00313830120074206

68.

Qiao

Zhao

(2021). Mining and analysis of self-regulated learning process model: Based on hidden Markov model [Conference session]. Proceedings of the 10th International Conference of Educational Innovation through Technology (pp. 276–281). IEEE.

69.

Quick

J. D.

Motz

Morrone

(2023). Lost in translation: Determining the generalizability of temporal models across course contexts [Conference session]. LAK23: 13th International Learning Analytics and Knowledge Conference (pp. 273–283). Association for Computing Machinery.

70.

Rakovic

Bernacki

Greene

J., D.

Plumley

Hogan

Gates

Panter

(2022). Examining the critical role of evaluation and adaptation in self-regulated learning. Contemporary Educational Psychology, 68, 102027. https://doi.org/10.1016/j.cedpsych.2021.102027

71.

Rizki

Purnama

Rustam

Handoko

(2022). Promoting self-regulated learning for students in underdeveloped areas: The case of Indonesia nationwide online-learning program. Sustainability, 14, 4075. https://doi.org/10.3390/su14074075

72.

Roscoe

Segedy

Sulcer

Biswas

(2013). Shallow strategy development in a teachable agent environment designed to support self-regulated learning. Computers & Education, 62, 286–297. https://doi.org/10.1016/j.compedu.2012.11.008

73.

Rovers

Clarebout

Savelberg

de Bruin

Van Merrienboer

J. J.

(2019). Granularity matters: Comparing different ways of measuring self-regulated learning. Metacognition and Learning, 14, 1–19. https://doi.org/10.1007/s11409-019-09188-6

74.

Saint

Fan

Gasevic

Pardo

(2022). Temporally-focused analytics of self-regulated learning: A systematic review of literature. Computers and Education: Artificial Intelligence, 3, 100060. https://doi.org/10.1016/j.caeai.2022.100060

75.

Saint

Gašević

Matcha

Uzir

N. A.

Pardo

(2020a). Combining analytic methods to unlock sequential and temporal patterns of self-regulated learning [Conference session]. Proceedings of the Tenth International Conference on Learning Analytics & Knowledge (pp. 402–411). Association for Computing Machinery.

76.

Saint

Whitelock-Wainwright

Gašević

Pardo

(2020b). Trace-SRL: A framework for analysis of microlevel processes of self-regulated learning from trace data. IEEE Transactions on Learning Technologies, 13, 861–877. https://doi.org/10.1109/tlt.2020.3027496

77.

Salehian Kia

Hatala

Baker

R. S.

Teasley

S. D.

(2021). Measuring students’ self-regulatory phases in LMS with behavior and real-time self report [Conference session]. Proceedings of the 11th International Learning Analytics and Knowledge Conference (pp. 259–268). Association for Computing Machinery.

78.

Schraw

(2010). Measuring self-regulation in computer-based learning environments. Educational Psychologist, 45, 258–266. https://doi.org/10.1080/00461520.2010.515936

79.

Siadaty

Gasevic

Hatala

(2016a). Trace-based micro-analytic measurement of self-regulated learning processes. Journal of Learning Analytics, 3, 183–214. https://doi.org/10.18608/jla.2016.31.11

80.

Siadaty

Gašević

Hatala

(2016b). Associations between technological scaffolding and micro-level processes of self-regulated learning: A workplace study. Computers in Human Behavior, 55, 1007–1019. https://doi.org/10.1016/j.chb.2015.10.035

81.

Siadaty

Gašević

Hatala

(2016c). Measuring the impact of technological scaffolding interventions on micro-level processes of self-regulated workplace learning. Computers in Human Behavior, 59, 469–482. https://doi.org/10.1016/j.chb.2016.02.025

82.

Srivastava

Fan

Rakovic

Singh

Jovanovic

van der Graaf

Lim

Surendrannair

Kilgour

Molenaar

Bannert

Moore

Gašević

(2022). Effects of internal and external conditions on strategies of self-regulated learning: A learning analytics study [Conference session]. LAK22: 12th International Learning Analytics and Knowledge Conference (pp. 392–403). Association for Computing Machinery.

83.

Sun

J. C.-Y.

Liu

Lin

(2023a). Temporal learning analytics to explore traces of self-regulated learning behaviors and their associations with learning performance, cognitive load, and student engagement in an asynchronous online course. Frontiers in Psychology, 13, 1096337. https://doi.org/10.3389/fpsyg.2022.1096337

84.

Sun

J. C.-Y.

Tsai

H.-E.

Cheng

W. K.

(2023b). Effects of integrating an open learner model with AI-enabled visualization on students' self-regulation strategies usage and behavioral patterns in an online research ethics course. Computers and Education: Artificial Intelligence, 4, 100120. https://doi.org/10.1016/j.caeai.2022.100120

85.

Tang

(2021). Person-centered analysis of self-regulated learner profiles in MOOCs: A cultural perspective. Educational Technology Research and Development, 69, 1247–1269. https://doi.org/10.1007/s11423-021-09939-w

86.

Taub

Azevedo

(2019). How does prior knowledge influence eye fixations and sequences of cognitive and metacognitive SRL processes during learning with an intelligent tutoring system? International Journal of Artificial Intelligence in Education, 29, 1–28. https://doi.org/10.1007/s40593-018-0165-4

87.

Taub

Banzon

Zhang

Chen

(2022). Tracking changes in students’ online self-regulated learning behaviors and achievement goals using trace clustering and process mining. Frontiers in Psychology, 13, 813514. https://doi.org/10.3389/fpsyg.2022.813514

88.

van der Graaf

Lim

Fan

Kilgour

Moore

Bannert

Gasevic

Molenaar

(2021). Do instrumentation tools capture self-regulated learning? [Conference session] Proceedings of the 11th International Learning Analytics and Knowledge Conference (pp. 438–448). Association for Computing Machinery.

89.

Viberg

Khalil

Baars

(2020). Self-regulated learning and learning analytics in online learning environments: A review of empirical research [Conference session]. Proceedings of the 10th International Conference on Learning Analytics & Knowledge (pp. 524–533). Association for Computing Machinery.

90.

Wang

Huang

Lajoie

(2023). Task complexity affects temporal characteristics of self-regulated learning behaviours in an intelligent tutoring system. Educational Technology Research and Development, 71(3), 991–1011. https://doi.org/10.1007/s11423-023-10222-3

91.

Wang

Lajoie

(2022). The interplay between cognitive load and self-regulated learning in a technology-rich learning environment. Educational Technology & Society, 26, 50–62.

92.

Warden

C. A.

Chang

C.-C.

Stanworth

J. O.

Caskey

Chen

J. F.

(2022). The impact of scripts on blended and online socially shared regulation of learning: A role-playing game theory perspective. International Journal of Computer-Supported Collaborative Learning, 17, 463–487. https://doi.org/10.1007/s11412-022-09381-x

93.

Winne

(2010). Improving measurements of self-regulated learning. Educational Psychologist, 45, 267–276. https://doi.org/10.1080/00461520.2010.517150

94.

Winne

P. H.

(2014). Issues in researching self-regulated learning as patterns of events. Metacognition and Learning, 9, 229–237. https://doi.org/10.1007/s11409-014-9113-3

95.

Winne

P. H.

(2017). Learning analytics for self-regulated learning. In Lang

Siemens

Wise

A. F.

Gaševic

(Eds.), The handbook of learning analytics (1st ed., pp. 241–249). Society for Learning Analytics Research.

96.

Winne

P. H.

(2020). Construct and consequential validity for learning analytics based on trace data. Computers in Human Behavior, 112, 106457. https://doi.org/10.1016/j.chb.2020.106457

97.

Winne

P. H.

Baker

R. S.

(2013). The potentials of educational data mining for researching metacognition, motivation and self-regulated learning. Journal of Educational Data Mining, 5(1), 1–8.

98.

Winne

P. H.

Hadwin

A. F.

(1998). Studying as self-regulated learning. In Hacker

Graesser

A. C.

(Eds.), Metacognition in educational theory and practice (pp. 277–304). Lawrence Erlbaum Associates Publishers.

99.

Winne

P. H.

Hadwin

A. F.

(2013). nStudy: Tracing and supporting self-regulated learning in the internet. In Azevedo

Aleven

(Eds.), International handbook of metacognition and learning technologies (pp. 293–308). Springer.

100.

Winne

P. H.

Jamieson-Noel

(2002). Exploring students’ calibration of self reports about study tactics and achievement. Contemporary Educational Psychology, 27, 551–572. https://doi.org/10.1016/s0361-476x(02)00006-1

101.

Winne

P. H.

Perry

N. E.

(2000). Measuring self-regulated learning. In Boekaerts

Pintrich

P. R.

Zeidner

(Eds.), Handbook of self-regulation (pp. 531–566). Academic Press.

102.

Wise

A. F.

Hsiao

Y.-T.

(2019). Self-regulation in online discussions: Aligning data streams to investigate relationships between speaking, listening, and task conditions. Computers in Human Behavior, 96, 273–284. https://doi.org/10.1016/j.chb.2018.01.034

103.

Wong

Baars

Davis

Zee

T. V.

Houben

G.-J.

Paas

(2019). Supporting self-regulated learning in online learning environments and MOOCs: A systematic review. International Journal of Human–Computer Interaction, 35, 356–373. https://doi.org/10.1080/10447318.2018.1543084

104.

Wong

Baars

de Koning

B. B.

Paas

(2021). Examining the use of prompts to facilitate self-regulated learning in Massive Open Online Courses. Computers in Human Behavior, 115, 106596. https://doi.org/10.1016/j.chb.2020.106596

105.

Yang

Song

(2022). Understanding primary students’ self-regulated vocabulary learning behaviours on a mobile app via learning analytics and their associated outcomes: a case study. Journal of Computers in Education, 10, 469–498. https://doi.org/10.1007/s40692-022-00251-x

106.

Pennisi

(2022). Using trace data to enhance Students' self-regulation: A learning analytics perspective. The Internet and Higher Education, 54, 100855. https://doi.org/10.1016/j.iheduc.2022.100855

107.

Young

(2009). Direct from the source: The value of 'think-aloud' data in understanding learning. Journal of Educational Enquiry, 6, 19–33.

108.

Zhang

Taub

Chen

(2021). Measuring the impact of COVID-19 induced campus closure on student self-regulated learning in physics online learning modules [Conference session]. Proceedings of the 11th International Learning Analytics and Knowledge Conference (pp. 110–120). Association for Computing Machinery.

109.

Zhang

Taub

Chen

(2022). A multi-level trace clustering analysis scheme for measuring students’ self-regulated learning behavior in a mastery-based online learning environment [Conference session]. Proceedings 12th International Learning Analytics and Knowledge Conference (pp. 197–207).

110.

Zheng

Lajoie

S. P.

(2020). The role of achievement goals and self-regulated learning behaviors in clinical reasoning. Technology, Knowledge and Learning, 25, 541–556. https://doi.org/10.1007/s10758-019-09420-x

111.

Zheng

Xing

Zhu

(2019). Examining sequential patterns of self- and socially shared regulation of STEM learning in a CSCL environment. Computers & Education, 136, 34–48. https://doi.org/10.1016/j.compedu.2019.03.005

112.

Zheng

(2016). The effectiveness of self-regulated learning scaffolds on academic performance in computer-based learning environments: a meta-analysis. Asia Pacific Education Review, 17, 187–202. https://doi.org/10.1007/s12564-016-9426-9

113.

Zimmerman

B. J.

(1986). Becoming a self-regulated learner: Which are the key subprocesses? Contemporary Educational Psychology, 11, 307–313. https://doi.org/10.1016/0361-476x(86)90027-5

114.

Zimmerman

B. J.

(2002). Becoming a self-regulated learner: An overview. Theory Into Practice, 41, 64–70. https://doi.org/10.1207/s15430421tip4102_2

115.

Zimmerman

B. J.

Moylan

(2009). Self-regulation: Where metacognition and motivation intersect. In Hacker

Graesser

A. C.

(Eds.), Handbook of metacognition in education (pp. 299–315). Routledge.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.02 MB

0.07 MB

0.13 MB

0.01 MB

0.14 MB