Abstract
Purpose
This integrative review mainly documents our team's two decades of research on Chinese integrated writing (IW) assessment based on funded projects. These projects related to IW, led by Xinhua Zhu as the principal investigator, were funded by the General Research Fund (GRF), the Quality Education Fund, the Education and Manpower Bureau/Education Bureau of the Government of the Hong Kong Special Administrative Region, as well as The Hong Kong Polytechnic University. The findings in approximately 40 papers on IW have been published in SSCI-indexed journals.
Design/Approach/Methods
We have employed various methods, including questionnaire surveys, interviews, and eye-tracking experiments, to investigate key issues concerning Chinese IW assessment: the measured construct, factors influencing Chinese IW performance, L1 and L2 IW associations, and related practices. Findings are thematically integrated to develop a systematic account of current perspectives in this field and suggest directions for various stakeholders to scaffold de facto IW learning, teaching, and assessment.
Findings
As a multifaceted construct, Chinese IW involves a dynamic integration of independent language skills and source use. Chinese IW task performance is linguistically-based and influenced by psychological and behavioral factors. The cross-linguistic research also revealed L1-L2 IW associations in general performance, subsumed skills, and related psychological factors. Findings greatly informed and improved related teaching, learning, and assessment practices, while significant room for improvement remains to appropriately align these practices with students’ writing development.
Originality/Value
Not only offering current perspectives, this review also suggests underexplored issues and methodological implications for researchers, guidance for educators and professionals to develop instructional activities and assessments, and insights for learners to find effective practices for honing their IW skills.
Introduction
Integrated writing (IW) has been widely acknowledged as a fundamental literacy skill in academic discourse communities, where learners are required to produce written responses after synthesizing and transforming information from multiple sources (Zhu et al., 2021a). Unlike independent tasks that elicit written responses solely based on writers’ prior resources and knowledge, integrated tasks require comprehending and using appropriate information from multimodal language resources such as texts, charts, audios, and videos to compose written responses. These tasks are believed to be more reflective of real-life writing practices, in which different language modalities are activated and employed in concert (Gebril & Plakans, 2009; Weigle, 2004). Furthermore, integrated tasks provide all the learners with the same source support, thus partly alleviating content bias and enhancing test fairness. The source materials possibly support learners in language, content, and structure when producing their written texts, thus gaining positive washback (Leki & Carson, 1997; Plakans, 2015; Shin & Ewert, 2015). Given the increased authenticity, fairness, and positive washback, IW tasks have gained increasing popularity during the past decades and have been included as a vital component of international and regional language assessment programs (Knoch & Sitajalabhorn, 2013; Zhu, 2005; Zhu et al., 2016).
To date, a diverse array of task types falls under the umbrella of IW tasks. Reading-to-write tasks emerge as the most commonly used ones in which learners are provided with reading materials to compose a text according to the task requirements (Knoch & Sitajalabhorn, 2013). Summary and response tasks are two representative variants of reading-to-write tasks. When completing a summary task, learners are asked to summarize the provided reading materials (Yu, 2010). They are also expected to express and argue for their views after comprehending and summarizing these reading materials in response tasks (Grabe & Zhang, 2013). The story continuation writing task, a new variant of reading-to-write tasks, requires students to read an incomplete story and logically and coherently continue writing the story (Wang, 2016). It has been used in college entrance examinations and subsequently adopted in other public and school-based writing assessments in Chinese mainland (Cheng & Qi, 2006; Zhao, 2016). Some IW tasks also involve listening input. This task type is adopted in the writing section of the TOEFL iBT. Test-takers respond to the TOEFL iBT IW task by composing an essay after reading a short passage and listening to a brief lecture (Alderson, 2009). Input materials can also be graphs, diagrams, and/or charts. A typical example of this task type is Writing Task 1 of the International English Language Testing System (IELTS), which requires producing a written response after decoding the graphic input.
Situated in Hong Kong, IW continues to play an essential part in language learning, teaching, and assessment. The traditional Chinese language curriculum and assessment primarily focused on reading, writing, and rote learning of prescribed texts to foster language proficiency in the 20th century (Hong Kong Curriculum Development Council [HKCDC], 2017). When facing the 21st century of booming information, HKCDC and Hong Kong Examinations and Assessment Authority (HKEAA) (2007) highlight the necessity for a much broader and more comprehensive view of Chinese language competence. Consequently, IW tasks were added to the Chinese language curriculum and assessment. Writing instruction in Hong Kong is implemented not merely to fulfil assessment requirements and enhance reasoning skills but also to foser the integrated use of language skills (Li et al., 2020). IW tasks are officially incorporated into the Chinese language assessment system as an essential part of assessing students’ source-based writing, authentic communication, and higher-order thinking skills.
In 2007, the HKEAA and the Education and Manpower Bureau (EMB, now the Education Bureau [EDB]) officially commissioned scholars in tertiary institutions to develop assessment standards for Chinese language competence. As a response to this call, Zhu (2005) undertook a project named “A Study of Setting Standards of Chinese Language Assessment in Reading, Writing, and Integrated Skills.” The standards produced in this project were adopted in the Hong Kong Certificate of Education Examination (HKCEE) Chinese Language Level Descriptors and Exemplars for Standards-Referenced Assessment (HKEAA, 2005) since 2007. They also served as the basis for designing school-based assessments of Chinese (Zhu, 2014; Zhu et al., 2016). The wide use of these standards has greatly promoted the transformation from a “norm-referenced approach” to a “criterion-referenced approach” in Hong Kong public examinations since 2007. Later, the HKCEE and Hong Kong Advanced Level Examination (HKALE) were replaced by the Hong Kong Diploma of Secondary Education Examination (HKDSE) in 2012. The fine-tuned standards are still used in HKDSE, with university admission being one of its purposes. The authors undertook an EDB research project titled “Research-Based Development on Merged Standard-Referenced Reporting Standards on Listening and Integrated Language Use in the Hong Kong Diploma of Secondary Education Examination for the Subject of Chinese Language” to address possible overlapping issues between independent listening and IW. The project findings were used to refine level descriptors and testing formats to ensure independent listening and IW tests tapped into different listening skills (Zhu, 2014).
Despite the prevalent use of IW tasks in the L2 testing context, research on some central issues, including construct conceptualization, factors contributing to IW performance, related teaching activities, and assessment task design and validation, is still in its infancy, especially in the L1 testing context. The associations between L1 and L2 IW skills are an even more underexplored area. Researchers and specialists have not reached a consensus on these crucial issues and are still working to seek further development (Barkaoui et al., 2013; Gebril & Plakans, 2009; Zhu et al., 2016). Our research team, led by Xinhua Zhu as the principal investigator, has conducted ongoing research in this field and disseminated results in a series of publications. It is time to reflect upon and synthesize these research results in an integrative and systematic manner, thus bringing collective insights for researchers, educators, professionals, and other practitioners. To guide this review, we limited our focus to the following research questions: (1) What construct is measured by the Chinese L1 IW task? (2) How do psychological and behavioral factors influence the Chinese L1 IW task performance? (3) Are there any L1-L2 associations underlying IW tasks? (4) How can we design and develop Chinese L1 IW assessment and pedagogical activities effectively? This review is more than a summary of our research findings regarding Chinese L1 IW. Rather, it critically examines these findings to stimulate further interest and advancement in this field.
Fundamental research on the Chinese IW construct
The core of IW tasks lies in the construct, which supports the interpretation and use of test scores. Our research seeks to analyze the IW construct in tripartite ways: examining the underlying factor structure, comparing IW and independent skill task performance, and exploring the use of source materials to clarify its distinctiveness.
L1 IW competence factors
Three important points warrant our attention when constructing the evaluative framework for Chinese IW assessment (Zhu, 2015a). Aligning with international IW assessment practices, it firstly concerns the integrative employment of multiple language skills, with writing at the core and listening and reading materials offering background information. Moreover, Chinese IW tasks do not simply measure subject-specific knowledge but also communication competence, especially students’ communicative function and contextualized language use. Finally, Chinese IW tasks involve multiple levels of information processing, including comprehension, comparison, summarization, inference, argumentation, evaluation, and use of source information to support students’ original and creative ideas. Drawing extensively on the above three points, Zhu (2015a) proposed a theoretical framework, namely the Four Traits of Integrated Writing Competence (or the four-pillar framework), to underpin the construct representation of Chinese IW assessment. It consists of four pillars: citation and synthesis, original opinion and argument, contextual awareness, and writing expression and organization. According to Canale and Swain (1980), communicative competence highlights the contextual appropriateness of language use in addition to grammatical and discoursal accuracy. It remains true in the writing domain. Chinese IW tasks are designed to measure students’ ability to communicate in real-life contexts with the use of linguistic expressions, which gives rise to the pillars of contextual awareness and writing expression, and organization. More specifically, the pillar of contextual awareness concerns contextual and cultural aspects of authorship, readership, and writing conventions, while the pillar of writing expression and organization involves the quality of language use, logic, and macrostructure. Real-life communication is context-dependent, which does not occur in a vacuum. The background context in Chinese IW tasks is built on the provided materials. Students are expected to compose a text after processing multiple sources of materials. Thus, we draw on Spivey's (1984, 1990) discourse synthesis to develop the pillar of citation and synthesis, which involves comprehending and using information from multiple sources. Students’ performance in L1 IW tasks is also assessed in terms of their higher-order thinking skills, including evaluation, argumentation, and creativity (Cheong et al., 2018). The last pillar of original opinion and argument, in this way, focuses on presenting and arguing for original and creative ideas. This framework has been implemented in Chinese public examinations and school-based assessments in Hong Kong secondary schools.
To empirically validate the four-pillar framework, Zhu et al. (2016) conducted a factor analysis on a group of secondary students’ listening-reading-writing task scores. They examined ten indicators closely relevant to Chinese IW performance: identification, tone, writing convention, interaction, synthesis, citation, original opinion, argument, language use, and organization. Consistent with the four-pillar framework, four common factors were extracted in the exploratory factor analysis, which collectively accounted for a large proportion of the variance of students’ task scores. Notably, the factors of citation and synthesis and original opinion and argument, assessing higher-order thinking and information-processing skills, contributed substantially to overall IW performance while they were regarded as the most challenging aspects (Zhu & Wu, 2014; Zhu et al., 2016).
Relationship between L1 IW and independent language skills
Chinese L1 IW assessment requires a hybrid of independent language skills, i.e., listening, reading, and writing. Our studies compared IW and independent skill tasks to identify variations in writing outcomes, thereby addressing the construct overlap and supporting the use of IW tasks as distinct language assessment measurements.
Zhu et al. (2016) compared students’ performance in the Independent Listening task and the Listening-Reading-Writing task in the Hong Kong Chinese Language Examinations. The joint factor analysis results indicated no common factors between the two types of tasks. Students’ performance in the two types of tasks was significantly correlated, while indicators of the Independent Listening task explained only 8.9% of the variance in Listening-Reading-Writing task scores. It indicates a slight overlap in the construct that the two task types assess. IW tasks require reading comprehension within a text and across texts (Karimi, 2017). Zhu et al. (2021a) compared the effects of students’ L1 single- and multiple-text comprehension on L1 IW performance simultaneously. The latent variable path analysis results showed that both single- and multiple-text comprehension positively predicted IW performance, thereby verifying the reading-writing connection (Ahmed et al., 2014; Graham et al., 2018; Kim & Crossley, 2018). Independent writing also promotes IW performance because it has certain composing processes, such as planning, translating, and reviewing, shared with IW (Cheong et al., 2022). Independent language skills not only contribute to IW performance independently but also interact or mediate to shape IW performance (Cheong et al., 2018). For example, the interaction between reading and listening significantly influences IW performance (Cheong et al., 2018), and writing mediates the relationship between reading and IW (Cheong et al., 2022). Comparatively, the impact of independent listening and reading on IW performance is overshadowed by independent writing (Liao et al., 2021).
Therefore, the Chinese IW assessment goes beyond a mere amalgamation of independent listening, reading, and writing skills. IW demands a certain level of knowledge and cognitive processing akin to those required in independent tasks, according to significant correlations and regression coefficients found in our previous studies (e.g., Cheong et al., 2018, 2022; Liao et al., 2021; Zhu et al., 2016, 2018, 2025). However, the substantial amount of unexplained variance in IW performance offers compelling evidence for its distinctiveness (Liao et al., 2021; Zhu et al., 2016). It notes the need for an inclusive approach to integrating multiple independent skills and unique processing, such as source use under the IW construct representation.
Source use
As Cumming et al. (2005) aptly pointed out, IW tasks require students to “produce writing compositions that display appropriate and meaningful uses of and orientations to source evidence, both conceptually and textually” (p. 34). As an inter-textual processing skill unique to IW, source use is a complex meaning-making process requiring comprehension, integration, and transformation of source information to create written products (Gebril & Plakans, 2009). Spivey (1984, 1990) conceptualized three types of cognitive processing involved in source use: organizing (i.e., comprehending and organizing source materials), connecting (i.e., connecting information across source materials and linking source information to the writer's ideas), and selecting (i.e., determining and choosing appropriate ideas for use in the essay). The role of source use in IW is multifaceted and includes acquiring background knowledge and viewpoints, gathering perspectives or evidence, selecting words and phrases from the materials, and emulating text structures to organize one's writing. Source use has been an active research area in EFL contexts. Previous studies have examined source use features and processes, source functions, how source use differs across score levels and task types, scoring rubrics, and the rating process and consistency (Kyle, 2020; Ohta et al., 2018; Shin & Ewert, 2015; Uludag et al., 2019; Xie, 2023). Noticeably, source use is also central to studying L1 IW construct and processes (Cheong et al., 2023).
We have extended the existing literature to Chinese L1 IW contexts. Zhu and colleagues (2021b) examined Hong Kong secondary students’ discourse synthesis skills in the Chinese L1 IW task and their effects on final task performance. Hierarchical regression analysis results revealed that students’ discourse synthesis skills, that is, quotation, summarization, and connection, explained 63.6% variance of L1 IW scores. Their study further examined discourse synthesis processes between high- and low-performing writers by combining eye-tracking and stimulated-recall interview data. High-performing students had a higher level of visual engagement in processing relevant information than their low-performing peers. Similarly, Cheong et al. (2021) observed that secondary students’ source use was closely connected to their written argumentation construction in L1 IW. The participants could comprehend and summarize competing information contained in source materials while they struggled to integrate source information to support their written arguments. Thus, student writers may not be competent enough to engage in higher complexity of source use processing even in L1 writing.
Our studies so far have revealed the determining role of source use in L1 IW performance. L1 writers possess the skills of comprehending, summarizing, and paraphrasing multiple sources of materials, while they are still facing difficulties in transforming and integrating source information into their writing. Consequently, care should be taken in cultivating higher-level source integration skills among these writers.
Further research on psychological and behavioral factors
IW, in its very nature, is a hybrid of multiple independent language skills and source use; it is particularly challenging and time-consuming for language learners due to the high demand for individual motivation, determination, and effort, even in the L1 context (Kormos, 2012; Pajares, 2003). Learners should be cognitively, psychologically, and behaviorally engaged to achieve their learning goals in IW. Thus, researchers and educators should pay special attention to individual psychological and behavioral factors that possibly cause differential IW performance.
Motivational and emotional factors related to Chinese IW performance
Future selves have been conceptualized as the primary driving force for language learning (Dörnyei, 2005, 2009). Influenced by positive learning psychology, we have empirically clarified future selves (i.e., ideal and ought-to L2 self) specifically regarding IW. The ideal self refers to the desired writer images that learners hope to pursue in the future. Situated in the story continuation task setting, Cheong et al. (2022) found that high school students’ ideal selves significantly and positively predicted final IW performance, despite the marginal magnitude. Students’ ideal IW self was in tandem with their imaginative capacity to facilitate writing performance. Zhu et al. (2022a) revealed a modest relationship strength between the ideal self and IW performance, which further supported the motivation-writing achievement associations. Similarly, Zhan et al. (2023) also unveiled the positive and significant predictive effects of the ideal self on IW performance. Students with more entertaining images of future writers were more likely to score higher in the IW assessment. However, as intrinsic motivation, the direct effect of the ideal self on IW performance tends to diminish, especially in an examination-oriented educational context (Li et al., 2018; Zabihi, 2018; Zhao et al., 2023).
Students’ current beliefs about writing abilities also play a role in their IW performance. A substantial body of research has focused on the impacts of self-efficacy on L2 writing performance (e.g., Bai et al., 2021; Hirvela et al., 2016; Teng et al., 2017) and reached an agreement concerning its importance. We extend this line of inquiry to L1 IW (Xu et al., 2023; Yao et al., 2023). Triangulating questionnaire and interview responses, Yao et al. (2023) empirically validated the five-dimensional framework of L1 IW self-efficacy: self-efficacy for ideation, source use, conventions, negative emotion control, and concentration. They categorized the participants into three groups of efficacious students via latent profile analysis (LPA). The three groups of participants reported above-medium levels of L1 IW self-efficacy, indicating relatively high confidence in performing L1 IW-related tasks. Unexpectedly, the three groups of efficacious participants had poor performance in the L1 IW task, and this study found no positive effects of self-efficacy on L1 IW performance. It could be partially attributed to the fact that the participants got slack with L1 IW tasks since they were not incorporated into the course grading system. Similarly, Xu et al. (2023) found no evidence for the predictive effects of self-efficacy on L1 IW performance, while it was identified as a significant predictor of L2 IW performance. Such contrasting results suggest the ceiling effect: Self-efficacy works prominently in tasks of higher complexity, that is, in the L2 IW context.
IW performance is also susceptible to the impact of writers’ positive and negative emotions. Zhu et al. (2022b) examined writers’ enjoyment and anxiety profiles in IW assessment and their relationship with IW performance. Students in the moderate-enjoyment/low-anxiety group scored highest in the IW assessment, while those in the low-enjoyment/high-anxiety group scored lowest in the IW assessment. There is no clear-cut conclusion concerning the effects of emotions. Positive emotions like enjoyment are better maintained at a moderate level since an excess of such emotions may inhibit language performance by occupying cognitive resources for purposive learning (Bielak, 2022). Negative emotions like anxiety are not necessarily associated with poor performance.
Mediating effects of behavioral factors on Chinese IW performance
Following the “motivation → behavior → outcome” chain (Dörnyei, 2005, p. 73), behavioral factors assume a mediating role in the relationship between psychological factors and IW performance. Students possibly capitalize on their expectations of future writer end-states to fully improve their writing performance with motivated behavior (Papi, 2010). Zhan et al. (2023) examined the mediating role of feedback-seeking behavior in the relationship between the ideal self and continuation story writing performance among a group of Chinese high school students. They revealed a mixed picture concerning the indirect path of “ideal-self → feedback-seeking behavior → story continuation performance”: It was statistically significant among the high- and mid-performing students, while not among the low-performing ones. The mediating role of feedback-seeking behavior, influenced by embedded cost and value, tended to be sophisticated in the process of reducing self-discrepancy to achieve future writing goals. Future images and joyful feelings potentially boost the use of various strategies to maintain the arousal of positive experiences in writing (Kormos, 2012; Papi & Khajavy, 2021). Zhao et al. (2023) empirically tested the mediator of strategy use in the “motivation → behavior → outcome” chain. They found that the ideal self and enjoyment only showed a significant indirect influence on IW performance via the mediation of self-regulatory strategies. These motivational variables can exert positive effects on IW performance by evoking more self-regulatory strategies.
To sum up, Chinese IW performance interacts with a diverse range of individual features. Our research has shed light on several motivational (i.e., ideal self and self-efficacy), emotional (i.e., enjoyment and anxiety), and behavioral (i.e., feedback-seeking behavior and strategy use) factors in this domain. Nevertheless, it seems to be challenging to reach a definitive consensus on how these individual factors impact IW performance due to their nuanced essence and intricate links. Researchers are expected to continue this line of research systematically, thus offering more profound insights.
Extended research on L1-L2 associations
Cummins’ (1979) Linguistic Interdependence Hypothesis (LIH) postulated that there is a common underlying proficiency (CUP) shared between L1 and L2 literacy skills, facilitating the cross-language transfer of knowledge, skills, and strategies. Some cross-language studies have found similarities in strategy use and general performance in L1 and L2 writing (e.g., Guo & Huang, 2020; Pae, 2019). Our research has extended the extant literature by conducting within-subject comparisons in L1 Chinese and L2 English IW, thus advancing Cummins’ (1979) theory of L1-L2 cross-language transfer with IW insights.
L1-L2 associations in general IW performance and subsumed skills
Administering L1 and L2 intertextual inference verification and IW tasks to 415 secondary students, Cheong et al. (2019) observed that multi-textual comprehension was consistently applied to reading-to-write tasks in both language contexts. A high level of similarity was found between L1 and L2 intertextual inference verification task performance and IW performance. Therefore, L2 learners are possibly advantaged to capitalize on multi-text comprehension and IW skills acquired in L1 to leverage their L2 IW performance. Similarly, Zhu et al. (2021a) found a significant predictive effect of L1 reading comprehension, including single-text and multiple-text comprehension, on L1 and L2 IW performance. The results indicated that L1 reading comprehension was positively associated with L1 and L2 IW performance, thus demonstrating the L1-L2 connections between reading and IW. Additionally, L1 discourse synthesis skills of quotation, summarization, and connection have been found to facilitate L2 IW performance cross-linguistically (Zhu et al., 2021b). Cheong et al. (2021) delved into the L1-L2 association of argumentation behaviors in IW. Although student writers scored higher in argumentative structure and reasoning quality in L1 IW than in L2 IW, their tendency to favor source information in support of personal views remained similar across both IW contexts. Thus, source-based argumentation skills seem to be a stable resource that can be invariantly accessed and utilized in both L1 and L2 IW. L2 IW performance is influenced not only by related skills in this language but also by parallel skills in L1 (Xu & Zhu, 2025).
L1-L2 associations in IW-related psychological factors
We have also examined psychological factors cross-linguistically to determine whether these factors contribute to the shared pool underlying L1 and L2 IW. L2 writing is an activity interacting with social, cultural, and educational contexts (Kormos, 2012). Correspondingly, learners’ ideal image of L2 writers that they aspire to pursue in the future is possibly influenced by their L1 sociocultural environment. In the context of tertiary education, Zhu et al. (2022c) concluded that Chinese EFL learners’ enthusiastic self-images of their L1 IW had a constructive effect on those in L2 IW based on the significant path coefficient from L1 to L2 IW ideal selves. These students shared a common future image between L1 and L2 IW and valued the learning of IW in both language contexts similarly. Learners’ current beliefs of IW ability, namely IW self-efficacy, are also strongly linked in L1 and L2 contexts. Learners who possessed a high level of confidence in their L1 IW were more likely to engage with their L2 IW, and vice versa (Xu et al., 2023). Learners also exhibit a stable and consistent level of favor or disfavor toward IW ability between L1 and L2 contexts (Zhu et al., 2024). In other words, they held equal attitudes toward IW across the two language contexts.
Our cross-linguistic studies collectively uncover the L1-L2 associations in general performance, subsumed skills, and psychological traits specifically regarding IW. In addition to empirically testing the LIH and CUP hypotheses, they revealed the nexus of L1 and L2 IW and noted the importance of collaborative language teaching to enhance bilingual development.
Applications concerning Chinese IW teaching and assessment
Knowing how to write in concert with other language modalities is challenging not only for students to learn but also for policymakers and instructors to assess and teach (Graham & Perin, 2007; Nelson, 2008; Pecorari & Petric, 2014). We have continually kept an eye on the perspectives of Chinese IW stakeholders and engaged in transferring empirical results into de facto pedagogical and assessment practices.
Student and teacher conceptions of Chinese L1 IW and its instruction
Zhu and Wu (2014) found that Hong Kong secondary students faced more difficulties in IW tasks than in independent ones according to their questionnaire responses. These students also expressed a need for IW guidance and instruction, especially in selecting and using source information to create valid arguments and opinions in written texts. However, there are still no unified conceptions of the IW nature and the purpose of IW instruction among teachers. Li et al. (2020) interviewed 25 Chinese language teachers and identified three main conceptions representing IW competence: a composite of disconnected parts, a logical inquiry, and a developmental process. Correspondingly, they held significantly different purposes for administering IW teaching practices: to fulfil examination requirements, to promote pre-writing activities enhancing students’ reasoning skills, and to develop holistic language abilities through the comprehensive employment of language skills. Unfortunately, the exam-oriented view of IW competence and corresponding teaching purposes, less aligned with curriculum objectives, prevails among these teachers. It may result from a lack of venues and resources for professional training, low receptivity to the legitimacy of IW assessment and curriculum, and fossilized format and content of IW tasks in public examinations. Therefore, more pedagogical and political effort is needed to help students go through the IW dilemma and to aid teachers in solving conceptualization and instruction puzzles.
Student self and peer feedback/assessment in Chinese L1 IW
Feedback involves “dialogic processes whereby learners make sense of information from various sources and use it to enhance their work or learning strategies” (Carless, 2016, p. 1). As an alternative formative assessment, feedback is gaining popularity in higher education to improve students’ writing performance (Adachi et al., 2018). To capitalize on the benefits of feedback, Lu et al. (2021) compared the differential characteristics of self-feedback and peer-feedback in an academic abstract writing task where students were required to compose an abstract after reading one research paper. Their results revealed that both feedback modes significantly contribute to students’ writing improvement despite the remarkable differences in the amount, type, and implementation. Self-assessment can complement peer assessment in guiding students to revise when the latter is lacking, forming a reference source to increase suggestion quality when peer assessment is at hand, evoking lasting and systematic reflections, relieving social-affective burdens, and mitigating the suggestion quality discrepancy from feedback givers of different proficiency (Cheong et al., 2023). Worth noting is that feedback quality matters in IW instead of feedback quantity. Excessive feedback added an extra burden on students’ cognition and memory resources, which in turn caused undesirable effects on writing performance (Patchan et al., 2016). Meanwhile, feedback and its effects on writing improvement are influenced by individual characteristics. For example, low self-efficacious students were more likely to face difficulties in conducting self-reflections and giving feedback, while high self-efficacious students were more active in seeking advice and giving feedback based on self-reflections and peers’ written texts (Wei et al., 2024). Lu et al. (2024) followed students’ writing revision amount and function after receiving self- and peer-assessment. Both the quantity and the type of revisions played a pivotal role in writing improvement. Meanwhile, students’ self-rating accuracy moderated the effects of revision. Revision amount benefited students with low self-rating accuracy more, while the revision function enhanced writing quality among students with high self-rating accuracy.
As a concluding note, self and peer feedback/assessment potentially boost IW improvement. Thus, their combined use is recommended in writing classrooms to optimize students’ IW performance. Also, teachers and instructors should attend to the specifics of the two formative assessment modes to maximize the positive effects and to learner characteristics in IW.
Teacher pedagogical and assessment practices of Chinese L1 IW
We conducted several research projects (e.g., the Hong Kong EDB project “Research and Development Project on Developing Students’ Language Competence in the Integrative Use of Language Skills and Strategies in Chinese Language Education at the Secondary Level” and the Hong Kong Quality Education Fund project “Enhancing Students’ Multi-Level Processing Competency of Multi-Source Information in Integrated Language Tasks in Secondary Schools”) to empower teachers in adapting to curriculum changes and designing school-based IW instruction activities and assessment tasks. After collecting teachers’ views toward Chinese IW assessment, Zhu (2015b) recommended four avenues for enhancing professional development: strengthening the understanding of IW assessment as a measurement of students’ problem-solving ability in real-life language use, promoting practical skills in designing IW assessment tasks and using scoring rubrics, guaranteeing students’ access to evaluating IW performance, and adjusting instructional activities aligned with students’ IW development. Specifically, he proposed several fundamental strategies for successfully implementing school-based Chinese IW assessment (Zhu, 2017). The ability-oriented nature, authenticity, fairness, and positive washback for teaching Chinese IW assessment should be considered with high regard. It is also necessary to calibrate IW indicators with information processing. Moreover, authenticity should be operationalized in theme determination, selection, and adaptation of source materials, and prompt writing. Our research team also developed a teaching kit to assist teachers in fostering students’ multi-level processing competence of multi-source information in integrated tasks (Zhu et al., 2018). We offered professional guidance and support for those teachers throughout the pilot process of the teaching kit, including conducting meetings to introduce the design rationale and application procedure before the pilot, inviting experts to address any problems those teachers encountered in the application, and providing teachers the access to our classroom observation and analysis reports to reflect on the teaching effects.
These scholarly and professional endeavors have consequently led to significant paradigm shifts from “norm-referenced tests” to “criterion-referenced tests,” from “teaching for examination” to “teaching to develop skills,” and from “rote learning” to “ability-oriented approach” in Chinese education of Hong Kong.
Conclusions, implications, and suggestions for future research
Chinese IW as a multi-faceted construct that involves intricate integrations between related independent skills and source use
By reviewing our research systematically, several statements concerning the Chinese IW construct can be made. At its very nature, the Chinese IW construct can be validly represented by “four pillars”: contextual awareness, citation and synthesis, opinion and argument, and written expression and organization (Zhu, 2005; Zhu et al., 2016). Besides appropriate language use and structure, Chinese IW tasks also assess the ability to present original and creative ideas after using the sources in line with cultural and contextual factors embedded in task requirements. Also, our studies have revealed strong associations between students’ independent and integrated task performance, while independent skills accounted for a relatively low proportion of the variance in IW performance. Thus, the second statement concerning the IW construct is that it is not simply a summation of writing and other independent language modalities but a reciprocal integration between them. Specifically, IW tasks require students to produce written texts after a copious array of source processing, that is, organizing, connecting, and selecting, which is not available in independent ones. Spivey's discourse synthesis theory, rooted in English contexts, is also evident for Chinese L1 IW. Chinese students are facing difficulties in transforming and using the source language, content, and organization in their writing. There is a threshold level of language proficiency for L2 learners to process source materials in IW tasks (Cumming, 2013); the threshold also exists for L1 learners to have higher-order source use in such tasks.
We seek to understand the Chinese L1 IW construct through three subfields of research: extracting competence factors underlying students’ performance, comparing students’ performance in independent and integrated tasks, and examining source use and its contribution to IW performance. The three subfields of research altogether answer the core question of the Chinese L1 IW construct. It is a multifaceted construct that involves the integration of related independent language skills as well as the ability to comprehend and use source materials for writing. We hope this focused definition could serve as a theoretical source addressing the core question of what has been measured by Chinese IW tasks and informing the design and development of related assessment tasks (Cumming, 2013; Plakans, 2015). However, the lack of process-oriented efforts in this vein of research should be noted. Evidence concerning Chinese L1 IW strategies and processes in both real-life and testing contexts is important, as it provides in-depth insights and specifications about the underlying construct. Thus, we recommend that future research, building on and extending what we have found, collect and analyze Chinese L1 IW process data from students of multiple education levels over time and across different task conditions to capture its complexity and nuance.
Chinese IW performance as a result of the complex interplay of individual difference factors
Writing is a self-initiated and self-sustained process in which students are motivationally, affectively, and behaviorally engaged to achieve desired goals (Schunk & Zimmerman, 2011). Writers are expected to exhibit variations in their current beliefs (i.e., self-efficacy), future images (i.e., ideal self), emotional profiles (e.g., enjoyment and anxiety), and behavioral engagement (e.g., strategy use and feedback-seeking behavior), ultimately causing differentiated IW performance. Our team has conducted ongoing research to uncover individual difference factors specifically related to IW tasks and how they influence students’ IW performance separately and jointly (Xu et al., 2023; Yao et al., 2023; Zhao et al., 2023). This line of research yields meaningful but mixed findings concerning the effects of these psychological and behavioral factors on IW performance. We acknowledge the importance of these factors in the IW domain, while it is impossible for us to make any concluding statements about the extent to which a specific individual difference factor under investigation facilitates or constrains final IW performance. It seems understandable as these factors are largely constrained by the external environment and continually fluctuate within individuals (Kormos, 2012).
Despite the absent consensus in the extant research, we can still suggest three key considerations for researchers in this field. Regarding methodology, our research has predominantly employed a quantitative design with statistical analyses of questionnaire responses and task scores. To provide robust interpretations and deep insights, it is crucial to incorporate multiple triangulated data sources. Also, the scope of our research in this area is limited to task-situated writing. Assessing students’ IW performance solely based on one IW task may lead to misleading characterizations of their writing abilities. Therefore, we may recommend prolonging student writers’ participation in IW tasks and documenting their IW performance over time to fully capture students’ writing abilities. Finally, future research is warranted to investigate how these individual difference factors influence specific indicators of IW performance, thus uncovering the rich nuance of their relationship. Individual difference factors are not shaped in isolation, and they are more likely to interact with each other to influence IW performance. It deserves scholarly attention with various psychological and behavioral factors involved in one study, thus further specifying the “motivation → behavior → outcome” chain proposed by Dörnyei (2005, p. 73).
L1-L2 associations empirically proved in IW general performance, subsumed skills, and psychological factors
Our research has also documented L1-L2 associations of general performance, subsumed skills, and psychological factors in the IW domain through within-subject comparisons. Chinese EFL learners had similar general performance, reading comprehension, discourse synthesis, and source-based argumentation with direct relevance to L1 and L2 IW performance. Their future images, current beliefs, and attitudes of IW are also highly connected in the two contexts. L1 IW skills and related mental states have the potential to support the development of L2 IW cross-linguistically. Therefore, this line of research empirically tests the common underlying proficiency shared between L1 and L2 IW to some degree, and Chinese EFL learners are possibly advantaged to perform L2 IW tasks due to their experience and resources already existing in the acquisition of L1 IW (Cummins, 1979). Given the cross-linguistic facilitation effect of L1 IW skills and influential factors on L2 IW counterparts, Chinese EFL learners are encouraged to draw upon their prior L1 IW experience and resources as a schema to activate L2 IW knowledge structure and cognitive framework (Pae, 2018). It underscores the importance of L1-L2 collaborative teaching to enhance teachers’ understanding of students’ IW competence and related variables. Joint talks and sharing workshops possibly help explore new avenues for cultivating successful writers with insights from the writing development of another language.
Meanwhile, we still observed the scarcity of cross-linguistic IW investigations. Three areas can be suggested for future research to move forward. To begin with, more attention should be paid to the variant L1-L2 associations in IW. L1 and L2 IW skills and factors may differ as a function of individual features and task characteristics (Jarvis & Pavlenko, 2008; Larsen-Freeman, 2013; Pae, 2018), indicating the possible existence of moderators that facilitate or constrain their relationship strength. Research on these moderators can help lift the veil of language learning mechanisms embedded in our brains (Sparks, 1995). Besides, previous studies have evidenced the bi-directional L1-L2 interactions (Su, 2010). The cross-linguistic influence occurs in both directions from L1 to L2 and from L2 to L1. Compared to the influence of L1 in L2, less research examines the backward influence of L2 in L1. Thus, future studies can extend prior ones by looking at L2 influence in different levels of L1 skills to mitigate possible conflicts between L1 and L2 learning and increase their reciprocal development. Another direction concerns the methodology. Our cross-linguistic research has relied on large-scale questionnaires and tasks to provide statistical evidence for the relationships between L1 and L2 IW-related skills and variables. Researchers are encouraged to adopt multiple and triangulated methods to collect data across different socio-cultural contexts and time points so as to add valid and dynamic insights into L1-L2 IW associations.
Challenges and prospects of developing standards and strategies for assessing and teaching Chinese IW
It suffices to assert that our research results have benefited Chinese IW teaching and assessment in Hong Kong. Research-informed policies of public examination and teacher training seminars continue to fuel related practices. However, we rarely gather feedback from teachers and students regarding the impact of these policy changes and training seminars. There is a long way to go for practitioners to develop IW assessment tasks and tailor instructions in a way that is compatible with students’ holistic writing development.
Our research in this vein is still in an early but burgeoning stage with a specific focus on the standardized testing context, that is, HKDSE. We hope future research can attend more to classroom-based IW tasks, thus shifting the foci away from writing test results to promoting learning-focused assessment (Zheng & Yu, 2019). IW tasks represent not purely a valid measurement tool but also a means to provide diagnostic information for students’ writing, which in turn informs related teaching and learning (Lee, 2023). Teachers and test boards can explore systematic ways to integrate the value of writing processes and the diagnostic potential of IW tasks in the future. Furthermore, technological advancements such as artificial intelligence (AI) tools have greatly impacted various professional and academic domains, including writing (Barrot, 2023). The emergence of ChatGPT has brought the impact to a peak. As one of the most influential AI tools, ChatGPT is capable of engaging users “in natural and human-like interactive experiences through audio or text.” Teachers and test professionals can be trained to use ChatGPT to document students’ IW processes, access and select source input and practices for IW tasks, and offer specialized feedback on IW texts.
In conclusion, this review offers collective insights into core issues concerning Chinese IW tasks. It sheds light on the underlying construct, influential factors, L1-L2 associations, and related assessment and pedagogy practices after critically synthesizing our research over the past two decades. This line of research makes a unique contribution to the existing IW literature by extensively examining the Chinese L1 context. Noticeably, we tap into Chinese IW mainly from a perspective of cognitive process, which is not constrained by language and cultural settings. Therefore, studies are hoped to inform multilingual instruction by adopting the proposed four traits, that is, contextual awareness, citation and synthesis, original opinion and argument, and writing expression and organization across different language contexts and cultures to see if they remain inherent dimensions of IW competence. Besides, we have employed multiple methods such as questionnaire surveys, interviews, and eye-tracking recordings in our studies. A more diverse range of methods and a longitudinal design are still necessary to collect rich sources of triangulated data for the construct representation, the relationship among those involved variables, and the changing patterns of the inquiry. Moreover, there is a greater need for researchers to transfer what they have found into actual practices that promote IW competence compatible with students’ language proficiency and target educational and cultural settings. Such academic knowledge transfer requires collaborations with various stakeholders, including policymakers, educators, and assessment boards.
Footnotes
Acknowledgments
The authors would like to thank Guoxing Yu, Choo Mui Cheong, Xian Liao, Yaping Liu, Pengfei Zhao, Qi Lu, Yi Guan, Wei Wei, Guanying Li, Jianlin Zhan, Hongbo Wen, and Na Luo for their collaboration with Xinhua Zhu (as PI)'s relevant research projects on integrated writing.
Contributorship
Xinhua Zhu was responsible for conceptualization, supervision, securing grants, and revision. Wandong Xu, Jialin Li, and Yiwen Sun contributed by conceptualizing, drafting, and revising the review drafts. Siyu Zhu and Wanru Pang identified, collected, and organized related empirical studies. Yuan Yao commented and revised the review drafts. All authors read and approved the final manuscript.
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Ethical statement
Consents from all the participants in our previous empirical studies have been collected before data collection, and ethics approval for this review has been obtained from the Institutional Review Board of The Hong Kong Polytechnic University (number: HSEARS20211102003).
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This paper was supported by the General Research Fund (GRF) of University Grants Committee (UGC) 2022/23 (15607722).
