Abstract
Although reading in the digital age looks different than in the past (e.g., on a screen, shorter texts spread across multiple sources), reading continues to be a central part of everyday life. Research in reading comprehension shows that this type of modern reading requires more complex knowledge building inference processes that are difficult for many adolescents and adults. Constructive and personalized learning activities can support readers’ knowledge building. Researchers have been using artificial intelligence (AI) to make these types of activities more effective and accessible. Emerging directions and considerations also result from the introduction of generative AI. Increased collaboration across researchers, developers, educators, and policymakers would afford empirically supported research, development, and implementation that can keep pace with the quickly evolving technological landscape.
Tweet
The digital age has changed how people read and learn. Comprehension theory and the science of learning suggest how to support today's readers and how researchers implement personalized, constructive learning activities using AI-based systems.
Key Points
Many adolescents and adults need support for the reading comprehension challenges that encounter in their everyday lives.
Readers are knowledge builders who need opportunities to engage in constructive activities that are personalized to their skills, knowledge, and interests.
AI-based tools can facilitate constructive and personalized learning activities in ways that help more people become better readers.
The rapid development of AI demands that policymakers, researchers, developers, and educators engage in collaborative and accelerated research and development toward theory-driven technologies that are effective and equitable.
Although the digital age has transformed the types of texts that people read and how they read them, reading remains central to how people acquire knowledge. Unfortunately, many adolescents and adults are not ready for the reading comprehension tasks that they will encounter in their classrooms, at work, and in their everyday lives (e.g., ACT, 2018). Researchers from across the brain and behavioral sciences alongside their colleagues in education have identified what challenges these struggling readers face and how to best support them to have better comprehension during individual reading tasks and to become better readers over time. In addition, the rapid development of artificial intelligence (AI) and AI-based technologies stand to greatly advance society's ability to support people's reading comprehension. On one hand, AI offers unprecedented opportunities to engage learners in theory-informed activities that can improve comprehension and to do so in ways that meet the learner where they are, so that they can build their skills. On the other hand, AI presents several considerations that ought to be studied before such technologies are indiscriminately implemented.
For many people, the idea of AI still feels like science fiction, yet AI is everywhere: smart home devices, predictive text in a Google search, or crafting an email all use AI. More recently, powerful generative AI tools have come online and tools such as ChatGPT serve as a way for anyone to be able to leverage this state-of-the-art technology with no programming experience necessary. Such ubiquity invites policymakers, educators, and the public to deepen their understanding of best practices uncovered by the science of learning and the ways in which AI is being used to support learners. After a brief overview of modern reading comprehension, the review then examines three broad ways in which researchers are leveraging AI in this space—examining the current state-of-the-art, ongoing challenges, and opportunities for future research.
Reading Comprehension in the Digital Era: The Modern Reader
Magliano and colleagues (2017) used the idea of the “The Modern Reader” to explore how the digital age has changed the way that people consume information. The modern reader uses reading as a knowledge building activity toward some goal: a college student attempting to complete coursework; an employee trying to learn a new procedure; a citizen who needs to make a high-stakes medical or political decision that will impact their family. As these examples highlight, reading comprehension is complex and purposeful (Britt et al., 2022). Readers need not only read a text and be able to answer simple questions about it, but they need to navigate across multiple, oftentimes conflicting sources in order to synthesize, evaluate, and communicate about complex ideas (Barzilai & Strømsø, 2018) and need to do so in specialized content domains (Goldman, 2012; see also Snow, 2002).
Unfortunately, reports show that adolescent and adult readers are underprepared to engage in these types of knowledge building tasks. PIAAC results suggest that nearly half of U.S. adults struggle to read texts that are “dense or lengthy and include continuous, noncontinuous, mixed, or multiple pages of text” nor can they complete tasks that require one “to identify, interpret, or evaluate one or more pieces of information” or “to construct meaning across larger chunks of text” (OECD, 2013). The majority of graduating high school seniors are unable to read at a proficient level (ACT, 2018) and 60%-75% of first-year college students are underprepared to read for their college courses (Perin, 2020).
Research in the fields of text and discourse processing has addressed these concerns by attempting to better understand what processes, skills, and knowledge are brought to bear as readers engage in complex comprehension tasks. Comprehension theories suggest that reading comprehension involves readers constructing a mental model of the topic at hand (McNamara & Magliano, 2009). Researchers conceptualize this mental model as a connectionist network of nodes and links. For better comprehension (i.e., a more elaborated and coherent mental model), readers need to not only process the explicit words and ideas, but to generate inferences that help to create the links across ideas in the text and to connect the text content with their existing prior knowledge (McNamara & Magliano, 2009). Broadly, the research suggests that (a) aspects of the text, task, and reader influence the quantity and quality of the inferences that a reader generates and that (b) instruction and scaffolding can help learners to construct more inferences that support a higher quality mental model.
Research further suggests that modern, purposeful reading is heavily dependent on a learner's prior knowledge (Cervetti et al., 2020; McCarthy & McNamara, 2021; Kendeou et al., 2023). Wang and colleagues (2021) had students complete a traditional multiple-choice reading comprehension test and a scenario-based assessment designed to reflect modern reading. Both tests used texts about American football. While the traditional test performance was strongly predicted by basic reading skills and general academic knowledge, performance on the scenario-based assessment was most strongly predicted by learners’ knowledge of football. Thus, there is renewed interest in how to measure and leverage what a learner already knows to help them build their knowledge (Hattan et al., 2023; McCarthy & McNamara, 2021; Simonsmeier et al., 2022).
Helping the Modern Reader: Exploring the Potential of AI
As the tasks for the modern reader have become more complex, researchers and educators have needed to develop methods of instruction, scaffolding, and evaluation to match that complexity. One emerging area is the use of AI to study and support comprehension. AI refers to a range of technologies that allow computers to perform tasks that would typically require human cognitive abilities. More simply, AI analyzes old data to identify patterns that allow it to make predictions about new data. AI allows computers to automate repetitive or complex tasks in ways that make complex activities more scalable. One prominent subset of AI is natural language processing (NLP), which focuses on developing methods for computers to understand, interpret, and generate human language. NLP tools can be used to extract various aspects of language, from individual words to the text as a whole. In the last decade, significant advancements have been made in NLP, particularly with the introduction of large language models (LLMs) such as BERT (Bidirectional Encoder Representations from Transformers) and GPT (Generative Pre-training Transformer). LLMs are trained on massive datasets, which allows them to learn statistical relationships between words and ideas. LLMs are a substantial advancement over previous AI models. They can learn from unlabeled (i.e., not pre-scored by humans) data, which allows them to train on massive datasets without manual labeling. They can also adapt to new tasks without explicit training, which also allows them to generate new content (Yan et al., 2023).
Although the field is in relative infancy, significant strides have been made in employing AI in reading comprehension research. The review highlights three areas where AI has been used to examine ways of studying and supporting the modern reader:
Implementing constructive learning activities at scale Providing tailored, personalized instruction and feedback, and Facilitating data collection and analysis that expands understanding of comprehension
Constructive Learning Activities
Much of what is known about support for reading comprehension comes from studies of skilled and less skilled readers. Less skilled readers tend to maintain a relatively narrow focus, largely restating information that is in the text. On the other hand, skilled readers engage in more inferencing as they read. Not only do they bridge ideas from sentence to sentence, but they also connect what they are currently reading to the larger global purpose of the text, and they are more likely to use their prior knowledge to elaborate upon what is in the text (e.g., Wolfe & Goldman, 2005). Critically, experimental studies show that encouraging less skilled readers to make inferences and to draw upon their prior knowledge improves their comprehension (McNamara, 2007). These findings further emphasize the role of knowledge building and are consistent with a broader base of research in the science of learning (see How People Learn II, NASEM, 2018), demonstrating that activities in support of active and constructive 1 processing are better for learning (Chi & Wylie, 2014; Dunlosky et al., 2013; Fiorella & Mayer, 2016).
Research has demonstrated the utility of open-ended construction of written content, which typically would require an instructor to interpret the writing and provide feedback. AI technologies allow these learning tasks to be completed more efficiently so that more students have more opportunities to engage in meaningful knowledge building.
Personalized Instruction
A learner approaches a given reading task with a unique blend of strengths and limitations and, with respect to prior knowledge, different experiences. Each of these influences how and what the person learns. Given this variability, attempting to reach the “average learner” often fails to meet any one individual (Connor & Morrison, 2016). Thus for individualization and personalization of instruction, these approaches go beyond asking “what works?” to asking, “what works, for whom, and under what conditions?” Personalized instruction emphasizes not only abilities, but also the need to consider students’ interests and the relevance of the content to their experiences (Walkington & Bernacki, 2018). These efforts also highlight a movement away from deficit-based remediation, toward asset-based instruction. Not only are asset-based approaches more equitable, but they also align with best practices in knowledge building as drawing from their own knowledge and lived experiences allows students to engage in the constructive processes that support learning (McCarthy & McNamara, 2022). Work in AI has been developing methods to better evaluate differences in learners and their learning process and to deliver a variety of supports (e.g., feedback) to address those differences.
Stealth assessment is a promising new approach to evaluating student learning that uses AI to analyze learners’ behavioral data (Shute & Rahimi, 2022). Stealth assessment offers a rich and nuanced understanding of students’ existing skills and knowledge, as well as their learning processes, without subjecting them to numerous standardized tests. Students are not aware that they are being assessed, which is less stressful and allows them to engage in an authentic learning task. Developing research demonstrates use of stealth assessment to evaluate students’ proficiencies, such as their general vocabulary knowledge or their performance on a general reading comprehension test (McNamara et al., 2023).
Critically, constructive learning activities not only beneficial learning, but they can be mined for insights into the learning process. For example, readers’ self-explanations can identify learners’ attentional focus, constructed inferences, and knowledge sources. As mentioned, comprehension strategies, such as creating bridging and elaborative inferences, indicate good mental model building. NLP algorithms can reliably detect these comprehension strategies (e.g., Magliano et al., 2016), and recent additions of LLMs increase the accuracy and generalizability of these detectors (Nicula et al., 2023). Leveraging technology also allows including student affect and motivation in the learner model. For example, researchers have used behavioral data (keystrokes, eye-tracking); NLP-based sentiment analysis of students’ responses can detect confusion and boredom (e.g., Allen et al., 2016). Modeling and detecting students’ learning helps provide increasingly tailored feedback.
Effective adaptivity requires understanding features of the task. NLP can evaluate texts in a variety of ways, such as identifying key content or evaluating text complexity. Indeed, AI has greatly improved the ability to identify what features of text are involved in comprehension. Although traditional readability metrics are commonly used in classrooms, they are not particularly good at predicting reading comprehension (Begeny & Greene, 2014; Shanahan, 2020). Early readability metrics like Flesch-Kincaid and Dale-Chall were designed to approximate reading fluency rather than comprehension and were constrained to low-level features of text (e.g., number of syllables, sentence length) that were relatively easy to compute. Although such features affect reading, these formulas do not take into account discourse-level features such as the connectedness between words and ideas (e.g., cohesion) that support mental model building. The advent of sophisticated NLP tools that can capture linguistic features such as semantic overlap has allowed for the creation of new readability metrics that can provide a more accurate and contextualized evaluation of text complexity that afford more effective adaptivity (Choi & Crossley, 2022; Crossley et al., 2017, 2023).
Insights Across Contexts and Learners
In addition to implementation, AI is also being used for discovery.
The utility of big data is perhaps most obvious in the case of individual differences. Small sample sizes mean that researchers are limited to examining a few individual differences at a time, which ignores critical interactions and intersectional identities (McNamara et al., 2022). Larger data sets treat an individual's skills and experiences as valuable parts of their learning, rather than noise, which affords the development of better algorithms that are less prone to biases and support more truly personalized learning.
AI can also help investigate more contexts. Research and development in reading comprehension and educational technologies have largely focused on K-12 and higher education learners. There is growing demand for instruction and tools to address adult education and re/upskilling (World Economic Forum, 2023). While there will likely be some generalizable factors and processes, it is critical to explore the unique strengths and needs of adult learners (McCarthy & McNamara, 2021). More research is needed to explore whether these approaches generalize across populations and domains and what modifications can be made to better align AI-supported research and implementation into different contexts.
Advances in LLMs have allowed for more efficient analysis of big data. Previously, if a researcher wanted to build an algorithm to predict a rubric score or detect a behavior, they would need to provide the computer with a large set of pre-labeled (i.e., human scored) data. By contrast, LLMs can teach themselves with relatively limited human involvement, making it feasible to analyze data of this scale.
Implications for Policy
The ongoing work described above highlights several policy implications. Policymakers, developers, and educators should recognize that AI is an emerging area, but there are ample research-backed approaches to reading comprehension that are showing promise. Educators and developers can learn more about how engaging readers in constructive activities that are personalized to their skills, knowledge, and interests can support learning and how AI can support these efforts. Researchers can leverage big data and AI approaches to analysis to better identify what personalizable factors are most important to evaluate and respond to. Policymakers should further incentivize co-design and collaboration across researchers, developers, and educators so that AI-based reading comprehension supports are designed in ways that meet the needs of the modern reader in authentic learning contexts.
All stakeholders should recognize that AI can support equity through greater access to personalized instruction, but that AI can also increase inequities (e.g., perpetuating biases). Thus, researchers, developers, educators, and policymakers all must continually critically evaluate AI-based tools to ensure help rather than harm.
Conclusions
Rapid advances in technology and AI are changing the ways people interact with information, but they also afford opportunities to address the challenges they present. Research shows that readers are knowledge builders who need opportunities to engage in constructive activities that are personalized to their skills, knowledge, and interests. AI offers opportunities to deliver these types of activities and to collect and analyze learner data that can enhance the efficacy and equity of such activities. Understanding and supporting the needs of the modern reader requires consideration of the modern context. AI is part of that context. Although generative AI technologies are only a few years old, they are already revolutionizing the way that people interact with information, and new conceptions of the modern reader will likely follow. This rapid shift highlights that the speed of this important work will require more collaborative and iterative research and development.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
