Abstract
Background: While oral reading fluency (ORF) is linked to reading comprehension (RC) in African languages, the role of vocabulary remains underexplored.
Objectives: This study investigated the relative contributions of ORF and vocabulary to RC in the first and second languages of three bilingual groups: Akan-English in Ghana, isiZulu-English in South Africa, and Shona-English in Zimbabwe and examined the ORF rate needed for 80% comprehension in each language.
Method: A quantitative correlational design was used. Learners in Ghana (n = 158), South Africa (n =104), and Zimbabwe (n = 122) were assessed through convenience sampling. Constructs measured included: ORF, RC and receptive vocabulary tests.
Results: Oral reading fluency was a strong predictor of RC in first and second languages. Learners needed to read between 89 and 109 words correct per minute (WCPM) in English to achieve 80% comprehension. For African languages ORF thresholds ranged from 47 WCPM (isiZulu) to 55 WCPM (Akan).
Conclusion: Oral reading fluency emerged as a critical prerequisite for RC across bilingual populations. Without adequate ORF skills, vocabulary knowledge cannot support comprehension.
Contribution: This study provides novel data on reading fluency and comprehension in under-researched African languages, particularly Akan and Shona. It proposes preliminary ORF benchmarks for Ghana and Zimbabwe and validates existing benchmarks for South Africa.
Keywords: oral reading fluency; vocabulary; RC; ORF benchmarks; bilingualism; Akan; isiZulu; Shona.
Introduction
Worldwide, the most used parameters to measure basic education outcomes are literacy and numeracy. The importance of these fundamental skills is globally acknowledged, as illustrated by their inclusion in the United Nations Sustainable Development Goals, target 4.6: ‘By 2030, ensure that all youth and a substantial proportion of adults, both men and women, achieve literacy and numeracy’ (UN 2015). Initiatives to improve literacy outcomes, particularly in primary education, have increased rapidly across the African continent since global programmes, such as Education for All (UNESCO 2005, 2014), the Sustainable Development Goals (UN 2018) and the Education 2030 agenda (UNESCO 2015) materialised. A popular measure in these programmes is Oral Reading Fluency (ORF), typically measured by having learners read grade-level text aloud, with the number of words read correctly per minute used as the fluency score. The words correct per minute (WCPM) that an individual reads is calculated and represents the ORF score. Text reading fluency, which can be defined as ‘the ability to read a text quickly, accurately and with proper expression’ (National Institute of Child Health and Human Development 2000:3–5), has been established as a strong predictor of RC in Indo-European languages (Bigozzi et al. 2017; Fuchs et al. 2001; Hudson, Lane & Pullen 2005; Lervåg & Melby-Lervåg 2022).
Low RC levels continue to be a problem in many sub-Saharan African countries (Piper & Dubeck 2024). Given the established link between ORF and RC, researchers have explored this relationship to some extent in African languages. Existing studies (Ardington et al. 2020; Mutema & Pretorius 2024; Pretorius & Spaull 2016; Spaull, Pretorius & Mohohlwane 2020; Wills et al. 2022) support the idea that fluency is important for RC in African languages, but research on ORF benchmarks (i.e. standardised WCPM scores that suggest the fluency rate at which a child should be reading in order to understand a text) for African languages remains scarce and inconclusive. Furthermore, while traditional English ORF benchmarks may assist teachers in identifying learners who struggle with English decoding and may help policymakers to determine when interventions are needed, their use in non-Western contexts poses some concerns. Although English is often used as a medium of instruction (MOI) in the intermediate and secondary education phases in African countries, it remains a second language (L2) for the majority of learners. Most English ORF benchmarks were established for first language (L1) readers in Western contexts. Since English L2 learners follow a different development pattern in language acquisition than L1 speakers (Cremer & Schoonen 2013; Goriot et al. 2021; Proctor et al. 2005), it is important to develop ORF benchmarks for African English L2 contexts, based on evidence from the same context in which they are used. This study contributes to the current debate by investigating to what extent L1 and L2 ORF and vocabulary knowledge predict L1 and L2 RC in three groups of bilingual children (Akan-English in Ghana, isiZulu-English in South Africa and Shona-English in Zimbabwe). Furthermore, the study estimates the fluency thresholds in L1 and L2 required to support comprehension in these children. The next sections outline relevant ORF benchmark research, the theoretical framework, and the linguistic contexts of the study.
Oral reading fluency benchmarks
In Indo-Euro languages, higher WCPM scores are generally associated with more fluent reading and better text comprehension (Daane et al. 2005; Jenkins et al. 2003; Klauda & Guthrie 2008). This correlation is not universal though, as higher ORF scores do not automatically ensure good comprehension among English learners in the United States (Schwanenflugel & Kuhn 2016), and lower ORF scores do not prohibit RC. Graham and Van Ginkel (2014), for instance, showed that readers could achieve 80% comprehension in the Kenyan languages Sabaot and Pokomo, when reading at 30 WCPM and at 24 WCPM. Dowd and Bartlett (2019) compared WCPM data from Grade 2 children in 11 developing countries and found that children reading ‘slower’ than 45–60 WCPM were able to achieve high comprehension levels. The mean WCMP in the 11 countries was 51 WCPM, but in terms of explaining comprehension across languages, this metric was not helpful: the fastest readers in Zimbabwe (who read with comprehension) read at this rate, but the average reader in Vietnam (who reads with comprehension) read at 95 WCPM. Dowd and Bartlett (2019) argued that WCPM:
[C]annot be delimited with the precision required of a global metric; language-specific ranges are necessary, … and targets may need to be differentiated for pupils learning to read in a second or third language. (p. 203)
Reading rates will vary cross-linguistically because of differences in morphological and orthographical structures. In an eye tracking study, Land (2016) found that L1 isiZulu readers fixated significantly longer on characters in isiZulu (an agglutinative language) than in English (a non-agglutinative language) and that refixations increased when readers had to process long words in isiZulu. Land’s data demonstrated that word recognition processes in isiZulu and English are different – less than 1% of isiZulu words were skipped, suggesting that fewer isiZulu words in the parafoveal view are predictable sight words.
Regarding fluency rates in English L2 in non-Western contexts, research shows that fast reading does not guarantee comprehension. Piper, Schroeder and Trudell (2016) found that Kenyan learners read significantly faster in English than in their L1, but often do not understand what they read. Comprehension may be hindered by poor vocabulary knowledge, poor linguistic knowledge, lack of background knowledge and poor inferencing skills. Factors such as text complexity and readability also need to be considered (Zainurrahman & Sukyadi 2024). Benchmarks focusing on speed rather than comprehension may thus disadvantage learners who read more slowly in order to construct meaning within the text (Goodman 2006).
In response to the need to establish ORF benchmarks for different African languages and for English L2 in African countries, the Department of Basic Education in South Africa led an extensive project between 2020 and 2023 to develop ORF benchmarks for the African languages used in education in South Africa. The project’s main goal was to create contextually relevant standards that would help the education system to monitor and improve reading skills in the early grades. Based on this project, ORF benchmarks for Grade 3 learners in South Africa have been set at 35 WCPM for three Nguni languages (isiZulu, isiXhosa and siSwati) (Ardington et al. 2020) and at 60 WCPM for Sotho languages (Sesotho and Setswana) (Mohohlwane et al. 2022). For English L1 and English L2 in the South African context, Wills et al. (2022) suggest a threshold of 50 WCPM, stating that few learners will manage to read for meaning in English if they read below this rate. This threshold applies to L1 English learners in Grade 2 (who should reach a benchmark of 90 WCPM in Grade 3 to read at 85% comprehension levels) and to L2 English learners in Grade 3 (who should reach a benchmark of 90 WCPM in Grade 5 to read at 85% comprehension levels).
Despite recent progress in some countries, many indigenous African languages still do not have reliable benchmarks or use benchmarks that were set without systematic assessment or considering the morphological complexity and word length of the language. For example, the 2013 EGRA (Early Grade Reading Assessment) in Ghana suggested a single benchmark of 40–60 WCPM based on working memory capacity research for 11 different Ghanaian languages (Ministry of Education 2014). Based on this benchmark, the study concluded that average fluency rates across all languages were insufficient for comprehension. No recent research on L1 ORF benchmarks in Akan or Shona, or on ORF benchmarks for L2 English in Ghana and Zimbabwe was identified by the authors, highlighting the need for the present study.
Theoretical framework
Although RC is a complex construct, remarkably strong evidence continues to support the notion that comprehension depends on decoding and linguistic comprehension, as described in the ‘Simple View of Reading’ (SVR) (Lervåg & Melby-Lervåg 2022). The SVR model (Hoover & Gough 1990) states that RC (RC) is the product of word-recognition abilities, also known as decoding (D) and linguistic comprehension (LC). Decoding is the process of translating written language into spoken language (Sadoski & Paivio 2007), while LC refers broadly to the ability to understand a language.
Critics of the SVR (e.g., Kirby & Savage 2008) pointed out that the model did not specify how decoding should be measured, even though it is generally interpreted as a measure of reading accuracy. Linguistic comprehension is a similarly broad concept, with Kirby and Savage (2008:76) arguing that it ‘represents all of verbal ability, including vocabulary, syntax, inferencing and the construction of mental schemas’. They further argued that reading fluency (i.e. automatic decoding) is equally important for comprehension and called on researchers to consider the role of fluency within the SVR model. Theories of automaticity in reading presume that more cognitive space becomes available for comprehension when underlying processes such as decoding take place automatically. Automatic decoding and lexical access connect lower-level processes such as phonological and orthographical processing with higher-level processes such as inferencing and background knowledge integration – essentially, fluent readers recognise words automatically, which allows them to reserve cognitive resources for meaning construction (Lervåg & Melby-Lervåg 2022). Many studies in the early 2000s (Daane et al. 2005; Jenkins et al. 2003; Klauda & Guthrie 2008) demonstrated the importance of rapid and automatic processing of words in a text and associated this skill with comprehension. Consequently, automatic word recognition became associated with fluent reading, which became a prerequisite for RC (Daane et al. 2005; Jenkins et al. 2003; Klauda & Guthrie 2008). The importance of fluency for long-term reading success has also been demonstrated by Wang et al. (2019), who followed 30 000 learners from Grade 5 to Grade 10. The decoding-RC relationship was only maintained for learners who achieved decoding at a certain threshold level in Grade 5. For those reading below this threshold, RC remained stagnant. Scholars now agree that fluency is as important for comprehension as word recognition and linguistic comprehension, and stress the importance of developing fluency during the early stages of reading development.
As previously discussed, LC, or the ability to understand a language, is an important component of the SVR. Vocabulary knowledge (one aspect of LC) is necessary for RC, but not sufficient. When we read to learn, information that needs to be processed tends to be new. Given this, it is essential that readers can interact with texts in ways that allow them to adapt their existing knowledge, based on what they have read. Reading fluency, vocabulary knowledge and comprehension are thus interconnected and reciprocal skills: as learners become more fluent readers, they are exposed to more words, which helps expand their vocabulary, and which supports comprehension. Conversely, a strong vocabulary base supports better fluency, as learners can recognise and understand words more quickly. The implications of the SVR model are different for L1 and L2 readers (Grabe 2009). First language readers typically begin with strong LC and must develop D, while L2 readers may struggle with both, resulting in more complex relationships between decoding, comprehension, and fluency. Achieving automatic decoding depends on many variables, including L2 LC, quality of instruction and access to materials. In some cases, learners may reach high ORF rates, without achieving RC, because of a lack of L2 LC. In other cases, learners may reach acceptable L2 LC levels, but fail to read with comprehension because of a lack of automatic D. Even so, there is evidence that improvements in ORF lead to increased RC in bi-/multilingual children in the early grades (Baker et al. 2012; Kim, Wagner & Foster 2010; Wawire, Liang & Piper 2023). In their longitudinal study, Wawire et al. (2023) found that text reading fluency mediates the relationship between D and RC in both English and Kiswahili. Wawire et al. (2023) stressed the need for instructional practices that enhance text reading fluency in multilingual contexts, given its critical role in RC in multilingual learners.
Linguistic and literacy contexts in the current study
African countries are characterised by multilingualism (Eberhard et al. 2024). To manage linguistic diversity in public institutions, many African countries, including the ones in this study, have language education policies that limit the use of multiple languages in government, business and education. Many countries have adopted former colonial languages (English, French or Portuguese) as official languages (UNICEF 2016), even where indigenous languages enjoy official language status. All three countries in this study use an ‘early exit model’ in education, where mother-tongue education is supported from Grades 1–3,1 followed by English instruction from Grade 4 onwards. This policy is implemented in South Africa, where 12 languages (Afrikaans, English, isiNdebele, isiXhosa, isiZulu, Sepedi, Sesotho, Setswana, siSwati, Tshivenda, Xitsonga, and South African Sign Language) are officially recognised and used as MOI in the Foundation Phase. In Zimbabwe and Ghana, this policy is not implemented consistently, and many children receive most of their literacy instruction in English, with the L1 being used as MOI to facilitate instruction (Chivhanga & Chimhenga 2013). Currently, the education systems of Ghana, South Africa and Zimbabwe continue to fail a large proportion of young bilingual learners. The 2013 EGRA study in Ghana reported that most learners in public schools could not read with comprehension by the end of Grade 2, with many children obtaining zero scores in oral passage reading (Ministry of Education 2014). In South Africa, the 2021 Progress in International Reading Literacy Study data show that 81% of Grade 4 learners cannot read for meaning (Van Staden, Roux & Tshele 2023). In Zimbabwe, the 2021 Southern and Eastern Africa Consortium for Monitoring Educational Quality report indicated that only 34% of Grade 6 learners achieved the minimum level of reading proficiency, and overall, 66% of learners did not reach the reading competency level expected for their grade (Awich 2021). Despite evidence that mother-tongue education benefits literacy development in both the L1 and the L2, and that it leads to better academic outcomes (Evans & Acosta 2020; Nag et al. 2019), early exit models persist in the current research settings and contribute to poor literacy outcomes.
The development of early reading skills is influenced by the linguistic and orthographic characteristics of a language (Seymour, Aro & Erskine 2003) and, although this is known, reading in African languages continues to be taught using methods developed for English. Akan, isiZulu and Shona all have transparent orthographies, which theoretically means they should be easy to decode (Pretorius & Mampuru 2007). However, all three languages add unique characteristics to letters of the Roman alphabet, including diacritics and tone markings (to preserve phonetic and tonal nuances needed for meaning), and special characters that compensate for sounds that are not in the standard Latin orthography (e.g. Ɛ, ɛ:, Ɔ and ɔ: in Akan). In addition, isiZulu and Shona are agglutinative languages: prefixes and suffixes are added to root words to indicate features such as noun class, tense, aspect, negation, mood and location. This means that isiZulu and Shona words are often long and complex and are visually similar, which makes decoding difficult (Ardington et al. 2020; Land 2016). The isiZulu example Abafana abawufundi (‘The boys are not reading it’) illustrates this complexity (researchers’ own example). In abawufundi (‘are not reading it’), a- is a negative prefix, marking the verb as negative present tense; -ba- is a subject concord prefix agreeing with the subject abafana ‘boys’ (noun class 2), -wu- acts as a concord prefix that substitutes the object (indicating ‘it’), -fund- (‘read’) is the root verb and -i is a negative final vowel indicating negative present tense. African languages also differ in that they are written disjunctively (morphemes represent separate words, e.g., ke a mo rata [Northern Sotho ‘I love him/her’]) or conjunctively (multiple morphemes are combined into a single word, e.g., ngiyamthanda [isiZulu ‘I love him/her’]). Finally, isiZulu and Shona have complex consonant systems and a large code set, with isiZulu, for instance, using 60 phonemes compared to English’s 44 (Land 2015a). Akan is not formally classified as an agglutinative language (Kambon 2012) but does exhibit some characteristics of agglutination. Akan has a consonant system that is not more complex than the English system, but that is distinct. For instance, the glottal stop in Akan is represented by an apostrophe (‘). Combined, these linguistic characteristics can slow down decoding, especially if reading is taught using methods developed for English. Against this background, this study was guided by the following research questions.
Research questions
- To what extent do ORF and vocabulary predict RC in the Intermediate Phase among bilingual Akan-English, isiZulu-English and Shona-English learners in both their L1 and L2?
- What average ORF rate is needed to achieve 80% comprehension in the L1 and the L2 of Akan-English, isiZulu-English and Shona-English bilingual learners?
Research methods and design
Research setting and sample
Because of constraints on time and money, convenience sampling was used to test bilingual Akan-English learners (Ghana), isiZulu-English learners (South Africa), and Shona-English learners (Zimbabwe) in Grade 5 and Grade 6. The total sample included 158 Ghanaian learners, 104 South African learners, and 122 Zimbabwean learners (see mean ages in Table 2). Three public schools in a rural town in Ghana’s Central Region participated in the study (…) in the study, where most inhabitants for in the fishing industry (…) most inhabitants work in the fishing industry, with 29.8% living below the national poverty line (Ghana Statistical Service 2015). In South Africa, two urban isiZulu schools in the City of Tshwane participated. The schools served mainly low-income communities. In Zimbabwe, two urban schools (also serving low-socio-economic communities) in the cities of Harare and Gweru participated. To be considered for inclusion, schools had to use Akan, isiZulu or Shona as the MOI, had to be no further than 20 km from the local research institutions, and had to serve low-income communities. Schools that met the MOI and low-income criteria, but that were further than 20 km from the researchers’ institutions, were not considered for inclusion.
English is the L2 of all the children in the sample, but the Zimbabwean children used English as MOI from Grade 1, whereas the South African children used English as MOI from Grade 4. In Ghana, English is the MOI from Grade 4, but L1 instruction in Ghanaian languages in the first three years of education depends on variables such as the availability of materials and teacher training (Appiah 2024). All learners participated in a series of L1 and L2 language tests, which formed part of a larger literacy project. Learners were tested outside their classroom during school hours. All research assistants were native speakers of the L1 being tested, and fluent in English.
Instruments
To measure ORF, two traditional short stories were selected per grade level, one in English and one in the L1 (Akan, Shona or isiZulu). The selected English ORF tests were validated for English L2 research in the African context for similar grades in previous studies, such as the 2013 NEEDU study (Draper & Spaull 2015). Since no standardised ORF and comprehension tests for Akan, isiZulu and Shona were available, the researchers used L1 passages from the learners’ mother-tongue course books, in line with similar studies (Graham & Van Ginkel 2014). The selection was done by local teachers and African language experts; consequently, all learners read passages that were thought suitable for their grade level. Learners were asked to read aloud the ORF story in their L1 for 1 min, during which errors were marked by the research assistant. Once the learner had finished, the WCPM score was calculated. Each passage was accompanied by five comprehension questions, making the minimum score 0 and the maximum score 5 per comprehension test. Learners who were unable to read the entire text in 1 minute were given an additional 2 minutes to finish reading the story (…) and then had to answer the five comprehension questions orally.
The English L2 ORF test was conducted in the same way. The Grade 5 English text had a Flesch Reading Ease (RE) score of 84.7 (easy), a Flesch-Kincaid Grade Level score of 3.47 (very easy), and an Automated Readability Index (ARI) score of 4.06 (easy), indicating that the text is suitable for learners aged 9–10 (Readability Formulas n.d.). The Grade 6 English text had a Flesch RE score of 83 (easy), a Flesch-Kincaid Grade Level score of 4.66 (fairly easy), and an ARI score of 4.88 (fairly easy), making this text suitable for learners aged 10–11 (Readability Formulas n.d.). An overview of the ORF texts is given in Table 1.
| TABLE 1: Description of oral reading fluency and comprehension texts per Grade, including Word count, minimum score, maximum score, reliability (Cronbach’s alpha) and question type. |
Comparability of language tests
When including several languages is the same study design, it is challenging to provide assessments of comparable levels of difficulty. Several steps were taken in the study to align the length and difficulty of the ORF tests as much as possible. Firstly, although the length of the ORF tests differed per language, the design was such that word count per test increased with grade level (see Table 1). Secondly, floor and ceiling effects were monitored for WCPM scores. None of the participants obtained the maximum WCPM score in their L1 or L2, showing that the texts had an appropriate length. Regarding the minimum WCPM scores, Table 1 shows that 24 of the 81 Grade 5 learners in Ghana (29.6%) obtained zero scores for Akan, which might be an indication that the text was somewhat difficult. Zero WCPM scores for Grade 6 Akan learners were also on the higher side (18.2%). In spite of these zero scores, the reliability of both tests was good. The high number of zero scores is in line with findings from other studies in Ghana (Ministry of Education 2014). Regarding the two other countries, Table 1 shows that zero scores were much lower in South Africa and non-existent in Zimbabwe. A third step taken to balance levels of difficulty was to include comparable RC questions. The majority of the questions for each test were easy questions for which learners had to retrieve straightforward information and key details. The other questions were either vocabulary questions, that is, working out the meaning of unknown words using contextual clues, or inferencing questions. As shown in Table 1, the Shona Grade 6 test for RC had the lowest number of retrieval questions.
Test reliability
Cronbach’s alpha reliability tests were performed to assess the reliability of the L1 and L2 RC tests. Results are presented in Table 1. The reliability of the tests ranged between 0.62 and 0.86 for the Ghanaian and South African samples. For the Grade 5 Zimbabwe sample, the tests were highly unreliable for both Shona (α = 0.388; n = 60) and L2 English (α = 0.479; n = 60). Further analysis of the Grade 5 items revealed that item-correlations ranged between –0.031 and 0.365, indicating a strong inconsistency within the scale. It was also investigated whether removing items with a negative inter-item correlation from the scale would improve the scale reliability, but this was not the case. Therefore, it was concluded that the Grade 5 Zimbabwe test data were not suitable for further use in the study. Rather than removing participant scores to manipulate results, all the Grade 5 data from the Zimbabwe were removed, leaving a sample of 62 Grade 6 learners in the Zimbabwe sample. The reliability of the Grade 6 Shona L1 RC test was also deemed low, hence items 1 and 2 were removed from the test, resulting in an alpha score of 0.612 (n = 62).
Receptive vocabulary in English was tested with the Peabody Picture Vocabulary Test (PPVT) Form B (Dunn & Dunn 2007). Receptive vocabulary in Akan, isiZulu and Shona was measured using a translation of the PPVT Form A. The isiZulu translation of Form A (Schaefer 2023) was used as basis for the translations into Akan and Shona, which were done by qualified language practitioners. Adaptations to the original English items (sets 1–11 of Form A), necessitated by linguistic or contextual limitations, are explained in Schaefer (2023). As the participants in the present samples were older than those in Schaefer’s study, a certified translator translated three additional sets of Form A (sets 12–14). The additional translated items were piloted with 28 learners in Ghana, 15 in South Africa, and 20 in Zimbabwe. Following the reliability analysis of the pilot data, the researchers deemed the Akan and isiZulu tests suitable for further use. The Shona version was slightly adjusted after the pilot, because of ambiguous meanings of translated words. For example, the word ‘dancing’ (Form A, set 3, item 25) was translated as kutamba; however, this means both playing and dancing, making the selection of one picture complicated.
Directions in the PPVT manual were followed for both forms: each learner was shown a page with four pictures and was asked to point to the picture that represented the word spoken by the administrator of the test.
Data analysis
Scores were captured manually and data were entered into IBM SPSS Statistics (Version 30). Raw scores were calculated for ORF, RC and vocabulary. Descriptive statistics were conducted to explore the data. Reading comprehension scores were transformed to percentages to allow a comparison between the three groups. Pearson correlations were performed to assess the relationship between all the L1 and L2 variables.
Six hierarchical linear regression analyses were performed (three L1 models and three L2 models). Grade 5 and Grade 6 data were collapsed for Ghana and South Africa. For each language group we evaluated the roles of gender, age, ORF and vocabulary (predictor variables) on both L1 and L2 RC (outcome variables). Gender was entered first (model 1), followed by age (model 2), ORF (model 3), and vocabulary (model 4). Assumptions of normality (Shapiro-Wilk test for normality), multicollinearity (variance inflation factors [VIF]) and homoscedasticity (Breusch-Pagan test for heteroscedasticity) were checked throughout. The outcome of the assumption checking is provided for model 4 of each regression in the next section.
For L1 Akan RC, multicollinearity was low (VIF Gender = 1.09, VIF Age = 1.01, VIF L1 ORF = 1.14, VIF L1 Vocabulary = 1.13). The residuals were not normal (W = 0.97, p = 0.001), but homoscedasticity could be assumed (η = 3.52, p = 0.48). For L1 isiZulu RC, multicollinearity was also low (VIF Gender = 1.08, VIF Age = 1.05, VIF L1 ORF = 1.11, VIF L1 Vocabulary = 1.11). The residuals were not normal (W = 0.96, p = 0.008), but homoscedasticity could be assumed (η = 2.74, p = 0.602). For Shona L1 RC, multicollinearity (VIF Gender = 1.14, VIF Age = 1.03, VIF L1 ORF = 1.05, VIF L1 Vocabulary = 1.15) was low, and while the residuals were not normal (W = 0.92, p < 0.001), there was no statistical evidence of heteroscedasticity (η = 2.19, p = 0.7).
For L2 RC (Ghana), multicollinearity was low (VIF Gender = 1.19, VIF Age = 1.11, VIF ORF = 1.47, VIF Vocabulary = 1.48) and residuals were normal (W = 0.98, p = 0.208). The assumption of homoscedasticity was met (η = 1.47, p = 0.83). For L2 RC (South Africa), multicollinearity was low (VIF Gender = 1.13, VIF Age = 1.01, VIF ORF = 1.28, VIF Vocabulary = 1.19), and residuals also appeared normal (Shapiro-Wilk W = 0.99, p = 0.775). The assumption of homoscedasticity was met (η = 8.63, p = 0.07). For L2 RC (Zimbabwe), multicollinearity was acceptable (VIF Gender = 1.03, VIF Age = 1.11, VIF ORF = 1.73, VIF Vocabulary = 1.81), while the residuals deviated slightly from normality (W = 0.95, p = 0.03). Despite this deviation, the assumption of homoscedasticity was met (η = 7.34, p = 0.12).
Although the assumption of normality of residuals was not met for the L1 models, most other assumptions were met and the researchers thus proceeded with the regression analyses, understanding that the L1 results would have to interpreted cautiously.
Ethical considerations
Ethical clearance for this study was obtained from Radboud University (2021-9071 Ref. 21U.003003), from Ghana Education Service in Central Region, from the Department of Basic Education in South Africa (8/4/4/1/2) and from the Ministry of Primary and Secondary Education in Zimbabwe (C/426/3). Additional permission for school visits was obtained from the school principals. Caregivers received written information to explain the study, including an informed consent form. Learners only participated if they had signed consent from their caregiver. Learners were asked for verbal assent to participate in the assessments. No learner was coerced to participate. To ensure anonymity and confidentiality, all participants were coded using a unique identifier in the data file.
Results
Descriptive statistics
The means, standard deviations, minimum and maximum scores for reading fluency, RC and vocabulary in both the L1 and L2 of the learners are presented in Table 2.
| TABLE 2: Descriptive statistics for reading fluency, RC and vocabulary knowledge in L1 Akan (Ghana), L1 isiZulu (South Africa), and L1 Shona (Zimbabwe), and L2 English (all three countries). |
To allow comparisons with previous research, the researchers also present the cumulative distribution function (CDF) of ORF in both the L1 and L2 in the three groups (Figure 1 and Figure 2). The CDF represents the probability that a learner’s ORF score is less than or equal to a particular value. Essentially, it shows the proportion of learners who read at or below a certain number of WCPM. Since the CDFs do not include the RC data, the original sample for Zimbabwe (n = 122) was included.
 |
FIGURE 1: Cumulative distribution function of oral reading fluency in Akan, Shona and isiZulu. |
|
 |
FIGURE 2: Cumulative distribution function of English L2 oral reding fluency. |
|
In Figure 1, the CDF indicates that the median WCPM is similar for the L1s (45 WCPM in Akan, 39 WCPM in Shona, and 49 WCPM in isiZulu). This means that 50% of the learners were reading at or below these ORF levels in their L1. At the 75th percentile, a different pattern was observed: 75% of Ghanaian learners read at or below 71 WCPM, while South African learners read at 52 WCPM and Zimbabwean learners read at 64 WCPM at this percentile. Figure 1 also shows zero scores for Akan and isiZulu learners, whereas in Zimbabwe no zero scores were recorded.
In Figure 2, the CDF indicates that the median WCPM for English L2 is similar for Ghana and South Africa (69 and 67 WCPM). Zimbabwean learners had a higher median (112 WCPM). This means that 50% of the learners were reading at or below these ORF levels in English. At the 75th percentile, 75% of Ghanaian learners read at or below 90 WCPM, while South African learners read at 95 WCPM and Zimbabwean learners read closer to 129 WCPM in English.
Words correct per minute and reading comprehension
In Akan and Shona, a higher WCPM was associated with higher RC, with readers achieving 100% comprehension at an average rate of 75 WCPM, and readers achieving 80% and 60% comprehension at WCPM rates of 55 and 45 (Figure 3). In Shona, children achieving 100% comprehension read at 56 WCPM, while children achieving 80% and 60% comprehension read at 48 WCPM on average. The pattern was less clear in isiZulu, where learners achieved 100% comprehension at an average of 44 WCPM, and 80% and 60% comprehension at an average of 46 WCPM.
 |
FIGURE 3: Mean words correct per minute and comprehension scores in L1 for (a) Ghana (Akan); (b) South Africa (isiZulu); and (c) Zimbabwe (Shona). |
|
For English L2, a higher WCPM was associated with higher RC in all three groups (Figure 4). Readers at the 100% comprehension level read at 90 WCPM (Ghana), 100 WCPM (South Africa), and 123 WCPM (Zimbabwe). Learners who achieved 80% comprehension read at 89 WCPM (Ghana), 94 WCPM (South Africa), and 108 WCPM (Zimbabwe). At lower comprehension levels, the pattern was less straightforward. In Ghana and South Africa, there was little variation between learners who understood between 20% and 40% of the texts (these children read between 67 WCPM and 74 WCPM). In Zimbabwe, children reading at 90 WCPM, on average, only scored 40% on comprehension, while those who read below 40 WCPM scored between 0 and 20% in comprehension.
 |
FIGURE 4: Mean words correct per minute and comprehension scores in L2 (English) for (a) Ghana; (b) South Africa; and (c) Zimbabwe. |
|
Correlation and regression analyses
Correlational statistics2 showed that L1 ORF and L2 ORF correlations ranged from moderate (Shona group, r = 0.45) to moderately strong (isiZulu group, r = 0.66) and strong (Akan, r = 0.75). L1 ORF strongly correlated with L1 RC in the Akan (r = 0.84) and isiZulu (r = 0.70) groups, and moderately in the Shona group (r = 0.42). In all three groups, L2 ORF and L2 vocabulary correlated moderately with L2 RC, with the exception of L2 vocabulary which correlated moderately strongly with L2 RC in the Shona group (r = 0.65).
Results of the hierarchical linear regression analyses are reported in Table 3. Given the number of regression models conducted, the researchers were mindful of potential Type I error inflation. The results were interpreted cautiously, emphasising consistent patterns across models and reporting effect sizes alongside p-values.
| TABLE 3: Hierarchical regression L1 reading comprehension. |
The first and second models (with gender and age as predictors) were significant for Akan (F [1, 156] = 5.60, p = 0.019, R2 = 0.04 and F [2, 155] = 3.38, p = 0.037, R2 = 0.04), but not for isiZulu or Shona. Adding ORF as predictor resulted in significant models for all three groups: Akan (F [3, 154] = 134.962, p < 0.000, R2 = 0.72); isiZulu (F [3, 100] = 32.125, p < 0.000, R2 = 0.49); and Shona (F [3, 58] = 4.479, p < 0.000, R2 = 0.19). Finally, adding L1 vocabulary in model 4 as predictor variable resulted in significant models in all three groups: Akan F [4, 153] = 100.807, p < 0.000, R2 = 0.73); isiZulu (F [4, 99] = 24.389, p < 0.000, R2 = 0.50); and Shona (F [4, 57] = 4.335, p = 0.004, R2 = 0.23).
ΔR2 from model 2 to model 3 was significant in all three L1 models (ΔR2 Akan L1 RC = 0.68, p < 0.000; ΔR2 isiZulu L1 RC = 0.47, p < 0.000; and ΔR2 Shona L1 RC = 0.16, p < 0.000). Although model 4 was significant for all three groups, adding the last predictor (vocabulary) did not cause an increase in the predictive power of the model in any of the groups, highlighting the robustness of ORF as predictor. Following Cohen’s (1988) standards,3 the effect size of the predictors in model 4 may be described as large in all three groups. The strongest predictor of L1 RC was L1 ORF – adding this predictor explained 68% of additional variance for the Akan group 47% for the isiZulu group and 16% for the Shona group.
The L1 ORF coefficients were statistically significant in all three L1 RC models. Specifically, the standardised coefficients and 95% confidence intervals were as follows in model 4: Akan, β = 0.85, 95%CI (0.753, 0.934); isiZulu, β = 0.67, 95%CI (0.523, 0.822); and Shona, β = 0.36, 95%CI (0.124, 0.602). Because none of these intervals includes zero, one can conclude with 95% confidence that there is a positive linear relationship between L1 ORF and L1 RC and that the true population coefficient falls within the respective intervals. In contrast, the L1 vocabulary coefficients were not statistically significant in any of the models. The 95% confidence intervals for Akan (β = 0.03, 95%CI [–0.614, 0.119]), isiZulu (β = 0.08, 95%CI [–0.068, 0.230]), and Shona (β = 0.23, 95%CI [–0.021, 0.477]) all included zero. Thus, one cannot conclude with 95% confidence that L1 vocabulary has a significant linear association with L1 RC.
The hierarchical regression analyses with English L2 RC as outcome variable was performed in the same manner as described above. The regression statistics for the L2 comprehension models are given in Table 4.
| TABLE 4: Hierarchical regression English L2 reading comprehension. |
The regression analyses show that the first L2 model (only gender added as predictor) was not significant in any of the groups groups (Ghana F [1,155] = 2.483, p = 0.177, R2 = 0.02; South Africa F [1,94] = 1.416, p = 0.237, R2 = 0.03; Zimbabwe F [1,60] = 1.32, p = 0.255, R2 = 0.02). Adding the predictor age resulted in statistically significant models for the Ghanaian learners (F [2,154] = 12.457, p < 0.000, R2 = 0.14), explaining 12% additional variance (R2 = 0.12, p < 0.000). Adding age as predictor did not yield a significant model for the Zimbabwean or South African L2 data. Adding L2 ORF to the model in the third step resulted in significant models for all three groups (Ghana F [3,153] = 41.013, p < 0.000, R2 = 0.45; South Africa F [3,92] = 13.328, p < 0.000, R2 = 0.30; Zimbabwe F [3,58] = 9.779, p < 0.000, R2 = 0.34). The models remained significant in all three groups when L2 vocabulary was added as predictor (Ghana F [4,152] = 39.013, p < 0.000, R2 = 0.51; South Africa F [4,91] = 14.242, p < 0.000, R2 = 0.39; Zimbabwe F [4,57] = 7.425, p < 0.000, R2 = 0.34). Adding vocabulary as predictor explained an additional 6% and 8% of the variance in the Ghanaian and South African models, but L2 vocabulary did not account for any additional variance in L2 RC in the Zimbabwean learners.
The L2 ORF coefficients were statistically significant in all three final models. In Ghana, the standardised coefficient was β = 0.44 (95%CI [0.305, 0.578]); in South Africa, β = 0.41 (95%CI [0.232, 0.603]); and in Zimbabwe, β = 0.50 (95%CI [0.215, 0.780]). None of these intervals includes zero, meaning that one can conclude with 95% confidence that there is a positive linear relationship between L2 ORF and English L2 RC, and that the true population coefficient falls within the respective intervals. For the L2 vocabulary coefficients, the results varied by country. The relationship was statistically significant for Ghana (β = 0.30, 95%CI [0.170, 0.444]) and South Africa (β = 0.31, 95%CI [0.126, 0.484]), as neither interval included zero. However, the coefficient for Zimbabwe was not statistically significant (β = 0.11, 95%CI [–0.179, 0.399]), as the confidence interval included zero.
ΔR2 from model 2 to model 3 was again much more pronounced than ΔR2 changes at any other stage. The effect size of model 4 was large for each group, with L2 ORF emerging as the strongest predictor of L2 English RC, explaining an additional 31% of variance in the Ghana model, an additional 26% of variance in the South Africa model, and an additional 30% of variance in the Zimbabwe model.
Discussion
The aim of this study was twofold. First, the researchers wanted to explore the contribution of reading fluency to RC in three different bilingual groups of African learners, using a conceptual model that also considered the contribution of vocabulary to RC. Second, the researchers wanted to contribute original ORF data from readers of African languages, and from English L2 readers in African contexts to further ORF benchmark research – specifically, we wanted to support the ORF benchmark estimates for Nguni languages as reported in Ardington et al. (2020), and estimate ORF benchmarks for Akan and Shona.
In all three L1 models, ORF was the strongest predictor of RC, accounting for 68% of the explained variance in Ghana, 47% in South Africa, and 17% in Zimbabwe. Gender and age were weak predictors of L1 RC; these variables explained 4% of variance in Ghana, 2% in South Africa, and 3% in Zimbabwe. Vocabulary contributed minimally once ORF was controlled, although in the L2 models for Ghana and South Africa, it explained an additional 6% – 8% of variance.
In the English Second language models, the analyses revealed again that L2 ORF was the best predictor of RC in all three countries. Gender was a poor predictor of L2 comprehension, but age had a significant negative impact in Ghana, indicating that older learners had lower RC scores. This may be attributed to older learners repeating grades or being absent from school. Most sub-Saharan African countries report high grade repetition rates (Taniguchi 2015). Driven by poverty, many Ghanaian children are absent from school because they have to help with farming, fishing, household chores, or other forms of labour (Ampiah & Adu-Yeboah 2009). This economic necessity often takes precedence over education, leading to irregular school attendance, poor academic performance, and high dropout. Second language ORF explained 31% of variance in reading L2 comprehension in Ghana, compared to 26% in South Africa and 30% in Zimbabwe. Second language vocabulary had a more pronounced effect on L2 comprehension in Ghana and South Africa, where L2 vocabulary uniquely contributed to L2 comprehension (with 6% and 8% additional variance explained). Even though Zimbabwean learners had significantly higher English vocabulary scores, adding L2 vocabulary did not significantly improve the model. Possibly, these learners found the English text comparatively easy (as English was their MOI from Grade 1), and a vocabulary-comprehension relationship might only appear when learners have to read a text that contains more challenging vocabulary.
The data presented support global results that have indicated the importance of developing fluency in both languages of a bilingual learner (Baker et al. 2012; Kim et al. 2010; Wawire et al. 2023), and confirm studies in African contexts that reported fluency as a stronger predictor of RC than vocabulary (Pretorius & Mampuru 2007). Furthermore, the findings support the work by scholars such as Leachman, Wolters and Kim (2025), who noted that the predictive power of models explaining comprehension varies as a function of developmental phases and orthographic depths, suggesting the need for tailored educational strategies in each language. Increased fluency also supports vocabulary expansion, which further supports other important literacy constructs such as RC, learner motivation and confidence, as readers experience more satisfaction from reading (Mason & Krashen 2017). Finally, fluency is linked to overall academic achievement because it enables learners to read to learn, accessing and understanding content across subjects (Paige & Smith 2018). Overall, the findings highlight the critical role of L1 and L2 reading fluency for L1 and L2 RC. English vocabulary contributed to English L2 comprehension, but its impact was secondary to that of fluency.
The second aim of the study was to contribute to the current debate on ORF benchmarks for African languages and L2 English readers in African contexts. Readers in European countries typically achieve both accuracy and fluency in L1 reading by the end of the first school year (Seymour et al. 2003). The developmental rate of English decoding and ORF skills in native English children is more than twice as slow as in children who are native speakers of other European languages with shallow orthographies; this difference has been attributed to differences in linguistic complexity and orthographic depth (Seymour et al. 2003). Southern African languages have transparent orthographies, which theoretically means that establishing phoneme – grapheme correspondences (which supports decoding) should be simple. However, mounting evidence shows that South African learners read faster in English than in their L1, and many learners are not decoding text in their L1 at expected rates at the end of Grade 3. Decoding in African languages might be harder than anticipated, and fluency rates in African languages might be significantly lower than in English because of the agglutinative nature of (some) African languages, and orthographic demands. Van Rooy and Pretorius (2013), for example, show that eye-fixations of isiZulu speakers are much longer when reading isiZulu compared to English, suggesting that reading isiZulu requires more cognitive effort and processing time. Secondly, the lack of tone markers in written forms of the three languages may cause confusion. Finally, isiZulu words have a low number of permissible letter combinations, making many words look alike. This possibly requires long duration of fixations noted among proficient readers of isiZulu (Land 2015b, 2016).
The data in this study suggest that fluency rates in all three African languages were indeed lower than in English, and that children who could achieve 100% comprehension levels in Akan, isiZulu and Shona read at an average of 75 WCPM, 44 WCPM and 55 WCPM. Children reading at 46 WCPM (isiZulu) and 48 WCPM (Shona) could still achieve a comprehension level above 60%. Reading rates in Akan were slightly higher than in isiZulu, and Shona, and indicate that the single benchmark of 40–60 WCPM suggested for Ghanaian languages (Ministry of Education 2014) does not adequately capture variation. Large-scale data are needed to estimate ORF benchmarks for different Ghanaian languages, as existing data (even if limited) suggest that better defined ranges are essential. The present sample is too small to generalise data, but comparing these data to previous South African studies is useful to provide supportive evidence for existing South African benchmarks. The Grade 5 and Grade 6 learners who achieved 80% or higher comprehension in English L2 read at 94 WCPM in South Africa and at 89 WCPM in Ghana, which aligns well with the 90 WCPM benchmark set for the end of Grade 5 in Wills et al. (2022). Learners in Zimbabwe read faster (108 WCPM) at the 80% comprehension level, which falls in between the L1 English and L2 English benchmarks for South African readers. This probably reflects their amount of exposure to English. Researchers thus need to establish unique English L2 ORF benchmarks for Zimbabwe. Learners who read below 40 WCPM could not answer any questions in all three groups. Importantly, some learners in all three groups achieved less than 40% comprehension, even though they read well above the 50 WCPM threshold suggested for Grade 3 learners in South Africa. This confirms that reaching the 90 WCPM is important to facilitate English comprehension in the intermediate grades.
Regarding L1 ORF benchmarks, the data show that L1 isiZulu and Shona children can reach 80% comprehension levels at rates of 47 WCPM and 49 WCPM, which is slightly below the 50 WCPM suggested in Wills et al. (2022) for Grade 5 readers. Readers in Akan L1 read at 55 WCPM at the 80% comprehension level, which aligns fairly well with the benchmarks suggested for conjunctive languages in South Africa at Grade 5–6 level. In all three African languages, readers reading below 40 WCPM struggled with understanding, achieving less than 40% comprehension.
Theoretical and pedagogical implications
The original SVR proposed that RC is the product of D (word recognition) and LC (Hoover & Gough 1990), but most scholars now agree that reading fluency needs to be considered in RC models. The presented data support the idea that text reading fluency, in particular, bridges the gap between decoding individual words and RC. Some scholars have suggested that it is the ability to read with appropriate prosody (a particular feature of fluent text reading) that enhances comprehension and retention of information, rather than the ability to decode words accurately and rapidly (Tong et al. 2023; Wolters, Kim & Szura 2020).
Linguistic comprehension remains essential, as it directly impacts the ability to understand and process text. However, the models in this study show that LC (operationalised as vocabulary knowledge) do not always add significant explanatory power for RC once decoding is accounted for. The presented data neither fully support nor refute the SVR, but rather highlight the complexity of RC in the African context. The strong predictive power of L1 and L2 ORF supports the SVR’s emphasis on decoding, but L1 vocabulary failed to explain any variance in L1 RC, and L2 vocabulary only marginally affected L2 RC. The SVR provides a useful framework, but does not capture the nuances of L1 RC in diverse multilingual African contexts, which also depends on knowledge of morphosyntactic structures, orthographic knowledge, quality of instruction, background knowledge and critical thinking skills.
The researchers recommend that interventions aimed at improving L1 and L2 comprehension should focus on enhancing reading fluency. Educational programmes should prioritise developing learners’ fluency in reading aloud to boost their overall RC skills. To support L1 fluency in agglutinative African languages, we concur with Land (2016), who suggested that young readers should be introduced to a core list of short words that do not agglutinate (e.g., conjunctives), which are easier to decode automatically. Readers should also practise identifying common word stems and common combinations of morphemes in complex agglutinating words.
Extensive reading programmes have been flagged as interventions that are especially helpful for improving reading fluency and comprehension in learners of all ages (Kuhn, Schwanenflugel & Meisinger 2010; Mason & Krashen 2017; Reynolds & Goodwin 2016). Extensive reading has also been tested in the South African context, where its implementation improved reading fluency and comprehension (Pretorius & Lephalala 2011; Pretorius & Mampuru 2007), and led to cognitive, affective, and social benefits as early as the Foundation Phase (Nkomo 2021). A key feature of extensive reading is that reading materials should be slightly above the learners’ current proficiency level, ensuring that they are challenged without being frustrated. Comprehensible input also supports vocabulary acquisition (Mason & Krashen 2017). All extensive reading interventionists highlight the importance of reading for pleasure rather than for assessment. This reduces anxiety, promotes self-selection of reading materials and increases motivation, autonomy and engagement with texts. Implementing sustained silent reading (Kuhn et al. 2010) as part of extensive reading helps to set aside regular, uninterrupted time for reading, and when implemented correctly, improves reading stamina and fluency.
Educators need to incorporate a variety of reading materials beyond graded readers and textbooks in their daily classroom activities (Reynolds & Goodwin 2016). Learners’ progress can be monitored and supported through reading logs and providing feedback and discussion about the books learners are reading, which will enhance comprehension and critical thinking (Reynolds & Goodwin 2016). Other practical classroom elements from extensive reading programmes are daily reading sessions, setting reading challenges, organising book clubs and group discussions, and encouraging the use of reading journals. Integrating extensive reading with other subjects by selecting books that complement the curriculum can also be beneficial.
Limitations
The present study has some limitations. First, the overall sample was small, and the removal of the Grade 5 Shona data meant that the researchers had limited Zimbabwean data. Although most of the assumptions of regression analyses were met, the residuals in the L1 models deviated from normality, which means that the reported confidence intervals may be too wide or too narrow. A larger sample would provide more robust results and support generalisations of the findings.
Second, gender and age were weak predictors of RC in this sample, indicating that other unmeasured variables were impacting RC. The predictive power of vocabulary was also smaller than anticipated – possibly because we measured only receptive vocabulary, because of limited time and resources. Expressive vocabulary showed stronger associations with RC than receptive vocabulary in languages such as English (Dixon et al. 2022) and Dutch (Van Viersen et al. 2022), suggesting that the relationship between different dimensions of vocabulary and RC in African languages needs to be studied more carefully. Given that the regression model (with gender, age, L1 and L2 ORF, and L1 and L2 vocabulary as predictors) is clearly not exhaustive, future models can include a broader range of cognitive and linguistic factors, as well as sociolinguistic and educational factors such as language exposure, learner motivation, the home literacy environment, teacher preparedness, and teaching materials. The focus here was only on within-language prediction, but an interactive framework that considers cross-linguistic prediction might prove more powerful overall.
Finally, in Ghana, L1 instruction in local indigenous languages in the first three years of education depends on variables such as the availability of materials and teacher training (Appiah 2024). The researchers were unable to control for these variables, which might have affected the amount of literacy instruction that children received in Akan.
Conclusion
This study highlights the important role of reading fluency in facilitating RC in both the L1 and L2 of bilingual learners in the intermediate education phase in Ghana, South Africa and Zimbabwe. First language reading fluency was the strongest predictor of L1 RC in Akan, isiZulu and Shona. First language vocabulary did not explain additional variance in this study, once reading fluency had been accounted for. Second language reading fluency also emerged as the strongest predictor of L2 English RC, but vocabulary did explain some of the variance in the Ghanaian and South African L2 models. Even so, the data show that reading fluency (in both the L1 and in English L2 reading) accounts for more variance in RC than vocabulary in these African learners, and support versions of the SVR that stress the importance of reading fluency for comprehension. It would seem that, at least up to the intermediate phase of education in African contexts, a strong focus on decoding, and specifically fluent decoding, is required. The role of vocabulary in comprehension can never be disregarded, but vocabulary knowledge cannot support RC without reading fluency.
This study also highlights the need for tailored educational strategies to support the development of reading fluency in the foundation and intermediate phases of formal education. The researchers argue that understanding the impact of ORF on RC will enable educators to better support learners’ literacy development in African contexts, and recommend further studies on the developmental trajectory of reading fluency in multilingual African learners in low-resource contexts.
Acknowledgements
The authors would also like to acknowledge the diligence of the field workers in Ghana, South Africa and Zimbabwe, whose work enabled the authors to collect the data within a limited time frame.
Competing interests
The first author reported that she received funding from the Dutch Research Council Talent Programme (VENI). The author has disclosed those interests fully.
Authors’ contributions
As project leader, Lieke Stoffelsma conceptualised the article and was responsible for instrumentation, data collection and supervision of field workers, data analysis, interpretation and visualisation, writing of the first draft, compiling references, data curation, reviewing and editing the final draft, project administration and funding acquisition. As South African collaborator, Carien Wilsenach was involved the development of isiZulu instruments, data collection and supervision of field workers, interpretation of data, writing of the first draft, compiling references, reviewing and editing the final draft, and project administration. As Zimbabwean collaborator, Fungai Mutema was involved in developing the Shona instruments, data collection, project administration, and producing contextual information and sources to be included in the draft version of the article.
Funding information
This work was supported by the Dutch Research Council Talent Programme (VENI) under Grant VI.Veni.201T.011.
Data availability
The data will be available at the Radboud University Repository: https://doi.org/10.34973/s0nb-em39.
Disclaimer
The views and opinions expressed in this article are those of the authors and are the product of professional research. It does not necessarily reflect the official policy or position of any affiliated institution, funder, agency, or that of the publisher. The authors are responsible for this article’s findings, and content.
References
Ampiah, J.G. & Adu-Yeboah, C., 2009, ‘Mapping the incidence of school dropouts: A case study of communities in Northern Ghana’, Comparative Education 45(2), 219–232. https://doi.org/10.1080/03050060902920625
Appiah, B., 2024, ‘The language of instruction in the basic schools: The impact of English as a medium of instruction in the basic schools at Kwabre-East Municipality of Ghana’, Thesis, University of Iceland.
Ardington, C., Wills, G., Pretorius, E., Deghaye, N., Menendez, A., Mohohlwane, N. et al., 2020, Technical Report: Benchmarking early grade reading skills in Nguni languages, viewed 03 March 2025, from https://resep.sun.ac.za/wp-content/uploads/2020/11/Nguni-Languages-Summary-Report-V06-3-1.pdf.
Awich, M., 2021, SACMEQ IV international report. A study of the conditions of schooling and the quality of education, viewed 05 May 2025, from http://www.seacmeq.org/?q=sacmeq-projects/sacmeq-iv/reports.
Baker, S.K., Smolkowski, K., Katz, R., Fien, H., Seeley, J.R., Kame’enui, E.J. et al., 2012, ‘The impact of reading fluency interventions on the RC of middle school learners with learning disabilities’, Learning Disabilities Research & Practice 27(1), 1–10. https://doi.org/10.1111/j.1540-5826.2011.00347.x
Bigozzi, L., Tarchi, C., Vagnoli, L., Valente, E. & Pinto, G., 2017, ‘Reading fluency as a predictor of school outcomes across grades 4–9’, Frontiers in Psychology 8, 00200. https://doi.org/10.3389/fpsyg.2017.00200
Chivhanga, E. & Chimhenga, S., 2013, ‘Language planning in Zimbabwe: The use of indigenous languages (Shona) as a medium of instruction in primary schools’, IOSR Journal of Humanities and Social Science 12, 58–65. https://doi.org/10.9790/0837-1255865
Cohen, J., 1988, Statistical Power Analysis for the Behavioral Sciences, 2nd edn., Lawrence Erlbaum Associates, Hillsdale NJ.
Cremer, M. & Schoonen, R., 2013, ‘The role of accessibility of semantic word knowledge in monolingual and bilingual fifth-grade reading’, Applied Psycholinguistics 34(6), 1195–1217. https://doi.org/10.1017/S0142716412000203
Daane, M.C., Campbell, J.R., Grigg, W.S., Goodman, M.J. & Oranje, A., 2005, Fourth-grade students reading aloud: NAEP 2002 special study of oral reading (NCES 2006 – 469), viewed 22 April 2025, from https://nces.ed.gov/nationsreportcard/pdf/studies/2006469.pdf.
Dixon, C., Hessel, A., Smith, N., Nielsen, D., Wesierska, M. & Oxley, E., 2022, ‘Receptive and expressive vocabulary development in children learning English as an additional language: Converging evidence from multiple datasets’, Journal of Child Language 50(3), 610–631. https://doi.org/10.1017/S0305000922000071
Dowd, A.J. & Bartlett, L., 2019, ‘The need for speed: interrogating the dominance of oral reading fluency in international reading efforts’, Comparative Education Review 63(2), 189–212. https://doi.org/10.1086/702612
Draper, K. & Spaull, N., 2015, ‘Examining oral reading fluency among Grade 5 rural ESL learners in South Africa: An analysis of NEEDU 2013’, South African Journal of Childhood Education 5(2), 44–77. https://doi.org/10.4102/sajce.v5i2.390
Dunn, L.M. & Dunn, D.M., 2007, Peabody picture vocabulary test, 4th edn., Pearson, Bloomington.
Eberhard, D.M., Simons, G.F. & Fennig, C.D., 2024, Ethnologue: Languages of the World, 27th edn., SIL International, Dallas, TX.
Evans, D.K. & Acosta, M.A., 2020, Education in Africa: What are we learning, Center for Global Development Working Paper, World Bank.
Fuchs, L.S., Fuchs, D., Hosp, M.K. & Jenkins, J.R., 2001, ‘Oral reading fluency as an indicator of reading competence: A theoretical, empirical, and historical analysis’, Scientific Studies of Reading 5(3), 239–256. https://doi.org/10.1207/S1532799XSSR0503_3
Ghana Statistical Service, 2015, Ghana poverty mapping report, viewed 11 February 2025, from https://statsghana.gov.gh/gssmain/fileUpload/pressrelease/POVERTY%20MAP%20FOR%20GHANA-05102015.pdf.
Goodman, K.S. (ed.), 2006, ‘A critical review of DIBELS’, in The truth about DIBELS: What it is, what it does, pp. 1–39, Heinemann, Portsmouth, NH.
Goriot, C., Van Hout, R., Broersma, M., Lobo, V., McQueen, J.M. & Unsworth, S., 2021, ‘Using the peabody picture vocabulary test in L2 children and adolescents: Effects of L1’, International Journal of Bilingual Education and Bilingualism 24(4), 546–568. https://doi.org/10.1080/13670050.2018.1494131
Grabe, W., 2009, Reading in a second language: Moving from theory to practice, Cambridge University Press, Cambridge.
Graham, J. & Van Ginkel, A., 2014, ‘Reading comprehension in Kenyan languages’, Journal of African Languages and Linguistics 35(2), 123–145.
Hoover, W. & Gough, P., 1990, ‘The simple view of reading’, Reading and Writing 2, 127–160. https://doi.org/10.1007/BF00401799
Hudson, R.F., Lane, H.B. & Pullen, P.C., 2005, ‘Reading fluency assessment and instruction: What, why, and how?’, Reading Teacher 58(8), 702–714. https://doi.org/10.1598/RT.58.8.1
IBM Corp, 2024, IBM SPSS Statistics, Version 30 (Computer software), IBM Corp, Armonk, NY.
Jenkins, J.R., Fuchs, L.S., van den Broek, P., Espin, C. & Deno, S.L., 2003, ‘Sources of Individual Differences in Reading Comprehension and Reading Fluency’, Journal of Educational Psychology 95(4), 719–729. https://doi.org/10.1037/0022-0663.95.4.719
Kambon, Ọ.B., 2012, ‘Serial Verb Nominalization in Akan’, Doctoral dissertation, University of Ghana, Legon.
Kim, Y.-S., Wagner, R.K. & Foster, E., 2010, ‘Relations among oral reading fluency, silent reading fluency, and RC: A latent variable study of first-grade readers’, Scientific Studies of Reading 15(4), 338–362. https://doi.org/10.1080/10888438.2010.493964
Kirby, J.R. & Savage, R.S., 2008, ‘Can the simple view deal with the complexities of reading?’, Literacy 42(2), 75–82. https://doi.org/10.1111/j.1741-4369.2008.00487.x
Klauda, S.L. & Guthrie, J., 2008, ‘Relationships of three components of reading fluency to RC’, Journal of Educational Psychology 100(2), 310–321. https://doi.org/10.1037/0022-0663.100.2.310
Kuhn, M.R., Schwanenflugel, P.J. & Meisinger, E.B., 2010, ‘Aligning theory and assessment of reading fluency: Automaticity, prosody, and the definitions of fluency’, Reading Research Quarterly 45(2), 230–251. https://doi.org/10.1598/RRQ.45.2.4
Land, S., 2015a, ‘Reading and the orthography of isiZulu’, South African Journal of African Languages 35(2), 163–175. https://doi.org/10.1080/02572117.2015.1113000
Land, S., 2015b, ‘Skilled reading in isiZulu: what can we learn from it?’ Journal of Education 63, 57–87. https://doi.org/10.17159/i63a03
Land, S., 2016, ‘Automaticity in reading isiZulu’, Reading & Writing 7(1), a90. https://doi.org/10.4102/rw.v7i1.90
Leachman, M., Wolters, A. & Kim, Y.-S.G., 2025, ‘The relation between text reading and RC varies as a function of developmental phase, orthographic depth, and measurement characteristics: Evidence from a meta-analysis’, Journal of Educational Psychology 117(3), 508–528. https://doi.org/10.1037/edu0000765
Lervåg, A. & Melby-Lervåg, M., 2022, ‘Modeling the development of RC: The simple view of reading in a transparent orthography’, in M.J. Snowling, C. Hulme & K. Nation (eds.), The science of reading: A handbook, pp. 280–297, John Wiley & Sons, West Sussex.
Mason, B. & Krashen, S., 2017, ‘Extensive reading in English as a foreign language’, System 66, 1–13. https://doi.org/10.1016/j.system.2017.04.003
Ministry of Education, 2014, Ghana 2013 early Grade reading assessment and early grade mathematics assessment: Reports of findings, GES, Accra.
Mohohlwane, N., Ardington, C., Wills, G., Sebaeng, L., et al., 2022, Benchmarks report. Sesotho-Setswana early grade reading, Department of Basic Education, Pretoria, viewed 22 April 2025, from https://www.education.gov.za/Portals/0/Documents/Reports/ReadingBenchmarks22/9.%20Summary%20Report-Setswana%20Early%20Grade%20Reading%20Benchmarks.pdf.
Mutema, F. & Pretorius, E.J., 2024, ‘Does it matter if some learners read slowly? Exploring relationships between RC and oral reading fluency’, South African Journal of Childhood Education 14(1), a1518. https://doi.org/10.4102/sajce.v14i1.1518
Nag, S., Vagh, S.B., Dulay, K.M. & Snowling, M.J., 2019, ‘Home literacy, school language, and children literacy attainments: A systematic review of evidence from low- and middle-income countries’, Review of Education 7, 91–150. https://doi.org/10.1002/rev3.3130
National Institute of Child Health and Human Development, 2000, Report of the national reading panel: Teaching children to read: An evidence-based assessment of the scientific research literature on reading and its implications for reading instruction, U.S. Dept. of Health and Human Services, Public Health Service, National Institutes of Health, National Institute of Child Health and Human Development, Washington DC.
Nkomo, S.A., 2021, ‘The benefits of an extensive reading programme implemented in two foundation phase classrooms in the Eastern Cape, South Africa’, Reading & Writing: Journal of the Reading Association of South Africa 12(1), a290. https://doi.org/10.4102/rw.v12i1.290
Paige, D.D. & Smith, G.S., 2018, ‘Academic vocabulary and reading fluency: Unlikely bedfellows in the quest for textual meaning’, Education Sciences 8(4), 165. https://doi.org/10.3390/educsci8040165
Piper, B. & Dubeck, M.M., 2024, ‘Responding to the learning crisis: Structured pedagogy in sub-Saharan Africa’, International Journal of Educational Development 109, 103095. https://doi.org/10.1016/j.ijedudev.2024.103095
Piper, B., Schroeder, L. & Trudell, B., 2016, ‘Oral reading fluency and comprehension in Kenya: Reading acquisition in a multilingual environment’, Journal of Research in Reading 39(2), 133–152. https://doi.org/10.1111/1467-9817.12052
Pretorius, E.J. & Lephalala, M., 2011, ‘Reading comprehension in high-poverty schools: How should it be taught and how well does it work?’, Per Linguam 27(2), 1–24. https://doi.org/10.5785/27-2-105
Pretorius, E.J. & Mampuru, D.M., 2007, ‘Playing football without a ball: Language, reading and academic performance in a high-poverty school’, Journal of Research in Reading 30(1), 38–58. https://doi.org/10.1111/j.1467-9817.2006.00333.x
Pretorius, E.J. & Spaull, N., 2016, ‘Exploring relationships between oral reading fluency and reading comprehension amongst English second language readers in South Africa’, Reading and Writing 29, 1449–1471. https://doi.org/10.1007/s11145-016-9645-9
Proctor, C.P., Carlo, M., August, D. & Snow, C.E., 2005, ‘Native Spanish-speaking children reading in English: Toward a model of comprehension’, Journal of Educational Psychology 97(2), 246–256. https://doi.org/10.1037/0022-0663.97.2.246
Readability Formulas, n.d., Free text readability consensus calculator, viewed 28 August 2025, from https://readabilityformulas.com/readability-scoring-system.php.
Reynolds, D. & Goodwin, A.P., 2016, ‘Supporting reading fluency in young readers: A review of the evidence and implications for classroom practice’, Language and Literacy 18(2), 1–22. https://doi.org/10.1177/2332858416680353
Sadoski, M. & Paivio, A., 2007, ‘Toward a unified theory of reading’, Scientific Studies of Reading 11(4), 337–356. https://doi.org/https://doi.org/10.1080/10888430701530714
Schaefer, M., 2023, ‘Phonological processing skills and their longitudinal relation to first and additional language literacy in isiXhosa and isiZulu speaking children’, PhD thesis, University of South Africa (UNISA), Pretoria.
Schwanenflugel, P. & Kuhn, M., 2016, ‘Reading fluency’, in P. Afflerbach (ed.), Handbook of individual differences in reading: Text and context, pp. 107–120, Routledge Publishing, New York, NY.
Seymour, P.H., Aro, M. & Erskine, J.M., 2003, ‘Foundation literacy acquisition in European orthographies’, British Journal of Psychology 94(pt 2), 143–174. https://doi.org/10.1348/000712603321661859
Spaull, N., Pretorius, E. & Mohohlwane, N., 2020, ‘Investigating the comprehension iceberg: Developing empirical benchmarks for early-grade reading in agglutinating African languages’, South African Journal of Childhood Education 10(1), a773. https://doi.org/10.4102/sajce.v10i1.773
Taniguchi, K., 2015, ‘Determinants of grade repetition in primary school in sub-Saharan Africa: An event history analysis for rural Malawi’, International Journal of Educational Development 45, 98–111. https://doi.org/10.1016/j.ijedudev.2015.09.014
Tong, S.X., Lentejas, K., Deng, Q., An, N. & Cui, Y., 2023, ‘How prosodic sensitivity contributes to RC: A meta-analysis’, Educational Psychology Review 35, 78. https://doi.org/10.1007/s10648-023-09792-8
UN, 2015, Transforming our world: The 2030 Agenda for Sustainable Development. United Nations, viewed 13 December 2025, from https://sdgs.un.org/publications/transforming-our-world-2030-agenda-sustainable-development-17981.
UN, 2018, The sustainable development goals report 2018, United Nations, New York, NY, viewed 11 February 2025, from https://unstats.un.org/sdgs/files/report/2018/thesustainabledevelopmentgoalsreport2018-en.pdf.
UNESCO, 2005, EFA Global Monitoring Report 2006: Literacy for life, UNESCO, Paris.
UNESCO, 2014, Teaching and learning: Achieving quality for all: Education for All Global Monitoring Report 2013/14, UNESCO Publishing. https://doi.org/10.54676/CMSE2898
UNESCO, 2015, Incheon declaration and framework for action for the implementation of sustainable development goal 4, viewed 11 February 2025, from https://unesdoc.unesco.org/ark:/48223/pf0000245656.
UNICEF, 2016, The impact of language policy and practice on children’s learning: Evidence from Eastern and Southern Africa, viewed 03 March 2025, from https://www.unicef.org/esa/sites/unicef.org.esa/files/2018-09/UNICEF-2016-Language-and-Learning-FullReport.pdf
Van Rooy, B. & Pretorius, E.J., 2013, ‘Is reading in an agglutinating language different from an analytic language? An analysis of IsiZulu and English reading based on eye movements’ Southern African Linguistics and Applied Language Studies 31(3), 281–297. https://doi.org/10.2989/16073614.2013.837603
Van Staden, S., Roux, K. & Tshele, M. (eds.), 2023, PIRLS 2021: South African children’s literacy achievement, Centre for Evaluation and Assessment, University of Pretoria, Pretoria.
Van Viersen, S., Altani, A., De Jong, P.F. & Protopapas, A., 2024, ‘Between-word processing and text-level skills contributing to fluent reading of (non)word lists and text’, Reading and Writing: An Interdisciplinary Journal 38, 671. https://doi.org/10.1007/s11145-024-10533-8
Wang, Z., Sabatini, J., O’Reilly, T. & Weeks, J., 2019, ‘Decoding and RC’, Journal of Educational Psychology 111(3), 387–401. https://doi.org/10.1037/edu0000302
Wawire, B.A., Liang, X. & Piper, B., 2023, ‘The mediating role of text reading fluency in RC in English and Kiswahili: Evidence from multilingual contexts in Kenya’, Reading & Writing Quarterly 39(3), 173–191. https://doi.org/10.1080/10573569.2022.2078754
Wills, G., Ardington, C., Pretorius, L. & Sebaeng, L., 2022, Benchmarking early grade reading skills: English First Additional Language. Summary report, Khulisa Management Services, Johannesburg, viewed 22 April 2025, from https://www.education.gov.za/Portals/0/Documents/Reports/ReadingBenchmarks22/5.%20Summary%20Report-%20English%20as%20a%20First%20Additional%20Language%20Benchmarking.pdf.
Wolters, A.P., Kim, Y.-S.G. & Szura, J.W., 2020, ‘Is reading prosody related to RC? A meta-analysis’, Scientific Studies of Reading 26(1), 1–20. https://doi.org/10.1080/10888438.2020.1850733
Zainurrahman, F.N. & Sukyadi, D., 2024, ‘Text readability and its impact on RC and reading time’, Journal of Education and Learning 18(4), 1422–1432. https://doi.org/10.11591/edulearn.v18i4.21724
Footnotes
1. The intention is that learners in the three countries receive formal instruction in the Foundation Phase in their L1. In reality, though, children in multilingual metropolitan areas might have an L1 at home that is different from the MOI that is used at the school that they attend. Also, the amount and quality of L1 instruction depends on the availability of teaching materials in the L1.
2. All Pearson correlations reported are significant (p < 0.01). Because of a lack of space, comprehensive correlation tables are not included.
3. Cohen (1988:413–414) proposed the following standards to interpret effect sizes of predictors and dependent variables using R-square: 0.02 (small effect); 0.13 (medium effect); 0.26 (large effect).
|