Top Banner
Presented at Society for Research in Child Development, April 2017 An evaluation of the quality of pre-primary schooling in East Africa and its association with early primary outcomes Frances E. Aboud a* , Amina Abubakar b , Elias Kumbakumba c , Shehe Abdalla Moh'd d , Melanie Dirks a a Department of Psychology, McGill University, Montreal, Canada b Pwani University and KEMRI, Kenya c Mbarara University of Science and Technology, Uganda d State University of Zanzibar, Zanzibar *Department of Psychology 2001 McGill College Ave., Montreal QC H3A 1G1 Telephone 514 398-6099; fax 514 398-4896 [email protected] Acknowledgements The research was funded by the SESEA project (Strengthening Education Systems in East Africa) with funds from Global Affairs Canada. The funders played no role in study design; in the collection, analysis and interpretation of data; in the writing of the report; and in the decision to submit the article for publication.
48

Abstract · Web view< .001, respectively), thus providing validity of the two achievement tests in relation to school performance. The two Executive Function tests were the Tower

Feb 03, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript

Evaluation of East Africa pre-primary schooling 2

Presented at Society for Research in Child Development, April 2017

An evaluation of the quality of pre-primary schooling in East Africa and its association with early primary outcomes

Frances E. Abouda*, Amina Abubakarb, Elias Kumbakumbac, Shehe Abdalla Moh'dd, Melanie Dirksa

aDepartment of Psychology, McGill University, Montreal, Canada

bPwani University and KEMRI, Kenya

cMbarara University of Science and Technology, Uganda

dState University of Zanzibar, Zanzibar

*Department of Psychology

2001 McGill College Ave., Montreal QC H3A 1G1

Telephone 514 398-6099; fax 514 398-4896

[email protected]

Acknowledgements

The research was funded by the SESEA project (Strengthening Education Systems in East Africa) with funds from Global Affairs Canada. The funders played no role in study design; in the collection, analysis and interpretation of data; in the writing of the report; and in the decision to submit the article for publication.

We would like to acknowledge the cooperation and support of personnel affiliated with the Education Ministries of Kenya, Uganda and Zanzibar, and the Madrasa Early Childhood Programme (MECP) of East Africa. Shekufeh Zonji, a program manager and researcher with wide experience, helped train assistants on the MELE measure of quality. Also we thank the staff and teachers of the pre-primary schools and primary schools, as well as the children and their parents, for providing important information.

Abstract

The purpose was to evaluate two common models of pre-primary schooling in East Africa, in terms of their quality and early primary academic benefits to children. Some 117 Madrasa pre-primaries were randomly selected from Kenya, Uganda, and Zanzibar, and 100 non-Madrasa pre-primaries feeding into the same government primary schools. Their quality was observed and rated using the 50-item Measure of Early Learning Environments (MELE), developed by an internationally sponsored team of experts and intended to be relevant for low- and middle-income countries. One year later, five randomly selected graduates from each observed pre-primary (N = 974, ages 70 to 117 months) were tested on the Early Grades Reading and Math tests, and on executive function and social problem-solving measures. The Madrasa pre-primaries were found to have higher overall quality, play, language, group activities, and program structure. There was no significant difference in first grade performance between graduates from the two programs. However, quality mattered. Based on a multilevel linear regression analysis, overall quality and some domain-specific qualities were associated with math and literacy performance. An increase in overall quality of one standard deviation led to an increased math score of 7.7% in Uganda and literacy score of 10.4% in Kenya.

Keywords: Measure of Early Learning Environments (MELE); early childhood development; Kenya; Uganda; Zanzibar

1. Introduction

Too many children around the world do not complete secondary or even primary schooling, and those who do often lack expected proficiencies in literacy and numeracy (UNESCO, 2017). Consequently, there have been concerted efforts by the international community to support improvements in education, especially access to pre-primary education and a smooth transition to primary school (e.g. Global Partnership for Education, 2016). It is expected that attending at least one year of pre-primary education will give children the kind of experience they need to succeed in primary school (Earle et al., 2018). For this reason, one of the Sustainable Development Goals (SDG) related to education is to have all children receive at least a year of free quality pre-primary education (UNESCO, 2015). Pre-primary education here refers to center-based organized instruction designed to introduce children to a school-type environment in the year ahead of primary schooling. In contrast, the commonly-used term preschool broadly refers to organized group care between the ages of 3 and 6 years. The current study compared two models of pre-primary education in East Africa by evaluating the quality of their teaching/learning program, using a quality measure newly developed for low- and middle-income countries (LMIC), and the academic, cognitive and social benefits to children one year later at the end of first grade.

Uganda, Zanzibar in Tanzania, and Kenya are low- and low-middle-income countries with Human Development Indices of 0.493, 0.531 and 0.555, respectively (United Nations Development Program, 2016). During the study, only Kenya had a widespread system of free pre-primary education where 77% of children 3-6 years participated; Zanzibar and Uganda had less than 20% of eligible children attending public pre-primaries with a similar number attending private faith-based programs (UNICEF, 2017). Yet, the three countries, along with others in East Africa, are working to align their education curricula at the pre-primary level (East African Community Secretariat 2014). The quality of these pre-primaries and the impact of quality on early grades literacy and math will help determine how close we are to meeting the SDG goals.

1.1 Background Literature

Until now, increasing access was the main goal of national and international organizations because of research showing that attending pre-primary led to cognitive gains with an effect size of 0.64 (Rao et al., 2017). Most of these studies assessed children at the end of a pre-primary program and found that they performed better on tests of school readiness than children who did not attend pre-primary (e.g., Aboud, 2006; Rao et al., 2012; Rao et al., 2012; see review by Nores & Barnett, 2010). There were also attempts to identify or directly implement improved pre-primary programs; as expected, improved or developmentally appropriate programs conferred greater benefits on children (Aboud et al., 2016; Moore et al., 2008; Mwaura et al., 2008; Opel et al., 2009; Opel et al., 2012). With attention now being paid to the concept of “quality pre-primary” we used Burchinal’s (2018) definition of early education quality, entailing sensitive and responsive interaction, setting reasonable limits to acceptable behavior, intentional teaching of age-appropriate skills and scaffolding of children’s learning, using a curriculum for instruction, and positive links with families.

1.2 Measuring Pre-primary Quality

In order to go beyond informed decisions about what makes a pre-primary have higher quality, a measure of pre-primary quality appropriate for low- and middle-income countries is needed. Our first question was: Would a newly developed measure of pre-primary quality be feasible in East Africa and reveal program differences? A metric of quality is important for a number of reasons. Although “developmentally appropriate” learning activities are generally known by educators and child developmentalists, a metric would permit a finer-grained analysis. A metric might also be used to describe features of a high quality program that in turn would be used to guide improvement. Finally, a measure of quality could be validated through associations with a direct assessment of children’s performance to confirm whether those qualities are indeed worth striving for.

Measures of quality developed for pre-primaries in LMIC have not been available until recently, so researchers have used ones developed in the United States and United Kingdom, sometimes with extensive modifications to accommodate a low-resource setting. When used in some Latin American countries such as Brazil, Chile, Colombia, Ecuador, and Peru, the Teacher Instructional Practices and Processes System (TIPPS; Seidman et al., 2014) and the Classroom Assessment Scoring System (CLASS; Pianta et al., 2008) provide information on the quality of teacher-child interaction. The TIPPS has also been used in Ghana (McCoy & Wolf, 2018; Wolfe et al., 2018) where, after being reduced to 14 items, it showed some significant but low correlations with preschool literacy and numeracy. However, the more general measure of quality, namely the seven-subscale Early Childhood Environment Rating Scale (ECERS-R; Harms et al., 1998) has been used in Latin America (Araujo and Schady, 2015) and extensively in Africa and Asia (Aboud, 2006; Brinkman et al., 2016) along with the ECERS extension to math, literacy, and science teaching (ECERS-E; Sylva et al, 2006). Yet researchers have acknowledged the unsatisfactory strategy of modifying measures in major and haphazard ways to suit low-resource contexts, or relying on the original quality measure without excluding context-unsuitable items. For example, Brinkman et al. (2016) found that 15 out of 43 items of the ECERS-R were not applicable to any of their rural Indonesian centers which lacked furnishings, computers, videos and televisions.

Consequently, along with four institutions supporting the MELQO (2017) initiative (Brookings Institution, UNESCO, UNICEF, and World Bank), we developed a measure of pre-primary process quality tailored for use in LMIC and based on the experience of international experts. This was the first effort to assess its use in East Africa and its value in predicting first grade literacy and math achievement.

1.3 Child Outcomes Associated with Quality

Our second question addressed the developmental benefits of attending high quality pre-primary education. Developmental benefits are usually measured in terms of academic achievement, or cognitive, language and social-emotional development. Using a school readiness test and other cognitive measures, several researchers have found a relation between ECERS-E and -R quality and pre-primary students’ performance in Bangladesh, Indonesia and East Africa (Aboud, 2006; Aboud et al., 2016; Brinkman et al., 2016; Malmberg et al., 2011). Studies in Latin America where an emergent approach to teaching/learning is more common found mixed results using the CLASS in Chile (Yoshikawa, et al., 2015). Similarly, in the United States, recent studies have found some evidence for preschool instructional quality predicting pre-primary (aka kindergarten) language and reading but not math achievement (Burchinal et al., 2008), though in most studies the associations were modest at best with effect sizes less than 0.20 (Burchinal, 2018). However, British children’s pre-academic and cognitive outcomes were more strongly associated with the ECERS-E literacy and math quality scores than with the ECERS-R total or subscale quality scores (Sylva et al., 2006). In sum, there is mixed evidence for the relation between measures of program quality and student learning in the pre-primary year. Less is known about how pre-primary quality relates to primary school learning, the focus of the current study.

The limitations of existing research on pre-primary quality and children’s learning concern first the measure of quality and second how it relates to school achievement beyond the pre-primary level. To address the first limitation, the Measuring Early Learning Quality and Outcomes (MELQO) Initiative (2016) held meetings with experts and set up a technical advisory group to develop a measure of quality appropriate for low- and middle-income countries. This measure, the MELE, was explored here for the first time to compare two well-known models of pre-primary education in East Africa. To address outcomes, which in past research were often not well-aligned with qualities or with future academic skills (Burchinal, 2018), we used a standard measure of reading and math for early primary grades, developed and used by governments in multiple countries with some cultural adaptation (e.g., Piper and Mugenda, 2012). Thus, previous limitations of quality and outcome measures were addressed in this study.

Another limitation of almost all the pre-primary research in LMIC, including this one, is the design, namely that children are not randomly assigned to pre-primaries of high or low quality, or to pre-primaries implementing different models. This happens because governments and organizations are already implementing pre-primaries; helping them to evaluate and improve existing programs takes precedence over researcher-controlled programs. Consequently, there may be selection bias if certain children attend certain pre-primaries. However, in LMIC children normally attend the pre-primary in their village with little choice. Only one published study included in a Cochrane Review (Brown et al., 2014) partly overcame this limitation by having pre-post assessments on the same children. This was the earlier study of our Madrasa and non-Madrasa preschools where children were pretested at entry when 3 or 4 years of age and retested 1 year and 2 years later (Mwaura et al., 2008). The study revealed no cognitive differences between the two groups of children at pretest, a significant difference one year later and a reduced difference two years later at the end of pre-primary. A problem associated with following up children even one year later in the same preschool is that of attrition which was 38% in the Mwaura study. This severely curtails statistical analyses. Considering this rate of attrition which one expects to be higher when following pre-primary children into primary, we used the post-only non-randomized design of other researchers in low-income countries, while minimizing and controlling for observed group differences, such as the mother’s education and family assets (e.g. Aboud & Hossain, 2011; Brinkman et al., 2016).

1.4 Aims of the Study

The present study evaluated the quality of two community-based models of pre-primary education in Kenya, Uganda and Zanzibar, namely the Madrasa program and other government or community programs. The term “Madrasa” in Arabic refers to school, though it has mistakenly come to be associated only with schools that teach Islam. In recent years, Madrasas in many countries have adopted the government curriculum or one that is even more progressive. The Madrasa preschool program in East Africa (also known as the Madrasa Early Childhood Program or MECP) integrated Islamic teaching into a secular curriculum. Though now managed and funded by the communities with parent contributions, the curriculum is developed by the MECP and teachers are trained and supervised by the MECP. They offer a 2-year certification course, sometimes open to government teachers as well, and a 6-month short course for those already certified. Their approach is based on a child-centered constructivist philosophy of active learning, where manipulation and exploration of materials and ideas are supported by high-quality teacher-child interaction (Mwaura & Marfo, 2011). These features of the program were to be observed as part of this research, and teachers’ experience and training were also assessed with an interview.

Graduating students from the Madrasa program feed into government primary schools, as do graduates from the other community and government pre-primaries. It was expected based on government documents and in-school observations (Plan International, 2014) that non-Madrasa pre-primaries would have a style of teaching-learning that would be teacher-led and where students would mostly respond to teacher requests. Tuition is free in these government schools though children are often expected to wear uniforms. Teachers were expected to have some secondary school education and at least one year in a certificate program, followed by regular supervision. At the time of the study, Kenya was working on a revised curriculum, Zanzibar was focused on expanding its offerings to rural regions and creating standards (World Bank, 2013), and Uganda relied mainly on local community groups (e.g. faith-based) to provide pre-primary education while the government sought to develop standards. Because practices often differ from policies, we aimed to evaluate what actually occurred in pre-primary settings and how the graduates of observed pre-primaries performed in first grade one year later.

Based on findings from past research, the hypotheses were:

Madrasa pre-primary programs would have a higher quality than non-Madrasa programs especially in the domains of free-choice play, pedagogical approach (e.g. program structure), and teacher-child interaction.

Graduates of the evaluated Madrasa pre-primaries, assessed at the end of primary 1, would have higher academic, cognitive and social outcomes than graduates of the evaluated non-Madrasa settings.

First grade literacy and math achievement would be associated with the quality of the pre-primaries attended by these students.

2. Method2.1 Setting and Participants

The three settings were Eastern Kenya (Mombasa and the coast), Central Uganda (north of Kampala), and Zanzibar, Tanzania, where the Madrasa Early Childhood Program supported Muslim communities from 1986, 1993, and 1990, respectively, In Uganda, non-Madrasas were non-profit community-managed pre-primaries. In Kenya and Zanzibar, non-Madrasas were government-managed, teachers were government-trained and supervised, and they used the government curriculum. The number of pre-primaries observed in October 2015 varied slightly by country from 78 in Kenya, 65 in Uganda, and 74 in Zanzibar. Approximately half were Madrasa pre-primaries, selected randomly as described below.

The Primary 1 graduates of the selected pre-primary schools came from a mix of disadvantaged urban and mainly rural communities. The education level of parents was generally low though most mothers had at least some primary school experience. Two-thirds of the families were Muslim and the rest were Christian. In October 2016 children were tested if they were between 6 and 9 years of age. Although this is a broad age range for the first grade of primary, it was representative of children who had graduated from the selected pre-primaries. Their caregivers (mothers) were interviewed after signing consent for their own and their child’s participation. Ethics approval was granted by McGill University, Canada, the Mbarara University of Science and Technology in Uganda, Pwani University in Kenya, and from the Ministry of Education in Zanzibar. The study was conducted independently of the funder and of the Aga Khan Foundation by university researchers with no connection to the education programs being evaluated.

2.2 Design

The design was a cluster nonrandomized two-group design (Madrasa; non-Madrasa) comparing the quality of Madrasa and non-Madrasa pre-primary settings (Phase 1) and a year later comparing randomly selected children, who had graduated from the observed pre-primaries, now at the end of their Primary 1 year (Phase 2). As mentioned previously, it was not feasible to get a baseline assessment of children, because they had started preschool at different ages, some at 3 years and some for pre-primary only, and because attrition was expected to be high.

Quality was assessed at the end of the school year, in the tenth month of 2015, and Primary 1 achievement in the tenth month of 2016 (see the flow graph in Figure 1). Pre-primary schools were selected as follows using a list of government primary schools with both Madrasa and non-Madrasa pre-primaries feeding into them. Of Madrasa pre-primaries feeding into government primary schools (80 in Kenya, 87 in Uganda, and 81 in Zanzibar), 36 were randomly selected from each of the three country sites; the non-Madrasa pre-primary that fed into the same government primary school used by these Madrasa graduates was then selected. So Madrasa pre-primaries were randomly selected and a yoked non-Madrasa was selected (in Uganda, comparison pre-primaries were community- or church-based because government-run ones were rare in this setting; in Kenya and Zanzibar comparison pre-primaries were government-run). One year later at the end of 2016, using the 2015 pre-primary lists of enrolled children, we randomly selected five primary school children who had graduated from the previously observed Madrasa and five from the yoked observed non-Madrasa. Thus, Madrasa pre-primaries were randomly selected, non-Madrasas were yoked, and their graduates were randomly selected. The cluster was the primary school. Reporting follows the TREND statement for Transparent Reporting of Evaluations with Nonrandomized Designs (Des Jarlais et al., 2004).

2.2 Randomization and Masking

Madrasa and non-Madrasa pre-primaries and students who graduated from these pre-primaries were not randomized; parents largely chose to which they sent their child. However, they attended the nearby government primary school for first grade. Selection bias due to non-randomization was minimized by including pre-primaries and children from similar and adjacent communities, feeding into the same primary school, and by statistically controlling for observed demographic differences, which had been few in the earlier studies (Malmberg et al., 2011).

It was not possible to mask data collectors to the pre-primary they were observing and rating for quality. Efforts were made to mask data collectors to the child’s pre-primary school when testing them in first grade. Local investigators gave data collectors lists of every fourth or fifth child plus replacements from the pre-primary rosters, without identifying whether it was Madrasa or non-Madrasa graduate. To minimize bias, we spoke to data collectors at length about the need to remain unbiased and shed any expectations about the pre-primary models. We avoided hiring data collectors who had prior experience and pre-conceptions about the education programs.

2.3 Phase 1. Pre-primary Quality Measure and Data Collection

A recently developed quality instrument, Measure of Early Learning Environments (MELE), was used to observe the pre-primary setting and to interview teachers (MELQO, 2016 www.ecdmeasure.org). To ensure content validity, conceptual domains were elicited from an international group of over 30 early education experts to cover constructs considered important in pre-primary settings. Domains often overlapped with those found in existing measures, such as Interactions and Language teaching/learning, so the following short headings were adopted: Physical environment, Interactions, Inclusiveness, Program structure, Language, Numeracy, Science, Games/art, and Free-choice Play activities. Fifty individual items were then formulated to fit these domains, selecting qualities from existing measures previously modified for use in LMIC, in particular, but not exclusively, play items from the ECERS-R (Harms et al., 1998), literacy, numeracy and science items from the ECERS-E (Sylva et al., 2003), teacher-child interaction items from the TIPPS (Seidman et al., 2014), and physical environment plus other items from the ECEQAS (Kaul et al., 2012) and TECERS (Isely, 2001). Program structure, similar to the ECERS-R with the same name, had three items but was kept as a separate domain; a group of items including songs, rhymes, art, and games when teacher-led and performed by children together was also kept as a domain separate from free-choice play activities. Items do not reflect any particular philosophy of early education but rather capture the definition of pre-primary education as the initial stage of organized instruction within a school-type environment, designed to meet the educational and developmental needs of young children (OECD, 2003; 2012). Likewise, domains were identified not as essential and distinct constructs, but rather to ensure inclusion of critical qualities.

Items were phrased in terms of observable and quantifiable events and resources that a pre-primary would be expected to provide for children of 4 to 6 years; they avoided phrases linked to child outcomes such as cognitive or social-emotional. Examples are: Scaffolding by teacher to help a child work through the steps to solve problems or errors (Interaction), Gender equality in class participation (Inclusiveness), Children are introduced to reading and/or writing letters (Language), Children’s new numeracy learning is connected to past learning and to its everyday application (Numeracy), and Children have access to different interest centers during indoor play, e.g., blocks, sand and water, books, art, games, dramatic, music (Free-Choice Play). Response options for each item are ordered from 1 to 4 to reflect low to high quality. Modifications of the measure underwent various iterations after sharing with stakeholders from around the world who evaluate or implement pre-primary programs, and again after trialing the measure in Tanzania where the initial 100 items were reduced to 50. Clarification of certain items again followed training before its use in this study. A manual is available with elaborations on the underlying features being measured and specific examples of each rating.

The full measure is freely accessible. A few examples of the 4-levels of quality indicate that quantity and frequency are included, as is the nature of the teaching method (e.g., group repetition vs individual work). Items from teacher-child interaction, program structure, language teaching, and numeracy teaching, respectively, are provided here.

Interaction. Behavioral indications of a positive environment between teacher and children.

1 Teacher rarely (<5 times) smiles, claps or verbally praises children’s efforts

2 Teacher smiles, claps or says “good” 5 or more times but it is automatic and routine, not heart-felt, and usually involves asking the whole class to clap

3 Teacher smiles, looks directly and has warm words of praise for 5 or more individual children’s efforts

4 Teacher’s smiles, looks directly and gives warm words of praise to 10 or more individual children

Program Structure. The daily routine, seen today, has a mix of activities including play (indoor, outdoor), arts & games (e.g. stories, songs, rhymes, art, games), and instructional (e.g. teacher-led child-directed language, numeracy).

1 Children receive teacher-led instruction most of the time (> 80%)

2 Teacher-led instruction occurs for approximately 2/3 of the time. The rest of the time is devoted to child-initiated play or arts & games but not both.

3 Teacher-led instruction occurs for approximately 2/3 of the time. The rest of the time is devoted to child-initiated play and arts & games activities (e.g. stories, songs, rhymes, games).

4 One-half or less of the time is spent in teacher-led instruction; the rest includes both child-initiated learning/play and arts& games activities

Language Teaching. Children are introduced to reading and/or writing letters.

1 Children's attention is not directed to written letters or written words in posters, books or blackboard

2 Children read and/or write letters or words by immediately repeating what the teacher says

3 Some children have the opportunity to read and/or write letters or words on their own (may copy from the board on to their paper), not simply repeating immediately what the teacher said

4 All children have the opportunity to read and/or write letters or words on their own, in at least one activity such as writing in an exercise book

Numeracy Teaching. Children use objects (e.g., blocks, sticks, bottle caps) for math concepts and patterns, and not simply for enumerating.

1 Objects are not used by individual children for this purpose

2 Objects used by less than half the children to reproduce what the teacher has shown

3 Objects used by more than half the children to reproduce what the teacher has shown

4 Objects used in an individual way by at least half the children to reflect math concepts without simply reproducing what the teacher has shown

On the MELE data form, there is space to enter information requested from the teacher and/or observed during the visit. This included: hours of operation and of observation; numbers of boys and girls enrolled and attending on the day of observation; number of teachers/aides; the weekly schedule of activities, and the duration of activities on the day of observation.

A short Teacher Interview using a structured questionnaire provided information on structural aspects of the pre-primary programs, specifically teachers’ education and training, mentoring and supervision, and parent engagement (MELQO, 2016). Again, most but not all responses were scored on a 1 to 4 scale while others required yes-no answers.

Training and data collection. Eight local, university graduates were recruited from each site based on their research experience, education, and communication skills. They were proficient in English and the local language.

Training on the MELE measure of quality was conducted by two researchers, one a developer of the measure and the second a local researcher who had obtained high reliability with the first. The items and the manual were in English given that all data collectors were proficient in this language. Over the course of a week, research assistants became familiar with the measure, with the manual explaining in detail what to observe for each item, and they practiced observing and rating pre-primary sessions using both videos and actual pre-primary settings. Inter-rater reliability with the measure developer was obtained at the end of each day and found to reach a minimum of 90% by the end of the third day. A fourth day was used to review additional videos and highlight some difficult items.

During data collection, local assistants visited one pre-primary a day, given that all operated in the mornings. They worked in pairs on the first day and individually thereafter. They were supervised by the local researcher, who was available particularly during the first week when logistic and other problems were solved. They were to observe a full morning session of activities, which normally lasted 5 hours. A subsample of 63 teachers (23 from Uganda, 16 from Kenya, 24 from Zanzibar), 35 from Madrasas and 28 from non-Madrasas, were interviewed after the session was finished for the day. The smaller sample was intended as the teacher measure had undergone less pretesting.

2.4 Phase 2. Primary School Measures and Data Collection

Caregivers (mothers) were interviewed to gather information on the child and the family, including the child’s birthdate, preventive health measures (e.g. immunizations, improved water, deworming), diet (recoded into seven food categories to calculate a dietary diversity score, Daelmans et al., 2009), past 2-week illnesses, along with family characteristics such as assets, parental education, religion, household size. Because the mother's education is often used as a covariate in analyses of children's achievement, we imputed missing scores for this variable based on the mean education level of other mothers in the cluster. Other covariates did not have missing data. Children’s height was taken twice and averaged to calculate a standardized height-for-age z score based on WHO growth references (World Health Organization, 2009).

Children were directly tested for math and literacy achievement, executive function, and social-emotional development. A fuller description now follows.

The Early Grades Math Assessment (EGMA) and Early Grades Reading Assessment (EGRA) have been translated into local languages and previously used with first, second, and third graders in East Africa (e.g., Piper, 2010; Piper and Mugenda, 2012). The tests were modified by our team of regional and international experts in early education and child development, to make them comparable across the three sites in East Africa, shorten them for first graders, and avoid floor effects with low variance. They were intended for research, not diagnosis. The Math test consists of 53 items in six subtests requiring children to read aloud numbers, identify the larger, complete a pattern, add, subtract, and solve word problems (alpha 0.70 Kenya, 0.76 Uganda, 0.78 Zanzibar). The Reading test, here called Literacy, consists of 155 items in eight subtests requiring children to read letters, syllables, familiar words, and novel words; read aloud a passage, comprehend words and a passage, and write words (alpha 0.84 Kenya, 0.81 Uganda, 0.89 Zanzibar). There were practice items and stopping rules but no time constraints within general limits. The Math score was the sum of correct answers across 53 items and the Literacy score the sum of correct answers across 155 items. Scores are presented out of 100 for comparability. High alpha coefficients and subtest-total correlations warranted combining subscores. Teachers' reports of the most recent Math and Language marks for a subsample of the students correlated significantly with their EGMA and EGRA scores ( r = 0.53 and 0.60 in Kenya; 0.37 and 0.40 in Uganda; 0.52 and 0.48 in Zanzibar, p < .001, respectively), thus providing validity of the two achievement tests in relation to school performance.

The two Executive Function tests were the Tower of London (Anderson, Anderson, & Lajoie, 1996) that requires planning and flexibility, and Digit Span that relies on working memory (Diamond, 2013). The Tower task requires children to move balls from one dowel to another to re-create the model in a minimum number of moves. There were 16 trials, each rated pass (1) or fail (0), summed for a total score. Digit span backwards (6 trials) and digit span sequences (6 trials) included listening to a sequence of 2, 3, and 4 numbers and repeating them backwards or re-arranging them in the proper forward sequence, respectively. Forward digit span sequences were given as practice before the test sequences. Scores were the number of trials performed correctly. The Digit Span and Social Problem Solving (to be described next) were the only measures newly used in East Africa and so translated into local languages before use. Research assistants during training discussed whether the instructions and scenarios were correctly communicated, and modifications were made when necessary.

Social problem solving consists of three vignettes describing a problem the child must solve (Shure & Spivack, 1982). They were modified to fit the context: how to reduce a mother’s anger after breaking her bowl, how to regain a teacher’s approval after performing poorly on a test, how to get access to a peer’s desirable toy. After each solution, the child was asked, “What else could you do?” Up to nine solutions, three for each problem, were recorded.

Training and data collection. Eight local university graduates were recruited for each site based on their research experience, education, and communication skills. They were proficient in English and the local language. Most had participated in the observation of pre-primaries.

Training on the Mother interview and direct individual child tests took one week of becoming familiar with the measures and practicing them on each other. Preliminary testing in Zanzibar, where we started, demonstrated high inter-rater reliability several days apart with high correlations and non-significant differences between means (e.g., Literacy r = .99, t = 0.95, p = 0.37; Math r = .90, t = 0.42, p = 0.69; Digit sequence r = .73, t = 0.60, p = 0.57; social problem solving r = .77, t = 0.36, p = 0.73). We used both correlation and difference tests to demonstrate the association between testers and the lack of a practice effect for children.

Data collectors were given names of primary schools and lists of children to test. During the previous year’s visit to pre-primary schools, names of all children in the pre-primary class were collected along with the corresponding catchment primaries they were likely to attend. From the 2015 list of pre-primary children, first grade students were systematically selected to be tested to make up the five needed for our sample (e.g., from a class of 40, every eighth child was listed for testing; from a class of 30, every sixth child was selected), along with replacements. Children were individually tested at school or at home after school. A quiet place was found for the testing; data collectors gave tests in the same order as described here to all children, and permitted rest periods if requested.

2.5 Sample Size Estimation

The sample size was based on child outcomes to be assessed when graduates of pre-primaries were in primary 1. A sample of 360 children per country was required, half from Madrasa pre-primaries and half from non-Madrasa pre-primaries. This was based on an expected difference of 0.40 standard deviations between group means, using a power of .80 and a significance level of .05 (based on previous research such as Aboud and Hossain, 2011; Brinkman et al., 2016; Malmberg et al., 2011; Nores and Barnett, 2010; Rao et al., 2017). The sample was then multiplied by 1.80 based on an intra-cluster correlation (ICC) of 0.20 because of clustering at the primary school level (based on previously obtained ICCs from Aboud et al., 2017). To obtain a sample of 180 children from each model of pre-primary in each country we selected 36 Madrasa and 36 non-Madrasa pre-primaries, taking 5 children from each a year later as they finished first grade. Because this was the first use of the full MELE in a LMIC, we did not intend to hypothesize or validate a factor structure, which would require a minimum of 250 pre-primaries (though available on request). Rather the analyses focused on a comparison of the two pre-primary models, and the construct and predictive validity of their setting quality.

3. Results3.1 Comparison of Madrasa and non-Madrasa Pre-Primary Quality

In order to compare the quality of the Madrasa and non-Madrasa pre-primary programs, analyses of covariance were conducted, using a 2 (Madrasa, non-Madrasa) X 3 (countries) design, covarying attendance numbers on the day of observation (which were significantly different by program and country). Table 1 presents means and standard deviations for attendance, overall quality and its domains for Madrasa and non-Madrasa pre-primaries. The overall quality of Madrasas was significantly higher than non-Madrasas (means were Madrasa M=2.66 and non-Madrasa M=2.46; effect size d=0.70). Despite this, ranges of quality ratings overlapped considerably especially at the low end, where Madrasa overall quality ranged from 1.88 to 3.40 and non-Madrasa quality from 1.92 to 3.00. As expected, Madrasas had significantly higher quality on items related to free-choice play activities (d=1.12) and language (d=0.60), and to a lesser extent on group activities (art, games) (d=0.40) and the program structure (d=0.25). There were no significant differences between Madrasas and non-Madrasas on the physical environment, teacher-child interaction, or math teaching.

There were significant country differences on the overall quality and on most domains, indicating that Kenyan pre-primaries scored significantly higher than Ugandan or Zanzibari ones except on the physical environment. Only Play showed a significant country x program interaction.

Regarding other data collected about the programs in the three sites, all had 5- or 6- hour daily morning sessions, with time reserved for language, math, and indoor or outdoor play generally 4 days a week. Nature/science teaching was typically given on only two days/week. Boys’ and girls’ attendance differed significantly but minimally across programs (Program x Gender, F(1,211)=4.39, p=.04), where girls slightly predominated in Madrasas and boys in non-Madrasas: Madrasa girls M=9.58, boys M=9.19; non-Madrasa girls M=12.28, boys M=13.51.

Insert Table 1 about here

Analyses of teachers’ responses to interview items revealed only a few differences: teachers in government pre-primaries were better educated (p = 0.02) whereas parents of Madrasas were more engaged in the program and its governance (p = 0.0002).

3.2 Comparison of Madrasa and non-Madrasa Graduates on Primary 1 Outcomes

Analyses of variance, using SAS 9.3 PROC MIXED to adjust for clusters, were first conducted on child and family variables derived from the mother’s interview to determine if Madrasa and non-Madrasa groups differed on child age or sex, and family variables. Group differences were then included as covariates in the analyses of child outcomes. Preliminary analyses of children showed large differences across the three countries, in literacy and math performance and other variables such as age (see Table 2). Concerning age, Kenyan children were 8.3 years of age on average with a range of 78 to 117 months, whereas Zanzibari children were 7.4 years on average (range 70 to 108). It was therefore decided to analyze children’s data separately by country.

Insert Table 2 about here

Table 3 provides information on children's first grade performance. Sex did not interact significantly with the type of pre-primary attended, so we combined boys and girls for these analyses (sex disaggregated data tables are available on request). Because of differences in children's ages and mother's education in some cases, these variables were covaried in analyses of achievement.

Child outcomes from primary 1 were analyzed to examine group and sex differences, using SAS 9.3 PROC MIXED analysis of covariance to adjust for covariates and clusters (ICCs are provided in the tables). Table 3 shows that on the Literacy and Math assessments, Madrasa and non-Madrasa graduates did not perform differently. Kenyan children attained higher levels of literacy and math than did Ugandan and Zanzibari children, though our intent was not statistically to test country differences. Although the age for entering primary school is similarly 6 years in each country, Kenyan children were on average older. However, their scores did not correlate with age; Kenyan children who were 8 or 9 years of age did not perform better than those who were 6 or 7 years. On the two tests of executive function, Tower Planning and Digit Span, again there were no group differences. There were also no group differences on the social problem test.

Insert Table 3 about here

3.3 Association of Primary Performance with Pre-primary Quality

Finally, children’s primary 1 literacy and math performance outcomes were correlated with ratings of overall quality and of four relevant quality ratings assigned to their former pre-primary while they were enrolled; namely interaction, language, numbers, and play quality. These domains are conceptually most meaningfully related to literacy and math performance, and have been empirically associated in past research (Aboud & Hossain, 2011; Aboud et al., 2016; Sylva et al., 2006). Again, because performance differed by county but not by Madrasa and non-Madrasa program, separate country analyses were conducted combining children who had graduated from both programs.

The multilevel linear regression program from MPlus 7 (Muthén and Muthén, 2015) was used. Adjustments were made for clusters, namely pre-primaries, and for covariates such as child’s age and mother’s education which were found to differ across pre-primaries. Adjusted standardized beta coefficient estimates, yielded by the analysis, served as effect sizes for the association between quality and child outcomes (Nieminen et al., 2013). Table 4 presents standardized coefficients and their p values by country for literacy and math. One measure of executive function, namely digit span, was strongly associated with overall quality, interaction and language (standardized coefficients ranged from 0.42 to 0.51, p < .01) in Uganda but not in Kenya or Zanzibar.

Insert Table 4 and 5 about here

The pattern for Kenyan data showed that students' Literacy scores were associated with the overall quality of their pre-primary, and Math scores were positively associated with the quality of math teaching. This would mean that if the overall quality rose one standard deviation from 2.68 to 2.98 on a 1 to 4 scale, students' literacy scores would be 10.4% higher; and if math quality rose from 2.43 to 2.98, students' math scores would be 6.44% higher. Ugandan students’ literacy scores were significantly related to Interaction, whereas math scores were associated with the overall quality. Consequently, literacy scores would rise by 9.36% and math by 7.7% if the corresponding quality was improved by one standard deviation. Zanzibari students’ scores were not associated with any quality ratings.

Two features of the Zanzibar data are worth noting: one is that their literacy and math scores were low, on average 50% or below for literacy and math. Another is that the internal consistency of their quality ratings was not as high as those for Uganda and Kenya. Across all countries, Madrasas showed more consistency than non-Madrasas (see Table 5), possibly reflecting a more cohesive program. Low consistency in a domain such as teacher-child interaction might be expected if teachers offered praise but no scaffolding. However, low consistency in literacy and math might reflect a fragmented program in these two areas of teaching/learning.

4. Discussion

The findings were that Madrasa pre-primary schools had higher quality than non-Madrasas on their overall rating and on the qualities of free-choice play activities, language teaching, games/art, and program structure. However, there were no differences between the two groups of graduates in their first grade literacy, math, executive function and social development. Finally, children’s first grade performance in literacy and math were associated with overall quality or with the quality of the corresponding domain in Kenya and Uganda, but not Zanzibar. We now interpret these findings in light of past research and implications for future improvements in pre-primary programs.

4.1 Quality of Pre-primary Schools

Overall, as expected, Madrasa pre-primary schools in the three countries had significantly higher quality scores than the non-Madrasa pre-primaries. Effect sizes were small to large, ranging from 0.25 in the domain of program structure to 1.12 in free-choice play. The program structure items included having a curriculum, using a mix of child- and teacher-led activities, and using a mix of small group and individual work in addition to whole group teaching. Madrasas were expected to have a better program structure and more free play compared to non-Madrasas, which were expected to have more whole-group teaching and less free play. Madrasas and non-Madrasas did not differ in adult-child interaction, inclusiveness, physical environment, numeracy or science teaching. Average ratings of Madrasas were just above the midpoint of the scale between 2.58 and 2.78, whereas non-Madrasa overall averages ranged from 2.35 to 2.55; ranges showed that quality between the two programs largely overlapped. In other respects, the programs were similar, especially regarding teacher experience and supervision, activities included in the daily routine, and numbers of children attending.

These results are comparable to others in the published literature in LMIC, which in most cases assessed quality with modifications of existing scales. For example, an earlier study on a smaller number of selected Madrasas and non-Madrasas in the same East African regions found that in the pre-primary year the former had higher quality with average scores for the 11 domains in the ECERS-R and –E of 4.11 and 2.93, respectively, on a 1 to 7 scale (Malmberg et al., 2011). At that time, the Madrasa pre-primaries averaged slightly above the midpoint as they did in our study; however, non-Madrasas performed better now compared to the earlier study. Non-Madrasas in Kenya and Zanzibar have benefited from collaboration with the MECP program in terms of teacher training specifically.

Despite its recent development, there are some unique strengths to the MELE in addition to its open access. Essentially the measure is a composite of items relevant to pre-primaries in LMIC, drawing from different measures with strengths in play, pedagogy, interaction, and physical environment. Each item describes four levels of quality with a short phrase of observable behaviors, thus providing clear guidance on how to improve. Items cover domains considered important, as judged by early education experts and as included in other measures, thus demonstrating content validity. Construct validity, in terms of known differences, was demonstrated here with significant differences between Madrasa and non-Madrasa pre-primaries on overall quality and several subscales. Certain clusters of items, such as those related to the physical environment, have been found to distinguish Indonesian preschools given assistance to strengthen this component (Proulx & Aboud, 2019). High correlations between the MELE and the ECERS-E in three China sites and also in Hong Kong support its convergent validity (Rao, personal communication). Other uses of the MELE, for example in Colombia, found associations with pre-primary math and literacy outcomes (Maldonado-Carreño et al., 2018). However, because many activities and materials were of low quality, they were rated simply as absent vs present. Such modifications might be necessary in order to conduct a statistical analysis, yet they do not provide guidance on improvement.

4.2 Early Primary Child Outcomes

Literacy and math skills at the end of first grade, along with executive function and social problem solving, showed no differences between graduates of Madrasa and non-Madrasa pre-primaries. The academic tests are ones commonly used by governments in the region to evaluate progress in early grades reading and math. With averages of 80%, Kenyan students achieved more literacy and math skills than those from Uganda and Zanzibar, whose scores averaged between 43% and 60%. However, in all three regions, subscale scores ranged from zero to 100%. Thus, teachers were trying to teach simultaneously children who do not know any letters or numbers and children who can read and comprehend passages. This is surprising in that all children in our samples had attended pre-primary programs where they received some language and numeracy instruction.

The lack of a significant difference between Madrasa and non-Madrasa students adds to a small but growing body of research examining whether pre-primary children maintain their advantage once they enter primary school. Most researchers find a difference at the end of pre-primary (e.g., Aboud, 2006; Brinkman et al., 2016; Martinez et al., 2012) but do not follow the children into primary school. For example, the earlier study comparing Madrasa and non-Madrasa children in the same East African sites found significant benefits for the former after the first two preschool years but less at the end of pre-primary (Mwaura et al., 2008; Malmberg et al., 2011). This suggests that the Madrasa curriculum may be more appropriate for 3- and 4-year-olds but lacks attention to early literacy and numeracy skills for 5- and 6-year-olds. Only a few LMIC studies have followed children into primary school. Aboud and Hossain (2011) found superior literacy and math performance in first and second graders who had participated in the pre-primary program only after the program raised its quality (see also Aboud et al., 2016; Montie, Xiang, & Schweinhart, 2006). Thus, for benefits to continue beyond pre-primary school, its quality must be more than adequate. This may explain the lack of significant differences in outcomes in first grade despite higher quality of Madrasa pre-primaries overall, and an association between quality and student outcome. Madrasa pre-primary quality was not sufficiently higher to impact first grade performance. This finding is important and draws attention to the need for researchers to follow children beyond pre-primary when examining relations between quality and academic performance.

4.3 Associations of Literacy and Math Performance with Pre-primary Quality

The importance of pre-primary quality in predicting early grades literacy and math received some support from the multilevel linear regression analyses. The overall quality regression coefficients for Uganda (0.35) and Kenya (0.50) were moderate, and comparable to Sylva et al.’s (2006) and Aboud et al.’s (2016) associations with the ECERS-E, and better than other findings using the ECERS-R (e.g. Abreu-Lima et al., 2013; Burchinal et al., 2008; Early et al., 2018; Gordon et al., 2013) or the TIPPS in Ghana (Wolf et al., 2018). This is the first such support for the MELE and promising evidence for an association between overall quality in LMIC and first grade achievement. The 50-item quality measure showed good internal consistency in all countries and so could be used as a measure of program evaluation. The associations found here and elsewhere emphasize the need to raise the quality of pre-primary education in order to have a more lasting effect on early grade literacy and math.

Beyond overall quality, associations with a few specific domains were interpretable. Kenyan math scores were predicted by math quality (std coeff 0.56), and Ugandan literacy and math were related to teacher-child interaction (std coeff 0.39 and 0.45). The former is to be expected; the latter might be explained in terms of the value of teacher-child open-ended and one-on-one conversations promoting language skills.

It is debatable whether one should expect discriminable and internally consistent subscales representing numeracy, literacy, interaction, and play quality. Regardless of the factor structure of the ECERS-R and -3, they continue to yield important information about preschool quality in high-income countries (e.g., Burchinal, 2018; Early, Sideris, Neitzel, LaForett, & Nehler, 2018). Analysis of the TIPPS in Accra, Ghana yielded a 3-factor structure of 14 items, but a set of 13 curricular content items showed stronger correlations with preschool literacy and numeracy outcomes than any factor (McCoy & Wolf, 2018). In these East African data, play and numeracy subscales showed good internal consistency; literacy and interaction items yielded only modest alphas, often because programs that included letter and word activities did not necessarily include story reading or discussion of new vocabulary. Likewise, teachers who praised good answers did not necessarily ask open-ended questions or scaffold new learning. Removing inconsistent items would raise the alpha level but leave an incomplete measure of literacy teaching, for example. Further use of the MELE in LMIC will help to clarify its value for program evaluation, as it did in Indonesia where attempts to improve physical infrastructure were reflected in higher physical environment scores but otherwise low scores pointed to missed opportunities to improve the pedagogical program (Proulx & Aboud, 2019).

It is not clear why significant associations were not found within the Zanzibar data. Their non-Madrasa program is somewhat new and there may have been less coherence in the program, as evidenced by lower alpha coefficients. Zanzibari children overall received lower math and literacy scores. Scores were similarly low among Madrasa and non-Madrasa graduates. Zanzibari families, however, had good diets, high maternal education and many family assets.

4.2 Strengths and Limitations

The research had a number of strengths, including the follow up of children into primary school to examine associations with pre-primary quality. Other design strengths were large samples of pre-primaries in each country, with Madrasa pre-primaries being randomly selected and matched with a non-Madrasa yoked to the same primary. The sample of children was randomly selected from within their respective pre-primary. The measures were appropriate, in particular the measures of literacy and math were previously used in the three countries and validated for our sample. The measure of quality was specifically developed for LMIC, yet tapped into domains that are considered important by experts in high-, middle-, and low-income countries. Built in is a strategy to raise the quality of one’s program based on the findings. Our analyses adjusted for covariates as well as clusters.

The design and procedures also had some noteworthy limitations, in particular selection bias due to non-random assignment, addressed here through matching pre-primaries and statistically controlling child differences. Pre-primaries were not randomly assigned to group. The children, likewise, were not randomly assigned to pre-primaries. The post-only design meant that children were not tested before their pre-primary program or at the end of pre-primary to see if they were comparable at baseline. Thus the lack of difference could be due to selection bias that put the Madrasa children behind non-Madrasa children at preschool or pre-primary entry, implying that they gained as a result of their pre-primary experience, though this is unlikely (Malmberg et al., 2011). We statistically controlled in analyses the few sociodemographic variables on which groups differed. However, groups may have differed on other unobserved variables that were not controlled Masking of data collectors was not possible when observing pre-primaries. Children covered a larger age range than expected, but we did not want to set an unrepresentative age limit.

Despite these limitations, the strengths give confidence to our conclusions that Madrasas were better in quality than non-Madrasas but that graduates from these two pre-primary programs performed similarly on measures of academic, cognitive and social development in primary school. There was some evidence that the quality of their pre-primary was significantly associated with children’s academic achievement at the end of first grade. This evidence supports the conclusion that program improvement and policy changes should be based on evidence of pre-primary quality.

References

Aboud, F. E. (2006). Evaluation of an early childhood preschool program in rural Bangladesh. Early Childhood Research Quarterly, 21, 46–60. doi:10.1016/j.ecresq.2006.01.008

Aboud, F. E., & Hossain, K. (2011). The impact of pre-primary school on primary school achievement in Bangladesh. Early Childhood Research Quarterly, 26, 237-246.

Aboud, F. E., Proulx, K., & Asrilla, Z. (2016). An impact evaluation of Plan Indonesia's early childhood program. Canadian Journal of Public Health, 107(4-5), e366–e372, doi: 10.17269/CJPH.107.5557

Abreu-Lima, I. M., Leal, T. B., Cadima, J., & Gamelas, A. M. (2013). Predicting child outcomes from preschool quality in Portugal. European Journal of Psychology of Education, 28, 399–420. doi.org/10.1007/s10212-012-0120-y

Anderson, P., Anderson, V., & Lajoie, G. (1996). The Tower of London test: Validation and standardization for pediatric populations. The Clinical Neuropsychologist, 10, 1, 54-65, doi: 10.1080/13854049608406663

Araujo, M.C., & Schady, N. (2015). Day care services: It’s all about quality. In S. Berlinksi & N. Schady (Eds.). The early years. Child well-being and the role of public policy. Washington, D.C.: Inter-American Development Bank.

Brinkman, S.A., Hasan A., Jung, H., Kinnell, A., Nakajima, N., & Pradhan, M. (2016). The role of preschool quality in promoting child development: Evidence from rural Indonesia. World Bank Group, Policy Research working Paper 7529, Washington DC.

Brown, T. W., van Urk, F. C., Waller , R., & Mayo-Wilson, E. (2014). Centre-based day care for children younger than five years of age in low- and middle-income countries. Cochrane Database of Systematic Reviews Issue 9. Art. No.: CD010543. doi: 10.1002/14651858.cd010543.pub2.

Burchinal, M. (2018). Measuring early care and education quality. Child Development Perspective, 12, 3-9. doi: 10.1111/cdep.12260

Burchinal, M., Howes, C., Pianta, R., Bryant, D., Early, D., Clifford, R., & Barbarin, O. (2008). Predicting child outcomes at the end of kindergarten from the quality of pre-kindergarten teacher–child interactions and instruction, Applied Development Science, 12, 140-153, doi: 10.1080/10888690802199418

Daelmans, B., Dewey, K., & Arimond, M. (2009). New and updated indicators for assessing infant and young child feeding. Food and Nutrition Bulletin, 30(2), S256-62.

Des Jarlais, D.C., Lyles, C., Crepaz, N. and the TREND Group (2004). Improving the reporting quality of nonrandomized evaluations of behavioral and public health interventions: The TREND Statement. American Journal of Public Health, 94, 361-366.

Diamond, A. (2013). Executive functions. Annual Review of Psychology, 64, 135–168. doi:10.1146/annurev-psych-113011-143750.

Earle, A., Milovantseva, N., & Heymann, J. (2018). Is free pre-primary education associated with increased primary school completion? A global study. International Journal of Child Care & Education Policy, 12, 13-33.

Early, D. M., Sideris, J., Neitzel, J., LaForett, D. R., & Nehler, C. G. (2018). Factor structure and validity of the Early Childhood Environment Rating Scale – Third Edition (ECERS-3). Early Childhood Research Quarterly, 44, 242-256.

East African Community Secretariat (2014). Draft harmonised curriculum structures and framework for pre primary education. Arusha, Tanzania.

Global Partnership for Education (2016). GPE’s work for early childhood care and education. Policy Brief . Washington DC. https://www.globalpartnership.org/content/policy-brief-gpes-work-early-childhood-care-and-education

Gordon, R. A., Fujimoto, K., Kaestner, R., Korenman, S., & Abner, K. (2004). An assessment of the validity of the ECERS–R with implications for measures of child care quality and relations to child development. Developmental Psychology, 49, 146-160.

Harms, T., Clifford, R. M., & Cryer, D. (1998). Early childhood environment rating scale, revised edition. New York: Teachers’ College, Colombia University.

Isely, E. (2001). Tamil Nadu early childhood rating scale. Chennai, India: M.S. Swaminathan Research Foundation.

Kaul, V. (2012). Early Childhood Education Quality Assessment scale (ECEQAS). Centre for Early Childhood Education and Development (CECED), Ambedkar University, Delhi.

Maldonado-Carreño, C., Rey, C., Cuartas, J., Plata-Caviedes, T., Rodriguez, J. J., Escallon, E., . . . Dirección de Primera Infancia-MEN. (2018). Quality measurement of the institutional modality of early childhood education in Colombia. [Education Policy Notes, 1]. Retrieved from https://educacion.uniandes.edu.co/images/Notaspoliticaeducacion/Notes-In-Education.pdf

Malmberg, L.-E., Mwaura, P., & Sylva, K. (2011). Effects of a preschool intervention on cognitive development among East-African preschool children: A flexibly time-coded growth model. Early Child Research Quarterly, 26(1), 124-33.

Martinez, S., Naudeau, S., & Pereira, V. (2012). The Promise of Preschool in Africa: A Randomized Impact Evaluation of Early Childhood Development in Rural Mozambique. Available at: http://siteresources.worldbank.org/INTAFRICA/Resources/ The_Promise_of_Preschool_in_Africa_ECD_REPORT.pdf (Accessed February 6, 2012).

MELQO (2016). Overview of the Measuring Early Learning Quality and Outcomes (MELQO) Initiative. UNESCO: Washington DC.

Montie, J. E., Xiang, Z., & Schweinhart, L. J. (2006). Preschool experience in 10 countries: Cognitive and language performance at age 7. Early Childhood Research Quarterly, 21, 313–331.

Moore, A. C., Akhter, S., & Aboud, F. E. (2008). Evaluating an improved quality preschool program in rural Bangladesh. International Journal of Educational Development, 28, 118–131. doi:10.1016/j.ijedudev.2007.05.003..

Muthén, L.K., & Muthén, B.O. (2015). Mplus User’s Guide. Seventh Edition. Los Angeles, CA: Muthén & Muthén.

Mwaura, P. A. M., & Marfo, K. (2011). Bridging culture, research, and practice in early childhood development: The Madrasa Resource Centers in East Africa. Child Development Perspectives, 5, 134-139.

Mwaura, P., Sylva, K., & Malmberg, L.-E. (2008). Evaluating the Madrasa pre-school programme in East Africa: A quasi experimental study. International Journal of Early Years Education, 16, 237–255.

Nieminen, P., Lehtiniemi, H., Vähäkangas, K., Huusko, A., & Rautio, A. (2013). Standardised regression coefficient as an effect size index in summarising findings in epidemiological studies. Epidemiology Biostatistics and Public Health, 10, e8854-1.

Nores, M., & Barnett, W.S. (2010). Benefits of early childhood interventions across the world: (Under) investing in the very young. Economics of Education Review, 29, 271-282.

Opel, A., Ameer, S. S., & Aboud, F. E. (2009). The effect of preschool dialogic reading on vocabulary among rural Bangladeshi children. International Journal of Educational Research, 48, 12-20.

Opel A., Zaman, S. S, Khanom F, & Aboud F. E. (2012). Evaluation of a mathematics program for pre-primary children in rural Bangladesh. International Journal of Educational Development, 32, 104-110. doi:10.1016/j.ijedudev.2011.01.013.

Organization for Economic Co-operation and Development (2003). Education at a glance. Paris. Retrieved 11 September 2017 https://stats.oecd.org/glossary/detail.asp?ID=5409.

Organization for Economic Co-operation and Development (2012). Starting Strong III. A Quality Toolbox for Early Childhood Education and Care, Starting Strong Paris: OECD Publishing.

Pianta, R. C., LaParo, K. M., & Harms, B. (2009). Classroom Assessment Scoring System manual pre-K. Baltimore, MD: Brookes.

Piper, B. (2010). Uganda Early Grade Reading Assessment Findings Report: Literacy Acquisition and Mother Tongue. Research Triangle Park, North Carolina, USA: RTI International.

Piper, B., & Mugenda, A. (2012). The Primary Math and Reading Initiative: Baseline report. Prepared for USAID/Kenya under the Education Data for Decision Making (EdData II) project, Task Order No. AID-623-M-11-00001. Research Triangle Park, North Carolina, USA: RTI International. http://pdf.usaid.gov/pdf_docs/pa00hx75.pdf (Accessed April 1, 2016)

Plan International (2014). Report on visit to observe and discuss ECD Centers, Preschools, and Parenting Groups in Kenya. Nairobi, Kenya.

Proulx, K., & Aboud, F. E. (2019). Disaster risk reduction in early childhood education: Effects on preschool quality and child outcomes. International Journal of Educational Development, 66, 1-7.

Rao, N., Sun, J., Chen, E. E., & Ip, P. (2017). Effectiveness of early childhood interventions in promoting cognitive development in developing countries: A systematic review and meta-analysis. Hong Kong Journal of Paediatrics, 22, 14-25.

Rao, N., Sun, J. Pearson, E., Liu, H., Constas, M. A., & Engle, P. L. (2012). Is something better than nothing? An evaluation of early childhood programs in Cambodia. Child Development, 83, 864-876.

Rao, N., Sun, J., Zhou, J., & Zhang, L. (2012). Early achievement in rural China: The role of preschool experience. Early Childhood Research Quarterly, 27, 66-76.

Seidman, E., Raza, M., Kim, S., & McCoy, J. M. (2014). Teacher Instructional Practices and Processes System (ECD) – TIPPS: Manual and scoring system. New York University

Shure, M.B., & Spivack, G. (1982). Interpersonal problem-solving in young children: a cognitive approach to prevention. American Journal of Community Psychology, 10, 341-356.

Sylva, K., Siraj-Blatchford, I., & Taggart, B. (2003). Assessing quality in the early years: Early Childhood Environment Rating Scales – Extension (ECERS-E): Four curricular subscales. London: Trentham.

Sylva, K., Siraj-Blatchford, I., Taggart, B., Sammons, P., Melhuish, E., Elliot, K., & Totsika, V. (2006). Capturing quality in early childhood through environmental rating scales. Early Childhood Research Quarterly, 21, 76–92. doi:10.1016/j.ecresq.2006.01.003

United Nations Children’s Fund (2017). State of the World’s Children. New York: UNICEF.

United Nations Development Program (2016). Human Development Report 2016. New York: United Nations. http://hdr.undp.org/en/composite/HDI (Accessed April 1, 2017)

UNESCO (2015). Sustainable development goal 4 and its targets. https://en.unesco.org/education2030-sdg4/targets. (Accessed February 18, 2019).

UNESCO Institute for Statistics (2017). More than one-half of children are not learning worldwide. http://uis.unesco.org/sites/default/files/documents/fs46-more-than-half-children-not-learning-en-2017.pdf. (Accessed February 18, 2019).

Wolf, S., Raza, M., Kim, S., Aber, J. L., Behrman, J., & Seidman, E. (2018). Measuring and predicting process quality in Ghanaian pre-primary classrooms using the Teacher Instructional Practices and Processes System (TIPPS). Early Childhood Research Quarterly, 45, 1-13. doi.org/10.1016/j.ecresq.2018.05.003

World Bank (2013). Zanzibar SABER Country Report. World Bank Group. 2013. Zanzibar Early Childhood Development: SABER Country Report 2013. Systems Approach for Better Education Results. World Bank, Washington, DC. https://openknowledge.worldbank.org/handle/10986/24485

World Health Organization (2009). WHO AnthroPlus for personal computers Manual: Software for assessing growth of the world's children and adolescents. Geneva: WHO, 2009.

Yoshikawa, H., Leyva, D., Snow, C.E., Treviño, E., Barata, M.C., Weiland, C., … Arbour, M.C. (2015). Experimental impacts of a teacher professional development program in Chile on preschool classroom quality and child outcomes. Developmental Psychology 51, 309-322. doi.org/10.1037/a0038785

Figure 1. TREND Flow diagram for Zanzibar; the flow is similar in Kenya and Uganda

Eligible Gov’t Primary Schools (n=81)

with Madrasa & non-Madrasa Pre-primaries feeding in to it

Select yoked non-Madrasas feeding into same primary n=36

Recruitment of Pre-Primaries

Randomly select Madrasas n=36

Phase 2 Assessment October 2016

Phase 1 Assessment October 2015

Observation and rating of quality

using the 50-item MELE

Early Grades Reading Assessment

Early Grades Math Assessment

Executive Function & Social tests

Family demographic information

Analysis

Completed (n=712; 91.5%)

Lost to follow-up

District (n=0)

Children (n=66; 8.5%): Absent (37), Death (0), Migration (14), Refusal (4), Missed (9), Not found (2)

Follow-up of Pre-primary Graduates 1 year later at the end of Grade 1

Randomly select graduates of observed non-Madrasas (5 from each of 36 primaries)

Randomly select graduates of observed Madrasas (5 from each of 36 primaries)

Quality data on pre-primaries (n =38 )

Grade 1 achievement of Madrasa graduates whose pre-primaries had quality data (n=185)

Quality data on pre-primaries (n =36 )

Grade 1 achievement of non-Madrasa graduates whose pre-primaries had quality data (n=185)

Table 1. Comparison of MELE quality by country and program model in three East African countries

MELE conceptual domains (number of items; alpha)

Prog

Model

Means (SD)

Statistical significance F (p)

Kenya

Uganda

Zanzibar

Country

Model

Country x

Model

Sample size (n)

M

nonM

41

37

38

27

38

36

Overall Average (50; .80)

Physical environment (10; .51)

M

nonM

M

nonM

2.78 (0.32)

2.56 (0.28)

2.75 (0.34)

2.77 (0.52)

2.59 (0.31)

2.35 (0.26)

3.00 (0.33)

3.04 (0.34)

2.58 (0.26)

2.45 (0.19)

3.17 (0.32)

3.06 (0.31)

10.57

(<.0001)

13.80

(<.0001)

28.04

(<.0001)

1.17

(.28)

1.11

(.33)

0.69

(.50)

Interaction (8; .49)

M

nonM

3.07 (0.39)

2.99 (0.45)

2.83 (0.41)

2.62 (0.48)

2.87 (0.35)

2.82 (0.33)

10.51

(<.0001)

3.52

(.06)

0.67

(.51)

Inclusiveness (6; .42)

M

nonM

2.74 (0.41)

2.77 (0.38)

2.40 (0.42)

2.40 (0.35)

2.26 (0.22)

2.28 (0.29)

38.05

(<.0001)

0.03

(.97)

0.01

(.96)

Program structure

(3; .37)

M

nonM

2.78 (0.75)

2.36 (0.73)

1.96 (0.66)

1.69 (0.55)

2.17 (0.63)

2.19 (0.52)

22.96

(<.0001)

4.35

(.03)

1.68

(.19)

Language and Literacy (5; .53)

M

nonM

2.80 (0.57)

2.29 (0.46)

2.52 (0.52)

2.48 (0.55)

2.57 (0.65)

2.21 (0.50)

1.25

(.29)

14.78

(.0002)

3.05

(.05)

Numbers and Numeracy (6; .68)

M

nonM

2.43 (0.58)

2.43 (0.52)

2.37 (0.66)

2.21 (0.55)

2.23 (0.73)

2.02 (0.71)

3.33

(.04)

1.09

(.30)

1.07

(.34)

Nature and Science

(2; .63)

M

nonM

2.51 (0.90)

2.45 (0.81)

2.16 (0.65)

1.94 (0.58)

2.30 (0.65)

2.37 (0.67)

5.97

(.003)

0.38

(.54)

0.66

(.52)

Group Activities: games, songs, art (4; .37)

M

nonM

2.95 (0.78)

2.53 (0.65)

2.71 (0.69)

2.41 (0.67)

2.55 (0.61)

2.46 (0.63)

2.88

(.06)

9.08

(.002)

1.38

(.25)

Free-choice Indoor Play (6; .82)

M

nonM

2.82 (0.84)

1.91 (0.82)

2.47 (0.89)

1.25 (0.34)

2.16 (0.56)

1.89 (0.54)

9.62

(.0001)

69.08

(<.0001)

8.71

(.0002)

Attendance

M

nonM

12.66 (8.4)

27.38 (11.7)

17.58 (11.0)

19.55 (15.4)

26.84 (10.5)

28.52 (10.6)

13.75

(<.0001)

21.50

(<.0001)

9.10

(.0002)

Note. Ranges of overall quality showed large overlaps: Kenya Madrasas 2.20 to 3.40, non-Madrasas 2.10 to 3.00; Uganda Madrasas 1.88 to 3.02, non-Madrasas 1.92 to 2.96; Zanzibar Madrasas 2.14 to 3.26, non-Madrasas 2.12 to 2.84.

Table 2. Country means (SD) and significance levels of family variables and primary school performance

Variable

(theoretic max score)

Kenya

Uganda

Zanzibar

ANOVA effects

Country Prog.Model

Number of children

Madrasa; nonMadrasa

n = 108; 173

n = 173; 150

n = 185; 185

F ( p )

F ( p )

Child’s age (6 – 9 yrs)

99.35 (9.44)

88.94 (9.51)

92.34 (7.29)

100.75 (<.001)

0.39 (.53)

Diet diversity (7)

2.95 (0.78)

2.90 (0.97)

3.67 (0.80)

79.94 (<.001)

1.44 (.23)

Preventive health (14)

11.08 (1.39)

9.97 (2.19)

11.32 (1.44)

8.82 (<.001)

2.73 (.10)

Family assets (12)

5.58 (2.20)

7.12 (2.65)

9.01 (2.55)

123.91 (<.001)

0.41 (.71)

Mother’s education

6.22 (3.31)

8.18 (3.62)

8.64 (3.79)

24.24 (<.001)

0.01 ( .92)

Father’s education

8.21 (3.34)

9.64 (4.55)

9.09 (4.16)

4.68 (.01)

0.11 (.74)

Primary student performance

Literacy (100)

81.56 (20.73)

46.07 (24.60)

49.31 (30.70)

115.14 (<.001)

2.44 (.12)

Math (100)

79.27 (11.44)

60.00 (22.02)

43.53 (25.14)

161.80 (<.001)

0.53 (.47)

Notes. Prog.Model= Madrasa; non-Madrasa pre-primary schools

Primary school performance of children was analyzed with covariates (age, sex, assets, mother’s education) and clusters.

Table 3. Mean (SD) performance scores of Primary 1 students graduating from Madrasa and non-Madrasa pre-primaries, adjusting for clusters and covariates (child’s age, mother’s education)

Measure (max score)

Program Model

Effect

Kenya primary analyses

Madrasa

(n = 108)

Non-Madrasa

(n = 173)

Group

Group x Sex

ICC

M (SD)

M (SD)

F (p)

F (p)

Literacy score (100)

0.19

82.07 (20.32)

81.24 (21.03)

0.00 (.99)

0.15 (.70)

Math score (100)

0.12

78.42 (13.03)

79.80 (10.33)

1.28 (.26)

3.13 (.08)

Tower planning (16)

0.006

7.93 (3.97)

8.51 (3.83)

0.09 (.76)

0.27 (.60)

Digit Span memory (12)

0.04

7.72 (2.98)

7.66 (2.73)

0.53 (.47)

0.35 (.55)

Social Problem solutions(9)

0.09

4.93 (1.96)

4.92 (1.91)

0.45 (.50)

2.07 (.15)

Uganda primary analyses

Madrasa

(n = 173)

Non-Madrasa

(n = 150)

Group

Group x Sex

ICC

M (SD)

M (SD)

F (p)

F (p)

Literacy score (100)

0.30

43.60 (22.96)

48.92 (26.15)

0.96 (.33)

1.98 (.16)

Math score (100)

0.13

58.80 (22.19)

61.37 (21.81)

0.30 (.59)

0.57 (.45)

Tower planning (16)

0.09

8.90 (4.81)

8.77 (5.01)

0.33 (.57)

0.82 (.37)

Digit Span memory (12)

0.16

6.81 (3.28)

7.03 (3.33)

0.48 (.49)

0.43 (.51)

Social Problem solutions(9)

0.08

3.73 (1.71)

3.81 (1.70)

0.00 (.99)

0.24 (.63)

Zanzibar primary analyses

Madrasa

(n = 185)

Non-Madrasa

(n = 185)

Group

Group x Sex

ICC

M (SD)

M (SD)

F (p)

F (p)

Literacy score (100)

.11

48.37 (30.90)

50.25 (30.56)

0.77 (.38)

0.03 (.32)

Math score (100)

.15

43.58 (25.05)

43.48 (24.48)

0.00 (.99)

0.85 (.35)

Tower planning (16)

.05

8.54 (4.70)

8.20 (4.63)

0.65 (.42)

1.52 (.22)

Digit Span memory (12)

.18

5.45 (3.67)

5.36 (3.67)

0.04 (.85)

0.18 (.67)

Social Problem solutions(9)

.07

3.89 (1.83)

4.37 (1.72)

5.02 (.03)

2.70 (.10)

Table 4. Multilevel linear regression predicting children’s primary 1 literacy and math from their pre-primary classroom quality, covarying child’s age and mother’s education, adjusted standardized estimates (2-tailed p value)

MELE Quality (items)

Primary 1 Students’ Outcome Scores

Kenya

Uganda

Zanzibar

Literacy

Math

Literacy

Math

Literacy

Math

Overall quality (50)

.50 (.004)

.23 (.27)

.20 (.18)

.35 (.05)

.16 (.36)

.10 (.57)

Interaction (9 items)

.08 (.70)

-.21 (.32)

.39 (.008)

.45 (.06)

.07 (.28)

.08 (.64)

Language (5 items)

.27 (.14)

.16 (.50)

.23 (.07)

.28 (.10)

.02 (.88)

-.02 (.90)

Numbers (6 items)

.32 (.16)

.56 (.02)

-.28 (.01)

-.22 (.28)

-.02 (.88)

.04 (.82)

Free-play (6 items)

.34 (.08)

.06 (.79)

-.12 (.34)

.16 (.42)

.14 (.36)

.21 (.19)

Note. Significant coefficients are bolded.

Kenya analyses are based on n = 55 pre-primary clusters and n = 259 children.

Uganda analyses are based on 52 pre-primary clusters and n = 289 children.

Zanzibar analyses are based on 69 pre-primary clusters and n = 355 children.

Table 5. Alpha coefficients for total scores and four domains

MELE Domain

Item number

All data

N = 217

Kenya data

n = 78

Uganda data

n = 65

Zanzibar data

n = 74

All items

1 - 50

.80

.82

.82

.73

Madrasas All items

.81

.83

.82

.79

Non-Madrasas All items

.72

.76

.77

.59

Interaction

11 - 18

.49

.56

.55

.41

Language

28 - 32

.53

.57

.57

.57

Numeracy

33 - 38

.68

.54

.64

.80

Free play

45 - 50

.82

.86

.86

.66