Abstract
We study the socioeconomic gradient of child development on a sample of low- and middle-income children aged 6–42 months in Bogota using the Bayley Scales of Infant and Toddler Development. We find an average difference of 0.53, 0.42, and 0.49 standard deviations (SD) in cognition, receptive, and expressive language respectively, between children in the top and bottom quartile of the wealth distribution. These gaps increase substantially to 0.81 SD (cognition), 0.76 SD (receptive language), and 0.68 SD (expressive language) for children aged 31–42 months. These robust findings can inform the design and targeting of interventions promoting early childhood development.
I. Introduction
In low-and middle-income countries, an estimated 219 million (39 percent) children younger than age five fail to reach their developmental potential due to exposure to risk factors such as illness, nutritional deficiencies, and less-responsive parents—all of which are associated to poverty (Grantham-McGregor et al. 2007). These factors affect cognitive abilities beyond the effect of genetics (Hackman and Farah 2009) and generate developmental delays that are difficult to compensate later on in life given the plasticity of the brain in early childhood (Shonkoff 2010). Lower school readiness and performance, lower employability and earnings, and worse adult health and well-being, are among the long-term consequences of exposure to these factors (see Almond and Currie 2010 and references therein).
Hence, low early childhood development levels may not only undermine the expected social and economic benefits of public (governmental) and private (parental) investments in children’s health and education later in life but also reduce the quality of the human resources available in the labor market, thus affecting the aggregate economy (Naudeau et al. 2011). Given the high private and public returns of improved outcomes in the early years, understanding when and why disadvantages in child development start is critical to the design of well-targeted, timely interventions.
The positive association between poverty and socioeconomic status (SES)—as measured by income, wealth, or parental education—on child health and development has been well documented in Western countries (Blau 1999; Bradley and Corwyn 2002; Aughinbaugh and Gittleman 2003), the gradient in health widening as children get older (Case, Lubotsky, and Paxson 2002).
An increasing number of studies are documenting similar patterns in developing countries. Paxson and Schady (2007) finds a difference close to two standard deviations (SD) on receptive language—measured on the Spanish version of the Peabody Picture Vocabulary Test (PPVT)—between children three to six years old in families in the 90th and 10th percentile of the wealth distribution in poor rural Ecuador. Macours, Schady, and Vakis (2012) finds qualitatively similar gaps on the PPVT for children three to seven years old in highly disadvantaged communities in rural Nicaragua as do Bernal and Van Der Werf (2011) when comparing children three to ten years old in the lowest and highest third of the wealth distribution for a Colombia-wide representative sample. This gap is largest among children in urban areas, and more so for children aged four and a half to eight and a half.
Outside Latin America, Ghuman et al. (2005) reports a negative association between household assets and receptive language for low-income children younger than age three in the Philippines. More recently, Fernald et al. (2011) assesses a nationally representative sample of children three to six years old in Madagascar on a range of cognitive and language measures and finds that children from families in the top wealth quintile or whose mothers had secondary education perform significantly better. These differences double between age three and six. In line with this evidence are the findings from two parallel World Bank studies on three-to five-year-olds in rural Mozambique (Bruns et al. 2010) and Cambodia (Filmer and Naudeau 2010).
However, with the exception of Bernal and Van Der Werf (2011) and Fernald et al. (2011), the studies above focus on disadvantaged children in rural areas. In addition, they often measure proxies for cognitive development—such as the PPVT—as opposed to a comprehensive range of developmental functions. Finally, little is known to date about the gap before three years of age, except for recent works by Hamadani et al. (2014) and Fernald et al. (2012). The first study collects high-quality measures of cognition—the Bayley Scales of Infant Development (BSID-II) and the Wechsler Preschool and Primary Scale of Intelligence (WPPSI-II)—for a panel of poor children in rural Bangladesh over a period of five years since birth, and shows that the cognitive gap associated with poverty starts at seven months. Fernald et al. (2012) documents SES gaps on child development for under two-year-olds in rural India, Indonesia, Peru, and Senegal. They, however, use a measure based on maternal reports—the Extended Ages and Stages Questionnaire (EASQ), which is thus likely to be less reliable.
In this paper, we study a representative sample of children aged 6–42 months in the lowest three (out of six) socioeconomic strata in the city of Bogota, Colombia. Eighty-five percent of the city’s population lives in these strata. We construct an index of household wealth and estimate the SES gradient in child development, measured using all scales of the latest version of the Bayley Scales of Infant and Toddler Development (Bayley-III): cognitive, receptive language, expressive language, fine motor, gross motor, and socioemotional. The Bayley-III offers a complete picture of a child’s developmental level by direct observation of their abilities and is currently considered the gold standard for the assessment of children up to 36 months by many (Fernald et al. 2009).
Hence, our study contributes to the existing literature by describing the developmental profile of children at very early ages in low-and middle-income groups in a typical Latin American urban environment, using one of the highest quality measures available. This allows studying any differences in the SES gap with age across developmental domains, which in turn can inform the design and timing of effective interventions.
We find sizeable average SES differences on cognition (0.53 SD), receptive language (0.42 SD), and expressive language (0.49 SD) between children in the top and bottom quartile of the household wealth distribution. The SES gap is about half the size for fine motor and socioemotional development, at 0.26 SD and 0.27 SD respectively, and not significant in gross motor skills. In line with the existing evidence, these gaps (except receptive language) are statistically significant even among the youngest in the sample (6–18 months) at conventional levels, and at the 10 percent for cognition. Similarly, as reported elsewhere, these differences increase with age, reaching levels of 0.81 SD (cognition), 0.76 SD (receptive language), 0.68 SD (expressive language), 0.40 SD (fine motor), and 0.38 SD (socioemotional) for children aged 31–42 months. Finally, we find that the SES gap persists after controlling for other factors likely correlated with wealth, including maternal education, and is robust to a number of alternative specifications.
The paper is organized as follows. The next section describes the study design, the sample, and the data. In Section III, we estimate the SES gap by age and developmental domain. Section IV presents a number of robustness tests and Section V concludes.
II. Study Design: Sample and Data
Our population of interest are children aged 6–42 months in Bogota. To estimate the SES gradient, it would be desirable to study a sample representative of the entire city’s population. Constructing such a sample should be relatively easy given the socioeconomic stratification first introduced in Bogota in the 1980s as a mechanism to target the subsidization of basic public services (such as drinking water, electricity, and gas). Following the 1994 law of Public Services, the Department for National Planning classified entire blocks into six strata—“estratos”—according to their location and quality of infrastructure and housing. This scheme is, in principle, revised every five years. For our sampling exercise, we combined data from the 2005 Census and the 2001 Cadastre, which classified 12.6 percent of all residential blocks in Bogota as estrato 1 (E1), 38.9 percent as estrato 2 (E2), 36.6 percent as estrato 3 (E3), 6.9 percent as estrato 4 (E4), 2.9 percent as estrato 5 (E5), and the remaining 2.2 percent residential blocks as estrato 6 (E6).1
In practice, however, it is extremely difficult to contact and obtain participation consent from the higher strata, who often live in restricted access apartment blocks and compounds. For this reason, at the beginning of the project, we decided to exclude the wealthiest two sectors (E5 and E6) and focus on the first four estratos. These represent about 95 percent of the population of the city. We designed the original study sample to be balanced across E1 to E4 and eight-month range age groups, with 90 children in each stratum-age cell, for a total of 1,440 children in 240 blocks.
Data were collected in three stages. First, once neighborhoods and blocks within them were selected, we visited all households in a block to identify those with children aged 6–42 months.2 Next, trained interviewers carried out a household survey on a random subsample of these children to collect: basic household socioeconomic information (such as demographic composition, education, and employment for all household members, dwelling characteristics, and assets); information on the child’s nutritional status (such as birth weight and gestational age); formal and informal childcare arrangements; and the number of play activities and play materials using UNICEF’s Family Care Indicator (Frongillo, Sywulka, and Kariger 2013).
In a third and final stage, the Bayley-III test was administered in the presence of the mother/main caregiver by trained psychologists (testers) in the library or public childcare center closest to the child’s home. Height and weight of both mother and child were also collected. The online Appendixes 1 and 2 (associated with this article at http://uwpress.wisc.edu/journals) provide more details on sampling, data collection procedures, and the instruments used.
A. Analysis Sample
Data collection took place between March and August 2011 on a sample of 1,533 children aged 6–42 months in 497 blocks mostly in estratos 1 to 3 (more than double the number of blocks originally planned). The Bayley-III test was administered to 1,330 (86.8 percent) of the children for whom we have a household survey. These children constitute our sample of analysis. Table A1 in the online appendix reports summary statistics for this sample by estrato and shows sample balance in age and gender, as well as a positive relationship between estrato and standard measures of household SES.
Geographically, the sample spans all Bogota and is, a priori, representative of approximately the 85 percent of the city population that lives in these estratos. A few considerations are, however, in order. First, we have very few children from E4. As we discuss in the online appendix, soon after the start of field activities, it was clear that E4 households were extremely reluctant to participate in the survey, mostly because of apparent mistrust. To compensate for the loss of children in E4, we increased the number of children in E1 and E2 by 90 each. In addition, mindful of the larger degree of heterogeneity in SES to be found in E3, we oversampled this estrato, adding 180 children. Second, refusals to participate in the study were more frequent in E1 than in E2 and E3, possibly also due to higher mistrust among those most vulnerable.
Hence, while we use estrato as a sampling device to ensure we obtain sufficient observations, we will consider household wealth as our variable of interest in the analysis. Findings are robust to using sampling weights, an issue we return to in Section IV.
B. Construction of a Household Wealth Index
Panel III in Table A1 reports household characteristics by estrato. While there is a strong correlation between estrato of residence and household economic well-being, there is also substantial within-estrato heterogeneity. This is illustrated by the ample geographical dispersion of E3 across the city, from more peripheral areas neighboring E2 to very central areas surrounding E4 and E5, with many more services and amenities (see Appendix Figure A1), and by the variance in the distribution of observables within estrato (Table A1). We formally assess the level of heterogeneity within estratos by constructing a household wealth index using principal component analysis on data on assets and dwelling characteristics (Filmer and Scott 2012).
To construct the wealth index, we first identified those variables that have enough variability within and across estratos and that are ex ante reasonable measures of wealth and standard of living at both ends of the distribution—for example, lack of external windows and ownership of a flat TV. Because all our indicators, except for the crowding index, are categorical, we performed a polychoric principal component analysis to allow comparability of the factor loadings. The final wealth index is the first principal component of the following variables: car, fridge, microwave, washing machine, boiler, computer, smartphone, flat TV, home theatre, DVD, stereo, games console, Internet, garage, whether the household shares the kitchen with other households, whether the household shares the bathroom, has more than one bathroom, has quality floors (tiles, carpet, or wood as opposed to gravel, cement, or dirt), has external windows, and the crowding index. The first principal component explains 43.09 percent of the total variance, the second component explaining an additional 8.03 percent.
Figure 1 shows the distribution of our constructed wealth index by estrato. As suspected, there is a remarkable range of variation in household wealth, especially within E3. This indicates that, even if the correlation between household wealth and estrato is moderate (r = 0.403), socioeconomic block stratification masks a lot of variability in household wealth within areas. Therefore, in order to obtain as precise a match as possible between household economic well-being and child development in what follows, we will use the household wealth index to investigate the SES gradients. In particular, we will compare developmental outcomes for children living in households in the bottom quartile of the wealth index distribution (Q1) with households living in subsequent quartiles (Q2, Q3, and Q4).
We investigate the wealth-representativeness of our sample relative to the overall population of Bogota by estimating a new wealth index using households in Bogota from a nationally representative sociodemographic survey, the 2010 Encuesta de Calidad de Vida (ECV), and the set of wealth-related variables available in both data sets.3 We then use the estimated factor loadings to compute a new wealth index for both the ECV and our sample. Figure 2 plots the density of this index in both samples. As our sample excludes households living in E4 to E6, the density of the wealth index is shifted to the left relative to that in the ECV. However, and perhaps surprisingly, the support of our sample is almost as large as that of the ECV: The largest value of the wealth index in our sample corresponds roughly to the 98th percentile of the ECV, indicating large wealth-representativeness.
Table 1 reports summary statistics for the study sample by quartile of the distribution of the household wealth index. As shown, the distribution of age and sex is well balanced across quartiles. However, not surprisingly, maternal age and employment and parental education increase with household SES. Panel III lists household size alongside the set of variables used in the construction of the wealth index. These offer a good characterization of economic well-being in the sample. Over 90 percent of dwellings in the top quartile have quality flooring, a fridge, a washing machine, a computer; 99.4 percent have external windows. The situation is very different at the opposite end of the distribution: Only 39.6 percent of the dwellings have quality flooring, 31.8 percent have a fridge, 11.7 percent a washing machine, and 4.2 percent and 3 percent have a computer and Internet access; 31.2 percent lack external windows. The level of stimulation in the home (FCI scores in Panel IV) and having a child minder also are correlated positively with the household SES.
C. Child Development Outcomes: Externally Versus Internally Standardized Scores
We assessed child development using the cognitive, receptive language, expressive language, fine motor, gross motor, and socioemotional scales of the third version of the Bayley Scales of Infant and Toddler Development (Bayley-III, Bayley 2006).
The first five scales consist of a series of tasks (items) of increasing difficulty that the child has to perform. The child scores one for each item correctly executed and zero otherwise. The socioemotional scale consists of a maximum (depending on the child’s age) of 35 five-point frequency rating questions responded by the mother/ main caregiver. The online Appendix 3 provides more details on the test, training, and administration.
For each scale, the raw score is the sum of all correct responses. Raw scores are adjusted by age and difficulty level in a nonlinear fashion to produce the composite scores, constructed to have a mean of 100 and a SD of 15 at every month-of-age. The norms (weights) used to construct composite scores are based on the reference population on which the test was standardized: a representative sample of 1,700 children in the United States. Also, the receptive and expressive language and the fine and gross motor subscales are combined to produce a single language and motor scale, respectively. Working with externally standardized scores is useful as it allows comparing scores within and across populations and across scales. However, some anomalies in the distribution of the composite scores by age suggest that the external norms used to standardize the Bayley-III may not be appropriate for our sample.
In Figure 3, we plot kernel estimates of average cognitive, language, motor, and socio-emotional composite scores against age (in months), along with 95 percent confidence intervals. Given that composite scores are constructed to have a mean of 100 points at every month-of-age, we would expect the nonparametric regression line to be flat around 100. Instead, we observe nonlinear relationships between the scores and age.
To an extent, the negative age gradients—this is to say, the presumed delay in development with respect to the population of reference as the child ages—in cognition and language, and to a lesser degree, for socioemotional development, are to be expected given that we are working with a more deprived sample (representative of low-and middle-income groups in Bogota) than the sample for which the test was standardized (representative of all socioeconomic groups in the United States). However, the 12-point increase in motor development over the age range and the four-point increase in language scores from 22 to 42 months are unusual. These unexpected relationships with age could be explained by cultural differences between the standardized and study populations, as well as by problems with translation.4 Similarly, they could also explain the early advantage children in the sample seem to have in cognition and language.
Table A2 in the online appendix reports means and SDs of the composite scores by quartile of the household wealth index and 12-month age groups. By age, the evolution of the scores for each wealth quartile displays similar patterns to those observed in Figure 3 for the entire sample. The SDs are smaller than the expected 15 points in the standardized population and decrease with age, particularly for cognition. The changing SD with age and the relation of the mean scores to age suggest the unsuitability of external norms and highlight the need to use internally standardized scores that adjust for age so as to allow comparisons.
Hence, as commonly done with developing country data, our analysis will focus on within-country comparisons between SES groups and use internally standardized scores. Often the standardization is done by dividing the sample into the smallest possible age groups—ideally monthly, given how sensitive developmental milestones are to age in the early years—but guaranteeing enough observations per group, and computing z-scores within age groups. (Subtract the months-of-age-specific mean of the raw score and divide by the months-of-age-specific SD.) (See Fernald et al. 2011, for example.)
We follow this approach but compute internal z-scores in a more flexible form in order to replicate as closely as possible the way in which the Bayley-III constructs composite scores while taking into account our limited sample size. In particular, instead of using months-of-age-specific means and SDs, we estimate age-conditional means and SDs using regression methods as described in the online Appendix 4.5 This procedure is less sensitive to outliers and small sample sizes within age category. As expected, the substantive results that compare across wealth groups are unaffected by this procedure or the use of composite scores. Section IV provides more details on this.
III. SES Gap by Developmental Domain
We start by exploring the timing of the SES gap by developmental domain and how it varies with age nonparametrically. The graphs in Figure 4 plot kernel-weighted local polynomial regressions of the cognitive, receptive and expressive language, fine and gross motor, and socioemotional internally standardized scores on age for the lowest (Q1) and highest (Q4) quartiles of the distribution of household wealth, along with 95 percent confidence intervals.
The top left graph in Figure 4 displays a remarkable gap in cognition, significant from at least as early as 12–13 months of age, and reaching levels of almost one SD by 42 months of age.6 The gap increases throughout the entire age range, given that children in the bottom quartile exhibit a negative age gradient (cumulative delay with age), whereas the age gradient for children in households in the top quartile is positive.
Similarly, the next two graphs show an age-increasing SES gap in receptive and expressive language that becomes significant earlier—at around ten and eight months of age, respectively. However, from about 18–24 months of age, the gap becomes statistically insignificant. Note that this age range coincides with the “dip” in language scores shown in Figure 3, which may partly be attributable to inaccuracy in the measurement of language skills at these early ages using the Bayley-III as translated. Note also that the scores and gap in receptive language vary more consistently with age than the expressive language ones. This pattern may be related to the fact that the development of receptive language skills precedes that of expressive language skills.
The bottom graphs show that, while fine motor scores for children in the richest 25 percent of the wealth distribution are always higher than for those in the poorest 25 percent, gross motor scores are higher for the poorest children until 20 months of age. These differences are not statistically significant. Finally, the gap in socioemotional development is considerably smaller than that in cognitive or language development over the age interval we consider, attaining a maximum value of 0.4 by 42 months and becoming statistically significant much later, at around 32 months of age.
Next, we use a parametric framework to quantify the size of the SES gap on child development. We compare mean developmental levels for children in households in different quartiles of the distribution of the household wealth index, by estimating, for each developmental domain j, the following equation by OLS:
(1)
where ZYij is the age-adjusted z-score, internally standardized as described in the online Appendix 4, obtained by child i on developmental domain j, agei is expressed in months, femalei is a dummy variable and testeri controls for any systematic unobserved differences in test administration and scoring across testers.7 Qki is the kth quartile of the household wealth distribution. The omitted category is the first wealth quartile, Q1, and hence is the estimated coefficient of the difference in the average z-score obtained by children in wealth quartile k with respect to the average z-score obtained by children in the first wealth quartile, on scale j. is the error term and includes other observable and unobservable child and household characteristics correlated with developmental outcomes. We cluster standard errors at the neighborhood level (primary sampling unit, n = 128 neighborhoods) to allow these characteristics to be spatially correlated within neighborhoods but not across. In order to establish the age range at which the SES gap first becomes statistically significant, we estimate Equation 1 by age groups—6–18, 19–30, and 31–42 months—separately.
Table 2 reports results. For each domain j, we report average estimates for the entire sample first, and disaggregated by age category, in subsequent columns. As the scores are internally standardized, estimates can be interpreted in terms of standard deviations within our sample. We find a significant difference between children in the top and bottom quartile of the wealth distribution in all developmental domains, except gross motor. The average SES gap for children aged 6–42 months is substantially large for cognition (0.53 SD), receptive language (0.42 SD), and expressive language (0.49 SD), and about half the size in fine motor (0.26 SD) and socioemotional development (0.27 SD). Note that the SES gradient is positive—this is, the size of the gap increases the higher the quartile—for all developmental domains and is already statistically significant between children in Q3 and Q1, and even in Q2 for cognition.
By age group, we observe that the gap generally increases with age for all wealth quartiles (when compared to Q1), and particularly for Q4. The difference in cognitive scores, for example, is 0.26 SD between children in Q4 and Q1 in the youngest age group (6–18 months, significant at the 10 percent level), it increases to 0.55 for children in the middle age group (19–30 months), and is as high as 0.81 SD for those in the oldest age group (31–42 months). Similarly, the gap in receptive language increases from a statistically nonsignificant 0.20 SD at 6–18 months to 0.30 SD at 19–30 months, reaching 0.76 SD at 31–42 months. The magnitude of the SES gap for expressive language is sizeable and statistically significant at conventional levels since very early ages, measuring 0.43 SD for children 6–18 months. However, it increases at a slower pace with age, reaching 0.68 SD at 31–42 months.
For fine motor skills, the gap is statistically significant for the youngest and the oldest in the sample, at 0.25 SD and 0.40 SD, respectively. For the socioemotional scale, we find a statistically significant gap between the fourth and first quartile of wealth, although it is much smaller than for other domains. Interestingly, the gap does not increase monotonically with age over the interval we consider: It starts at 0.30 SD for children aged 6–18 months, declines to 0.20 for children aged 19–30 months, and increases to 0.38 SD for children aged 31–42 months.
We have formally tested whether the gap widens as children get older by comparing the size of the average gap for children aged 31–42 months with that for children aged 6–18 months. Results show statistically significant differences for cognitive development (estimated mean gap difference = 0.55, p-value = 0.000) and receptive language (mean = 0.55, p-value = 0.002).8 A variant of Equation 1, where we include the wealth index as a continuous variable and the interaction of wealth with age, offers similar results (available upon request).
Lastly, we have also investigated whether the evolution of developmental differences by SES varies between boys and girls and found no major gender differences. However, the gap in cognition, expressive language, and fine motor skills becomes statistically significant at slightly earlier ages for girls (further details also available upon request).
IV. Robustness
An important concern discussed in Section IIB is the extent to which differential refusal rates to participate in either the household survey or the Bayley-III test by estrato can jeopardize the representativeness of the sample. So far, our strategy has relied on using the constructed household wealth index—as opposed to estrato—as the relevant SES variable. In this section, we specifically address this issue by reestimating Equation 1, weighting each observation by the inverse of the probability that the child is included in the estimation sample. Results, reported in Table A3 in the online appendix, are similar to those reported in Table 2, suggesting that our sample is very close to being representative, as already suggested by the comparison with the wealth distribution in the ECV sample in Figure 3.
In addition, we further investigate differential sample loss between the household survey and the Bayley-III. Although a chi-square test of goodness of fit cannot reject the null that the proportion of children not showing up for the Bayley-III is equally distributed across wealth quartiles (p-value = 0.341), this proportion does increase with household SES. More specifically, if we model this probability with a probit, we find that surveyed children with younger mothers, living in households with other children aged five to seven and/or no elders, and attending a childcare center, have a higher probability of not attending the test (see Table A4). These results possibly suggest that mothers without alternative forms of care faced difficulties affording the time to take the child to the test. Table A5 confirms that our results are not affected by the inclusion of these variables in the analysis.
Note that the interviewer dummies in Table A4 are jointly significant (p-value = 0.000). This indicates that the interviewer played a significant role in motivating the mother to take the child to the test—both through her ability administering the household survey and through direct encouragement. However, her identity is independent of the child’s skills and performance in the test for two reasons: Households are assigned to interviewers randomly, and the interviewer did not administer the test. Hence, we have used the identity of the interviewer to correct for potential selection bias into the test using a Heckman selection model. The estimated SES effects with the selection correction, shown in Table A6, are very similar to those reported in Table 2. The Wald test of independence indicates that we cannot reject the null of no selection bias in 16 out of the 24 cases—that is, selection into the Bayley-III is uncorrelated with unobserved child ability. Moreover, in those cases in which the correction is required (p-value of Wald Test < 0.05), the estimated gaps using the correction are larger than those reported previously. In particular, the gap on cognition for children aged 6–18 months is significant at the 5 percent at 0.28 SD and of 0.88 SD for children aged 31–42 months.
Tables A7 and A8 respectively assess the robustness of the estimated SES gap to constructing the dependent variables differently—namely, to internally standardizing z-scores using the traditional approach (substracting the 2-months-of-age mean of the raw score and dividing by the 2-months-of-age SD), and to working with the composite scores. In both cases, we find comparable size and gap patterns by age and wealth quartiles. In fact, a comparison of these estimates with those reported in Table 2 shows that our standardization procedure results in marginally lower gap estimates. It is worth noting the significant gap between the top and bottom wealth quartile in cognition and language composite scores for the youngest children in the sample, at 0.26 SD and 0.32 SD respectively (reported in row Gap Q4– Q1 and computed dividing the point estimate on Q4 by the SD of the estimation sample).
Omitted determinants of our outcome variables may also be correlated with those variables that compose the wealth index. We thus reestimate Equation 1 including parental factors (maternal education, presence of mother and father in the household, and first child), child biomedical factors (prematurity, prematurity interacted with age, birth weight, and stunting), factors related to the home environment (FCI scores for books, newspapers, and magazines; play activities; and play materials), and factors related to institutional childcare arrangements (attendance in public or private preschool, or in small nurseries run by community women), as additional controls.9 All these variables are strongly associated among each other and with child development. Results in Table A9 show that, while substantially reduced in size, the effects on the oldest age group remain for cognition as well as for receptive and expressive language. Similarly, the average effects on cognition (0.27 SD), expressive language (0.23 SD), and socioemotional development (0.17 SD) remain significant.
Finally, results are also robust to dropping the 11 children in E4 and outliers in the distribution of child development (results available upon request).
V. Discussion and Conclusion
In this paper, we examine the developmental levels by age, household wealth, and developmental domain on a sample of children aged 6–42 months from low-and middle-income households in Bogota and quantify the size of the SES gap using the Bayley-III—considered by many the best assessment for child development for children younger than age three.
We find evidence of a sizeable and statistically significant SES gap in child development from very early ages. Moreover, and in line with previous studies—most on older children—the gap increases substantially, monotonically, and significantly with age. The findings indicate that disadvantaged children going to preschool at three or four years of age are already likely to have marked cognitive and language deficits. There is little evidence at present indicating when the most effective age is to intervene. Although intervention trials are required to establish with any certainty, our findings suggest that intervening in the first year to prevent children from accumulating delays has the potential of high returns. Nonetheless, the increasing cognitive and language deficits after 24 months of age documented in this and other studies indicate that interventions need to last throughout that age period and up to school entry. This suggestion contrasts with early nutritional interventions that are recommended to concentrate on the first 1,000 days (Black et al. 2013). Our findings also suggest that interventions for disadvantaged children should particularly target cognitive and language skills, where the deficits are greatest.
For the older children in the sample, the estimated SES gap, while significantly reduced, persists after controlling for other factors, including maternal education, the quality of the home environment, and the type of institutional care. The latter two factors, for example, could be thought of as choice variables related to parental investments in their children—namely, the quantity and quality of childcare provided—and hence possibly modifiable through public policy. It should be stressed, however, that none of the evidence provided in this paper is causal. Indeed, parental investments are determined jointly with child development and may react to its evolution, either in a “compensatory” (parents invest more in children with lower perceived ability to close gaps) or a “complementary” (parents devote more resources to children with higher perceived ability because of the highest expected returns) fashion.
Despite these caveats, our findings are important for stimulating research in early interventions and evaluation of questions such as the most effective age to start investing, the most effective duration, and curriculum content. They may even contribute to guiding the eventual development and timing of supportive ECD interventions aimed at reducing unequal opportunities earlier in life and income disparities in the longer run.
Footnotes
Marta Rubio-Codina is a senior research economist at the Centre for the Evaluation of Development Policies (EDePo) at the Institute for Fiscal Studies and a senior economist at the Inter-American Development Bank.
Orazio Attanasio is a professor of economics at the Department of Economics at University College London and a research fellow and codirector of EDePo at the Institute for Fiscal Studies.
Costas Meghir is a professor of economics at the Department of Economics at Yale University and a research fellow at the Institute for Fiscal Studies.
Natalia Varela is a doctoral candidate at Université Laval.
Sally Grantham-McGregor is emerita professor of international child health at the Institute of Child Health at University College London. The opinions expressed herein are those of the authors and not necessarily of the institutions they represent. This research would not have been possible without the outstanding work of Pablo Muñoz, Mara Minski, Belén Gómez, Juan Fernando Trujillo and Hanner Sánchez in the field; and funding from the Inter-American Development Bank for data collection. The authors are grateful to the BibloRed network of libraries, Jardines Sociales del Distrito de Bogota, and aeioTu centers for lending us their facilities. The authors thank Camila Fernández, researchers at EDePo, and two anonymous referees for useful comments and suggestions; and José Guerra for research assistance. Rubio-Codina’s research time was financed by the Leverhulme Trust Early Career Fellowship ECF/2008/0170 on “CCTs, Child Care and Child Development.” Attanasio’s research time was partially financed by the European Research Council Advanced Grants 249612 on “Exiting Long Run Poverty: The Determinants of Asset Accumulation in Developing Countries” and by the ESRC Professorial Fellowship ES/K010700/1 on “The Accumulation of Human Capital in Developing Countries.” Rubio-Codina, Attanasio, and Meghir acknowledge financial support to research time from Grand Challenges Canada. All errors are the authors’ own. The data used in this article can be obtained beginning November 2015 through October 2018 from Marta Rubio-Codina.
↵1. See DANE (2011) for more details. More recently, the distribution seems to have shifted slightly to the right: 8.1 percent of all residential blocks are classified as E1, 37.3 percent as E2, 38.2 percent as E3, 11.3 percent as E4, 2.1 percent as E5, and the remaining 2.7 percent as E6.
↵2. Neighborhoods and blocks were selected using the proportion of women in fertile age as weights (probability design). Children were identified by door-to-door census. We obtained 403 children in 134 blocks in E1, 459 in 159 blocks in E2, 457 in 199 blocks in E3, and 12 children in five blocks in E4.
↵3. These are: car, fridge, microwave, washer, boiler, computer, DVD, stereo, console, Internet, shared kitchen, shared bathroom, more than one bathroom, quality floors, and crowding index.
↵4. We did not reorder the test items since we did not have the resources to request permission from the publisher and undertake this task.
↵5. Bayley-III composite scores are a nonlinear function of raw scores where: (i) items administered in younger ages are given more weight, and (ii) months-of-age are lumped together until the age of 36 months, and in larger intervals thereafter.
↵6. Note that the confidence interval of the difference in cognitive scores between Q4 and Q1 would be tighter than the difference in confidence intervals shown in the graph (the variance of a difference is smaller than the difference of variances), meaning that the “true” SES gap in cognition in this sample is in fact significant at an earlier age.
↵7. Recall that the testers were distributed randomly across estratos and age categories.
↵8. The standard errors (SEs) for these tests are computed by simulating the distribution of the difference in the gap using 500 random draws of the sample (sampling with replacement) and clustering at the neighborhood level.
↵9. We present this more parsimonious model after experimenting with the inclusion of other relevant variables.
- Received January 2013.
- Accepted March 2014.
This open access article is distributed under the terms of the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0) and is freely available online at: http://jhr.uwpress.org.