Abstract
We use large-scale register data on 450,000 Swedish males who underwent mandatory military enlistment at age 18, and a subsample of 150,000 siblings, to examine why tall people earn more. We show the importance of both cognitive and noncognitive skills, as well as family background and muscular strength for the height-earnings relationship. In addition, we show that a substantial height premium remains after these factors have been accounted for, which originates from very short people having low earnings. This is mostly explained by the sorting of short people into low-paid occupations, which may indicate discrimination by stature.
I. Introduction
The positive association between height and social status is one of the most consistent findings in the social sciences. It has been obtained using both 18-century data from Germany and using recent U.S. and U.K. data (Komlos 1990; Persico, Postlewaite, and Silverman 2004; Case and Paxson 2008a; Steckel 2009). In developing countries, where work tasks are more manual and physically demanding, this association has been attributed to greater physical strength accompanying height (for example, Steckel 1995; Thomas and Strauss 1997). For the Western world, where fewer jobs are physically demanding, it has been suggested that the height-earnings association arises because short people are being discriminated against but two recent studies, Persico, Postlewaite, and Silverman (2004) and Case and Paxson (2008a), attribute the association between height and earnings to mental skills associated with height.1
Despite using the same data source, the two studies reach different conclusions concerning the nature of the underlying mechanisms at work, however. Case and Paxson (2008a) emphasize the role of cognitive skills whereas Persico, Postlewaite, and Silverman (2004) stress the importance of noncognitive (or social) skills. In their analysis of data from the United Kingdom and United States (National Child Development Study, NCDS, and National Longitudinal Survey of Youth, NLSY, respectively), Persico, Postlewaite, and Silverman (2004) find that teen height essentially explains the association between adult height and hourly wages among white men. In addition, they show that even if cognitive test scores in childhood are correlated with adult earnings, these scores do not affect the magnitude of the association between height and earnings. Instead, they argue that the link between teen height and later life earnings runs via participation in activities that build social skills during adolescence. From this perspective, noncognitive skill can be viewed as a mediating factor in the height-earnings relationship as it results from social processes in which height in youth influences its development.
Case and Paxson (2008a) use wider inclusion criteria and argue that height growth and cognitive development during childhood are subject to common environmental and nutritional circumstances. Accordingly, the association between height and earnings does not reflect a causal effect of height, but rather that height is correlated with cognitive ability via some underlying third factor affecting both height and earnings. They find that neglect of controlling for cognitive ability substantially biases the estimate for height in the earnings regression upwards.2
In this paper, we provide new evidence on why tall people earn more. We contribute to the literature by jointly analyzing the hypotheses offered by Case and Paxson (2008a) and Persico, Postlewaite, and Silverman (2004) on the role of cognitive and noncognitive skills. For this purpose, we exploit data on an unusually large sample of 450,000 Swedish males enlisting for the military between 1984 and 1997. Besides including high-quality composite measures of both cognitive and noncognitive skills, the data include measured height at age 18 and register-based data on adult earnings, which should be more reliable than information obtained from surveys. Because enlistment was mandatory for all males at age 18 during the study period, our data also have an extraordinary degree of representativeness.
In addition, we examine the role that both observed and unobserved family background plays for the relation between height and earnings. Part of the association between height and earnings may simply reflect that tall people come from advantaged backgrounds but the previous literature has not been able to fully address this. We therefore exploit data on a large sample of 145,000 sibling brothers who enlisted for the military. As a further contribution, we investigate the importance of muscular strength. While this factor has not been considered in the economics of height literature to date, we argue there are reasons to believe it could still be of importance for the height-earnings relationship also in the western world.
Finally, we relax the assumption of linearity in the relationship between height and earnings. By estimating models with centimeter fixed-effects and piecewise-linear spline models, we are able to analyze whether any height-earnings relationship that remains after accounting for control variables and mediating factors could reflect discrimination against short people or, correspondingly, preferential treatment of tall people. We also investigate the extent to which any such remaining height premium reflects sorting of short people into low-paid occupations.
Our results demonstrate a substantial association between height and earnings also in a country such as Sweden, with universal healthcare coverage and small health and income inequalities. We then find that half of this raw association is explained by tall people coming from advantaged family backgrounds and having higher cognitive test scores. While this provides some support for the findings of Case and Paxson (2008a), we then also show the importance of noncognitive skills. Arguing that noncognitive skills should be thought of as a mediating factor in the height-earnings relationship, we find this type of skill to explain one fifth of the height premium, lending some support also to the hypothesis put forward by Persico, Postlewaite, and Silverman (2004). When we also add muscle strength to our analysis, the magnitude of the height-earnings relationship is further weakened.
Although inclusion of our control and mediating variables reduces the raw association between height and earnings by as much as three-quarters, we find that a significant relationship between height and earnings remains. Relaxing the assumption of linearity in the height-earnings relationship shows that this remaining height-earnings association originates from very short people having low earnings. This could be indicative of discrimination of short people on the labor market, although we cannot rule out that it instead is a result of some, by us unobserved, productivity enhancing trait being associated with height. Irrespective of the mechanism, we show that this remaining ”shortness penalty” is explained by short people sorting into low-paid occupations.
This paper is organized as follows. Section II outlines a basic conceptual and empirical framework and discusses how cognitive and noncognitive skills as well as muscular strength may be linked to height in late adolescence, thereby potentially influencing the association between height and earnings. In Section III we describe our data and provide some descriptive statistics. Our results are presented in Section IV, whereas Section V concludes.
II. Conceptual Framework
In order to conceptualize how the “returns” to height could be interpreted, consider the following counterfactual framework:
(1)
which illustrates potential earnings, yih, under different heights, h, for an individual i. In this framework, the causal return to height, or the “height premium,” can be thought of as the additional earnings that each additional unit of height generates.3 There are mainly two sorts of mechanisms that would generate such returns. First, the height premium may result from preferential treatment of tall people. In this case, the returns to height reflect a taste for stature per se on the labor market. Second, the returns may result from any positive effect height brings on productivity, for instance through its effect on the formation of noncognitive skills. Here, it is not height per se that is valued on the labor market but rather the skills that result from greater height.
In order to empirically investigate the returns to height, the previous literature has adopted empirical specifications of the type:
(2)
where Xi′ denotes a vector of all relevant control and mediating variables and γ denotes the corresponding regression coefficients. Given that no mediating factors are included in Xi′ and under the assumption that Cov(hi, εi) = 0, β would measure the causal return to height. To the extent that the returns to height reflect the impact of height on productivity-enhancing factors, including these mediating factors in X would reduce the magnitude of β. What remains of β then capture the returns to height as measured through preferential or discriminatory treatment.
As discussed in the introduction, Persico, Postlewaite, and Silverman (2004) argued that noncognitive skills are mediating factors in the relationship between height and earnings. Case and Paxson (2008a) instead argued that the height coefficient picks up that tall people earn more because of their greater cognitive skills. If this latter argument is valid, a simple regression of earnings on height would suffer from a standard omitted variables problem, where the estimated β would be biased upward due to the omission of cognitive skills in the regression.
Our discussion so far thus shows that there is a sharp distinction between the two hypotheses when it comes to the underlying mechanisms at work. In the following three sections we deepen our discussion on the role of cognitive, noncognitive skill and muscular strength for the height-earnings association in order to clarify our choice of empirical strategy.
A. Cognitive Skills
A positive association between height and cognitive ability, throughout the human lifespan from early childhood to old age, has been documented in numerous studies. (See, for example, Richards et al. 2002; Case and Paxson 2008a, 2008b.) This association was exploited by Case and Paxson (2008a) and formed the basis for their argument that the association between height and earnings largely reflects the fact that taller people on average have greater cognitive abilities.4 The authors also provide a survey of some of the potential explanations for the positive association between height and cognitive ability. Summarizing, it is postulated that certain growth factors are related to both height and cognitive ability, although there is still substantial uncertainty regarding the exact underlying mechanisms. For instance, insulin-like growth factors are believed to influence body growth at the same time as influencing neural growth. It is also assumed that environmental factors, such as maternal smoking during pregnancy, may also explain part of the association between height and cognition. In comparing twins, Black, Devereux, and Salvanes (2007) found a connection between birthweight on the one hand, and IQ test scores and height in late adolescence on the other, again underscoring the importance of the intrauterine environment for adult height as well as cognitive ability.
Based on the basic presumption that an individual’s endowment affects both cognitive and height development, Case and Paxson (2008a) postulate that cognitive ability and the onset of the growth spurt in adolescence should be correlated, which is indeed what they find in their data.5 For boys, cognitive test scores at age 11 are positively associated with height gain in the subsequent five years and negatively associated with height gain from age 16 onwards. Though boys entering their growth spurt late may catch up in height, they will still lag behind cognitively as the relative position, when it comes to intelligence, is rather firmly established early in life. This fact may explain why height in adolescence is a stronger predictor of adult earnings than adult height.
Hence, the association between height and cognition has primarily been attributed to common underlying factors, and not to any causal relation between the two. From this respect, cognitive ability should be regarded as a control variable when estimating the height premium.6
B. Noncognitive Skills
Noncognitive skills are commonly referred to in the literature as personal character traits such as motivation, persistence, charm, and leadership abilities. In the labor economics literature, the term essentially describes a complexity of personal characteristics connected to the aspects of one’s persona (apart from pure cognition), potentially affecting productivity.
It seems uncontroversial to presume that noncognitive abilities are valued by employers, co-workers and potential customers in almost any kind of occupation. Indeed, a recent body of research has shown that noncognitive skills are important predictors of various adult socioeconomic outcomes. (See, for example, Heckman and Rubinstein 2001; Heckman, Stixrud, and Urzua 2006; Borghans et al. 2008; Lindqvist and Vestman 2011.) If height in adolescence is related to the development of noncognitive skills, it is therefore straightforward to formulate a hypothesis where part of the height premium reflects the returns to noncognitive skills, as in Persico, Postlewaite, and Silverman (2004).
Whereas the linkage between cognition and height has been attributed mainly to biological processes early in life, the connection between noncognitive skills and height has commonly been attributed to social processes, in which taller individuals are treated differently than shorter ones. For instance, a common theory in social psychology postulates that taller people are more successful for reasons of interpersonal dominance. (See, for example, Frieze, Olson, and Good 1990.) It then follows that if being tall is an advantage in different competitive situations, this may facilitate the building of self-esteem and social skills in relatively taller individuals. Moreover, children may face different expectations from parents, teachers, other adults, and peers depending on their height. If children react to expectations by complying, such expectations may become self-fulfilling. For instance, if taller children are perceived and treated as being more mature, when it comes to temperance, responsibility, and planning, they may well be fostered into developing such traits.
As discussed by Persico, Postlewaite, and Silverman (2004), another explanation emphasizes the role of self-esteem, but from a different perspective. Here, the height premium stems from the greater self-image that is achieved through a comparison with a socially determined notion of ideal height. It is then argued that a greater self-image leads to higher achievement through noncognitive factors such as perseverance and social skills. This is related to the findings of Persico, Postlewaite, and Silverman (2004), where teen height is found to explain a large part of the relationship between height and earnings, and where the premium is reduced when controlling for participation in high school sports and clubs. The latter finding causes the authors to conclude that such participation shapes noncognitive skills. This is well in line with Kuhn and Weinberger (2005) who, controlling for cognitive ability, found that self-perceived leadership skills as well as exposure to leadership positions in high school, was positively related to earnings later in life. Moreover, Lindqvist (2012) finds that height is positively related to leadership skills, as measured at the Swedish enlistment.
In sum, existing explanations for the association between height and noncognitive skills in general focus on how height may affect the development of noncognitive skills, rather than both being determined by some third, underlying factor. From these respects, noncognitive skill is a mediating factor in explaining the height premium.7
C. Muscular Strength
It has long been known that anthropometric measures such as body height are strongly correlated with muscular strength. (See Everett and Sills 1952 for an early example.) If muscle strength would also have an independent influence on earnings, part of the height-earnings relationship would reflect the association between height and muscle strength. Though admittedly, the direct importance of muscle strength for labor market success has likely declined with the development of society and modern production processes, there are still a number of reasons why it may be correlated with labor market success in the western world.
First, the development of the muscular system may be related to the same genetic predispositions and susceptible to the same environmental conditions as height growth, thereby yielding a positive correlation between the two (Silventoinen et al. 2008).8 To the extent that the same genetic and environment factors also influence a person’s labor market success, muscle strength would be an important control variable when estimating the height premium.
A second possibility is that tall people to a larger extent participate in activities that not only builds noncognitive skills, as suggested by Persico, Postlewaite, and Silverman (2004), but also muscle strength, such as sports, and that these same characteristics have a labor market return. (See Rooth 2011.) In fact, to the extent that being tall constitutes an advantage in certain sports activities, one would expect tall people to participate to a greater extent. From this perspective, muscle strength would instead be a mediating factor in the height-earnings relationship, because it may partly be caused by being tall.
Finally, it is also possible that male muscle strength signals certain personality traits. Some studies have for instance found that handgrip strength predicts not only physical fitness but also aggressiveness and dominance (Gallup, White, and Gallup 2007). These traits can be recognized as dimensions of noncognitive skills, which would mean that muscle strength would partly pick up the same factors as direct measures of noncognitive skills.
Even though there are good reasons to believe that muscle strength may play a role in the height-earnings relationship, our discussion also shows there is great uncertainty about the mechanisms at work. In our empirical analysis, we will therefore be somewhat agnostic about whether muscle strength mainly plays the role of a control variable or a mediating variable and leave that for future research to investigate.
D. Empirical Framework
Our baseline empirical model relates adult earnings to height at age 18 for the full population and controls only for age:
(3)
where i is the index for the individual, h is a measure of individual height in centimeters, and where a indicates age. In this model, β will pick up both the causal return to height, that run through preferential treatment of tall people or through mediating factors, and the influence of any omitted control variables that are correlated with height and earnings. We will then alter this baseline model by including different control variables and mediating variables into the regression. In particular, the previous section introduced three sets of variables that are of importance in understanding the relationship between height and earnings; cognitive skills, noncognitive skills, and muscular strength. We will also introduce variables indicating family background; that is, parental education and parental earnings, which will be thought of as control variables, as they may relate to both an individual’s height and earnings. By entering the explanatory variables one by one, starting with the controls, and then together, we can analyze to what extent the relationship between height and earnings in Equation 3 above is affected when accounting for control variables and mediating factors. Our full model can thus be written:
(4)
where X now is the vector of variables measuring age, parental background, cognitive skills, noncognitive skills, and muscle strength, and where γ is the associated vector of regression coefficients. In this model, the estimate of β can be interpreted as the returns to height that remains after accounting for the effect of height that runs through noncognitive skills and after controlling for factors associated with, but not caused by, height—that is, cognitive skills and parental background.9 This remaining height premium can then be interpreted as the part of the height premium that runs through preferential treatment of tall people on the labor market, given that all mediating factors and confounding factors have been accounted for.
Note that in our main empirical analysis, we only include variables that are determined before labor market entry. As argued by Case and Paxson (2008a), controlling for variables such as occupation and postsecondary education would hide part of the height premium if taller individuals sort themselves into certain educations or jobs. However, at the end of the analysis found in Tables 4.1 we will examine the extent to which the remaining height premium works through sorting into occupations and educations.
Our second main specification exploits our data on siblings and can be written as:
(5)
where ij is an index for an individual i in family j and μ represents a family fixed effect. The fixed effect captures family characteristics common to all siblings within the same family. Here, identification of the height coefficient relies upon sibling variation in height at age 18. In this specification, our estimate of β will not be biased due to any confounding influence from unobserved family level factors that are also associated with earnings.
The availability of large-scale sibling data constitutes an important advantage in our analyses. First, height is known to have a very high genetic heritability explaining up to 80 percent of its variation (Visscher, Hill, and Wray 2008).10 If some of the genes that affect height, via separate channels also affect earnings, this will bias the height coefficient, because genes are not observed. However, since biological siblings share on average 50 percent of their genes, our fixed effects approach therefore ”washes” out some of the genetic influence, which may reduce the bias in the height coefficient. A similar reasoning applies to the coefficients for cognitive skills, noncognitive skills, and muscle strength.
Even though the siblings fixed effects approach does not fully cancel out the influence of genetics, the sibling data give us another advantage: Mendellian randomization, or the “genetic lottery,” implies that within a family, it is random which child inherits a particular gene.11 Thus, if the variation in height across siblings that remains after accounting for sibling fixed effects would be mainly genetically determined, we can be confident that height variation across siblings is exogenous.12
Second, the sibling approach allows us to difference out confounding environmental factors operating at the family level. Sibling teenagers are likely to share important unobserved factors such as food and nutrition supply in the home, parental practices, and preferences. They are also likely to attend the same school and thus face the same school environment and neighborhood characteristics.13 However, parents may also reinforce or compensate for differences in endowments between siblings by directing various amounts of skill building resources to each child. Empirical evidence from Western countries seems to mainly support the hypothesis that parents try to compensate for ability differences between children. (See Almond and Currie 2011 for a recent literature overview.) To the extent that skills and strength are associated with labor market success as well as height, such compensatory practices would dampen the association between height and earnings.
III. Data and Descriptive Statistics
Our empirical analysis is based on a data set constructed by integrating registers from Statistics Sweden (SCB) and the Swedish National Service Administration. The latter contains information on every individual living in Sweden in the year 1999 who enlisted in the military between 1984 and 1997.14 Our study population consists of all males who were 28–38 years old in 2003, who enlisted in the military, and for whom there is full information on relevant variables. Enlisting for the military is carried out during a two-day procedure and was mandatory for all male Swedish citizens the year they turn 18. Only those with severe handicaps, or institutionalized (due to mental disorders or being in prison), or living abroad, are exempted from enlisting.15 It should also be noted that a refusal to enlist results in fines, and eventually in imprisonment. In order to avoid any confounding influence of ethnic discrimination, we restrict our analyses to native Swedish males—that is, those born in Sweden to Swedish-born parents.16 Given these restrictions, our study population covers about 92 percent of the total native male Swedish population in the relevant cohorts.
Our base sample consists of 468,312 individuals. Out of these, 96 percent had positive annual earnings in 2003—that is, 448,702 individuals, which is the sample that we use in our analyses. Hence, there is very little attrition in the data and it more or less covers the entire native born male Swedish population. However, in the main part of the analysis we instead focus on variation between siblings, which reduces the sample to 145,210 individuals. Because the enlistment variables are measured by military personnel, and earnings by tax authorities, our results are not likely to be influenced by any reporting bias, which often plagues survey data.
Our measure of annual earnings takes into account income from work, self-employed income, and social insurance benefits such as sickness benefits and parental benefits for the year 2003 and is taken from the tax records. A sensitivity analysis presented in the online appendix, where only incomes from work and self-employed income constitute the measure of earnings, shows that the inclusion of social insurance benefits does not affect our results.
Cognitive skill is measured using Enlistment Battery 80, which is a test similar in style to the Armed Forces Qualification Test (AFQT) in the United States and comprises four separate tests, Instructions, Synonyms, Metal Folding, and Technical Comprehension. The cognitive skill variable used in the analysis is a standardized version of the composite cognitive measure calculated by the military enlistment service, which ranges from one to nine.17
Noncognitive skills are measured through interviews carried out by certified psychologists.18 The interview contains assessments of psychological stability and endurance, capability of taking initiative, responsibility, and social competence. This evaluation results in a composite enlistment score of noncognitive skills, ranging from one to nine, which we then standardize in our analyses. Though the ultimate purpose of the interview is to evaluate the conscript’s ability to perform military service and to function in a war situation, it seems reasonable to assume that the character traits valued by the military psychologists (mentioned above), ought to be appreciated and rewarded also in the labor market.
Handgrip strength is measured as the maximum pressure exerted by the strongest hand squeezing a dynamometer. This variable is standardized when used in our analyses. Measuring handgrip strength is a rather cheap and simple procedure that correlates with the muscle strength of other parts of the body; it is therefore a common method for assessing overall muscle strength (for example, Gale et al. 2007).19 20
In our regressions, we control for age fixed effects. Because 99 percent of the conscripts enlisted at age 18 or 19 (86 and 13 percent, respectively), the age fixed effects picks up any nonlinearity in the age profile for earnings as well as anything specific for the enlistment year such as changes in the measurement of the enlistment variables. It is therefore reassuring that the results are insensitive to how we handle the age and age-when-enlisting variables; that is, including additional controls for age (fixed effects) when enlisting, or only including a control for linear age, does not change the results at all.
In a separate analysis, we are also interested in whether the height premium results from tall and short males being sorted into certain occupations or levels of education. For this purpose, we categorize the variable measuring occupation into 115 different occupational groups according to SSYK (Standard for Swedish Occupational Classification), a three-digit occupational classification code similar to the international classification (ISCO). These indicators are then entered as fixed effects in the regressions. A similar procedure is done for education levels according to SUN (Standard for Swedish Educational Classification).
It should be noted that almost everyone who enlisted during our study period also completed military service. For the vast majority of the study sample, the enlistment test results did not affect the decision on whether they would have to do military service or not. Instead, the tests merely influence the individual’s placement in the army, meaning that poorer results will typically lead to a less qualified and meriting placement. In this respect the incentives to deliberately underperform in the tests are limited. Still, in the online appendix (http://jhr.uwpress.org/) we perform some sensitivity tests in order to examine the role of “enlistment fakers” in our results.
Our data also contain information on parental years of schooling and earnings in 1980. For about 12–13 percent of the sample, however, information is missing on these variables. Moreover, for some of the other explanatory variables, information is missing for certain individuals.21 When there is missing information in a cell, we use the variable mean and create an additional binary variable indicator taking on the value one when information is missing and zero otherwise.
Before moving on to the empirical part of our work, we also need to address the age at which height is measured. Both Persico, Postlewaite, and Silverman (2004) and Case and Paxson (2008a) use height measures at different ages in their earnings regressions. From this perspective, a potential limitation of our study is that it only captures height at age 18. Hence, we are not able to assess the relative importance of height status in different childhood ages for earnings in adulthood. However, it should be noted that the association between height at a very young age and adulthood is in the order of 0.7. (See Case and Paxson 2008b.) In addition, if the association between height and earnings only reflects unobserved factors related to labor market success, inclusion of these factors in the regression should cause the coefficient of height to approach zero (Case and Paxson 2008a). This will then be the case irrespective of whether single or multiple measures of height are used. Hence, having only a single measure of height is less problematic, because our aim is to disentangle the importance of family environment, cognitive skills, noncognitive skills and handgrip strength, respectively, for the height premium.
The distributions for our key variables (log) earnings and height (at age 18) for the total population are illustrated in Figure 1. The figure shows that earnings rise with height across the entire height distribution and that the earnings per height rise is rather uniformly distributed in the midsection of the height distribution, but is more pronounced below than above mean height.
Logarithm of average annual earnings by height, cognitive skill, noncognitive skill, and handgrip strength. Total population. 28–38 years old in 2003
Notes: The upper-left graph shows average earnings for each centimeter in height. Cognitive skill is the enlistment test score in cognitive ability and the upper-right graph shows average earnings for every value of the cognitive test score (1–9). Noncognitive skill is the psychological evaluation when enlisting and the lower-left graph shows average earnings for every value of the noncognitive measure (1–9). The lower-right graph shows average earnings for every integer of the handgrip strength score.
We next address the raw associations between our main explanatory variables and earnings. Figure 1 shows that cognitive and noncognitive skills are strongly, positively, and rather linearly associated with earnings. The relationship between strength and earnings is more concave implying that the positive association weakens in the upper part of the strength distribution. Also, cognitive and noncognitive skills as well as strength correlate positively with height, as shown in Figure 2. Hence, the two measures of mental capacity as well as strength are associated with both height and earnings in a bivariate analysis. Descriptive statistics of the key variables used in the empirical analysis are given in Table 1 for those below and above the median height of 179 cm (5 feet 10 inches).
Average cognitive skill, noncognitive skill and handgrip strength by height. Total population. 28–38 years old in 2003
Notes: The height variable has been trimmed at the 1st and 99th percentile. Cognitive skill is the enlistment test score in cognitive ability, Noncognitive skill is the psychological evaluation when enlisting, and Handgrip strength is the enlistment test result when squeezing a dynamometer with the strongest hand. The graphs show average skills/strength for each centimeter in height.
Descriptive Statistics of the Population Divided at Median Height. Men 28–38 Years Old, 2003.
In the empirical analysis we control for family fixed effects, hence, using only variation between brothers to identify the earnings-height relationship. As shown in Figure 3, there is a fairly large variation in height, cognitive, and noncognitive skill between brothers.22
The distribution of the within-family difference in height, cognitive, and noncognitive skills
Notes: The within-family difference is calculated as the difference between the maximum and minimum value for each variable—that is, between brothers. 92 percent of the data consists of just two brothers within a family.
IV. Main Results
We start out with the full set of 448,702 observations, including only height and age in the earnings regression; see Table 2. This specification (Model A) serves as our benchmark (henceforth referred to as the “crude” association between height and earnings) and shows that a 10 centimeter increase in height is associated with 6 percent higher earnings.23 In Columns B through H, various variables that may act as either controls or mediating factors in the height-earnings relationship are added. The row in Table 2 denoted “Reduction (%) in the Model A height-earnings association” contains information about how much the magnitude of this crude estimate is reduced in each subsequent model.
The Height-Earnings Association per 10 cm Height Increase. Men 28–38 Years Old, 2003. Logarithm of Annual Earnings.
The height-earnings association is slightly reduced—by about one-sixth (to 0.052)—when adding parental socioeconomic characteristics to Model A; see Column B. Parental background, in terms of income and schooling, thus seem to play some role for the height-earnings relationship of their sons.
There may also be unobserved factors operating at the family level that plays an even greater role for the height-earnings relationship (for example, genetics, family practices, values, and norms). In order to further investigate the role of family background, we next turn to our sibling sample. We start by running OLS regressions, without fixed effects, in order to check if we obtain the same crude height-earnings relationship in this sample as for the full sample; see Column C. The estimated height coefficient is actually identical to the one obtained for the full sample. This is reassuring and shows that our sibling sample is likely representative of the greater sample.24
When we introduce sibling fixed effects (Model D), and thus account for unobserved factors at the family level, the magnitude of the height coefficient is reduced from 0.062 to 0.042. This suggests that factors operating at the family level explain about one-third of the crude relationship between height and earnings or, correspondently, that the height coefficient is upward biased by almost 50 percent when not accounting for family background. Moreover, these results suggest that it is both parental socioeconomic factors as well as other circumstances at the family level that matters for the height-earnings relationship.
Next we turn to one of the main themes of the paper: the role played by cognitive and noncognitive skills. Starting with cognitive ability, and holding family background constant, the results from Model E clearly show that this is an important determinant of earnings, since a one standard deviation increase in cognitive ability is associated with 10 percent greater earnings. Controlling for cognitive skill also further reduces the magnitude of the height coefficient by more than one-third, from 4.2 to 2.8 percent. This also implies that even when controlling for family background, neglect of incorporating this type of skill in the earnings equation still biases the estimated height premium upward by 50 percent. Under the assumption that this type of skill, and family background, are the only relevant control variables, we would thus be left with a statistically significant height premium of 2.8 percent. This also means that the estimate of the height premium in Columns A and C was upward biased by more than 100 percent when not accounting for family background and cognitive skills.25, 26
In order to understand the mechanisms behind the height premium that remains after including our controls, we next add noncognitive skill as a mediating factor between height and earnings to the regression (Model F), which further reduces the magnitude of the height coefficient. Assuming that the height coefficient of 2.8 in Model E reflects the causal effect of height on earnings, our results imply that 20 percent of this effect runs via formation of noncognitive skills.
To what extent does also muscle strength, as measured through handgrip strength, matter for the height-earnings relationship? Though it is uncertain whether handgrip strength should mainly be regarded as a control or mediating variable, adding it to the regression should at least indicate if it matters at all for the height-earnings relationship. Doing so (see Column G), we first note that it is a less important predictor of earnings than our skills measures (an increase in handgrip strength by one standard deviation increases earnings by 1.3 percent). Moreover, the coefficients of cognitive and noncognitive skills almost remain unchanged by its inclusion. Including muscle strength does, however, result in an additional decline in the magnitude of the height coefficient from 2.2 to 1.5, or by about 30 percent. If one believes that muscle strength is a mediating factor between height and earnings, this would imply that about half of the height premium in Model E is mediated through noncognitive skills and strength. Alternatively, if one views handgrip strength as mainly a control variable, the estimated height premium in Model F would be upward biased by almost 50 percent.27 Irrespective of the more precise pathways by which strength is linked to the height-earnings relationship, it does seem to be an important trait, also in the Western world. We leave the correct interpretation to future research.28
Summing up, our results so far shed new light on why tall persons earn more. More than half of the crude relationship is explained by factors that are related to both height and earnings, but not caused by height, such as family background and cognitive skills. This latter result thus supports the hypothesis of Case and Paxson (2008a) that tall people earn more partly because they are smarter. Our results also support those of Persico, Postlewaite, and Silverman (2004), however, since we find that non-cognitive skills as a mediating factor explains an important part of the estimated height premium. Adding muscle strength to the equation, our control and mediating variables explains about three-quarters of the crude relationship between height and earnings.
Although we are able to explain a large part of the crude height-earnings relationship, there still remains a significant height coefficient of 1.5 percent. As we soon will show in Section IV.A below, the remaining height premium seems to be caused by very short males earning less than others. One mechanism through which this result could arise would be if short people sort into lower paid occupations. To shed some light on this possibility, Model H adds occupation fixed effects to Model G, which results in a further reduction of the height coefficient by more than 40 percent to 0.009, and the coefficient is now statistically insignificant. The crude relationship between height and earnings in Model A is now reduced by more than 85 percent and we are thus able to explain almost the entire crude relationship between height and earnings.29 The sorting of short men into lower-paid occupations may reflect discrimination, where short men shy away from occupations where employers are perceived to discriminate against them. If this is the case, an important part of the height premium would reflect tastes for tall people, where sorting into occupations based on one’s height is an important mediating mechanism.30
A. A Nonlinear Height Premium
In this section, we relax the restriction of a linear height-earnings relationship by first estimating the relationship between height and earnings using centimeter fixed effects. Figure 4, which corresponds to the Model A specification of Table 2, shows that the relationship between height and earnings is slightly concave and reveals that what appears as a height premium may rather be interpreted as a penalty for being short because most of the action occurs at the lower part of the height distribution. As more explanatory variables are introduced into the model, the height-earnings relationship weakens in magnitude, but the concavity remains, as shown in Figure 5.31
The height-earnings association. Total population data. Model A
Notes: Earning differences relative to height = 180 cm (reference line). Data end coded at bottom/top 1 percent of the height distribution. This regression model corresponds to Model A in Table 2, exchanging the continuous height variable with centimeter fixed effects.
The height-earnings association. Total population data. Model G
Notes: This regression model uses the full set of variables (corresponds to Model G in Table 2), exchanging the continuous height variable with centimeter fixed effects.
Estimating the model with centimeter fixed effects also for the sibling sample is not informative because the confidence intervals become too wide. Instead, we estimate piecewise linear models using the visible breaks in the height premium at 172/173, 180/181, 188/189 revealed in Figure 4.32 This analysis confirms that the height-earnings association is more pronounced the lower the stature, being almost twice as large at the 165–172 height interval compared to the 173–180 height interval, while there is no significant height-earnings association above 188 centimeters (Table 3, Model B). As we introduce the skill and strength variables, the relationship between height and earnings weakens somewhat at all intervals, but the main picture remains. For instance, the estimates for Model E shows that earnings is only sensitive to variations in height below mean stature (180cm), and the height-earnings association is still the highest for the shortest category, implying that earnings rise with 0.5 percent per centimeter increase in height. However, as for the main analysis in Table 2, all height interval estimates become statistically insignificant when we introduce occupation fixed effects. See Model F, which indicates that an important mechanism of the remaining height premium is to be found in the sorting into different occupations by height.
The Height-Earnings Association at Different Height Intervals. Men 28–38 Years Old, 2003. Logarithm of Annual Earnings. Sibling Sample.
To conclude, the remaining height premium is mainly due to very short people having low earnings. Hence, in the search for what the remaining height premium constitutes, one has to look for theories explaining the earnings-height association at the bottom of the height distribution. We will return to this issue in our discussion.
In order to check the robustness of our results we have conducted a range of sensitivity analyses (the implications of which are thoroughly presented in the online appendix available at http://jhr.uwpress.org/). Summarizing, the results do not change to any important extent when we exclude a) individuals with low earnings (below 100’ SEK), b) sickness benefits from the income measure, c) potential enlistment “fakers” (that is, people scoring very low on any of the analyzed enlistment tests), d) people who were under the age of 30 in 2003 (when income was measured), or e) subdivide the data according to sector of employment (public or private).
V. Summary and Discussion
We show that an additional 10 centimeters in height, which is approximately 4 inches,33 is associated with 6 percent higher annual earnings among Swedish males. This is somewhat lower than the 10 percent height premium in wages found by Persico, Postlewaite, and Silverman (2004) and Case and Paxson (2008a), but could probably be explained by the institutional setting in Sweden, with the existence of “solidarity” wages and a rather narrow wage distribution.34 It should be noted that the association between height and paycheck in the three countries (wages in the United States, United Kingdom and earnings in Sweden) corresponds to the return to one additional year of schooling in the respective countries—about 10 percent in the United States and United Kingdom (Card 1999) and 6 percent in Sweden (using the data explored in this study). Hence, the height-earnings association in Sweden, though somewhat lower than the wage premiums in the United States and United Kingdom, is still substantial.35
We then turned to the reasons why tall people earn more. Based on the argumentation and evidence presented in the literature we divided the mechanisms into three distinct categories; control factors (family background and cognition), mediating factors (non-cognitive skill) and preferential treatment (discrimination). In addition, we brought in an additional factor, muscle strength, where there is greater uncertainty about its role.
Our results show that there is not one story as to why tall people earn more. Instead, our results support both the findings of Persico, Postlewaite, and Silverman (2004) and Case and Paxson (2008a). As in the latter study, we find that neglect of controlling for cognitive skills severely biases the estimate of the height premium upward. We also show that this result holds even when accounting for family background, which in itself reduces the magnitude of the height coefficient by one-third. But also non-cognitive skills matters, as was hypothesized by Persico, Postlewaite, and Silverman (2004). Our results suggest that noncognitive skill mediates a substantial part (about one-fifth) of the estimated height premium once cognitive skill and family background is controlled for.
As far as we know, the only other paper that has assessed both the role of cognitive and noncognitive skills is Schick and Steckel (2010). Whereas their data is richer in terms of the number of variables collected and the number of tests administrated at different ages, our data has clear advantages in terms of sample size, access to sibling data, and measurement of height and earnings. It is therefore interesting to note the qualitatively similar results obtained in both studies regarding the importance of both types of skills for understanding the height-earnings relation.
Somewhat surprisingly, we also find that muscular strength, also measured at enlistment, matters for the estimated height-earnings association. While we do not know whether muscle strength should be regarded as a control variable or mediating variable, including it into the earnings equation further reduces the magnitude of the height premium. With our control variables and mediating variables, we are able to account for three quarters of the crude age-adjusted relationship between height and earnings.
Incorporating all our control variables and mediators into the earnings equation, we are still left with a height premium of about 1.5 percent. Closer inspection reveals that this is due to very short people being penalized rather than tall people being rewarded. This may reflect discrimination against short people in the labor market and we show that this remaining penalty of being short is explained by short people sorting into low-paid occupations. Such behavior may arise if employers discriminate against short people or if short people shy away from perceived discriminatory employers. While discrimination is possible,36 we cannot rule out the remaining “penalty” instead is a result of some unobservable, productivity-enhancing factor being correlated with stature. For instance, if cognition and height are correlated via underlying nutritional and genetic factors, statistical discrimination against short people may arise, and as noted by Case and Paxson (2008a) such discrimination ought to be stronger early in the labor market career.
On a general level, the results presented underline the importance of cognitive and noncognitive skills for understanding how stature relates to labor market success. From this perspective, our results emphasize the importance of beneficial nutritional and social conditions during early stages of life. If inherently short or physically weak children are discriminated against by being excluded from taking on responsibilities or participating in activities that builds leadership and other types of noncognitive skill, an implication of our results is that school policies, aiming at giving children equal opportunities, should be considered. Moreover, early life interventions that targets nutrition and other factors that are inputs in the production of both cognitive skills and height could lessen the association between a person’s stature and his labor market success. This conclusion is further strengthened by our result suggesting that family background explain a large part of the height-earnings association. Clearly, there is something in the family background of certain children related to both short stature and low earnings potential that does not seem to run through skills, strength, or parental income and schooling.
Footnotes
Petter Lundborg is a professor of economics at Lund University in Sweden. Petter.Lundborg{at}nek.lu.se.
Paul Nystedt is an associate professor of economics at Linköping University in Sweden. Paul.Nystedt{at}liu.se.
Dan-Olof Rooth is a professor of economics at Linnaeus University in Sweden. Dan-Olof.Rooth{at}lnu.se. The authors thank two anonymous referees and seminar participants at Linnaeus University, Lund University, IFAU/Uppsala University, at the ESPE conference in Seville and at the EALE meeting in London for useful comments and suggestions. Research grants from the Centre for Economic Demography at Lund University and the Swedish Council for Working Life and Social Research are gratefully acknowledged. The data in this article belong to Statistics Sweden. Scholars seeking to replicate this work may consult Dan-Olof Rooth, Linnaeus University, 39182 Kalmar, Sweden, beginning August 2014 through July 2017.
↵1. Other studies confirm the positive association between height and labor market success, but commonly group height into a few distinct categories and/or are based on rather few observations, which make comparisons difficult (Loh 1993; Hamermesh and Biddle 1994; Harper 2000; Behrman and Rosenzweig 2004).
↵2. In recent work, Schick and Steckel (2010) use the NCDS and show that, taken separately, the incorporation of cognitive and noncognitive skills reduces the unconditional height-earnings association by about 40 percent each whereas, taken together, they reduce it by about two-thirds.
↵3. Note that the concept of the “height premium” refers to an individual and not a population return. That is, we study the effect of a single person being taller (but the population distribution staying the same) rather than a cohort getting taller.
↵4. The importance of cognitive skills for successful labor market outcomes is well established. (See for example Cawley, Heckman, and Vytlacil 2001.)
↵5. Endowments are defined as environmental conditions (such as health and nutrition), biological factors, genetic factors, and gene-environment interactions.
↵6. Though we will adopt this consensus view in our analyses, we note that there may be alternative mechanisms operating as well. For instance, one cannot completely rule out the possibility that the development of cognitive skill may also be influenced by a person’s height if taller children are subjected to greater intellectual challenges, expectations and opportunities. Moreover, Cunha and Heckman (2008) argue that cognitive and noncognitive skills may well be complementary in the sense that they promote the acquisition of each other, yielding interdependence between the two.
↵7. While we believe that available evidence points to the view that height does affect the development of noncognitive skills, we note that there may be important underlying factors governing both as well. As argued by Schick and Steckel (2010) important dimensions of noncognitive skill concern the ability to make judgment based on experience in favor of pure instinct (for example, to forego immediate returns in favor of future ones). Similarly to cognitive ability, this kind of ability increases with age in childhood and youth as the brain develops. Given that both cognitive and noncognitive skills are based on mental processes, it seems logical that nutritional influences may affect not only height and cognitive ability but via neural growth also the development of noncognitive skill. Insofar the association between height and noncognitive ability also is due to a third underlying factor (that is, nutrition), one cannot rule out that parts of this type of skill should be regarded as a control variable when estimating the correlation between height and earnings.
↵8. Related to this are findings indicating an important role of early life conditions for the development of both muscle strength and height. For the former outcome, studies have shown a positive association between birthweight and adult muscle strength (for example, Gale et al. 2001).
↵9. This also assumes that all relevant control variables have been included in the regression.
↵10. It should be noted, however, that the genetic loci identified by the largest study to date only explain about 10 percent of the phenotypic variation in height (Allen et al. 2010).
↵11. To be specific, a gene consists of two alleles, where a child randomly inherits one of the two alleles from each parent at the time of conception. (See, for instance, Fletcher and Lehrer 2011.)
↵12. There still remains the possibility, however, that the genes that affect height also directly affect earnings by some mechanism.
↵13. Height may also partly be determined by interactions between genes and environment, however. If there are important gene-environment interactions for height within families, the sibling approach would be less helpful in reducing bias.
↵14. The individuals had to live in Sweden during 1999 because many important variables, such as the enlistment information and the family information, are collected from the 1999 population data.
↵15. Because those in our sample enlisted during the years 1984–97, and earnings are followed up in 2003, we lose a small number of people due to death and emigration. There is no information available on why a particular individual did not enlist.
↵16. Moreover, nonnative ethnic groups have a much lower participation rate for enlisting since only about 50 percent (or less) are Swedish citizens, making selective participation an issue for these groups.
↵17. The general intelligence factor, G, is the variable used in this study. For more information about the G factor, see Carroll (1993). The separate tests and the composite measure have a correlation above 0.9.
↵18. Exact details of this part of the enlistment process are not publically revealed and we are merely referred to information on the intentions of the assessment procedure. It should be noted that our measure of noncognitive ability is a strong predictor of earnings, independently of cognitive skills. Lindqvist and Vestman (2011) provide a thorough discussion on the measure of noncognitive skills used in the enlistment process.
↵19. In the online appendix (at http://jhr.uwpress.org/) we also present the distributions for cognitive and noncognitive skills, as well as for muscle strength.
↵20. Height and muscle strength are not the only bodily dimensions to explore in relation to earnings. There is a large body of research on the impact of being overweight or obese on labor market outcomes. However, including BMI in the earnings equation has no impact on the height premium, see Lundborg, Nystedt, and Rooth (2009).
↵21. For handgrip strength and noncognitive skills less than 0.1 and 3 percent of the population, respectively, have missing information. For height and cognitive skills no information is missing (by data design).
↵22. Descriptive statistics for the sample of brothers is found in Table A1 in the online appendix.
↵23. In Section IV.A we show that the height premium is somewhat concave. Hence, this suggests the inclusion of height squared in the models being estimated. However, since this nonlinearity is not very strong and we are able to explain a large part of the height premium, this section follows the empirical specification used in Persico, Postlewaite, and Silverman (2004) and Case and Paxson (2008a) to facilitate the comparison of results.
↵24. Further, the reductions in the height premium that follows when introducing the explanatory variables— that is, cognitive and noncognitive skills and strength—are similar when analyzing the total data and the sibling data. (See Lundborg, Nystedt, and Rooth 2009.)
↵25. We also examined if the returns to height are smaller or larger at higher levels of cognitive skills by interacting height with cognitive ability. The point estimate of the interaction term suggested that the returns were greater at higher ability levels, but the estimate was rather small and statistically insignificant.
↵26. For a subsample of the data we have information on the four subtests of the composite cognitive test measure. Using this data we study whether the impact on the height premium varies depending on if more fluid-type cognitive tests (spatial ability and logic), which may be less affected by schooling, or less fluid tests (synonyms and technical comprehension) are controlled for. We find this not to be the case.
↵27. One could be worried that the noncognitive skill and handgrip strength variables are measuring the same underlying construct since both are expected to capture aspects of noncognitive skills at age 18. Their correlation is only 0.16, however.
↵28. We have also included month of birth as an additional regressor. Birth month could have an impact on the height-earnings relationship if children born in the winter months receive more negative health shocks, for instance, flu is more common in the winter, and therefore becomes shorter and that at the same time individuals of varying socioeconomic backgrounds have children at different periods of the year. However, the inclusion of birth month had no impact on the estimated relationship between height and earnings.
↵29. It is also possible that there is sorting into different education levels by height. The height coefficient is only reduced from 0.015 to 0.014, however, when adding education dummies to Model G. Moreover, adding education dummies to Model H does not affect the estimated height coefficient.
↵30. In order to illustrate which occupations short people tend to sort into, we have computed the fractions of people below 170 cm of height per occupational group (results not shown). Short stature is most frequent (11 percent) in the low-skill service sector consisting of cleaners, restaurant assistants, newspaper carriers, etc., whereas it is most uncommon (5 percent) in managerial positions in mid-and large-scale enterprises (such as CEOs, executive managers and directors).
↵31. This specification corresponds to Model G in Table 2 but uses the full sample.
↵32. We owe thanks to one of the referees for suggesting using the piecewise linear regression model.
↵33. Corresponding to the interquartile difference in height in the United Kingdom, United States, and Sweden.
↵34. Sweden has a much more compressed wage structure than the United States. In Sweden, the 90th percentile earns approximately twice as much as the 10th percentile, whereas in the United States they earn five times as much.
↵35. Its magnitude could also be related to the ethnic and immigrant earnings gaps in Sweden. The annual earnings of men born in Sweden with a non-European parental background is sixteen percent lower compared to the earnings of native Swedish men (Nordin and Rooth 2009), which, based on the raw results of this study, is equivalent to the difference in earnings between men who differ by about 27 centimeters in height. The observed Swedish gender earnings gap is 16 percent (Kumlin 2007).
↵36. See Rooth (2009) showing that discriminatory treatment of being obese and / or unattractive exists in the Swedish labor market using a field experimental approach sending fictitious job applications to real job openings.
- Received October 2011.
- Accepted December 2012.











