Returns to Skills and the College Premium
-
Upload
independent -
Category
Documents
-
view
1 -
download
0
Transcript of Returns to Skills and the College Premium
Returns to Skills and the College Premium
Flávio Cunha, Fatih Karahan, and Ilton Soares
The University of Pennsylvania
April 22, 2010
Abstract
A substantial literature documents the evolution of the college premium in the U.S. labor
market over the last 40 years or so. There are at least three different interpretations of this
fact: (1) Shifts in the relative supply of and demand for college versus high school labor;
(2) Shifts in the relative supply of and demand for skills in the college versus high school
sector; (3) composition effects. We investigate how each of these components contribute to
the dynamics of the college premium and find that all three play a role, but the increase in
the college premium is primarily driven by the first component. We also find that during the
1980s the college premium for high school workers diverged from the college premium for the
college workers and a substantial fraction of gap that opens up is primarily due to the increase
in the returns to cognitive skills.
1 Introduction
A substantial literature documents the evolution of the college premium in the U.S. labor market
over the last 40 years or so (e.g., Katz and Autor, 1999; Autor, Katz, and Kearney, 2008). Figure 1,
produced from the pooled dataset that we describe below, is roughly consistent with the evidence
from many studies based on the Current Population Survey (CPS) data. The college premium
starts at around 40 percent in the late 1960’s, goes down to about 30 percent in the late 1970’s,
sharply increases to 50 percent in the late 1980’s and finishes the 1990s at around 60 percent. Such
movements led economists to search for explanations. In order to illustrate several possible stories
it is helpful to consider the following potential outcome model:
ys,t = μs,t + αs,tθ + εs,t
where ys,t is the natural logarithm of labor income in schooling level s and calendar year t, μs,t is the
payoff associated with schooling s and year t, αs,t is the return to pre-college ability θ in schooling
1
level s and year t, and εs,t are the shocks in ys,t that are unanticipated by the individuals.12 Let
s = 0 and s = 1 denote "high school" and "college", respectively, and rt ≡ E (y1,t| t, s = 1) −E (y0,t| t, s = 0) denote the observed college premium. Its change from period τ to period t can be
decomposed into the following components:
rt − rτ =¡μ1,t − μ0,t
¢−¡μ1,τ − μ0,τ
¢+ [(α1,t − α0,t)− (α1,τ − α0,τ)]E (θ| τ , s = 1) (1)
+α0,t {[E (θ| t, s = 1)−E (θ| t, s = 0)]− [E (θ| τ , s = 1)−E (θ| τ , s = 0)]}+other terms
The expression above makes clear that there are at least three non-excludent stories that are consis-
tent with the movements in the college premium.3 The first and most common interpretation is that
the premium dynamics documented above are caused by shifts in the supply and demand for college
and high-school labor as presented in Katz and Murphy (1992) and Card and Lemieux (2001). In
equation (1), this is captured by changes in μ1,τ−μ0,τ over time. These shifts in supply and demandconditions raise or lower the college premium for all workers regardless of their pre-college ability
level θ.
The second explanation links the increase in the college premium to increases in the returns to
skills (Lee and Chay, 2000; Taber, 2001). In expression (1), this is captured by variation over time
of the term (α1,τ − α0,τ) evaluated at the mean ability in the college sector at the baseline period τ .
Murnane, Willett, and Levy (1995) present evidence that the returns to cognitive ability increased
from the 1970s to 1980s and that an important fraction of the rise in the college premium between
1978 and 1986 for young workers can be attributed to a rise in the return to ability. Blackburn and
Neumark (1993) reported that the rise in the economic return to education is concentrated among
those with high ability. Under this explanation, movements in the college premium reflect the shifts
in the supply and demand in the college sector for skills that are produced prior college.
From the viewpoint of a policymaker who is willing to intervene to prevent an increase in
inequality across schooling groups, the first two interpretations of the data suggest different policies.
In the first case, the returns to college can be manipulated by increasing college graduation regardless
of the ability of the individuals. In the second case, the policymaker should subsidize the formation
of skills made by families and individuals before they arrive at their college ages.4
The third explanation is that the sorting between schooling and ability has changed over time,
a point investigated in Carneiro and Lee (2008). In expression (1), this is represented by variation1In this paper we use the terms "ability" and "skills" interchangeably.2Note that shocks that are unanticipated are not known at the time that schooling choices are made, so it is not
possible for agents to select on them (Cunha, Heckman, and Navarro, 2005). As a result, E (εs,t|S = s) = 0.3The other terms in equation (1) refer to the change in mean ability in college multiplied by the differences
in returns to ability in period t, (α1,t − α0,t) [E (θ| t, s = 1)−E (θ| τ , s = 1)] , and the differences in mean abilityin the high school sector in period τ multiplied by the change in returns to ability in the high school sector.(α0,t − α0,τ ) [E (θ| τ , s = 1)−E (θ| τ , s = 0)]
4The latter policy could also increase college completion because the pre-college skills affect the college enrollmentand completion choices (e.g., Heckman, Stixrud, and Urzua, 2006).
2
in E (θ| τ , s = 1)−E (θ| τ , s = 0) evaluated at the returns to skills in the high school sector in thebaseline period, α0,τ . For example, consider the case in which α0,τ > 0 and that the pattern of ability
sorting is such that the individuals who are prone to move from high school to the college sector are
the ones who occupy the lowest quantiles of the college distribution of θ and the top quantiles of
the high school distribution of θ. Then an expansion of the supply of college workers would shrink
E (θ| t, s = 1) − E (θ| t, s = 0) relative to E (θ| τ , s = 1) − E (θ| τ , s = 0) and cause a reductionof the observed college premium even if there were no movements in μs,τ or αs,τ . Econometric
procedures that ignore such compositional effects may yield biased estimates of the economy-wide
elasticity of substitution between college and high-school labor because they attribute changes in
the composition to changes in μ1,τ − μ0,τ or α1,τ − α0,τ .
Our goal is to investigate how the returns to cognitive and noncognitive skills evolved from
the late 1960s to the early 1990s and how these movements in the returns to ability affected the
college premium. In order to do so, it is necessary to have good measures of skills and to collect
labor income (or wage) data over many cohorts and periods to separate year from age effects. To
the best of our knowledge, we don’t know a single dataset that contains this information. While
the National Longitudinal Survey (NLS/66) and the National Longitudinal Survey of the Youth
(NLSY/79) have relatively good measures of skills, their respondents were born only a few years
apart. This situation contrasts with that of the Panel Study of Income Dynamics (PSID): although
its respondents were born over many different decades and are followed over many different years,
their skills were assessed with a limited set of instruments in which measurement error is a more
serious concern. Our solution is to build on the econometric framework in Cunha, Heckman, and
Navarro (2005) so we can pool these three different datasets together to explore their relative
strengths. Specifically, we use the quality of the multiple measures of skills in the NLS/66 and
NLSY/79 to address the problem of measurement error in the measures of skills in the PSID. We
use error-corrected ability measures to estimate the returns to skills from the labor market data
over many different years and cohorts in the PSID, NLS/66 and NLSY/79. Once the returns to
skills are estimated, we can quantify their contribution to the evolution of the college premium.
Our work is related to that of Carneiro and Lee (2008), Heckman, Stixrud, and Urzua (2006),
Heckman and Vytlacil (2001), and Taber (2001). Building on Carneiro and Lee (2008), we model
schooling explicitly and we use variation in tuition within states and across years as an exclusion
restriction that induces variation in schooling choices. Unlike these authors, we directly measure
different dimensions of abilities so we can link changes in composition to changes in specific skills.
We build on Taber (2001) by analyzing the roles of cognitive and noncognitive skills. Our dataset
covers a longer horizon in terms of calendar years and individuals ages. We also build on Heckman,
Stixrud, and Urzua (2006) in estimating the economic relevance of cognitive and noncognitive
skills. While they investigated a large number of outcomes and wages at age 30, we investigate how
cognitive and noncognitive skills affect income and the college premium over many different years
and cohorts. We address the identification of time and age effects raised by Heckman and Vytlacil
3
(2001) by pooling together three different datasets that combine labor market information of many
cohorts from late 1960s to early 2000s. Finally, we build on the identification analysis of Carneiro,
Hansen, and Heckman (2003) by allowing for an ARMA representation of the serial correlation of
the residuals.
We find that the dynamics of the returns to cognitive skills in the college sector are remarkably
similar to that of the college premium: the returns decrease during the 1970s, suddenly increase
in the 1980s, and continue to grow at a lower rate during the 1990s. We don’t find evidence of
trends in returns to cognitive skills in the high school sector or to noncognitive skills in high school
or college sectors. Composition changes also contribute to the dynamics of the college premium
and its role is particularly important in the 1990s. Nothwithstanding these findings, our results
indicate that most of the increase in the college premium is driven by the increase in μ1,τ −μ0,τ . Wealso estimate the factual college premium for actual college workers and the counterfactual college
premium for the high school workers. We find that they were about the same in the early 1970s
and they diverge in the early 1980s: the college premium goes up for college workers and remains
constant for the high school workers and only starts to go up for this group in the late 1980s. This
fact is substantially driven by the increase in the returns to cognitive skills.
The paper is organized as follows. In Section 2, we present our data and show that it contains
measures of cognitive and noncognitive skills that are comparable over the different datasets that
we combine for the purposes of this study. In section 3, we present our econometric framework and
discuss our modeling assumptions in detail. In section 4, we show that the model is identified. In
Section 5, we describe our empirical findings. Section 6 concludes.
2 The Data
The econometric framework we describe below is tailored to the characteristics of the datasets that
we combine. For this reason, we start by describing our data.
2.1 Schooling and Labor Income Data
We focus on white males from three different data sets. We restrict the sample to individuals
who are in one of two schooling groups. The first group is formed by the individuals that have
a high-school diploma and exactly twelve years of education (s = 0). The second group is formed
by the individuals who graduated from college. Table 1 shows the fraction of individuals with a
college education for different cohorts of birth and survey membership. In our dataset, the fraction
of college graduates follows an inverted U-shape pattern. Thirty-seven percent of the individuals in
the first cohort are college educated, while about 50 percent did so in the third cohort, and only 44
percent in the sixth cohort. Importantly, an F test does not reject the null hypothesis that there
are no differences in the schooling data from PSID, NLS/66, and NLSY/79.
4
We analyze labor income from 1968 to 2001 for individuals who are between ages 23 and 56.
The Lexis Diagrams 1A-1D show one of the reasons that make the pooling of the datasets necessary.
Lexis Diagram 1A shows that the NLS/66 has labor income data for individuals between ages 23
and 39 over some of the years between 1968 and 1981. Lexis Diagram 1B displays the age-year
combinations in the NLSY/79. We see that it has almost no overlap with the NLS/66. Lexis
Diagram 1C, which is based on the labor income availability data from the PSID, shows that there
is considerable variation of age-year pairs. The problem with the PSID dataset is that it has only
one measure of cognitive ability that is very noisy in comparison to the measures of ability available
in the other two datasets. Lexis Diagram 1D shows the combination of age-year pairs in the pooled
data that resembles that of the PSID but also incorporates the precise ability measures that were
collected in the National Longitudinal Surveys.
Figures 2A and 2B displays the mean log earnings by age of high school and college graduates,
respectively. More precisely, we plot the profiles by age for each survey as well as a one-standard
deviation bar in each direction.5 To construct these figures, we regressed log income against a
quadratic term of age, an indicator for south and urban location at age 14, cohort dummies, year
dummies, survey membership dummies, and the interaction of survey membership dummies with
age.6 In the three different datasets, the log income-age profiles are similar, except that the NLS/66
high-school individuals tend to have higher levels of income. The null hypothesis that the age profiles
of log labor income are equal across the different surveys was rejected by a standard F test. However,
we could not reject the null hypothesis that the age profiles of log labor income are equal within a
cohort across the different surveys. This indicates that the differences in age profiles results from
changing profiles across cohorts, as documented in Card and Lemieux (2001).
Table 2 presents descriptive statistics on the NLSY/79, NLS/66, and PSID samples, respectively.
In the three samples, college graduates have higher test scores, fewer siblings, parents with higher
levels of education, and are more likely to live in locations where the tuition for four-year college is
lower. 7
2.2 Measures of Skills
We next describe the measures of cognitive and noncognitive skills obtained from each sample.
There are several measures of cognitive and noncognitive ability in the NLSY/79 that were directly
obtained from the respondent. Some of the tests were administered in the first rounds of the study.
The NLS/66 obtained one measure of cognitive skills from the high school records of the individuals
in their sample, but administered the noncognitive test in the 1966 round. The PSID collected
information on skills in the rounds between (and including) 1968 to 1972.
5Earnings figures are adjusted for inflation using the CPI and we take the year 2000 as the base year.6By survey membership dummy we mean a variable that takes the value one if the individual’s response comes
from the NLS/66 and zero otherwise. We defined a similar variable for membership in the NLSY/79.7See Cameron and Heckman (2001) for details on the construction of the tuition variables used in this paper.
5
2.2.1 Cognitive Skills
According to the American Psychological Association, cognitive ability is the “ability to understand
complex ideas, to adapt effectively to the environment, to learn from experience, to engage in
various forms of reasoning, to overcome obstacles by taking thought” (Neisser et al. 1996, p. 77).
In practical terms, psychologists measure cognitive skills by IQ tests. There is ample evidence
that different IQ tests are highly intercorrelated (see the discussion and the evidence in Borghans,
Duckworth, Heckman, and ter Weel, 2008). The following different tests designed to measure IQ
are present in at least one of the three different data sets that we combine: (1) The components
of the Armed Services Vocational Aptitude Battery (ASVAB); (2) The Scholastic Aptitude Test
(SAT); (3) The American College Test (ACT); (4) The Otis-Lennon Test; (5) The Lorge-Thorndike
Test; (6) The California Test of Mental Maturity; (7) The Sentence Completion test in the PSID. A
different type of test that we use are the Components of the Knowledge of World of Work, routinely
used as vocational tests. As we show in our results, they are correlated with the typical measures
of IQ above. We briefly describe these measures next.
The ASVAB is a multiple choice test used to determine qualification for enlistment in the United
States armed forces. During the summer and fall of 1980, 94 percent of NLSY79 respondents took
the ASVAB. It contains ten components (Miller, 2004): (GS) The General Science Knowledge mea-
sures knowledge of the physical and biological sciences; (AR) The Arithmetic Reasoning tests the
ability to solve arithmetic word problems; (WK) The Word Knowledge is designed to evaluate the
capacity to select the correct meaning of a word that is presented in a context; (PC) The Paragraph
Comprehension quantifies the capacity to understand written passages; (NO) The Numerical Oper-
ations assesses the ability to perform arithmetic computations within a time limit; (CS) The Coding
Speed estimates the capacity to assign code numbers to words; (AI) Auto and Shop Information
appraises the knowledge about automobiles and tools; (MK) Mathematics Knowledge evaluates the
knowledge of high-school math; (MC) Mechanical Comprehension determines the knowledge about
mechanical and physical principles; (EI) Electronics Information conveys knowledge about electric-
ity and electronic. We use six components of the ASVAB test battery: AR, WK, PC, MK, NO,
CS. The first four are used to compute the Armed Forces Qualification Test, which loads heavily
on IQ. Researchers who base their studies on the NLSY/79 sample have used these scores routinely
(e.g., Bishop, 1991; Blackburn and Neumark, 1993; Taber, 2001; Cunha, Heckman, and Navarro,
2005; Hansen, Heckman, and Mullen, 2004; Heckman and Vytlacil, 2001; and Neal and Johnson,
1996). Unfortunately, the ASVAB components are available neither in the NLS/66 nor in the PSID
surveys.
Scores in the following tests were collected by both the NLS/66 and NLSY/79 surveys. The
SAT and the ACT are standardized tests used in college admissions in the United States. The SAT
consists of three major sections: Critical Reading, Mathematics, and Writing. The ACT covers
four skill areas: English, Mathematics, Reading, and Science. According to Vane and Motta (1984),
both the SAT and ACT load heavily on IQ. The Otis-Lennon School Ability Test (Otis and Lennon,
6
1977) is a test intended to provide a measure of IQ. It is organized in the following five areas: verbal
comprehension, verbal reasoning, pictorial reasoning, figural reasoning, and quantitative reasoning.
The Lorge-Thorndike Intelligence Test (Lorge and Thorndike, 1966) is a test that measures abstract
intelligence and contains both verbal and nonverbal items. The verbal battery is made up of five
subtests that include vocabulary, verbal classification, sentence completion, arithmetic reasoning,
and verbal analogies. The nonverbal battery contains subtests of pictorial classifications, pictorial
analogies, and numerical relationships. The conglomerate of different tasks is an attempt to measure
IQ, which is assumed to be the only common component across all intellectual processes measured
by the test (Goldstein and Hersen, 2000). The California Test of Mental Maturity (Traxler, 1939)
is designed to measure memory, spatial relationships, reasoning, and vocabulary. The test yields
three intelligence quotients: an IQ for language factors and an IQ for non-language factors, as well
as the usual type of IQ based on total scores. The IQ for language factors is based on tests of
delayed recall, number quantity, inference, and vocabulary. The IQ for non-language factors results
from tests of immediate recall, sensing right and left, manipulation of areas, foresight in spatial
relations, opposites, similarities, analogies, number series, and number quantity.
Finally, for the PSID survey, we use the sentence completion test, which is based on the verbal
component of the Lorge-Thorndike Intelligence Test (Veroff, McClelland and Marquis, 1971).
A different set of tests that we also use as proxy for cognitive skills are the "Knowledge of
the World of Work" (KWW) test. These tests are designed to measure vocational aptitude of
individuals, but as we show below they correlate with measures of cognitive skills. The KWW
test consists of three components. The first requires a respondent to identify, in a multiple-choice
format, the duties of ten occupations: (A) hospital orderly, (B) machinist, (C) acetylene welder,
(D) stationary engineer, (E) statistical clerk, (F) fork lift operator, (G) economist, (H) medical
illustrator, (I) draftsman, and (J) social worker. The second component involves identification of
the typical educational attainment of men in each of these same ten occupations. Four choices were
listed: “less than a high school diploma, a high school diploma, some college, a college degree.”
The third component is based on a respondent’s judgment, for each of eight pairs of occupations, as
to which one yields the higher average annual earnings. The eight occupation pairs are: (A) auto
mechanic/electrician, (B) medical doctor/lawyer, (C) aeronautical engineer/medical doctor, (D)
truck driver/grocery store clerk, (E) unskilled laborer in steel mill/in shoe factory, (F) lawyer/high
school teacher, (G) high school teacher/janitor, and (H) janitor/policeman.
Table 3 shows the distribution of individuals’ cognitive test scores across the different quartiles,
both for high school and college graduates. From this table it is clear that the lower quartiles
are composed mostly by high school graduates and the upper quartiles by college graduates. The
largest differences are observed in the first and fourth quartiles. If we average over the tests, around
70 percent of the high school graduates are below the median. This number is 34 percent if we
consider the college graduates. Even more remarkable is the difference when we consider only the
first quartile. On average, more than 40 percent of the high school graduates and only 12 percent
7
of the college graduates scored below the first quartile. Moving to the top two quartiles the results
are inverted. On average, more than 66 percent of the college graduates are above the median and
only 30 percent of the high school graduates are in this category. If we focus on the top quartile, we
see that on average 9 percent of the high school graduates and 35 percent of the college graduates
are in this group.
2.2.2 Noncognitive Skills
Many different personality traits are lumped into the category of noncognitive skills. Psychologists
have developed batteries of tests to measure these skills (e.g., Sternberg, 1985). Companies use
personality tests to screen workers, but they are not yet widely used to ascertain college readiness
or to evaluate the effectiveness of schools or reforms of schools. The literature on cognitive tests
shows that one dominant factor (“IQ”) summarizes cognitive tests and their effects on outcomes.
No single factor has emerged as dominant in the literature on noncognitive skills and it is unlikely
that one will ever be found, given the diversity of traits subsumed under the category of noncognitive
skills.
Studies by Bowles and Gintis (1976) , Edwards (1976), and Klein et al (1991) demonstrate that
job stability and dependability are the traits most valued by employers as ascertained by supervisor
ratings and questions of employers, although they present no direct evidence of the effects of these
traits on wages and educational attainment. Perseverance, dependability and consistency are the
most important predictors of grades in school (Bowles and Gintis, 1976).
Self-reported measures of persistence, self-esteem, optimism, future orientedness, and the like
are collected in major data sets, and some recent papers discuss estimates of the effects of these
measures on earnings and schooling outcomes (Bowles and Gintis,1976; Claessens et al, 2009).
Heckman, Stixrud, and Urzua (2006) present evidence that both cognitive and noncognitive skills
affect schooling and the returns to schooling. We use one of the measures of noncognitive skills used
in their study: the Rotter Locus of Control scales. The Rotter scale is based on questions about the
extent to which respondents believe themselves to have control over the various domains of their
lives. A higher score indicates more control over one’s life. Comparable measures of Rotter Locus
of control are collected in the three different surveys that we pool together.
In the formulation proposed in Rotter (1966), individuals with a high internal locus of control
believe that events are the result of their own behavior and actions. Those with a high external
locus of control believe that powerful others, fate, or chance primarily determine events. Reflecting
the notion, Rotter showed that individuals with a high internal locus of control have better control
of their behavior and are more likely to assume that their efforts will be successful. They are more
active in seeking information and knowledge concerning their situation. Caliendo et al (2010) find
evidence that individuals with an internal locus of control search more and that individuals who
believe that their future outcomes are determined by external factors have lower reservation wages.
The NLS/66 has 10 components of the Rotter scale, the NLSY/79 has only four, and the PSID
8
has three but measured on a slightly different scale.8 The three common components in all the
surveys are the instruments referring to an individual’s beliefs about the results of planning (‘When
I make plans, I am almost certain that I can make them work’); the importance of luck on success
(‘In my case getting what I want has little or nothing to do with luck’); and an individual’s control
of his life (‘What happens to me is my own doing’). The fourth component of the Rotter scale
that is commonly asked in NLSY/79 and NLS/66 is also related to how shocks determine positive
or negative outcomes (‘Most people don’t realize the extent to which their lives are controlled by
accidental happenings’).
Locus of Control is a subset of the Motivational Theory developed by Atkinson (1964), who
saw motivation as the combination of motives and expectations. Atkinson’s motives coincides with
Murray (1938) and McClelland et al (1953) theory of "Need for Achievement", "Need for Power",
and "Need for Affiliation". Expectancies, on the other hand, consist of an individual’s beliefs about
his capacity to influence the positive realization of a desired outcome. In the labor market context, it
refers to the beliefs a particular individual has that his efforts will lead to a promotion (as advanced
in Rotter, 1975). Because motivational theory and Rotter’s Locus are linked, we supplement the
Rotter scale in the PSID survey with one item that measures the need for achievement versus
need for affiliation (‘do you prefer a job in which you have to think for yourself or a job with nice
colleagues?’)
Additionally, Bouchard and Loehlin (2001) present evidence that individuals who score high on
Locus of Control are also more likely to score high on Conscientiousness, one of the five dimensions
of noncognitive skills measured by the Big Five.9 Because of this relationship, we use two measures
of conscientiousness from the PSID survey. The first measure concerns the amount of planning
an individual tends to make to pursue the goals (‘Are you the kind of person that plans his life
ahead all the time, or do you live more from day to day?’); and the second measures an individual’s
commitment on finish tasks that he or she starts (‘Would you say you nearly always finish things
once you start them, or do you sometimes have to give up before they are finished?’).
Table 4 presents the proportion of people in each alternative of each noncognitive skill test, for
both high school and college graduates. Higher values in each one of these tests indicate stronger
internal locus of control, higher likelihood of planning and carrying out planning to the end, or
preference for a job that indicates lower level of affiliation. This feature implies that our analysis
can be simplified if we focus on the extremes. When considering only the highest value for each
test, we see that the proportion of college graduates is higher than that of high school graduates
for all but one test (Rotter K). On the other hand, if one considers the minimum value of each
8The PSID scales were used by Dunifon and Duncan (1998).9The Big Five are the five dimensions routinely used to summarize noncognitive skills. The definition and
description of following dimensions of personality competencies follow Borghans et al (2008). Openness to Experiencemeasures the need for intellectual stimulation, change, and variety. Conscientiousness captures the compliance withrules, norms, and standards. Extroversion quantifies the capacity to interact socially. Agreeableness rates the needfor pleasant and harmonious relations with others. Finally, Neuroticism assesses how much the person experiencesthe world as threatening and beyond control.
9
alternative than for all tests the proportion of high school graduates is bigger than that of college
graduates. The general conclusion one can get from this table is that, as for the case of cognitive
skill tests, the performance in noncognitive skill tests of the college graduates is superior to the one
of high school graduates.
2.2.3 The Relationship Between Measures of Cognitive and Noncognitive Skills andObservable Individual Characteristics
Figures 3A to 3D present some features of the Cognitive Test Scores regarding their relation with
parental education, number of siblings and schooling choice. Figures 3A to 3D show the same
analysis for the Noncognitive Test Scores.
In Figure 3A we have the partial correlation between each one of the cognitive test scores
and maternal education, controlling for urban and south residence, paternal education, number of
siblings, and cohort dummies. All but one of these tests exhibit positive partial correlation with
maternal education and this single negative correlation has a considerable part of its confidence
interval assuming positive values. It is worth to note the overlap of the confidence intervals of the
partial correlations for each of these tests with maternal education. If one draw a horizontal line
at 0.10 (as shown in the figure), this will cross all but one of the confidence intervals for the partial
correlation. A similar result is found in Figure 3B, now for the partial correlation between each
one of the cognitive test scores and paternal education, controlling for the same variables as before
and changing paternal for maternal education. Figures 3A and 3B together can be interpreted as
parental education having a positive relation with the individual performance in cognitive tests.
Figure 3C displays the partial correlations between each cognitive test score and the number of
siblings. The results point that, controlling for urban residence, south residence, parental education
and cohort dummies, individuals with higher cognitive test scores have, in general, fewer siblings. In
this case, there is a sub-interval common to all confidence intervals, so if one does the same exercise
of drawing a horizontal line, say at -0.04, this will cross all confidence intervals. The relation between
the cognitive test scores and the schooling choice appears in Figure 3D. These values correspond
to the coefficient of the specific cognitive test score in a probit regression, controlling for urban
and south residence, parental education, tuition costs and cohort dummies. This figure shows
that, in general, higher cognitive test scores are more likely to be associated to college graduates,
independent on the measure used. Even though the intersection of the confidence intervals for these
coefficients is less pronounced than in the past figures, we still observe some relation among the
different estimates.
Figures 4A to 4D are the analogous of 3A to 3D for the noncognitive test scores. In Figures 4A
and 4B we can see that as in the cognitive test scores case, the noncognitive tests also present a
consistent positive partial correlation with parental education, controlling for the same variables as
before. The similarities also hold for the intersections of the confidence intervals for the different
measures of noncognitive skills. As opposed to the case of cognitive tests, Figure 4C shows that,
10
if any, the relation between better performance in the noncognitive tests and number of siblings is
positive. Eight out of the ten measures studied exhibit a considerable intersection in the positive
axis of their confidence intervals for the partial correlation between noncognitive test scores and
number of siblings. Figure 4D reproduces the exercise done in Figure 3D, but now relating schooling
choice and noncognitive test scores. As in the former case, we can conclude that for all the analyzed
measures of cognitive skills, higher scores are more likely to be associated to college graduates than
to high school graduates.
3 The Econometric Framework
We now present a framework to analyze the pooled data described above. Specifically, our econo-
metric model addresses the issues that (i) skills are measured with error and in the PSID there
is only one (potentially very noisy) measure of cognitive skills; (ii) that individuals choose their
education levels and (iii) that their choice may be driven by skills as well as other observable and
unobservable characteristics of the individuals.
3.1 The Model for Labor Income
Let Ys,a,t denote the labor income of an individual who chooses schooling level s and is a years-old at
the calendar year t. Log earnings ys,a,t = lnYs,a,t are assumed to depend on observable characteristics
xs,a,t, cognitive and noncognitive skills (θc, θn), and unobservable components vs,a,t in the following
way:
ys,a,t = xs,a,tβs + αc,s,tθc + αn,s,tθn + vs,a,t (2)
The parameters βs depend on schooling choice, but we assume that they are constant over age
a and year t. The terms αc,s,t and αn,s,t are the returns to cognitive and noncognitive skills at
schooling level s and year t, respectively. The error terms vs,a,t are assumed to satisfy the condition
E (vs,a,t|xs,a,t, θc, θn) = 0, s = 0, 1; a = 1, ..., A; and t = 1, ..., T.We decompose the error terms in theearnings equations into a heterogeneity factor θh and an error term εs,a,t. The heterogeneity factor
may capture other dimensions of noncognitive skills that are orthogonal to "Locus of Control". We
denote by αh,s,t its factor loading. We assume that v0,a,t and v1,a,t can be written in factor-structure
form
vs,a,t = αh,s,tθh + εs,a,t s = 0, 1, t = 1, . . . , T. (3)
The idiosyncratic error term εs,a,t follows a structure with transition laws that are specific to school-
ing level s at period t: εs,a,t = πs,a,t + s,a,t, where πs,a,t follows an AR(1) process with persistence
ρs:
πs,a+1,t+1 = ρsπs,a,t + ηπs,a+1,t+1 (4)
11
and s,a,t follows an MA(1) process with coefficient φs:
s,a+1,t+1 = φsηs,a,t + ηs,a+1,t+1. (5)
3.2 The Schooling Choice Model
Agents make schooling choices based on expected present value utility maximization given infor-
mation set I. Let u (c) denote the agents utility when consumption is c. We assume that agents
cannot borrow or lend, so at each point in time consumption equals income:
cs,a,t = Ys,a,t = eys,a,t (6)
This assumption implies that consumption tracks income, which is at least partially consistent with
the data. For example, Carrol and Summers (1991) find that the growth rate of consumption is
larger for individuals that choose education levels also associated with higher income growth. The
same is true when they compare the life-cycle consumption profile of occupation groups that differ
in income growth. More recently, Fernandez-Villaverde and Krueger (2007) show that consumption
expenditure is hump shaped, especially when one does not drop the expenditure on durables. Fur-
thermore, consistent with the evidence from Carrol and Summers (1991), the hump shape profile is
more pronounced for individuals with higher levels of education. Fernandez-Villaverde and Krueger
(2010) offer an interpretation that the hump-shape profile of durable and nondurable consumption
expenditures could be explained by the lack of intertemporal markets for durable goods. This leads
households to build up their durable stocks little by little. Because durable and nondurable expen-
ditures interact in the budget constraint for a given time period, the way households finance the
expenditure on durables is by postponing some of the expenditure on nondurables. In this paper,
we do not model the consumption decision, primarily because both NLS/66 and NLSY/79 have no
information on consumption, while the PSID reports expenditures on food. In any case, we write
the index V as net present value of lifetime utility of graduating from college. Then:
V = E
"AXa=1
µ1
1 + ρ
¶a−1(u (c1,a,t)− u (c0,a,t))− C
¯¯I#, (7)
where C denotes the costs regarding a college education, which we decompose as
C = zγ + αc,Cθc + αn,Cθn + αh,Cθh + εC . (8)
Let θ = (θc, θn, θh) and αs,t = (αc,s,t, αn,s,t, αh,s,t) . In equation (8) z corresponds to observable
characteristics and the error term εC is independent of θ, x, z, εs,a,t, s = 0, 1, a = 1, ..., A, t =
1, . . . , T . We assume that x, z, θ, εC ∈ I, but that εs,a,t /∈ I, with E [εs,a,t| I] = 0 for all s, a, t. Let
12
x = {x0,a,t, x1,a,t}Aa=1 . Under these assumptions, it is easy to show that we can rewrite V as:
V = μ (x, z) + θα0V − εC (9)
where μ (x, z) =PA
a=1
³11+ρ
´a−1(x1,a,tβ1 − x0,a,tβ0) − zγ and α0V =
PAa=1
³11+ρ
´a−1 ¡α01,t − α00,t
¢−
α0C .
3.3 The Model for Cognitive and Noncognitive Test Scores
In addition to data on earnings and choices, we also have access to data on a set of cognitive
and noncognitive test scores. Let Mk,j denote the agent’s score on the kth test on skill j, where
j ∈ {c, n}. Assume that the Mk,j have finite means and can be expressed in terms of conditioning
variables XMk,j. Write
Mk,j = XMk,jβ
Mk,j + θjα
Mk,j + εMk,j and E
¡εMk,j¯θj,X
Mk,j
¢= 0. (10)
where the αMk,j are “factor loadings”, i.e. coefficients that map θj into Mk,j. Both the decision
to attend college and realized earnings likely depend on the cognitive and noncognitive skills that
agents have at the time their schooling choices are made. Following the psychometric literature, the
factors (θc, θn)might be interpreted as a latent cognitive and noncognitive abilities which potentially
affect all test scores. We assume that (θc, θn) is independent of XM =©XM
k,j
ªKj
k=1,j∈{c,n} and εMk,j,
withKj indicating the number of tests of skill j. The εMk,j are mutually independent and independent
of (θc, θn). Modeling test scores in this fashion allows them to be noisy measures of cognitive and
noncognitive abilities.10
4 Identification of Parameters
In this section we discuss identification of the model. We focus on parametric identification because
this is the estimation strategy that we pursue in the paper. We then discuss how the proof could be
extended to a semiparametric setting. More specifically, we assume that the factors are normally
distributed, so θ ∼ N (0,Σθ) . In what follows, we assume that the diagonal elements of Σθ are equal
to one. This is essentially a necessary normalization imposed on factor models because the scales
of the factors are never identified.
4.1 Identification of the Distribution of Skills
We start discussing identification of the test score equations, which are considerably easier. The
reason this is so is because we do not worry about selection issues with regard to the scores that are10The identification of the model can be secured under much weaker conditions by applying the analysis of Schen-
nach (2004).
13
observed in the data. As we show below, it is necessary to have at least three cognitive and three
noncognitive tests available in the dataset. We assume that the error terms εMk,j are independent
random variables that are normally distributed with mean zero and variance¡σMk,j
¢2. Under our
assumptions an OLS regression of the test score Mk,j against XMk,j identifies the parameters β
Mk,j.
Let Mk,j =Mk,j −XMk,jβ
Mk,j. We can compute the conditional covariances:
Cov³M1,c, M2,c
´= αM
1,cαM2,cV ar (θc) ,
Cov³M1,c, M3,c
´= αM
1,cαM3,cV ar (θc) ,
Cov³M2,c, M3,c
´= αM
2,cαM3,cV ar (θc) .
We imposed V ar (θc) = 1. We pick
αM1,c =
vuuutCov³M1,c, M2,c
´Cov
³M1,c, M3,c
´Cov
³M2,c, M3,c
´ .
We can recover αM2,c from Cov
³M1,c, M2,c
´and αM
3,c from Cov³M1,c, M3,c
´. The variances of the
error terms,¡σMk,c
¢2, are identified from V ar
³Mk,c
´. We can repeat the analysis for noncognitive
tests and identify the factor loadings.
The following moment identifies the covariance between cognitive and noncognitive skills:
Cov³M1,c, M1,n
´= αM
1,nαM1,nCov (θc, θn) . (11)
The identification of this covariance is interesting per se, but it is also important in the current
context because this is the only type of covariance that can be used to identify the factor loading
and variance of measurement error in the cognitive test that was given to the PSID respondents.
Obviously, identification requires Cov (θc, θn) 6= 0. Identification of these parameters can be war-
ranted even if such condition fails and we return to this issue when we discuss the identification of
the cognitive and noncognitive factor loadings in the schooling choice equation. If the cognitive and
noncognitive skill factors were the only factors in the model, then we would have already identified
the joint distribution of the factors exclusively from the information contained in the test score
equations. We will have more factors, so we will need to explore information from schooling choice
and log earnings to identify the distribution of the remaining factors. Although the approach de-
scribed here explores the parametric structure, it is possible to extend it to a nonparametric setting
by using the argument in Kotlarski (1967).
14
4.2 Identification of the Parameters in Earnings and Schooling Choice
Equations
The identification of the parameters in the log earnings and choice equations requires more work
because college log earnings are observed only for those individuals that chose to go to college. The
same problem is true for the high-school log earnings. Therefore, it is necessary to address the issue
of selection to obtain identification. To see how this arises in our model note that the mean college
earnings in the data are computed only using the subsample of those who chose to go college, which
is the sample equivalent of E (y1,a,t|x1,a,t, S = 1), but not E (y1,a,t|x1,a,t) . This implies that:
E (y1,a,t|x, z, S = 1) = x1,a,tβ1 +E¡θα01,t
¯x, z, S = 1
¢+E (ε1,a,t|x, z, S = 1) .
By assumption, ε1,a,t /∈ I with E (ε1,a,t| I) = 0. As a consequence, ε1,a,t does not affect schoolingchoices because it is not known at the time these choices are made. From (9) we know that
S = 1⇐⇒ εC − θα0V ≤ μ(x, z). Generally, this information is not very helpful because if we don’t
know the distribution of θ and εC , then we don’t obtain closed-form solutions for the conditional
moment E¡θα01,t
¯x, z, εC − θα0V ≤ μ(x, z)
¢. However, if the factors θ and the error term εC are
normally distributed it is possible to show that:
E¡θα01,t
¯x, z, εC − θα0V ≤ μ(x, z)
¢= −γ1,a,tκ (μ(x, z)) ,
where γ1,a,t =Cov(θα01,t,εC−θα0V )
V ar(εC−θα0V ), μ(x, z) = μ(x,z)
V ar(εC−θα0V ), κ (μ(x, z)) = φ(μ(x,z))
Φ(μ(x,z)), and the functions φ
and Φ are the density and distribution of standardized normal random variables. The mean college
log earnings equation satisfies:
E (y1,a,t|x, z, S = 1) = x1,a,tβ1 − γ1,a,tκ (μ(x, z)) .
If we knew the parameters that determine μ(x, z), then we could recover the parameters β1 and γ1,a,tfrom an OLS regression of y1,a,t against x1,a,t and κ (μ(x, z)) . Luckily, we can explore the information
from the schooling choice. Note that under normality assumptions Pr (S = 1|x, z) = Φ (μ(x, z))
and Pr (S = 0|x, z) = 1 − Φ (μ(x, z)) . By estimating this probit we can identify the parameters
that characterize μ(x, z). We use that information to recover β1 and γ1,a,t. This is the two-stage
estimation procedure developed in Heckman (1976). A similar analysis for the high-school log
earnings allows us to recover β0 and γ0,a,t.
15
4.2.1 Recovering the Loadings on Cognitive and Noncognitive Skills in Log EarningsEquations
Once identification of β0 and β1 have been secured, we can compute the following moments:
Cov³y1,a,t − x1,a,tβ1, M1,c
¯S = 1
´= αc,1,tα
M1,cV ar (θc|S = 1) + αn,1,tα
M1,cCov (θc, θn|S = 1)
Cov³y1,a,t − x1,a,tβ1, M1,n
¯S = 1
´= αc,1,tα
M1,nCov (θc, θn|S = 1) + αn,1,tα
M1,nV ar (θn|S = 1) .
There are two equations. The loadings αM1,c and αM
1,n have already been identified. Remember
that V ar (θc|S = 1) and V ar (θn|S = 1) are conditional variances of normally distributed randomvariables, so they can be easily estimated given our framework. If
αM1,cα
M1,n
¡V ar (θc|S = 1)V ar (θn|S = 1)− Cov (θc, θn|S = 1)2
¢6= 0,
then there are two linearly independent equations that generate a unique solution for the two un-
knowns, αc,1,t and αn,1,t. This argument identifies the factor loadings on cognitive and noncognitive
factors for all s, t pairs.
4.2.2 Recovering the Loadings on Cognitive and Noncognitive Skills in the SchoolingChoice Equation
A similar argument can be applied to identify the factor loadings on cognitive and noncognitive
skills on the schooling choice equation. As shown in Matzkin (1992), we can assume that we observe
net utility indexes V = V−μ(x,z)V ar(εC−θα0V )
and compute:
Cov³V , M1,c
´= αM
1,c
Ãαc,Vp
V ar (εC − θα0V )V ar (θc) +
αn,VpV ar (εC − θα0V )
Cov (θc, θn)
!
Cov³V , M1,n
´= αM
1,n
Ãαc,Vp
V ar (εC − θα0V )Cov (θc, θn) +
αn,VpV ar (εC − θα0V )
V ar (θn)
!
where αc,V and αn,V are the first two coordinates of the vector αV = (αc,V , αn,V , αh,V ). Given that
the loadings αM1,c and αM
1,n have already been identified, these covariances and the nonsingularity of
the covariance matrix of (θc, θn) allow us to identify the scaled factor loadingsαc,V
V ar(εC−θα0V )and
αn,V
V ar(εC−θα0V ).
At this point, we return to the issue regarding the identification of the factor loading and
variance of measurement error in the PSID cognitive test. If Cov (θc, θn) = 0, so that the moment
(11) cannot be used, then it is possible to identify the factor loading of the PSID test from the
covariance between schooling choice and the test score in PSID. Without loss of generality, let j = 4
16
denote the PSID cognitive test. Then, from the moment:
Cov³V , M4,c
´=
αM4,cαc,Vp
V ar (εC − θα0V )V ar (θc)
we can easily recover αM4,c.We can then use V ar
³M4,c
´to identify the variance of the measurement
error in the PSID cognitive test. Note that the existence of the schooling choice information,
together with a set of reliable estimates of skills from a different source, allows us to recover the
distribution of skills in the PSID free of measurement error even though we don’t have repeated
measures of skills in the PSID.
4.2.3 Recovering the Distribution of the Remaining Residual Components
In what follows, it will be convenient to define the residualized natural log earnings ys,a,t as:
ys,a,t = ys,a,t − xs,a,tβs − αc,s,tθc − αn,s,tθn = αh,s,tθh + πs,a,t + s,a,t
where πs,a,t and s,a,t follow the AR(1) and MA(1) processes defined in (4) and (5), respectively.
The identification of these components explore the time-series information within a schooling group.
Set αh,1,1 = 1 and define:
rs,a,l,k = Cov¡y1,a,l, y1,a+(k−l),k
¯S = 1
¢and note that the following equality holds:
r1,a+1,2,5 − αh,1,2r1,a,1,5r1,a+1,2,4 − αh,1,2r1,a,1,4
= ρ1 (12)
If we normalize αh,1,2 = 1, we could identify ρ1. However, such normalization may not be necessary
because of the following equality for every t = 5, 6, ..., T and k = 2, 3, ..., T also holds:
r1,a+1,k,t − αh,1,kr1,a,k,tr1,a+1,k,t−1 − αh,1,kr1,a,k,t−1
= ρ1 (13)
Focus on k = 2. We have the following polynomials in αh,1,2 :
b1,k,lα2h,1,2 − b2,k,lαh,1,2 + b3,k,l = 0
where
b1,k,l = (r1,a,1,kr1,a,1,l−1 − r1,a,1,lr1,a,1,k−1) ,
b2,k,l = (r1,a+1,2,kr1,a,1,l−1 + r1,a,1,kr1,a+1,2,l−1 − r1,a,1,k−1r1,a+1,2,l − r1,a+1,2,k−1r1,a,1,l) ,
b3,k,l = (r1,a+1,2,kr1,a+1,2,l−1 − r1,a+1,2,k−1r1,a+1,2,l) .
17
Clearly, if b1,k,l = 0, then there is only one solution for αh,1,2 =b3,k,lb2,k,l
. If b1,k,l 6= 0, then we have asecond-degree polynomial that may not have a real solution or may have at most two solutions that
are given by:
αh,1,2 =b2,k,l ±
qb22,k,l − 4b1,k,lb3,k,l2b1,k,l
If b22,k,l−4b1,k,lb3,k,l < 0, there are no real solutions. Note that this case happens if b22,k,l−4b1,k,lb3,k,l <0 for all possible combinations of k and l, an unlikely case. If b22,k,l − 4b1,k,lb3,k,l = 0, then there isone solution given by αh,1,2 =
b2,k,l2b1,k,l
. Finally, if b22,k,l − 4b1,k,lb3,k,l > 0, there are two solutions. We
pick the largest solution αh,1,2 =b2,k,l+
√b22,k,l−4b1,k,lb3,k,l2b1,k,l
. Once αh,1,2 is identified, we can identify ρ1
from (12). More importantly, we can use the relationship (13) to identify αh,1,k for k = 3, 4, ..., T.
Define σ2πt = V ar (πt) , σ2h,1 = V ar (θh|S = 1) . We can recover σ2h,1 from:
σ2h,1 =rs,a+1,1,4 − ρ1rs,a,1,3α1,4,h − ρ1α1,3,h
.
We identify the variance of the AR(1) component σ2πl , l = 1, 2, ..., T from:
rs,a,l,l+2 = α1,l,hαh,1,l+2σ2h,1 + ρ21σ
2πl.
We can use the fact that ρ1 and σ2πlare known, as well as the parametric formulation of the AR(1)
process, to identify the variance of shocks in the AR(1) component:
σ2πl+1 = ρ21σ2πl+ V ar
¡ηπs,a+1,t+1
¢.
We have to identify the variance of the MA(1) component as well as the persistence. From
V ar (ys,a,1) we identify V ar¡ηs,a,1
¢. Then, from rs,a,1,2, we identify φ1. This shows that all the
parameters for the labor income equation in the schooling group S = 1 are identified.
To guarantee identification of the parameters for schooling group S = 0, note that γ1,a,t =Cov(θα01,t,εC−θα0V )
V ar(εC−θα0V )is identified as well as each of the factor loadings in the vector α01,t. Normalizing
V ar (εC) = 1, we can then recover V ar (εC − θα0V ) =
µCov(θα01,t,εC−θα0V )
γ1,a,t
¶2. Given that information,
we can recover α0V without the scalepV ar (εC − θα0V ). We explore the covariance between labor
income in schooling S = 0 and the indexes from the schooling choice to identify αh,0,1. Once this
parameter is identified, we can repeat the argument made for S = 1 to identify the remaining
parameters for S = 0.
18
5 Empirical Results
5.1 Measurement Error in Ability Tests
One of the advantages of our approach is that we allow for the different tests that appraise abilities
to be measured with error. Note that there are two components in the residuals of the test score
equations (10). The first component is the cognitive or noncognitive factor pre-multiplied by its
factor loading. The second component is the measurement error from the tests. Under our assump-
tions, these two components are independent so we can assign the share of the residual variance
that is due to ability (i.e., due to θc or θn) and the one that is due to noise (i.e., due to εMk,j in
equation 10). Conditional on observed characteristics XMk,j, the total variance in test score k is:
V ar¡Mk,j|XM
k,j
¢=¡αMk,j
¢2V ar (θj) + V ar
¡εMk,j
¢The share of the variance that is due to true ability is V ar(θj)
V ar(Mk,j|XMk,j)
while the share that is due to
noise isV ar(εMk,j )
V ar(Mk,j|XMk,j)
. Figure 5 plots these quantities for each test used in our estimation. In terms
of cognitive skills, it is easy to see that the ASVAB components have the highest signal to noise
ratio. In fact, the Arithmetic Reasoning (ASVAB-AR) and Mathematical Knowledge (ASVAB-
MK) components have a signal ratio around 80 percent. The ASVAB measure that has the lowest
signal ratio is the Coding Speed (ASVAB-CS), around 40 percent, which is above the signal ratio
of the PSID sentence completion test (around 35 percent). The ACT, collected for the NLSY/79
and NLS/66 also exhibits high signal ratio (above 60 percent). The Occupation component in the
Knowledge of theWorld of Work, also collected for the NLSY/79 and NLS/66, presents a signal ratio
above 40 percent, well above that of the SAT (signal ratio of 10 percent) or the Lorge-Thorndike
test (8 percent). Our analysis shows a clear picture: the measures of ability from the NLSY/79
have the best signal to noise ratio, followed by the ones in the NLS/66, and lastly, by the PSID
measure.
None of the tests designed to measure noncognitive skills have signal ratios that are as high as
the components of the ASVAB. In fact, the components of the Rotter Locus of Control have signal
ratios at most around 30 percent (the PSID Rotter F component), and sometimes as low as 10
percent (the PSID Job Preference component).
5.2 The Estimates of the Schooling Choice Equation
These measures of skills are used to estimate the distributions of the cognitive and noncognitive
skill factors, which we then use to see how they affect schooling choices and labor income. Table 5
shows the estimates of the schooling cost equation (8). As commonly found in the literature (e.g.,
Cunha, Heckman, and Navarro 2005), the higher the education of the mother, the lower the psychic
costs of attending college. Similarly, higher stocks of cognitive and noncognitive skills also tend to
19
decrease the psychic costs of college. Interestingly, the unobserved heterogeneity component is such
that it increases the psychic costs of college. This finding means that individuals that have high
amounts of θh are more likely to choose s = 0.
The pattern of selection by skills changes in a non-monotonic way across cohorts. For example,
in Table 6A we see that in the first cohort (individuals born between 1920 and 1929) 40 percent of
the individuals in college were at the highest quartile of cognitive skills. That figure is 35 percent
for the third cohort (the individuals born between 1940 and 1949) and 38 percent for the last cohort
(individuals born between 1970 and 1978). These movements are reproduced for noncognitive skills
(Table 6B), but the variation is much smaller. The opposite movement is seen in terms of unobserved
heterogeneity (Table 6C): there is first an increase and then a decrease in the proportion of college
graduates that come from the top quartile of θh. All in all, the evidence from Tables 6A, 6B, and
6C show is that the individuals at the margin of going to college were not at the top quartile of the
joint distribution of (θc, θn, θh).
5.3 Estimates of the Log Earnings Equation
In Table 7 we show the point estimates and the standard errors of coefficients on the control variables
in the log income equations in high school and college, respectively. The potential experience profile
is more concave in college than in high school. We also find evidence that the income profiles are
becoming flatter for more recent cohorts. We are particularly interested in the year dummies
because they are the empirical counterpart to the μs,t in equation (1). In Figure 6 we plot the point
estimates of the coefficients on year dummies, as well as bars indicating one standard-deviation in
each direction. To help see their evolution over time, we regress the coefficients on a quartic in
periods, where a period is defined to be calendar year minus 1968. From this figure we can see
that the year dummies in the two different sectors have followed roughly a parallel pattern, except
during the late 1960s and early 1970s when they follow the opposite directions, increasing in the
college sector and declining in the high school sector.
In Table 8 we display the point estimates and standard errors of the returns to ability and
the factor loadings associated with the unobserved heterogeneity factor. Because the unobserved
heterogeneity is a combination of many different characteristics, we do not discuss the estimates of
the factor loadings, but we register that they exhibit trends in both schooling levels.
Figure 7A plots the estimated returns to cognitive ability in high school and college over the
period between 1968 and 2001. In the high school sector, one standard deviation in cognitive skills
is associated with an increase in income of 4.6 percent and there is little variation over the 33 year
window that our data covers. This contrasts starkly with the returns to cognitive skills. In late
1960s, one standard deviation in cognitive skills raised labor income by almost 9 percent. By 1980,
that return had already dropped to 1.5 percent. Ten years later, the return to cognitive skills had
jumped to 13.7 percent and it maintains around this plateau until the end of that decade. It is
20
very hard to miss the similarity of the dynamics of the college premium and that of the returns to
cognitive ability in the college sector.
Figure 7B plots the dynamics of the returns to noncognitive abilities in the high school and
college sector. We find that the returns are flat over the entire period covered by our data in
both college and high school. In any case, one standard deviation in noncognitive ability in the
high-school sector raises labor income by 7.7 percent. This is 68 percent higher than the impact
an equivalent change in cognitive skills. In the college sector, one standard deviation increase in
noncognitive skills causes income to increase by 9.3 percent.
5.4 The Evolution of College Premium and the Importance of Compo-
sitional Changes
In this section we quantify the importance of compositional changes in measuring the college pre-
mium. The literature studying the evolution of college premium has focussed on the observed wage
differentials between college and high school graduates. However, such differences are confounded
by the fact that college graduates are a selected sample of the population. Depending on the mar-
ket structure, one would expect individuals with higher expected returns to end up with college
degrees. This creates a wedge between the return of obtaining a college degree and the observed
wage differences among these groups. Moreover, as the importance of skills change over time, the
composition of college graduates may change, too. Therefore, the analysis would not only be biased
in the level of the returns to college but also in its evolution over time. The benefit of our approach
is the ability to correct for such compositional changes.
In order to evaluate how much selection matters for the assessment of college premium, we
simulate data from our model. The estimation of the model discussed in the previous sections
makes it possible to simulate earnings of an individual, yis,a,t, for both schooling levels. Using the
simulated data, we calculate the returns to schooling over time as rt = E£log yi1,a,t − log yi0,a,t | t
¤.
This represents college premium after collecting for selection.
Figure 8 plots rt and compares its evolution to the Mincerian returns reported before in Figure
1. We find differences along several dimensions. First, ignoring selection introduces, as expected,
an upward bias for the estimate of college premium: The Mincerian return is consistently larger
than the actual returns throughout the sample period. The difference gets smaller in the late 70’s,
but then widens up later on.
Second, the time-series pattern of the college premium also differs from the Mincerian counter-
part. Unlike the Mincerian returns, the corrected returns show an increase during 1970’s. Moreover,
while the Mincerian returns point to a sharply increasing profile during 1980’s, we find a less steeper
rise during that period. Both of the returns flatten out toward the end of the sample period.
Another interesting result of the estimation is the heterogeneity in the returns to college. Our
model allows the returns to schooling to be different across different people. This is possible because
21
of the observed heterogeneity in cognitive and noncognitive skills and the unobserved heterogene-
ity. Different prices of these components across education groups introduces heterogeneity in the
premium that one would obtain from obtaining a college degree. In particular, the sample of col-
lege graduates face different returns from college education than do the high school graduates. To
quantify how different the returns are, we compute expected returns for college and high school
graduates separately. More specifically, we compute r1,t = E£log yi1,a,t − log yi0,a,t | t, s = 1
¤and
r0,t = E£log yi1,a,t − log yi0,a,t | t, s = 0
¤. Figure 9 plots these returns.
The results show that the returns have been higher for college graduates for the entire sample
period. This happens because, on average, individuals with higher cognitive and noncognitive
abilities end up with a college degree and because the skill prices are higher for college graduates,
as shown in Figure 7A and 7B. The difference between the two schooling groups is smaller in the
early 70’s. This is consistent with the fact that the discrepancy between the Mincerian and corrected
returns in Figure 8 is smaller in the same period. We also find that the average return for college
graduates has diverged from that of the high school graduates, especially in the 1980’s.
5.5 The Returns to Cognitive and Noncognitive Skills and the Rise in
the College Premium
It has been suggested that the rise in the college premium can be due to the changes in the prices of
skills (Juhn, Murphy and Pierce, 1993). The model presented in this paper together with the data
we construct allows us to evaluate this hypothesis. To do so, we ask how much the college premium
would have changed if the skill prices had remained constant throughout the sample period. To
isolate the role of cognitive and noncognitive skills separately, we run 3 counterfactual experiments.
First, we constrain the price of cognitive skills for both schooling groups to remain constant at its
value in 1968 but leave everything else unaltered. Then, we repeat this for the price of noncognitive
skills. Finally, we ask how much of the change in college premium can be accounted for by changes
that are the same for both schooling groups. To answer this question, we set the coefficients on
time dummies to their values in 1968. In each of these counterfactuals, we simulate artificial data
from the model and compute average college premium as explained in the previous section.
Figure 10 plots the results. It is evident that the changes in the prices of cognitive skills have
had a significant effect on college premium. In particular, the college premium would have been
higher up until mid 1980’s and lower in the 1990’s if skill prices did not change at all. Changes in
noncognitive skill prices, on the other hand, cannot account for an important part of the college
premium. It is interesting that time dummies have a substantial effect on college premium. Our
analysis reveals that the college premium would have been mostly flat, if not decreasing, up until
1980’s if there was no change in the coefficients of time dummies. This implies that a large portion
of the changes in college premium since 1970’s are due to changes that are common to everyone in
the economy, regardless of their abilities.
22
5.6 The Returns to Cognitive and Noncognitive Skills and the Hetero-
geneity in College Premium
In this section we investigate the role of skill prices in the heterogeneity of returns to schooling. As
Figure 9 has shown, the returns can be quite different for college and high school graduates. We
ask how the heterogeneity would have evolved had the skill prices remained constant for cognitive
and noncognitive abilities and the factor loadings on unobserved heterogeneity. More precisely,
we repeat the same experiments as in the previous section, however we now compute the college
premium for high school and college graduates separately. This time we also investigate the role of
factor loadings on unobserved heterogeneity. It should be noted that we allow individuals to change
their schooling choice in the experiments. Figures 11A and 11B report the results.We find that the
returns faced by a college graduate as well as high school graduate would have been higher up until
mid 1980’s had the cognitive skill prices remained constant. Movements of noncognitive skill prices
have no effect on heterogeneity.
5.7 Monetary Value of Skills
Tables 9A and 9B compute the dollar increase in lifetime earnings as a result of a 0.1 standard
deviation increase in cognitive and noncognitive skills, respectively. We constrain our analysis to
the set of people who do not change their schooling level and report the results for 2 different
cohorts; the cohort of people born between 1935-1945 and those born between 1955-1965. For the
older cohort, a small increase in cognitive ability raises lifetime earnings by $2, 705 for high school
graduates and $9, 411 for college graduates. The value of cognitive skills is higher for both schooling
groups in the younger cohort, but the difference is more pronounced for college graduates.
Noncognitive skills also have similar impacts. The benefit of a 0.1 standard deviation in noncog-
nitive abilities for the young cohort is larger than for the old cohort at both schooling levels.
Finally, in Table 9C we report the probability of switching from a high school degree to a college
degree as a result of an increase in skills. The probability as a result of an increase in cognitive
ability is 2.94% and 2.53% for the old and young cohorts, respectively. The corresponding figures
for noncognitive ability is smaller, but the probability of switching is larger for the old cohort.
6 Conclusion
This paper estimates the contribution of the returns to skill to the dynamics of the college premium
from late 1960s to late 1990s. In order to do so, we pool three different datasets: the NLS/66,
the NLSY/79, and the PSID. We develop an econometric framework that allows us to explore the
relative strengths of these datasets: the skill data in the NLS/66 and NLSY/79 and the long horizon
of labor market data in the PSID. The National Longitudinal Surveys allow us to estimate error-
corrected ability measures that we use to estimate the returns to skills from the labor market data
23
over many different years and cohorts in the PSID, NLS/66 and NLSY/79.
We find that the dynamics of the returns to cognitive skills in the college sector are remarkably
similar to that of the college premium: the returns decrease during the 1970s, suddenly increase
in the 1980s, and continue to grow at a lower rate during the 1990s. We don’t find evidence of
trends in returns to cognitive skills in the high school sector or to noncognitive skills in high school
or college sectors. Composition changes also contribute to the dynamics of the college premium
and its role is particularly important in the 1990s. Nothwithstanding these findings, our results
indicate that most of the increase in the college premium is driven by the increase in μ1,τ −μ0,τ . Wealso estimate the factual college premium for actual college workers and the counterfactual college
premium for the high school workers. We find that they were about the same in the early 1970s
and they diverge in the early 1980s: the college premium goes up for college workers and remains
constant for the high school workers and only starts to go up for this group in the late 1980s. This
fact is substantially driven by the increase in the returns to cognitive skills.
24
7 References
Atkinson, John W. (1964) An Introduction to Motivation. Princeton, NJ.: Van Nostrand.
Autor, David H, Lawrence F. Katz, andMelissa S. Kearney. (2008) “Trends in U.S. Wage Inequality:
Re-Assessing the Revisionists,” Review of Economics and Statistics 90(2), 300-323.
Bishop, John H. (1991) "On-the-Job Training of New Hires." In Market Failure in Training, edited
by David Stern and Jozef M. M. Ritzen, New York, Springer Verlag, 61-96.
Blackburn, McKinley, and David Neumark. (1993) "Omitted-Ability Bias and the Increase in the
Return to Schooling." Journal of Labor Economics, July, 11:3, 521-44.
Borghans, Lex, Angela Lee Duckworth, James J. Heckman, and Bas ter Weel. (2008) "The eco-
nomics and psychology of personality traits." Journal of Human Resources, 43(4), 972-1059.
Bouchard, Thomas, and John Loehlin. (2001) "Genes, evolution, and personality." Behavior Ge-
netics, 31, 243-273.
Bowles, Samuel, and Herbert Gintis. (1976) Schooling in capitalist America: Educational reform
and contradictions of economic life. New York, NY: Basic Books.
Caliendo, Marco, Deborah Cobb-Clark, and Arne Uhlendorff. (2010) "Locus of Control and Job
Search Strategies." IZA Discussion Paper 4750.
Cameron, Stephen and James J. Heckman. (2001) "The Dynamics of Educational Attainment for
Black, Hispanic, and White Males." Journal of Political Economy, vol. 109(3), 455-459.
Card, David and Thomas Lemieux. (2001) "Can Falling Supply Explain The Rising Return To
College For Younger Men? A Cohort-Based Analysis." The Quarterly Journal of Economics, vol.
116(2), 705-746.
Carneiro, Pedro, Karsten T. Hansen, and James J. Heckman. (2003) "Estimating Distribution of
Counterfactuals with an Application to the Returns to Schooling and Measurement of the Effect of
Uncertainty on Schooling Choice." International Economic Review, Vol 44(2), 361-422.
Carneiro, Pedro, and Sokbae Lee. (2008) "Trends in Quality Adjusted Skill Premia in the US:
1960-2000." Working paper, UCL.
Carroll, Christopher, and Lawrence Summers. (1991) "Consumption Growth Parallels Income
Growth: SomeNew Evidence." In National saving and Economic Performance, edited by Douglas
Bernheim and John B. Shoven, Chicago University Press, 305—43.
Claessens, Amy, Greg Duncan, and Mimi Engel. (2009) "Kindergarten Skills and Fifth Grade
Achievement: Evidence from the ECLS-K." Economics of Education Review, 28(4): 415-427.
Chay, Kenneth Y., and David S. Lee. (2000) "Changes in relative wages in the 1980s Returns to
observed and unobserved skills and black-white wage differentials," Journal of Econometrics, 99(1):
1-38.
Cunha, Flavio, James J. Heckman, and Salvador Navarro. (2005) "Separating uncertainty from
heterogeneity in life cycle earnings." Oxford Economic Papers, Oxford University Press, vol. 57(2),
191-261.
25
Dunifon, Rachel, and Greg Duncan (1998) "Long-Run Effects of Motivation on Labor-Market Suc-
cess." Social Psychology Quarterly, 61(1): 33-48.
Edwards, Rick. (1976) "Individual Traits and Organizational Incentives: What makes a ’Good’
Worker?" Journal of Human Resources, 11(1), 51-68.
Fernandez-Villaverde, Jesus and Dirk Krueger. (2007) "Consumption over the Life Cycle: Some
Facts from Consumer Expenditure Survey Data." The Review of Economics and Statistics, 89(3):
552—565.
Fernandez-Villaverde, Jesus and Dirk Krueger. (2010) "Consumption and Saving over the Life
Cycle: How Important are Consumer Durables." Macroeconomic Dynamics, forthcoming.
Goldstein, Gerald, andMichel Hersen. (2000)Handbook of Psychological Assessment, Third Edition.
Oxford, UK: Elsevier.
Hansen, Karsten, James J. Heckman, and Kathleen Mullen. (2004) "The effect of schooling and
ability on achievement test scores." Journal of Econometrics, vol. 121(1-2), pages 39-98.
Heckman, James J. (1976) "The Common Structure of Statistical Models of Truncation, Sample
Selection and Limited Dependent Variables and a Simple Estimator for Such Models." The Annals
of Economic and Social Measurement, 5, 475-492.
Heckman, James J., and Edward Vytlacil. (2001) "Identifying the Role of Cognitive Ability in
Explaining the Level of and Change in the Return to Schooling."Review of Economics and Statistics,
Vol 83(1), 1-12.
Heckman, James, Jora Stixrud, and Sergio Urzua. (2006) "The Effects of Cognitive and Noncogni-
tive Abilities on Labor Market Outcomes and Social Behavior." Journal of Labor Economics, vol.
24(3), 411-482.
Juhn, Chinhui, Kevin Murphy, and Brooks Pierce. (1993) "Wage Inequality and the Rise in Returns
to Skill." Journal of Political Economy, 101(3), 410-42.
Katz, Lawrence and David Autor. (1999) "Changes in theWage Structure and Earnings Inequality."
In Handbook of Labor Economics, edited by Orley Ashenfelter and David Card, vol. 3A, North-
Holland, 1463-1555.
Katz, Lawrence and Kevin Murphy. (1992) "Changes in Relative Wages, 1963-1987: Supply and
Demand Factors." Quarterly Journal of Economics, 107, 35-78.
Klein, Roger, Richard Spady, and Andrew Weiss. (1991) "Factors Affecting the Output and Quit
Propensities of Production Workers." Review of Economic Studies, 58(2), 929-54.
Kotlarski, Ignacy. (1967) "On characterizing the gamma and normal distribution." Pacific Journal
of Mathematics, 20, 729-38.
Lorge, Irving and Robert Thorndike. (1966) Lorge-Thorndike Intelligence Tests. Chicago, IL:
Riverside Publishing Co.
Matzkin, Roza. (1992) "Nonparametric and distribution-free estimation of the binary threshold
crossing and the binary choice models." Econometrica, 60, 239-70.
McClelland, David, John Atkinson, Russell Clark, and Edgar Lowell. (1953) The achievement mo-
26
tive. New York, NY: Appleton-Century-Crofts.
Miller, Shawn. (2004) National Longitudinal Surveys: NLSY79 User’s Guide 1979-2002. Center for
Human Resource Research, The Ohio State University.
Mincer, Jacob. (1974) Schooling, Experience, and Earnings.New York, NY: Columbia University
Press.
Murnane, Richard J., John B. Willett, and Frank Levy. (1995) "The Growing Importance of
Cognitive Skills in Wage Determination." The Review of Economics and Statistics, vol. 77(2),
251-66.
Murray, Henry. (1938) Explorations in Personality. New York, NY: Oxford University Press.
Neal, Derek A., and William R. Johnson. (1996) "The Role of Premarket Factors in Black-White
Wage Differences." Journal of Political Economy, 104(5), 869-95.
Neisser, Ulric, Gwyneth Boodoo, Thomas J. Bouchard, A. Wade Boykin, Nathan Brody, Stephen J.
Ceci, Diane F. Halpern, John C. Loehlin, Robert Perloff, Robert J. Sternberg, and Susana Urbina.
(1996) "Intelligence: Knowns and Unknowns." American Psychologist 51(2):77—101.
Otis, Arthur S., and Roger T. Lennon. (1977)Otis-Lennon School Ability Test. New York: Harcourt
Brace Jovanovich.
Rotter, Julian B. (1966) "Generalized expectancies of internal versus external control of reinforce-
ments." Psychological Monographs, 80 (whole no. 609).
Rotter, Julian B. (1975) "Some problems and misconceptions related to the construct of internal
versus external control of reinforcement." Journal of Consulting and Clinical Psychology, 43, 56-67.
Schennach, Susanne. (2004) "Estimation of Nonlinear Models with Measurement Error." Econo-
metrica. 72, 33-75.
Sternberg, Robert. (1985) "Beyond IQ: A triarchic theory of human intelligence." New York, NY:
Cambridge University Press.
Taber, Christopher R. (2001) "The Rising College Premium in the Eighties: Return to College or
Return to Unobserved Ability." Review of Economic Studies, Vol 68, 665 - 691.
Traxler, Arthur. (1939) "A Study of the California Test of Mental Maturity: Advanced Battery."
The Journal of Educational Research, 32(5), 329-335.
Vane, Julia and Robert Motta. (1984) "Group Intelligence Tests." In Handbook of Psychological As-
sessment, edited by Gerald Goldstein and Michel Hersen (eds.), 100-116, Elmsford, NY: Pergamon
Press.
Veroff, Joseph, Lor McClelland and Kent Marquis. (1971) "Measuring intelligence and achievement
motivation in surveys." In Final Report to the U.S. Department of Health, Education, and Welfare,
Office of Economic Opportunity, Survey Research Center, Institute for Social Research, University
of Michigan, Ann Arbor Contract OEO-4180 .
27
0.35
0.40
0.45
0.50
0.55
0.60
0.65
Figure 1College Premium by Year
White Males for PSID, NLSY/79, and NLS/66
0.25
0.30
1968 1971 1974 1977 1980 1983 1986 1989 1992 1995 1998 2001
Specification 1 Specification 2 Quartic Approximation (Specification 1) Quartic Approximation (Specification 2)
This figure plots the estimated college premium by year according to two different specifications. In the first specification (gray diamonds),we regress log income year by year against a dummy for a college education and add potential experience, potential experience squared,dummies for south, urban residence at age 14, cohort dummies, and interactions of potential experience with cohort dummies. In the secondspecification (black squares), we regress log income in all years against a dummy for college education interacted with year dummies and addpotential experience, potential experience squared, dummies for south, urban residence at age 14, cohort dummies, and interactions of potential experience with cohort dummies. The solid curves are the quartic interpolation of the estimated college premium.
11.00
12.00
Figure 2ALog Labor Income Age Profile, High School
Comparison Across Different SurveysBars Indicate a One-Standard Deviation Above and Below
9.00
10.00
23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55
NLS/66 NLSY/79 PSID
Figure 2BLog Labor Income Age Profile, CollegeComparison Across Different Surveys
12.00
Bars Indicate a One-Standard Deviation Above and Below
11.00.00
10.00
9.00
23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55
NLS/66 NLSY/79 PSID
0 05
0.10
0.15
0.20
0.25
0.30
0.35
Figure 3APartial Correlation: Cognitive Test Scores and Maternal Education
White Males from PSID, NLS/66 and NLSY/79We control for urban and south residence, paternal education, number of siblings, and cohort
-0.05
0.00
0.05
Otis
-Len
non
Calif
orni
a Te
st o
f Men
tal M
atur
ity
Lorg
e-Th
ornd
ike
SAT
ACT
ASV
AB-
Arit
hmet
ic R
easo
ning
ASV
AB-
Wor
d K
now
ledg
e
ASV
AB-
Para
grap
h Co
mpo
sitio
n
ASV
AB-
Num
eric
al O
pera
tions
ASV
AB-
Codi
ng S
peed
ASV
AB-
Mat
h K
now
ledg
e
Wor
k K
now
ledg
e -O
ccup
atio
ns
Wor
k K
now
ledg
e -E
duca
tion
Wor
k K
now
ledg
e -E
arni
ngs
PSID
Sen
tenc
e Co
mpl
etio
n
Figure 3BPartial Correlation: Cognitive Test Scores and Paternal Education
White Males from PSID, NLS/66 and NLSY/79W ntr l f r rb n nd th r id n m t rn l d ti n n mb r f iblin nd h rt
0.30
0.35
We control for urban and south residence, maternal education, number of siblings, and cohort
0.20
0.25
0.05
0.10
0.15
-0.05
0.00
0.05
n y e T T g e n s d e s n s n
Otis
-Len
non
of M
enta
l Mat
urity
Lorg
e-Th
ornd
ike
SAT
ACT
ithm
etic
Reas
onin
g
B-W
ord
Kno
wle
dge
grap
h Co
mpo
sitio
n
mer
ical
Ope
ratio
ns
VA
B-Co
ding
Spe
ed
B-M
ath
Kno
wle
dge
edge
-O
ccup
atio
ns
wle
dge
-Edu
catio
n
owle
dge
-Ear
ning
s
nten
ce C
ompl
etio
n
Calif
orni
a Tes
t
ASV
AB-
Ari
ASV
AB
ASV
AB-
Para
g
ASV
AB-
Num
ASV
ASV
AB
Wor
k K
now
le
Wor
k K
now
Wor
k K
no
PSID
Sen
Figure 3CPartial Correlation: Cognitive Test Scores and Number of Siblings
White Males from PSID, NLS/66 and NLSY/79W ntr l f r rb n r id n th r id n m t rn l nd p t rn l d ti n nd h rt
0.03
0.05
We control for urban residence, south residence, maternal and paternal education, and cohort
0 03
0.00
-0.05
-0.03
-0.10
-0.08
n ty ke T T g ge n ns d ge ns n gs n
Otis
-Len
non
t of M
enta
l Mat
urit
Lorg
e-Th
ornd
ik
SAT
ACT
rithm
etic
Reas
onin
g
B-W
ord
Kno
wle
dg
agra
ph C
ompo
sitio
n
umer
ical O
pera
tion
VA
B-Co
ding
Spe
ed
B-M
ath
Kno
wled
g
ledge
-O
ccup
atio
n
owle
dge
-Edu
catio
n
now
ledg
e -E
arni
ng
ente
nce
Com
plet
ion
Calif
orni
a Tes
t
ASV
AB-
Ar
ASV
AB
ASV
AB-
Para
ASV
AB-
Nu
ASV
ASV
AB
Wor
k K
now
l
Wor
k K
no
Wor
k K
n
PSID
Se
Figure 3DSchooling Choice and Cognitive Test Scores
White Males from PSID, NLS/66 and NLSY/79W ntr l f r rb n nd th r id n m t rn l nd p t rn l d ti n t iti n nd h rt
1.20
1.40
We control for urban and south residence, maternal and paternal education, tuition, and cohort
0.80
1.00
0.40
0.60
0.00
0.20
n y e T T g e n s d e s n s n
Otis
-Len
non
of M
enta
l Mat
urity
Lorg
e-Th
ornd
ike
SAT
ACT
ithm
etic
Reas
onin
g
B-W
ord
Kno
wle
dge
grap
h Co
mpo
sitio
n
mer
ical
Ope
ratio
ns
VA
B-Co
ding
Spe
ed
B-M
ath
Kno
wle
dge
edge
-O
ccup
atio
ns
wle
dge
-Edu
catio
n
owle
dge
-Ear
ning
s
nten
ce C
ompl
etio
n
Calif
orni
a Tes
t
ASV
AB-
Ar i
ASV
AB
ASV
AB-
Para
g
ASV
AB-
Num
ASV
ASV
AB
Wor
k K
now
le
Wor
k K
now
Wor
k K
no
PSID
Sen
Figure 4APartial Correlation: Noncognitive Test Scores and Maternal Education
White Males from PSID, NLS/66 and NLSY/79W l f b d h id l d i b f ibli d h
0.15
0.20
We control for urban and south residence, paternal education, number of siblings, and cohort
0.10
0.00
0.05
-0.05
-0.10
Rotte
r E
Rotte
r F
Rotte
r G
Rotte
r K
D R
otte
r E
D R
otte
r G
D P
lan L
ife
D R
otte
r F
ish T
hing
s
Pref
eren
ce
PSID
PSID
PSID
PSID
PSID
Fin
i
PSID
Job
P
0.10
0.15
0.20
0.25
0.30
0.35
Figure 4BPartial Correlation: Noncognitive Test Scores and Paternal Education
White Males from PSID, NLS/66 and NLSY/79We control for urban residence, south residence, maternal education, number of siblings, and cohort
-0.05
0.00
0.05
Rotte
r E
Rotte
r F
Rotte
r G
Rotte
r K
PSID
Rot
ter E
PSID
Rot
ter G
PSID
Plan
Life
PSID
Rot
ter F
PSID
Fin
ish T
hing
s
PSID
Job
Pref
eren
ce
Figure 4CPartial Correlation: Noncognitive Test Scores and Number of Siblings
White Males from PSID, NLS/66 and NLSY/79W ntr l f r rb n nd th r id n m t rn l nd p t rn l d ti n nd h rt
0.06
0.08
We control for urban and south residence, maternal and paternal education, and cohort
0.02
0.04
-0 02
0.00
-0.04
0.02
-0.06
Rotte
r E
Rotte
r F
Rotte
r G
Rotte
r K
D R
otte
r E
D R
otte
r G
D P
lan L
ife
D R
otte
r F
ish T
hing
s
Pref
eren
ce
PSID
PSID
PSID
PSID
PSID
Fin
i
PSID
Job
P
Figure 4DSchooling Choice and Noncognitive Test Scores
White Males from PSID, NLS/66 and NLSY/79W t l f b d th id t l d t l d ti t iti d h t
0.20We control for urban and south residence, maternal and paternal education, tuition, and cohort
0.10
0.15
0.05
0.00
-0.05
Rotte
r E
Rotte
r F
Rotte
r G
Rotte
r K
Rotte
r E
Rotte
r G
Plan
Life
Rotte
r F
h Th
ings
efer
ence
R R R R
PSID
R
PSID
R
PSID
P
PSID
R
PSID
Fin
ish
PSID
Job
Pre
Work Knowledge - OccupationsWork Knowledge - Education
Work Knowledge - EarningsPSID Sentence Completion
Rotter ERotter FRotter GRotter K
PSID Rotter EPSID Rotter GPSID Plan LifePSID Rotter F
PSID Finish ThingsPSID Job Preference
Figure 5Residual Variance in Test Scores: True Ability versus Noise
Cognitive and Noncognitive Skill Factors
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
Otis-LennonLorge-Thorndike
California Test of Mental MaturitySATACT
ASVAB-ARASVAB-WKASVAB-PC
ASVAB-NOASVAB-CS
ASVAB-MK
True Ability Noise
-0.1
0.0
0.1
0.2
0.3
0.4
Figure 6Year Dummies in High School and College Sectors
-0.5
-0.4
-0.3
-0.2
1969 1972 1975 1978 1981 1984 1987 1990 1993 1996 1999
High School College Quartic Approximation (High School) Quartic Approximation (College)
0.10
0.14
0.18
Figure 7AReturns to Cognitive Ability
High School and College Sectors
‐0.02
0.02
0.06
1968 1972 1976 1980 1984 1988 1992 1996 2000
High School College Cubic Approximation (High School) Cubic Approximation (College)
0.10
0.14
0.18
Figure 7BReturns to Noncognitive AbilityHigh School and College Sectors
‐0.02
0.02
0.06
1968 1972 1976 1980 1984 1988 1992 1996 2000
High School College Cubic Approximation (High School) Cubic Approximation (College)
0.30
0.40
0.50
0.60
0.70
Figure 8Accounting for Compositional Changes in the College Premium
0.00
0.10
0.20
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
Mincer Corrected
Quartic Approximation (Mincer) Quartic Approximation (Corrected)
0.30
0.40
0.50
0.60
0.70
Figure 9The Dynamics of the College Premium
College Workers Versus High School Workers
0.00
0.10
0.20
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
Average College High School
Quartic Approximation (Average) Quartic Approximation (College) Quartic Approximation (High School)
0.30
0.40
0.50
0.60
0.70
Figure 10The College Premium, Returns to Skills, and Year Dummies
0.00
0.10
0.20
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1994
1995
1996
1997
1998
1999
2000
2001
Returns Cognitive Time Dummies
Quartic Approximation (Returns) Quartic Approximation (Cognitive) Quartic Approximation (Dummies)
0.30
0.40
0.50
0.60
0.70
Figure 11AThe Dynamics of the College Premiumfor High School and College Workers
Factual versus Counterfactual Returns to Cognitive Skills
0.00
0.10
0.20
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1995
1997
1999
2001
College High School
College (experiment) High School (experiment)
0.30
0.40
0.50
0.60
0.70
Figure 11BThe Dynamics of the College Premiumfor High School and College Workers
Factual versus Counterfactual Returns to Noncognitive Skills
0.00
0.10
0.20
1968
1969
1970
1971
1972
1973
1974
1975
1976
1977
1978
1979
1980
1981
1982
1983
1984
1985
1986
1987
1988
1989
1990
1991
1992
1993
1995
1997
1999
2001
College High School
College (experiment) High School (experiment)
age/year 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 200123 73 77 136 145 155 3124 82 73 91 150 162 145 3325 141 94 89 93 167 164 15626 105 139 93 90 160 168 169 3527 102 146 91 96 170 160 16928 101 141 87 151 171 159 3529 100 87 91 146 155 154 3630 133 85 93 162 158 14131 94 81 87 139 148 14732 127 82 90 154 14733 86 119 83 138 14434 74 82 89 12635 111 76 8336 70 71 8137 106 7338 70 9839 684041424344454647484950515253545556
Lexis Diagram 1AData availability for Earnings (NLS/66)
Blue boxes mean available data for the corresponding age/year
age/year 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 200123 113 120 132 160 134 167 131 12224 115 122 132 171 141 179 151 13525 121 126 137 168 147 179 159 13926 124 122 139 170 146 184 153 13827 124 125 142 182 150 183 156 13328 128 125 144 172 155 187 152 14029 127 126 145 178 153 181 153 14330 124 120 140 174 154 183 16131 128 129 138 168 150 184 14232 133 124 133 170 153 15833 132 121 132 170 180 12534 127 131 135 149 15235 124 125 164 169 13536 126 139 141 15537 130 166 158 12638 129 125 132 14239 119 167 15640 130 121 14041 110 14642 128 11443 11044 117454647484950515253545556
Lexis Diagram 1BData availability for Earnings (NLSY/79)
Blue boxes mean available data for the corresponding age/year
age/year 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 200123 17 23 46 35 30 36 44 37 44 38 39 45 37 26 31 37 32 29 18 21 19 21 24 16 18 21 18 15 18 18 16 1624 21 33 32 62 52 44 48 56 43 58 40 48 50 44 29 38 41 42 38 27 29 26 24 29 21 29 22 24 24 23 22 2625 22 27 35 36 73 57 57 46 66 54 67 46 57 56 48 31 48 43 45 43 33 39 31 38 35 24 41 25 30 24 28 3626 28 24 33 38 45 79 63 59 47 71 56 70 51 64 62 59 34 55 44 53 48 40 47 35 38 38 26 44 33 29 41 3827 26 29 30 35 39 50 81 65 56 46 70 57 73 52 70 66 60 41 59 49 53 52 41 53 36 40 42 29 44 27 31 3828 19 29 34 31 35 42 54 89 65 63 48 69 59 76 53 76 65 63 40 61 54 56 57 42 55 39 41 44 32 42 31 4429 15 21 33 33 35 38 39 49 88 67 61 46 71 60 70 55 78 71 70 40 62 55 56 59 43 56 38 41 47 28 30 3430 20 17 24 34 32 37 41 41 49 86 67 64 48 70 59 74 52 81 74 68 42 64 55 53 59 47 52 36 43 40 40 3531 17 23 19 24 34 35 36 44 44 50 81 64 59 51 67 52 77 58 79 68 73 43 70 52 62 60 50 52 35 34 30 3732 17 22 23 19 25 34 36 36 44 45 49 81 65 54 49 64 49 75 59 76 71 73 49 70 55 58 60 47 53 24 41 4833 13 19 22 24 19 26 35 34 37 47 43 48 79 65 54 51 66 50 76 59 75 68 77 49 73 57 59 61 47 47 32 3334 19 13 20 22 24 19 25 32 36 38 46 43 49 75 69 57 53 61 50 76 59 73 69 72 50 67 57 58 56 40 28 4135 17 20 13 21 22 24 18 27 32 33 39 43 43 49 78 66 54 52 61 50 76 62 69 66 73 54 67 60 60 49 48 3436 18 16 20 13 21 22 24 16 28 35 35 39 43 41 44 75 67 50 54 63 51 78 64 71 68 73 55 70 60 50 37 2637 25 19 16 20 13 24 21 23 16 26 37 36 40 43 41 47 70 65 51 55 63 49 84 65 66 68 67 56 69 52 49 4838 28 26 20 17 21 14 24 20 21 16 26 37 34 40 42 38 46 72 64 53 54 60 51 79 67 68 62 69 56 58 48 3439 26 27 26 19 17 22 14 23 20 24 16 26 34 34 38 43 38 44 67 65 53 53 62 51 76 61 70 65 72 42 51 4440 29 26 26 26 20 17 22 14 22 20 20 16 24 32 34 34 41 37 44 69 62 49 54 58 50 72 59 68 63 54 59 4841 27 31 27 26 27 20 18 23 15 22 20 20 16 24 31 35 36 41 35 43 70 61 55 50 57 53 72 59 65 42 44 5242 21 25 30 28 26 27 21 18 23 15 23 21 20 14 24 29 38 39 41 38 42 63 65 55 45 54 51 71 59 59 56 5843 21 21 26 31 28 26 27 21 17 23 15 24 21 19 17 23 29 35 39 42 37 41 63 65 58 47 55 50 72 46 43 4444 17 22 22 26 30 28 26 23 21 18 24 15 23 21 16 19 22 30 31 38 42 35 44 62 63 53 51 51 53 53 54 5445 15 18 22 22 26 30 28 26 23 21 17 23 14 23 23 17 19 19 30 31 36 39 37 44 63 67 52 51 48 43 45 4246 22 16 18 23 22 26 31 27 25 21 22 16 23 14 22 22 17 18 22 30 31 34 37 39 44 65 63 50 47 44 48 5147 20 21 16 19 23 24 26 29 28 26 20 21 19 22 12 21 20 15 18 25 30 30 32 34 35 38 62 57 48 43 44 4348 15 19 21 16 19 24 23 23 27 27 25 22 22 19 21 11 21 19 16 17 25 31 30 31 34 34 41 63 57 39 41 5349 14 16 18 21 16 19 24 23 23 27 26 25 22 24 17 21 9 20 18 15 17 24 30 32 30 33 39 38 69 53 39 4550 14 14 16 19 20 15 19 23 22 23 26 26 25 22 24 16 21 8 20 20 14 17 27 30 33 28 27 34 36 56 43 4251 15 15 14 16 18 20 15 20 20 22 21 26 26 25 21 24 15 21 9 20 18 14 18 25 31 33 31 28 35 33 54 3652 9 16 15 14 17 19 20 15 18 19 21 22 26 26 24 20 22 14 19 9 20 18 13 18 24 31 28 25 27 31 52 3753 8 9 17 15 14 17 19 20 15 18 20 21 21 26 26 22 19 22 14 18 9 20 18 14 18 24 27 30 29 21 25 5254 11 8 10 17 14 14 17 20 19 14 18 20 19 21 26 24 21 18 20 14 20 9 20 18 13 16 21 26 31 23 33 5155 10 11 9 10 17 15 14 15 17 18 15 18 19 17 21 21 23 19 18 18 14 19 9 19 16 13 18 19 25 24 23 2356 11 9 11 9 10 17 15 14 13 16 18 15 19 18 17 21 22 21 21 17 19 13 20 7 18 17 11 17 20 21 24 31
Lexis Diagram 1CData availability for Earnings (PSID)
Blue boxes mean available data for the corresponding age/year
age/year 1968 1969 1970 1971 1972 1973 1974 1975 1976 1977 1978 1979 1980 1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 200123 90 100 182 180 30 191 44 68 44 38 39 45 150 146 163 197 166 196 149 143 19 21 24 16 18 21 18 15 18 18 16 1624 103 106 123 212 52 206 48 201 76 58 40 48 50 159 151 170 212 183 217 178 164 26 24 29 21 29 22 24 24 23 22 2625 163 121 124 129 73 224 57 210 222 54 67 46 57 56 169 157 185 211 192 222 192 178 31 38 35 24 41 25 30 24 28 3626 133 163 126 128 45 239 63 227 216 71 91 70 51 64 62 183 156 194 214 199 232 193 185 35 38 38 26 44 33 29 41 3827 26 131 176 126 39 146 81 235 216 46 239 57 73 52 70 66 184 166 201 231 203 235 197 186 36 40 42 29 44 27 31 3828 19 29 135 172 35 129 54 240 236 63 207 69 94 76 53 76 65 191 165 205 226 211 244 194 195 39 41 44 32 42 31 4429 15 21 33 133 35 125 39 140 234 67 216 46 225 96 70 55 78 71 197 166 207 233 209 240 196 199 38 41 47 28 30 3430 20 17 24 34 32 170 41 126 142 86 229 64 206 211 59 74 52 81 74 192 162 204 229 207 242 208 52 36 43 40 40 3531 17 23 19 24 34 129 36 125 131 50 220 64 207 198 67 52 77 58 79 68 201 172 208 220 212 244 50 194 35 34 30 3732 17 22 23 19 25 34 36 163 126 45 139 81 219 201 49 64 49 75 59 76 71 206 173 203 225 211 60 205 53 24 41 4833 13 19 22 24 19 26 35 120 156 47 126 48 217 209 54 51 66 50 76 59 75 68 209 170 205 227 59 241 47 172 32 3334 19 13 20 22 24 19 25 32 110 38 128 43 138 201 69 57 53 61 50 76 59 73 69 199 181 202 57 207 56 192 28 4135 17 20 13 21 22 24 18 27 32 33 150 43 119 132 78 66 54 52 61 50 76 62 69 66 197 179 67 224 60 218 183 3436 18 16 20 13 21 22 24 16 28 35 105 39 114 122 44 75 67 50 54 63 51 78 64 71 68 199 55 209 60 191 192 2637 25 19 16 20 13 24 21 23 16 26 37 36 146 116 41 47 70 65 51 55 63 49 84 65 66 68 67 186 69 218 207 17438 28 26 20 17 21 14 24 20 21 16 26 37 104 138 42 38 46 72 64 53 54 60 51 79 67 68 62 198 56 183 180 17639 26 27 26 19 17 22 14 23 20 24 16 26 34 102 38 43 38 44 67 65 53 53 62 51 76 61 70 65 72 161 218 20040 29 26 26 26 20 17 22 14 22 20 20 16 24 32 34 34 41 37 44 69 62 49 54 58 50 72 59 68 63 184 180 18841 27 31 27 26 27 20 18 23 15 22 20 20 16 24 31 35 36 41 35 43 70 61 55 50 57 53 72 59 65 42 154 19842 21 25 30 28 26 27 21 18 23 15 23 21 20 14 24 29 38 39 41 38 42 63 65 55 45 54 51 71 59 59 184 17243 21 21 26 31 28 26 27 21 17 23 15 24 21 19 17 23 29 35 39 42 37 41 63 65 58 47 55 50 72 46 43 15444 17 22 22 26 30 28 26 23 21 18 24 15 23 21 16 19 22 30 31 38 42 35 44 62 63 53 51 51 53 53 54 17145 15 18 22 22 26 30 28 26 23 21 17 23 14 23 23 17 19 19 30 31 36 39 37 44 63 67 52 51 48 43 45 4246 22 16 18 23 22 26 31 27 25 21 22 16 23 14 22 22 17 18 22 30 31 34 37 39 44 65 63 50 47 44 48 5147 20 21 16 19 23 24 26 29 28 26 20 21 19 22 12 21 20 15 18 25 30 30 32 34 35 38 62 57 48 43 44 4348 15 19 21 16 19 24 23 23 27 27 25 22 22 19 21 11 21 19 16 17 25 31 30 31 34 34 41 63 57 39 41 5349 14 16 18 21 16 19 24 23 23 27 26 25 22 24 17 21 9 20 18 15 17 24 30 32 30 33 39 38 69 53 39 4550 14 14 16 19 20 15 19 23 22 23 26 26 25 22 24 16 21 8 20 20 14 17 27 30 33 28 27 34 36 56 43 4251 15 15 14 16 18 20 15 20 20 22 21 26 26 25 21 24 15 21 9 20 18 14 18 25 31 33 31 28 35 33 54 3652 9 16 15 14 17 19 20 15 18 19 21 22 26 26 24 20 22 14 19 9 20 18 13 18 24 31 28 25 27 31 52 3753 8 9 17 15 14 17 19 20 15 18 20 21 21 26 26 22 19 22 14 18 9 20 18 14 18 24 27 30 29 21 25 5254 11 8 10 17 14 14 17 20 19 14 18 20 19 21 26 24 21 18 20 14 20 9 20 18 13 16 21 26 31 23 33 5155 10 11 9 10 17 15 14 15 17 18 15 18 19 17 21 21 23 19 18 18 14 19 9 19 16 13 18 19 25 24 23 2356 11 9 11 9 10 17 15 14 13 16 18 15 19 18 17 21 22 21 21 17 19 13 20 7 18 17 11 17 20 21 24 31
Lexis Diagram 1DData availability for Earnings (Pooled Data)
Blue boxes mean available data for the corresponding age/year
Estimate Std. Error t-statisticCohort 01 (born before 1929) 0.374 0.028 13.57Cohort 02 (born between 1930 and 1939) 0.405 0.030 13.51Cohort 03 (born between 1940 and 1949) 0.508 0.016 32.1Cohort 04 (born between 1950 and 1959) 0.400 0.013 31.31Cohort 05 (born between 1960 and 1969) 0.396 0.016 24.98Cohort 06 (born between 1970 and 1978) 0.445 0.027 16.68Dummy for NLS/66 -0.039 0.018 -2.15Dummy for NLSY/79 -0.011 0.018 -0.63
Observations 6396R2 0.423F-test (Dummy for NLS/66 = 0 and Dummy for NLSY/79 = 0) 2.37P-Value of F-test 0.094
Table 1Fraction of Individuals with a College Degree
by Cohort and Survey*
*Note: To construct this table, we regress schooling attainment (s = 0 for high school graduates and s = 1 for college graduates) against cohort dummies and survey membership dummies.
High School College High School College High School CollegeNumber of Observations 857 677 873 549 1518 1158South Residence at age 14 0.286 0.254 0.226 0.231 0.367 0.313
(0.452) (0.435) (0.418) (0.422) (0.482) (0.463)Urban Residence at age 14 0.674 0.818 0.683 0.816 0.764 0.864
(0.469) (0.386) (0.466) (0.388) (0.425) (0.342)Cohort 01 (born before 1929) - - - - 0.054 0.041
- - - - (0.226) (0.197)Cohort 02 (born between 1930 and 1939) - - - - 0.093 0.082
- - - - (0.290) (0.273)Cohort 03 (born between 1940 and 1949) 0.694 0.755 - - 0.272 0.383
(0.461) (0.430) - - (0.444) (0.486)Cohort 04 (born between 1950 and 1959) 0.306 0.245 0.352 0.322 0.383 0.333
(0.461) (0.430) (0.477) (0.467) (0.486) (0.471)Cohort 05 (born between 1960 and 1969) - - 0.648 0.678 0.164 0.128
- - (0.477) (0.467) (0.370) (0.333)Cohort 06 (born between 1970 and 1978) - - - - 0.026 0.028
- - - - (0.159) (0.165)Maternal Education 3.198 4.109 3.757 4.685 3.527 4.328
(1.010) (1.165) (0.966) (0.984) (1.078) (1.138)Paternal Education 2.917 4.085 3.715 4.964 3.277 4.347
(1.101) (1.328) (1.177) (1.183) (1.238) (1.417)Log Tuition 7.345 7.336 7.630 7.562 7.470 7.459
(0.121) (0.117) (0.483) (0.480) (0.344) (0.311)Log Earnings 10.526 10.643 10.400 10.722 10.508 10.891
(2.082) (1.735) (1.976) (1.673) (2.125) (1.756)Number of Siblings 2.992 2.249 3.140 2.446 3.236 2.455
(2.082) (1.735) (1.976) (1.673) (2.125) (1.756)Note: Standard errors in parenthesis
Mean Value (Control Variables)White Males from NLS/66, NLSY/79 and PSID
NLS/66 NLSY/79 PSID
Table 2
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4
Otis-Lennon Test1 38.41% 34.77% 20.53% 6.29% 11.55% 10.76% 31.47% 46.22%California Mental Maturity Test2 37.70% 28.96% 19.67% 13.66% 10.08% 15.97% 32.77% 41.18%Lorge-Thorndike Test3 32.41% 34.26% 22.22% 11.11% 14.29% 12.99% 28.57% 44.16%Scholastic Aptitude Test (SAT)4 55.56% 17.28% 16.05% 11.11% 18.42% 27.11% 26.58% 27.89%American College Test (ACT)4 53.85% 28.21% 17.95% 0.00% 21.13% 24.74% 25.26% 28.87%ASVAB - Arithmetic Reasoning5 42.12% 27.11% 24.66% 6.11% 8.24% 17.42% 29.03% 45.32%ASVAB - Word Knowledge5 41.27% 29.18% 22.71% 6.84% 7.49% 15.36% 41.39% 35.77%ASVAB - Paragraph Composition5 41.15% 27.23% 27.59% 4.03% 6.18% 20.79% 57.68% 15.36%ASVAB - Numerical Operation5 36.26% 30.40% 20.39% 12.94% 8.05% 25.66% 26.78% 39.51%ASVAB - Coding Speed5 36.14% 28.08% 21.86% 13.92% 13.11% 21.54% 24.53% 40.82%ASVAB - Math Knowledge5 39.80% 38.10% 19.41% 2.69% 3.18% 13.11% 35.77% 47.94%Work knowledge, occupations knowledge6 34.90% 33.45% 21.86% 9.80% 13.96% 36.49% 30.12% 19.43%Work knowledge, education required for occupation6 34.99% 31.44% 20.21% 13.36% 13.82% 30.76% 22.73% 32.69%Work knowledge, earnings comparison6 51.06% 33.33% 0.00% 15.61% 32.59% 32.59% 0.00% 34.81%PSID Sentence Completion7 35.79% 26.55% 33.93% 3.73% 16.92% 20.12% 51.48% 11.48%
6 The Knowledge of World of Work test consists of three components. The first requires a respondent to identify, in a multiple-choice format, the duties of ten occupations. Thesecond component involves identification of the typical educational attainment of men in each of these same ten occupations. The third component is based on a respondent'sjudgment, for each of eight pairs of occupations, as to which one yields the higher average annual earnings. 7 PSID sentence completion test is an adaptation of the verbal component of the Lorge-Thorndike Intelligence Test.
1 The Otis-Lennon test Intends to provide a measure of IQ and is organized in five areas: verbal comprehension, verbal reasoning, pictorial reasoning, figural reasoning, andquantitative reasoning. 2 The California Test of Mental Maturity is designed to measure memory, spatial relationships, reasoning, and vocabulary. 3 The Lorge-Thorndike Intelligence Test measures abstract intelligence and contains both verbal and nonverbal items. 4 The Scholastic Aptitude Test (SAT) and the American College Test (ACT) are standardized tests used in college admissions in the United States. The SAT consists of threemajor sections: Critical Reading, Mathematics, and Writing. The American College Test (ACT) covers four skill areas: English, Mathematics, Reading, and Science.5 The ASVAB is a multiple choice test used to determine qualification for enlistment in the United States armed forces. The Arithmetic Reasoning Test measures an individual'sability to solve arithmetic word problems. The Word Knowledge component intends to measure the individual's ability to provide the correct meaning to a word that is presented in a context. The Paragraph Comprehension evaluates the capacity to understand written passages. The Numerical Operation test consists of a series of arithmetic computationsthat must be carried out correctly within a time limit. The Coding Speed component measures the capacity to assign code numbers to words. The Math Knowledge test measuresthe knowledge of high-school math principles.
White Males from NLS/66, NLSY/79 and PSIDHigh School College
Table 3Proportion of People in Each Quartile (Cognitive Skill Tests)
1 2 3 4 5 6 1 2 3 4 5 6
Rotter E1 13.63% 10.96% 27.64% 47.77% - - 10.48% 8.94% 28.79% 51.79% - -Rotter F2 24.46% 25.48% 20.08% 29.99% - - 15.60% 14.75% 30.26% 39.39% - -Rotter G3 9.13% 6.83% 38.29% 45.76% - - 5.83% 5.66% 39.42% 49.10% - -Rotter K4 22.19% 18.65% 33.61% 25.55% - - 18.27% 19.31% 39.13% 23.29% - -Rotter E PSID1 18.10% - - 81.90% - - 15.31% - - 84.69% - -Rotter G PSID3 29.88% 0.18% 0.62% 0.80% 68.51% - 17.64% - 0.95% 1.55% 79.86% -PSID Planning5 47.55% 0.45% 1.78% 1.16% 49.06% - 31.94% 1.19% 2.98% 2.74% 61.14% -Rotter F PSID2 0.09% 30.20% 0.09% 2.96% 0.99% 65.68% - 19.67% 0.59% 2.25% 1.54% 75.95%PSID Carrying-out Plans6 17.31% 0.09% 0.89% 1.07% 80.64% - 13.73% 0.12% 0.71% 0.83% 84.62% -PSID Affiliation vs Achievement7 35.57% - - 64.43% - - 19.15% - - 80.85% - -
6 "When you make plans ahead, do you usually get to carry out things the way you expected, or do things usually come up to make you change your plans?" There are five possibleanswers and higher values indicate individuals who report to be more likely to carry out plans to the end.7 "What kind of job would you want the most, a job where you had to think for yourself, or a job where the people you work with are a nice group?". There are two possible answers andthe highest value reflects individuals who are more likely to prefer a job that he can think for himself (i.e., lower need for affiliation).
Notes: 1 "What Happens to me is my own doing". There are four possible answers in the NLS/66 and NLSY/79 surveys and two possible answers in the PSID survey. Higher valuesindicate stronger internal locus of control.2 "I can make plans work". There are four possible answers in the NLS/66 and NLSY/79 surveys and five possible answers in the PSID survey. Higher values indicate stronger internallocus of control.3 "What I want has nothing do to with luck". There are four possible answers in the NLS/66 and NLSY/79 surveys and two possible answers in the PSID survey. Higher values indicatestronger internal locus of control.4 "It is impossible to believe that luck plays role". There are four possible answers in the NLS/66 and NLSY/79 surveys and is not available in the PSID survey. Higher values indicatestronger internal locus of control.5 "Are you the kind of person that plans his life ahead all the time, or do you live more from day to day?" There are five possible answers and higher values indicate individuals who reportare more likely to plan.
Table 4Proportion of People in Each Alternative (Noncognitive Skill Tests)
White Males from NLS/66, NLSY/79 and PSIDHigh School College
Variable Point Estimate Standard Error t-statisticIntercept 0.120 0.571 0.210Cohort 02 (born between 1930 and 1939) 0.167 0.144 1.157Cohort 03 (born between 1940 and 1949) 0.113 0.123 0.919Cohort 04 (born between 1950 and 1959) 0.502 0.132 3.793Cohort 05 (born between 1960 and 1969) 0.702 0.137 5.127Cohort 06 (born between 1970 and 1978) 0.854 0.174 4.898Maternal Education -0.312 0.026 -11.926Paternal Education -0.363 0.025 -14.300Number of Siblings 0.078 0.012 6.545Log Tuition 0.405 0.081 4.987Dummy for NLS/66 -0.243 0.096 -2.537Dummy for PSID -0.183 0.075 -2.443Cognitive Skill Factor -0.620 0.052 -11.936Noncognitive Skill Factor -0.094 0.044 -2.148Unobserved Heterogeneity Factor 0.516 0.112 4.617
Parameters of the Schooling Cost EquationTable 5
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4Cohort 01 (born before 1929) 33.05% 27.52% 23.10% 16.33% 12.38% 20.49% 27.72% 39.41%Cohort 02 (born between 1930 and 1939) 33.47% 27.71% 22.82% 16.00% 12.90% 20.98% 28.21% 37.92%Cohort 03 (born between 1940 and 1949) 35.41% 27.97% 22.23% 14.38% 14.12% 21.69% 27.98% 36.21%Cohort 04 (born between 1950 and 1959) 33.26% 27.75% 23.08% 15.91% 12.39% 20.68% 28.03% 38.91%Cohort 05 (born between 1960 and 1969) 34.00% 27.97% 22.45% 15.57% 12.22% 20.99% 27.99% 38.80%Cohort 06 (born between 1970 and 1978) 33.98% 28.67% 22.65% 14.70% 12.79% 20.30% 28.68% 38.23%
Table 6AProportion of People in Each Quartile (Cognitive Skills)
High School College
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4Cohort 01 (born before 1929) 28.02% 25.93% 24.39% 21.65% 19.95% 23.25% 26.65% 30.25%Cohort 02 (born between 1930 and 1939) 28.39% 25.70% 24.25% 21.67% 19.82% 23.46% 26.43% 30.29%Cohort 03 (born between 1940 and 1949) 29.36% 26.33% 23.80% 20.51% 20.48% 23.80% 25.97% 29.75%Cohort 04 (born between 1950 and 1959) 28.52% 26.11% 24.06% 21.31% 19.29% 23.46% 26.23% 31.02%Cohort 05 (born between 1960 and 1969) 28.83% 25.90% 24.04% 21.24% 19.24% 23.29% 26.53% 30.93%Cohort 06 (born between 1970 and 1978) 29.42% 25.87% 24.53% 20.18% 19.93% 23.24% 26.85% 29.98%
Table 6BProportion of People in Each Quartile (Noncognitive Skills)
High School College
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4Cohort 01 (born before 1929) 18.02% 23.66% 26.95% 31.38% 36.85% 27.06% 21.35% 14.74%Cohort 02 (born between 1930 and 1939) 17.85% 23.53% 26.93% 31.70% 35.24% 27.41% 21.95% 15.40%Cohort 03 (born between 1940 and 1949) 16.79% 22.64% 27.08% 33.49% 33.98% 27.12% 22.81% 16.09%Cohort 04 (born between 1950 and 1959) 18.02% 23.09% 26.81% 32.08% 36.32% 27.16% 21.96% 14.56%Cohort 05 (born between 1960 and 1969) 17.89% 23.34% 27.08% 31.06% 36.23% 27.67% 21.36% 14.74%Cohort 06 (born between 1970 and 1978) 16.72% 22.83% 26.97% 33.48% 35.56% 27.35% 22.39% 14.71%
CollegeHigh School
Table 6CProportion of People in Each Quartile (Unobserved Heterogeneity)
Estimate Std. Error Estimate Std. ErrorIntercept 9.713 0.169 8.992 0.280Potential Experience 0.066 0.009 0.151 0.014Potential Experience Squared (Divided by 100) -0.113 0.012 -0.266 0.017South -0.031 0.014 0.000 0.020Urban 0.046 0.014 0.048 0.025Dummy for Year 1969 0.005 0.019 0.037 0.034Dummy for Year 1970 -0.018 0.025 0.101 0.079Dummy for Year 1971 -0.027 0.025 0.116 0.079Dummy for Year 1972 -0.016 0.028 0.183 0.076Dummy for Year 1973 -0.003 0.029 0.204 0.077Dummy for Year 1974 -0.054 0.033 0.098 0.079Dummy for Year 1975 -0.106 0.034 0.097 0.079Dummy for Year 1976 -0.131 0.036 0.178 0.085Dummy for Year 1977 -0.119 0.039 0.177 0.087Dummy for Year 1978 -0.075 0.039 0.102 0.088Dummy for Year 1979 -0.112 0.042 0.050 0.091Dummy for Year 1980 -0.188 0.043 0.071 0.096Dummy for Year 1981 -0.225 0.046 0.046 0.098Dummy for Year 1982 -0.301 0.051 0.041 0.105Dummy for Year 1983 -0.328 0.052 0.036 0.107Dummy for Year 1984 -0.311 0.054 -0.102 0.110Dummy for Year 1985 -0.288 0.056 -0.057 0.112Dummy for Year 1986 -0.272 0.057 -0.066 0.111Dummy for Year 1987 -0.270 0.059 -0.030 0.113Dummy for Year 1988 -0.244 0.060 -0.007 0.118Dummy for Year 1989 -0.263 0.062 0.000 0.119Dummy for Year 1990 -0.293 0.064 -0.032 0.124Dummy for Year 1991 -0.324 0.066 -0.065 0.125Dummy for Year 1992 -0.316 0.068 -0.055 0.126Dummy for Year 1993 -0.313 0.070 -0.046 0.127Dummy for Year 1995 -0.259 0.074 0.061 0.133Dummy for Year 1997 -0.268 0.079 0.045 0.137Dummy for Year 1999 -0.210 0.083 0.074 0.141Dummy for Year 2001 -0.224 0.088 0.118 0.146Dummy for Cohort 2 0.137 0.149 0.516 0.258Dummy for Cohort 3 0.259 0.161 0.687 0.276Dummy for Cohort 4 0.324 0.178 0.700 0.294Dummy for Cohort 5 0.274 0.187 0.747 0.305Dummy for NLS/66 Membership 0.308 0.186 0.718 0.309Dummy for PSID Membership 0.019 0.028 0.096 0.040Potential Experience Times Dummy for Cohort 2 -0.056 0.021 -0.023 0.030Potential Experience Times Dummy for Cohort 3 -0.004 0.005 -0.019 0.009Potential Experience Times Dummy for Cohort 4 -0.006 0.006 -0.028 0.010Potential Experience Times Dummy for Cohort 5 -0.012 0.007 -0.033 0.011Potential Experience Times Dummy for Cohort 6 -0.011 0.008 -0.031 0.013
Table 7Estimates of Coefficients in the Log Income Equations in High School and College
High School College
Year Estimate Std Error Estimate Std Error Estimate Std Error1968 0.041 0.021 0.066 0.023 0.000 0.0281970 0.031 0.015 0.084 0.018 0.025 0.0241972 0.063 0.015 0.075 0.018 0.020 0.0231974 0.056 0.016 0.073 0.019 0.117 0.0231976 0.025 0.016 0.085 0.019 0.174 0.0231978 0.039 0.017 0.062 0.019 0.135 0.0211980 0.034 0.015 0.058 0.018 0.197 0.0191982 0.045 0.023 0.069 0.022 0.311 0.0211984 0.048 0.022 0.078 0.023 0.284 0.0201986 0.039 0.020 0.100 0.023 0.267 0.0201988 0.040 0.020 0.088 0.023 0.217 0.0201990 0.056 0.022 0.070 0.026 0.238 0.0211992 0.076 0.023 0.059 0.027 0.216 0.0221996 0.047 0.024 0.094 0.028 0.210 0.0222000 0.045 0.024 0.092 0.028 0.229 0.022
0.046
Year Estimate Std Error Estimate Std Error Estimate Std Error1968 0.090 0.037 0.086 0.044 -0.130 0.0511970 0.062 0.026 0.095 0.031 -0.095 0.0381972 0.028 0.024 0.094 0.025 -0.017 0.0341974 0.052 0.024 0.102 0.025 -0.032 0.0311976 0.032 0.024 0.063 0.026 0.041 0.0321978 0.067 0.025 0.050 0.025 -0.060 0.0301980 0.015 0.024 0.069 0.024 0.031 0.0281982 0.016 0.026 0.073 0.024 -0.019 0.0271984 0.053 0.031 0.112 0.028 -0.128 0.0301986 0.082 0.029 0.142 0.025 -0.059 0.0321988 0.131 0.029 0.110 0.027 0.126 0.0321990 0.137 0.033 0.115 0.031 0.092 0.0351992 0.142 0.035 0.086 0.033 0.063 0.0351996 0.111 0.039 0.083 0.035 0.074 0.0382000 0.129 0.039 0.115 0.035 0.061 0.038
High School
Estimates of Returns to Cognitive Ability, Noncognitive Ability, and Factor Loadings of Unobserved Heterogeneity in High School and College
Table 8
CollegeCognitive Ability Noncognitive Ability Unobs. Heterogeneity
Cognitive Ability Noncognitive Ability Unobs. Heterogeneity
Cohort 1935 - 1945 Cohort 1955 - 1965High School $2,705 $3,421College $9,411 $13,723
Table 9AMonetary Value of Cognitive Skills
Cohort 1935 - 1945 Cohort 1955 - 1965High School $5,604 $5,887College $9,532 $13,332
Table 9BMonetary Value of Noncognitive Skills