BLUP for genetic evaluation of plants in non-inbred families of annual crops

10
This article was published in the above mentioned Springer issue. The material, including all portions thereof, is protected by copyright; all rights are held exclusively by Springer Science + Business Media. The material is for personal use only; commercial use is not permitted. Unauthorized reproduction, transfer and/or use may be a violation of criminal as well as civil law. ISSN 0014-2336, Volume 174, Number 1

Transcript of BLUP for genetic evaluation of plants in non-inbred families of annual crops

This article was published in the above mentioned Springer issue.The material, including all portions thereof, is protected by copyright;all rights are held exclusively by Springer Science + Business Media.

The material is for personal use only;commercial use is not permitted.

Unauthorized reproduction, transfer and/or usemay be a violation of criminal as well as civil law.

ISSN 0014-2336, Volume 174, Number 1

BLUP for genetic evaluation of plants in non-inbred familiesof annual crops

Jose Marcelo Soriano Viana • Isis Fernanda de Almeida •

Marcos Deon Vilela de Resende • Vinıcius Ribeiro Faria •

Fabyano Fonseca e Silva

Received: 30 September 2009 / Accepted: 23 November 2009 / Published online: 5 December 2009

� Springer Science+Business Media B.V. 2009

Abstract Best Linear Unbiased Prediction (BLUP)

has become the most widely used method for genetic

assessment of animal and perennial species, and it is

potentially relevant for annual crops. The objective of

this study was to assess this method for selection

within non-inbred families in recurrent breeding

programs. The ‘animal model’ was fitted. The data

were expansion volume (EV) and grain yield of

plants in recombination plots of two to three selection

cycles in the popcorn population Vicosa, with half-

and full-sib progenies. The ASReml program was

used to perform the analyses. For both EV and yield

the breeding values predicted from the additive and

additive-dominant models were highly correlated.

Multi-generation BLUP was, in general, more accu-

rate than single-generation analysis. These two

methods resulted in highly correlated predicted

breeding values. The dominance genetic values

predicted from the single- and multi-generation

analysis were also highly correlated. The pedigree

information reduced the percentage of coincidences

among the selected individuals relative to phenotypic

selection mainly in the population structured in half-

sib families. Based on breeding values predicted by

BLUP analysis, the most efficient selection procedure

was mass selection.

Keywords Mixed model � Animal model �Recurrent selection

Introduction

Recurrent selection aims for a continuous increase in

favorable gene frequency. Regardless of the selection

unit, its efficiency depends on accurate identification

of the individuals with the best additive genetic value.

A commonly used methodology in animal and

perennial species breeding for predicting additive

genetic values of individuals under selection is the

Best Linear Unbiased Prediction, which was pro-

posed by Henderson (1974). Ideally, predictions are

made taking into account the relationship among

individuals, using estimates of the variance compo-

nents of the model (genetic and non-genetic), with

correction for fixed effects.

Regarding the use of BLUP in annual crops, the

most relevant application until recently was prediction

of the performance of non-assessed hybrids, proposed

J. M. S. Viana (&) � I. F. de Almeida � V. R. Faria

Departamento de Biologia Geral, Universidade Federal de

Vicosa, Vicosa, MG 36570-000, Brazil

e-mail: [email protected]

M. D. V. de Resende

Departamento de Engenharia Florestal, Embrapa CNPF;

Universidade Federal de Vicosa, Vicosa, MG 36570-000,

Brazil

F. Fonseca e Silva

Departamento de Informatica, Universidade Federal de

Vicosa, Vicosa, MG 36570-000, Brazil

123

Euphytica (2010) 174:31–39

DOI 10.1007/s10681-009-0094-2

Author's personal copy

by Bernardo (1996), which is possible only when

sufficient relationship information among inbred lines

is available. In his study with 4,099 hybrids and 16

combinations among nine heterotic corn groups

assessed in up to 15 locations, Bernardo found

estimates of the correlation between predicted value

and phenotypic performance for grain yield ranging

from 0.49 to 0.70, with a tendency toward higher

estimates for groups with a greater number of assessed

hybrids. According to this author, these values can be

considered moderately high, and it was suggested that

BLUP might be efficient for identifying non-per-

formed crosses with superior performance.

Other relevant applications common in animal and

perennial species breeding have been incorporated

during the last 5 years in the breeding programs of

annual allogamous and autogamous crops. Piepho

et al. (2008) reviewed the use of the BLUP method-

ology in plant breeding, showing its relevance even

without pedigree information. The applications

included genetic assessment of individuals (additive

genetic value prediction) and of pure and inbred lines

(prediction of genotypic values and the effects of

general combining ability) with and without pedigree

information in single and multi-trait analyses. Flache-

necker (2006a, b) used BLUP to predict the mean

additive genetic value of parents of full-sib families of

a corn population. This permitted satisfactory predic-

tions of the performance of the progeny in the

subsequent cycle for grain moisture and/or the

selection index trait, calculated as a function of grain

yield and dry matter content. From an analysis of

wheat genotypes in several environments, Crossa

et al. (2006) concluded that the prediction of breeding

value by BLUP with pedigree information was more

precise than traditional analyses assuming fixed and

random effects. Reis et al. (2005) used BLUP in

combination with the bootstrap technique in a diallelic

analysis of inbred corn lines. They reported that the

correlation between the predicted and observed mean

yields of the hybrids decreased with a reduction in the

number of parents, and they emphasized that the

decrease was less pronounced with the use of BLUP

compared to the BLUE (fixed model) method.

The value of pedigree information was investi-

gated by Nunes et al. (2008) using simulation. They

concluded that inclusion of relationship information

resulted in higher predicted gains. Bauer et al. (2006)

used molecular marker information to calculate the

relationship matrix and concluded that the BLUP

methodology could be used in inbred line selection.

The same conclusion was reported by Sebolai et al.

(2005) from a simulation study of selection of pure

lines. In another simulation study, Bauer and Leon

(2008) analyzed the efficiency of superior line

selection by multi-trait BLUP. The results showed

that this analysis was superior to single-trait analysis

for identifying superior lines and prediction accuracy.

Kruuk (2004) used the ‘‘animal model’’ to estimate

genetic parameters of natural populations and con-

cluded that the BLUP methodology permitted the

most efficient use of the available data in addition to

giving greater statistical power to the tests.

In order to show that this method can be used regu-

larly by plant breeders of annual crops, the objective of

this study was to assess its use in the selection of plants

in non-inbred families in recurrent breeding programs.

Materials and methods

Theoretical aspects

The selection efficiency in recurrent intrapopulation

breeding programs is proportional to the efficiency of

identifying the parents with superior additive genetic

values, because the other components of the genetic

value are not correlated with the number of favorable

genes present. Although the genotypic value (G) and

the additive genetic value (A) are, in general, both

positively correlated with the number of favorable

genes (N), the correlation of A with N is usually

greater or equal (in the absence of dominance) to the

correlation of G with N (Viana 2005).

Assuming an absence of epistasis we have

G ¼ Mþ Aþ D

where M is the population mean and D is the

dominance genetic value.

Thus, the phenotypic value of an individual can be

defined as

P ¼ Mþ Aþ DþX

k

ðfixed effectÞk

þX

l

ðnon-genetic random effectÞl þ error

In matrix notation, assuming no other non-genetic

random effects,

32 Euphytica (2010) 174:31–39

123

Author's personal copy

y ¼ Xbþ Zu1 þ Zu2 þ e

where y is the vector of phenotypic values, X is the

incidence matrix of fixed effects, b is the vector of

the effects of the levels of each fixed factor (including

the population mean), Z is the incidence matrix of

random effects, u1 is the vector of the additive genetic

values, u2 is the vector of the dominance genetic

values and e is the residuals vector. This model,

called by animal breeders as the ‘animal model’, is

appropriate for prediction of genetic values of the

measured individuals, which are the selection units.

In plant breeding the appropriate term is ‘individual

model’. The criterion for obtaining the best linear

unbiased predictor of a random vector is the maxi-

mization of the joint probability density function of y

and the random vectors (Henderson 1974), obtaining

(under normality) the mixed model equations.

Assuming a non-inbred population (Cov(u1, u2)

equal to a null vector) and that there is no correlation

among the errors (Var(e) = R = Ir2), the mixed

model equations are

X0X X0Z X0ZZ0X Z0Zþ r2G�1

1 Z0ZZ0X Z0Z Z0Zþ r2G�1

2

2

4

3

5bo

~u1

~u2

2

4

3

5

¼X0yZ0yZ0y

24

35

where G1 ¼ r2AAðnÞ and G2 ¼ r2

DDðnÞ are the variance

matrices of the vectors of additive and dominance

genetic values. A(n) and D(n) are the additive and

dominance relationship matrices, given by

AðnÞ ¼

2r11 2r12 � � � 2r1n

2r12 2r22 � � � 2r2n

� � � � � � . ..

� � �2r1n 2r2n � � � 2rnn

266664

377775

and

DðnÞ ¼

u11 u12 � � � u1n

u12 u22 � � � u2n

� � � � � � . ..

� � �u1n u2n � � � unn

266664

377775

where rij is the coefficient of coancestry between

individuals i and j, and uij ¼ uðmfÞðm0f 0Þ ¼ rmm0 rff0 þrmf0 rfm0 is the probability of individuals i and j having

genotypes identical by descent, where m and f are the

parents of i, and m0 and f0 are the parents of j (Mrode

2005).

The fit of the model by an iterative process gives

estimates of the variance components, a prediction of

each random vector and a solution for the fixed

effects vector. The process most used to estimate the

components of variance is restricted/residual maxi-

mum likelihood (REML), which was developed by

Patterson and Thompson (1971). Alternatively, the

maximum likelihood (ML) method, the minimum

variance quadratic unbiased estimation (MIVQUE)

method and Henderson’s method III (Littell et al.

2006) can be used.

Although the model described suggests that it is

always possible to predict the additive and domi-

nance genetic values, this unfortunately is not always

true. For example, in the case of annual allogamous

species such as corn, if the population in the

recombination plot is structured in half-sib progenies,

the additive-dominant model cannot be fitted because

the dominance relationship matrix is an identity in

each cycle. Thus, as G2 ¼ r2DI and R ¼ r2I there is

no convergence in the REML process. This is not

necessarily true in the case of animal and perennial

species. If the population is structured in full-sib

progenies, even if the dominance relationship matrix

is not an identity, there is no guarantee that the

additive-dominant model can be fitted. In general, for

the additive-dominant model to be fitted there must

be at least two different covariances among relatives

that are functions of the additive variance and of

dominance variance.

The prediction accuracy of the additive genetic

values is

rA~A ¼ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi

1� PEV

~r2A

� �s

where PEV is the prediction error variance and ~r2A is

the restricted maximum likelihood estimate of the

additive variance (Mrode 2005, pp. 50–51).

Two selection strategies based on EV were

considered in each cycle: selection of 9–10 plants

in each one of the 20 families selected based on the

progeny test (totaling 196 individuals) and selection

of 196 plants (mass selection). The direct and indirect

genetic gains were calculated based on the parental

control (c = 1/2 with selection within half-sib family

and c = 1 with selection within full-sib family) and

Euphytica (2010) 174:31–39 33

123

Author's personal copy

the additive genetic values of the selected individuals

(~As) by the function

DMp ¼ c1

ns

X

i

~Ais

!

where ns is the number of selected parents (196

females with selection within half-sib family, and 196

females and up to 196 distinct males with selection

within full-sib progeny). If the population is struc-

tured in full-sib families, this function is biased when

there are allelic and non-allelic gene interaction

(dominance and epistasis). Assuming Hardy–Wein-

berg equilibrium and absence of epistasis, the mean

of the population obtained after selection can be

expressed as (Viana et al. 2009)

M1 ¼Mþ ð1=2ÞEðAsÞ; if the population is

structured in half-sib progenies, or

M1 ¼Mþ Eð�AsÞ þ Eð�DosÞ; if the population is

structured in full-sib families

where M is the mean of the base population, EðAsÞ is

the expectation of the additive genetic values of the

selected individuals, Eð�AsÞ is the expectation of the

average additive genetic values of the selected full-

sib family parents, and Eð�DosÞ is the expectation of

the average dominance genetic values of the individ-

uals in selected full-sib families. The offspring

average dominance genetic value is a function of

the allelic frequencies and d, the dominance deviation

(difference between the genotypic value of the

heterozygote and m, the mean of the genotypic

values of the homozygotes), for each gene.

The prediction function is also obtained by an

equivalent derivation of that used by Eberhart (1970),

assuming that the predicted gain is proportional to a

selection differential based on predicted breeding

values (DMp ¼ bð�AS � �AÞ). Considering that the

coefficient of proportionality is the regression coef-

ficient of the genotypic value of half-sib/full-sib

family in the improved population as a function of the

additive genetic value of its common parent (selected

individual)/average additive genetic value of the

parents, we have

b ¼ CovðAs or �As; �GoÞVðAs or �AsÞ

Assuming irrelevant changes in gene frequen-

cies, b = 1/2 in the population structured in half-

sib families or b = 1 in the case of full-sib

progenies.

For comparison the genetic gains were also com-

puted from the function proposed by Cullis et al.

(2006), including the parental control. The indirect

expected genetic gain in relation to trait Y with

selection based on trait X is

EGGYðXÞ ¼ c � i � qgX;YrgY

hX

where i is the selection intensity, qgX;Yis the genetic

correlation, r2gY

is the genotypic variance and h2X is

the generalized measure of heritability (Cullis et al.

2006).

Experimental data

Were analyzed the phenotypic values of expansion

volume (EV; ml/g) and grain yield (g/0.18 m2) of

plants in the recombination plots (areas of crosses

between selected families) of cycles 1, 2 and 3 of the

selection program with half-sib families, and of the

cycles 1 and 2 of the selection program with full-sib

families, both with the popcorn population Vicosa.

The two breeding programs aim the development of

superior open pollinated populations. Each cycle

consisted of evaluation of 196 families in a repli-

cated trial, selection of the best 20 progenies,

recombination of these families in isolated area

(recombination plot), using remnant seeds, and

selection of 196 superior plants for the next cycle.

The tests were designed as a 14 9 14 simple lattice

and carried out in the experimental station of the

Federal University of Vicosa (UFV) in Coimbra,

Minas Gerais (MG), Brazil, in the 1999/2000, 2001/

2002, 2002/2003, 2003/2004 and 2004/2005 growing

seasons. They are simultaneously conducted to offer

experimental data appropriate to studies on Quanti-

tative Genetics and Statistical Genomics (compara-

tive analysis of genetic parameters and selection

efficiency, influence of the effective population size

on genetic variability, genetic evaluation based on

BLUP, marker assisted selection based on linkage

disequilibrium (LD) between molecular markers and

QTL, among others).

The recombination plots with half-sib progenies

were conducted at the experimental station of the

UFV in Ponte Nova-MG, Brazil, in the 2000/2001,

2002/2003 and 2004/2005 growing seasons. The

34 Euphytica (2010) 174:31–39

123

Author's personal copy

twenty selected families were randomly planted inter-

calating one row of male plants to every four rows of

female plants. Two rows were also planted with the

single hybrid IAC 112 (control), developed by the

Agronomic Institute of Campinas (IAC). Each row

was 20 m long with up to 100 plants. The rows of

females and the control were detasseled at flowering.

The recombination plots with full-sib progenies were

conducted in an experimental field at UFV in Vicosa-

MG in the 2003/2004 and 2005/2006 growing seasons.

The 20 selected families were randomly planted in

rows with 10 m long with up to 50 plants. Crosses were

controlled at flowering so that at least 19 full-sib

families were obtained for each progeny (one male

from each of the other 19 families). The plants used as

males were not measured. In the analyses they were

inserted in the pedigree to permit prediction of their

breeding values. The prediction is possible only for

those with pedigree information.

In the half-sib family recombination plots, 20

plants were measured for each progeny and 5–10

plants in each row of the control. In the full-sib

family recombination plots, 8–31 plants were mea-

sured for each family, totaling 324 and 390 plants.

The EV was assessed in a hot air popcorn machine

(1,200 w) using 10 g samples.

Statistical analysis

The software ASReml v2.00 (Gilmour et al. 2006)

was used for the analyses. In the analyses with data

from the half-sib family recombination plots the IAC

112 was inserted in the pedigree as an individual with

unknown parents. In the multi-generation BLUP

model (see the ASReml code in the Appendix), since

the single hybrid IAC 112 is not a genotype from the

base population, we inserted a fixed effect of

population. Once we grouped populations structured

in half- and full-sib families, from two different

selection cycles, conducted in two locals and in

4 years, to be parsimonious we define a second fixed

factor named environment (the environment effect

includes confounded effects of population structure,

cycle, local and year). It is important to note that the

estimation of additive and residual variance in the

single generation analysis of the population struc-

tured in full-sib families was possible since G1 ¼r2

AA 6¼ R ¼ r2I in each cycle.

Results

The most probable application of the BLUP method-

ology, by breeders of annual allogamous plants,

would be estimation of genetic parameters and

prediction of the breeding values of the individuals

under selection for each cycle. The tests showed that

there is genetic variability for EV and grain yield in

the populations regardless of their family structure

(Tables 1, 2). The test relative to EV in cycle 1 of the

full-sib family population is associated with a prob-

ability value of 17%. For both traits, the estimates of

Table 1 Parameters estimated by BLUP/REML, population and IAC 112 means, direct gain in EV (ml/g) and indirect gain in yield

(g/0.18 m2) in the half-sib family population

Parameters Cycle 1 Cycle 2 Cycle 3

EV Yield EV Yield EV Yield

Additive var.

(conf. interval)

19.12570.0000

(11.76–26.49)

646.190.0060

(253.87–1038.52)

12.29400.0423

(0–26.27)

519.600.0001

(247.41–791.78)

13.08070.0000

(8.21–17.95)

388.600.0495

(0–850.43)

Residual var. 8.1768 327.57 14.8737 281.51 9.9519 668.15

Heritabilitya 0.7239 0.6932 0.2881 0.3717 0.2015 0.1525

Genetic corr. -0.1485 -0.0779 0.7972

Pop. mean 29.52 89.66 34.78 84.30 34.42 91.07

Gain w/mass sel. 1.40 0.48 0.98 0.86 1.20 0.29

Gain w/within sel. 1.15 0.29 0.77 1.31 0.78 -0.15

Gain w/sel.a 2.73 -2.4 1.50 -0.76 1.43 6.19

Mean of IAC 112 38.00 113.68 37.90 89.60 41.90 66.17

a From Cullis et al. (2006)

Euphytica (2010) 174:31–39 35

123

Author's personal copy

the additive genetic variance were greater in the half-

sib progeny population. The estimates of genetic

correlation between EV and yield were consistent in

sign in the two populations for cycles 1 and 2,

although the magnitudes were greater in the full-sib

family population (Tables 1, 2). However, selection

based on quality should not imply a significant

change in grain yield. The first cycle of among and

within selection based on EV reduced the magnitude

of the genetic correlation in the two populations. In

the third cycle in the population structured in half-sib

families the estimate was closer to 1.

An alternative approach is the multi-generation

BLUP. The main advantage is the increment in the

prediction accuracy. For both populations in general

an increase in accuracy compared to the single

generation analysis for EV and yield was observed

with the BLUP multi-generation method (Table 3).

The increase in accuracy ranges from 8.9 to 42.6%.

For both traits, in cycles 2 and 3 of the half-sib

progeny population, the correlations among the

additive genetic values predicted by single and

multi-generation BLUP were high, with estimates

close to 1 (0.985 and 0.935, and 0.996 and 0.963,

respectively). In the full-sib family population the

correlations were also high (0.975 and 0.984).

A general apparent limitation of these approaches,

relative to annual crops, is the impossibility of fitting

the additive-dominant model in a population struc-

tured in half-sib families. In relation to a population

structured in full-sib families it is possible to fit this

model, depending on the cycle (in the first cycle this

is not possible) or depending on the number of cycles

considered in the multi-generation analysis. Fortu-

nately this is not a limitation. As there is one base

population and given the equivalence (in number of

selection cycles) among the cycles 1 and 2 of the

half- and full-sib populations, it was possible to fit the

complete model, combining the populations

(Table 4). For yield, the additive-dominant model

was fitted only by multi-generation analysis. In the

analyses by cycle the REML process produced

negative estimates of the dominance variance. The

estimates of genetic parameters are in general com-

parable with the values obtained from the additive

model. The most important results, however, are the

correlations between breeding values predicted from

the additive model and the values predicted based on

Table 2 Parameters

estimated by BLUP/REML,

population mean, direct

gain in EV (ml/g) and

indirect gain in yield

(g/0.18 m2) in the full-sib

family population

a From Cullis et al. (2006)

Parameters Cycle 1 Cycle 2

EV Yield EV Yield

Additive var.

(conf. interval)

2.81360.1703

(0–8.60)

92.59540.0152

(8.78–176.41)

2.40720.0650

(0–5.52)

67.49010.0175

(4.71–130.27)

Residual var. 50.0469 266.5100 22.5853 197.9900

Heritabilitya 0.2008 0.4604 0.2259 0.3317

Genetic corr. -0.2098 -0.1457

Pop. mean 30.31 41.22 31.94 35.94

Gain w/mass sel. 0.31 0.16 0.34 0.20

Gain w/within sel. 0.19 0.24 0.17 0.60

Gain w/sel.a 1.05 -1.27 1.06 -0.80

Table 3 Accuracy of the BLUP/REML analyses

Analysis Half-sib families Full-sib families

EV Yield EV Yield

Cycle 1 Cycle 2 Cycle 3 Cycle 1 Cycle 2 Cycle 3 Cycle 1 Cycle 2 Cycle 1 Cycle 2

Single-generation BLUP 0.82 0.54 0.76 0.77 0.78 0.42 0.42 0.45 0.63 0.63

Multi-generation BLUP (c1 ? c2) 0.77 0.56 0.49 0.69

Multi-generation BLUP

(c1 ? c2 ? c3)

0.71 0.52

36 Euphytica (2010) 174:31–39

123

Author's personal copy

the complete model (Table 5). For both EV and yield

the predicted breeding values were highly correlated.

The dominance genetic values predicted from the

single- and multi-generation analysis were also

highly correlated. For EV the correlations between

values predicted from the analysis based on the two

cycles and those predicted from the analysis from

cycles 1 and 2 were 0.9980 and 0.9982, respectively.

When selecting for EV in each generation based

on the additive genetic values from the BLUP single

generation analysis, greater predicted gains were

observed in both population with mass selection,

relative to within progeny selection (Tables 1, 2). In

the population structured in half-sib progenies the

increase in predicted gain for EV with mass selection

relative to within family selection ranged from 22 to

61%. In that structured in full-sib families, the

increments were 63 and 100%. In both populations

the predicted gains for EV with mass selection were

small, ranging from 1.0 to 4.7%. The indirect

predicted gains for yield ranged from 0.3 to 1%.

Equivalent results were obtained by Matta and Viana

(2003) in cycle 1 of the popcorn population Beija-

Flor, using BLUE/Henderson’s method III. The

predicted gains for EV were 0.53 and 0.35 ml/g with

mass selection and within family selection, respec-

tively. The limitation of mass selection is a reduction

in the effective population size. In general, in three to

six families none to five plants were selected and in

four to five progenies more than 14 plants were

selected. In the case of within family selection 9–10

plants would be selected in each progeny. The

expected genetic gains in EV computed as proposed

by Cullis et al. (2006) were greater than those based

Table 4 Parameters estimated by BLUP/REML in the population structured in half- and full-sib families, relative to EV (ml/g) and

yield (g/0.18 m2)

Parameters Cycle 1 Cycle 2 Cycles 1 and 2

EV Yield EV Yield EV Yield

Additive var.

(conf. interval)

5.30480.0089

(0.92–9.69)

589.45209.5E-13

(425.39–753.51)

4.59450.0086

(0.81–8.38)

489.04800.0000

(370.18–607.91)

4.91400.0006

(1.93–7.90)

281.81704.7E-7

(169.13–394.51)

Dominance var.

(conf. interval)

25.11931.2E-12

(18.09–32.14)

– 13.30180.0008

(5.01–21.59)

– 19.10831.4E-13

(13.98–24.23)

93.56460.1232

(0–251.75)

Residual var. 7.8947 190.6960 8.1000 124.1920 7.9607 261.0730

Accuracy 0.51 0.87 0.56 0.89 0.56 0.73

Heritabilitya 0.2742 0.7551 0.2114 0.4681 0.2799 0.5434

Additive corr. -0.0924

Dominance corr. -0.0980

Gain w/sel.a 1.06 0.86 1.03 -0.72b

a From Cullis et al. (2006)b Indirect gain

Table 5 Correlations

between breeding values

predicted by the additive

and additive-dominant

models, relative to EV

(ml/g) and yield (g/0.18 m2)

a Additive model for yield

Pop. structure Half- and full-sib

fam., cycle 1, add.

-dom. modela

Half- and full-sib

fam., cycle 2, add.

-dom. modela

Half- and full-sib fam.,

two cycles, add.-dom.

model

Half-sib fam., cycle 1,

additive model

EV 0.8676 0.8585

Yield 0.9986 0.9397

Half-sib fam., cycle 2,

additive model

EV 0.9556 0.9268

Yield 0.9951 0.9813

Full-sib fam., cycle 1,

additive model

EV 0.9905 0.9630

Yield 0.8600 0.9636

Full-sib fam., cycle 2,

additive model

EV 0.9888 0.9747

Yield 0.8281 0.9595

Euphytica (2010) 174:31–39 37

123

Author's personal copy

on the predicted breeding values. The predicted

indirect gains in yield were not concordant in

magnitude and/or sign.

In the half-sib progeny population, coincidence

involving the 196 individuals selected by mass selec-

tion using BLUP single generation analysis and

phenotypic selection decreased over the cycles. In

the BLUP analyses, 85.7, 77.5 and 53.1% of the

individuals selected in the first, second and third cycles

were also indicated as superior by phenotypic selec-

tion. Therefore, as the pedigree information was added

to the BLUP individual analyses, the coincidence

decreased between the methods. When selection was

based on predictions from the multi-generation anal-

ysis, there were slightly higher coincidence values

with phenotypic selection, of 83.7 and 58.2% for the

second and third cycles, respectively. In the population

structured in full-sib families, the coincidence among

the individuals selected by mass selection using

individual BLUP analyses and phenotypic selection

also decreased from the first to the second cycle, from

85.7 to 82.6%. In contrast, an increase in coincidence

between phenotypic selection and selection based on

multi-generation BLUP analysis was observed, from

70.0 to 98.0% in the two selection cycles.

Discussion

The most important aspect that should be discussed

when using the BLUP method through the animal

model is the fit of the additive model, even when it is

known that the expression of the trait depends on

interactions between alleles (dominance). This, how-

ever, is never taken into consideration in many

applied and simulated studies, probably because there

is a consensus among breeders that the methodology

adequately predicts the true additive genetic values

that are generally more accurate than those obtained

by other methods (Kumar et al. 2008). This was

demonstrated in our study. Simulation studies by

Leclerc et al. (2008) and Stock et al. (2007) involving

continuous and categorical traits and molecular data

in QTL analyses have shown the great efficiency and

usefulness of the BLUP methodology, even with

multi-trait and random regression analyses.

There is also a consensus that the technique offers

adequate prediction of additive genetic values when

there is generation overlay and/or partial or total lack

of pedigree information and selection effect, whether

by man or natural. Kruuk (2004) indicated that the

BLUP/REML methodology is a powerful way to

estimate the components of variance and to predict

additive genetic values in natural populations. But

Cantet et al. (2000), in a simulation study, reported

underestimation of heritability in the population

under selection, because the relationship information

was only partially available and the existence of

selection makes heritability biased.

There is no doubt, however, that the fit of the

additive-dominance model is important in combining

ability studies, especially when there is interest in

predicting non-performed crosses (Bernardo 1996).

In the study by Flacheneckar et al. (2006a, b), where

the additive model was fitted, the prediction of the

performance of the full-sib families by the mean

additive genetic value of the parents was not satis-

factory for yield, but it was satisfactory for a less

important characteristic (grain moisture) and for a

second characteristic called the selection index

(which is a function of the yield and dry matter

content), probably because it not does not depend

greatly on dominance effects.

Although the results obtained in many theoretical

and applied studies have systematically shown the

superiority of the BLUP method compared to BLUE,

because of the greater accuracy of the predicted

values compared to the phenotypic values, the

difference between the different selection methods

is not always emphasized (Kumar et al. 2008),

especially when the data are balanced and from only

one generation (Knott et al. 1995).

Acknowledgements We thank the Foundation for Research

Support of Minas Gerais State (Fapemig), the Brazilian Federal

Agency for Support and Evaluation of Graduate Education

(Capes) and the National Council for Scientific and

Technological Development (CNPq) for financial support.

Appendix

ASReml code for the additive-dominant model

!Workspace 200

BLUP analysis, population structured in half- and

full-sib families, two cycles

38 Euphytica (2010) 174:31–39

123

Author's personal copy

a 1535 !p # additive value

d 1535 # dominance value

pop 2 # population

env 4 # environment

ev # expansion volume

y # grain yield

z ! = ev ! ? y # to permit estimation of gen./

non gen. corr./covariance

ped.txt # pedigree file

Amatrix.grm # additive relationship matrix

Dmatrix.grm # dominance relationship matrix

data.asd # data file

!ddf # correction of df by the Kenward-Roger

method

!continue # to continue REML process

!maxit 30 # maximum number of interactions

ev or y or z * mu pop env pop.env !r a d

predict a !vpv # necessary to compute generalized

measure of heritability

The additive and dominance relationship matrices

files were generated by a program developed in

REALbasic 5.5. To read the prediction error variance

matrix in the file .pvs another REALbasic 5.5 program

was developed.

References

Bauer AM, Leon J (2008) Multiple-trait breeding values for

parental selection in self-pollinating crops. Theor Appl

Genet 116:235–242

Bauer AM, Reetz TC, Leon J (2006) Estimation of breeding

values of inbred lines using best linear unbiased predic-

tion (BLUP) and genetic similarities. Crop Sci 46:2685–

2691

Bernardo R (1996) Best linear unbiased prediction maize sin-

gle-cross performance. Crop Sci 36:50–56

Cantet RJC, Birchmeier AN, Santos-Cristal MG, de Avila VS

(2000) Comparison of restricted maximum likelihood and

Method R for estimating heritability and predicting

breeding value under selection. J Anim Sci 78:2554–2560

Crossa J, Burgueno J, Cornelius PL, McLaren G, Trethowan R,

Krishnamachari A (2006) Modeling genotype x environ-

ment interaction using additive genetic covariances of

relatives for predicting breeding values of wheat geno-

types. Crop Sci 46:1722–1733

Cullis BR, Smith AB, Coombes NE (2006) On the design of

early generation variety trials with correlated data. J Agric

Biol Environ Stat 11:381–393

Eberhart SA (1970) Factors effecting efficiencies of breeding

methods. Afr Soils 15:669–680

Flachenecker C, Frisch M, Falke KC, Melchinger AE (2006a)

Modified full-sib selection and best linear unbiased

prediction of progeny performance in a European F2

maize population. Plant Breed 125:248–253

Flachenecker C, Frisch M, Falke KC, Melchinger AE (2006b)

Trends in population parameters and best linear unbiased

prediction of progeny performance in a European F2

maize population under modified recurrent full-sib selec-

tion. Theor Appl Genet 112:483–491

Gilmour AR, Gogel BJ, Cullis BR, Thompson R (2006)

ASReml user guide release 2.0. VSN International Ltd,

Hemel Hempstead

Henderson CR (1974) General flexibility of linear model

techniques for sire evaluation. J Dairy Sci 57:963–972

Knott SA, Sibly RM, Smith RH, Moller H (1995) Maximum

likelihood estimation of genetic parameters in life-history

studies using the ‘‘animal model’’. Funct Ecol 9:122–126

Kruuk LEB (2004) Estimating genetic parameters in natural

populations using the ‘‘animal model’’. Philos Trans R

Soc Lond B Biol Sci 359:873–890

Kumar A, Gandh RS, Singh A, Haile A (2008) Comparison of

animal model with other conventional methods of sire

evaluation for milk production in Karan Fries cattle.

Indian J Anim Sci 78:1393–1396

Leclerc H, Wensch-Dorendorf M, Wensch J, Ducrocq V,

Swalve HH (2008) A general method to validate breeding

value prediction software. J Dairy Sci 91:3179–3183

Littell RC, Milliken GA, Stroup WW, Wolfinger RD, Scha-

benberger O (2006) SAS for mixed models, 2nd edn. SAS

Institute Inc., Cary

Matta FP, Viana JMS (2003) Eficiencias relativas dos processos de

selecao entre e dentro de famılias de meiosirmaos em popu-

lacao de milho-pipoca. Ciencia e Agrotecnologia 27:548–556

Mrode RA (2005) Linear models for the prediction of animal

breeding values, 2nd edn. CABI Publishing, Oxfordshire

Nunes JAR, Ramalho MAP, Ferreira DF (2008) Inclusion of

genetic relationship information in the pedigree selection

method using mixed models. Genet Mol Biol 31:73–78

Patterson HD, Thompson R (1971) Recovery of inter-block

information when block sizes are unequal. Biometrika

58:545–554

Piepho HP, Mohring J, Melchinger AE, Buchse A (2008)

BLUP for phenotypic selection in plant breeding and

variety testing. Euphytica 161:209–228

Reis AJD, Chaves LJ, Duarte JB, Brasil EM (2005) Prediction

of hybrid means from a partial circulant diallel table using

the ordinary least square and the mixed model methods.

Genet Mol Biol 28:314–320

Sebolai B, Pedersen JF, Marx DB, Boykin DL (2005) Effect of

control plot density, control plot arrangement, and

assumption of random or fixed effects on nonreplicated

experiments for germplasm screening using spatial mod-

els. Crop Sci 45:1978–1984

Stock KF, Hoeschele I, Distl O (2007) Estimation of genetic

parameters and prediction of breeding values for multi-

variate threshold and continuous data in a simulated horse

population using Gibbs sampling and residual maximum

likelihood. J Anim Breed Genet 124:308–319

Viana JMS (2005) Dominance, epistasis, heritabilities and

expected genetic gains. Genet Mol Biol 28:67–74

Viana JMS, Faria VR, Silva AC (2009) Bias in the prediction

of genetic gain due to mass and half-sib selection in

random mating populations. Genet Mol Biol 32:497–506

Euphytica (2010) 174:31–39 39

123

Author's personal copy