Variation of Female and Male Lineages in Sub-Saharan Populations: the Importance of Sociocultural...

10
Variation of Female and Male Lineages in Sub-Saharan Populations: the Importance of Sociocultural Factors Giovanni Destro-Bisol,* Francesco Donati,* Valentina Coia,* Ilaria Boschi, Fabio Verginelli,à Alessandra Caglia `, Sergio Tofanelli,§ Gabriella Spedini,* and Cristian Capelli *Department of Animal and Human Biology, University ‘‘La Sapienza,’’ Rome, Italy;  Institute of Legal Medicine, Catholic University, Rome, Italy; àDepartment of Oncology and Neurosciences and Center for Research and Training on Cancer in Sub-Saharan Africa, University G.d’Annunzio, Chieti, Italy; §Department of Ethology, Ecology and Evolution, University of Pisa, Italy In this paper, we present a study of genetic variation in sub-Saharan Africa, which is based on published and unpublished data on fast-evolving (hypervariable region 1 of mitochondrial DNA and six microsatellites of Y chromosome) and slow- evolving (haplogroup frequencies) polymorphisms of mtDNA and Y chromosome. Our study reveals a striking difference in the genetic structure of food-producer (Bantu and Sudanic speakers) and hunter-gatherer populations (Pygmies, !Kung, and Hadza). In fact, the ratio of mtDNA to Y-chromosome Nt is substantially higher in food producers than in hunter-gatherers as determined by fast-evolving polymorphisms (1.76 versus 0.11). This finding indicates that the two population groups differ substantially in female and male migration rate and/or effective size. The difference also persists when linguistically homogeneous populations are used and outlier populations are eliminated (1.78 vs 0.19) or when the jacknife procedure is applied to a paired population data set (1.32 to 7.84 versus 0.14 to 0.66). The higher ratio of mtDNA to Y-chromosome Nm in food producers than in hunter-gatherers is further confirmed by the use of slow- evolving polymorphisms (1.59 to 7.91 versus 0.12 to 0.35). To explain these results, we propose a model that integrates demographic and genetic aspects and incorporates ethnographic knowledge. In such a model, the asymmetric gene flow, polyginy, and patrilocality play an important role in differentiating the genetic structure of sub-Saharan populations. The existence of an asymmetric gene flow is supported by the phylogeographic features of mtDNA and Y-chromosome haplogroups found in the two population groups. The role of polyginy and patrilocality is sustained by the evidence of a differential pressure of genetic drift and gene flow on maternal and paternal lineages of food producers and hunter- gatherers that is revealed through the analysis of mitochondrial and Y-chromosomal intrapopulational variation. Introduction The polymorphisms of mitochondrial DNA (mtDNA) and Y chromosome have opened up new perspectives in the study of genetic variation. This finding is made possible through three key features shared by the two genetic systems. First, they allow us to study maternal and paternal lineages separately because of their unilinear transmission. Second, the substantial lack of recombination allows mutations to accumulate along lines of descent. This feature makes mtDNA and Y chromosome much more powerful tools for phylogeographical analyses than autosomes. Third, polymorphisms can be used with comparable modes and rates of evolution. The fast evolving sites of the hypervariable region 1 (HVR-1) of mtDNA and the Y-chromosomal microsatellites on one side and the slow-evolving mtDNA and Y-chromosome sites that define haplogroups on the other side, indeed have comparable evolutionary rates. This condition makes minimizing the confounding effect of a differential muta- tional dynamic between the two genetic systems possible. The study by Seielstad, Minch, and Cavalli-Sforza (1998) may be regarded as the pioneer study on the relations between genetic variation of unilinearly transmitted poly- morphisms and sociocultural factors. In this worldwide analysis of mtDNA and Y-chromosome polymorphisms, the authors observed that the interpopulation differentiation was much higher for the Y chromosome than for mtDNA. Applying an island migration model (Cavalli-Sforza and Bodmer 1971), they obtained a ratio between female and male Nt—which approximates to the product of the effective population size per migration rate—close to 8 (7.96). According to Seielstad, Minch, and Cavalli-Sforza (1998), the higher female to male migration rate associated with the widespread habit of patrilocality may account for most of this difference. The study paved the way for further investigations that substantially confirmed the importance of the social structure in shaping the genetic variation of human populations (e.g. Oota et al. 2001). In a study recently published in Molecular Biology and Evolution, Hammer et al. (2001) analyzed 43 biallelic polymorphisms of the nonrecombining portion of the Y chromosome in a total of 50 human populations that included five populations from sub-Saharan Africa (Bag- andans, East Bantus, Gambians, Khoisan, and Pygmies). These authors obtained a st of 0.251 for sub-Saharan Africans, a value that is sligthly smaller than that obtained for Asians (0.271). They also noted that this finding contrasts with previous estimates for mtDNA (Melton et al. 1997), in which the /st value for sub-Saharan Africans (0.339) largely exceeds those of Europeans (0.045) and Asians (0.009). Hammer et al. (2001) suggested that the relatively small interpopulational variation of Y chromo- some among sub-Saharans could be caused by a male- biased gene flow during the expansion of populations speaking Bantu languages. This interpretation is supported by the widespread distribution of haplotype 15 in Bantu- speaking populations (Hg E3a according to the nomen- clature suggested by the Y Chromosome Consortium Key words: sub-Saharan Africa, food producers, hunter-gatherers, mtDNA, Y chromosome, sociocultural factors. E-mail: [email protected]. Mol. Biol. Evol. 21(9):1673–1682. 2004 doi:10.1093/molbev/msh186 Advance Access publication June 9, 2004 Molecular Biology and Evolution vol. 21 no. 9 Ó Society for Molecular Biology and Evolution 2004; all rights reserved. by guest on May 19, 2014 http://mbe.oxfordjournals.org/ Downloaded from

Transcript of Variation of Female and Male Lineages in Sub-Saharan Populations: the Importance of Sociocultural...

Variation of Female and Male Lineages in Sub-Saharan Populations:the Importance of Sociocultural Factors

Giovanni Destro-Bisol,* Francesco Donati,* Valentina Coia,* Ilaria Boschi,�Fabio Verginelli,� Alessandra Caglia,� Sergio Tofanelli,§ Gabriella Spedini,*and Cristian Capelli�*Department of Animal and Human Biology, University ‘‘La Sapienza,’’ Rome, Italy; �Institute of Legal Medicine,Catholic University, Rome, Italy; �Department of Oncology and Neurosciences and Center for Research andTraining on Cancer in Sub-Saharan Africa, University G.d’Annunzio, Chieti, Italy; §Department of Ethology,Ecology and Evolution, University of Pisa, Italy

In this paper, we present a study of genetic variation in sub-Saharan Africa, which is based on published and unpublisheddata on fast-evolving (hypervariable region 1 of mitochondrial DNA and six microsatellites of Y chromosome) and slow-evolving (haplogroup frequencies) polymorphisms of mtDNA and Y chromosome. Our study reveals a strikingdifference in the genetic structure of food-producer (Bantu and Sudanic speakers) and hunter-gatherer populations(Pygmies, !Kung, and Hadza). In fact, the ratio of mtDNA to Y-chromosome Nt is substantially higher in food producersthan in hunter-gatherers as determined by fast-evolving polymorphisms (1.76 versus 0.11). This finding indicates that thetwo population groups differ substantially in female and male migration rate and/or effective size. The difference alsopersists when linguistically homogeneous populations are used and outlier populations are eliminated (1.78 vs 0.19) orwhen the jacknife procedure is applied to a paired population data set (1.32 to 7.84 versus 0.14 to 0.66). The higher ratioof mtDNA to Y-chromosome Nm in food producers than in hunter-gatherers is further confirmed by the use of slow-evolving polymorphisms (1.59 to 7.91 versus 0.12 to 0.35). To explain these results, we propose a model that integratesdemographic and genetic aspects and incorporates ethnographic knowledge. In such a model, the asymmetric gene flow,polyginy, and patrilocality play an important role in differentiating the genetic structure of sub-Saharan populations. Theexistence of an asymmetric gene flow is supported by the phylogeographic features of mtDNA and Y-chromosomehaplogroups found in the two population groups. The role of polyginy and patrilocality is sustained by the evidence ofa differential pressure of genetic drift and gene flow on maternal and paternal lineages of food producers and hunter-gatherers that is revealed through the analysis of mitochondrial and Y-chromosomal intrapopulational variation.

Introduction

The polymorphisms of mitochondrial DNA (mtDNA)and Y chromosome have opened up new perspectives in thestudy of genetic variation. This finding is made possiblethrough three key features shared by the two geneticsystems. First, they allow us to study maternal and paternallineages separately because of their unilinear transmission.Second, the substantial lack of recombination allowsmutations to accumulate along lines of descent. Thisfeature makes mtDNA and Y chromosome much morepowerful tools for phylogeographical analyses thanautosomes. Third, polymorphisms can be used withcomparable modes and rates of evolution. The fastevolving sites of the hypervariable region 1 (HVR-1) ofmtDNA and the Y-chromosomal microsatellites on oneside and the slow-evolving mtDNA and Y-chromosomesites that define haplogroups on the other side, indeed havecomparable evolutionary rates. This condition makesminimizing the confounding effect of a differential muta-tional dynamic between the two genetic systems possible.

The study by Seielstad, Minch, and Cavalli-Sforza(1998) may be regarded as the pioneer study on the relationsbetween genetic variation of unilinearly transmitted poly-morphisms and sociocultural factors. In this worldwideanalysis of mtDNA and Y-chromosome polymorphisms,the authors observed that the interpopulation differentiation

was much higher for the Y chromosome than for mtDNA.Applying an island migration model (Cavalli-Sforza andBodmer 1971), they obtained a ratio between female andmale Nt—which approximates to the product of theeffective population size per migration rate—close to 8(7.96). According to Seielstad, Minch, and Cavalli-Sforza(1998), the higher female to male migration rate associatedwith the widespread habit of patrilocality may account formost of this difference. The study paved the way for furtherinvestigations that substantially confirmed the importanceof the social structure in shaping the genetic variation ofhuman populations (e.g. Oota et al. 2001).

In a study recently published in Molecular Biologyand Evolution, Hammer et al. (2001) analyzed 43 biallelicpolymorphisms of the nonrecombining portion of the Ychromosome in a total of 50 human populations thatincluded five populations from sub-Saharan Africa (Bag-andans, East Bantus, Gambians, Khoisan, and Pygmies).These authors obtained a �st of 0.251 for sub-SaharanAfricans, a value that is sligthly smaller than that obtainedfor Asians (0.271). They also noted that this findingcontrasts with previous estimates for mtDNA (Melton etal. 1997), in which the /st value for sub-Saharan Africans(0.339) largely exceeds those of Europeans (0.045) andAsians (0.009). Hammer et al. (2001) suggested that therelatively small interpopulational variation of Y chromo-some among sub-Saharans could be caused by a male-biased gene flow during the expansion of populationsspeaking Bantu languages. This interpretation is supportedby the widespread distribution of haplotype 15 in Bantu-speaking populations (Hg E3a according to the nomen-clature suggested by the Y Chromosome Consortium

Key words: sub-Saharan Africa, food producers, hunter-gatherers,mtDNA, Y chromosome, sociocultural factors.

E-mail: [email protected].

Mol. Biol. Evol. 21(9):1673–1682. 2004doi:10.1093/molbev/msh186Advance Access publication June 9, 2004

Molecular Biology and Evolution vol. 21 no. 9 � Society for Molecular Biology and Evolution 2004; all rights reserved.

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from

[2002]) and by nested cladistic analysis. Thus, accordingto Hammer et al. (2001), sub-Saharan Africa mightrepresent another case in which the genetic structure ofhuman populations has been shaped by the greatermobility of males (Mesa et al. 2000; Carvajal-Carmonaet al. 2000; Oota et al. 2001), in contrast with what wasobserved at the worldwide level (Seielstad, Minch, andCavalli-Sforza 1998; Stoneking 1998). Interestingly, theconclusion drawn by Hammer et al. (2001) also differsfrom what was observed by Seielstad, Minch, and Cavalli-Sforza. (1998) in another group of African populations.These authors analyzed 14 populations from Eastern andCentral Africa and observed that within-populationvariance for autosomal microsatellite was higher thanvariance for Y-chromosomal microsatellites. The data setincludes populations from Sudan (Beja), Ethiopia (Konso,Tsamako, Ongota, Hamar, Dasenech, Surma, Nyangatom,Bench, and Majangir), and Mali (Dogon, Peulh, Tuareg,and Songhai). According to Seielstad, Minch, and Cavalli-Sforza (1998), this evidence fits the expectation of a higherfemale than male migration rate and is consistent with thepatrilocal habit of most sub-Saharan populations.

The discrepancy between the conclusions drawn bySeielstad, Minch, and Cavalli-Sforza (1998) and Hammeret al. (2001) suggests that sub-Saharan populationsanalyzed in these two studies differ for variation ofmaternal and paternal lineages and/or for sex-linkedmigration rates. A substantial difference between the datasets used by the two studies lies in the fact that Seielstad,Minch, and Cavalli-Sforza (1998) analyzed only popula-tions that can be considered food producers (food-producerpopulations [FPPs]), that is, agriculturalists and pastor-alists, whereas Hammer et al. (2001) also surveyedpopulations with a traditional economy based on huntingand gathering (hunter-gatherer populations [HGPs], Pyg-mies and !Kung). Although based primarily on the type ofsubsistence economy, the distinction between FPPs andHGPs is important from a genetic point of view. In fact,FPPs and HGPs differ in their social structure in threespecific aspects that may have important effects onvariation of male and female lineages. First, the level ofpolyginy is higher among FPPs than among HGPs(Cavalli-Sforza 1986a). Second, exceptions to patrilocalityhave been described for HGPs but not for FPPs (Bahuchet1999; Biesele and Royal 1999). Third, socioculturalbarriers strongly influence the way in which unionsbetween individuals of the two groups take place (seebelow). Therefore, an investigation of the relationsbetween sociocultural factors and genetic variation maybe very useful to better understand the mechanisms drivingthe genetic structure of sub-Saharan populations and todefine more precisely the role of culture in determining thediversity within and between groups.

Materials and MethodsThe Database

A total of 40 populations from sub-Saharan Africawas used in this study (see tables S1–S3 in SupplementaryMaterial online). Our database also comprises unpublishedresults from three populations from Cameroon (Bakaka,

Bassa, and Fulbe; data available at http://www.scienzemfn.uniroma1.it/labantro/index.html). FPPs include a total of32 populations, mainly speakers of languages of the Niger-Kordofanian phylum (Greenberg 1963), from central andsouthern Africa (see tables S1–S3 in SupplementaryMaterialonline, for more detailed information on popula-tion location and sample size). HGPs (eight populations)include pygmies from western and eastern Africa, Hadzaand Hadzabe from Tanzania, and !Kung from southernAfrica.

Laboratory and Data Analyses

The genetic systems considered include the hyper-variable region-1 (HVR-1, from np 16024 to 16384) ofmtDNA, six Y-chromosomal microsatellites (DYS19,389I, 390, 391, 392, and 393), and haplogroups of themtDNA and Y chromosome built by use of uniqueevolutionary events. Sequencing of the HVR-1 (betweenpositions 16024 and 16384) and Y-chromosomal micro-satellite typing for Bakaka, Bassa, and Fulbe were carriedout as previously described (Caglia et al. 2003; Destro-Bisol et al. 2004). MtDNA sequences were assigned to thephylogenetic tree of Salas et al. (2002).

Parameters of within-population (haplotype diversityand mean number of pairwise comparisons), between-population, and among-population diversity (Fst and /st)were calculated by the Arlequin software (Schneider et al.1997). The Nm parameter (which incorporates effectivesize, migration, and mutation) was calculated by applica-tion of the formula Nt ¼ (1/Fst) 2 1, according to theisland model of migration for haploid systems (Cavalli-Sforza and Bodmer 1971). Because the contributionof mutation rate to the Nt parameter in our geneticsystems may be considered negligible, the fluctuations ofNt values have been assumed to be the result ofdifferences in migration rate and/or effective populationsize among populations (see also Wjisman [1984] andSeielstad, Minch, and Cavalli-Sforza [1998]). The Kimuratwo-parameter (Kimura 1980) and Rst (Slatkin 1995)methods were used to calculate the genetic distances forHVR-1 of mtDNA and Y-chromosome microsatellites,respectively. To visualize the genetic relationships amongthe groups examined, we analyzed the genetic distancematrices by the nonmetric multidimensional scalingmethod (MDS [Kruskal 1964]) in the Statistica version5.0 software.

Results

From all the data available for the hypervariableregion-1 of mtDNA (28 populations) and Y-chromosomalmicrosatellites (22 populations), we obtained a /st of0.172 for mtDNA and of 0.100 for Y chromosome (table1). When an island model of migration is applied (Cavalli-Sforza and Bodmer 1971), these values produce a ratio ofmtDNA to Y-chromosome Nt of 0.54, which may beconsequence of a migration rate and/or an effectivepopulation size that is nearly double for males. When thepopulations are divided into FPPs and HGPs, a strikingdifference emerges. In fact, among FPPs the Nt estimated

1674 Destro-Bisol et al.

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from

for mtDNA is 1.76 times that obtained for Y chromosome,whereas the ratio of mtDNA to Y-chromosome Nt is only0.11 among HGPs. Before attempting any interpretation ofthese initial results, we tested their robustness by threefurther analyses.

First, we analyzed pairwise genetic distances toestablish whether the divergent ratio of mtDNA to Y-chromosome Nt between HGPs and FPPs reflects trendsshared by populations within each group or, alternatively,whether it is the result of any detectable confoundingfactor. We focused on two possible confounding factors:the heterogeneity among linguistically different popula-tions and the presence of outlier populations. Previousstudies of African populations have revealed importantcorrelations between genetic and linguistic distances(Cavalli-Sforza, Menozzi, Piazza 1994; Poloni et al.1997; Lane et al. 2002; Sanchez-Mazas 2001). Thus, thereliability of estimates obtained from large African datasets by use of linguistically homogeneous populationsmust be tested. Poloni et al. (1997) compared Fst valuesfor Y-chomosome p49,f/Taq1 haplotypes and mtDNAlow-resolution Restriction Fragment Length Polymor-phisms (RFLPs) from populations that belonged to theNiger-Congo branch of the Niger-Kordofanian phylum(Greenberg 1963) and obtained sligthly higher Fst formtDNA than for Y chromosome. Because the Niger-Congo branch include populations heterogeneous bothlinguistically and historically, we preferred to use a morenarrow linguistic group and selected the FPPs who speaklanguages of the Benue-Congo subfamily of the Niger-Congo branch (see tables S1–S3 in SupplementaryMaterial online). These populations, also referred to asBantu-speaking populations, are thought to be in geneticcontinuity with the farmers who migrated from southwest-ern Cameroon to all of Central and South Africa some3,000 years ago (Vansina 1984). The MDS plot of geneticdistances calculated for the HVR-1 of mtDNA shows thatFPPs (22 populations) tend to cluster together, whereas theHGPs (six populations) are widely dispersed throughoutthe plot (fig. 1A). This difference between the two groupswas already predicted by their substantially different /stvalues (table 1). The Bantu-speaking populations are lesswidely dispersed than FPPs on the whole. In the MDS plotbased on Rst genetic distances (Slatkin 1995) for Ychromosome, no appreciable difference exists between thedispersion of FPPs (17 populations) and HGPs (fivepopulations) (fig. 1C), and Bantu-speaking FPPs againseem to be less heterogeneous than FPPs as a whole.Further important information is provided by the MDSplots concerning the Mbenzele pygmies and the Bakaka,who cluster separately from HGPs (in the mtDNA plot),and Bantu-speaking FPPs (in the Y-chromosome plot),respectively. The genetic distinctiveness of these twopopulations is confirmed by the inspection of thedistribution of genetic distances. In fact, the Mbenzeleare responsible for all four mtDNA genetic distance valuesthat are greater than 0.5 (figure 1B and genetic distancematrix in table S4 in Supplementary Material online).Considering the Y-chromosomal genetic distances be-tween FPPs, the only genetic distance greater than 0.4 andfour out of the six values greater than 0.3 are the results of

comparisons between the Bakaka and other populations(figure 1D and the genetic distance matrix in table S5 inSupplementary Material online). The /st values for theHVR-1 of mtDNA and the six microsatellites of the Ychromosome recalculated for the Bantu-speaking FPPs andafter the elimination of the Mbenzele and Bakaka from thedata set are reported in table 1. The most importantchanges concern the mitochondrial /st of FPPs, whichdecreases by 40% (from 0.063 to 0.038) in the subset ofBantu-speakers, and the Y-chromosomal /st of FPPs,which decreases by 30% (from 0.106 to 0.074) in thesubset of Bantu-speakers without the Bakaka. However,the newly estimated ratio of mtDNA to Y-chromosome Nmfor Bantu-speaking populations with (2.50) and without(1.78) Bakaka remains greater than 1 and much larger thanfor HGPs after the elimination of the Mbenzele (0.19).

Second we calculated /st values for only thepopulations analyzed for both mtDNA and Y chromosome(table 2). Apart from the greater reliability of estimatesobtained from paired mtDNA and Y-chromosomal pop-ulation data sets, this step has another less evident butequally important advantage. The island model assumes anequilibrium between migration and drift. This condition isprobably not met by some of the populations included inour data set, because past demographic events haveprobably left an important signature in their presentgenetic variation (e.g., the neolithic and Bantu expansionin most FPPs and bottleneck events in HGPs [Excoffierand Schneider 1999; Salas et al. 2002]). However,departures from equilibrium conditions can reasonablybe expected to affect estimates for mtDNA and Ychromosome in a comparable way if paired data sets areused. Consequently, substantial bias should occur in theresulting ratio of mtDNA to Y-chromosome Nm. The useof paired data sets reconfirmed the difference between thetwo population groups observed when the complete dataset is used. In fact, among FPPs, the Nm estimated formtDNA is 3.83 times that obtained for Y chromosome,whereas the ratio of mtDNA to Y-chromosome Nt is ofonly 0.10 among HGPs. To test the robustness of this

Table 1Estimates of �st and Nm of Sub-Saharan PopulationsObtained from the Hypervariable Region-1 ofMitochondrial DNA and Microsatellite Haplotypes ofthe Y Chromosome

�st

Nt

mtDNA Y ChromosomemtDNA/Y

Chromosome

All populations 0.172 (28) 0.100 (22) 0.54FPPs 0.063 (22) 0.106 (17) 1.76FPPs (only Bantu

speakers) 0.038 (9) 0.090 (9) 2.50FPPs (Bantu speakers,

Bakaka excluded) 0.043 (8) 0.074 (6) 1.78HGPs 0.428 (6) 0.074 (5) 0.11HGPs (Mbenzele

excluded) 0.302 (5) 0.076 (4) 0.19

NOTE.—The number of populations analyzed is in parentheses. The

abbreviations are explained in the text. The Nt values were calculated by

application of an island model of migration (Cavalli-Sforza and Bodmer 1971).

FPPs ¼ food-producer populations. HGPs ¼ hunter-gatherer populations.

Variation of Female and Male Lineages in Sub-Saharan Africa 1675

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from

difference, we applied a jacknife procedure by excludingone population at a time from the mtDNA and Y-chromosome data sets. This method accounts for thevariance caused by interpopulation differentiation, al-though we cannot assume this procedure has the sameeffect in FPPs and HGPs, because the former are probablymore historically correlated than the latter as a result oftheir more recent expansion events. No overlap occurredbetween the Nm interval estimates obtained for the twogroups when a jacknife procedure is used, even taking intoconsideration the values obtained for Bantu-speakers onlyand after the elimination of the Bakaka Bantus andMbenzele pygmies (table 2). This finding indicates that the

discrepancy between the two population groups is notcaused by an unbalanced composition of the database, theeffect of single populations, or the small size of somepopulation samples (as in the case of the eastern pygmieswith only five individuals examined for Y-chromosomalmicrosatellites).

Third, we reestimated Nm from frequencies ofhaplogroups. Haplogroups of both mtDNA and Ychromosome are mostly the result of unique evolutionaryevents created by mutations that occur at a relatively lowrate (on the order of 1029 for Y-chromosome singlenucleotide polymorphisms [SNPs] [Thomson et al. 2000]).This rate contrasts with those of the HVR-1 and Y-chromosome microsatellites whose mutation rates aremuch higher (on the order of 1026 per site per generationfor the HVR-1 of mtDNA and 1023 for Y-chromosomemicrosatellites [Vigilant et al. 1989; Kayser et al. 2002])and part of their variation may be caused by recurrentmutation. Therefore, haplogroups may provide informa-tion different from HVR-1 and Y-chromosome micro-satellites. The values obtained by use of haplogroupfrequencies (table 3) confirm and even enlarge thepreviously observed differences between FPPs and HGPs.In fact, the ratio of mtDNA to Y-chromosome Nt largelyincreases in both FPPs and HGPs, but the value obtainedfor the former group (8.57) largely exceeds that for thelatter (0.26). Also, in this case, the interval estimates forFPPs and HGPs obtained by use of the jacknife proceduredo not overlap (see table 3).

Discussion

The results obtained in the course of this study pointsout to the existence of a clearcut difference between thegenetic structure of FPPs and HGPs. In fact, the ratios of

FIG. 1.—Multidimensional scaling plots and distributions of genetic distances among sub-Saharan populations for mitochondrial DNAhypervariable region-1 (A and B) and Y-chromosome microsatellite haplotypes built from loci DYS19, 389I, 390, 391, 392, and 393 (C and D). Geneticdistances between FPPs are indicated by black histograms, and those between HGPs are indicated by gray histograms. HGPs are indicated by arrows.Bantu-speaking populations are in italic. Population abbreviations are reported in Table S1 in Supplementary Material online. The stress index value isof 0.114 and 0.130 for the mtDNA and the Y-chromosome plots, respectively.

Table 2Estimates of �st and Nm of Sub-Saharan PopulationsObtained from the Subset of Populations Analyzed forboth the Hypervariable Region-1 of Mitochondrial DNAand the Microsatellite Haplotypes of the Y Chromosome

�st

Nm

mtDNA Y ChromosomemtDNA/Y

Chromosome

All populations (11) 0.211 0.106 0.44FPPs (7) 0.038 0.130 3.83 [3.41–4.85]FPPs (only Bantu

speakers) (5) 0.036 0.114 3.45 [2.17–11.89]FPPs (only Bantu

speakers, Bakakaexcluded) (4) 0.040 0.083 2.17 [1.32–7.84]

HGPs (4) 0.431 0.072 0.10 [0.09–0.15]HGPs (Mbenzele

excluded) (3) 0.278 0.071 0.20 [0.14–0.66]

NOTE.—The number of populations analyzed is in parentheses. The Nm values

were calculated by application of an island model of migration (Cavalli-Sforza and

Bodmer 1971). Interval estimates of Nm parameter (in brackets) were obtained by

use of the jacknife procedure. FPPs ¼ food-producer populations. HGPs ¼ hunter-

gatherer populations.

1676 Destro-Bisol et al.

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from

mtDNA to Y-chromosome Nm of FPPs and HGPsconstantly remain above and below 1, respectively, whenthe complete data set for fast-evolving polymorphisms isused (table 1). Furthermore, no overlap between the twogroups exists, even when the selected data sets (obtainedby using linguistically homogeneous populations andeliminating outlier populations) and the interval estimatesproduced by the jacknife procedure are considered (tables1 and 2). A further confirmation of the difference in Nmbetween FPPs and HGPs is provided by the use of slow-evolving polymorphisms (table 3). At the same time,a certain difference between the estimates obtained by useof fast-evolving (HVR-1 and microsatellites) and slow-evolving (mtDNA and Y-chromosome haplogroups)polymorphisms can be noted. In fact, the ratio of mtDNAto Y-chromosome Nm for FPPs ranges from 1.76 to 4.85for fast-evolving polymorphisms and from 6.24 to 14.79for slow-evolving polymorphisms. Also, in the case ofHGPs, the estimates based on slow-evolving polymor-phisms (0.12 to 0.35) are higher than those based on fast-evolving polymorphisms (0.09 to 0.19). Interestingly, thehaplogroup-based estimate for Bantu-speaking FPPs(6.24) is not far from the estimate obtained by Seielstadet al. (1998) for world populations by means of SNP data.Because the estimates for the two kinds of polymorphismswere obtained by use of different populations, differentialsampling could explain at least part of the discrepancy.This possibility is clearly indicated by the markedvariation that can also be observed among the estimatesobtained by use of either fast-evolving (see and comparethe values reported in tables 1 and 2) or slow-evolvingpolymorphisms (see table 3). The comparison between theresults of fast-evolving and slow-evolving polymorphismsis further complicated by the fact that mtDNA haplogroupswere defined on the basis of a subset of variablenucleotides of HVR-1, whereas Y-chromosomal hap-logroups were assigned on the basis of loci different fromthose used to build microsatellite haplotypes. Comparisonsbased on data from homogeneous data sets are required toshed more light on the important issue of the comparisonof Fst estimates based on slow and fast-evolving poly-morphisms.

Once the robustness of the initial results had beentested, our next logical step was to discuss the microevo-lutionary processes underlying the different patterns ofmitochondrial and Y-chromosomal variation of FPPs andHGPs. To this end, we built a model (fig. 2) that integratesdemographic and genetic aspects and incorporates ethno-graphic knowledge, especially knowledge of Africanpygmies.

The possibility has been suggested that HGPs couldhave maintained relatively large effective population sizesthrough high migration rates before the arrival of FPPs,namely Bantu-speaking farmers. In fact, until that timeHGPs had no competitors and could occupy a widerterritory (Cavalli-Sforza 1986b). The arrival of farmers,who penetrated the equatorial belt in the course of theexpansion of Bantu-speakers peoples, around 2,000 to3,000 years ago (Vansina 1980) caused an importantchange in the demography of HGPs. Because of theirhigher growth rate, the Bantu-speaking farmers progres-

sively occupied most of the habitat previously available toHGPs and pushed the HGPs into less favorable areas. Thefragmentation of HGPs inevitably reduced the gene flowbetween subpopulations and, consequently, their effectivesize decreased. This scenario of competition betweenpreexisting HGPs and expanding Neolithic populationshas been proposed by Excoffier and Schneider (1999) toexplain the lack of signs of Pleistocene expansion in bothAfrican and non-African HGPs. In the case of AfricanHGPs, the explanation receives support from the study byWeiss and Von Haeseler (1998, who detected traces ofa recent decrease of population size in Biaka pygmies fromthe Central African Republic by use of mtDNA variation.However, with this model, we do not want to imply thatthe marginalization of HGPs by Bantu-speakers was theonly cause underlying their peculiar genetic structure. Infact, the tendency of HGPs to conform to a stabledemographic model must be taken into account. In thiscircumstance, genetic drift may be more effective than inagriculturalists, who tend to be more fertile than non-agriculturalists (Sellen and Mace 1997; von Haeseler,Sajantila, and Paabo 1996).

Another important consequence of the meetingbetween HGPs and FPPs is their reciprocal gene flow.No substantial taboos or social barriers existed betweenHGPs and FPPs during the early stages of contact. The twogroups to probably exchanged genes symmetrically, anassumption supported by the similarity of anthropometriccharacters between the Twa Konda (pygmies) and OtoKonda (farmers) of Central Africa (Cavalli-Sforza 1986b).However, the present-day situation indicates that thisinitial condition changed considerably, and an asymmetricgene flow progressively developed between HGPs andFPPs because of the establishment of socioculturalinequalities (Cavalli-Sforza 1986b). In fact, pygmy womenare accepted as wives by Bantu communities, first, becausethey are famed for their great fertility, and, secondly,because the future husband must pay a relatively low‘‘bride price’’ to the wife’s family to gain the right to

Table 3Estimates of �st and Nm of Sub-Saharan PopulationsObtained from Haplogroups of the mtDNA andY Chromosome

�st

Nm

mtDNA Y ChromosomemtDNA/Y

Chromosome

All populations (11) 0.173 0.277 1.81FPPs (7) 0.070 0.392 8.57 [6.91–14.79]FPPs (only Bantu

speakers) (4) 0.021 0.118 6.24 [1.59–7.51]HGPs (4) 0.373 0.134 0.26 [0.12–0.35]

NOTE.—The number of populations analyzed is in parentheses. The Nm values

were calculated by application of an island model of migration (Cavalli-Sforza and

Bodmer 1971). Interval estimates of Nm parameter (in brackets) were obtained by

use of the jacknife procedure. For mtDNA we considered frequencies of

haplogroups L1a, L1b, L1c, L1d, L1e, L1f, L1k, L2a, L2b, L2c, L2d, L3b, L3d,

L3e, L3f, and L3g (see table S1 in Supplementary Material online for further

details). For Y chromosome we considered frequencies of haplogroups A, B2a,

B2b, B1/B, E3a, E/D, and C/F (YCC 2002; Knight et al. 2003). FPPs ¼ food-

producer populations. HGPs ¼ hunter-gatherer populations.

Variation of Female and Male Lineages in Sub-Saharan Africa 1677

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from

marriage. These conditions make a pygmy-to-Bantu flowof maternal lineages possible. A Bantu-to-pygmy flow ofpaternal lineages is also expected through three mecha-nisms: first, extramarital unions between pygmy femalesand Bantu-speaking males, second, adoption of orphansborn from mixed unions, and, third, a return to the HGPsof Pygmy women and of their children after the divorcefrom Bantu-speaking FPP males. On the other hand, boththe pygmy-to-Bantu flow of paternal lineages and theBantu-to-pygmy flow of maternal lineages are inhibited bysociocultural taboos against unions between Bantu-speak-ing females and Pygmy males (Cavalli-Sforza 1986b). Theresulting asymmetric gene flow has left a signature in bothpygmies and Bantu speakers, but it must have had a deeperimpact on the genetic structure of the latter because of theirsmaller population size. This asymmetric gene flowbetween FPPs and HGPs probably affected the gene flowbetween HGP subpopulations as well. In fact, the femalesof pygmy groups in which mixed unions with Bantuspeakers occur more frequently are less available formarriages with males from other pygmy subgroups(Biasutti 1967). This circumstance is another factor thatcould have contributed to the high level of geneticdifferentiation observed for mtDNA among HGPs.

Different levels of poligyny and patrilocality in FPPsand HGPs are other factors probably involved in thedifferences observed between the two groups. By de-creasing the male effective size, polyginy decreasesdiversity of paternal lineages within populations andincreases that among populations. Polyginy is known tobe substantially higher among FPPs than among HGPs,both in terms of proportion of polygamists and in terms ofaverage number of wives per polygamist (Cavalli-Sforza1986a; Biesele and Royal 1999). Furthermore, whereas all

FPPs practice a rigid patrilocality, some HGPs do notexclusively follow such social behavior. Among the Akapygmies, a grouping that includes the Biaka and Mbenzelepygmies analyzed here, the young couple generally settlesin the husband’s camp after the birth of the first child.However, the husband may remain in the wife’scommunity, where he may be joined by one of hisbrothers or sisters (Bahuchet 1999). Among some !Kunggroups of Botswana, males often join the wife’s family(Biesele and Royal 1999). Interestingly, a high level ofpolygyny and extreme patrilocality have been proposed asprobable causes of the low Y-chromosome and highmtDNA diversity observed in West New Guinea popula-tions (Kayser et al. 2003).

Apart from being consistent with the results describedabove and providing an explanation for the discrepancybetween the conclusions of Seielstad, Minch, and Cavalli-Sforza (1998) and Hammer et al. (2001), this modelpredicts the existence of phylogeographic traces of a sex-biased gene flow between HGPs and FPPs. The existenceof a sex-biased gene flow is supported by the distributionof the most-common haplogroups in FPPs and HGPs.Haplogroup E3a is the modal Y-chromosome type in FPPneighbors of HGPs; frequencies range from 42% (Wairakfrom Tanzania [Luis et al. 2004]) to 96% (Bamileke fromCameroon [Cruciani et al. 2002]). E3a has been indicatedas a signature of Bantu expansion (Underhill et al. 2001).It is present in all HGPs, where it ranges from 5% amongthe Ju/’hoansi !Kung (recalculated from Underhill et al.[2001] by Knight et al. [2003]), who are known to haveintermarried with Bantu-speakers to a low degree, to 65%among the Biaka pygmies (Cruciani et al. 2002). Thehaplogroup E2b1x (E2b1a) is also probably associatedwith the Bantu expansion (Cruciani et al. 2002). It reaches

FIG. 2.—Model describing the role of asymmetric gene flow, polygyny, and patrilocality in determining the different pattern of mtDNA and Y-chromosome variation between FPPs and HGPs.

1678 Destro-Bisol et al.

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from

a frequency of 15% among southern African Bantu and6% in the !Kung (Underhill et al. 2001; Cruciani et al.2002). Furthermore, most of the other Y-chromosometypes observed among HGPs are absent among FPPs([A3b1, A2, B2a*, B2b2, B2b4b, B2b*x(B2b3*), B2b3a][Cruciani et al. 2002]), with two exceptions. The firstexception is represented by haplogroups B2a1 andB2b3*x(B2b3a) (both occur at a frequency of 5% amongBiaka pygmies [Cruciani et al. 2002]), which, togetherwith the other haplogroups that belong to the group II ofUnderhill et al. (2001), is thought to be the remnant ofearly diversification and dispersal processes within Africa(Underhill et al. 2001; Cruciani et al. 2002). The secondexception is provided by the haplogroup E3b*x(E3b1,E3b2, and E3b3), which is found only among !Kung fromsouthern Africa (with a frequency of 11%) and ingeographically distant populations such as Ethiopian Jewsand Mossi from Burkina Faso (Cruciani et al. 2002).Therefore, the instances of haplogroup sharing mentionedabove seem to be the result of the maintenance of ancestralcharacteristics diluted elsewhere by more recent demo-graphic events rather than reverse gene flow (from HGPsto FPPs). On the other hand, the analysis of mtDNAhaplogroup distribution show quite a different pattern.Western pygmy and !Kung have two different mtDNAmodal types, L1c1a1 (60% among the Mbenzele pygmiesand 30% among the Biaka pygmies [Destro-Bisol et al.2004]) and L1d (frequency of 51% to 96% among !Kung:recalculated by Destro-Bisol et al. [2004] from Chen et al.[2000]). This substantial heterogeneity between pygmiesand !Kung illustrates their long reciprocal isolation. Thesetwo mtDNA types have probably originated amongpygmies and !Kung (Destro-Bisol et al. 2004; Salas etal. 2002) and are also present in some FPP neighbors ofHGPs. The L1c1a1 is found among Ewondo (frequency of13% [Destro-Bisol et al. 2004]), whereas the L1d ispresent among Mozambicans (frequency of 5% [Salas etal. 2002]). Some signatures of reverse gene flow (fromFPPs to HGPs for maternal lineages) are also detectable inthe mtDNA haplogroups of probable Bantu origin presentboth in the HGPs and in their FPP neighbors (L1a2, L1a1a,L3d3, and L3e2b [Pereira et al. 2001; Salas et al. 2002]).This finding is indicated by the cumulative frequency ofthe haplogroups mentioned above, which ranges from 0%(Botswana !Kung [Vigilant et al. 1991]) to 19% (SouthAfrican !Kung [Chen et al. 2000]), with intermediatevalues of 4% among the Mbenzele (Destro-Bisol et al.2004) and 18% among the Biaka (Vigilant et al. 1991).This indication of introgression of FPP maternal lineagesinto HGPs is not in contrast with the asymmetric gene flowpredicted by our model. In fact, the cumulative frequencyof the Bantu mtDNA haplogroups is substantially lowerthan the frequency of the E3a haplogroups in the sameHGPs (from 39% in the !Kung to 65% in the Biaka[Cruciani et al. 2002]).

The evidence that the differential gene flow ofpaternal lineages has left a stronger signature than thedifferential gene flow of maternal lineages merits somefurther considerations. This difference may be seen on twodifferent levels. First, haplogroups bearing the M2mutation have been observed among all the HGPs

analyzed so far, whereas the L1c1a1 and L1d haplogroupshave only been found in some FPPs. Second, at present, noclear signs of reverse gene flow exists for Y chromosome(from HGPs to FPPs), whereas such signs do exist formtDNA (from FPPs to HGPs). This discrepancy can beexplained by the substantial difference in size betweenFPPs and HGPs. In the case of paternal lineages, the signsof the FPP-to-HGP gene flow are more evident andpersistent because of the smaller size of recipientpopulations. Furthermore, the smaller size of HGPs couldhave facilitated the retention of FPP maternal lineagesacquired during the initial period of symmetric gene flow.On the other hand, the larger size of the FPPs has probablydiluted the signs of the HGP-to-FPP gene flow of Ychromosomes that probably occurred in the initial phase ofcontact between the two population groups.

Another important implication of our model is in thedifferential pressure of microevolutionary forces onmaternal and paternal lineages of HGPs and FPPs. In fact,because of the combined effect of asymmetric gene flowand different levels of polyginy and patrilocality, themodel predicts that genetic drift had been more effectiveon maternal than on paternal lineages of HGPs, whereasgene flow is expected to be the prevailing microevolu-tionary force on their paternal lineages. The oppositesituation is expected for FPPs. Consequently, HGPsshould show a higher intrapopulational diversity forpaternal than for maternal lineages, whereas the oppositeshould be valid for FPPs. To test these expectations, wecompared HVR-1 mtDNA and Y-chromosome micro-satellite haplotype diversity in the same populations forboth genetic systems. From this comparison, we found thatHGPs show the lowest level of haplotype diversity formtDNA but nearly the highest for the Y chromosome (fig.3). Furthermore, the difference in the ratio of mtDNAHVR-1 to Y-chromosome microsatellite haplotype di-versity between HGPs and FPPs is statistically significantby the Mann-Whitney U test (P ¼ 0.011). Therefore, ourresults seem to reflect a substantial difference betweenFPPs and HGPs concerning the degree of patrilocality andpolyginy, which is so far suggested by only a few,nonsystematic anthropological studies (Cavalli-Sforza1986a; Bahuchet 1999; Biesele and Royal 1999).

Conclusion

In this study, we have analyzed variation ofmaternally and paternally transmitted polymorphisms byuse of loci with different evolutionary rates and a largedata set of sub-Saharan populations. A final picture ofgenetic variation in this area requires ad hoc studies on thepopulations of the northern fringes of sub-Saharan Africa(e.g., Sudan, Niger, Chad, Somalia, and Ethiopia).Nonetheless, the results obtained so far allow us to drawfour main conclusions. First, our study provides evidencefor a marked heterogeneity between FPPs and HGPs interms of distribution of unilinearly transmitted markers.This result demonstrates the danger of making excessivegeneralizations regarding the genetics of the populationsliving south of the Sahara desert (see also Tishkoff andWilliams [2002]). Second, the results obtained in the

Variation of Female and Male Lineages in Sub-Saharan Africa 1679

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from

course of our study suggest that the inferences drawn byboth Seielstad, Minch, and Cavalli-Sforza (1998) andHammer et al. (2001) are both valid. In fact, the geneticstructure of present-day HGPs seems to maintain a strongsignature of the male-biased gene flow of Bantu originproposed by Hammer et al. (2001). Furthermore, thehigher female migration rate suggested by Seielstad,Minch, and Cavalli-Sforza (1998) for human populationson a global scale and for East and Central Africa issupported for FPPs. Third, our study introduces oneimportant novelty relative to the two studies mentionedabove. We claim that the values of Fst obtained formitochondrial and Y-chromosomal polymorphisms reflectnot only the mobility of males and females but also thepressure of genetic drift on maternal and paternal lineages.Fourth, our research reconfirms the importance of themultidisciplinary approach for studies on human geneticvariation. In our model, the asymmetric gene flow,polyginy, and patrilocality, and, hence, the socioculturalfactors underlying them, have an important role indetermining and differentiating the genetic structure ofsub-Saharan populations.

Supplementary Material

The following tables are available online: table S1(mtDNA hypervariable region-1 data used in this study),table S2 (Y-chromosome microsatellite data used in thisstudy), table S3 (Y-chromosome haplogroup data used inthis study), table S4 (genetic distance matrix obtainedusing mtDNA hypervariable region-1 data), and table S5(Genetic distance matrix obtained using Y-chromosomalmicrosatellite data).

Acknowledgments

This study develops the work initiated with ourunforgotten friend and colleague Michele Belledi, to whomthe paper is dedicated. We thank the Equipe du Dispensairede Sante Integree, Mission Catholique du Belemboke(Central African Republic) and the blood donors, whoseavailability made this study possible. This research wassupported by grants from the M.U.R.S.T. (Cofin Projects2003054059 ‘‘Dna e Biodemografia: approccio integratoallo studio della mobilita umana,’’ and 2002063871 ‘‘Lastruttura genetica del cromosoma Y in Italia’’) and theUniversity of Rome ‘‘La Sapienza’’ (Ateneo projectC26A034352 ‘‘Analisi della variabilita in geni sottopostia selezione in popolazioni dell’Africa sub-Sahariana’’). Wethank Alec Knight and Elizabeth Wood for their usefulcomments on an earlier version of this paper.

Literature Cited

Bahuchet, S. 1999. Aka pygmies. Pp. 190–194. in B. Richard andR. Daly, eds. The Cambridge encyclopedia of hunters andgatherers. Cambridge University Press, Cambridge UK.

Biasutti, R., 1967. Le razze e i popoli della Terra, Vol. 3. Utet,Torino, Italy.

Biesele, M., and K. Royal. 1999. Africa; Mbuti. Pp 210–214 inB. Richard and R. Daly, eds. The Cambridge encyclopedia ofhunters and gatherers. Cambridge University Press, Cam-bridge UK.

Caglia, A., S. Tofanelli, V. Coia, I. Boschi, M. Pescarmona, G.Spedini, V. Pascali, G. Paoli, and G. Destro-Bisol. 2003. Astudy of Y-chromosome microsatellite variation in sub-Saharan Africa: a comparison between Fst and Rst geneticdistances. Hum. Biol. 75:313–330.

Carvajal-Carmona, L. G., I. D. Soto, N. Pineda et al. (11 co-authors). 2000. A strong Amerind/white sex bias and a possibleSephardic contribution among the founders of a population innorthwest Colombia. Am. J. Hum. Genet. 67:1287–1295.

Cavalli-Sforza, L. L. 1986a. Demographic data. Pp. 23–44 in:L. L. Cavalli-Sforza, ed. African pygmies. Academic Press,Orlando, Fla.

———. 1986b. African pygmies: an evaluation of the state ofresearch. Pp. 361–426 in L. L. Cavalli-Sforza, ed. Africanpygmies. Academic Press, Orlando, Fla.

Cavalli-Sforza, L. L., and W. Bodmer. 1971. The genetics ofhuman populations. Freeman, San Francisco, Calif.

Cavalli-Sforza L., P. Menozzi, and A. Piazza. 1994. The historyand geography of human genes. Princeton University Press,Princeton, NJ.

Chen Y. S., A. Olckers, T. G. Schurr, A. M. Kogelnik, K.Huoponen, and D. C. Wallace. 2000. mtDNA variation inthe South African !Kung and Khwe and their genetic relation-ships to other African populations. Am. J. Hum. Genet. 66:1362–1383.

Coia, V., A. Caglia, B. Arredi et al. (11 co-authors). 2003. Binaryand microsatellite polymorphisms of the Y-chromosome inthe Mbenzele pygmies from the Central African Republic.Am. J. Hum. Biol. in press.

Corte-Real, F., M. Carvalho, L. Andrade, M. J. Anjos, C. Pestoni,M. V. Lareu, A. Carracedo, D. N. Vieita, and M. C. Vide.2000. Chromosome Y STRs analysis and evolutionary aspectsfor Portuguese spoken countries. Pp. 272–274 in G. F.Sensabaugh, P. J. Lincoln, and B. Olaisen, eds. Progress inforensic genetics, Vol. 8. Elsevier Sciences, Amsterdam.

Cruciani, F., P. Santolamazza, P. Shen et al. (16 co-authors)2002. An Asia to sub-Saharan Africa back migration is

FIG. 3.—Haplotype diversity of sub-Saharan populations formitochondrial DNA hypervariable region-1 (A) and Y-chromosomemicrosatellite haplotypes built from loci DYS19, 389I, 390, 391, 392, and393 (B). FPPs and HGPs are represented by black and gray histograms,respectively. For population abbreviations see tables S1 in SupplementaryMaterial online.

1680 Destro-Bisol et al.

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from

supported by high-resolution analysis of human Y chromo-somes. Am. J. Hum. Genet. 70:1197–1214.

Destro-Bisol, G., V. Coia, I. Boschi, F. Verginelli, A. Caglia, V.Pascali, G. Spedini, and F. Calafell. 2004. The analysis ofvariation of mtDNA hypervariable region-1 suggests thateastern and western pygmies diverged before the Bantuexpansion. Am. Nat. 163:212–226.

Excoffier, L., and S. Schneider. 1999. Why hunter-gathererpopulations do not show signs of Pleistocene demographicexpansions. Proc. Natl. Acad. Sci. USA 96:10597–10602.

Graven, L., G. Passarino, O. Semino, P. Boursot, S. Santachiara-Benerecetti, A. Langaney, and L. Excoffier. 1995. Evolution-ary correlation between control region sequence and re-striction polymorphisms in the mitochondrial genome ofa large Senegalese Mandenka sample. Mol. Biol. Evol.12:334–345.

Greenberg J. 1963. The languages of Africa. Mouton, The Hague.Hammer, M. F., T. A. Karafet, A. J. Redd, H. Jarjanazi, S.

Santachiara-Benerecetti, H. Soodyall, and S. L. Zegura. 2001.Hierarchical patterns of global human Y-chromosome di-versity. Mol. Biol. Evol. 18:1189–1203.

Kayser M., S. Brauer, G. Weiss, W. Schiefenhovel, P. Underhill,P. Shen, P. Oefner, M. Tommaseo-Ponzetta, and M.Stoneking. 2003. Reduced Y-chromosome, but not mitochon-drial DNA, diversity in human populations from West NewGuinea. Am. J. Hum. Genet. 72:281–302.

Kayser, M., M. Krawczak, L. Excoffier et al. (12 co-authors).2001. An extensive analysis of Y-chromosomal microsatellitehaplotypes in globally dispersed human populations. Am. J.Hum. Genet. 68:990–1018.

Kayser M., L. Roewer, M. Hedman et al. (14 co-authors). 2002.Characteristics and frequency of germline mutations atmicrosatellite loci from the human Y chromosome, as re-vealed by direct observation in father/son pairs. Am. J. Hum.Genet. 66:1580–1588.

Kimura, M. 1980. A simple method for estimating evolutionaryrates of base substitutions through comparative studies ofnucleotide sequences. J. Mol. Evol. 16:111–120.

Knight, A., P. A. Underhill, H. M. Mortensen, L. A.Zhivotovsky, A. A. Lin, B. M. Henn, D. Louis, M. Ruhlen,and J. L. Mountain. 2003. African Y dhromosome andmtDNA divergence provides insight into the history of Clicklanguages. Curr. Biol. 13:464–473.

Kruskal, J. B. 1964. Multidimensional scaling by optimizinga goodness of fit test to a nonmetric hypothesis. Psychome-trika 19:1–27.

Lane, A. B., H. Soodyall, S. Arndt, M. E. Ratshikhopha, E.Jonker, C. Freeman, L. Young, B. Morar, and L. Toffie. 2002.Genetic substructure in South African Bantu-speakers:evidence from autosomal DNA and Y-chromosome studies.Am. J. Phys. Anthropol. 119:175–185.

Luis, J. R., D. J. Rowold, M. Regueiro, B. Caeiro, C. Cinniog^lu,

C. Roseman, P. A. Underhill, L. L. Cavalli-Sforza, and R. J.Herrera. 2004. The Levant versus the Horn of Africa:evidence for bidirectional corridors of human migrations.Am. J. Hum. Genet. 74:532–544.

Mateu, E., D. Comas, F. Calafell, A. Perez-Lezaun, A. Abade, andJ. Bertranpetit. 1997. A tale of two islands: population historyand mitochondrial DNA sequence variation of Bioko and SaoTome, Gulf of Guinea. Ann. Hum. Genet. 61:507–518.

Melton, T., C. Ginther, G. Sensabaugh, H. Soodyall, and M.Stoneking. 1997. Extent of heterogeneity in mitochondrialDNA of sub-Saharan African populations. J. Forensic Sci.42:582–589.

Mesa, N. R., M. C. Mondragon, I. D. Soto, et al. (14 co-authors).2000. Autosomal, mtDNA, and Y-chromosome diversity in

Amerinds: pre- and post-Columbian patterns of gene flow inSouth America. Am. J. Hum. Genet. 67:1277–1286.

Oota, H., W. Settheetham-Ishida, D. Tiwawech, T. Ishida, and M.Stoneking. 2001. Human mtDNA and Y-chromosomevariation is correlated with matrilocal versus patrilocalresidence. Nat. Genet. 29:20–21.

Pereira, L., V. Macaulay, A. Torroni, R. Scozzari, M. J. Prata,and A. Amorim. 2001. Prehistoric and historic traces in themtDNA of Mozambique: insights into the Bantu expansionsand the slave trade. Ann. Hum. Genet. 65:439–458.

Poloni, E. S., O. Semino, G. Passarino, A. S. Santachiara-Benerecetti, I. Dupanloup, A. Langaney, and L. Excoffier.1997. Human genetic affinities for Y-chromosome P49a,f/TaqI haplotypes show strong correspondence with linguistics.Am. J. Hum. Genet. 61:1015–1035.

Rando, J. C., V. M. Cabrera, J. M. Larruga, M. Hernandez, A. M.Gonzalez, F. Pinto, and H. J. Bandelt. 1999. Phylogeographicpatterns of mtDNA reflecting the colonization of the CanaryIslands. Ann. Hum. Genet. 63:413–428.

Rando, J. C., F. Pinto, A. M. Gonzalez, M. Hernandez, J. M.Larruga, V. M. Cabrera, and H. J. Bandelt. 1998. Mitochon-drial DNA analysis of northwest African populations revealsgenetic exchanges with European, near-eastern, and sub-Saharan populations. Ann. Hum. Genet. 62:531–550.

Salas, A., M. Richards, T. De la Fe, M.-V. Lareu, B. Sobrino, P.Sanchez-Diz, V. Macaulay, and A. Carracedo. 2002. Themaking of the African mtDNA landscape. Am. J. HumanGenet. 71:1082–1111.

Sanchez-Mazas, A. 2001. African diversity from the HLA pointof view: influence of genetic drift, geography, linguistics, andnatural selection. Hum. Immunol. 62:937–948.

Schneider, S., J. M. Kueffer, D. Roessli, and L. Excoffier. 1997.Arlequin ver 1.1: a software for population genetic dataanalysis. Genetics and Biometry Laboratory, University ofGeneva, Switzerland.

Seielstad, M. T., E. Minch, and L. L. Cavalli-Sforza. 1998.Genetic evidence for a higher female migration rate inhumans. Nat. Genet. 20:278–280.

Sellen, D. W, and R. Mace. 1997. Fertility and model ofsubsistence: a phylogenetic analysis. Curr. Anthropol.38:878–889.

Slatkin, M. 1995. A measure of population subdivision based onmicrosatellite allele frequencies. Genetics 139:457–462.

Stoneking, M. 1998. Women on the move. Nat. Genet. 20:219–220.

Thomson, R., J. K. Pritchard, P. Shen, P. J. Oefner, and M. W.Feldman. 2000. Recent common ancestry of human Ychromosomes: evidence from DNA sequence data. Proc.Natl. Acad. Sci. USA 97:7360–7365.

Tishkoff, S. A., and S. M. Williams. 2002. Genetic analysis ofAfrican populations: human evolution and complex disease.Nat. Rev. Genet. 3:611–621.

Trovoada, M. J., C. Alves, L. Gusmao, A. Abade, A. Amorim,and M. J. Prata. 2001. Evidence for population sub-structuringin Sao Tome e Principe as inferred from Y-chromosome STRanalysis. Ann. Hum. Genet. 65:271–283.

Underhill, P. A., G. Passarino, A. A. Lin, P. Shen, M. MirazonLahr, R. A. Foley, P. J. Oefner, and L. L. Cavalli-Sforza.2001. The phylogeography of Y chromosome binaryhaplotypes and the origins of modern human populations.Ann. Hum. Genet. 65:43–62.

Underhill, P. A., P. Shen, A. A. Lin et al. (21 co-authors). 2000.Y chromosome sequence variation and the history of humanpopulations. Nat. Genet. 26:358–361.

Vansina, J. 1984. Western Bantu expansion. J. Afr. Hist. 25:129–145.

Variation of Female and Male Lineages in Sub-Saharan Africa 1681

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from

Vigilant, L., R. Pennington, H. Harpending, T. D. Kocher, andA. C. Wilson. 1989. Mitochondrial DNA sequences in singlehairs from a southern African population. Proc. Natl. Acad.Sci. USA 86:9350–9354.

Vigilant, L., M. Stoneking, H. Harpending, K. Hawkes, andA. C. Wilson. 1991. African populations and the evolution ofhuman mitochondrial DNA. Science 253:1503–1507.

von Haeseler, A., A. Sajantila, and A. Paabo. 1996. The geneticalarchaeology of the human genome. Nat. Genet. 14:135–140.

Watson, E., K. Bauer, R. Aman, G. Weiss, A. von Haeseler, andS. Paabo. 1996. mtDNA sequence diversity in Africa. Am. J.Hum. Genet. 59:437–444.

Weiss, G., and A. von Haeseler. 1998. Inference of populationhistory using a likelihood approach. Genetics 149:1539–1546.

Wjisman, E. M. 1984. Estimation of genetic admixture inPygmies. Pp. 349–358 in L. L. Cavalli-Sforza, ed. Africanpygmies. Academic Press, Orlando, Fla.

Y Chromosome Consortium (YCC). 2002. A nomenclaturesystem for the tree of human Y-chromosomal binaryhaplogroups. Genome Res. 12:339–348.

David Goldstein, Associate Editor

Accepted May 17, 2004

1682 Destro-Bisol et al.

by guest on May 19, 2014

http://mbe.oxfordjournals.org/

Dow

nloaded from