Single linkage group per chromosome genetic linkage map for the horse, based on two...

29
Single linkage group per chromosome genetic linkage map for the horse, based on two three-generation, full-sibling, crossbred horse reference families June E. Swinburne a,1 , Mike Boursnell a, * , Gemma Hill a , Louise Pettitt a , Twink Allen b,c , Bhanu Chowdhary d , Telhisa Hasegawa e , Masahiko Kurosawa f , Tosso Leeb g , Suguru Mashima f , James R. Mickelson h , Terje Raudsepp d , Teruaki Tozaki f , Matthew Binns a,2 a Animal Health Trust, Newmarket, Suffolk CB8 7UU, UK b T.B.A. Equine Fertility Unit, Newmarket, Suffolk CB8 9BH, UK c Department of Clinical Veterinary Medicine, University of Cambridge, Cambridge CB3 0ES, UK d Department of Veterinary Integrative BioScience, College of Veterinary Medicine, Texas A and M University, College Station, TX 77843, USA e Equine Research Institute, Japan Racing Association, Utsunomiya, Tochigi, Japan f Department of Molecular Genetics, Laboratory of Racing Chemistry, Utsunomiya, Tochigi, Japan g Institute of Animal Breeding and Genetics, School of Veterinary Medicine Hannover, Bunteweg 17p, 30559 Hannover, Germany h College of Veterinary Medicine, 1988 Fitch Avenue, University of Minnesota, St. Paul, MN 55108, USA Received 14 February 2005; accepted 3 September 2005 Available online 28 November 2005 Abstract A genetic linkage map of the horse consisting of 742 markers, which comprises a single linkage group for each of the autosomes and the X chromosome, is presented. The map has been generated from two three-generation full-sibling reference families, sired by the same stallion, in which there are 61 individuals in the F2 generation. Each linkage group has been assigned to a chromosome and oriented with reference to markers mapped by fluorescence in situ hybridization. The average interval between markers is 3.7 cM and the linkage groups collectively span 2772 cM. The 742 markers comprise 734 microsatellite and 8 gene-based markers. The utility of the microsatellite markers for comparative mapping has been significantly enhanced by comparing their flanking sequences with the human genome sequence; this enabled conserved segments between human and horse to be identified. The new map provides a valuable resource for genetically mapping traits of interest in the horse. D 2005 Elsevier Inc. All rights reserved. Keywords: Horse; Linkage map; Genetic map; Comparative map; Radiation hybrid map Comprehensive genetic linkage maps consisting of 1000+ microsatellites are now available for species such as cattle [1], pig [2], sheep [3], and chicken [4]. These typically cover 2286 (pig) to 3800 cM (chicken). Molecular genetic investigations of the horse have lagged behind these agricultural animals. One reason for this is that the horse is not primarily considered either a significant food production animal or a pet. The horse does, however, occupy a unique position in the history of civilized man, having played a vital role in many spheres of human endeavor from transportation to warfare and from communication to colonization. During the 20th century the status of the horse largely changed from a beast of burden to a competitor in leisure and sporting activities and, in this role, it has become a prized possession. Many of the most valuable individual animals in the world are Thoroughbred racehorses and, as such, the health and welfare of the Thoroughbred horse is of major economic concern. There are several common conditions affecting horses that have an inherited component, including rhabdomyolysis 0888-7543/$ - see front matter D 2005 Elsevier Inc. All rights reserved. doi:10.1016/j.ygeno.2005.09.001 * Corresponding author. E-mail addresses: [email protected] (J.E. Swinburne), [email protected] (M. Boursnell). 1 Fax: +44 1638 750794. 2 Present address: The Royal Veterinary College, Royal College Road, London NW1 0TU, UK. Genomics 87 (2006) 1 – 29 www.elsevier.com/locate/ygeno

Transcript of Single linkage group per chromosome genetic linkage map for the horse, based on two...

sevier.com/locate/ygeno

Genomics 87 (2

Single linkage group per chromosome genetic linkage map for the horse,

based on two three-generation, full-sibling,

crossbred horse reference families

June E. Swinburne a,1, Mike Boursnell a,*, Gemma Hill a, Louise Pettitt a, Twink Allen b,c,

Bhanu Chowdhary d, Telhisa Hasegawa e, Masahiko Kurosawa f, Tosso Leeb g, Suguru Mashima f,

James R. Mickelson h, Terje Raudsepp d, Teruaki Tozaki f, Matthew Binns a,2

a Animal Health Trust, Newmarket, Suffolk CB8 7UU, UKb T.B.A. Equine Fertility Unit, Newmarket, Suffolk CB8 9BH, UK

c Department of Clinical Veterinary Medicine, University of Cambridge, Cambridge CB3 0ES, UKd Department of Veterinary Integrative BioScience, College of Veterinary Medicine, Texas A and M University, College Station, TX 77843, USA

e Equine Research Institute, Japan Racing Association, Utsunomiya, Tochigi, Japanf Department of Molecular Genetics, Laboratory of Racing Chemistry, Utsunomiya, Tochigi, Japan

g Institute of Animal Breeding and Genetics, School of Veterinary Medicine Hannover, Bunteweg 17p, 30559 Hannover, Germanyh College of Veterinary Medicine, 1988 Fitch Avenue, University of Minnesota, St. Paul, MN 55108, USA

Received 14 February 2005; accepted 3 September 2005

Available online 28 November 2005

Abstract

A genetic linkage map of the horse consisting of 742 markers, which comprises a single linkage group for each of the autosomes and the X

chromosome, is presented. The map has been generated from two three-generation full-sibling reference families, sired by the same stallion, in

which there are 61 individuals in the F2 generation. Each linkage group has been assigned to a chromosome and oriented with reference to

markers mapped by fluorescence in situ hybridization. The average interval between markers is 3.7 cM and the linkage groups collectively span

2772 cM. The 742 markers comprise 734 microsatellite and 8 gene-based markers. The utility of the microsatellite markers for comparative

mapping has been significantly enhanced by comparing their flanking sequences with the human genome sequence; this enabled conserved

segments between human and horse to be identified. The new map provides a valuable resource for genetically mapping traits of interest in the

horse.

D 2005 Elsevier Inc. All rights reserved.

Keywords: Horse; Linkage map; Genetic map; Comparative map; Radiation hybrid map

Comprehensive genetic linkage maps consisting of 1000+

microsatellites are now available for species such as cattle [1],

pig [2], sheep [3], and chicken [4]. These typically cover 2286

(pig) to 3800 cM (chicken). Molecular genetic investigations of

the horse have lagged behind these agricultural animals. One

reason for this is that the horse is not primarily considered

0888-7543/$ - see front matter D 2005 Elsevier Inc. All rights reserved.

doi:10.1016/j.ygeno.2005.09.001

* Corresponding author.

E-mail addresses: [email protected] (J.E. Swinburne),

[email protected] (M. Boursnell).1 Fax: +44 1638 750794.2 Present address: The Royal Veterinary College, Royal College Road,

London NW1 0TU, UK.

either a significant food production animal or a pet. The horse

does, however, occupy a unique position in the history of

civilized man, having played a vital role in many spheres of

human endeavor from transportation to warfare and from

communication to colonization. During the 20th century the

status of the horse largely changed from a beast of burden to a

competitor in leisure and sporting activities and, in this role, it

has become a prized possession. Many of the most valuable

individual animals in the world are Thoroughbred racehorses

and, as such, the health and welfare of the Thoroughbred horse

is of major economic concern.

There are several common conditions affecting horses that

have an inherited component, including rhabdomyolysis

006) 1 – 29

www.el

J.E. Swinburne et al. / Genomics 87 (2006) 1–292

(‘‘tying-up’’), laryngeal hemiplegia (‘‘roaring’’), osteochondro-

sis dissecans, and recurrent airway obstruction. These diseases

provide an incentive to study the genetics of the horse

generally and the Thoroughbred in particular. The identifica-

tion of the inherited components involved in such conditions

could lead to improvements in health and welfare, through

selective breeding of individuals not carrying disease muta-

tions. Studbook records should provide a valuable reference

guide for the assembling of pedigrees for mapping purposes.

For example, the General Studbook provides an account of the

pedigrees of today_s Thoroughbred horses that stretches back

over more than 30 generations, to the very foundation of the

breed. In addition, every Thoroughbred horse registered in the

General Studbook over the past 30 years has been parentage

tested, thereby providing a verified pedigree with which to

work. These two factors should facilitate genetic studies of

complex genetic diseases in the Thoroughbred breed, when

accompanied by appropriate DNA collections from accurately

phenotyped horses.

Scientific interest in the genetics of the horse has increased

within the past decade. An International Equine Gene Mapping

Workshop was established in Lexington in October 1995 and

since that time substantial advances toward an integrated

genetic map suitable for the mapping of mutations and traits of

interest have been made. Specifically, the development of a cell

hybrid panel for synteny analysis [5], genetic linkage maps

[6–9], physical mapping resources [10–15], an equine BAC

library (http://bacpac.chori.org/equine241.htm), and radiation

hybrid (RH) maps [16–19] has allowed the localization of

>1000 Type I and Type II markers. In addition, ZOO

fluorescence in situ hybridization (FISH) comparison of horse

and human chromosomes has provided information on chromo-

some segments that are evolutionarily conserved between these

two species [11,20,21]. More recently, RH mapping has been

used to produce comparative maps of horse with human

Fig. 1. Pedigree structure of the horse reference family used in this study. Two pairs o

used together with a single stallion (Thinur) to generate two families of full-sibling ho

All twin embryos retrieved were found to be dizygotic in origin. The breeds used in

Queenslander, Polish Warmblood; 5, Reynir fra Holum and Thorf, Icelandic Horse

[15,17–19,22] and mouse [19,22] genomes, thereby facilitat-

ing the exploitation of the genome sequence data available for

these species for the benefit of the horse. These various resources

were utilized during the present study to develop targeted

strategies to close gaps in the last three linkage groups, resulting

in single linkage groups for each autosome and the X

chromosome.

The efficient generation of the map was dependent upon the

use of a unique mapping reference family (Fig. 1). A detailed

description of the construction of the reference pedigree, whose

effectiveness centered upon the utilization of two pairs of

identical twin mares to produce large numbers of full siblings

in the form of preimplantation embryos, can be found in the

first version of the map [9]. The production of such a pedigree

would not have been possible using traditional breeding

methods. The distinctive structure of the pedigree has meant

that a relatively small number of individuals are required to

detect linkage, compared to the large half-sibling family groups

traditionally used to map the genomes of large uniparous

mammals. From the 742 markers now mapped, panels of

markers that are polymorphic in particular breeds can be

selected for whole genome scanning.

Historically, genetic linkage maps have been composed

largely of Type II microsatellite markers. The difficulties in

generating and scoring polymorphic markers within genes,

compared to the ease with which microsatellite markers can be

isolated, has hindered the placement of genes (Type I) onto

genetic linkage maps. In contrast, both Type I and Type II

markers can be easily placed onto RH maps, where there is no

requirement for polymorphism. The lack of genes on linkage

maps, coupled with the previous lack of identifiable relation-

ships between microsatellite markers of different species, has

made it difficult to align directly the genetic linkage maps of

different species. In this paper, sequences flanking almost half

of the equine microsatellites have been found to show high

f monozygotic twin mares (Cordelia and Regan, Ophelia and Desdemona) were

rse embryos. Females are depicted by circles and males are depicted by squares.

this pedigree are 1, Arab; 2, Concha, Thoroughbred; 3, Othello, Welsh Cob; 4,

.

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 3

levels of identity to unique locations on the human genome.

When the locations of these markers were compared with

existing data describing horse–human conserved segments

[19,23], the map positions were found to be consistent in the

majority of cases. We have illustrated the linkage map with the

most current RH map [19] and comparative data to draw

together these three sets of information. Using this alignment,

useful comparisons between the horse linkage map, horse RH

map, and the human genome can be made.

This study uses a unique mapping reference pedigree to add

markers efficiently to the current equine linkage map. This

paper greatly extends previous genetic linkage maps of the

horse by adding almost 400 Type II markers, to bring the total

number of markers mapped to 742, and providing a total map

length of 2772 cM.

Results

Genotyping

New markers (n = 601) were tested on the stallion and the

twin mares to determine which were informative in the

reference family; 554 (92.2%) of them produced comprehen-

sible data. Ninety-five markers (17.1% of those scored) were

not polymorphic in the pedigree, consistent with previous

proportions of informative markers found in this crossbred

family. Nine markers were identified as duplicates of others

and were discarded. The remaining markers (n = 450) were

genotyped across the family and all the markers for which the

data were unambiguous were scored. However, a number of

markers were discarded, because the PCR amplification was

weak, the products were difficult to score, or the inheritance

was apparently non-Mendelian. Very rarely there were cases of

sporadic mutation (<10 from ¨100,000 datapoints), consistent

with an average mutation frequency for dinucleotide micro-

satellite repeats in the horse of approximately 1 � 10�4, and

there were no markers for which the identical twins differed in

genotype. Several markers segregated a null allele in this

pedigree; these were scored whenever the interpretation was

considered unambiguous. Reliable results were added to those

described by Swinburne and colleagues [9], bringing the

number of markers genotyped to 801.

Two-point analysis

Details of all the markers genotyped during this study are

listed in Supplementary Table 1, and they are described fully at

http://www.thearkdb.org/browser?species=horse and http://

locus.jouy.inra.fr/cgi-bin/horsemap/Horsemap/main.pl.

Of the 801 markers in the combined database, 794 (99.1%)

showed linkage with a LOD >3 to another marker. Linkage

groups were formed by the grouping of markers displaying

linkage greater than LOD 3 to others in the group. All bar 3 of

the 794 (i.e., 0.4%) were unambiguously assigned to a specific

linkage group. These 3 markers showed linkage to two

different groups and could not be definitively assigned to

either (Supplementary Table 1). Seven markers that showed no

linkage to another marker at LOD >3 were classed as

‘‘unassigned’’ (Supplementary Table 1). Thirty-two linkage

groups were generated, corresponding to each of the 31

autosomes and the X chromosome.

A number of spurious linkages of LOD 3–5.5 were

generated, which contradicted the overwhelming majority of

data. This is to be expected in a dataset of this size and, in

general, did not confuse the assignment of markers to linkage

groups, as these markers had much higher lod scores to other

markers. In one instance, however, the assignment of markers

to two chromosomes was not completely evident—this

involved several markers assigned to ECA13 and several

assigned to ECA22. After careful examination of the linkage

data, markers were assigned to specific chromosomes as

illustrated (Fig. 2), and this assignment agrees with the RH

map [19] (illustrated) with the exception of AHT030 and

AHT095. However, a more recent publication of a high-density

map of ECA22 [22] has repositioned these two misplaced

markers from ECA22 to ECA13, such that linkage, FISH, and

RH data now agree.

Assignment of linkage groups to chromosomes

All the linkage groups were assigned to chromosomes by

means of 159 FISH-mapped markers (not illustrated in Fig. 2).

Some of these markers were included in the previous mapping

paper [9] and, in addition, recent data were obtained from other

publications and personal communications (see Supplementary

Table 1). There were eight examples noted in the literature in

which the FISH positions contradicted the results of the linkage

analysis; these are highlighted with the word ‘‘inconsistency’’

in Supplementary Table 1. On closer examination it was clear

that some of these were mistakes made at the time of

chromosome identification, while the remainder were probably

due to chimerism in the FISH-mapped clone. Some of these

results have been double-checked and corrected in previous

publications (listed in Supplementary Table 1) and they are

described by Penedo and colleagues [24].

The linkage groups have been oriented with reference to

these FISH-mapped markers and are drawn in the same

orientation as other maps published recently [15,19,24]. For

several chromosomes there are few FISH-mapped microsa-

tellite markers, but the publication of numerous FISH-mapped

genes has allowed the unambiguous orientation of all linkage

groups, except for the small chromosomes ECA26, 27, 29,

and 30.

Targeted generation of microsatellites to close gaps in the map

At the stage in map building when there was a total of 36

linkage groups assigned to the 32 recombining chromosomes, a

targeted approach to closing the gaps was employed. At that

time ECA8 and ECA9 were each assigned two linkage groups

and ECA10 was assigned three. The strategies used the equine

BAC library (http://bacpac.chori.org/equine241.htm) to isolate

large insert clones from selected regions from which new

microsatellite markers could be isolated.

J.E. Swinburne et al. / Genomics 87 (2006) 1–294

BAC clones were isolated using overgo probes derived from

the following sequences: (1) microsatellite markers that were

placed in the regions on other maps, but were uninformative in

our family; (2) microsatellite markers at the ends of the linkage

groups on both sides of the gaps that were informative on only

one-half of the family, with a view to developing new markers

Fig. 2. Sex-averaged genetic linkage maps of the horse autosomes and female-specif

The positions of markers along the chromosome are shown in centimorgans (cM) to

ordered with a threshold of LOD >1 are shown to the right in vertical text. The adjac

indicated by a short horizontal line. The RH map [19] is shown to the far right. Color

the horse linkage map and the human genome sequence. The identity of the human

the human chromosome is shown in megabases. The maps were oriented by referen

from these BACs that were informative on both sides of the

family; (3) genes that were in gap regions based on the RH

map; and (4) equine BAC end sequences that were identified

by comparison with the human genome sequence as being

positioned in the gap regions. In addition, new equine

microsatellite sequences submitted to the sequence databases

ic map of the X chromosome are shown as gray bars in the center of each map.

the left. Framework markers are shown in bold italics. Markers that cannot be

ent vertical line describes their probable location with their most likely position

ed bars to the far left indicate proposed segments of conserved synteny between

chromosome is shown adjacent to the colored bars. The position of match with

ce to FISH-mapped markers (refer to Supplementary Table 1) and genes [19].

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 5

were screened by comparing them with the human genome

sequence to identify those markers likely to be situated on

ECA8, 9, and 10. Each of these strategies contributed new

microsatellite markers that led to the closure of all four gap

regions.

Construction of the map

Sex-averaged maps were constructed for each of the 31

autosomes and a female-specific map was made for the X

chromosome. Forty-nine markers were discarded during the

process of map construction due to them not being placed

unequivocally in one region of a linkage group. The maps

constructed from the remaining 742 markers are depicted in

Fig. 2. The linkage groups vary in length from 24 cM for

ECA26 to 194 cM for ECA1. The whole map measures

2772 cM.

The framework map, the order of which is significant at

LOD 3 (i.e., it is 1000 times more likely than any other

order), consists of 169 markers (22.8% of those assigned to

linkage groups) and is illustrated in bold italics in Fig. 2. The

comprehensive map, which is built around the framework

map and has an order significant at LOD 1, (i.e., it is 10

times more likely than any other order), consists of 507

markers (68.3% of those assigned to linkage groups). The

remaining markers, numbering 235 (31.7% of those assigned

to linkage groups), have been assigned their most probable

location.

There are 174 (23.4%) markers that share a map position

with one or more other markers. These markers were examined

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–296

in detail and eight pairs were found to be identical (UMNe448 =

HP027, UM002 = TKY297, LEX015 = LEX038, UMNe259 =

AHT076, AHT063 = AHT081, TKY328 = AHT102, AHT018 =

AHT100, TKY288 = LEX005). These are in addition to markers

that were already known to be identical and for which only one

of each pair was used in developing the present map (COR041 =

COR065, LEX024 = UM001, LEX062 = LEX066, COR055 =

COR060, COR83 = COR102, UM038 = SGCV031).

Comparative mapping

Of the 742 microsatellites included in this map, the

sequences of 610 (82.2%) had been submitted to GenBank

and were therefore available for comparative mapping using

the method described (see Materials and methods). The

BLAST comparison of their flanking sequences with the

human genome yielded 312 (51.1%) markers with significant

and unique matches and these are listed in Supplementary

Table 2.

When the conserved segments generated with these data

were compared with the most recent comparative maps (see

Materials and methods) it was found that 288 (92.3%) of the

markers described here fell into the same human segments as

those described in the previous publications. The remaining 24

markers (7.7%) were all single markers that did not agree with

adjacent markers. It may be that these are small regions that

have not previously been identified, or they could be spurious

results suggesting an incorrect match. Because these 24

markers did not have additional data to support their human

match, they have been included only as an Appendix to

Supplementary Table 2 and their matches have been omitted

from Fig. 2.

We did not observe several small segments identified by

Milenkovic and colleagues [15] and Chowdhary and collea-

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 7

gues [19], (i.e., HSA22 on ECA1, HSA2 and HSA8 on ECA2,

HSA3 on ECA3, HSA22 on ECA5, HSA19 on ECA7, HSA12

on ECA9, HSA9 on ECA19, HSA22 on ECA28), although all

but two of these segments were defined by only one marker.

These may be small conserved segments that were not detected

by the markers in this study. Similarly we did not observe four

segments of homology identified by ZOO-FISH [21], namely

HSA8 on ECA2, HSA19 on ECA7 and ECA21, and HSA22 on

ECA28.

Discussion

This genetic linkage map has been constructed using a

unique reference pedigree of two related full-sibling families,

the structure and composition of which have been refined in a

number of ways: (1) the founding horses represent a number of

disparate breeds in an attempt to maximize the polymorphism

present in the pedigree (Fig. 1); in particular the Icelandic

Horse, which has been geographically separated from other

European breeds for 900 years, should demonstrate significant

genetic divergence; (2) the use of full siblings required that

many fewer animals needed to be genotyped (n = 71) to detect

linkage, compared to previously published maps for the horse

that involved 271 and 458 half-sibling individuals [6–8]; (3)

the three-generation structure allowed phase to be determined;

(4) the genotyping of the dams in the pedigree allowed the

production of an X-chromosome map.

The resulting map improves upon the previous iteration [9]

by adding an additional 383 markers to bring the total number

mapped to 742. The markers comprise 717 that are autosomal

in origin and 25 that are X-linked (Fig. 2). Only 7 markers

(0.9%) did not display linkage with another marker at LOD >3.

These may lie within the existing linkage groups in regions

where the map is based on information from one side of the

reference family only, such that the recombination distance to

the next adjacent marker sharing information is too great to

produce a significant lod score. This situation was seen during

the cloning of markers in the last few gaps between linkage

groups on ECA8, 9, and 10, where two linkage groups

overlapped but did not join due to the lack of shared

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–298

information at the ends of the groups. In fact, 6 of these 7

markers are informative on one side of the family only and, in

addition, have an average number of informative meioses of

only 33.4 (range 18–61), compared to an average of 71.8

(range 16–144) among the whole dataset.

The generation of a single linkage group for each

chromosome is a great improvement on previously published

linkage maps and suggests that map coverage is extensive. The

average distance between markers on the map is 3.7 cM and the

total length of the linkage groups is 2772 cM. This is a

significantly shorter map length than the recent compilation

map described by Penedo and colleagues [24] of 3826 cM.

Possible reasons for the difference are discussed by Penedo and

colleagues [24] and principally include genotyping errors in the

data. Our map length of 2772 cM is, however, very close to the

total map length of 2720 cM estimated from chiasma counts in

horses [25].

As the number of typed markers increases, marker order can

be determined less accurately. This is due to fewer recombina-

tions occurring in the reference family between adjacent

markers. In some instances there will be no recombinations

between close markers, and this will result in these markers

being assigned the same map position. Indeed the stage will be

reached at which the number of recombinations present in this

pedigree is a limiting factor and the reward for adding

additional markers does not justify the effort. It may be that

this stage is approaching, though there remains value in rapidly

and accurately mapping markers to defined regions on the map,

to be used for fine mapping in other populations.

Marker placement was compared with the most recent

published RH map of the horse genome [19], and two

contradictions were observed. The first is COR006, which is

assigned to ECA20 here, but ECA1 on the RH map. The

linkage data placing COR006 on ECA20 is not unquestionable

(COR006�LEX071 Q = 0.12, LOD = 3.25; COR006 –UM031

Q = 0.12, LOD = 3.25), and the BLAST match of this marker to

HSA14 does suggest that it should be assigned to the p arm of

ECA1, which is where it is located on the RH map. The second

is TKY275, which is assigned to ECA8 here, but ECA26 on the

RH map. The linkage assignment of TKY275 to ECA8 is

strong, however, suggesting mislabeling is to blame. Compar-

ison with a recent compilation of linkage data from several

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 9

sources [24] shows only four discrepancies. These include our

assignment of TKY281 to ECA1 rather than ECA28, of UM031

to ECA20 rather than ECA26, of SGCV003 to ECA22 rather

than ECA13, and of TKY320 to ECA23 rather than ECA28.

Further examination has shown our linkage data to be solid for

the first two of these markers, though weaker for the second

two. It may well be that SGCV003 and TKY320 are incorrectly

assigned on this map. The discrepancies with TKY281 and

UM031 may lie with reagent or data mislabeling, but these

minor discrepancies are likely to be resolved with further

refinements of the maps.

In Fig. 2 the most recently published RH map [19] has been

cross-referenced with the linkage map. On the whole the

marker orders agree, and although there are localized regions

where marker order disagrees, this may merely be due to

anomalies in the maps or markers not being placed in an order

exceeding LOD 1. There are other occasions, however, in

which one RH group is flipped with respect to the linkage map;

obvious examples are the RH groups delineated by UMNe054

and COL1A2 on ECA4 and by AHT063 and COR039 on

ECA16. Occasionally, the linkage map suggests that the order

of RH groups along a chromosome should be changed, e.g.,

ECA15 and 16.

The use of BLAST comparisons to identify comparative

positions between Type II markers and the human genome

sequence has added greatly to the comparative map available

for the horse and complements the comparative map illustrated

by Chowdhary and colleagues [19], in which Type I markers

were used to provide points of homology between the two

species. The position has now been reached at which accurate

predictions can often be made as to the location of candidate

genes in the horse genome. Markers that are likely to be in the

correct region can be chosen for linkage mapping from the

2000+ Type II markers now available for the horse genome. In

addition, the comparative map will have great value in

suggesting candidate genes from genetic mapping projects

for which a critical interval has been narrowed to a manageable

size. The comparative map will therefore play a valuable role in

both candidate gene and positional cloning approaches to

disease gene identification, until a genome sequence is

available for the horse.

The conserved segments identified in the present study have

been drawn as contiguous segments (Fig. 2) wherever the data

suggest that this may be the case. It is premature to begin

estimating how many evolutionary break-points have occurred

between the horse and the human, as the maps are not yet

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2910

detailed enough for this. However, 59 segments have been

drawn using this approach, with the reservation that 9 segments

are represented by only one marker.

Ideally those groups now actively involved in mapping in

the horse need to strive to integrate the data that are currently

available from linkage, RH, cytogenetic, and comparative

sources. The recent publications of an expansive RH map,

which integrates linkage, RH, and cytogenetic data [19], and an

integration of several linkage maps [24] have begun this

process. The amalgamation of mapping efforts across the

community is necessary to bring together the varied strands of

information to one centralized location for easy viewing and

interrogation. A possible vehicle for this may be the recently

developed viewing tool for Web-based access to published

maps (http://www.vgl.ucdavis.edu/equine/caballus/), which

provides a means for cross-referencing mapping data.

The maps generated for the horse genome are now reaching

a stage at which they can be utilized for the mapping of genes

involved in inherited diseases and traits. Panels of markers are

being developed for use in whole genome scanning projects

and, in addition, the emerging comparative map between horse

and human in particular provides access to suggestive

candidate genes in positional cloning experiments.

Materials and methods

Production of reference family DNA

The generation of the reference family material and the isolation of DNA

from it are described by Swinburne and colleagues [9]. The pedigree comprises

71 individuals consisting of 10 adult horses and 61 F2 offspring. The structure

of the family is shown in Fig. 1, and the name, breed, and sex of each

individual are listed.

Genotyping of microsatellites

Microsatellite markers were developed for the horse genome in a number of

laboratories worldwide and the primer pairs or primer sequences were sent to

the Animal Health Trust in Newmarket, England, for genotyping. A list of the

markers used in this study can be found in Supplementary Table 1 and they are

described in detail at http://www.thearkdb.org/browser?species=horse and

http://locus.jouy.inra.fr/cgi-bin/horsemap/Horsemap/main.pl.

Initially, markers were tested on the reference family stallion and mares

to confirm that they were informative in the pedigree. They were then

pooled into groups of either two or three pairs of primers prior to PCR or

were combined post-PCR, to maximize efficiency and reduce costs.

Fluorescence labeling of the PCR amplicon was achieved, by labeling

one primer either with 6-FAM, TET, HEX, TAMRA, JOE, or NED (for use

with the ABI3700 automated sequencer) or with 6-FAM, VIC, NED, or

PET (for use with the ABI3100 automated sequencer). Recently, the three-

primer methodology described by Schuelke [26] has been employed as an

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 11

extremely cost-effective and efficient means of fluorescently labeling PCR

fragments.

PCRs were performed in 0.2-ml thin-walled PCR plates (Alpha

Laboratories Ltd., Hampshire, UK), using 12-Al reaction volumes and the

MgCl2 concentrations suggested in the original marker report. All PCRs

were performed using 1.5 units AmpliTaq Gold (Applied Biosystems, Foster

City, CA, USA), 1� GeneAmp PCR buffer II (Applied Biosystems), and

200 AM each dNTP. For the majority of markers 10 pmol of each primer

was added to the reaction, except when the methodology of Schuelke [26]

was employed, in which case 5 pmol of reverse, 2 pmol of tailed forward,

and 10 pmol of the labeled third primer were added. An MJ Tetrad PCR

cycler (Bio-Rad Laboratories, Hercules, CA, USA) and a general PCR

program with variable annealing temperature (Ta) were used for all PCR

amplifications: 95-C for 10 min, followed by 30 cycles of 95-C for 30 s,

Ta for 30 s, and 72-C for 1 min and then 72-C for 10 min. However, when

the methodology of Schuelke [26] was employed the PCR program that

they specified was used (94-C for 4 min; followed by 30 cycles of 94-Cfor 1 min, 55-C for 1 min, and 72-C for 1 min; followed by 8 cycles of

94-C for 1 min, 50-C for 1 min, and 72-C for 1 min and then 72-C for

10 min).

PCRs were stored at �20-C before being analyzed on an ABI3700

(Applied Biosystems) or ABI3100 (Applied Biosystems) according to the

manufacturer_s instructions. Filter sets were chosen according to the dyes used

in the reaction. Dilution of the sample prior to loading on the ABI3700 was

determined empirically when required. Occasionally, the injection time was

altered when the ABI3100 was used to optimize the signal strength.

Genotyping data were collected and analyzed initially with GeneScan

(Applied Biosystems) and then with either Genotyper (Version 3.6 NT;

Applied Biosystems) or, more recently, GeneMapper (Version 3.0; Applied

Biosystems).

Biallelic marker typing

An SNP was identified in the equine IL4 gene and this was mapped onto

the reference family by sequencing PCR-amplified products. The PCR program

involved 94-C for 2 min; then 30 cycles of 94-C for 1 min, 55-C for 1 min, and

72-C for 4 min; and then 72-C for 10 min, which generated a product of about

2.5 kb. The G/A polymorphism was identified at a position about 300 bp into

the sequence generated using the reverse primer for sequencing

(TCTCTGGGGCG/AGTGTGTCA).

A randomly generated SNP named S22 was characterized by sequencing a

PCR product. The PCR program involved 95-C for 10 min; then 30 cycles of

95-C for 30 s, 55-C for 1 min, and 72-C for 1 min; and then 72-C for 5 min,

which generated a product of about 300 bp. The SNP produces a restriction site

for NlaIII (CATG) that can be assayed by restriction digest (CATCTTTAA-

TAACA/TTGTGGACCAATTGT).

Selection of BAC clones, subcloning, and the characterization of

microsatellites

Overgo probes [27] were designed from sequences flanking microsatellite

repeats and from equine BAC end sequences placed in gap regions by

comparative mapping. The two probes were combined in a stock solution at 10

pmol/Al and heated at 90-C for 2 min and then placed at room temperature for

10 min before being stored on ice. The labeling reaction was performed in a

10-Al volume with 1 Al oligonucleotide mix, 0.5 Al BSA (2 mg/ml), 4 Al ofoligo ligation buffer [27], 5 ACi [a-32P]dATP (110 TBq/mmol) (Amersham), 5

ACi [a-32P]dCTP (110 TBq/mmol) (Amersham), 2 units of Klenow fragment

DNA polymerase (Roche), and deionized water. The reaction was allowed to

proceed for 30 min at 37-C. After the addition of 2 Al cold dATP/dCTP mix

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2912

(100 AM) the reaction was incubated for a further 30 min at 37-C. The probe

was purified by passing it through a G50 Sephadex column (Amersham) and

eluting it in 400 Al 1� SSC.

Equine BAC library filters (BACPAC Resources, Children_s Hospital,

Oakland, CA, USA; http://bacpac.chori.org/equine241.htm) were prehybri-

dized for at least 1 h at 60-C in 40 ml prewarmed Church buffer [28]. Overgo

probes were denatured for 10 min at 95-C and cooled on ice for 1 min before

being added to the prehybridization solution. Hybridization continued overnight

at 60-C with constant rotation. Filters were rinsed with 50 ml 2� SSC, 0.1% (w/

v) SDS; washed twice in 50 ml 2� SSC, 0.1% (w/v) SDS for 15 min at 60-C;

wrapped in Saran wrap; and exposed to X-ray film for 24 h at room temperature.

After processing, positive clone IDs were ascertained from the filters using the

grid position on the filter. Multiple clones were selected for each probe.

BAC DNAwas prepared using a PSIC Clone, Big BAC DNA kit (Princeton

Separations, Adelphia, NJ, USA) following the manufacturer_s instructions.

BAC DNA was digested with AluI and size fractionated by electrophoresis to

select fragments of 600–1000 bp, and the isolated fragments were ligated into

pUC18 that had been digested with SmaI. Following transformation into

Escherichia coli TG1 [29], recombinant clones were gridded in duplicate and

grown overnight at 37-C. One set of clones was then transferred to Hybond

N+filters (Amersham Biosciences, Buckinghamshire, UK), denatured, neutral-

ized, and fixed by UV irradiation. The filters were prehybridized and hybridized

with radioactively labeled (dA–dC)n I (dG–dT)n polynucleotide probe (Pfizer,

New York, NY, USA) and washed as described above for overgo probing of the

BAC library. Plasmid DNA was isolated from positive clones using the REAL

Prep 96 Plasmid Kit (Qiagen, Valencia, CA, USA) and the insert DNA was

sequenced in both directions using universal M13 forward and reverse primers.

Microsatellites were identified in the sequence traces and oligonucleotide

primers designed to amplify the microsatellites using PRIMER (Version 0.5;

Whitehead Institute for Biomedical Research).

Data handling and two-point linkage analysis

Pedigree information and genotypes were entered into CYRILLIC 2.1.3

(Family Genetix Ltd., Wallingford, Oxford, UK) and exported to CRI-MAP

(Phil Green, Version 2.4 (3/26/1990)) for linkage analysis. Initially, two-point

analysis, with a cut-off threshold of LOD 3, was performed to identify

linkage groups. At this point, any cases of non-Mendelian inheritance were

highlighted. Markers isolated from large, FISH-mapped clones such as

cosmids or BACs were used to identify to which chromosome each linkage

group belonged. These locations were all obtained from published sources

and details can be found in Supplementary Table 1.

Construction of the map

MULTIMAP [30] was used to generate a linkage map for each chromosome

by identifying the most likely marker order and estimating the distance between

each marker. Initially framework maps of at least four markers were generated

that identified markers the order of which was significant at LOD 3. For some

chromosomes (ECA12, 26, 27, and X) the stringency was relaxed to LOD 2 to

generate a framework, and for others, particularly the smaller ones, fewer than

four markers were accepted as the framework (ECA7, 12, 25, 26, 29, and 31).

Comprehensive maps were then generated, based on the frameworks in which

the order of the markers was significant at LOD >1. This process resulted in

chromosome maps with marker orders 10-fold more likely than any other. The

most likely positions of markers that could not be inserted into the map in an

order significant at LOD 1 were identified by MULTIMAP, and these have been

depicted separately in Fig. 2 (see figure legend for details). Sex-averaged map

distances were generated for all autosomes and female-specific map distances

for the X chromosome.

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 13

Comparative mapping

GenBank was searched for all entries with ‘‘microsatellite’’ and ‘‘horse’’ and

1708 sequences were identified. These were compared with the human genome

sequence by using BLAST in a batch format. The top five matches for each were

listed and examined in turn and the results were split into two groups—those for

which a significant match to a unique position on the human genome was found

and those for which such a match was not found. The criteria used were not rigid;

high BLAST matches were not included if there were also high matches to other

genomic locations, and moderately high BLAST matches were considered

significant if there were no other moderately high matches, i.e., it was the

difference between the top scoring location and the second scoring location that

was important, rather than the absolute score. No matches less than e�2 were

considered. Markers that were linkage mapped and had significant unique

matches were picked out and ordered as they appeared along the horse

chromosome. Conserved segments in which adjacent markers on the horse

map corresponded with contiguous stretches of the human genome were

identified. These data are given in Supplementary Table 2 and illustrated in

Fig. 2.

The conserved segments generated with these data were compared with the

most recent comparative maps [15,19,21].

Acknowledgments

We thank the Horserace Betting Levy Board and the

Childwick Trust for generous financial support. We also

acknowledge the Dorothy Russell Havemeyer Foundation for

their continuing support of collaboration in this field by

financing International Workshops. We are grateful to the

laboratories that contributed to this effort by providing primers

for microsatellite markers, including INRA Centre de

Recherches de Jouy, France; the College of Veterinary

Medicine, University of Minnesota; the Equine Research

Institute, Japan Racing Association, Japan; Uppsala University,

Sweden; and the Norwegian College of Veterinary Medicine,

Oslo, Norway.

Appendix A. Supplementary data

Supplementary data associated with this article can be found

in the online version at doi:10.1016/j.ygeno.2005.09.001.

References

[1] N. Ihara, et al., A comprehensive genetic map of the cattle genome based

on 3802 microsatellites, Genome Res. 14 (2004) 1987–1998.

[2] G.A. Rohrer, et al., A comprehensive map of the porcine genome,

Genome Res. 6 (1996) 371–391.

[3] J.F. Maddox, et al., An enhanced linkage map of the sheep genome

comprising more than 1000 loci, Genome Res. 11 (2001) 1275–1289.

[4] M.A. Groenen, et al., A consensus linkage map of the chicken genome,

Genome Res. 10 (2000) 137–147.

[5] Y.-L. Shiue, et al., A synteny map of the horse genome comprised of 240

microsatellite and RAPD markers, Anim. Genet. 30 (1999) 1–9.

[6] G. Lindgren, et al., A primary male autosomal linkage map of the horse

genome, Genome Res. 8 (1998) 951–966.

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2914

[7] G. Guerin, et al., Report of the International Equine Gene Mapping

Workshop: male linkage map, Anim. Genet. 30 (1999) 341–354.

[8] G. Guerin, et al., The second generation of the International Equine Gene

Mapping Workshop half-sibling linkage map, Anim. Genet. 34 (2003)

161–168.

[9] J. Swinburne, et al., First comprehensive low-density horse linkage map

based on two, three-generation, full-sibling, cross-bred horse reference

families, Genomics 66 (2000) 123–134.

[10] M. Breen, et al., Genetical and physical assignments of equine

microsatellites—first integration of anchored markers in horse genome

mapping, Mamm. Genome 8 (1997) 267–273.

[11] T. Raudsepp, et al., Comparison of horse chromosome 3 with donkey and

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 15

human chromosomes by cross-species painting and heterologous FISH

mapping, Mamm. Genome 10 (1999) 277–282.

[12] S. Godard, et al., Cytogenetic localization of 44 new coding sequences in

the horse, Mamm. Genome 11 (2000) 1093–1097.

[13] T.L. Lear, et al., Mapping of 31 horse genes in BACs by FISH,

Chromosome Res. 9 (2001) 261–262.

[14] D. Mariat, et al., Isolation, characterization and FISH assignments of horse

BAC clones containing type I and II markers, Cytogenet. Genome Res. 92

(2001) 144–148.

[15] D. Milenkovic, Cytogenetic localization of 136 genes in the horse: com-

parative mapping with the human genome, Mamm. Genome 13 (2002)

524–534.

[16] S.L. Kiguwa, et al., A horse whole-genome-radiation hybrid panel:

chromosome 1 and 10 preliminary maps, Mamm. Genome 11 (2000)

803–805.

[17] T. Raudsepp, et al., Conservation of gene order between horse and human

X chromosomes as evidenced through radiation hybrid mapping,

Genomics 79 (2002) 451–457.

[18] B.P. Chowdhary, et al., Construction of a 5000 (rad) whole-genome

radiation hybrid panel in the horse and generation of a comprehensive and

comparative map of ECA11, Mamm. Genome 13 (2002) 89–94.

[19] B.P. Chowdhary, et al., The first generation whole-genome radiation

hybrid map in the horse identifies conserved segments in human and

mouse genomes, Genome Res. 13 (2003) 742–751.

[20] T. Raudsepp, L. Fronicke, H. Scherthan, I. Gustavsson, B.P. Chowdhary,

Zoo-FISH delineates conserved chromosomal segments in horse and man,

Chromosome Res. 4 (1996) 218–225.

[21] F. Yang, et al., Refined genome-wide comparative map of the domestic

horse, donkey and human based on cross-species chromosome painting:

insight into the occasional fertility of mules, Chromosome Res. 12 (2004)

65–76.

[22] A. Gustafson-Seabury, et al., High-resolution RH map of horse chromo-

some 22 reveals a putative ancestral vertebrate chromosome, Genomics 85

(2005) 188–200.

[23] C.R. Farber, J.F. Medrano, Identification of putative homology between

horse microsatellite flanking sequences and cross-species ESTs, mRNAs

and genomic sequences, Anim. Genet. 35 (2004) 28–33.

[24] M.C.T. Penedo, et al., International Equine Gene Mapping Workshop

Report: a consensus linkage map constructed with data from new markers

and by merging four mapping resources, Cytogenet. Genome Res. in

press.

[25] I.S. Scott, S.E. Long, An examination of chromosomes in the stallion

(Equus caballus) during meiosis, Cytogenet. Cell Genet. 26 (1980) 7–13.

[26] M. Schuelke, An economic method for the fluorescent labelling of PCR

fragments, Nat. Biotechnol. 18 (2000) 233–234.

[27] C.S. Han, et al., Construction of a BAC contig map of chromosome 16q by

two-dimensional overgo hybridization, Genome Res. 10 (2000) 714–721.

[28] G.M. Church, W. Gilbert, Genomic sequencing, Proc. Natl. Acad. of Sci.

USA 81 (1984) 1991–1995.

[29] D. Hanahan, Studies on transformation of Escherichia coli with plasmids,

J. Mol. Biol. 166 (1983) 557–580.

[30] T.C. Matise, M. Perlin, A. Chakravarti, Automated construction of genetic

linkage maps using an expert system (MultiMap): a human genome

linkage map, Nat. Genet. 6 (1994) 384–390.

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2916

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 17

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2918

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 19

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2920

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 21

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2922

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 23

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2924

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 25

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2926

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 27

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–2928

Fig. 2 (continued).

J.E. Swinburne et al. / Genomics 87 (2006) 1–29 29