Structure and expression of dog apolipoprotein C-II and C-III mRNAs. Implications for the evolution...

12
Structure and expression of dog apolipoprotein A-I, E, and C-I mRNAs: implications for the evolution and functional constraints of apolipoprotein structure Chi-Cheng Luo, * Wen-Hsiung Li, * and Lawrence Chant The Center for Demographic and Population Genetics,* University of Texas, Houston, TX 77030, and Depart- ments of Cell Biology and Medicine,t Baylor College of Medicine, Houston, TX 77030 Abstract Dog apolipoprotein (apo) C-I, A-I, and E cDNA clones were identified in a dog iiver cDNA library in XgtlO by hybridization to synthetic oligonucleotide probes with the corre- sponding human DNA sequences. The longest clone for each apolipoprotein was completely sequenced. The apoC-I cDNA sequence predicts a protein of 62 residue mature peptide preceded by a 26 amino acid signal peptide. The a p d - I cDNA sequence predicts a 242 residue mature peptide, a 6 residue pro-segment, and an 18 residue signal peptide. The apoE cDNA, which lacks the signal peptide region, predicts a mature peptide of 291 amino acid residues. Slot blot hybridization of total RNA iso- lated from various dog tissues to dog apoC-I, A-I, and E cDNA probes indicates that apoC-I mRNA is detectable in liver only, apoA-I mRNA is present in liver and small intestine, though the concentration in the latter tissue is only - 15% of that in the liv- er, and apoE mRNA is present in multiple tissues including liv- er, jejunum, urinary bladder, ileum, colon, brain, kidney, spleen, pancreas, and testis with relative concentrations (%) of 100, 17.5, 7.5, 6.9,5.9, 5.5, 5.0, 3.3, 1.0, and 1.0, respectively. These tissue distributions indicate that nascent lipoprotein par- ticles produced in the dog small intestine would contain apoA-I and apoE but not apoC-I. The widespread tissue distribution of apoE mRNA indicates that like other mammals, peripheral syn- thesis of apoE contributes significantly to the total apoE pool in dog. We next compared the cDNA sequences among different vertebrate species for apoC-I (human and dog), A-I (human, rat, dog, rabbit and chicken), and E (human, rat, dog and rab- bit) and calculated the rate of nucleotide substitution for each gene. M Our results indicate that apoC-I has evolved rather rapidly and that on the whole, apoA-I is more conservative than apoE, contradictory to an earlier suggestion. ApoA-I is also more conservative than a region (residues 4204-4536) at the car- boxyl-terminal portion, but less conservative than a region (resi- dues 595-979) at the amino-terminal portion of apoB-100. Some regions in each of the apolipoproteins studied are better con- served than others and the rate of evolution of individual regions seems to be related to the stringency of functional requirements. Finally, we estimate that the human apoC-I pseudogene arose more than 35 million years ago, becoming nonfunctional soon after its formation. -Luq C-C., W-H. Li, and L. Cham Struc- ture and expression of dog apolipoprotein A-I, E, and C-I mRNAs: implications for the evolution and functional con- straints of apolipoprotein structure. J Lipid &. 1989. 30: 1735-1746. Supplementary key words I pseudogene cDNA clones tissue distribution apoC- The plasma lipoproteins are macromolecular complexes of lipids (triacylglycerols, cholesterol, and phospholipids) and protein. They are the vehicles for the transport of the hydrophobic lipid moieties from one tissue to another for metabolism. The protein components of plasma lipopro- teins are known as apolipoproteins. All apolipoproteins share the ability to spontaneously bind lipid. In addition, many of them have acquired highly specialized functions (for review, see refs. 1, 2). In this communication, we examined the structure and expression of the mRNA for three apolipoproteins in the dog, an animal used extensively as a model for cardiovascu- lar and lipoprotein research (e.g., 3-5). The three canine apolipoproteins we examined here are apolipoprotein (apo) A-I, apoE, and a@-I. A@-I is the major protein in high density lipoproteins, whose concentration is inversely related to the propensity for development of atherosclerosis (6-9). It is the major activator for the enzyme 1ecithin:cholesterol acyltransferase (LCAT) (10-12). ApoE is a constituent of chylomicrons, chylomicron remnants, very low density lipo- proteins, and special dasses of high density lipoproteins with apoE (HDL,, HDL). It is an interesting protein in that it confers many unique functions to the lipoprotein par- ticle, e.g., high affinity binding to the LDL receptor and a specific apoE receptor (13-15). The protein is synthesized in numerous tissues (16-20) and may be involved in such di- verse functions as reverse cholesterol transport (16), neuro- nal regeneration (20) and immunomodulation (21) (for review, see ref. 22). ApoC-I is the smallest of the apolipopro- teins. It also has the ability to activate LCAT in vitro (23). Abbreviations: apq apolipoprotein(s); LCAT, 1ecithin:cholesterol acyl- transferase; HDL, high density lipoproteins. Journal of Lipid Resea~~h Volume 30, 1989 1735 by guest, on January 6, 2016 www.jlr.org Downloaded from

Transcript of Structure and expression of dog apolipoprotein C-II and C-III mRNAs. Implications for the evolution...

Structure and expression of dog apolipoprotein A-I, E, and C-I mRNAs: implications for the evolution and functional constraints of apolipoprotein structure

Chi-Cheng Luo, * Wen-Hsiung Li, * and Lawrence Chant

The Center for Demographic and Population Genetics,* University of Texas, Houston, TX 77030, and Depart- ments of Cell Biology and Medicine,t Baylor College of Medicine, Houston, TX 77030

Abstract Dog apolipoprotein (apo) C-I, A-I, and E cDNA clones were identified in a dog iiver cDNA library in XgtlO by hybridization to synthetic oligonucleotide probes with the corre- sponding human DNA sequences. The longest clone for each apolipoprotein was completely sequenced. The apoC-I cDNA sequence predicts a protein of 62 residue mature peptide preceded by a 26 amino acid signal peptide. The a p d - I cDNA sequence predicts a 242 residue mature peptide, a 6 residue pro-segment, and an 18 residue signal peptide. The apoE cDNA, which lacks the signal peptide region, predicts a mature peptide of 291 amino acid residues. Slot blot hybridization of total RNA iso- lated from various dog tissues to dog apoC-I, A-I, and E cDNA probes indicates that apoC-I mRNA is detectable in liver only, apoA-I mRNA is present in liver and small intestine, though the concentration in the latter tissue is only - 15% of that in the liv- er, and apoE mRNA is present in multiple tissues including liv- er, jejunum, urinary bladder, ileum, colon, brain, kidney, spleen, pancreas, and testis with relative concentrations (%) of 100, 17.5, 7.5, 6.9, 5.9, 5.5, 5.0, 3.3, 1.0, and 1.0, respectively. These tissue distributions indicate that nascent lipoprotein par- ticles produced in the dog small intestine would contain apoA-I and apoE but not apoC-I. The widespread tissue distribution of apoE mRNA indicates that like other mammals, peripheral syn- thesis of apoE contributes significantly to the total apoE pool in dog. We next compared the cDNA sequences among different vertebrate species for apoC-I (human and dog), A-I (human, rat, dog, rabbit and chicken), and E (human, rat, dog and rab- bit) and calculated the rate of nucleotide substitution for each gene. M Our results indicate that apoC-I has evolved rather rapidly and that on the whole, apoA-I is more conservative than apoE, contradictory to an earlier suggestion. ApoA-I is also more conservative than a region (residues 4204-4536) at the car- boxyl-terminal portion, but less conservative than a region (resi- dues 595-979) at the amino-terminal portion of apoB-100. Some regions in each of the apolipoproteins studied are better con- served than others and the rate of evolution of individual regions seems to be related to the stringency of functional requirements. Finally, we estimate that the human apoC-I pseudogene arose more than 35 million years ago, becoming nonfunctional soon after its formation. -Luq C-C., W-H. Li, and L. Cham Struc- ture and expression of dog apolipoprotein A-I, E, and C-I mRNAs: implications for the evolution and functional con- straints of apolipoprotein structure. J Lipid &. 1989. 30: 1735-1746.

Supplementary key words I pseudogene

cDNA clones tissue distribution apoC-

The plasma lipoproteins are macromolecular complexes of lipids (triacylglycerols, cholesterol, and phospholipids) and protein. They are the vehicles for the transport of the hydrophobic lipid moieties from one tissue to another for metabolism. The protein components of plasma lipopro- teins are known as apolipoproteins. All apolipoproteins share the ability to spontaneously bind lipid. In addition, many of them have acquired highly specialized functions (for review, see refs. 1, 2).

In this communication, we examined the structure and expression of the mRNA for three apolipoproteins in the dog, an animal used extensively as a model for cardiovascu- lar and lipoprotein research (e.g., 3-5). The three canine apolipoproteins we examined here are apolipoprotein (apo) A-I, apoE, and a@-I. A@-I is the major protein in high density lipoproteins, whose concentration is inversely related to the propensity for development of atherosclerosis (6-9). It is the major activator for the enzyme 1ecithin:cholesterol acyltransferase (LCAT) (10-12). ApoE is a constituent of chylomicrons, chylomicron remnants, very low density lipo- proteins, and special dasses of high density lipoproteins with apoE (HDL,, HDL). It is an interesting protein in that it confers many unique functions to the lipoprotein par- ticle, e.g., high affinity binding to the LDL receptor and a specific apoE receptor (13-15). The protein is synthesized in numerous tissues (16-20) and may be involved in such di- verse functions as reverse cholesterol transport (16), neuro- nal regeneration (20) and immunomodulation (21) (for review, see ref. 22). ApoC-I is the smallest of the apolipopro- teins. It also has the ability to activate LCAT in vitro (23).

Abbreviations: apq apolipoprotein(s); LCAT, 1ecithin:cholesterol acyl- transferase; HDL, high density lipoproteins.

Journal of Lipid Resea~~h Volume 30, 1989 1735

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

While the structures of apoA-I and apoE have been deter- mined in multiple species (human, rat, rabbit, and chicken for apoA-I, and human, rat, and rabbit for apoE) (see ref. 2 for references therein), the structure of apoC-I is known for humans only. The availability of the apolipoprotein mRNA sequences for apoA-I, C-I, and E from another species has allowed us to closely examine the evolution of these interesting proteins, and to infer the structure-func- tion relationship in each of them. It also enables us to esti- mate the time of appearance of the human apoC-I pseudo- gene.

Although a number of laboratories have studied lipopro- tein metabolism in the dog (3-5), the contribution of various tissues to the total apolipoprotein production in this animal is unknown. In the present communication, we pre- sent the distribution of apoA-I, E, and C-I mRNA among different dog tissues. The information is important to our understanding of lipoprotein metabolism in the canine model.

MATERIALS AND METHODS

Restriction enzymes were from BRL (Bethesda Research Laboratories), IBI (International Biotechnologies, Inc.), and BM (Boehringer Mannheim). DNA polymerase I and the KIenow fragment of polymerase I were from BM. Avian myeblastosis virus DNA polymerase (reverse transcriptase), T4 DNA polymerase, and T4 DNA ligase were from BRL. Proteinase K was from Merck. Deoxyribonucleotides and dideoxyribonucleotides were from Amersham. The ”P- and 35S-iabeled mononucleotide triphosphates were from ICN or Amersham.

Total and polyA RNA isolation from dog tissues Total RNA was prepared from various dog tissues, i”e-

diately after killing by the guanidinium thiocyanate method (24). Total RNA was purified from the intial crude RNA ex- tract by cesium chloride step gradient (24). Quality of the RNA was checked by agarose gel electrophoresis. The total RNA was passed over an oligo-dT cellulose column twice to obtain polyA mRNA (25).

Construction of cDNA library, identification and sequence analysis of cDNA clones

A dog hepatic cDNA library was constructed in the phage vector XgtlO by the method of Gubler and Hoffman (26) using oligodeoxythymidylate primers and EcoRI linker ligation for insertion of the cDNA into the EcoRI site of XgtlO. The cDNA library was screened by plaque hybri- dization by using standard procedures (27). Oligonucleotide probea (two 21-mers, with sequences TTI=TGGcQGcAA-

GAZAACCC and GAGAAGGCCAAACCCGCG, corresponding to nucleotide positions 61-81 and 685-705, for human apoA-I; two 21-mers, with sequences CGCTJTIG-

CAG, corresponding to nucleotide positions 148-168 and 859-879 for human apoE; and one 37-mer with the sequence Z m m A A A A C - C C - G corresponding to nucleotide positions 221-185 in the antisense strand for human apoc-I) were synthesized on an Applied Biosystems Model 380A DNA synthesizer. These were used to identify the corresponding dog cDNA clones by cross-hy- bridization. The temperature of hybridization was 50OC. For apoA-I and apoE, the primary screening was each per- formed with two 21-base oligonucleotides. During secondary screening, duplicate filters were screened with individual oli- gonucleotides. Plaques were purified by secondary and terti- ary screening. cDNA inserts were recovered from the clones by digestion with EcoRI. They were subcloned in both pGEM-Blue and M13 vectors.

The nucleotide sequences of the cloned dog apoA-I, apoC-I, and apoE cDNAs were determined by the dideoxy nucleotide chain termination method (28). The cDNA in- serts were subcloned into the EcoRI sites of the M13 phage vectors mp18 or mp19 before sequencing. Sequencing was carried out on both strands using the M13 universal primer. Synthetic oligonucleotide primers were used for sequencing internal regions of each clone.

Northern blot and slot blot analysis of a+-I, apoC-I and apoE “As

For Northern blot analysis, 20 pg of total RNA was dena- tured by heating at 7OoC in 50% formaldehyde, then sub- jected to electrophoresis for 3 h at 70 volts on a 1% agarose gel in 6% formaldehyde, 50 mM HEPES, pH 7.8, 1 mM EDTA. After electrophoresis, the gels were rinsed twice in water for 15 min each, then washed in 2 x SSC. The RNA was transferred to a Gene Screen membrane (NEN, Du Pont Company) in 10 x SSC overnight. For slot blot analy- sis, varying amounts (1-20 pg) of total RNA from various canine organs were directly blotted onto nitrocellulose paper using a slot blot apparatus (Schleicher and Schuell). The double-stranded cloned canine a@-I, E, and C-I cDNA inserts (purified from their respective pGEM-blue vectors) were labeled with [32P]dNTP by nick-translation or random oligonucleotide priming. Prehybridization, hybridization to 32P-labeled nick-translated probes, and washing were as de- scribed (19). The Northern blots and slot blots were exposed to Kodak X-ray film, XAR-5, for 18-30 h. Autoradiograms were scanned with a MacBeth TD932 densitometer. Rela- tive concentrations of the respective mRNAs were calcu- lated from the linear regression coefficients (slopes) deduced from the signals obtained with different RNA concentra- tions applied to the blot.

GGAl7ACCTGCGC and GTGGAAGACA’iGCAGCGC-

1736 Journal of Lipid Research Volume 30, 1989

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

Statistical analysis of nucleotide substitution rates In estimating the number of nucleotide substitutions be-

tween two genes, we have used the method of Li, Wu, and Luo (29). In this method, nucleotide sites and substitutions are classified as synonymous (causing no amino acid change) and nonsynonymous. For example, the first two positions of the codon UUU are nonsynonymous, while the third position is counted as one-third synonymous and two-thirds nonsynonymous. The method gives the num- ber (&) of (synonymous) substitutions per synonymous site and the number (KA) of (nonsynonymous) substitu- tions per nonsynonymous site.

RESULTS

cDNA cloning and deduced amino acid sequence of dog apoA-I, apoC-I, and apoE

Using oligonucleotide probes from the corresponding hu- man cDNA sequences for cross hybridization, we identified 23, 14, and 12 clones of dog apoA-I, apoC-I, and apoE cDNAs, respectively, in a dog liver library in XgtlO. The longest cDNA clones were completely sequenced. A partial restriction map of the canine cDNA clones and the sequenc- ing strategy are shown in Fig. 1.

ApaA-I. An 883 base pair a+-I clone (XAI-11) was iso- lated from the dog liver library and its nucleotide sequence was determined (Fig. 2). Analysis of this sequence revealed an open reading frame of 798 nucleotides, flanked by 5'- and 3'-untranslated regions of 12 and 73 nucleotides, respec- tively. Fourteen bases upstream of the polyA tail is a putative polyadenylation signal, AACAAA, a variant of the canoni- cal signal sequence, AATAAA.

The first 24 residues of the derived amino acid sequence contain the signal peptide (18 residues) and prosegment (6 residues). They show a high degree of homology to the cor- responding human, chicken, rat, and rabbit sequences. Like both the human and rat apoA-I, the prosegment in dog also contains a Gln-Gln dipeptide, unusual amino acids for pro- tein precursors that are processed proteolytically (30, 31). It is interesting that the rabbit prosequence ends with Gln- Arg, and the chicken with Gln-His. This indicates that the requirement for Gln-Gln next to cleavage site is not abso- lute. The mature peptide of dog a@-I contains 242 resi- dues. It is one residue shorter than the corresponding human sequence. Codon 3 that encodes proline is dupli- cated in human a@-I. Only a single proline residue is pre- sent in this position in all other known vertebrate a@-Is (see below). The piedicted amino acid sequence of canine a d - I differs from a previously reported sequence deter- mined on the purified protein in that amino acid 211 is Glu instead of Gln in the latter sequence (32).

ApoC-2. The dog a@-I clone (XCI-7) is 427 bp in length plus a polyA tail. The DNA sequence includes 23 bp in the 5'-untranslated region, 264 bp in the coding region, a termi- nation codon ( E A ) , and a 3'-untranslated region of 137 bp (Fig. 3). A polyadenylation signal sequence AATAAA pre- cedes the polyA tail by 12 bases.

The DNA-deduced amino acid sequence contains 62 resi- dues of dog a@-I mature peptide region, preceded by a 26 amino acid signal peptide. Dog apoC-I is thus longer than the corresponding human protein by 5 amino acid resi- dues. The additional residues in the dog sequence occur at positions 9-12 (4 residues) and at position 61 (see below).

ApoE. On an initial screening, the longest canine apoE cDNA clone that was identified in the library was 353 bp in length. It spans the 3' region of the molecule including all sequences 5' upstream to the codon that encodes amino acid Ala-174. On subsequent screening, we identified several additional apoE cDNA clones, the longest of which, desig- nated XE-12, encodes the entire mature peptide region but still misses the signal peptide region. The nucleotide and de- duced amino acid sequence of this clone is shown in Fig. 4. hE-12 contains 1,019 nucleotides. It includes an open read- ing frame of 873 bp and a 3'-untranslated region of 146 bp plus the polyA tail. It predicts a mature polypeptide of 291 amino acid residues, 8 residues shorter than the human se- quence. Residues 121-150 of the sequence show a com- plete match to a partial sequence of purified canine a p E reported by Weisgraber et al. (33).

ApoC-I, A-I and E mRNA expression in various dog tissues

The dog has been used as an experimental animal for lipoprotein metabolism. However, little is known concerning the site of synthesis of some of these proteins. The isolation of cDNAs of dog apoC-I, A-I, and E mRNA allowed us to quantlfy the individual mRNAs from different tissues. Analysis of total RNAs extracted from the liver, small intes- tine, pancreas, brain, lung, spleen, kidney, urinary bladder, and t'estis blotted on nitrocellulose paper and hybridized to the respective 32P-labeled cDNAs revealed that apoC-I mRNA is detected only in the liver (data not shown). Therefore, the apoC-I that is found in mammalian chylomi- crons (34) must be acquired by these particles after they are secreted by the small intestine in the dog. In contrast, apoA- I mRNA is detected in both the liver and the small intestine, even though the concentration in the small intestine is only 15% of that in the liver (data not shown). As expected from its distribution in other mammals, apoE mRNA is present in a wide variety of tissues. By applying different amounts of RNA from these tissues and determining the linear re- gression coefficients (slopes) of each set of slot-blots, we cal- culated that the relative concentrations of apoE mRNA in

Luq Li, and Chan Dog apd-I, E, and C-I sequences 1737

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

(b) XCI-7

h 5 ' 3' -

( c ) ,I E-12

0 Q5 1kb

Fig. 1. Partial restriction maps and sequencing strategy of cloned dog apolipoprutein cDNAs. a) A@-I cDNA, b) apoC-I cDNA; and c) apoE cDNA. Sequencing was performed by the dideoxynudeotide chain termination technique of Sanger et al. (28). The direction and extent of each sequencing reaction are repmented by the snows. Synthetic oligo- nucleotides were used as secluencina primers. Restriction enzymes: b, BglI; h, Had; n, NarI; s, SmaI; t, PstI; v, PuvII; x, X h O I .

- -

the tissues are 100, 17.5, 7.5, 6.9, 5.9, 5.5, 5.0, 3.3, 1.1, and 1.0, respectively, in the following organs: liver, jejunum, uri- nary bladder, ileum, colon, brain, kidney, spleen, pancreas, and testis (Fig. 5).

Rates of nucleotide substitution in the a@-I, C-I, and E genes

First, we consider the rate of synonymous substitution, which can be computed from the number of substitutions per synonymous site (Ks) between genes (Table 1); a@-I will not be considered because its coding region is short so that the estimate of Ks has a large standard error. The dog species is thought to have branched off slightly earlier than the divergence among the human, rat, and rabbit species (35) and may therefore be used as a reference to infer the

Ks values in the latter lineages. Let a and b be the lengths from the ancestral node of the human and rat lineages to human and rat, and c be the length from the same node to dog (we consider the rabbit species below because its evolu- tionary position is uncertain). As calculated in able 2, a = 0.20, b = 0.53, and c = 0.13 for a@-I and a =

0.12, b = 0.47, and c = 0.17 for apoE, the averages being a = 0.16, b = 0.50, and c = 0.15. Therefore, the synony- mous rate in the rat lineage is approximately three times higher than those in the human and dog lineages. The K, values in the dog, human, and rabbit lineages (denoted by a, c, and d, respectively) are estimated to be a = 0.17, c = 0.14, and d = 0.15, suggesting that the synonymous rates in these three species are similar.

Next, we consider the rate of nonsynonymous substitu-

1738 Journal of Lipid Reaearrh Volume 30, 1989

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

- 20 -10 -1 cgteecttcagg ATC MA GCC CCA CTG CTC ACC TTC GCC GTC CTC TK C K ACC GGG AGC CAG GCT CGG CAC TTC TGG Cffi E M

H o t Lys Ala A l a Lou Lou Thr Lou Ala Val Lou P h o k u Thr Cly Sor Gin A la Arg Hi. PRO Trp C l n G l n

-1 10 20 30 SAT SM ccc EM; TU ccc TGC CAT ccc GTG MC CAT TTA ccc ACC ctc TAT CTC CAC CICA GK AM t ~ c MK c a M;A GAC TAT CTG GCC Asp clu Pro Cln Jor Pro T r p Asp Arg Val Lys Asp LOU A l a Thr Val Tyr Val Asp A l a Val Lys Asp Ser Cly Am Asp Tyt Val Ai.

40 50 63 CAC TTT GM ccc +cc GCC CTG CGA MA cffi CTC MC CTC AM crc CTC CAC MC TCG GAC AGC ctc AGC &a ACG GTC ACE AM ctc ccc Sln Pho G:u la 5.1 Ala Lou Gly Lys C l n LOU A m Lou Lyr Lou Lou Asp A s n Trp Asp So? Lou Sor Sor Thr Val Thr Lys Lou ArJ

SM cffi ATC CGC ccc CK ACC cffi cffi TTC TGC GAT AAC CTC GAG AM GAG A a CAC cte CIG ctc CAG GAG ATG A a MG GAC CTG SAC

100 110 123 GAG GTG MG cffi AM CTC CAG ccc TAC ctc CAC GAC TTC CAG MG AM TCC CAG CAC GIG CTG GIG CIC T K c a CAG MG GTG GCC CCG 5:u v a l Lys c ln Lyr Val C l n Pro Tyr Lou Asp Asp Pho G l n LYI Lyr Tip C l n Glu clu Val Giu LOU Tyr UQ G l n Lyo Val Ala Pro

crc CGC TCG GAG CTG CGC GAG CGC GCG CGC CAC AM CTC CAG GAG CIG CAC GAG 1 u ~ cte of CCG ctc GCG CAG GAG ctt coo GAC ca LOU cly Sor c l u LOU Arg C l u Cly Ala Arg Cln Lys k u G l n Clu Lou Cln Clu Lys LOU Sor pro LOU la ciu clu ku A r g Aop Arg

70 EO 90

G l u S l n 11. Cly Pro Val T h t G l n Clu PhO T r g Asp A s n LOU Clu LyB Clu Thr Glu Val k u A r g C l n Glu Nee Sot Lys Asp LOU Clu

130 140 150

160 170 1 a0

A l a Arg Thr Hi. Val Asp A l a LOU A r g Ala C l n LOU Ala Pro Tyr SOr A8@ Asp k U A r g Glu A r g IrU A h A l a A r g k u ClU Ala LOU CCG CGC ACC CAC m GAC CCC CTC CGC GCC CAG CTG CCC c a TAC AGC GAC GAC CTC CGC GAG CGC CTG.GCC wc CGG c'zc GAG GCG CTC

190 200 210

Lyr Glu Cly Cly Gly A l a Sac Lou A l a GIu Tyr H l s A la A r g A l a SO? Glu C l n Lou SOr A l a LOU Cly C l u Lys Ala A r g P r o A l a Lou

220 2 30 2 43

Glu Asp Lou Are G l n Gly Lou Lou Pro Val Lou Clu Sor Pho Lys Val tor Lou Lou Ala Ala 11. Aap C:u A l a Thr Lyr Lyr Lou A m

AAG GAG GGC GGC WK ccc AGC cte GCC GAG TAC CAC GCC AGG ~ t c AGC GAG cffi c t e h a GCG CTC GGC GAG AAC ccc A G ~ CEC GCG CTC

GAG GAC CTC CGC CAC CGC ctc CTC ccc CTG CTC GAG AGC TTC MG GTC A a ctc CTC GCI GCC AX CAC GAG GCC ACC MG AM CTS AAC

SCC :ffi TGA ggcgcccoccgecccgccccgcccgcccgtccgcc~ceccggscn~~aac~aacgcttccceac~a - poly(A) A:. G l n

Fig. 2. and variant polyadenylation sequences are underlined

- Dog a@-I cDNA and deduced amino acid sequence. The signal peptide sequence is identified by the negatiw numbering. The putative pmpeptide

tion in the signal peptide region. in a@-I, the KA values for the human, rat, rabbit and dog lineages are 0.00, 0.16, 0.03, and 0.03, and in apoE, the KA values for the human, rat, and rabbit lineages are 0.04, 0.18, and 0.22. These re- sults suggest that ve'y few nonsynonymous substitutions have occurred in the signal peptide region of the human a@-I and apoE genes and in that of the dog a@-I gene since the time of mammalian radiation. The same appears

-20

to be true for human and dog apoC-I because the KA value in this region is only 0.06 between human and dog (Table 1). It is not clear why the signal peptides of these pro- teins in the human and dog lineages have been so well con- semd. In the rabbit lineage, the signal peptide in apaA-I has evolved at a low rate (0.38 x lo-), whereas that in the apoE has evolved at a very high rate (2.75 x 10-9. The sig- nal peptide in rat a@-I and apoE have evolved at a high

-- :cccttcggccccOcc ATC AU; CTC fin CTC TCG CTC ccc CTT TTC CTC CTG c n CTC TCG AIG CTT m CM 001 c u GCC CCG KC CAG ccs

*1 10 20 33

A l a cly Glu 11. tor Sor Thr Lou Clu Ara 110 Pro AID LYB Lou Lys Clu Pk. Cly Asn Thr LOU Clu Aop LYB Ala A r g Ala Ala :io

O W AGC A X AM MG AGC GAC ATT CCP GCA AAC ACC CGA TCG TTT Tff GAG GCT TTT Mc MA CtG M Gllc CAT CTC AhR IIR CCC G l u Sor 11. Lys Lys Sor Asp 110 Pro A l a Lys Thr Arg A m T i p Ph. So? Clu Ala P h o Lyo Lyo V.1 Lyo Glu Hlo Lou y o Thr Ala

not Arg ~ o u 11o Lou Sor LOU Pro Val Lou Val Val Val Lou Sor wlt Val h u Clu Val Pro A l a P r o A l a C l n Ala

GCC GGA CM ATC TCC MK ACT T R GAG COC ATC CCC CAT AM CTC AM GAG TTT CCT AM ACC CtG CM WIC W GCC CGG GCA GEC A r

40 50 60

Fig. 3. sequence is underlined.

Dog a+-I cDNA and deduced amino acid sequence. The signal peptide sequence is identified by the negative numbering. The polyadenylation

L q Li, and Chan Dog a@-I, E, and C-I aequences 1739

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

-1 10 20 30 M G GTC CAG CAG GAG CTG GAG CUL GAG CCC Gffi TGG CAG ACT GGC CAG CCC TGG clv; GCG CCG CTC GCC CU: TTC TOG CAT TAC CTC CCC ~ y s v.1 Gln d l n G l u ~ o u G l u Pro G l u A h Cly T r p G l n Thr Gly C l n Pro Tm Glu Ala Ala Lou A h Arg Pho Trp A- tyr L.U Arg

40 50 60 TGC G I G CAG ACG CTG TCT SAC CAG GTC C M CAG GCC CTG CTC MC ACC CAG GTC ACC CAG CM CTG ACG GEC CTG A% GILT GAG ACC ATG rrp V I 1 cln Thr ~ o u sor ASP C l n V a l C l n Glu Gly V a l Lou A m Thr Gin V a l Thr Gln G l u k u t h r A l a fnu nee A . p Glu Thr mt

70 a0 90 M G SAG GTG MG CCC TAC M C CCG GAG CTG CAC GAG CAG CTG GGC CCC ATG ACC TCG GAC ACG CAC GCC CGC G I G GCC ME GAG CTG CAC Lys Giu V a l L ~ S A l a Tyr Lya A l a G l u Lou A S 0 G l u G l n Lou Gly Pro Mot Thr Sor clu Thr C l n A l a Arg V a l Ala Lys Glu Lou G i n

100 110 i 20 GCG GCC CAG CCC CSC CTG CGT GCG GAC ATG GAG ‘SAC GTG CGC MC CCC CTG ACG CAG TAC CGC GGC GAG Cr(; CAG GM AN; CTG GGC CAG Ala A l a C l n A h Arg Lou A r g Ala As0 Not G l u Asp Val Arg Asn Arg Lou Thr Cln Tyr A r g Gly Clu k u Gln Ala Ikt Lou Gly G l n

130 140 :so AGC AGC EM; EM; CTC CGG GCG CGC T I C GCC TCC C M ATG CGC MG TTG CCT MG CGG GTC CTG CGG W GCE EM; GM CTG CAG ACG CSC 3.1 S ~ C c lu ciu tau Are Ala A r g Ph. A l a Sor His Hot Aru Lya Lou Aru Lys A r g Val L.u Arg A- A l a Glu A a p Lou Gln A r g Arg

1 6 0 170 180 CTG ccc GTC TAC ME GCC GGC GIG ccc GAG GCT GCC CAI; c a AGC GTG AGC AGC AK c a GAG CGC CK TGG CCG cm m GAG cffi GCC k u Ala V a l Tyr ~ y s la Gly V a l Arg C l u Gly Ala Glu Arg Sor V a l Sor lor Ilo Arg C l u A r g IAU T r p P r o fnu Lou G1u Gln Ala

190 200 210 CGC GAG c a MC GCC MG CTG GGC GCC CTG ccc ACE CAG CCG ctc CK GAG CGG GCC GM GCC roc G# CILG CAG CZG CGC QGC CAG chi Arg Glu Arg A8n Ala Lys V a l Gly A l a Lou A l a Thr Gin P r o Lou k u Glu A r g Ala A- A h T r p Gly Gin Gln k u Acg Gly Gln Lou

220 230 2 40 GAG G f f i ATG AGC AGC CGG GCC CCC SGC CAC CTG GAC GAC ATG CGE GAG CAG ATA CAG GAG GtG CCC CtG AkG A t 0 GAG GI0 CAG GCC C&C Glu G l u r4oc Sor Sor Arg A l a Arg Gly H i s Lou G l u Glu Mot Arg Glu C l n 110 Gln Glu V a l A r g Va l Lys Glu Glu C l n Ala Asp

250 2 60 2 70 CAG ATA CGC C M ME GCC EM; GCC T I C CAG CCG CGC C K MG AGC TGG T K OAC CCC C t G CIO CAA GM ATG CAG CCC CAG TGC W Gw C l n 110 A r g Cln Lys Ala Glu A l s Pho Gln Ala Arg Lou Lys Sor T r p P k . C l u P r o Lou Lou Glu Asp Mot Gln Arg Gln T r g Asp Cly

2 BO 2 9 0 CTG GTG GAG MG GTG CAG GCG GCC GTC GCC ACC ATC CCC ACC TCT ME CCT CTG GAG CM CCA TCh ~ ~ C C C C ~ C J C ~ C u C C t ~ c t ~ ~ ~ C C C C C C C Lou Val Glu Lys V a l Gln A h A h V a l A l a Thr 11. Pro Thr Sor Lys Pro Val G l u G l u Pro

C C J C t C C t C C C C ~ C ~ C C C C C t ~ C C t g C t C C C J C g C C t C C J ~ ~ J ~ ~ c t g C C C C t g C C C C C J g ~ q t C C t C c t g J J J W g C C C C J 9 C t C ~ g J C C C J C C J J g C t C ~ C C C WlylAl

Fig. 4. Dog apoE cDNA and deduced amino acid sequence. The polyadenylation sequence is underlined.

rate, approximately 2.0 x IO-’ substitutions per site per year, which is two times the average rate (0.9 x lo-’) for human and rat genes (29).

Finally, we consider the nonsynonymous rate in the ma- ture peptide region. For the divergence between dog and human apoC-I, the nonsynonymous rate is 1.4 x lo-’, which is considerably higher than the average rate for hu- man and rat genes (29). In apd-I , the KA values are ap- proximately 0.055, 0.195, 0.065, and 0.042 for the human, rat, rabbit, and dog lineages and the corresponding nonsyn- onymous rates are 0.66 x lo-’, 2.44 x lo-’, 0.81 x lo-’, and 0.53 x lo-’. In apoE, the KA values are approximate- ly 0.055, 0.13, 0.09, and 0.11 for the human, rat, rabbit, and dog lineages, the corresponding rates being 0.69 x lo-’, 1.63 x lo-’, 1.13 x lo-’, and 1.38 x lo-’. In both a p d - I and apoE, the rate in the rat’lineage is three to four times the rate in the human lineage. Previously, from a compari- son of human and rat a p d - I and apoE genes, we concluded that a@-I has evolved considerably faster than apoE (36). However, it is now clear from the above computation that this is true only for the rat lineage, though it is not clear why rat a+-I has evolved exceptionally fast. In the human lin- eage, a p d - I and apoE have evolved at the same rate,

100

.- 4- ‘O la

h

Fig. 5. Relative concentration of apoE mRNA in dog tissues. The con- centrations are deduced from the regnssion coefficients (slop) of densito- metric measurements of autoradiographs of slot blots obtained from graded amounts of dog total RNA hybridized to %’-labeled dog apoE cDNA clone, E-12.

1740 Journal of Lipid Research Volume 30, 1989

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

TABLE 1. Number of nucleotide substitutions per synonymous site (Ks) and per nonsynonymous site (KA) between mammalian apolipoprotein genes

K A

Gene Species Pair KS Signal Peptide Mature Peptide

0.53 f 0.13 0.06 f 0.03 0.23 f 0.05 c - I Human vs dog A-I Human vs dog A-I Human vs rat 0.73 f 0.10 0.15 * 0.07 0.26 * 0.02

A-I Dog vs rat 0.66 f 0.09 0.19 + 0.08 0.24 + 0.02 A-I Dog vs rabbit A-I Rat vs rabbit 0.68 f 0.09 0.19’ f 0.08 0.25 * 0.02

NA 0.16 f 0.02 E Human vs dog 0.28 f 0.04 E Human vs rat 0.59 f 0.07 0.22 f 0.09 0.18 f 0.02 E Human vs rabbit 0.34 f 0.05 0.26 * 0.10 0.14 f 0.02 E Dog vs rat 0.64 f 0.08 NA 0.23 f 0.02

0.19 * 0.02 E Dog vs rabbit 0.29 f 0.04 NA E Rat vs. rabbit 0.69 f 0.09 0.40 f 0.13 0.22 * 0.02

0.33 0.05 0.03 * 0.03 0.09 f 0.01

A-I Human vs rabbit 0.34 f 0.05 % 0.03 * 0.03 0.12 f 0.02

0.32 f 0.05 0.06 f 0.04 0.10 f 0.01

whereas in the rabbit and dog lineages, apoE has evolved faster than a@-I. Therefore, on the average, a@-I seems to be more conservative than apoE.

From a comparison of a partial rat apoB cDNA sequence (37) corresponding to residues 595 to 979 in human apoB with the human apoB cDNA sequence (38), we obtain KA = 0.09 + 0.01 and a nonsynonymous rate of 0.56 x lo-. As this rate is considerably lower than the average rates for human and rat a@-I and apoE, this part of apoB is probably more conservative than a@-I and apoE. A par- tial sequence for the 3‘ end of the chicken apoB cDNA is now available (39) and we have estimated that the KA value between this sequence and the corresponding human se- quence (codons for residues 4204-4536) is 0.59 * 0.04. As- suming 300 million years for the divergence between mammals and birds, we obtain a rate of 0.98 x lo-’. On the other hand, the KA value between chicken and human a@-I (40,41) is 0.41 + 0.03, corresponding to a rate of 0.68 x lo-. Thus, this part of the apoB appears to be less conservative than apoA-I.

Comparison of protein sequences

We have aligned, for each protein, the sequences from different species (Fig. 6). In each of these three proteins, as well as the other soluble apolipoproteins, there is a common block of 33 residues at the end of exon 3 (42), and the regon encoded by exon 4 contains repeats of 11 or 22 residues, which are labeled as repeats A-1-4, A-1-5, etc.

Although apoA-I has evolved rather rapidly, many resi- dues have been conserved among all the five vertebrate species (indicated by #), or among all the four mammalian species (indicated by + ) (Fig. 6). Further, a close examina- tion rweals that most of the amino acid substitutions occur between residues of similar biochemical properties, e.g., hy- drophobicity. We noted above that rat apoA-I has evolved exceptionally fast in terms of nucleotide substitutions. Fig. 6A shows that this is also true in terms of deletions: rat apoA-I contains four deletions in the boxed region, whereas among the other four sequences, only rabbit apd - I con- tains a deletion in this region.

TABLE 2. Rate of substitution per site per year

From KA Values

Branch From Ks Signal Mature Gene Length Values Rate Peptide Rate Peptide Rate

A-I a 0.20 2.5 10-9 0.00 0.00 x lo-’ 0.055 0.69 10-9 A-I b 0.53 6.6 x lo-’ 0.16 2.00 x 10‘~ o 195 2.44 10-9 A-I C 0.13 1.6 1 0 - ~ 0.03 0.38 x lo-’ 0.042 0.53 10-9

E b 0.50 6.3 10-9 NA NA 0.130 1.63 x 10-9 E C 0.15 1.9 10-9 NA NA 0.110 1.38 10-9

E a 0.16 2.0 10-9 NA NA 0.055 0.69

The parameters a, b, and c are the lengths from the node connecting the human, rat, and dog lineages to human (H), rat (R), and dog (D). They are given by a - (DHR + DHD - DRD/~), b = (DHR + D ~ D - DH&’) and c = (DHD + DRD - DHR/~) , where Dxy is the distance (Ks or KA) between species X and Y; for the method, see Fitch and Margoliash (60). The substitution rates are calculated under the assumption that the three species diverged 80 million years ago (61). NA: The nucleotide sequence for the signal peptide of dog apoE is not available.

Luq Li, and Chm Dog a@-I, E, and C-I sequences 1741

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

6867 ‘OE aurnloA TJeassd Prd3 P Panof ZPLT

-u! 30 uo!muasuor, 30 suo!leqdur! ayA ,suo!uasu! pue SUO!~ s! l! ‘.xaAaMoH .suo@aJ pamasuox~ou X~quap! 01 qnqpp -a[ap Su!pnpu! saSueyr, ~N.I ywuo:, suo@a.~ paxoqun aq~ s! l! os alqepw MOU am samanbas OM^ Xpo ‘I-Dode JOJ

soa1ayM ‘paMasuor, IIaM uaaq sey sanp!saJ EE 30 y3oLq UOLU ‘(2.b) gode u! ueyl I-vode u! pamasuo3 Jauaq SI u~al -wo:, ayl ‘alddurexa JOJ suo@a.~ .Suodure saum uo!leMasuo:, -led leadal ayl ley 3x3 aql Xq pue sapads ue!purureur ~no3 amanbas 30 aafiap aqJ ‘u!alold yxa u! ‘ley1 alou aM a91 Buoure pamasuo:, uaaq amy gode u! sanp!sal .I~M~J 1eyl

.gode pue I-vode ueyl pamas 8 pue vg ..%A ~0.13 uo!leMasqo ay Xq pauoddns s! I-pde -uo3 ssa~ an u!aloJd s!y130 suo@al ~p lev g %J ~0.13 Jeap ueyl aqeuasuo:, ssa~ SF gode ley1 uo!snpuoo aAoqe ayL

,(sc ‘pq) uoga. Bu!pu!q Jolda3al aqeznd acde murnq ‘gq !uay-J!q:, ‘~3 !,!qqel ‘88 !)el ‘~8 !ueurnq ‘nH :Bop ‘x :uo!$elou sapadg ‘x kq pareznpu! are panduo3 sapads m~purmur OM) aq3 u! pamasuo:, sanp!s;u aqL .I.Dodv (3) ’ + .4q palvxptq am sapads rrepumur .moJ aq) Buom pamasuo3 an leq sanp!sal aqL .gcdv (a) ’ + Lq palexpu! am uay3yJ u! IOU lnq sapads ue!purmur moJ aq Buom pauasuo:, an leq) asoq pm # .4q pa,expu! ale sapads almqauah ~AIJ aq p Buom pamasuo3 am ~eq sanp!saJ an ~1”qodv (v) .paxoq an suo+aI asaq, p !31a ‘~-1-v ‘p-1-v sleadaJ ST pa~aqe~ an q~!tp ‘sanp!sar ZZ 10 11 JO sleadar suyuo3 p uoxa .4q papma UO+J aq pue ‘(zp) E uoxa JO pua atp )E sanp!saJ EE JO yqq UOUI~O~ E s! aJatp ‘u!alold q3ea UI .saxanbas p!3e ou!m JO uosvduro3 ‘9 *%a

~s~~so~IYsI~JIM~~~~LN~~~~~~ LXY~I~S~YIS~I~~~?JNE)J~~~~

---sshaa nw x xx xxxx x xxxx x xx x xx x xxxxxxxxxxxxxxx

b-1-3 1JOTq UOPOJ-EC xx

I-30dV (3)

O?X3Al3VMbYOWa3Ald3 OIY3WlNYNoYbWa3Ald3 OAX3A?3VNbYbWa3Ald3 OA~~AT~~NOYOW~~T~~

++ ++ + +++++++ +++++ + ++++ ++ + + +++ bl-3 El-3

Y%VSAE)Y3V33W3V3 ~mrvs~3nno30~o~n YmIVs?3~3v3~3Vo Y3MISSASY3V33?3A3VY

6-3 + +++ ++ + + + ++ + t ++ +

Zl-3 11-3 01-3 + + + + ++++ + +++++ ++

~vLvmmYnIyzwum3m no)la~v~~~~n~?n~?~~~~~~~~sO~?mOv~o~h

WoWol3XVAMVoL3SLW ~bla3v~~~n~1ln~~~v~~?33~~031m0133~h

W~LV~h3XVl~L33Vh ~~~~OII~IW?Y~~Y~HLS’IYSYI~~LSO~~WLN~~~~ -V~VAOT~~ST~H~OW

~bl~~~llY~l~lHSV’I~Yl33LSb3lWIlbh33Yh ~OwO?~ns?m~33v~

+++ + +++ +++ +++ ++ + ++ +++++ + ++t ++ +++++ + + + ++ ++ + + + 8-3 L-3 9-3 5-3

B3~~~n~vn~3nw~33w-1 13bLAbSS1133~A~~S1S~ILMYlhaMMMLlf)l ~dOf)~-----W~3d-A3A3~3L3 033?3nn~vnh3~w~a~ 13~Lh~SS~l33~Ab~SlLbILMMYlhaMMMNl ~dOa~~-------d?Oa~-~31333 033msnxvn~3nw~~aw~ 13bLAOSS?133bhb3SlL~ILMYlX~MM~? ~Y~SbM3L.WY?3d3dlL3AV03An 03amvnxvn~3nwmaw~ 13~LAbLNlA33~A~aSlL~ILMYlX~~l ~Mdb3LbM------3Y3d--3?3.WAn ++ ++ ++++ + ++ ++ ++++++ ++++ ++ ++++++++++ ++ ++ +

b-3 XaoTq UOPO~-fE

BY

nw LY

ga

BY

nw LY

sa

au BY LY nw sa

BY LY nw oo

30dV (81

- 0 Btl

H3

0 LY

0 nw

-

# #++X # + ++ ++# +## # # + ## + +++# # #+ +# ## ET-I-v Zl-I-v 11-I-V 01-I-v

3a

#* + # ### ## # # + # + X# +# +### # #X ## +# ## #+ # # ### +# ## 6-I-V 8-I-v L-I-v 9-I-v

a3?mwvvvs~Lamaw 5n3AVSS3JbVIVm3S~L3lM~ 03b?nSALSS?swa?? On3JVSV3JbVM3Yt)S~L~LVJ O~O~YMLS~TL~~?? bnslLSS3aOSM~3S~V~LVJ 03YlnS~LSLASaMNall br3lVS33abSAh~3SalA~L~ 03Y?nLALsS?swa~? ~~3lYSV3PdVM~3S~AV~L~

#+*+++ ++#+++#+ +# # ++ #+##+# #+#### # ### ++ +##+# ++##++ #+ #+ + ### 5-I-v b-I-v 1JOTq UOPO3-EE

1-vodv (VI

by

gues

t, on

Jan

uary

6, 2

016

ww

w.jl

r.or

gD

ownl

oade

d fr

om

dividual regions in each protein for structure-function will be discussed later.

Origin of the human apoC-I pseudogene

We have compared the human a@-I pseudogene (43) with its functional counterpart. In the coding region, the yS and KA values are, respectively, 0.07 0.04 and 0.13 * 0.03, the latter being almost twice the former. This is sur- prising because, in a functional gene, Ks is usually much larger than KA (see Tables 1 and 2). Although the low Ks value is probably an extreme random deviate, because the region compared is short, this observation suggests that the pseudogene became nonfbnctional soon after its formation; afler that nonsynonymous substitutions were not subject to functional constraints and could occur at a high rate. This suggestion is supported by the fact that the two genes have diverged almost as fast as Alu sequences, which are now commonly thought to be pseudogenes (44); for the five Alu sequences at corresponding positions in the two genes (43), the numbers of nucleotide substitutions per site are 0.13, 0.15, 0.13, 0.16, and 0.16, which are only slightly higher than the KA value given above. The average for the five num- bers is 0.15, which is higher than the number (0.11) of substi- tutions per site between the human and owl monkey 7 glo- bin pseudogene (45). Thus, the apoC-I pseudogene probab- ly arose earlier than the divergence between the human and owl (New World) monkey lineages, i.e., about 35 million years ago. It will be interesting to see whether this pseudo- gene is in fact also present in New World monkeys.

DISCUSSION

Tissue expression of dog apolipoprotein "As A@-I is expressed only in the liver, whereas a@-I is

expressed both in liver and in the intestine, even though the a+-I mRNA concentration in the latter tissue is only - 15 % of that in the liver. In contrast, apOE mRNA is present in many tissues including liver, small intestine, urinary blad- der, colon, brain, kidney, spleen, pancreas, and testis. The widespread distribution of apoE synthesis is consistent with the experiments in vivo showing that newly synthesized apoE present in interstitial fluids contributes substantially to the plasma apoE pool in dogs (46). The almost ubiquitous presence of apoE mRNA in the dog is reminiscent of similar findings in other mammals, including humans, nonhuman primates, and rodents (16-20). Like other mammals, canine brain tissue contains a substantial amount of apoE mRNA. It is the source of the fairly high concentration of cere- brospinal fluid apoE in the dog (38-60% of the plasma con- centration compared to - 5.4% in man) (47). In other mammals, the small intestine expresses little, if any, apoE mRNA. In contrast, dog small intestine seems to contain a

substantial amount of the mRNA, suggesting that in this species, the small intestine is potentially a sjurce of an ap- preciable amount of circulating apoE. Since the RNA was prepared from total jejunum, we cannot be certain whether the mRNA is derived from the mucosa or submucosal tis- sues or both.

To date, we have examined the tissue distribution of a number of dog apolipoprotein mRNAs, including those of apoC-11, C-I11(36), (2-1, A-I, and E (present study). These apolipoproteins showed substantial differences in their site of synthesis: one of them is expressed exclusively in the liver (a@-I), one in both liver and intestine (a@-I), and three others (a@-11, C-111, and E) are expressed in a wide varie- ty of tissues. Future studies using the dog as an experimen- tal model for lipoprotein metabolism should take into consideration the relative tissue distribution of these mRNAs. The apolipoproteins synthesized in such tissues may perform some specific bc t ion locally. They also contribute to the cir- culating pool of apolipoproteins in this animal.

Relative rates of evolution and functional aspects of apolipopmtein structure

In a@-I, the 33 residue common block is well conserved among species and so are repeats 4, 5, 6, and 7 (in each re- gion, over 55% of the residues are conserved among the four mammalian species) (Fig. 6A), suggesting that the stringency of structural requirements in these regions is fair- ly strong. A major function of apoA-I is the activation of LCAT (10-12). Soutar et al. (23) have shown that both the amino- and carboxyl-terminal cyanogen bromide fragments (residues 1-85 and 147-243) activate LCAT; in the latter fragment, residues 145-182 seem to be involved in the acti- vation process. Surprisingly, this part of a@-I sequence, which corresponds to repeats A-1-9 and A-1-10, is less con- served than the other repeats (Fig. 6A). Thus, the structural requirements for LCAT activation may not be stringent. This conclusion is supported by the fact that a@-I, which can also activate LCAT, has evolved rapidly (see a h ) . Moreover, synthetic model peptides that mimic apoA-I sur- face properties but differ from a@-I in primary sequences are effective in LCAT activation (48). While LCAT activa- tion is a major function of a@-I, a high rate of evolution may still occur since the protein can undergo considerable change in its primary structure without impairment of its function.

All regions of apoC-I are less conserved than a@-I and apoE. Thus, the functions of apoC-I, which have been sug- gested to be phospholipid binding and LCAT activation, do not have very stringent structural requirements.

In apoE, comparison of the internally repeated regions (the common block, and repeats E-4 to E-14, boxed in Fig. 6B), and the flanking nonrepeated sequences in the mature peptide (unboxed in Fig. 6B) indicates that the former are much better conserved than the latter. It suggests that the

Lua, Li, and Chan Dog apoA-I, E, and C-I sequences 1743

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

repeats serve functions that have a more stringent structural requirement. In particular, the 33-codon common block is especially well conserved, whereas the region immediately preceding it has evolved much faster. Repeats 4, 6, 8, 9, 13, and 14 are better conserved than the other repeats.

In aqueous solution, there is evidence that apoE contains two independently folded structural domains: a relatively unstable self-associating carboxyl-terminal domain (residues 225-299) rich in amphipathic helices, and a more stable amino-terminal domain (residues 20-165) that resembles a soluble globular protein in structure (49, 50). The two do- mains are connected by an exposed peptide segment or hinge region that appears to have random coil structure and is highly susceptible to proteolysis. Inspection of Fig. 6B in- dicates that the amino- and carboxyl-terminal domains are considerably better conserved than the connecting hinge re- gion. This variation in interspecies homology over the entire length of apoE supports the thesis that the two structural do- mains require much more stringent sequence conservation for their functions than does the hinge region.

ApoE is an important determinant in the interaction be- tween apoE-containing lipoproteins and cell-surface recep- tors (51). Studies using monoclonal antibodies, natural mutants, and site-specific mutants produced in vitro localized the receptor-binding region to the vicinity of residues 140-150 and have thus far identified at least eight specific residues (#136, 140, 142, 143, 145, 146, 150, and 158) as cru- cial residues involved in receptor binding (52, 53). Further, the a-helical conformation' in this region also appears essen- tial, since substitution of Pro for either Leu'44 or Ala'" will interfere with binding activity. Examination of the degree of conservation of the various repeats in apoE (Fig. 6B) indi- cates that E-8 (residues 130- 166), which encompasses the re- ceptor-binding region, is indeed one of the most highly conserved regions of apoE.

When we specifically compared the eight residues that have been directly implicated in receptor binding, only one substitution (an Ala for AI-^'^^, human equivalent) in the rabbit sequence was seen among the four mammalian se- quences. In this case, the Ala was immediately preceded by an Arg not present in the other sequences. It is noteworthy that a subject heterozygous for apoE-2-Christchurch (result- ing from an Arg136+Ser mutation) and classical a p E 2 (Le., A~-g'~~+Cys) presented with Type 111 hyperlipoproteinemia (53). A geneticdy engineered A~-g '~~+Ser mutant also showed only 41% of the normal apoE receptor binding to the LDL receptor (54). The relative receptor binding activi- ty of rabbit apoE, which has an ArglS6+Ala substitution, has not been determined. When we aligned the putative re- ceptor binding sequence for human apoB-100 (residues 3352-3372) (dksignated hB in Fig. 6B, regs. 54, 55) with the corresponding apoE sequences, additional differences involv- ing residues 136 (Arg'Lys), 140 (His+Thr), 143 (Lys-+Leu), 150 (Arg+Lys), and 158 (Arg+Ser) are evi- dent. This analysis suggests that though apoB-100 and apoE

appear to bind to the same receptor, they might do so by interactions thay may not be identical, perhaps accounting for the significant differences in affinity between them (56). Furthermore, structure-function torrelation studies on the human low density receptor support this interpretation. A mutant low density lipoprotein receptor from a patient with familial hypercholesterolemia has been, described that has lost its binding affinity for low density lipoproteins, but has retained its ability to bind apoE (57). Finally, when different regions in the ligand binding domain of a cloned low density lipoprotein receptor are deleted, binding of the receptor 'to low density lipoproteins or to &migrating very low density lipoproteins (containing apoE) is affected in a dissimilar manner (58). Therefore, apoB-100 and apoE interact with the low density lipoprotein receptor via distinct mecha- nisms. We had difficulty aligning the apoE sequences with the other putative receptor-binding region in apoB-100 (residues 3147-3157) proposed by Knott et al. (59). TO date, experimental support for receptor-binding activity is not available for this domain. Our comparison suggests that if this domain is a bona fide receptor-binding sequence, its mechanism of interaction must differ even more in detail from the apoE-receptor interaction than does the domain lo- cated closer to the carboxyl terminus (residues 3352-3372). Blll

This work was supported by National Institutes of Health Grants GM-39927 (to W. H. L.) and HL-16512 (to L. C.). Manum@ rzceiued 20 Mmh 1989 and in miud fm 30 May 1989.

REFERENCES

1.

2.

3.

4.

5.

6.

7.

Mahley, R.W., T. L. Innerarity, S. C. Rall, Jr., and K. H. Weisgraber. 1984. Plasma lipoprotein: apolipoprotein struc- ture and function. J. Lipid Res. 25: 1277-1294. Li, W-H., M. Tanimura, C-C. Luo, S. Datta, and L. Chan. 1988. The apolipoprotein multigene family: biosynthesis, structure, structure-function relationships, and evolution. J.

Mahley, R. W. 1978. Alterations in plasma l i p o p 6 induced by cholesterol feeding in animals including man. In Distur- bances in Lipid and Lipoprotein Metabolism. J. M. Dietschy, A. M. Gotto, Jr., and J. A. Ontko, editors. American Phy- siological Society, Bethesda, MD. 181-197. Chuang, M, Y., L. Wong, W. R. Gallaher, J. J. Thompson, and P. S. Roheim. 1985. Production and characterization of a monoclonal antibody to dog hepatic lipase. Bkhim Bw&~s.

Dory, L., L. M. Boquet, R. L. Hamiltoi, C. H. Sloop, and P. S. Roheim. 1985. Heterogeneity of hcg interstitial fluid (peripheral lymph) high density lipoproteins: implications for a role in reverse cholesterol transport. J. Lipid Res. 2 6

Ban; D. I?, E. M. Russ, and H. A. mer. 1951. Protein-lipid relationship in human plasma. 11. In arteriosclerosis and re- lated conditions. Am J. Mcd 11: 480-493. Gofman, J. W., 0. DeLaUa, E Glazier, N. K. Freeman, E T. Lindgren, A. V. Nichols, E. H. Strisower, and A. R. Tamplin. 1954. The serum lipoprotein transport system in health, meta-

Lipid Res. 2 9 245-271.

A c ~ 833: 69-81.

519-527.

1744 Journal of Lipid Reseaxh Volume 30, 1989

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

8.

9.

10.

11.

12.

13.

14.

15.

16.

17.

18.

19.

20.

21.

22.

23.

24.

bolic disorders, atherosdemis and coronary heart disease. Plana 2 413-4434, Miller, G. J., and N. E. Miller. 1975. Plasma high density lipoprotein concentration and devleopment of ischaemic heart disease. Lancet 1: 16-19. Gordon, T., W. P. Castelli, M. C. Hortland, W. B. Kannel, and T. R. Dawber. 1977. High density lipoproteins as a pro- tective factor against coronary heart disease. Am. J Md. 62:

Glomset, J. A. 1968. The plasma 1ecithin:cholesteml acyltrans- ferase reaction. J Lipid h. 9: 155-167. Fielding, C. J., V. G. Shore, and P. E. Fielding. 1972. A pro- tein cofactor of 1ecithin:cholesterol acyltransferase. Biochem.

Fielding, C. J., V. G. Shore and €! E. Fielding. 1972. Leci- thinxholesterol acyltransferase: effects of substrate compasi- tion upon enzyme activity. Bziuhim BI$hys. Acta 270: 513- 518. Brown, M. S., P. T. Kovanen, and J. L. Goldstein. 1981. Reg- ulation of plasma cholesterol by lipoprotein receptors. Saiwe .

S h e d , B. C., T. L. Innerarity, and R. W. Mahley. 1980. Rapid hepatic clearance of the canine lipoproteins containing only the E apoprotein by high affinity receptor. J? BWl. C k

Mahley, R. W., T. L. Innerarity, andK. H. Weisgraber. 1980. Alterations in metabolic activity of plasma lipoproteins follow- ing selective chemical modification of the apoproteins. Ann. N

Blue, M-L., D. L. Williams, S. Zucker, S. A. Khan, and C. B. Blum. 1983. Apolipoprotein E synthesis in human kidney, adrenal gland, and liver. ADC. N d . had Sci. USA. 80:

Hshourbagy, N. S., W. S. Liao, R. W. Mahley, and J. M. Tay- lor. 1984. Apolipoprotein E mRNA is abundant in the brain and adrenals, as well as in the liver, and is present in other peripheral tissues of rats and marmosets. Aoc. NdZ. had Sci.

Lin-Lee, Y.C., E T. Kao, P. Cheung, and L. Chan. 1985. Apolipoprotein E gene mapping and expression: localization of the mRNA in lipoprotein and non-lipoprotein producing tissues. Bziuhemkq. 2 4 3751-3756. Lin, C-T., Y. Xu, JY. Wu, and L. Chan. 1986. Immunoreac- tive apolipoprotein E is a widely distributed cellular protein. Immunohistochemical localization of apolipoprotein E in ba- boon tissues. J Clin. Invest. 7 8 947-958. Ignatius, M. J., E. M. Shooter, R. E. Pitas, and R. W. Mah- ley. 1987. Lipoprotein uptake by neuronal growth cones in vi- tro. & k e . 236: 959-962. Ada, E. M., G. Holdsworth, M. Sasaki, R. L. Jackson, and J. A. Harmony. 1982. Apoprotein E suppresses phytohemag- glutinin-activated phospholipid turnover in peripheral blood mononudear cells. J Biol. Cii.em. 257: 5900-5909. Mahley, R. W. 1988. Apolipoprotein E: cholesterol transport protein with expanding role in cell biology. S k e . 240:

Soutar, A. K., C. W. Garner, H. N. Baker, J. T Sparrow, R. L. Jackson, A, M. Gotto, Jr., and L. C. Smith. 1975. Ef- fect of the human apolipoproteins and phosphatidylcholine acyl donor on the activity of 1ecithin:cholesterol acyltrans- ferase. BiochrmLitry. 14: 3057-3064. Chirgwin, J. M., A. E. Pnybyla, R. J. MacDonald, and W. J. Rutter. 1979. Isolation of biologically active ribonudeic acid from sources enriched in ribonuclease. Bziuhcmistv. 18:

707-714.

BI$&s. &. CO"W 4 6 1493-1498.

212: 628-635.

255: 1804-ia07.

k: A d &. 348: 265-280.

283-287.

USA. USA. 82: 203-207.

522-530.

5294-5299.

25.

26.

27.

28.

29.

30.

31.

32.

33.

34.

35.

36.

37.

38.

39.

40.

41.

42.

Aviv, H., and P. M e r . 1972. Purification of biologically active globin messenger RNA by chromatography on olipthynndy- lic acid-cellulose. h. Natl Atad Ski USA. 69 1408-1412. Gubler, U., and B. J. Hoffman. 1983. A simple and very &- cient method for generating cDNA libraries. Cmc. 25:

Maniatis, T., Z. E Frits&, and J. Sambrook. 1982. Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Labo- ratory, Cold Spring Harbor, W. Sanger, F., A. Coulson, B. Barrell, A. Smith, and €3. Roe. 1980. Cloning in single-stranded bacteriophagr as an aid to rapid DNA sequencing. J Mol BWl. 143: 161-178. Li, W-H., C-I. Wu, and C-C. Luo. A new method for esti- mating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes. Mol BWl. E d . 2 150-174. Gordon, J. I., H. E Sims, S. R. Lentz, C. Edelstein, A. M. Scanu, and A. W. Strauss. 1983. Proteolytic processing of hu- man preproapolipoprotein A-I a pqposed defect in the con- version of pmA-I to A-I in Tangier disease. J BWZ. Chem 258 4037-4044. Zannis, V. I., S. K. Karathanasis, H. T. Keutmann, G. Gold- berger, and J. L. Breslow. 1983. Intracellular and extracellular processing of human apolipoprotein A-I: secreted apolipopro- tein A-I isoprotein 2 is a proprotein. Pm. NdZ. had Sci. USA.

Chung, H., A. Randolph, I. Reardon, and R. L. Heinrickson. 1982. The covalent structure of apolipoprotein A-I from ca- nine high density lipoproteins. J BioZ. C h . 25f: 2961-2967. Weisgraber, K. H., R. E Troxler, S. C. Rall, andR. W. Mah- lqr 1980. Comparison of the human, canine and swine E apo- proteins. Biochem BWp5. Res. Commun 95: 374-380. Wu, A-L., and H. G. Windmueller. 1979. Relative contribu- tions by liver and intestine to individual plasma -apolipopro- teins.J BWZ. C h 254 7316-7322. Dayhoff, M. 0. 1972. Atlas of Protein Sequences and Stxuc- ture. Vol. 5. National Biomedical Research Foundation,cSilver Spring, MD. Datta, S., W-H Li, I. Ghosh, C-C. Luq and L. Chan. 1987. Structure and expression of dog apolipoprotein C-I1 and C- I11 mRNAs: implications for the evolution and functional constraints of apolipoprotein structure. J. BWl Chcm. 262:

Matsumoto, A., H. Aburatani, Y. Shibasaki, T. Kodamn, E Takake, and H. Itakura. 1987. Cloning and regulation of rat apolipoprotein B mRNA. Bziuhem Bwphy. Res. Co"m 142:

Chen, S-H., CY. Yang, P-E Chen, D. Setzer, M. Tanimura, W-H. Li, A. M. Gotto, Jr., and L. Chan. 1986. The complete cDNA and amino acid sequence of human apolipoprotein B-

Kirchgessner, T. G., C. Heinzmann, K. L. Svenson, D. A. Gordon, M. Nicosia, H. G. Lebherz, A. J. Lusis, and D. L. Williams. 1987. Regulation of chicken apolipoprotein B: don- ing, tissue distribution, and estrogen induction of mRNA.

Cheung, €!, and L. Chan. 1983. Nucleotide sequence of cloned cDNA of human apolipoprotein A-I. Nucleic Act& Res. 11:

Bymes, L., C-C. Luo, W-H. Li, C-Y. Yang, and L. Chan. 1987. Chicken apolipoprotein A-I: cDNA sequence, expres- sion and evolution. Bkhem Biof&s. Res. Commun. 148 485-492. Luo, C-C., W-H. Li, M. N. Moore, and L. Chan. 1986. Structure and evolution of the apolipoprotein multigene

263-269.

8 0 2574-2478.

10588-10593.

92-99.

100. J BWl. Chem 261: 12918-12921.

Cmc. 5 9 241-251.

3703-3715.

Luo, Li, and Chan Dog a@-I, E, and C-I sequences 1745

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from

f d y . J . M d Bid 111: 325-340. 43. h e r , S. J., D. Walker, N. A. Elshourbagy, C. A. Reardon,

B. Levy-Wilson, and J. M. Taylor. 1988. Two copies of the human apdipoprotein C-I gene are l i e d closely to the apolipoprotein E gene. J. BWl. Chmr 263: 7277-7286.

44. Britten, R. J., W. E Baron, D. B. Stout, and E. H. Davidson. 1988. Sources and evolution of human Alu repeated se- quences. Pnu. Natl Acad Sci USA. 85: 4770-4774.

45. Li, W-H., M. Tanimura, and P. M. Sharp. 1987. An evalua- tion of the decular dock hypothesis using mammalian

%. Dory, L., L. M. Boquet, C. R. Tate, and C. H. Sloop. 1986. Peripheral synthesis and isoform distribution of dog apopro- tein E: an in vivo approach. J. BWI. Chrm 261: 811-816.

47. Pitas, R. E., J. K. Boyles, S. H. Lee, D. Hui, and K. H. Weis- graber. 1987. Lipoproteins and their receptors in the central nervous system. Characterization of the lipoproteins in cere- brospinal fluid and identification of apolipoprotein B, E (LDL) receptors in the brain. J. Bwl. Chmr 262: 14352- 14360.

48. h a l l , H. J., A. Hu, A. M. Gotto, Jr., J. J. Albers, and J. T. Sp-. 1980. Activation of lecithinxholesterol acyl- transferase by a synthetic model lipid-associating peptide. h c . N d Acad &. USA. 7 7 3154-3158.

49. Wetterau, J. R., L. P. Aggerbeck, S. C. Rall, Jr., and K. H. Weisgraber. 1988. Human apolipoprotein E3 in aqueous solu- tion. I. Guidance for two structural domains. J Bwl. C h . 263: 6240-6248.

50. Awrbeck, L. P., J. R. Wetterau, K. H. Weisgraber, C-S. C. Wu, and E T. Lindgren. 1988. Human apolipoprotein E3 in aqueous solution. 11. Properties of the amino- and carboxyl-terminal domains. J. Biol. Chon 263: 6249-6258. Mahley, R. W., and T. L. Innerarity. 1983. Lipoprotein recep- tors and cholesterol homeostasis. Biochim. Bwptvs. Ada. 737

52. Lalazar, A., K.. H. Weisgraber, S. C. Rall, Jr., H. Giladi, T L. Innerarity, A. Z. Levanon, J. K. Boyles, B. 4mit, M. Gorecki, R. W. Mahlq, and T Vogel. 1988. Site specific mutagenesis of human apolipoprotein E. Receptor binding activity of variants with single amino acid substitutions. J.

53. Wdel l , M. R., S. 0. B E M ~ ~ , E. D. James, R. Fraser, and

DNA sequences.J. MOL Ed. 25: 330-342.

51.

197-222.

Bid CAmr 263: 3542-3545.

54.

55.

56.

57.

58.

59.

60.

61.

R. W. Carrell. 1987. Apolipoprotein E2-Christchurch (136 Arg + Ser). New variant of human apolipoprotein E in a pa- tient with Type 111 hyperlipoproteinemia. J. Clin. Znmt. 80: 483-490. Knott, T. J., S. C. Rall, Jr., T. L. Innerarity, S. E Jacobson, M. S. Urdea, B. Levy-Wilson, L. M. Powell, R. J. Pease, R. Eddy, H. Nakai, M. Beyers, L. M. Priestley, E. Robertson, L. B. Rall, C. Betsholz, T. B. Shows, R. W. Mahley, and J. Scott. 1985. Human apolipoprotein B: structure of carboxyl- terminal domains, sites of gene expression, and chromosomal localization. Science. 2 3 0 37-43. Yang, C-Y., S-H. Chen, S. H. Gianturco, W. A. Bradley, J. T. Sparrow, M. Tanimura, W-H. Li, D. A. Sparrow, H. D e h f , M. Rosseneu, F-S. Lee, Z-W. Gu, A. M. Gotto, Jr., and L. Chan. 1986. Sequence, structure, receptor binding domains and internal repeats of human apolipoprotein B-100. Natun.

Innerarity, T L., and R. W. Mahley. 1978. Enhanced binding by cultured human fibroblasts of apoE-containing lipoproteins as compared with low density lipoproteins. Biochzitry. 17:

H o b , H. H., M. S. Brown, J. L. Goldstein, and D. W. Rus- sell. 1986. Deletion of exon encoding cysteine-rich repeat of low density lipoprotein receptor alters its binding specificity in a subject with familial hypercholesterolemia. J. Biol. C h .

Esser, V., L. E. Limbird, M. S. Brown, J. L. Goldstein and D. W. Russell. 1988. Mutational analysis of the ligand binding domain of low density lipoprotein receptor. J. Biol C h . 263:

Knott, T. J., R. J. Pease, L. M. Powell, S. C. Wallis, S. C. Rall, T. L. Innerarity, B. Blackhart, W. H. Taylor, Y. Marcel, R. Milne, D. Johnson, M. Fuller, A. J. Lusis, B. J. McCarthy, R. W, Mahley, B. Levy-Wilson and J. Scott. 1986. Complete protein sequence and identification of structural domains of human apolipoprotein B. N d w . 323: 734-738. Fit& W. M., and E. Margoliash. 1967. Construction of phy- logenetic trees. A method based on mutation distances as esti- mated from cytochrome C sequences is of general applicabili- ty. Sckwe. 155: 279-294. Romer, A. S. 1966. Vertebrate Paleontology, University of Chicago Press, Chicago.

323: 738-742.

1440-1447.

261: 13114- 13120.

13282-13290.

1746 Journal of Lipid Research Volume 30, 1989

by guest, on January 6, 2016w

ww

.jlr.orgD

ownloaded from