WO 2009/143603 Al
-
Upload
khangminh22 -
Category
Documents
-
view
2 -
download
0
Transcript of WO 2009/143603 Al
(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT)
(19) World Intellectual Property OrganizationInternational Bureau
(10) International Publication Number(43) International Publication Date3 December 2009 (03.12.2009) WO 2009/143603 Al
(51) International Patent Classification: (74) Agent: MBM INTELLECTUAL PROPERTY LAWC40B 40/06 (2006.01) C40B 30/04 (2006.01) LLP; P . O . Box 809 Stn. B, Ottawa, Ontario KlP 5P9C12Q 1/68 (2006.01) C07H 21/00 (2006.01) (CA).
(21) International Application Number: (81) Designated States (unless otherwise indicated, for everyPCT/CA2009/000694 kind of national protection available): AE, AG, AL, AM,
AO, AT, AU, AZ, BA, BB, BG, BH, BR, BW, BY, BZ,(22) International Filing Date: CA, CH, CN, CO, CR, CU, CZ, DE, DK, DM, DO, DZ,
28 May 2009 (28.05.2009) EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN,(25) Filing Language: English HR, HU, ID, IL, IN, IS, JP, KE, KG, KM, KN, KP, KR,
KZ, LA, LC, LK, LR, LS, LT, LU, LY, MA, MD, ME,(26) Publication Language: English MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO,
(30) Priority Data: NZ, OM, PG, PH, PL, PT, RO, RS, RU, SC, SD, SE, SG,
61/056,827 28 May 2008 (28.05.2008) US SK, SL, SM, ST, SV, SY, TJ, TM, TN, TR, TT, TZ, UA,UG, US, UZ, VC, VN, ZA, ZM, ZW.
(71) Applicant (for all designated States except US):GENOMEDX BIOSCIENCES, INC. [CA/CA]; (84) Designated States (unless otherwise indicated, for every
460-2609 Granville Street, Vancouver, British Columbia kind of regional protection available): ARIPO (BW, GH,
V6H 3H3 (CA). GM, KE, LS, MW, MZ, NA, SD, SL, SZ, TZ, UG, ZM,ZW), Eurasian (AM, AZ, BY, KG, KZ, MD, RU, TJ,
(72) Inventor; and TM), European (AT, BE, BG, CH, CY, CZ, DE, DK, EE,(75) Inventor/Applicant (for US only): DAVICIONI, Elai ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV,
[CA/CA]; Vancouver, British Columbia (CA). MC, MK, MT, NL, NO, PL, PT, RO, SE, SI, SK, TR),
[Continued on next page]
(54) Title: SYSTEMS AND METHODS FOR EXPRESSION-BASED DISCRIMINATION OF DISTINCT CLINICAL DISEASE STATES IN PROSTATE CANCER
(57) Abstract: A system for expression-based discrimination of distinct clinical disease states in prostate cancer is provided that isbased on the identification of sets of gene transcripts, which arecharacterized in that changes in expression of each gene transcriptwithin a set of gene transcripts can be correlated with recurrent ornon-recurrent prostate cancer. The Prostate Cancer Prognostic sy stem provides for sets of "prostate cancer prognostic" target se
quences and further provides for combinations of polynucleotideprobes and primers derived there from. These combinations ofpolynucleotide probes can be provided in solution or as an array.The combination of probes and the arrays can be used for diagnosis.The invention further provides further methods of classifyingprostate cancer tissue.
Figure 1Λ
OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, before the expiration of the time limit for amending theMR, NE, SN, TD, TG). claims and to be republished in the event of receipt of
Published: amendments (Rule 48.2(h))
SYSTEMS AND METHODS FOR EXPRESSION-BASED DISCRIMINATION
OF DISTINCT CLINICAL DISEASE STATESIN PROSTATE CANCER
FIELD OF THE INVENTION
This invention relates to the field of diagnostics and in particular to systems and methods for
classifying prostate cancer into distinct clinical disease states.
BACKGROUND
Prostate cancer is the most common malignancy affecting U.S. men, with approximately 240,
000 new cases diagnosed each year. The incidence of prostate cancer is increasing, in part due to
increased surveillance efforts from the application of routine molecular testing such as prostate-
specific antigen (PSA). For most men, prostate cancer is a slow-growing, organ-confined or
localized malignancy that poses little risk of death. The most common treatments for prostate
cancer in the U.S. are surgical procedures such as radical prostatectomy, where the entire
prostate is removed from the patient. This procedure on its own is highly curative for most but
not all men.
The vast majority of deaths from prostate cancer occur in patients with metastasis, believed to be
present already at the time of diagnosis in the form of clinically undetectable micro-metastases.
In these patients, it is clear that prostatectomy alone is not curative and additional therapies such
as anti-androgen or radiation therapy are required to control the spread of disease and extend the
life of the patient.
Most prostatectomy patients however face uncertainty with respect to their prognosis after
surgery: whether or not the initial surgery will be curative several years from the initial
treatment because the current methods for assessment of the clinical risk such as the various
pathological (e.g., tumor stage), histological (e.g., Gleason's), clinical (e.g., Kattan nomogram)
and molecular biomarkers (e.g., PSA) are not reliable predictors of prognosis, specifically
disease progression. Routine PSA testing has certainly increased surveillance and early-detection
rates of prostate cancer and this has resulted in an increased number of patients being treated but
not significantly decreased the mortality rate.
Despite the controversies surrounding PSA testing as a screening tool, most physicians
confidently rely on PSA testing to assess pre-treatment prognosis and to monitor disease
progression after initial therapy.Successive increases in PSA levels above a defined threshold
value or variations thereof (i.e. 'Rising-PSA'), also known as biochemical recurrence has been
shown to be correlated to disease progression after first-line therapy (e.g., prostatectomy,
radiation and brachytherapy). However, less than a 1/3 of patients with 'rising-PSA' will
eventually be diagnosed with systemic or metastatic disease and several studies have shown that
after long-term follow up, the majority will never show any symptoms of disease progression
aside from increases in PSA measurement. The limitations of using the PSA biomarker and the
absence of additional biomarkers for predicting disease recurrence have led to the development
of statistical models combining several clinical and pathological features including PSA results.
Several of these 'nomograms' have been shown to improve the predictive power for disease
recurrence in individual patients over any single independent variable. These models (see
Citation #14) are used routinely in the clinic and are currently the best available tools for
prediction of outcomes, although they do not provide high levels of accuracy for groups of
patients with highly similar histological/pathological features or those at 'intermediate' risk of
disease recurrence after prostatectomy.
The use of quantitative molecular analyses has the potential to increase the sensitivity, specificity
and/or overall accuracy of disease prognosis and provide a more objective basis for
determination of risk stratification as compared to conventional clinical-pathological risk models
(see Citation #13). The PSA test demonstrates the deficiencies of relying on the measurement of
any single biomarker in clinically heterogeneous and complex prostate cancer genomes.
Therefore, genomic-based approaches measuring combinations of biomarkers or a signature of
disease recurrence are currently being investigated as better surrogates for predicting disease
outcome (see Citations # 1-13). For prostate cancer patients these efforts are aimed at reducing
the number of unnecessary surgeries for patients without progressive disease and avoid
inadvertent under-treatment for higher risk patients. To date, genomic profiling efforts to
identify DNA-based (e.g., copy-number alterations, methylation changes), RNA-based (e.g.,
gene or non-coding RNA expression) or protein-based (e.g., protein expression or modification)
signatures, useful for disease prognosis have not however resulted in widespread clinical use.
There are several key reasons explaining why prior genomic profiling methods for prostate
cancer have not yet been incorporated in the clinic. These include the small sample sizes typical
of individual studies, coupled with variations due to differences in study protocols, clinical
heterogeneity of patients and lack of external validation data, which combined have made
identifying a robust and reproducible disease signature elusive. Specifically for gene or RNA
expression based prognostic models; the mitigating technological limitations include the quality
and quantity of RNA that can be isolated from routine clinical samples. Routine clinical samples
of prostate cancer include needle-biopsies and surgical resections that have been fixed in
formalin and embedded in paraffin wax (FFPE). FFPE-derived RNA is typically degraded and
fragmented to between 100-300 bp in size and without poly-A tails making it of little use for
traditional 3'-biased gene expression profiling, which requires larger microgram quantities of
RNA with intact poly-A tails to prime cDNA synthesis.
Furthermore, as <2% of the genome encodes for protein, traditional gene expression profiling in
fact captures only a small fraction of the transcriptome and variation in expression as most RNA
molecules that are transcribed are not translated into protein but serve other functional roles and
non-coding RNAs are the most abundant transcript species in the genome.
This background information is provided for the purpose of making known information believed
by the applicant to be of possible relevance to the present invention. No admission is necessarily
intended, nor should be construed, that any of the preceding information constitutes prior art
against the present invention.
SUMMARY OF THE INVENTION
An object of the present invention is to provide systems and methods for expression-based
discrimination of distinct clinical disease states in prostate cancer. In accordance with one aspect
of the present invention, there is provided a system for expression-based assessment of risk of
prostate cancer recurrence after prostatectomy, said system comprising one or more
polynucleotides, each of said polynucleotides capable of specifically hybridizing to a RNA
transcript of a gene selected from the group of genes set forth in Table 3 and/or 6 .
In accordance with another aspect of the present invention, there is provided a nucleic acid array
for expression-based assessment of prostate cancer recurrence risk, said array comprising at least
ten probes immobilized on a solid support, each of said probes being between about 15 and about
500 nucleotides in length, each of said probes being derived from a sequence corresponding to,
or complementary to, a transcript of a gene selected from the group of genes set forth in Table 3
and/or 6, or a portion of said transcript.
In accordance with another aspect of the present invention, there is provided a method for
expression-based assessment of prostate cancer recurrence, said method comprising: (a)
determining the expression level of one or more transcripts of one or more genes in a test sample
obtained from said subject to provide an expression pattern profile, said one or more genes
selected from the group of genes set forth in Table 3 and/or 6, and (c) comparing said expression
pattern profile with a reference expression pattern profile.
In accordance with another aspect of the present invention, there is provided a kit for
characterizing the expression of one or more nucleic acid sequences depicted in SEQ ID NOs: 1-
2 114 comprising one or more nucleic acids selected from (a) a nucleic acid depicted in any of
SEQ ID NOs: 1-21 14; (b) an RNA form of any of the nucleic acids depicted in SEQ ID NOs: 1-
2 114; (c) a peptide nucleic acid form of any of the nucleic acids depicted in SEQ ID NOs: 1-
2 114; (d) a nucleic acid comprising at least 20 consecutive bases of any of (a-c); (e) a nucleic
acid comprising at least 25 consecutive bases having at least 90% sequence identity to any of (a-
c); or (f) a complement to any of (a-e); and optionally instructions for correlating the expression
level of said one or more nucleic acid sequences with the disease state of prostate cancer tissue.
In accordance with another aspect of the present invention, there is provided an array of probe
nucleic acids certified for use in expression-based assessment of prostate cancer recurrence risk,
wherein said array comprises at least two different probe nucleic acids that specifically hybridize
to corresponding different target nucleic acids depicted in one of SEQ ID NOs: 1-21 14, an RNA
form thereof, or a complement to either thereof.
In accordance with another aspect of the present invention, there is provided a device for
classifying a biological sample from a prostate cancer as recurrent or non-recurrent, the device
comprising means for measuring the expression level of one or more transcripts of one or more
genes selected from the group of genes set forth in Table 3 and/or 6; means for correlating the
expression level with a classification of prostate cancer status; and means for outputting the
prostate cancer status.
In accordance with another aspect of the present invention, there is provided a computer-readable
medium comprising one or more digitally-encoded expression pattern profiles representative of
the level of expression of one or more transcripts of one or more genes selected from the group
of genes set forth in Table 3 and/or 6, each of said one or more expression pattern profiles being
associated with a value wherein each of said values is correlated with the presence of recurrent or
non-recurrent prostate cancer.
BRIEF DESCRIPTION OF THE DRAWINGS
These and other features of the invention will become more apparent in the following detailed
description in which reference is made to the appended drawings.
Figure 1. A) Principle components analysis (PCA) of 2,1 14 RNAs identified to be differentially
expressed between tumors from patients with differing clinical outcome (see Table 2 for
comparisons evaluated), PCA plot of 22 prostate cancer tumors shows tight clustering of samples
by clinical outcome of patients (circles, NED; diamonds, PSA; squares, SYS). B) Two-way
hierarchical clustering dendrogram and expression matrix of 526 target sequences (Table 4)
RNAs filtered using linear regression (p<0.01) to identify RNAs that followed either
SYS>PSA>NED or NED>PSA>SYS trend in differential expression. C) Two-way hierarchical
clustering dendrogram and expression matrix of 148 target sequences (Table 5), a subset of the
most differentially expressed transcripts between patients with clinically significant 'recurrent'
(i.e., 'SYS') and 'non-recurrent' (i.e., 'PSA' and 'NED') disease as filtered using a t-test
(p<0.001). For B) and C), sample and RNAs were optimally ordered using Pearson's correlation
distance metric with complete-linkage cluster distances and the expression of each RNA in each
sample was normalized in the heatmap by the number of standard deviations above (blacker) and
below (whiter) the median expression value (grey) across all samples.
Figure 2 . Histograms showing distribution patient's tumor expression levels of a 'metagene'
generated from a linear combination of the 526 RNAs for each clinical group. The histograms
bin samples with similar metagene expression values and significantly separate three modes of
patient metagene scores (ANOVA, p<().()00001) corresponding to the three clinical status groups
evaluated.
Figure 3 . Scatter plots summarizing the mean (± standard deviation) of metagene expression
values for tumor samples from patients in the three clinical status groups (NED; PSA; SYS).
Metagenes were generated from a linear combinations of 6 (A), 18 (♦) or 20 (■) RNAs and
demonstrate highly significant differential expression between clinical groups (ANOVA,
p<().()()0001).
Figure 4 . Box plots showing interquartile range and distribution of 'POP' scores for each
clinical group using an 18-target sequence metagene (Table 7) to derive patient outcome
predictor scores scaled and normalized on a data range of 0-100 points. T-tests were used to
evaluate the statistical significance of differences in POP scores between NED and PSA ( ) as
well as between PSA and SYS ( ) clinical groups (p<7 X l O 7 and p<l X 10 6, respectively).
Figure 5 . Box plots showing interquartile range and distribution of 'POP' scores for each
clinical group using a 10-target sequence metagene (Table 9) to derive patient outcome predictor
scores scaled and normalized on a data range of 0-100 points. T-tests were used to evaluate the
statistical significance of differences in POP scores between 'recurrent' (i.e., 'SYS') and non-
recurrent (i.e., 'PSA' and 'NED') patient groups ( , p<4 X 10 KK
Figure 6 . Box plots showing interquartile range and distribution of 'POP' scores for each
clinical group using a 4 1-target sequence metagene (Table 10) to derive patient outcome
predictor scores scaled and normalized on a data range of 0-100 points. T-tests were used to
evaluate the statistical significance of differences in POP scores between 'recurrent' (i.e., 'SYS')
and non-recurrent (i.e., 'PSA' and 'NED') patient groups ( , p<2 X 10 ").
Figure 7 . Box plots showing interquartile range and distribution of 'POP' scores for each
clinical group using a 148-target sequence metagene to derive patient outcome predictor scores
scaled and normalized on a data range of 0-100 points. T-tests were used to evaluate the
statistical significance of differences in POP scores between 'recurrent' (i.e., 'SYS') and non
recurrent (i.e., 'PSA' and 'NED') patient groups ( , p<9 X 10 12 ) .
DETAILED DESCRIPTION OF THE INVENTION
The present invention provides a system and method for assessing prostate cancer recurrence risk
by distinguishing clinically distinct disease states in men with prostate cancer at the time of
initial diagnosis or surgery. The system and methods are based on the identification of gene
transcripts following a retrospective analysis of tumor samples that are differentially expressed in
prostate cancer in a manner dependent on prostate cancer aggressiveness as indicated by long-
term post-prostatectomy clinical outcome. These gene transcripts can be considered as a library
which can be used as a resource for the identification of sets of specific target sequences
("prostate cancer prognostic sets"), which may represent the entire library of gene transcripts or a
subset of the library and the detection of which is indicative of prostate cancer recurrence risk.
The invention further provides for probes capable of detecting these target sequences and
primers that are capable of amplifying the target sequences.
In accordance with one embodiment of the invention, the system and method for assessing
prostate cancer recurrence risk are prognostic for a post surgery clinical outcome selected from
no evidence of disease ('NED'), biochemical relapse (two successive increases in prostate-
specific antigen levels; ('PSA') and systemic prostate cancer systemic metastases ('SYS').
In accordance with one embodiment of the invention, the target sequences comprised by the
prostate cancer prognostic set are sequences based on or derived from the gene transcripts from
the library, or a subset thereof. Such sequences are occasionally referred to herein as "probe
selection regions" or "PSRs." In another embodiment of the invention, the target sequences
comprised by the prostate classification set are sequences based on the gene transcripts from the
library, or a subset thereof, and include both coding and non-coding sequences.
In one embodiment, the systems and methods provide for the molecular analysis of the
expression levels of one or more of the target sequences as set forth in SEQ ID NOs: 1-21 14
(Table 4). Increased relative expression of one or more target sequences in a 'NED' Group
corresponding to the sequences as set forth in SEQ ID NOs: 1-913 is indicative of or predictive
of a non-recurrent form of prostate cancer and can be correlated with an increased likelihood of a
long-term NED prognosis or low risk of prostate cancer recurrence. Increased relative
expression of one or more target sequences in a 'SYS' Group corresponding to the sequences as
set forth in SEQ ID NOs: 914-21 14 is indicative of or predictive of an aggressive form of
prostate cancer and can be correlated with an increased likelihood of a long-term SYS prognosis
or high risk of prostate cancer recurrence. Optionally, intermediate relative levels of one or more
target sequences in a 'PSA' Group corresponding to target sequences set forth in Table 7 is
indicative of or predictive of biochemical recurrence. Subsets and combinations of these target
sequences or probes complementary thereto may be used as described herein.
Before the present invention is described in further detail, it is to be understood that this
invention is not limited to the particular methodology, compositions, articles or machines
described, as such methods, compositions, articles or machines can, of course, vary. It is also to
be understood that the terminology used herein is for the purpose of describing particular
embodiments only, and is not intended to limit the scope of the present invention.
DEFINITIONS
Unless defined otherwise or the context clearly dictates otherwise, all technical and scientific
terms used herein have the same meaning as commonly understood by one of ordinary skill in
the art to which this invention belongs. In describing the present invention, the following terms
will be employed, and are intended to be defined as indicated below.
The term "polynucleotide" as used herein refers to a polymer of greater than one nucleotide in
length of ribonucleic acid (RNA), deoxyribonucleic acid (DNA), hybrid RNA/DNA, modified
RNA or DNA, or RNA or DNA mimetics, including peptide nucleic acids (PNAs). The
polynucleotides may be single- or double- stranded. The term includes polynucleotides composed
of naturally-occurring nucleobases, sugars and covalent internucleoside (backbone) linkages as
well as polynucleotides having non-naturally-occurring portions which function similarly. Such
modified or substituted polynucleotides are well-known in the art and for the purposes of the
present invention, are referred to as "analogues."
"Complementary" or "substantially complementary" refers to the ability to hybridize or base pair
between nucleotides or nucleic acids, such as, for instance, between a sensor peptide nucleic acid
or polynucleotide and a target polynucleotide. Complementary nucleotides are, generally, A and
T (or A and U), or C and G . Two single-stranded polynucleotides or PNAs are said to be
substantially complementary when the bases of one strand, optimally aligned and compared and
with appropriate insertions or deletions, pair with at least about 80% of the bases of the other
strand, usually at least about 90% to 95%, and more preferably from about 98 to 100%.
Alternatively, substantial complementarity exists when a polynucleotide will hybridize under
selective hybridization conditions to its complement. Typically, selective hybridization will
occur when there is at least about 65% complementarity over a stretch of at least 14 to 25 bases,
for example at least about 75%, or at least about 90% complementarity. See, M . Kanehisa
Nucleic Acids Res. 12:203 ( 1984).
"Preferential binding" or "preferential hybridization" refers to the increased propensity of one
polynucleotide to bind to its complement in a sample as compared to a noncomplementary
polymer in the sample.
Hybridization conditions will typically include salt concentrations of less than about IM, more
usually less than about 500 mM, for example less than about 200 mM. In the case of
hybridization between a peptide nucleic acid and a polynucleotide, the hybridization can be done
in solutions containing little or no salt. Hybridization temperatures can be as low as 50C, but are
typically greater than 220C, and more typically greater than about 3O0C, for example in excess of
about 370C . Longer fragments may require higher hybridization temperatures for specific
hybridization as is known in the art. Other factors may affect the stringency of hybridization,
including base composition and length of the complementary strands, presence of organic
solvents and extent of base mismatching, and the combination of parameters used is more
important than the absolute measure of any one alone. Other hybridization conditions which
may be controlled include buffer type and concentration, solution pH, presence and
concentration of blocking reagents to decrease background binding such as repeat sequences or
blocking protein solutions, detergent type(s) and concentrations, molecules such as polymers
which increase the relative concentration of the polynucleotides, metal ion(s) and their
concentration(s), chelator(s) and their concentrations, and other conditions known in the art.
"Multiplexing" herein refers to an assay or other analytical method in which multiple analytes
can be assayed simultaneously.
A "target sequence" as used herein (also occasionally referred to as a "PSR" or "probe selection
region") refers to a region of the genome against which one or more probes can be designed. As
used herein, a probe is any polynucleotide capable of selectively hybridizing to a target sequence
or its complement, or to an RNA version of either. A probe may comprise ribonucleotides,
deoxyribonucleotides, peptide nucleic acids, and combinations thereof. A probe may optionally
comprise one or more labels. In some embodiments, a probe may be used to amplify one or both
strands of a target sequence or an RNA form thereof, acting as a sole primer in an amplification
reaction or as a member of a set of primers.
"Having" is an open ended phrase like "comprising" and "including," and includes
circumstances where additional elements are included and circumstances where they are not.
"Optional" or "optionally" means that the subsequently described event or circumstance may or
may not occur, and that the description includes instances where the event or circumstance
occurs and instances in which it does not.
As used herein 'NED' describes a clinically distinct disease state in which patients show no
evidence of disease ('NED') at least 5 years after surgery, 'PSA' describes a clinically distinct
disease state in which patients show biochemical relapse only (two successive increases in
prostate-specific antigen levels but no other symptoms of disease with at least 5 years follow up
after surgery; 'PSA') and 'SYS' describes a clinically distinct disease state in which patients
develop biochemical relapse and present with systemic prostate cancer disease or metastases
('SYS') within five years after the initial treatment with radical prostatectomy.
As used herein, the term "about" refers to approximately a +/-10% variation from a given value.
It is to be understood that such a variation is always included in any given value provided herein,
whether or not it is specifically referred to.
Use of the singular forms "a," "an," and "the" include plural references unless the context clearly
dictates otherwise. Thus, for example, reference to "a polynucleotide" includes a plurality of
polynucleotides, reference to "a target" includes a plurality of such targets, reference to "a
normalization method" includes a plurality of such methods, and the like. Additionally, use of
specific plural references, such as "two," "three," etc., read on larger numbers of the same
subject, unless the context clearly dictates otherwise.
Terms such as "connected," "attached," "linked" and "conjugated" are used interchangeably
herein and encompass direct as well as indirect connection, attachment, linkage or conjugation
unless the context clearly dictates otherwise.
Where a range of values is recited, it is to be understood that each intervening integer value, and
each fraction thereof, between the recited upper and lower limits of that range is also specifically
disclosed, along with each subrange between such values. The upper and lower limits of any
range can independently be included in or excluded from the range, and each range where either,
neither or both limits are included is also encompassed within the invention. Where a value
being discussed has inherent limits, for example where a component can be present at a
concentration of from 0 to 100%, or where the pH of an aqueous solution can range from 1 to 14,
those inherent limits are specifically disclosed. Where a value is explicitly recited, it is to be
understood that values which are about the same quantity or amount as the recited value are also
within the scope of the invention, as are ranges based thereon. Where a combination is
disclosed, each subcombination of the elements of that combination is also specifically disclosed
and is within the scope of the invention. Conversely, where different elements or groups of
elements are disclosed, combinations thereof are also disclosed. Where any element of an
5 invention is disclosed as having a plurality of alternatives, examples of that invention in which
each alternative is excluded singly or in any combination with the other alternatives are also
hereby disclosed; more than one element of an invention can have such exclusions, and all
combinations of elements having such exclusions are hereby disclosed.
io PROSTATE CANCER PROGNOSTIC SYSTEM
The system of the present invention is based on the identification of a library of gene and RNA
transcripts that are differentially expressed in prostate cancer in a manner dependent on prostate
cancer aggressiveness as indicated by the post-prostatectomy clinical outcome of the patient. For
15 example, relative over expression of one or more of the gene transcripts in a prostate cancer
sample compared to a reference sample or expression profile or signature there from may be
prognostic of a clinically distinct disease outcome post-prostatectomy selected from no evidence
of disease ('NED'), biochemical relapse ('PSA') and prostate cancer disease systemic recurrence
or metastases ('SYS') . The reference sample can be, for example, from prostate cancer
0 sample(s) of one or more references subject(s) with a known post-prostatectomy clinical
outcomes. The reference expression profile or signature may optionally be normalized to one or
more appropriate reference gene transcripts. Alternatively or in addition to, expression of one or
more of the gene transcripts in a prostate cancer sample may be compared to an expression
profile or signature from normal prostate tissue.
5
Expression profiles or signatures from prostate cancer samples may be normalized to one or
more house keeping gene transcripts such that normalized over and/or under expression of one or
more of the gene transcripts in a sample may be indicative of a clinically distinct disease state or
prognosis.
0
PROSTATE PROGNOSTIC LIBRARY
The Prostate Prognostic Library in accordance with the present invention comprises one or more
gene or RNA transcripts whose relative and/or normalized expression is indicative of prostate
cancer recurrence and which may be prognostic for post-prostatectomy clinical outcome of a
patient. Exemplary RNA transcripts that showed differential expression in prostate cancer
samples from patients with clinically distinct disease outcomes after initial treatment with radical
prostatectomy are shown in Table 3 . In one embodiment of the invention, the library comprises
one or more of the gene transcripts of the genes listed in Table 3 .
In one embodiment, the library comprises at least one transcript from at least one gene selected
from those listed in Table 3 . In one embodiment, the library comprises at least one transcript
from each of at least 5 genes selected from those listed in Table 3 . In another embodiment, the
library comprises at least one transcript from each of at least 10 genes selected from those listed
in Table 3 . In a further embodiment, the library comprises at least one transcript from each of at
least 15 genes selected from those listed in Table 1. In other embodiments, the library comprises
at least one transcript from each of at least 20, at least 25, at least 30, at least 35, at least 40, at
least 45, at least 50, at least 55, at least 60 and at least 65 genes selected from those listed in
Table 3 . In a further embodiment, the library comprises at least one transcript from all of the
genes listed in Table 3 . In a further embodiment, the library comprises at all transcripts from all
of the genes listed in Table 3 .
In one embodiment, the library comprises at least one transcript from at least one gene selected
from the group consisting of [NMJ)0 10047221 ; [NMJ)0 10055221; [NMJ)OlO 1367 11;
[NMJ)0 10335 171; [NM_ 1830491; [NM_2 125591; 5'-3' exoribonuclease 1; A kinase (PRKA)
anchor protein (yotiao) 9; AarF domain containing kinase 4; Abhydrolase domain containing 3;
Aconitase 1, soluble; Actinin, alpha 1; ADAM metallopeptidase domain 19 (meltrin beta);
Adaptor-related protein complex 1, gamma 2 subunit; Adenosine deaminase, RNA-specific, B2
(RED2 homolog rat); Adenylate cyclase 3; ADP-ribosylation factor GTPase activating protein 3;
ADP-ribosylation factor guanine nucleotide-exchange factor 2 (brefeldin A-inhibited); ADP-
ribosylation factor-like 4D; Adrenergic, beta, receptor kinase 2; AF4/FMR2 family, member 3;
Amyloid beta (A4) precursor protein-binding, family B, member 1 (Fe65); Anaphase promoting
complex subunit 1; Ankyrin 3, node of Ranvier (ankyrin G); Ankyrin repeat domain 15; Ankyrin
repeat domain 28; Annexin Al; Annexin A2; Anterior pharynx defective 1 homolog B (C.
elegans);Anthrax toxin receptor 1; Antizyme inhibitor 1; Arachidonate 12-lipoxygenase, 12R
type; Arginine vasopressin receptor IA; Arginine-glutamic acid dipeptide (RE) repeats; ARP3
actin-related protein 3 homolog (yeast); Arrestin 3, retinal (X-arrestin); Arrestin domain
containing 1; Aryl hydrocarbon receptor interacting protein-like 1; Aryl hydrocarbon receptor
nuclear translocator; Ataxin 1; ATM/ATR-Substrate Chk2-Interacting Zn2+-finger protein;
ATPase, Class I, type 8B, member 1; ATPase, Na+/K+ transporting, alpha 1 polypeptide; ATP-
binding cassette, sub-family F (GCN20), member 1; Autism susceptibility candidate 2;
Baculoviral IAP repeat-containing 6 (apollon); Basonuclin 2; Brain-specific angiogenesis
inhibitor 3; Bromodomain containing 7 ; Bromodomain containing 8; Bromodomain PHD finger
transcription factor; BTB (POZ) domain containing 16; BTB (POZ) domain containing 7 ;
Calcium activated nucleotidase 1; Calcium binding protein P22; Calcium channel, voltage-
dependent, beta 4 subunit; Calcium channel, voltage-dependent, L type, alpha 1C subunit;
Calcium channel, voltage-dependent, L type, alpha ID subunit; Calcyclin binding protein;
Calmodulin 1 (phosphorylase kinase, delta); Calsyntenin 1; Carbonyl reductase 3; Cardiolipin
synthase 1; Carnitine palmitoyltransferase IA (liver); Casein kinase 1, delta; Casein kinase 1,
gamma 1; Casein kinase 1, gamma 3; Caspase 1, apoptosis-related cysteine peptidase
(interleukin 1, beta, convertase); CD 109 molecule; CD99 molecule-like 2; CDK5 regulatory
subunit associated protein 2; CDP-diacylglycerol synthase (phosphatidate cytidylyltransferase) 2;
Cell adhesion molecule 1; Cell division cycle and apoptosis regulator 1; Centrosomal protein
7OkDa; Chloride channel 3; Chromodomain helicase DNA binding protein 2; Chromodomain
helicase DNA binding protein 6; Chromodomain protein, Y-like 2; Chromosome 1 ORF 116;
Chromosome 1 ORF 52; Chromosome 10 ORF 118; Chromosome 12 ORF 30; Chromosome 13
ORF 23; Chromosome 16 ORF 45; Chromosome 18 ORF 1; Chromosome 18 ORF 1;
Chromosome 18 ORF 1; Chromosome 18 ORF 1; Chromosome 18 ORF 17; Chromosome 2
ORF 3; Chromosome 20 ORF 133; Chromosome 2 1 ORF 25; Chromosome 2 1 ORF 34;
Chromosome 22 ORF 13; Chromosome 3 ORF 26; Chromosome 5 ORF 3; Chromosome 5 ORF
33; Chromosome 5 ORF 35; Chromosome 5 ORF 39; Chromosome 7 ORF 13; Chromosome 7
ORF 42; Chromosome 9 ORF 3; Chromosome 9 ORF 94; Chromosome Y ORF 15B; Chymase
1, mast cell; Citrate lyase beta like; Class II, major histocompatibility complex, transactivator; C-
Maf-inducing protein; Coatomer protein complex, subunit alpha; Cofilin 2 (muscle); Coiled-coil
domain containing 50; Coiled-coil domain containing 7 ; Coiled-coil-helix-coiled-coil-helix
domain containing 4 ; Cold shock domain containing El, RNA-binding; Collagen, type XII,
alpha 1; Complement component 1, r subcomponent-like; Core-binding factor, runt domain,
alpha subunit 2; translocated to, 2; CREB binding protein (Rubinstein-Taybi syndrome); CTD
(carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase 2; CTD
(carboxy-terminal domain, RNA polymerase II, polypeptide A) small phosphatase-like; CUG
triplet repeat, RNA binding protein 2; Cullin 3; Cut-like 2; Cyclin F; Cyclin Y; Cysteine-rich
with EGF-like domains 1; Cytochrome P450, family 4, subfamily F, polypeptide 11;
Cytoplasmic FMRl interacting protein 2; DAZ interacting protein 1-like; DCP2 decapping
enzyme homolog (S. cerevisiae); DEAD box polypeptide 47; DEAD box polypeptide 5; DEAD
box polypeptide 52; DEAD box polypeptide 56; Death inducer-obliterator 1; Dedicator of
cytokinesis 2; DEP domain containing IB; DEP domain containing 2; DEP domain containing 6;
Development and differentiation enhancing factor 1; Diacylglycerol lipase, alpha; Diaphanous
homolog 2 (Drosophila); Dickkopf homolog 3; Dihydropyrimidine dehydrogenase; Dipeptidyl-
peptidase 10; Discs, large homolog 2, chapsyn-1 10; Dishevelled, dsh homolog 2; DnaJ (Hsp40)
homolog, subfamily C, member 6; Dpy-19-like 3; Dual specificity phosphatase 5; Ectodysplasin
A receptor; Ectonucleoside triphosphate diphosphohydrolase 7 ; EGFR-coamplified and
overexpressed protein; ELL associated factor 1; Emopamil binding protein (sterol isomerase);
Enabled homolog; Ephrin-A5; ER lipid raft associated 1; Erythroblast membrane-associated
protein (Scianna blood group); Erythrocyte membrane protein band 4 .1 like 4A; Etoposide
induced 2.4 mRNA; Eukaryotic translation initiation factor 4E family member 3; FADl flavin
adenine dinucleotide synthetase homolog; Family with sequence similarity 110, member A;
Family with sequence similarity 114, member A l ; Family with sequence similarity 135, member
A; Family with sequence similarity 40, member A; Family with sequence similarity 80, member
B; F-box and leucine-rich repeat protein 11; F-box and leucine-rich repeat protein 7 ; F-box
protein 2; Ferritin, heavy polypeptide 1; Fibronectin type III domain containing 3A; Fibronectin
type III domain containing 3B; Fibulin 1; FLJ25476 protein; FLJ41603 protein; Forkhead box
J3; Forkhead box J3; Forkhead box Kl; Forkhead box P l ; Frizzled homolog 3; Frizzled homolog
5; G protein-coupled receptor kinase interactor 2; GABA A receptor, delta; GATA binding
protein 2; GDNF family receptor alpha 2; Gelsolin (amyloidosis, Finnish type); Genethonin 1;
Glucose phosphate isomerase; Glucose-fructose oxidoreductase domain containing 1;
Glucosidase, beta (bile acid) 2; Glutamate dehydrogenase 1; Glutaminase; Glutamyl
aminopeptidase (aminopeptidase A); Glutathione reductase
Glycogen synthase kinase 3 beta; Grainyhead-like 2; Gremlin 1, cysteine knot superfamily,
homolog; GTPase activating protein (SH3 domain) binding protein 1; Hairy/enhancer-of-split
related with YRPW motif 2; Heparan sulfate 6-O-sulfotransferase 3; Hermansky-Pudlak
syndrome 5; Heterogeneous nuclear ribonucleoprotein C (C1/C2)
Hippocalcin-like 1; Histocompatibility (minor) 13; Histone cluster 1, H3d; Histone deacetylase
6; Homeobox A l ; Homeobox and leucine zipper encoding; Host cell factor C l (VP16-accessory
protein); Hyaluronan binding protein 4 ; Hyperpolarization activated cyclic nucleotide-gated
potassium channel 3; Hypothetical gene supported by AK128346; Hypothetical LOC5 1149;
Hypothetical protein FLJ 12949; Hypothetical protein FLJ20035; Hypothetical protein FLJ20309;
Hypothetical protein FLJ38482; Hypothetical protein HSPC 148; Hypothetical protein
LOC 130576; Hypothetical protein LOC285908; Hypothetical protein LOC643 155; Iduronidase,
alpha-L- IKAROS family zinc finger 1 (Ikaros); HvB (bacterial acetolactate synthase)-like;
Inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase beta; Inositol
polyphosphate-4-phosphatase, type II; Integrin, alpha 4 (antigen CD49D, alpha 4 subunit of
VLA-4 receptor); Integrin, alpha 6; Integrin, alpha 9; Integrin, alpha V (vitronectin receptor,
alpha polypeptide, antigen CD51); Integrin, beta-like 1 (with EGF-like repeat domains); Inter-
alpha (globulin) inhibitor H3; Interleukin enhancer binding factor 3, 9OkDa; Intestine- specific
homeobox
Intraflagellar transport 172 homolog; Janus kinase 1; Jumonji domain containing IB; Jumonji
domain containing 2B; Jumonji domain containing 2C; Kalirin, RhoGEF kinase; Kallikrein-
related peptidase 2; Karyopherin alpha 3 (importin alpha 4); Keratinocyte associated protein 2;
KIAAO 152; KIAA0241 ; KIAA03 19-like; KIAA0495; KIAA0562; KIAA0564 protein;
KIAAl 217; KIAA 1244; KIAA 1244; La ribonucleoprotein domain family, member 1; Lamin
A/C; LATS, large tumor suppressor, homolog 2; Leiomodin 3 (fetal); Leptin receptor
overlapping transcript-like 1; Leucine rich repeat containing 16; Leucine-rich repeat kinase 1;
Leucine-rich repeat-containing G protein-coupled receptor 4 ; LIM domain 7; Major
histocompatibility complex, class II, DR beta 1; Malignant fibrous histiocytoma amplified
sequence 1; Maltase-glucoamylase (alpha-glucosidase); Mannosidase, alpha, class 2A, member
1; Mannosyl (alpha- 1,6-)-glycoprotein beta-l,6-N-acetyl-glucosaminyltransferase; MBD2-
interacting zinc finger; Melanin-concentrating hormone receptor 1; Methionyl-tRNA synthetase;
Methyl CpG binding protein 2; Methyl-CpG binding domain protein 5; Microcephaly, primary
autosomal recessive 1; Microseminoprotein, beta-; Microtubule-associated protein IB;
Microtubule-associated protein 2; Minichromosome maintenance complex component 3
associated protein; Mitochondrial ribosomal protein S 15; Mohawk homeobox; Monooxygenase,
DBH-like 1; MORN repeat containing 1; Muscle RAS oncogene homolog; Muscleblind-like;
Myelin protein zero-like 1; Myeloid/lymphoid or mixed-lineage leukemia (trithorax homolog,
Drosophila); translocated to, 4; Myocyte enhancer factor 2B; Myosin IF; Myosin, heavy chain 3,
skeletal muscle, embryonic; N-acetylgalactosaminidase, alpha-; N-acetylglucosamine-1-
phosphate transferase, alpha and beta subunits; Nascent polypeptide-associated complex alpha
subunit; NECAP endocytosis associated 2; Necdin homolog; Neural precursor cell expressed,
developmentally down-regulated 9; Neuregulin 1 ; Neuron navigator 1 ; Nibrin ; Nicotinamide
N-methyltransferase ; NIMA (never in mitosis gene a)-related kinase 6; NLR family, CARD
domain containing 5; NOLl/NOP2/Sun domain family, member 3; NOLl/NOP2/Sun domain
family, member 3; NOLl/NOP2/Sun domain family, member 6; Nuclear receptor coactivator 2;
Nuclear receptor coactivator 6;Nuclear receptor subfamily 2, group F, member 2; Nuclear
receptor subfamily 3, group C, member 2; Nuclear receptor subfamily 4, group A, member 2;
Nuclear transcription factor, X-box binding-like 1; Nucleolar and coiled-body phosphoprotein 1;
Overexpressed in colon carcinoma- 1; PAN3 polyA specific ribonuclease subunit homolog; PAP
associated domain containing 1; Paraoxonase 2; Paraspeckle component 1 ; PCTAIRE protein
kinase 2; Peptidase D; Pericentrin (kendrin); Peroxisomal biogenesis factor 19; PHD finger
protein 8; Phosphatidic acid phosphatase type 2 domain containing 3; Phosphatidylinositol 4-
kinase, catalytic, alpha polypeptide; Phosphatidylinositol glycan anchor biosynthesis, class O;
Phosphatidylinositol transfer protein, beta; Phosphodiesterase 4D, cAMP-specific;
Phosphoglucomutase 5; Phosphoglycerate mutase family member 5; Phosphoinositide-3-kinase,
class 2, beta polypeptide; Phospholipase A2, group IVB (cytosolic); Phospholipase C, beta 1
(phosphoinositide-specific); Phospholipase C, gamma 2 (phosphatidylinositol-
specific);Phosphorylase kinase, beta; Plasminogen activator, tissue; Platelet-activating factor
acetylhydrolase, isoform Ib, alpha subunit 45kDa; Pleckstrin homology domain containing,
family A (phosphoinositide binding specific) member 3; Pleckstrin homology domain
containing, family A member 7 ; Pleckstrin homology domain containing, family G (with RhoGef
domain) member 3; Pleckstrin homology domain containing, family H (with MyTH4 domain)
member 1; Poly (ADP-ribose) polymerase family, member 16; Poly (ADP-ribose) polymerase
family, member 2; PoIy(A) polymerase alpha; Poly(A)-specific ribonuclease (deadenylation
nuclease); Polymerase (DNA directed) nu; Polymerase (DNA directed), gamma 2, accessory
subunit; Polymerase (RNA) II (DNA directed) polypeptide L; Polymerase (RNA) III (DNA
directed) polypeptide E; Polymerase I and transcript release factor; Potassium channel
tetramerisation domain containing 1; Potassium channel tetramerisation domain containing 2;
Potassium channel tetramerisation domain containing 7 ; Potassium channel, subfamily K,
member 1;Presenilin 1; PRKR interacting protein 1; Procollagen-lysine, 2-oxoglutarate 5-
dioxygenase 2; ProSAPiPl protein; Prostaglandin E synthase 3 (cytosolic); Protease, serine, 2
(trypsin 2); Protein kinase, Y-linked; Protein phosphatase 1, regulatory (inhibitor) subunit 9A;
Protein phosphatase 3 (formerly 2B), catalytic subunit, alpha isoform; Protein phosphatase 4,
regulatory subunit 1-like; Protein tyrosine phosphatase, non-receptor type 18 (brain-derived);
Protein tyrosine phosphatase, non-receptor type 3; Protein tyrosine phosphatase, receptor type,
D; Protein-O-mannosyltransferase 1; Proteolipid protein 2 (colonic epithelium-enriched);
Protocadherin 7 ; Protocadherin gamma subfamily A, 1; PRP6 pre-mRNA processing factor 6
homolog; Putative homeodomain transcription factor 1; RAB GTPase activating protein 1-like;
RAB lO; RAB30; Rabaptin, RAB GTPase binding effector protein 1; RAD51-like 1; RALBPl
associated Eps domain containing 2; Rap guanine nucleotide exchange factor (GEF) 1;
Rapamycin-insensitive companion of mTOR; Ras and Rab interactor 2; Receptor accessory
protein 3; Reelin; Replication factor C (activator 1) 3; Replication protein A3; Rho GTPase
activating protein 18; Rho guanine nucleotide exchange factor (GEF) 10-like; Rhophilin, Rho
GTPase binding protein 1; Ribonuclease H2, subunit B; Ribonuclease P 14kDa subunit; Ring
finger protein 10; Ring finger protein 144; Ring finger protein 44; RNA binding motif protein
16; Roundabout, axon guidance receptor, homolog 1; Roundabout, axon guidance receptor,
homolog 2; RUN domain containing 2A; Scinderin; SEC23 interacting protein; Secό l alpha 2
subunit; Septin 11; Serine/threonine kinase 32A; Serine/threonine kinase 32C; SGTl, suppressor
of G2 allele of SKPl; SH3 and PX domains 2A; Signal peptide peptidase 3; Signal transducer
and activator of transcription 1, 9IkDa; Single- stranded DNA binding protein 2; Small nuclear
ribonucleoprotein polypeptide N; SNF8, ESCRT-II complex subunit, homolog; Sodium channel,
voltage- gated, type III, alpha subunit; Solute carrier family 1 (neutral amino acid transporter),
member 5; Solute carrier family 16, member 7 (monocarboxylic acid transporter 2); Solute
carrier family 2 (facilitated glucose transporter), member 11; Solute carrier family 2 (facilitated
glucose transporter), member 11; Solute carrier family 24 (sodium/potassium/calcium
exchanger), member 3; Solute carrier family 3 (activators of dibasic and neutral amino acid
transport), member 2; Solute carrier family 30 (zinc transporter), member 6; Solute carrier family
39 (zinc transporter), member 10; Solute carrier family 43, member 1; Solute carrier family 9
(sodium/hydrogen exchanger), member 3 regulator 2; SON DNA binding protein; Sortilin-
related VPSlO domain containing receptor 3; Sparc/osteonectin, cwcv and kazal-like domains
proteoglycan (testican) 2; Spectrin repeat containing, nuclear envelope 1; Sperm associated
antigen 9; Splicing factor 3a, subunit 2, 66kDa; Splicing factor 3b, subunit 1, 155kDa;
Staphylococcal nuclease and tudor domain containing 1; Staufen, RNA binding protein, homolog
1; Suppression of tumorigenicity 7; Suppressor of variegation 4-20 homolog 1; Synapsin III;
Syntaxin 3; Syntaxin 5; Tachykinin receptor 1; TAO kinase 3; TBCl domain family, member
16; TBCl domain family, member 19; Testis specific, 10; Tetraspanin 6; Tetratricopeptide repeat
domain 23; Thioredoxin-like 2; THUMP domain containing 3; TIMELESS interacting protein;
TOX high mobility group box family member 4; Trafficking protein, kinesin binding 1;
Transcription factor 7-like 1 (T-cell specific, HMG-box); Transcription factor 7-like 2 (T-cell
specific, HMG-box); Translocase of inner mitochondrial membrane 13 homolog; Translocated
promoter region (to activated MET oncogene); Translocation associated membrane protein 2;
Transmembrane 9 superfamily member 2; Transmembrane emp24 protein transport domain
containing 8; Transmembrane emp24-like trafficking protein 10; Transmembrane protein 134;
Transmembrane protein 18; Transmembrane protein 18; Transmembrane protein 29; Triadin;
Tribbles homolog 1; Trinucleotide repeat containing 6C; Tripartite motif-containing 36;
Tripartite motif-containing 6 1; TRNA methyltransferase 11 homolog; TruB pseudouridine (psi)
synthase homolog 2; Tubby like protein 4; Tuftelin 1; Tumor necrosis factor receptor
superfamily, member l ib (osteoprotegerin); Tumor necrosis factor receptor superfamily,
member 25; Tumor necrosis factor, alpha-induced protein 8; Tyrosine kinase 2; Ubiquinol-
cytochrome c reductase core protein I ; Ubiquitin specific peptidase 47; Ubiquitin specific
peptidase 5 (isopeptidase T); Ubiquitin specific peptidase 8; Ubiquitin-like 7 (bone marrow
stromal cell-derived); UBX domain containing 6; UDP-glucose ceramide glucosyltransferase-
like 2; UDP-N-acetyl-alpha-D-galactosamineipolypeptide N-acetylgalactosaminyltransferase 2
(GalNAc-T2); Unc-93 homolog B 1;UTP6, small subunit (SSU) processome component,
homolog; Vacuolar protein sorting 8 homolog; V-akt murine thymoma viral oncogene homolog
1; V-ets erythroblastosis virus E26 oncogene homolog; Viral DNA polymerase-transactivated
protein 6; WD repeat domain 33; WD repeat domain 90; Wingless-type MMTV integration site
family, member 2B; WW and C2 domain containing 1; Yipl domain family, member 3; YTH
domain family, member 3; Zinc finger and BTB domain containing 10; Zinc finger and BTB
domain containing 20; Zinc finger and SCAN domain containing 18; Zinc finger homeodomain
4 ; Zinc finger protein 14 homolog; Zinc finger protein 335; Zinc finger protein 394; Zinc finger
protein 407 ; Zinc finger protein 608; Zinc finger protein 667; Zinc finger protein 692; Zinc
finger protein 718; Zinc finger protein 72 1; Zinc finger, CCHC domain containing 9; Zinc finger,
matrin type 1; Zinc finger, MYND domain containing 11; Zinc finger, ZZ-type containing 3;
Zinc fingers and homeoboxes 2; and Zwilch, kinetochore associated, homolog.
In one embodiment, the library comprises at least one transcript from at least one gene selected
from the group consisting of Replication factor C (activator 1) 3; Tripartite motif-containing 61;
Citrate lyase beta like; Ankyrin repeat domain 15; UDP-glucose ceramide glucosyltransferase-
like 2; Hypothetical protein FLJ 12949; Chromosome 22 open reading frame 13;
Phosphatidylinositol glycan anchor biosynthesis, class O; Solute carrier family 43, member 1;
Rabaptin, RAB GTPase binding effector protein 1; Zinc finger protein 14 homolog; Hypothetical
gene supported by AK 128346; Adenylate cyclase 3; Phosphatidylinositol transfer protein, beta;
Zinc finger protein 667; Gremlin 1, cysteine knot superfamily, homolog; Ankyrin 3, node of
Ranvier (ankyrin G) and Maltase-glucoamylase (alpha-glucosidase).
In one embodiment, the library comprises at least one transcript from at least one gene selected
from the group consisting of Replication factor C (activator 1) 3; Ankyrin repeat domain 15;
Hypothetical protein FLJ 12949; Solute carrier family 43, member 1; Thioredoxin-like 2;
Polymerase (RNA) II (DNA directed) polypeptide L; Syntaxin 5; Leucine rich repeat containing
16; Calcium channel, voltage-dependent, beta 4 subunit; [NM_00 1005522]; G protein-coupled
receptor kinase interactor 2; Ankyrin 3, node of Ranvier (ankyrin G); Gremlin 1, cysteine knot
superfamily, homolog; Zinc finger protein 667; Hypothetical gene supported by AKl 28346;
Transmembrane 9 superfamily member 2; Potassium channel, subfamily K, member 1;
Chromodomain helicase DNA binding protein 2; Microcephaly, primary autosomal recessive 1;
Chromosome 2 1 open reading frame 34 and Dual specificity phosphatase 5 .
In one embodiment, the library comprises at least one transcript from at least one gene selected
from the group consisting of Replication factor C (activator 1) 3; Tripartite motif-containing 61;
Citrate lyase beta like; Ankyrin repeat domain 15; Ankyrin 3, node of Ranvier (ankyrin G) and
Maltase-glucoamylase (alpha-glucosidase).
In one embodiment, the library comprises at least one transcript from at least one gene selected
from the group consisting of Replication factor C (activator 1) 3; Ankyrin repeat domain 15;
Hypothetical protein FLJ 12949; Solute carrier family 43, member 1; Thioredoxin-like 2;
Polymerase (RNA) II (DNA directed) polypeptide L; Syntaxin 5; Leucine rich repeat containing
16; Calcium channel, voltage-dependent, beta 4 subunit; [NM_00 1005522]; G protein-coupled
receptor kinase interactor 2; Ankyrin 3, node of Ranvier (ankyrin G); Gremlin 1, cysteine knot
superfamily, homolog; Zinc finger protein 667; Hypothetical gene supported by AK 128346;
Transmembrane 9 superfamily member 2; Potassium channel, subfamily K, member 1;
Chromodomain helicase DNA binding protein 2; Chromosome 9 open reading frame 94;
Chromosome 2 1 open reading frame 34; and Dual specificity phosphatase 5 .
In one embodiment, the library comprises at least one transcript from at least one gene selected
from the group consisting of Citrate lyase beta like; Phosphodiesterase 4D, cAMP-specific;
Ectodysplasin A receptor; DEP domain containing 6; Basonuclin 2; Chromosome 2 open reading
frame 3; FLJ25476 protein; Staphylococcal nuclease and tudor domain containing 1;
Hermansky-Pudlak syndrome 5 and Chromosome 12 open reading frame 30.
In one embodiment, the library comprises at least one transcript from at least one gene selected
from the group consisting of Replication factor C (activator 1) 3; Tripartite motif-containing 61;
Citrate lyase beta like; Adaptor-related protein complex 1, gamma 2 subunit; Kallikrein-related
peptidase 2; Phosphodiesterase 4D, cAMP-specific; Cytochrome P450, family 4, subfamily F,
polypeptide 11; Ectodysplasin A receptor
Phospholipase C, beta 1; KIAA 1244; Paraoxonase 2; Arachidonate 12-lipoxygenase, 12R type;
Cut-like 2; Chemokine (C-X-C motif) ligand 12; Rho guanine nucleotide exchange factor (GEF)
5; Olfactory receptor, family 2, subfamily A, member 4; Chromosome 19 open reading frame 42;
Hypothetical gene supported by AK128346; Phosphoglucomutase 5; Hyaluronan binding protein
4 ; NECAP endocytosis associated 2
Myeloid/lymphoid or mixed-lineage leukemia; translocated to, 4 ; Signal transducer and activator
of transcription 1; Chromosome 2 open reading frame 3; FLJ25476 protein; Staphylococcal
nuclease and tudor domain containing 1; Transmembrane protein 18; Hermansky-Pudlak
syndrome 5; Chromosome 12 open reading frame 30; Splicing factor 3b, subunit 1; Cofilin 2;
Heparan sulfate 6-O-sulfotransferase 3; Enabled homolog; Mannosyl (alpha- 1,6-)-glycoprotein
beta- 1,6-N-acetyl-glucosaminyltransferase; Solute carrier family 24 (sodium/potassium/calcium
exchanger), member 3; Inositol 1,4,5-triphosphate receptor, type 1; CAP-GLY domain
containing linker protein; Transglutaminase 4 ; MOCO sulphurase C-terminal domain containing
2; 4-hydroxyphenylpyruvate dioxygenase-like; and R3H domain containing 1.
The invention also contemplates that alternative libraries may be designed that include
transcripts of one or more of the genes in Table 3, together with additional gene transcripts that
have previously been shown to be associated with prostate cancer systemic progression. As is
known in the art, the publication and sequence databases can be mined using a variety of search
strategies to identify appropriate additional candidates for inclusion in the library. For example,
currently available scientific and medical publication databases such as Medline, Current
Contents, OMIM (online Mendelian inheritance in man), various Biological and Chemical
Abstracts, Journal indexes, and the like can be searched using term or key-word searches, or by
author, title, or other relevant search parameters. Many such databases are publicly available, and
strategies and procedures for identifying publications and their contents, for example, genes,
other nucleotide sequences, descriptions, indications, expression pattern, etc, are well known to
those skilled in the art. Numerous databases are available through the internet for free or by
subscription, see, for example, the National Center Biotechnology Information (NCBI),
Infotrieve, Thomson ISI, and Science Magazine (published by the AAAS) websites. Additional
or alternative publication or citation databases are also available that provide identical or similar
types of information, any of which can be employed in the context of the invention. These
databases can be searched for publications describing altered gene expression between recurrent
and non-recurrent prostate cancer. Additional potential candidate genes may be identified by
searching the above described databases for differentially expressed proteins and by identifying
the nucleotide sequence encoding the differentially expressed proteins. A list of genes whose
altered expression is between patients with recurrent disease and non-recurrent prostate cancer is
presented in Table 6 .
PROSTATE CANCER PROGNOSTIC SETS
A Prostate Prognostic Set comprises one or more target sequences identified within the gene
transcripts in the prostate prognostic library, or a subset of these gene transcripts. The target
sequences may be within the coding and/or non-coding regions of the gene transcripts. The set
can comprise one or a plurality of target sequences from each gene transcript in the library, or
subset thereof. The relative and/or normalized level of these target sequences in a sample is
indicative of the level of expression of the particular gene transcript and thus of prostate cancer
recurrence risk. For example, the relative and/or normalized expression level of one or more of
the target sequences may be indicative of an recurrent form of prostate cancer and therefore
prognostic for prostate cancer systemic progression while the relative and/or normalized
expression level of one or more other target sequences may be indicative of a non-recurrent form
of prostate cancer and therefore prognostic for a NED clinical outcome.
Accordingly, one embodiment of the present invention provides for a library or catalog of
candidate target sequences derived from the transcripts (both coding and non-coding regions) of
at least one gene suitable for classifying prostate cancer recurrence risk. In a further embodiment,
the present invention provides for a library or catalog of candidate target sequences derived from
the non-coding regions of transcripts of at least one gene suitable for classifying prostate cancer
recurrence risk. In still a further embodiment, the library or catalog of candidate target
sequences comprises target sequences derived from the transcripts of one or more of the genes
set forth in Table 3 and/or Table 6 . The library or catalog in affect provides a resource list of
transcripts from which target sequences appropriate for inclusion in a Prostate Cancer Prognostic
set can be derived. In one embodiment, an individual Prostate Cancer Prognostic set may
comprise target sequences derived from the transcripts of one or more genes exhibiting a positive
correlation with recurrent prostate cancer. In one embodiment, an individual Prostate Cancer
Prognostic Set may comprise target sequences derived from the transcripts of one or more genes
exhibiting a negative correlation with recurrent prostate cancer. In one embodiment, an
individual Prostate Cancer Prognostic Set may comprise target sequences derived from the
transcripts of two or more genes, wherein at least one gene has a transcript that exhibits a
positive correlation with recurrent prostate cancer and at least one gene has a transcript that
exhibits negative correlation with recurrent prostate cancer.
In one embodiment, the Prostate Cancer Prognostic Set comprises target sequences derived from
the transcripts of at least one gene. In one embodiment, the Prostate Cancer Prognostic Set
comprises target sequences derived from the transcripts of at least 5 genes. In another
embodiment, the Prostate Cancer Prognostic set comprises target sequences derived from the
transcripts of at least 10 genes. In a further embodiment, the Prostate Cancer Prognostic set
comprises target sequences derived from the transcripts of at least 15 genes. In other
embodiments, the Prostate Cancer Prognostic set comprises target sequences derived from the
transcripts of at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at
least 55, at least 60 and at least 65 genes.
Following the identification of candidate gene transcripts, appropriate target sequences can be
identified by screening for target sequences that have been annotated to be associated with each
specific gene locus from a number of annotation sources including GenBank, RefSeq, Ensembl,
dbEST, GENSCAN, TWINSCAN, Exoniphy, Vega, microRNAs registry and others (see
Affymetrix Exon Array design note).
As part of the target sequence selection process, target sequences can be further evaluated for
potential cross-hybridization against other putative transcribed sequences in the design (but not
the entire genome) to identify only those target sequences that are predicted to uniquely
hybridize to a single target.
Optionally, the set of target sequences that are predicted to uniquely hybridize to a single target
can be further filtered using a variety of criteria including, for example, sequence length, for their
mean expression levels across a wide selection of human tissues, as being representive of
transcripts expressed either as novel alternative (i.e., non-consensus) exons, alternative retained
introns, novel exons 5' or 3' of the gene's transcriptional start site or representing transcripts
expressed in a manner antisense to the gene, amongst others.
In one embodiment, the Prostate Classification Set comprises target sequences derived from
382,253 base pair 3' of Replication factor C (activator 1) 3, 38kDa; 58,123 base pair 3' of
Tripartite motif-containing 6 1; in intron #3 of Citrate lyase beta like; in intron #2 of Ankyrin
repeat domain 15; in exon # 1 of UDP-glucose ceramide glucosyltransferase-like 2; in exon of
#19 of Hypothetical protein FLJ 12949; in intron #4 of Chromosome 22 open reading frame 13;
in exon #2 of phatidylinositol glycan anchor biosynthesis, class O; in exon #15 of Solute carrier
family 43, member 1; in exon # 1 of Rabaptin, RAB GTPase binding effector protein 1; in intron
#38 of Maltase-glucoamylase (alpha-glucosidase); in intron #23 of Ankyrin 3, node of Ranvier
(ankyrin G); 71,333 base pair 3' of Gremlin 1, cysteine knot superfamily, homolog; 1,561 base
pair of 3' Zinc finger protein 667; in exon #4 of Phosphatidylinositol transfer protein, beta;
in intron #18 of Adenylate cyclase 3; and in exon #2 of Hypothetical gene supported by
AK 128346.
In one embodiment, the Prostate Classification Set comprises target sequences derived from
382,253 base pair 3' of Replication factor C (activator 1) 3; in intron #2 of Ankyrin repeat
domain 15; in exon #19 of Hypothetical protein FLJ 12949; in exon #15 of Solute carrier family
43, member 1; 313,721 base pair 3' of Thioredoxin-like 2; in exon #2 of Polymerase
(RNA) II (DNA directed) polypeptide L, 7.6kDa; in intron #10 of Syntaxin 5; 141,389 base pail¬
' of Leucine rich repeat containing 16; in intron #2 of Calcium channel, voltage-dependent, beta
4 subunit; 5,474 base pair 5' of [NM_00 1005522]; in intron #14 of G protein-coupled receptor
kinase interactor 2; in intron #23 of Ankyrin 3, node of Ranvier (ankyrin G); 71,333 base pair 3'
of Gremlin 1, cysteine knot superfamily, homolog; 1,561 base pair of 3' of Zinc finger protein
667; in exon #2 of Hypothetical gene supported by AKl 28346; in intron # 11 of Transmembrane
9 superfamily member 2; in intron # 1 of Potassium channel, subfamily K, member 1; in intron #2
of Chromodomain helicase DNA binding protein 2; 22,184 base pair 5' of Microcephaly,
primary autosomal recessive 1; in intron #4 of Chromosome 2 1 open reading frame 34; and in
intron #2 of Dual specificity phosphatase 5 .
In one embodiment, the Prostate Classification Set comprises target sequences derived from
382,253 base pair 3' of Replication factor C (activator 1) 3, 38kDa; 58,123 base pair 3' of
Tripartite motif-containing 6 1; in intron #3 of Citrate lyase beta like; in intron #2 of Ankyrin
repeat domain 15; in intron #38 of Maltase-glucoamylase (alpha-glucosidase); and in intron #23
of Ankyrin 3, node of Ranvier (ankyrin G).
In one embodiment, the Prostate Classification Set comprises target sequences derived from
382,253 base pair 3' of Replication factor C (activator 1) 3, 38kDa; in intron #2 of Ankyrin
repeat domain 15; in exon #19 of Hypothetical protein FLJ12949; in exon #15 of Solute carrier
family 43, member 1; 3 13,721 base pair 3' of Thioredoxin-like 2; in exon #2 of Polymerase
(RNA) II (DNA directed) polypeptide L, 7.6kDa; in intron #10 of Syntaxin 5; 141,389 base pair
5' of Leucine rich repeat containing 16; in intron #2 of Calcium channel, voltage-dependent, beta
4 subunit; 5,474 base pair 5' of [NM_00 1005522]; in intron #14 of G protein-coupled receptor
kinase interactor 2; in intron #2 of Ankyrin 3, node of Ranvier (ankyrin G); 71,333 base pair of
3' Gremlin 1, cysteine knot superfamily; 1,561 base pair 3' of Zinc finger protein 667; in exon
#2 of Hypothetical gene supported by AKl 28346; in intron # 11 of Transmembrane 9
superfamily member 2; in intron # 1 of Potassium channel, subfamily K, member 1; in intron #2
of Chromodomain helicase DNA binding protein 2; in exon #8 of Chromosome 9 open reading
frame 94; in intron #4 of Chromosome 2 1 open reading frame 34; and
in intron #2 of Dual specificity phosphatase 5 .
In one embodiment, the Prostate Classification Set comprises target sequences derived from in
intron #3 of Citrate lyase beta like; 210,560 base pair 5' of Phosphodiesterase 4D; 189,722 base
pair 5' of Ectodysplasin A receptor; 3,5 10 base pair 3' of DEP domain containing 6; in exon #6
of Basonuclin 2; in intron # 1 of Chromosome 2 open reading frame 3; in intron # 1 of FLJ25476
protein; in intron #10 of Staphylococcal nuclease and tudor domain containing 1; in exon
#22 of Hermansky-Pudlak syndrome 5; and in exon #24 of Chromosome 12 open reading frame
30.
In one embodiment, the Prostate Classification Set comprises target sequences derived from
382,253 base pair 3' of Replication factor C (activator 1) 3, 38kDa; 58,123 base pair 3' of
Tripartite motif-containing 61; in intron #3 of Citrate lyase beta like; in intron # 1 of Adaptor-
related protein complex 1, gamma 2 subunit; in intron #2 of Kallikrein-related peptidase 2;
210,560 base pair 5' of Phosphodiesterase 4D; 3,508 base pair 3' of Cytochrome P450, family 4,
subfamily F, polypeptide 11; 189,722 base pair 5' of Ectodysplasin A receptor; in intron #2 of
Phospholipase C, beta 1; in intron #10 of KIAAl 244; in intron #2 of Paraoxonase 2; 11,235 base
pair 3' of Arachidonate 12-lipoxygenase, 12R type; in exon #22 of Cut-like 2; 143,098 base pah
' of Chemokine (C-X-C motif) ligand 12; in intron #6 of Rho guanine nucleotide exchange
factor (GEF) 5; 15,057 base pair 5' of Olfactory receptor, family 2, subfamily A, member 4;
6,025 base pair 3' of Chromosome 19 open reading frame 42; in exon #2 of Hypothetical gene
supported by AK128346; in exon # 11 of Phosphoglucomutase 5; in exon #9 of Hyaluronan
binding protein 4; in exon #8 of NECAP endocytosis associated 2; in intron #20 of
Myeloid/lymphoid or mixed-lineage leukemia; 1,558 base pair 3' of Signal transducer and
activator of transcription; in intron # 1 of Chromosome 2 open reading frame 3; in intron # 1 of
FLJ25476 protein; in intron #10 of Staphylococcal nuclease and tudor domain containing 1;
84,468 base pair 3' of Transmembrane protein 18; in exon #22 of Hermansky-Pudlak syndrome
5; in exon #24 of Chromosome 12 open reading frame 30; 95,745 base pair of 3' Splicing factor
3b, subunit 1; in exon #4 of Cofilin 2; in intron # 1 of Heparan sulfate 6-O-sulfotransferase 3; in
intron # 1 of Enabled homolog ; in intron #2 of Mannosyl (alpha- 1,6-)-glycoprotein beta-l,6-N-
acetyl-glucosaminyltransferase; in intron #8 of Solute carrier family 24, member 3; 32,382 base
pair 3' of Inositol 1,4,5-triphosphate receptor, type 1; in intron #9 of CAP-GLY domain
containing linker protein 1; in exon #14 of Transglutaminase 4 ; in intron #4 of MOCO
sulphurase C-terminal domain containing 2; 21,555 base pair 5' of 4 hydroxyphenylpyruvate
dioxygenase-like; and in exon #26 of R3H domain containing 1.
In one embodiment, the potential set of target sequences can be filtered for their expression
levels using the multi-tissue expression data made publicly available by Affymetrix at
probes with, for example, elevated expression across numerous tissues (non-specific) or no
expression in prostate tissue can be excluded.
VALIDATION OF TARGET SEQUENCES
Following in silico selection of target sequences, each target sequence suitable for use in the
Prostate Cancer Prognostic Set may be validated to confirm differential relative or normalized
expression in recurrent prostate cancer or non-recurrent prostate cancer. Validation methods are
known in the art and include hybridization techniques such as microarray analysis or Northern
blotting using appropriate controls, and may include one or more additional steps, such as
reverse transcription, transcription, PCR, RT-PCR and the like. The validation of the target
sequences using these methods is well within the abilities of a worker skilled in the art.
MINIMAL EXPRESSION SIGNATURE
In one embodiment, individual Prostate Cancer Prognostic Sets provide for at least a
determination of a minimal expression signature, capable of distinguishing recurrent from non
recurrent forms of prostate cancer. Means for determining the appropriate number of target
sequences necessary to obtain a minimal expression signature are known in the art and include
the Nearest Shrunken Centroids (NSC) method.
In this method (see US 2007003 1873), a standardized centroid is computed for each class. This
is the average gene expression for each gene in each class divided by the within-class standard
deviation for that gene. Nearest centroid classification takes the gene expression profile of a new
sample, and compares it to each of these class centroids. The class whose centroid that it is
closest to, in squared distance, is the predicted class for that new sample. Nearest shrunken
centroid classification "shrinks" each of the class centroids toward the overall centroid for all
classes by an amount called the threshold. This shrinkage consists of moving the centroid
towards zero by threshold, setting it equal to zero if it hits zero. For example if threshold was 2.0,
a centroid of 3.2 would be shrunk to 1.2, a centroid of -3.4 would be shrunk to -1.4, and a
centroid of 1.2 would be shrunk to zero. After shrinking the centroids, the new sample is
classified by the usual nearest centroid rule, but using the shrunken class centroids. This
shrinkage can make the classifier more accurate by reducing the effect of noisy genes and
provides an automatic gene selection. In particular, if a gene is shrunk to zero for all classes, then
it is eliminated from the prediction rule. Alternatively, it may be set to zero for all classes except
one, and it can be learned that the high or low expression for that gene characterizes that class.
The user decides on the value to use for threshold. Typically one examines a number of different
choices. To guide in this choice, PAM does K-fold cross-validation for a range of threshold
values. The samples are divided up at random into K roughly equally sized parts. For each part in
turn, the classifier is built on the other K-I parts then tested on the remaining part. This is done
for a range of threshold values, and the cross-validated misclassification error rate is reported for
each threshold value. Typically, the user would choose the threshold value giving the minimum
cross-validated misclassification error rate.
Alternatively, minimal expression signatures can be established through the use of optimization
algorithms such as the mean variance algorithm widely used in establishing stock portfolios. This
method is described in detail in US patent publication number 20030194734. Essentially, the
method calls for the establishment of a set of inputs (stocks in financial applications, expression
as measured by intensity here) that will optimize the return (e.g., signal that is generated) one
receives for using it while minimizing the variability of the return. In other words, the method
calls for the establishment of a set of inputs (e.g., expression as measured by intensity) that will
optimize the signal while minimizing variability. Many commercial software programs are
available to conduct such operations. "Wagner Associates Mean-Variance Optimization
Application," referred to as "Wagner Software" throughout this specification, is preferred. This
software uses functions from the "Wagner Associates Mean-Variance Optimization Library" to
determine an efficient frontier and optimal portfolios in the Markowitz sense is preferred. Use of
this type of software requires that microarray data be transformed so that it can be treated as an
input in the way stock return and risk measurements are used when the software is used for its
intended financial analysis purposes.
The process of selecting a minimal expression signature can also include the application of
heuristic rules. Preferably, such rules are formulated based on biology and an understanding of
the technology used to produce clinical results. More preferably, they are applied to output from
the optimization method. For example, the mean variance method of portfolio selection can be
applied to microarray data for a number of genes differentially expressed in subjects with cancer.
Output from the method would be an optimized set of genes that could include some genes that
are expressed in peripheral blood as well as in diseased tissue.
Other heuristic rules can be applied that are not necessarily related to the biology in question. For
example, one can apply a rule that only a prescribed percentage of the portfolio can be
represented by a particular gene or group of genes. Commercially available software such as the
Wagner Software readily accommodates these types of heuristics. This can be useful, for
example, when factors other than accuracy and precision (e.g., anticipated licensing fees) have
an impact on the desirability of including one or more genes.
In one embodiment, the Prostate Cancer Prognostic Set for obtaining a minimal expression
signature comprises at least one, two, three, four, five, six, eight, 10, 15, 20, 25 or more of target
sequences shown to have a positive correlation with non-recurrent prostate disease, for example
those depicted in SEQ ID NOs: 1-913 or a subset thereof. In another embodiment, the Prostate
Cancer Prognostic Set for obtaining a minimal expression signature comprises at least one, two,
three, four, five, six, eight, 10, 15, 20, 25 or more of those target sequences shown to have a
positive correlation with recurrent prostate cancer, for example those depicted in of SEQ ID
NOs: 914-21 14, or a subset therof. In yet another embodiment, the Prostate Cancer Prognostic
Set for obtaining a minimal expression signature comprises at least one, two, three, four, five,
six, eight, 10, 15, 20, 25 or more of target sequences shown to have a correlation with no n
recurrent or recurrent prostate cancer, for example those depicted in SEQ ID NOs: 1-21 14 or a
subset thereof.
In some embodiments, the Prostate Cancer Prognostic Set comprises target sequences for
detecting expression products of SEQ IDs: 1-21 14. In some embodiments, the Prostate Cancer
Prognostic Set comprises probes for detecting expression levels of sequences exhibiting positive
and negative correlation with a disease status of interest are employed. For example, a
combination target sequences useful in these methods were found to include those encoding
RNAs corresponding to SEQ ID NOs: 1-913 (found at increased expression in prostate cancer
samples from NED patients) and/or corresponding to SEQ ID NOs: 914-21 14 (found at
increased expression levels in prostate cancer samples from SYS patients), where intermediate
levels of certain target sequences (Table 7) are observed in prostate cancer samples from PSA
patients with biochemical recurrence, where the RNA expression levels are indicative of a
disease state or outcome. Subgroups of these target sequences, as well as individual target
sequences, have been found useful in such methods.
In some embodiments, an RNA signature corresponding to SEQ ID NOs: 1, 4, 6, 9, 14-16, 18-21
915-917, 920, 922, 928, 929, 93 1, 935 and 936 (the 2 1-RNA' signature) and/or SEQ ID NOs: 1-
11, 914-920 (the ' 18-RNA' signature) and/or SEQ ID NOs: 1-4, 914,915) (the '6-RNA'
signature) and/or SEQ ID NOs: 1, 4, 6, 9, 14-16, 18-21, 915-917, 920, 922, 928, 929, 931, 935
and 936 (the '20-RNA' signature) and/or SEQ ID NOs 3, 36, 60, 63, 926, 971, 978, 999, 1014
and 1022 (the ' K)-RNA' signature) and/or SEQ ID NOs 1-3, 32, 33, 36, 46, 60, 63, 66, 69, 88,
100, 241, 265, 334, 437, 920, 925, 934, 945, 947, 954, 971, 978, 999, 1004, 1014, 1022, 1023,
1032, 1080, 1093, 1101, 1164, 1248, 1304, 131 1, 1330, 1402 and 1425 (the '41 -RNA' signature)
are formulated into a linear combination of their respective expression values for each patient
generating a patient outcome predictor ('POP') score and indicative of the disease status of the
patient after prostatectomy.
Exemplary subsets and combinations of interest also include at least five, six, 10, 15, 18, 20, 23,
25, 27, 30, 35, 40, 45, 50, 55, 60, 70, 80, 90, 100, 125, 150, 175, 200, 225, 250, 275, 300, 350,
400, 450, 500, 750, 1000, 1200, 1400, 1600, 1800, 2000, or all 2 114 target sequences in Table 4;
at least five, six, 10, 15, 18, 20, 23, 25, 27, 30, 35, 40, 45, 50, 55, 60, 70, 80, 90, 100, 125, 150,
175, 200, 225, 250, 275, 300, 350, 400, 450, 500, or all 526 target sequences in Table 7 ; SEQ ID
NOs: l , 4, 915, 6, 916, 9, 917, 920, 922, 14, 15, 16, 928, 929, 18, 19, 931, 20, 21, 935, 936, or
combinations thereof; SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 914, 915, 916, 917, 918, 919,
920, or combinations thereof; SEQ ID NOs: 1, 4, 6, 9, 14-16, 18-21, 915-917, 920, 922, 928,
929, 93 1, 935, 936 or combinations thereof; SEQ ID NOs 3, 36, 60, 63, 926, 971, 978, 999,
1014, 1022 or combinations thereof; SEQ ID NOs 1-3, 32, 33, 36, 46, 60, 63, 66, 69, 88, 100,
241, 265, 334, 437, 920, 925, 934, 945, 947, 954, 971, 978, 999, 1004, 1014, 1022, 1023, 1032,
1080, 1093, 1101, 1164, 1248, 1304, 131 1, 1330, 1402, 1425 or combinations thereof; at least
one, two, three, four, five or six of SEQ ID NOs: l , 4, 6, 9, 14, 15, 16, 18, 19, 20, and 2 1 and at
least one, two, three, four, five or six of SEQ ID NOs:915, 916, 917, 920, 922, 928, 929, 931,
935, and 936; and at least one, two, three, four, five or six of SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9,
10, and 11 at least one, two, three, four, five or six of at least one, two, three, four, five or six of
SEQ ID NOs:914, 915, 916, 917, 918, 919, and 920 and at least one, two, three, four, five or six
of SEQ ID NOs: 1, 4, 6, 9, 14-16, 18-21, 915-917, 920, 922, 928, 929, 93 1, 935, 936; and at least
one, two, three, four, five or six of SEQ ID NOs 3, 36, 60, 63, 926, 971, 978, 999, 1014, 1022;
and at least one, two, three, four, five or six of SEQ ID NOs 1-3, 32, 33, 36, 46, 60, 63, 66, 69,
88, 100, 241, 265, 334, 437, 920, 925, 934, 945, 947, 954, 971, 978, 999, 1004, 1014, 1022,
1023, 1032, 1080, 1093, 1101, 1164, 1248, 1304, 13 11, 1330, 1402, 1425.
Exemplary subsets of interest include those described herein, including in the examples.
Exemplary combinations of interest include those utilizing one or more of the sequences listed in
Tables 5, 7, 8, 9 or 10. Of particular interest are those combinations utilizing at least one
sequence exhibiting positive correlation with the trait of interest, as well as those combinations
utilizing at least one sequence exhibiting negative correlation with the trait of interest. Also of
interest are those combinations utilizing at least two, at least three, at least four, at least five or at
least six of those sequences exhibiting such a positive correlation, in combination with at least
two, at least three, at least four, at least five, or at least six of those sequences exhibiting such a
negative correlation. Exemplary combinations include those utilizing at least one, two, three,
four, five or six of the target sequences depicted in Tables 5 and 6 .
In some embodiments, increased relative expression of one or more of SEQ IDs: 1-913,
decreased relative expression of one or more of SEQ ID NOs:9 14-21 14 or a combination of any
thereof is indicative/predictive of the patient exhibiting no evidence of disease for at least seven
years or more after surgery. In some embodiments, increased relative expression of SEQ
IDs:9 14-21 14, decreased relative expression of one or more of SEQ ID NOs: 1-9 13 or a
combination of any thereof is indicative/predictive of the patient exhibiting systemic prostate
cancer. Increased or decreased expression of target sequences represented in these sequence
listings, or of the target sequences described in the examples, may be utilized in the methods of
the invention.
The Prostate Cancer Prognostic Set can optionally include one or more target sequences
specifically derived from the transcripts of one or more housekeeping genes and/or one or more
internal control target sequences and/or one or more negative control target sequences. In one
embodiment, these target sequences can, for example, be used to normalize expression data.
Housekeeping genes from which target sequences for inclusion in a Prostate Cancer Prognostic
Set can be derived from are known in the art and include those genes in which are expressed at a
constant level in normal and prostate cancer tissue.
The target sequences described herein may be used alone or in combination with each other or
with other known or later identified disease markers.
PROSTATE CANCER PROGNOSTIC PROBES / PRIMERS
The system of the present invention provides for combinations of polynucleotide probes that are
capable of detecting the target sequences of the Prostate Cancer Prognostic Sets. Individual
polynucleotide probes comprise a nucleotide sequence derived from the nucleotide sequence of
the target sequences or complementary sequences thereof. The nucleotide sequence of the
polynucleotide probe is designed such that it corresponds to, or is complementary to the target
sequences. The polynucleotide probe can specifically hybridize under either stringent or lowered
stringency hybridization conditions to a region of the target sequences, to the complement
thereof, or to a nucleic acid sequence (such as a cDNA) derived therefrom.
The selection of the polynucleotide probe sequences and determination of their uniqueness may
be carried out in silico using techniques known in the art, for example, based on a BLASTN
search of the polynucleotide sequence in question against gene sequence databases, such as the
Human Genome Sequence, UniGene, dbEST or the non-redundant database at NCBI. In one
embodiment of the invention, the polynucleotide probe is complementary to a region of a target
mRNA derived from a target sequence in the Prostate Cancer Prognostic Set. Computer
programs can also be employed to select probe sequences that will not cross hybridize or will not
hybridize non-specifically.
One skilled in the art will understand that the nucleotide sequence of the polynucleotide probe
need not be identical to its target sequence in order to specifically hybridize thereto. The
polynucleotide probes of the present invention, therefore, comprise a nucleotide sequence that is
at least about 75% identical to a region of the target gene or mRNA. In another embodiment, the
nucleotide sequence of the polynucleotide probe is at least about 90% identical a region of the
target gene or mRNA. In a further embodiment, the nucleotide sequence of the polynucleotide
probe is at least about 95% identical to a region of the target gene or mRNA. Methods of
determining sequence identity are known in the art and can be determined, for example, by using
the BLASTN program of the University of Wisconsin Computer Group (GCG) software or
provided on the NCBI website. The nucleotide sequence of the polynucleotide probes of the
present invention may exhibit variability by differing (e.g. by nucleotide substitution, including
transition or transversion) at one, two, three, four or more nucleotides from the sequence of the
target gene.
Other criteria known in the art may be employed in the design of the polynucleotide probes of
the present invention. For example, the probes can be designed to have <50% G content and/or
between about 25% and about 70% G+C content. Strategies to optimize probe hybridization to
the target nucleic acid sequence can also be included in the process of probe selection.
Hybridization under particular pH, salt, and temperature conditions can be optimized by taking
into account melting temperatures and by using empirical rules that correlate with desired
hybridization behaviours. Computer models may be used for predicting the intensity and
concentration-dependence of probe hybridization.
As is known in the art, in order to represent a unique sequence in the human genome, a probe
should be at least 15 nucleotides in length. Accordingly, the polynucleotide probes of the present
invention range in length from about 15 nucleotides to the full length of the target sequence or
target mRNA. In one embodiment of the invention, the polynucleotide probes are at least about
15 nucleotides in length. In another embodiment, the polynucleotide probes are at least about 20
nucleotides in length. In a further embodiment, the polynucleotide probes are at least about 25
nucleotides in length. In another embodiment, the polynucleotide probes are between about 15
nucleotides and about 500 nucleotides in length. In other embodiments, the polynucleotide
probes are between about 15 nucleotides and about 450 nucleotides, about 15 nucleotides and
about 400 nucleotides, about 15 nucleotides and about 350 nucleotides, about 15 nucleotides and
about 300 nucleotides in length.
The polynucleotide probes of a Prostate Cancer Prognostic Set can comprise RNA, DNA, RNA
or DNA mimetics, or combinations thereof, and can be single-stranded or double- stranded. Thus
the polynucleotide probes can be composed of naturally-occurring nucleobases, sugars and
covalent internucleoside (backbone) linkages as well as polynucleotide probes having non-
naturally-occurring portions which function similarly. Such modified or substituted
polynucleotide probes may provide desirable properties such as, for example, enhanced affinity
for a target gene and increased stability.
The system of the present invention further provides for primers and primer pairs capable of
amplifying target sequences defined by the Prostate Cancer Prognostic Set, or fragments or
subsequences or complements thereof. The nucleotide sequences of the Prostate Cancer
Prognostic set may be provided in computer-readable media for in silico applications and as a
basis for the design of appropriate primers for amplification of one or more target sequences of
the Prostate Cancer Prognostic Set.
Primers based on the nucleotide sequences of target sequences can be designed for use in
amplification of the target sequences. For use in amplification reactions such as PCR, a pair of
primers will be used. The exact composition of the primer sequences is not critical to the
invention, but for most applications the primers will hybridize to specific sequences of the
Prostate Cancer Prognostic Set under stringent conditions, particularly under conditions of high
stringency, as known in the art. The pairs of primers are usually chosen so as to generate an
amplification product of at least about 50 nucleotides, more usually at least about 100
nucleotides. Algorithms for the selection of primer sequences are generally known, and are
available in commercial software packages. These primers may be used in standard quantitative
or qualitative PCR-based assays to assess transcript expression levels of RNAs defined by the
Prostate Cancer Prognostic Set. Alternatively, these primers may be used in combination with
probes, such as molecular beacons in amplifications using real-time PCR.
In one embodiment, the primers or primer pairs, when used in an amplification reaction,
specifically amplify at least a portion of a nucleic acid depicted in one of SEQ ID NOs: 1-21 14
(or subgroups thereof as set forth herein), an RNA form thereof, or a complement to either
thereof.
As is known in the art, a nucleoside is a base-sugar combination and a nucleotide is a nucleoside
that further includes a phosphate group covalently linked to the sugar portion of the nucleoside.
In forming oligonucleotides, the phosphate groups covalently link adjacent nucleosides to one
another to form a linear polymeric compound, with the normal linkage or backbone of RNA and
DNA being a 3' to 5' phosphodiester linkage. Specific examples of polynucleotide probes or
primers useful in this invention include oligonucleotides containing modified backbones or non-
natural internucleoside linkages. As defined in this specification, oligonucleotides having
modified backbones include both those that retain a phosphorus atom in the backbone and those
that lack a phosphorus atom in the backbone. For the purposes of the present invention, and as
sometimes referenced in the art, modified oligonucleotides that do not have a phosphorus atom
in their internucleoside backbone can also be considered to be oligonucleotides.
Exemplary polynucleotide probes or primers having modified oligonucleotide backbones
include, for example, those with one or more modified internucleotide linkages that are
phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters,
aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3'-alkylene
phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 3'amino
phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, thionoalkyl-
phosphonates, thionoalkylphosphotriesters, and boranophosphates having normal 3'-5' linkages,
2'-5' linked analogs of these, and those having inverted polarity wherein the adjacent pairs of
nucleoside units are linked 3'-5' to 5'-3' or 2'-5' to 5'-2'. Various salts, mixed salts and free acid
forms are also included.
Exemplary modified oligonucleotide backbones that do not include a phosphorus atom are
formed by short chain alkyl or cycloalkyl internucleoside linkages, mixed heteroatom and alkyl
or cycloalkyl internucleoside linkages, or one or more short chain heteroatomic or heterocyclic
internucleoside linkages. Such backbones include morpholino linkages (formed in part from the
sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide and sulphone backbones;
formacetyl and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones;
alkene containing backbones; sulphamate backbones; methyleneimino and methylenehydrazino
backbones; sulphonate and sulfonamide backbones; amide backbones; and others having mixed
N, O, S and CH2 component parts.
The present invention also contemplates oligonucleotide mimetics in which both the sugar and
the internucleoside linkage of the nucleotide units are replaced with novel groups. The base units
are maintained for hybridization with an appropriate nucleic acid target compound. An example
of such an oligonucleotide mimetic, which has been shown to have excellent hybridization
properties, is a peptide nucleic acid (PNA) [Nielsen et ah, Science, 254: 1497-1500 (1991)1. In
PNA compounds, the sugar-backbone of an oligonucleotide is replaced with an amide containing
backbone, in particular an aminoethylglycine backbone. The nucleobases are retained and are
bound directly or indirectly to aza-nitrogen atoms of the amide portion of the backbone.
The present invention also contemplates polynucleotide probes or primers comprising "locked
nucleic acids" (LNAs), which are novel conformationally restricted oligonucleotide analogues
containing a methylene bridge that connects the 2'-0 of ribose with the 4'-C (see, Singh et al,
Chem. Commun., 1998, 4:455-456). LNA and LNA analogues display very high duplex thermal
stabilities with complementary DNA and RNA, stability towards 3'-exonuclease degradation,
and good solubility properties. Synthesis of the LNA analogues of adenine, cytosine, guanine, 5-
methylcytosine, thymine and uracil, their oligomerization, and nucleic acid recognition
properties have been described (see Koshkin et al., Tetrahedron, 1998, 54:3607-3630). Studies
of mis-matched sequences show that LNA obey the Watson-Crick base pairing rules with
generally improved selectivity compared to the corresponding unmodified reference strands.
LNAs form duplexes with complementary DNA or RNA or with complementary LNA, with high
thermal affinities. The universality of LNA-mediated hybridization has been emphasized by the
formation of exceedingly stable LNA:LNA duplexes (Koshkin et al, J. Am. Chem. Soc, 1998,
120: 13252-13253). LNA:LNA hybridization was shown to be the most thermally stable nucleic
acid type duplex system, and the RNA-mimicking character of LNA was established at the
duplex level. Introduction of three LNA monomers (T or A) resulted in significantly increased
melting points toward DNA complements.
Synthesis of 2'-amino-LNA (Singh et al., J . Org. Chem., 1998, 63, 10035-10039) and T -
methylamino-LNA has been described and thermal stability of their duplexes with
complementary RNA and DNA strands reported. Preparation of phosphorothioate-LNA and T -
thio-LNA have also been described (Kumar et al, Bioorg. Med. Chem. Lett., 1998, 8:2219-
2222).
Modified polynucleotide probes or primers may also contain one or more substituted sugar
moieties. For example, oligonucleotides may comprise sugars with one of the following
substituents at the 2' position: OH; F; O-, S-, or N-alkyl; O-, S-, or N-alkenyl; O-, S- or N-
alkynyl; or O-alkyl-0-alkyl, wherein the alkyl, alkenyl and alkynyl may be substituted or
unsubstituted C i to C i0 alkyl or C2 to C i0 alkenyl and alkynyl. Examples of such groups are:
O[(CH 2)n 0 I111
CH , 0(CH 2)n OCH , 0(CH 2)n NH2, 0(CH 2)n CH , 0(CH 2)n ONH2, and 0(CH 2)n
ONf(CH 2)n CH ) 2, where n and m are from 1 to about 10. Alternatively, the oligonucleotides
may comprise one of the following substituents at the 2' position: C i to C i0 lower alkyl,
substituted lower alkyl, alkaryl, aralkyl, O-alkaryl or O-aralkyl, SH, SCH 1, OCN, Cl, Br, CN,
CF1, OCF 1, SOCH 1, SO2 CH1, ONO2, NO2, N1, NH2, heterocycloalkyl, heterocycloalkaryl,
aminoalkylamino, polyalkylamino, substituted silyl, an RNA cleaving group, a reporter group, an
intercalator, a group for improving the pharmacokinetic properties of an oligonucleotide, or a
group for improving the pharmacodynamic properties of an oligonucleotide, and other
substituents having similar properties. Specific examples include 2'-methoxyethoxy (2'-0-CH 2
CH2 OCH 1, also known as 2'-O-(2-methoxyethyl) or 2'-MOE) [Martin et al, HeIv. Chim. Acta,
78:486-504(1995)1, 2'-dimethylaminooxyethoxy (O(CH 2)2 ON(CH 1 2 group, also known as T -
DMAOE), 2'-methoxy (2'-0-CH 1), 2'-aminopropoxy (2'-OCH 2 CH2 CH2 NH2) and 2'-fluoro (T-
F).
Similar modifications may also be made at other positions on the polynucleotide probes or
primers, particularly the 3' position of the sugar on the 3' terminal nucleotide or in 2'-5' linked
oligonucleotides and the 5' position of 5' terminal nucleotide. Polynucleotide probes or primers
may also have sugar mimetics such as cyclobutyl moieties in place of the pentofuranosyl sugar.
Polynucleotide probes or primers may also include modifications or substitutions to the
nucleobase. As used herein, "unmodified" or "natural" nucleobases include the purine bases
adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) and uracil (U).
Modified nucleobases include other synthetic and natural nucleobases such as 5-methylcytosine
(5-me-C), 5- hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and
other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine
and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-halouracil and cytosine, 5-
propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-
thiouracil, 8-halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and
guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted uracils and
cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8-azaadenine, 7-
deazaguanine and 7-deazaadenine and 3-deazaguanine and 3-deazaadenine. Further nucleobases
include those disclosed in U.S. Pat. No. 3,687,808; The Concise Encyclopedia Of Polymer
Science And Engineering, (1990) pp 858-859, Kroschwitz, J . L, ed. John Wiley & Sons;
Englisch et ah, Angewandte Chemie, Int. Ed., 30:613 (1991); and Sanghvi, Y. S., (1993)
Antisense Research and Applications, pp 289-302, Crooke, S . T. and Lebleu, B., ed., CRC Press.
Certain of these nucleobases are particularly useful for increasing the binding affinity of the
polynucleotide probes of the invention. These include 5-substituted pyrimidines, 6-
azapyrimidines and N-2, N-6 and O-6 substituted purines, including 2-aminopropyladenine, 5-
propynyluracil and 5-propynylcytosine. 5-methylcytosine substitutions have been shown to
increase nucleic acid duplex stability by 0.6-1 .20C [Sanghvi, Y. S., (1993) Antisense Research
and Applications, pp 276-278, Crooke, S . T. and Lebleu, B., ed., CRC Press, Boca Raton].
One skilled in the art will recognize that it is not necessary for all positions in a given
polynucleotide probe or primer to be uniformly modified. The present invention, therefore,
contemplates the incorporation of more than one of the aforementioned modifications into a
single polynucleotide probe or even at a single nucleoside within the probe or primer.
One skilled in the art will also appreciate that the nucleotide sequence of the entire length of the
polynucleotide probe or primer does not need to be derived from the target sequence. Thus, for
example, the polynucleotide probe may comprise nucleotide sequences at the 5' and/or 3' termini
that are not derived from the target sequences. Nucleotide sequences which are not derived from
the nucleotide sequence of the target sequence may provide additional functionality to the
polynucleotide probe. For example, they may provide a restriction enzyme recognition sequence
or a "tag" that facilitates detection, isolation, purification or immobilisation onto a solid support.
Alternatively, the additional nucleotides may provide a self-complementary sequence that allows
the primer/probe to adopt a hairpin configuration. Such configurations are necessary for certain
probes, for example, molecular beacon and Scorpion probes, which can be used in solution
hybridization techniques.
The polynucleotide probes or primers can incorporate moieties useful in detection, isolation,
purification, or immobilisation, if desired. Such moieties are well-known in the art (see, for
example, Ausubel et ah, (1997 & updates) Current Protocols in Molecular Biology, Wiley &
Sons, New York) and are chosen such that the ability of the probe to hybridize with its target
sequence is not affected.
Examples of suitable moieties are detectable labels, such as radioisotopes, fluorophores,
chemiluminophores, enzymes, colloidal particles, and fluorescent microparticles, as well as
antigens, antibodies, haptens, avidin/streptavidin, biotin, haptens, enzyme cofactors / substrates,
enzymes, and the like.
A label can optionally be attached to or incorporated into a probe or primer polynucleotide to
allow detection and/or quantitation of a target polynucleotide representing the target sequence of
interest. The target polynucleotide may be the expressed target sequence RNA itself, a cDNA
copy thereof, or an amplification product derived therefrom, and may be the positive or negative
strand, so long as it can be specifically detected in the assay being used. Similarly, an antibody
may be labeled.
In certain multiplex formats, labels used for detecting different targets may be distinguishable.
The label can be attached directly (e.g., via covalent linkage) or indirectly, e.g., via a bridging
molecule or series of molecules (e.g., a molecule or complex that can bind to an assay
component, or via members of a binding pair that can be incorporated into assay components,
e.g. biotin-avidin or streptavidin). Many labels are commercially available in activated forms
which can readily be used for such conjugation (for example through amine acylation), or labels
may be attached through known or determinable conjugation schemes, many of which are known
in the art.
Labels useful in the invention described herein include any substance which can be detected
when bound to or incorporated into the biomolecule of interest. Any effective detection method
can be used, including optical, spectroscopic, electrical, piezoelectrical, magnetic, Raman
scattering, surface plasmon resonance, colorimetric, calorimetric, etc. A label is typically
selected from a chromophore, a lumiphore, a fluorophore, one member of a quenching system, a
chromogen, a hapten, an antigen, a magnetic particle, a material exhibiting nonlinear optics, a
semiconductor nanocrystal, a metal nanopaiticle, an enzyme, an antibody or binding portion or
equivalent thereof, an aptamer, and one member of a binding pair, and combinations thereof.
Quenching schemes may be used, wherein a quencher and a fluorophore as members of a
quenching pair may be used on a probe, such that a change in optical parameters occurs upon
binding to the target introduce or quench the signal from the fluorophore. One example of such a
system is a molecular beacon. Suitable quencher/fluorophore systems are known in the art. The
label may be bound through a variety of intermediate linkages. For example, a polynucleotide
may comprise a biotin-binding species, and an optically detectable label may be conjugated to
biotin and then bound to the labeled polynucleotide. Similarly, a polynucleotide sensor may
comprise an immunological species such as an antibody or fragment, and a secondary antibody
containing an optically detectable label may be added.
Chromophores useful in the methods described herein include any substance which can absorb
energy and emit light. For multiplexed assays, a plurality of different signaling chromophores
can be used with detectably different emission spectra. The chromophore can be a lumophore or
a fluorophore. Typical fluorophores include fluorescent dyes, semiconductor nanocrystals,
lanthanide chelates, polynucleotide-specific dyes and green fluorescent protein.
Coding schemes may optionally be used, comprising encoded particles and/or encoded tags
associated with different polynucleotides of the invention. A variety of different coding schemes
are known in the art, including fluorophores, including SCNCs, deposited metals, and RF tags.
Polynucleotides from the described target sequences may be employed as probes for detecting
target sequences expression, for ligation amplification schemes, or may be used as primers for
amplification schemes of all or a portion of a target sequences. When amplified, either strand
produced by amplification may be provided in purified and/or isolated form.
In one embodiment, polynucleotides of the invention include a nucleic acid depicted in(a) any
one of SEQ ID NOs: 1-21 14, or a subgroup thereof as set forth herein; (b) an RNA form of any
one of the nucleic acids depicted in SEQ ID NOs: 1-21 14, or a subgroup thereof as set forth
herein; (c) a peptide nucleic acid form of any of the nucleic acids depicted in SEQ ID NOs: 1-
2 114, or a subgroup thereof as set forth herein; (d) a nucleic acid comprising at least 20
consecutive bases of any of (a-c); (e) a nucleic acid comprising at least 25 bases having at least
90% sequenced identity to any of (a-c); and (f) a complement to any of (a-e).
Complements may take any polymeric form capable of base pairing to the species recited in (a)-
(e), including nucleic acid such as RNA or DNA, or may be a neutral polymer such as a peptide
nucleic acid. Polynucleotides of the invention can be selected from the subsets of the recited
nucleic acids described herein, as well as their complements.
In some embodiments, polynucleotides of the invention comprise at least 20 consecutive bases as
depicted in SEQ ID NOs: 1-21 14, or a complement thereto. The polynucleotides may comprise
at least 21, 22, 23, 24, 25, 27, 30, 32, 35 or more consecutive bases as depicted in SEQ ID
NOs: 1-21 14, as applicable.
In some embodiments, the nucleic acid in (a) can be selected from those in Table 3, and from
SEQ ID NOs: l , 4 , 915, 6, 916, 9, 917, 920, 922, 14, 15, 16, 928, 929, 18, 19, 931, 20, 21, 935,
and 936; or from SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 914, 915, 916, 917, 918, 919, and
920; or from SEQ ID NOs: 1, 4, 6, 9, 14-16, 18-21, 915-917, 920, 922, 928, 929, 931, 935 and
936; or from SEQ ID NOs 3, 36, 60, 63, 926, 971, 978, 999, 1014 and 1022; or from SEQ ID
NOs 1-3, 32, 33, 36, 46, 60, 63, 66, 69, 88, 100, 241, 265, 334, 437, 920, 925, 934, 945, 947,
954, 971, 978, 999, 1004, 1014, 1022, 1023, 1032, 1080, 1093, 1101, 1164, 1248, 1304, 13 11,
1330, 1402 and 1425.
The polynucleotides may be provided in a variety of formats, including as solids, in solution, or
in an array. The polynucleotides may optionally comprise one or more labels, which may be
chemically and/or enzymatically incorporated into the polynucleotide.
In one embodiment, solutions comprising polynucleotide and a solvent are also provided. In
some embodiments, the solvent may be water or may be predominantly aqueous. In some
embodiments, the solution may comprise at least two, three, four, five, six, seven, eight, nine,
ten, twelve, fifteen, seventeen, twenty or more different polynucleotides, including primers and
primer pairs, of the invention. Additional substances may be included in the solution, alone or in
combination, including one or more labels, additional solvents, buffers, biomolecules,
polynucleotides, and one or more enzymes useful for performing methods described herein,
including polymerases and ligases. The solution may further comprise a primer or primer pair
capable of amplifying a polynucleotide of the invention present in the solution.
In some embodiments, one or more polynucleotides provided herein can be provided on a
substrate. The substrate can comprise a wide range of material, either biological, nonbiological,
organic, inorganic, or a combination of any of these. For example, the substrate may be a
polymerized Langmuir Blodgett film, functionalized glass, Si, Ge, GaAs, GaP, SiCh, SiN4,
modified silicon, or any one of a wide variety of gels or polymers such as
(poly)tetrafluoroethylene, (poly)vinylidenedifluoride, polystyrene, cross-linked polystyrene,
polyacrylic, polylactic acid, polyglycolic acid, poly(lactide coglycolide), polyanhydrides,
poly(methyl methacrylate), poly(ethylene-co-vinyl acetate), polysiloxanes, polymeric silica,
latexes, dextran polymers, epoxies, polycarbonates, or combinations thereof. Conducting
polymers and photoconductive materials can be used.
Substrates can be planar crystalline substrates such as silica based substrates (e.g. glass, quartz,
or the like), or crystalline substrates used in, e.g., the semiconductor and microprocessor
industries, such as silicon, gallium arsenide, indium doped GaN and the like, and includes
semiconductor nanocrystals.
The substrate can take the form of an array, a photodiode, an optoelectronic sensor such as an
optoelectronic semiconductor chip or optoelectronic thin-film semiconductor, or a biochip. The
location(s) of probe(s) on the substrate can be addressable; this can be done in highly dense
formats, and the location(s) can be microaddressable or nanoaddressable.
Silica aerogels can also be used as substrates, and can be prepared by methods known in the art.
Aerogel substrates may be used as free standing substrates or as a surface coating for another
substrate material.
The substrate can take any form and typically is a plate, slide, bead, pellet, disk, particle,
microparticle, nanoparticle, strand, precipitate, optionally porous gel, sheets, tube, sphere,
container, capillary, pad, slice, film, chip, multiwell plate or dish, optical fiber, etc. The
substrate can be any form that is rigid or semi-rigid. The substrate may contain raised or
depressed regions on which an assay component is located. The surface of the substrate can be
etched using known techniques to provide for desired surface features, for example trenches, v-
grooves, mesa structures, or the like.
Surfaces on the substrate can be composed of the same material as the substrate or can be made
from a different material, and can be coupled to the substrate by chemical or physical means.
Such coupled surfaces may be composed of any of a wide variety of materials, for example,
polymers, plastics, resins, polysaccharides, silica or silica-based materials, carbon, metals,
inorganic glasses, membranes, or any of the above-listed substrate materials. The surface can be
optically transparent and can have surface Si-OH functionalities, such as those found on silica
surfaces.
The substrate and/or its optional surface can be chosen to provide appropriate characteristics for
the synthetic and/or detection methods used. The substrate and/or surface can be transparent to
allow the exposure of the substrate by light applied from multiple directions. The substrate
and/or surface may be provided with reflective "mirror" structures to increase the recovery of
light.
The substrate and/or its surface is generally resistant to, or is treated to resist, the conditions to
which it is to be exposed in use, and can be optionally treated to remove any resistant material
after exposure to such conditions.
The substrate or a region thereof may be encoded so that the identity of the sensor located in the
substrate or region being queried may be determined. Any suitable coding scheme can be used,
for example optical codes, RFID tags, magnetic codes, physical codes, fluorescent codes, and
combinations of codes.
PREPARATION OF PROBES AND PRIMERS
The polynucleotide probes or primers of the present invention can be prepared by conventional
techniques well-known to those skilled in the art. For example, the polynucleotide probes can be
prepared using solid-phase synthesis using commercially available equipment. As is well-known
in the art, modified oligonucleotides can also be readily prepared by similar methods. The
polynucleotide probes can also be synthesized directly on a solid support according to methods
standard in the art. This method of synthesizing polynucleotides is particularly useful when the
polynucleotide probes are part of a nucleic acid array.
Polynucleotide probes or primers can be fabricated on or attached to the substrate by any suitable
method, for example the methods described in U.S. Pat. No. 5,143,854, PCT Publ. No. WO
92/10092, U.S. Patent Application Ser. No. 07/624,120, filed Dec. 6, 1990 (now abandoned),
Fodor et al., Science, 25 1: 767-777 (1991), and PCT Publ. No. WO 90/15070). Techniques for
the synthesis of these arrays using mechanical synthesis strategies are described in, e.g., PCT
Publication No. WO 93/09668 and U.S. Pat. No. 5,384,261. Still further techniques include bead
based techniques such as those described in PCT Appl. No. PCT/US93/04145 and pin based
methods such as those described in U.S. Pat. No. 5,288,5 14. Additional flow channel or spotting
methods applicable to attachment of sensor polynucleotides to a substrate are described in U . S .
Patent Application Ser. No. 07/980,523, filed Nov. 20, 1992, and U.S. Pat. No. 5,384,26 1.
Alternatively, the polynucleotide probes of the present invention can be prepared by enzymatic
digestion of the naturally occurring target gene, or mRNA or cDNA derived therefrom, by
methods known in the art.
PROSTATECANCER PROGNOSTIC METHODS
The present invention further provides methods for characterizing prostate cancer sample for
recurrence risk. The methods use the Prostate Cancer Prognostic Sets, probes and primers
described herein to provide expression signatures or profiles from a test sample derived from a
subject having or suspected of having prostate cancer. In some embodiments, such methods
involve contacting a test sample with Prostate Cancer Prognostic probes (either in solution or
immobilized) under conditions that permit hybridization of the probe(s) to any target nucleic
acid(s) present in the test sample and then detecting any probe: target duplexes formed as an
indication of the presence of the target nucleic acid in the sample. Expression patterns thus
determined are then compared to one or more reference profiles or signatures. Optionally, the
expression pattern can be normalized. The methods use the Prostate Cancer Prognostic Sets,
probes and primers described herein to provide expression signatures or profiles from a test
sample derived from a subject to classify the prostate cancer as recurrent or non-recurrent.
In some embodiments, such methods involve the specific amplification of target sequences
nucleic acid(s) present in the test sample using methods known in the art to generate an
expression profile or signature which is then compared to a reference profile or signature.
In some embodiments, the invention further provides for prognosing patient outcome, predicting
likelihood of recurrence after prostatectomy and/or for designating treatment modalities.
In one embodiment, the methods generate expression profiles or signatures detailing the
expression of the 2 114 target sequences having altered relative expression with different prostate
cancer outcomes. In one embodiment, the methods generate expression profiles or signatures
detailing the expression of the subsets of these target sequences having 526 or 18 target
sequences as described in the examples.
In some embodiments, increased relative expression of one or more of SEQ IDs: 1-913,
decreased relative expression of one or more of SEQ ID NOs:9 14-21 14 or a combination of any
thereof is indicative of a non-recurrent form of prostate cancer and may be predictive a NED
clinical outcome after surgery. In some embodiments, increased relative expression of SEQ
IDs:9 14-21 14, decreased relative expression of one or more of SEQ ID NOs: 1-9 13 or a
combination of any thereof is indicative of a recurrent form of prostate cancer and may be
predictive of a SYS clinical outcome after surgery. Increased or decreased expression of target
sequences represented in these sequence listings, or of the target sequences described in the
examples, may be utilized in the methods of the invention.
In one embodiment, intermediate levels of expression of one or more target sequences depicted
in Table 7 indicate a probability of future biochemical recurrence.
In some embodiments, the methods detect combinations of expression levels of sequences
exhibiting positive and negative correlation with a disease status. In one embodiment, the
methods detect a minimal expression signature.
Any method of detecting and/or quantitating the expression of the encoded target sequences can
in principle be used in the invention. Such methods can include Northern blotting, array or
microarray hybridization, by enzymatic cleavage of specific structures (e.g., an Invader® assay,
Third Wave Technologies, e.g. as described in U.S. Pat. Nos. 5,846,717, 6,090,543; 6,001,567;
5,985,557; and 5,994,069) and amplification methods, e.g. RT-PCR, including in a TaqMan®
assay (PE Biosystems, Foster City, Calif., e.g. as described in U.S. Pat. Nos. 5,962,233 and
5,538,848), and may be quantitative or semi-quantitative, and may vary depending on the origin,
amount and condition of the available biological sample. Combinations of these methods may
also be used. For example, nucleic acids may be amplified, labeled and subjected to microarray
analysis. Single-molecule sequencing (e.g., Illumina, Helicos, PacBio, ABI SOLID), in situ
hybridization, bead-array technologies (e.g., Luminex xMAP, Illumina BeadChips), branched
DNA technology (e.g., Panomics, Genisphere).
The expressed target sequences can be directly detected and/or quantitated, or may be copied
and/or amplified to allow detection of amplified copies of the expressed target sequences or its
complement. In some embodiments, degraded and/or fragmented RNA can be usefully analyzed
for expression levels of target sequences, for example RNA having an RNA integrity number of
less than 8 .
In some embodiments, quantitative RT-PCR assays are used to measure the expression level of
target sequences depicted in SEQ IDs: 1-21 14. In other embodiments, a GeneChip or microarray
can be used to measure the expression of one or more of the target sequences.
Molecular assays measure the relative expression levels of the target sequences, which can be
normalized to the expression levels of one or more control sequences, for example array control
sequences and/or one or more housekeeping genes, for example GAPDH. Increased (or
decreased) relative expression of the target sequences as described herein, including any of SEQ
ID NOs: 1-21 14, may thus be used alone or in any combination with each other in the methods
described herein. In addition, negative control probes may be included.
DIAGNOSTIC SAMPLES
Diagnostic samples for use with the systems and in the methods of the present invention
comprise nucleic acids suitable for providing RNAs expression information. In principle, the
biological sample from which the expressed RNA is obtained and analyzed for target sequence
expression can be any material suspected of comprising prostate cancer tissue or cells. The
diagnostic sample can be a biological sample used directly in a method of the invention.
Alternatively, the diagnostic sample can be a sample prepared from a biological sample.
In one embodiments, the sample or portion of the sample comprising or suspected of comprising
prostate cancer tissue or cells can be any source of biological material, including cells, tissue or
fluid, including bodily fluids. Non-limiting examples of the source of the sample include an
aspirate, a needle biopsy, a cytology pellet, a bulk tissue preparation or a section thereof obtained
for example by surgery or autopsy, lymph fluid, blood, plasma, serum, tumors, and organs.
The samples may be archival samples, having a known and documented medical outcome, or
may be samples from current patients whose ultimate medical outcome is not yet known.
In some embodiments, the sample may be dissected prior to molecular analysis. The sample
may be prepared via macrodissection of a bulk tumor specimen or portion thereof, or may be
treated via microdissection, for example via Laser Capture Microdissection (LCM).
The sample may initially be provided in a variety of states, as fresh tissue, fresh frozen tissue,
fine needle aspirates, and may be fixed or unfixed. Frequently, medical laboratories routinely
prepare medical samples in a fixed state, which facilitates tissue storage. A variety of fixatives
can be used to fix tissue to stabilize the morphology of cells, and may be used alone or in
combination with other agents. Exemplary fixatives include crosslinking agents, alcohols,
acetone, Bouin's solution, Zenker solution, HeIy solution, osmic acid solution and Carnoy
solution.
Crosslinking fixatives can comprise any agent suitable for forming two or more covalent bonds,
for example an aldehyde. Sources of aldehydes typically used for fixation include formaldehyde,
paraformaldehyde, glutaraldehyde or formalin. Preferably, the crosslinking agent comprises
formaldehyde, which may be included in its native form or in the form of paraformaldehyde or
formalin. One of skill in the art would appreciate that for samples in which crosslinking
fixatives have been used special preparatory steps may be necessary including for example
heating steps and proteinase-k digestion; see methods
One or more alcohols may be used to fix tissue, alone or in combination with other fixatives.
Exemplary alcohols used for fixation include methanol, ethanol and isopropanol.
Formalin fixation is frequently used in medical laboratories. Formalin comprises both an
alcohol, typically methanol, and formaldehyde, both of which can act to fix a biological sample.
Whether fixed or unfixed, the biological sample may optionally be embedded in an embedding
medium. Exemplary embedding media used in histology including paraffin, Tissue-Tek®
V.I.P. IM, Paramat, Paramat Extra, Paraplast, Paraplast X-tra, Paraplast Plus, Peel Away Paraffin
Embedding Wax, Polyester Wax, Carbowax Polyethylene Glycol, Polyfin IM, Tissue Freezing
Medium TFM IM, Cryo-Gel IM, and OCT Compound (Electron Microscopy Sciences, Hatfield,
PA). Prior to molecular analysis, the embedding material may be removed via any suitable
techniques, as known in the art. For example, where the sample is embedded in wax, the
embedding material may be removed by extraction with organic solvent(s), for example xylenes.
Kits are commercially available for removing embedding media from tissues. Samples or
sections thereof may be subjected to further processing steps as needed, for example serial
hydration or dehydration steps.
In some embodiments, the sample is a fixed, wax-embedded biological sample. Frequently,
samples from medical laboratories are provided as fixed, wax-embedded samples, most
commonly as formalin-fixed, paraffin embedded (FFPE) tissues.
Whatever the source of the biological sample, the target polynucleotide that is ultimately assayed
can be prepared synthetically (in the case of control sequences), but typically is purified from the
biological source and subjected to one or more preparative steps. The RNA may be purified to
remove or diminish one or more undesired components from the biological sample or to
concentrate it. Conversely, where the RNA is too concentrated for the particular assay, it may be
diluted.
RNA EXTRACTION
RNA can be extracted and purified from biological samples using any suitable technique. A
number of techniques are known in the art, and several are commercially available (e.g.,
FormaPure 1M nucleic acid extraction kit, Agencouit Biosciences, Beverly MA, High Pure FFPE
RNA Micro Kit IM, Roche Applied Science, Indianapolis, IN). RNA can be extracted from
frozen tissue sections using TRIzol (Invitrogen, Carlsbad, CA) and purified using RNeasy
Protect kit (Qiagen, Valencia, CA). RNA can be further purified using DNAse I treatment
(Ambion, Austin, TX) to eliminate any contaminating DNA. RNA concentrations can be made
using a Nanodrop ND-K)OO spectrophotometer (Nanodrop Technologies, Rockland, DE). RNA
integrity can be evaluated by running electropherograms, and RNA integrity number (RIN, a
correlative measure that indicates intactness of mRNA) can be determined using the RNA 6000
PicoAssay for the Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA).
REVERSE TRANSCRIPTION FOR QRT-PCR ANALYSIS
Reverse transcription can be performed using the Omniscript kit (Qiagen, Valencia, CA),
Superscript III kit (Invitrogen, Carlsbad, CA), for RT-PCR. Target- specific priming can be
performed in order to increase the sensitivity of detection of target sequences and generate
target-specific cDNA.
TAQM ΛN® GENE EXPRESSION ANALYSIS
TaqMan ® RT-PCR can be performed using Applied Biosystems Prism (ABI) 7900 HT
instruments in a 5 µl volume with target sequence-specific cDNA equivalent to 1 ng total RNA.
Primers and probes concentrations for TaqMan analysis are added to amplify fluorescent
amplicons using PCR cycling conditions such as 950C for 10 minutes for one cycle, 950C for 20
seconds, and 6O0C for 45 seconds for 40 cycles. A reference sample can be assayed to ensure
reagent and process stability. Negative controls (i.e., no template) should be assayed to monitor
any exogenous nucleic acid contamination.
AMPLIFICATION AND HYBRIDIZATION
Following sample collection and nucleic acid extraction, the nucleic acid portion of the sample
comprising RNA that is or can be used to prepare the target polynucleotide(s) of interest can be
subjected to one or more preparative reactions. These preparative reactions can include in vitro
transcription (IVT), labeling, fragmentation, amplification and other reactions. mRNA can first
be treated with reverse transcriptase and a primer to create cDNA prior to detection, quantitation
and/or amplification; this can be done in vitro with purified mRNA or in situ, e.g., in cells or
tissues affixed to a slide.
By "amplification" is meant any process of producing at least one copy of a nucleic acid, in this
case an expressed RNA, and in many cases produces multiple copies. An amplification product
can be RNA or DNA, and may include a complementary strand to the expressed target sequence.
DNA amplification products can be produced initially through reverse translation and then
optionally from further amplification reactions. The amplification product may include all or a
portion of a target sequence, and may optionally be labeled. A variety of amplification methods
are suitable for use, including polymerase-based methods and ligation-based methods.
Exemplary amplification techniques include the polymerase chain reaction method (PCR), the
ligase chain reaction (LCR), ribozyme-based methods, self sustained sequence replication (3SR),
nucleic acid sequence-based amplification (NASBA), the use of Q Beta replicase, reverse
transcription, nick translation, and the like.
Asymmetric amplification reactions may be used to preferentially amplify one strand
representing the target sequence that is used for detection as the target polynucleotide. In some
cases, the presence and/or amount of the amplification product itself may be used to determine
the expression level of a given target sequence. In other instances, the amplification product
may be used to hybridize to an array or other substrate comprising sensor polynucleotides which
are used to detect and/or quantitate target sequence expression.
The first cycle of amplification in polymerase-based methods typically forms a primer extension
product complementary to the template strand. If the template is single-stranded RNA, a
polymerase with reverse transcriptase activity is used in the first amplification to reverse
transcribe the RNA to DNA, and additional amplification cycles can be performed to copy the
primer extension products. The primers for a PCR must, of course, be designed to hybridize to
regions in their corresponding template that will produce an amplifiable segment; thus, each
primer must hybridize so that its 3' nucleotide is paired to a nucleotide in its complementary
template strand that is located 3' from the 3' nucleotide of the primer used to replicate that
complementary template strand in the PCR.
The target polynucleotide can be amplified by contacting one or more strands of the target
polynucleotide with a primer and a polymerase having suitable activity to extend the primer and
copy the target polynucleotide to produce a full-length complementary polynucleotide or a
smaller portion thereof. Any enzyme having a polymerase activity that can copy the target
polynucleotide can be used, including DNA polymerases, RNA polymerases, reverse
transcriptases, enzymes having more than one type of polymerase or enzyme activity. The
enzyme can be thermolabile or thermostable. Mixtures of enzymes can also be used. Exemplary
enzymes include: DNA polymerases such as DNA Polymerase I ("Pol I"), the Klenow fragment
of Pol I, T4, T7, Sequenase® T7, Sequenase® Version 2.0 T7, Tub, Taq, Tth, Pfx, Pfii, Tsp, TfI,
TH and Pyrococcus sp GB-D DNA polymerases; RNA polymerases such as E . coll, SP6, T3
and T7 RNA polymerases; and reverse transcriptases such as AMV, M-MuLV, MMLV, RNAse
H MMLV (Superscript®), Superscript® II, ThermoScript®, HIV-I, and RAV2 reverse
transcriptases. All of these enzymes are commercially available. Exemplary polymerases with
multiple specificities include RAV2 and TH (exo-) polymerases. Exemplary thermostable
polymerases include Tub, Taq, Tth, Pfx, Pfu, Tsp, TfI, TH and Pyrococcus sp. GB-D DNA
polymerases.
Suitable reaction conditions are chosen to permit amplification of the target polynucleotide,
including pH, buffer, ionic strength, presence and concentration of one or more salts, presence
and concentration of reactants and cofactors such as nucleotides and magnesium and/or other
metal ions (e.g., manganese), optional cosolvents, temperature, thermal cycling profile for
amplification schemes comprising a polymerase chain reaction, and may depend in part on the
polymerase being used as well as the nature of the sample. Cosolvents include formamide
(typically at from about 2 to about 10 %), glycerol (typically at from about 5 to about 10 %), and
DMSO (typically at from about 0.9 to about 10 %). Techniques may be used in the amplification
scheme in order to minimize the production of false positives or artifacts produced during
amplification. These include "touchdown" PCR, hot-start techniques, use of nested primers, or
designing PCR primers so that they form stem-loop structures in the event of primer-dimer
formation and thus are not amplified. Techniques to accelerate PCR can be used, for example
centrifugal PCR, which allows for greater convection within the sample, and comprising infrared
heating steps for rapid heating and cooling of the sample. One or more cycles of amplification
can be performed. An excess of one primer can be used to produce an excess of one primer
extension product during PCR; preferably, the primer extension product produced in excess is
the amplification product to be detected. A plurality of different primers may be used to amplify
different target polynucleotides or different regions of a particular target polynucleotide within
the sample.
An amplification reaction can be performed under conditions which allow an optionally labeled
sensor polynucleotide to hybridize to the amplification product during at least part of an
amplification cycle. When the assay is performed in this manner, real-time detection of this
hybridization event can take place by monitoring for light emission or fluorescence during
amplification, as known in the art.
Where the amplification product is to be used for hybridization to an array or microarray, a
number of suitable commercially available amplification products are available. These include
amplification kits available from NuGEN, Inc. (San Carlos, CA), including the WT-Ovation IM
System, WT-Ovation IM System v2, WT-Ovation IM Pico System, WT-Ovation IM FFPE Exon
Module, WT-Ovation IM FFPE Exon Module RiboAmp and RiboAmp Plus RNA Amplification
Kits (MDS Analytical Technologies (formerly Arcturus) (Mountain View, CA), Genisphere, Inc.
(Hatfield, PA), including the RampUp Plus IM and SenseAmp IM RNA Amplification kits, alone
or in combination. Amplified nucleic acids may be subjected to one or more purification
reactions after amplification and labeling, for example using magnetic beads (e.g., RNAClean
magnetic beads, Agencourt Biosciences).
Multiple RNA biomarkers can be analyzed using real-time quantitative multiplex RT-PCR
platforms and other multiplexing technologies such as GenomeLab GeXP Genetic Analysis
System (Beckman Coulter, Foster City, CA), SmartCycler® 9600 or GeneXpert(R) Systems
(Cepheid, Sunnyvale, CA), ABI 7900 HT Fast Real Time PCR system (Applied Biosystems,
Foster City, CA), LightCycler® 480 System (Roche Molecular Systems, Pleasanton, CA),
xMAP 100 System (Luminex, Austin, TX) Solexa Genome Analysis System (Illumina,
Hayward, CA), OpenArray Real Time qPCR (BioTrove, Woburn, MA) and BeadXpress System
(Illumina, Hayward, CA).
PROSTATE CLASSIFICATION ARRAYS
The present invention contemplates that a Prostate Cancer Prognostic Set or probes derived
therefrom may be provided in an array format. In the context of the present invention, an "array"
is a spatially or logically organized collection of polynucleotide probes. Any array comprising
sensor probes specific for two or more of SEQ ID NOs: 1-21 14 or a product derived therefrom
can be used. Desirably, an array will be specific for 5, 10, 15, 20, 25, 30, 50, 75, 100, 150, 200,
250, 300, 350, 400, 450, 500, 600, 700, 1000, 1200, 1400, 1600, 1800, 2000 or more of SEQ ID
NOs: 1-21 14. Expression of these sequences may be detected alone or in combination with other
transcripts. In some embodiments, an array is used which comprises a wide range of sensor
probes for pro state- specific expression products, along with appropriate control sequences. An
array of interest is the Human Exon 1.0 ST Array (HuEx 1.0 ST, Affymetrix, Inc., Santa Clara,
CA.).
Typically the polynucleotide probes are attached to a solid substrate and are ordered so that the
location (on the substrate) and the identity of each are known. The polynucleotide probes can be
attached to one of a variety of solid substrates capable of withstanding the reagents and
conditions necessary for use of the array. Examples include, but are not limited to, polymers,
such as (poly)tetrafluoroethylene, (poly)vinylidenedifluoride, polystyrene, polycarbonate,
polypropylene and polystyrene; ceramic; silicon; silicon dioxide; modified silicon; (fused) silica,
quartz or glass; functionalized glass; paper, such as filter paper; diazotized cellulose;
nitrocellulose filter; nylon membrane; and polyacrylamide gel pad. Substrates that are
transparent to light are useful for arrays that will be used in an assay that involves optical
detection.
Examples of array formats include membrane or filter arrays (for example, nitrocellulose, nylon
arrays), plate arrays (for example, multiwell, such as a 24-, 96-, 256-, 384-, 864- or 1536-well,
microtitre plate arrays), pin arrays, and bead arrays (for example, in a liquid "slurry"). Arrays on
substrates such as glass or ceramic slides are often referred to as chip arrays or "chips." Such
arrays are well known in the art. In one embodiment of the present invention, the Prostate Cancer
Prognosticarray is a chip.
DATA ANALYSIS
Array data can be managed and analyzed using techniques known in the art. The Genetrix suite
of tools can be used for microarray analysis (Epicenter Software, Pasadena, CA). Probe set
modeling and data pre-processing can be derived using the Robust Multi-Array (RMA)
algorithm or variant GC-RMA, Probe Logarithmic Intensity Error (PLIER) algorithm or variant
iterPLIER. Variance or intensity filters can be applied to pre-process data using the RMA
algorithm, for example by removing target sequences with a standard deviation of < 10 or a
mean intensity of < 100 intensity units of a normalized data range, respectively.
In some embodiments, one or more pattern recognition methods can be used in analyzing the
expression level of target sequences. The pattern recognition method can comprise a linear
combination of expression levels, or a nonlinear combination of expression levels. In some
embodiments, expression measurements for RNA transcripts or combinations of RNA transcript
levels are formulated into linear or non-linear models or algorithms (i.e., an 'expression
signature') and converted into a likelihood score. This likelihood score indicates the probability
that a biological sample is from a patient who will exhibit no evidence of disease, who will
exhibit systemic cancer, or who will exhibit biochemical recurrence. The likelihood score can be
used to distinguish these disease states. The models and/or algorithms can be provided in
machine readable format, and may be used to correlate expression levels or an expression profile
with a disease state, and/or to designate a treatment modality for a patient or class of patients.
Thus, results of the expression level analysis can be used to correlate increased expression of
RNAs corresponding to SEQ ID NOs: 1-21 14, or subgroups thereof as described herein, with
prostate cancer outcome, and to designate a treatment modality.
Factors known in the art for diagnosing and/or suggesting, selecting, designating, recommending
or otherwise determining a course of treatment for a patient or class of patients suspected of
having prostate cancer can be employed in combination with measurements of the target
sequence expression. These techniques include cytology, histology, ultrasound analysis, MRI
results, CT scan results, and measurements of PSA levels.
Certified tests for classifying prostate disease status and/or designating treatment modalities are
also provided. A certified test comprises a means for characterizing the expression levels of one
or more of the target sequences of interest, and a certification from a government regulatory
agency endorsing use of the test for classifying the prostate disease status of a biological sample.
In some embodiments, the certified test may comprise reagents for amplification reactions used
to detect and/or quantitate expression of the target sequences to be characterized in the test. An
array of probe nucleic acids can be used, with or without prior target amplification, for use in
measuring target sequence expression.
The test is submitted to an agency having authority to certify the test for use in distinguishing
prostate disease status and/or outcome. Results of detection of expression levels of the target
sequences used in the test and correlation with disease status and/or outcome are submitted to the
agency. A certification authorizing the diagnostic and/or prognostic use of the test is obtained.
Also provided are portfolios of expression levels comprising a plurality of normalized expression
levels of the target sequences described herein, including SEQ ID NOs: 1-21 14. Such portfolios
may be provided by performing the methods described herein to obtain expression levels from an
individual patient or from a group of patients. The expression levels can be normalized by any
method known in the art; exemplary normalization methods that can be used in various
embodiments include Robust Multichip Average (RMA), probe logarithmic intensity error
estimation (PLIER), non-linear fit (NLFIT) quantile-based and nonlinear normalization, and
combinations thereof. Background correction can also be performed on the expression data;
exemplary techniques useful for background correction include mode of intensities, normalized
using median polish probe modeling and sketch-normalization.
In some embodiments, portfolios are established such that the combination of genes in the
portfolio exhibit improved sensitivity and specificity relative to known methods. In considering
a group of genes for inclusion in a portfolio, a small standard deviation in expression
measurements correlates with greater specificity. Other measurements of variation such as
correlation coefficients can also be used in this capacity. The invention also encompasses the
above methods where the specificity is at least about 50% or at least about 60%. The invention
also encompasses the above methods where the sensitivity is at least about 90%.
The gene expression profiles of each of the target sequences comprising the portfolio can fixed
in a medium such as a computer readable medium. This can take a number of forms. For
example, a table can be established into which the range of signals (e.g., intensity measurements)
indicative of disease or outcome is input. Actual patient data can then be compared to the values
in the table to determine the patient samples diagnosis or prognosis. In a more sophisticated
embodiment, patterns of the expression signals (e.g., fluorescent intensity) are recorded digitally
or graphically.
The expression profiles of the samples can be compared to a control portfolio. If the sample
expression patterns are consistent with the expression pattern for a known disease or disease
outcome, the expression patterns can be used to designate one or more treatment modalities. For
patients with test scores consistent with systemic disease outcome after prostatectomy, additional
treatment modalities such as adjuvant chemotherapy (e.g., docetaxel, mitoxantrone and
prednisone), systemic radiation therapy (e.g., samarium or strontium) and/or anti-androgen
therapy (e.g., surgical castration, finasteride, dutasteride) can be designated. Such patients would
likely be treated immediately with anti-androgen therapy alone or in combination with radiation
therapy in order to eliminate presumed micro-metastatic disease, which cannot be detected
clinically but can be revealed by the target sequence expression signature. Such patients can also
be more closely monitored for signs of disease progression. For patients with test scores
consistent with PSA or NED, adjuvant therapy would not likely be recommended by their
physicians in order to avoid treatment-related side effects such as metabolic syndrome (e.g.,
hypertension, diabetes and/or weight gain) or osteoporosis. Patients with samples consistent
with NED could be designated for watchful waiting, or for no treatment. Patients with test
scores that do not correlate with systemic disease but who have successive PSA increases could
be designated for watchful waiting, increased monitoring, or lower dose or shorter duration anti-
androgen therapy.
Target sequences can be grouped so that information obtained about the set of target sequences
in the group can be used to make or assist in making a clinically relevant judgment such as a
diagnosis, prognosis, or treatment choice.
A patient report is also provided comprising a representation of measured expression levels of a
plurality of target sequences in a biological sample from the patient, wherein the representation
comprises expression levels of target sequences corresponding to any one, two, three, four, five,
six, eight, ten, twenty, thirty, fifty or more of the target sequences depicted in SEQ ID NOs: 1-
2 114, or of the subsets described herein, or of a combination thereof. In some embodiments, the
representation of the measured expression level(s) may take the form of a linear or nonlinear
combination of expression levels of the target sequences of interest. The patient report may be
provided in a machine (e.g., a computer) readable format and/or in a hard (paper) copy. The
report can also include standard measurements of expression levels of said plurality of target
sequences from one or more sets of patients with known disease status and/or outcome. The
report can be used to inform the patient and/or treating physician of the expression levels of the
expressed target sequences, the likely medical diagnosis and/or implications, and optionally may
recommend a treatment modality for the patient.
Also provided are representations of the gene expression profiles useful for treating, diagnosing,
prognosticating, and otherwise assessing disease. In some embodiments, these profile
representations are reduced to a medium that can be automatically read by a machine such as
computer readable media (magnetic, optical, and the like). The articles can also include
instructions for assessing the gene expression profiles in such media. For example, the articles
may comprise a readable storage form having computer instructions for comparing gene
expression profiles of the portfolios of genes described above. The articles may also have gene
expression profiles digitally recorded therein so that they may be compared with gene expression
data from patient samples. Alternatively, the profiles can be recorded in different
representational format. A graphical recordation is one such format. Clustering algorithms can
assist in the visualization of such data.
KITS
Kits for performing the desired method(s) are also provided, and comprise a container or housing
for holding the components of the kit, one or more vessels containing one or more nucleic
acid(s), and optionally one or more vessels containing one or more reagents. The reagents
include those described in the composition of matter section above, and those reagents useful for
performing the methods described, including amplification reagents, and may include one or
more probes, primers or primer pairs, enzymes (including polymerases and ligases), intercalating
dyes, labeled probes, and labels that can be incorporated into amplification products.
In some embodiments, the kit comprises primers or primer pairs specific for those subsets and
combinations of target sequences described herein. At least two, three, four or five primers or
pairs of primers suitable for selectively amplifying the same number of target sequence- specific
polynucleotides can be provided in kit form. In some embodiments, the kit comprises from five
to fifty primers or pairs of primers suitable for amplifying the same number of target sequence-
representative polynucleotides of interest.
The primers or primer pairs of the kit, when used in an amplification reaction, specifically
amplify at least a portion of a nucleic acid depicted in one of SEQ ID NOs: 1-21 14 (or subgroups
thereof as set forth herein), an RNA form thereof, or a complement to either thereof. The kit
may include a plurality of such primers or primer pairs which can specifically amplify a
corresponding plurality of different nucleic acids depicted in one of SEQ ID NOs: 1-21 14 (or
subgroups thereof as set forth herein), RNA forms thereof, or complements thereto. At least two,
three, four or five primers or pairs of primers suitable for selectively amplifying the same
number of target sequence- specific polynucleotides can be provided in kit form. In some
embodiments, the kit comprises from five to fifty primers or pairs of primers suitable for
amplifying the same number of target sequence-representative polynucleotides of interest.
The reagents may independently be in liquid or solid form. The reagents may be provided in
mixtures. Control samples and/or nucleic acids may optionally be provided in the kit. Control
samples may include tissue and/or nucleic acids obtained from or representative of prostate
tumor samples from patients showing no evidence of disease, as well as tissue and/or nucleic
acids obtained from or representative of prostate tumor samples from patients that develop
systemic prostate cancer.
The nucleic acids may be provided in an array format, and thus an array or microarray may be
included in the kit. The kit optionally may be certified by a government agency for use in
prognosing the disease outcome of prostate cancer patients and/or for designating a treatment
modality.
Instructions for using the kit to perform one or more methods of the invention can be provided
with the container, and can be provided in any fixed medium. The instructions may be located
inside or outside the container or housing, and/or may be printed on the interior or exterior of any
surface thereof. A kit may be in multiplex form for concurrently detecting and/or quantitating
one or more different target polynucleotides representing the expressed target sequnces.
DEVICES
Devices useful for performing methods of the invention are also provided. The devices can
comprise means for characterizing the expression level of a target sequence of the invention, for
example components for performing one or more methods of nucleic acid extraction,
amplification, and/or detection. Such components may include one or more of an amplification
chamber (for example a thermal cycler), a plate reader, a spectrophotometer, capillary
electrophoresis apparatus, a chip reader, and or robotic sample handling components. These
components ultimately can obtain data that reflects the expression level of the target sequences
used in the assay being employed.
The devices may include an excitation and/or a detection means. Any instrument that provides a
wavelength that can excite a species of interest and is shorter than the emission wavelength(s) to
be detected can be used for excitation. Commercially available devices can provide suitable
excitation wavelengths as well as suitable detection components.
Exemplary excitation sources include a broadband UV light source such as a deuterium lamp
with an appropriate filter, the output of a white light source such as a xenon lamp or a deuterium
lamp after passing through a monochromator to extract out the desired wavelength(s), a
continuous wave (cw) gas laser, a solid state diode laser, or any of the pulsed lasers. Emitted
light can be detected through any suitable device or technique; many suitable approaches are
known in the art. For example, a fluorimeter or spectrophotometer may be used to detect
whether the test sample emits light of a wavelength characteristic of a label used in an assay.
The devices typically comprise a means for identifying a given sample, and of linking the results
obtained to that sample. Such means can include manual labels, barcodes, and other indicators
which can be linked to a sample vessel, and/or may optionally be included in the sample itself,
for example where an encoded particle is added to the sample. The results may be linked to the
sample, for example in a computer memory that contains a sample designation and a record of
expression levels obtained from the sample. Linkage of the results to the sample can also
include a linkage to a particular sample receptacle in the device, which is also linked to the
sample identity.
The devices also comprise a means for correlating the expression levels of the target sequences
being studied with a prognosis of disease outcome. Such means may comprise one or more of a
variety of correlative techniques, including lookup tables, algorithms, multivariate models, and
linear or nonlinear combinations of expression models or algorithms. The expression levels may
be converted to one or more likelihood scores, reflecting a likelihood that the patient providing
the sample will exhibit a particular disease outcome. The models and/or algorithms can be
provided in machine readable format, and can optionally further designate a treatment modality
for a patient or class of patients.
The device also comprises output means for outputting the disease status, prognosis and/or a
treatment modality. Such output means can take any form which transmits the results to a
patient and/or a healthcare provider, and may include a monitor, a printed format, or both. The
device may use a computer system for performing one or more of the steps provided.
CITATIONS
Patents and Published Applications
1. US 2003/0224399 A l patent application Methods for determining the prognosis for patients
with a prostate neoplastic condition 2003- 12-04
2 . US 2007/0048738 A l patent application Methods and compositions for diagnosis, staging
and prognosis of prostate cancer 2007-03-0 1
3 . US 2007/0099197 A l patent application Methods of prognosis of prostate cancer 2007-05-
03
4 . US 2007/0259352 A l patent application Prostate cancer-related nucleic acids 2007-1 1-08
5 . US 2008/0009001 A l patent application Method for Identification of Neoplastic
Transformation with Particular Reference to Prostate Cancer 2008-0 1-10
Publications
1: Cooper CS, Campbell C, Jhavar S . Mechanisms of Disease: biomarkers and molecular targets
from microarray gene expression studies in prostate cancer.
Nat Clin Pract Urol. 2007 Dec;4( 12):677-87. Review.
2 : Reddy GK, Balk SP. Clinical utility of microarray-derived genetic signatures in predicting
outcomes in prostate cancer. Clin Genitourin Cancer. 2006 Dec;5(3): 187-9. Review.
3 : Nelson PS. Predicting prostate cancer behavior using transcript profiles. J Urol. 2004
Nov; 172(5 Pt 2):S28-32; discussion S33. Review.
4 : Bibikova M, Chudin E, Arsanjani A, Zhou L, Garcia EW, Modder J, Kostelec M, Barker D,
Downs T, Fan JB, Wang-Rodriguez J . Expression signatures that correlated with Gleason score
and relapse in prostate cancer. Genomics. 2007 Jun;89(6):666-72. Epub 2007 Apr 24.
5 : Schlomm T, Erbersdobler A, Mirlacher M, Sauter G . Molecular staging of prostate cancer in
the year 2007. World J Urol. 2007 Mar;25( 1): 19-30. Epub 2007 Mar 2 . Review.
6 : Mendiratta P, Febbo PG. Genomic signatures associated with the development, progression,
and outcome of prostate cancer. MoI Diagn Ther. 2007; 11(6):345-54.
7 : Reddy GK, Balk SP. Clinical utility of microarray-derived genetic signatures in predicting
outcomes in prostate cancer. Clin Genitourin Cancer. 2006 Dec;5(3): 187-9. Review.
8 : True L, Coleman I, Hawley S, Huang CY, Gifford D, Coleman R, Beer TM, Gelmann E,
Datta M, Mostaghel E, Knudsen B, Lange P, Vessella R, Lin D, Hood L, Nelson PS. A
molecular correlate to the Gleason grading system for prostate adenocarcinoma. Proc Natl Acad
Sci U S A . 2006 JuI 18; 103(29): 1099 1-6. Epub 2006 JuI 7 .
9 : Stephenson AJ, Smith A, Kattan MW, Satagopan J, Reuter VE, Scardino PT, Gerald
WL. Integration of gene expression profiling and clinical variables to predict
prostate carcinoma recurrence after radical prostatectomy.
Cancer. 2005 JuI 15; 104(2):290-8.
10: Bueno R, Loughlin KR, Powell MH, Gordon GJ. A diagnostic test for prostate cancer from
gene expression profiling data.J Urol. 2004 Feb;171(2 Pt l):903-6.
11: Yu YP, Landsittel D, Jing L, Nelson J, Ren B, Liu L, McDonald C, Thomas R, Dhir R,
Finkelstein S, Michalopoulos G, Becich M, Luo JH. Gene expression alterations in prostate
cancer predicting tumor aggression and preceding development of malignancy. J Clin Oncol.
2004 JuI 15;22(14):2790-9.
12: Feroze-Merzoug F, Schober MS, Chen YQ. Molecular profiling in prostate cancer. Cancer
Metastasis Rev. 200 1;20(3-4): 165-7 1. Review.
13: Nakagawa T, Kollmeyer TM, Morlan BW, Anderson SK, Bergstralh EJ, Davis BJ, Asmann
YW, Klee GG, Ballman KV, Jenkins RB. A tissue biomarker panel predicting systemic
progression after PSA recurrence post-definitive prostate cancer therapy. PLoS ONE. 2008;
3(5):e2318.
14: Shariat SF, Karakiewicz PI, Roehrborn CG, Kattan MW. An updated catalog of prostate
cancer predictive tools. Cancer 2008; 113(1 1): 3062-6.
EXAMPLES
To gain a better understanding of the invention described herein, the following examples are set
forth. It will be understood that these examples are intended to describe illustrative embodiments
of the invention and are not intended to limit the scope of the invention in any way. Efforts have
been made to ensure accuracy with respect to numbers used (e.g., amounts, temperature, etc.) but
some experimental error and deviation should be accounted for. Unless otherwise indicated,
parts are parts by weight, temperature is degree centigrade and pressure is at or near atmospheric,
and all materials are commercially available.
EXAMPLE 1. IDENTIFICATION O F TARGET SEQUENCES DIFFERENTIALLY EXPRESSED IN PROSTATE
DISEASE STATES
Tissue Samples. Formalin-fixed paraffin embedded (FFPE) samples of human prostate
adenocarcinoma prostatectomies were collected from patients at the Mayo Clinic Comprehensive
Cancer Center according to an institutional review board-approved protocol and stored in the
Department of Pathology for up to 20 years. For each patient sample four 4 micron sections were
cut from formalin-fixed paraffin embedded blocks. Pathological review of FFPE tissue sections
was used to guide macrodissection of tumor and surrounding normal tissue. Patients were
classified into one of three clinical disease states; no evidence of disease (NED, n=10) for those
patients with no biochemical or other clinical signs of disease progression (at least 10 years
follow-up); prostate-specific antigen biochemical recurrence (PSA, n=10) for those patients with
two successive increases in PSA measurements above an established cut-point of >4 ng/mL
('rising PSA'); and systemic disease (SYS, n=10) for those patients that had 'rising PSA' and
developed metastases or clinically detectable disease progression within five years after initial
prostatectomy. Clinical disease was confirmed using bone or CT scans for prostate cancer
metastases.
RNA Extraction. RNA was extracted and purified from FFPE tissue sections using a modified
protocol for the commercially available High Pure FFPE RNA Micro nucleic acid extraction kit
(Roche Applied Sciences, Indianapolis, IN). RNA concentrations were calculated using a
Nanodrop ND-K)OO spectrophotometer (Nanodrop Technologies, Rockland, DE).
RNA Amplification and GeneChip Hybridization. Purified RNA was subjected to whole-
transcriptome amplification using the WT-Ovation FFPE system including the WT-Ovation
Exon and FL-Ovation Biotin V2 labeling modules, with the following modifications. Fifty (50)
nanograms of RNA extracted from FFPE sections was used to generate amplified Ribo-SPIA
product. For the WT-Ovation Exon sense-target strand conversion kit 4 ug of Ribo-SPIA product
were used. All clean-up steps were performed with RNAClean magnetic beads (Agencourt
Biosciences). Between 2.5 and 5 micrograms of WT-Ovation Exon product were used to
fragment and label using the FL-Ovation Biotin V2 labeling module and labeled product was
hybridized to Affymetrix Human Exon 1.0 ST GeneChips following manufacturer's
recommendations (Affymetrix, Santa Clara, CA). Of the 30 samples processed, 22 had sufficient
amplified material (i.e., >2.5 ug of WT-Ovation Exon product) for GeneChip hybridization.
Microarray Analysis. All data management and analysis was conducted using the Genetrix
suite of tools for microarray analysis (Epicenter Software, Pasadena, CA). Probe set modeling
and data pre-processing were derived using the Robust Multi-Array (RMA) algorithm. The
mode of intensity values was used for background correction and RMA-sketch was used for
normalization and probe modeling used a median polish routine. A variance filter was applied to
data pre-processed using the RMA algorithm, by removing target sequences with a mean
intensity of < 10 intensity units of a normalized data range. Target sequences typically comprise
four individual probes that interrogate the expression of RNA transcripts or portions thereof.
Target sequence annotations and the sequences (RNAs) that they interrogate were downloaded
from the Affymetrix website (www.netaffx.com). Supervised analysis of differentially expressed
RNA transcripts was determined based on the fold change in the average expression (at least 2
fold change) and the associated t-test, with a p-value cut-off of p<0.001 between different
prostate cancer patient disease states. Linear regression was also used to screen differentially
expressed transcripts that displayed an expression pattern of NED>PSA>SYS or
SYS>PSA>NED and genes were selected with a p-value cut-off of p<0.01 for two-way
hierarchical clustering using Pearson's correlation distance metric with complete-linkage cluster
distances.
Archived FFPE blocks of tumors were selected from 30 patients that had undergone a
prostatectomy at the Mayo Clinic Comprehensive Cancer Center between the years 1987- 1997,
providing for at least 10 years follow-up on each patient. Twenty-two patient samples had RNA
of sufficient quantity and quality for RNA amplification and subsequent GeneChip hybridization.
Three clinical categories of patients were evaluated; patients alive with no evidence of disease
('NED', n=6), patients with rising PSA or biochemical recurrence (defined as two successive
increases in PSA measurements) ('PSA', n=7) and patients with rising PSA and clinical evidence
of systemic or recurrent disease (e.g., determined by bone scan, CT) ('SYS', n=9) after
prostatectomy. No statistically significant differences between these three clinical groups were
apparent when considering pathological factors such as Gleason score or tumor stage (Table 1).
As samples from older archived FFPE blocks are typically more degraded and fragmented than
younger blocks, the distribution of block ages was similar in the three clinical groups so as not to
skew or bias the results due to a block age effect. Fifty nanograms of RNA extracted from FFPE
sections was amplified and hybridized to whole-transcriptome microarrays, interrogating > 1.4
million probe target sequences measuring RNA levels for RefSeq, dbEST and predicted
transcripts (collectively, 'RNAs').
Table 3 displays the number of target sequences identified in two-way comparisons between
different clinical states using the appropriate t-tests and a p-value cut-off of p<0.001 . At total of
2,1 14 target sequences (Table 3) were identified as differentially expressed in these comparisons
and a principle components analysis demonstrates that these target sequences discriminate the
distinct clinical states into three clusters (Figure IA).
A linear regression filter was next employed to statistically rank target sequences that followed a
trend of either increased expression with poor prognosis patients (i.e., SYS>PSA>NED) or
increased expression in good prognosis patients (NED>PSA>SYS, alternatively decreased
expression in poor prognosis patients) (Table 4). Figure IB depicts a two-way hierarchical
clustering dendrogram and expression matrix of top-ranked 526 target sequences and 22 tumor
samples. Patients in the 'PSA' clinical status category displayed intermediate expression levels
for genes expressed at increased levels in SYS (n=3 13) and NED (n=213), respectively (Table
4). Figure 1C depicts a two-way hierarchical clustering dendrogram and expression matrix of
148 target sequences and 22 tumor samples. These target sequences were a subset of the
differentially expressed transcripts (Table 3) filtered using a t-test to query 'recurrent' (i.e.,
'SYS') and 'non-recurrent' (i.e., 'PSA' and 'NED') patient samples (Table 5).
The expression levels of these genes were summarized for each patient into a 'metagene' using a
simple linear combination by taking the expression level and multiplying it by a weighting factor
for each target sequence in the metagene signature and combining these values into a single
variable. Weighting factors were derived from the coefficients of the linear regression fit analysis
(Table 4). Figure 2 shows a histogram plot of the metagene expression values for the
summarized 526 target sequences in each of the three clinical groups. This 526-metagene
achieved maximal separation between clinical groups and low variance within each clinical
group. Metagenes comprised of smaller subsets of 21, 18 and 6 target sequences were also
generated (Figure 3, Tables 7 and 8). The distinctions between clinical groups with respect to the
metagene scores were preserved, although increased within-group variance was observed when
using fewer target sequences (Figure 3).
Next, Patient outcome predictor ('POP') scores were generated from the metagene values for
each patient. For the 18-target sequences metagene, this entailed scaling and normalizing the
metagene scores within a range of 0 to 100, where a value of between 0-20 points indicates a
patient with NED, 40-60 points a patient with PSA recurrence and 80-100 points a patient with
SYS metastatic disease (Figure 4). In contrast, Gleason scores for patients could not be used on
their own to distinguish the clinical groups (Table 1).
Using the Nearest Shrunken Centroids (NSC) algorithm with leave- 1-out cross-validation,
smaller subsets of RNA transcripts were identified that distinguish 'recurrent' (i.e., 'SYS') and
'non-recurrent' (i.e., 'PSA' and 'NED') disease (Tables 9 and 10). NSC algorithm identified 10-
and 4 1-target sequence metagenes used to derive patient outcome predictor scores scaled and
normalized on a data range of 0-100 points. Figures 5 and 6 depict box plots showing
interquartile range and distribution of 'POP' scores for each clinical group. A 148-target
sequence metagene (Table 5) was similarly used to derive 'POP' scores depicted in Figure 7 . T-
tests were used to evaluate the statistical significance of differences in POP scores between
'recurrent' (i.e., 'SYS') and non-recurrent (i.e., 'PSA' and 'NED') patient groups (indicated in
the figures) and show that increasing the number of target sequences in the metagene
combination increases the significance level of the differences in POP scores.
The data generated from such methods can be used to determine a prognosis for disease
outcome, and/or to recommend or designate one or more treatment modalities for patients, to
produce patient reports, and to prepare expression profiles.
Table 1. Clinical characteristics of different clinical status patient groups evaluated. Note Chi
square tests for homogeneity reveal that the three clinical groups do not show significant
differences in terms of patient composition based on known prognostic variables such as
pathological TNM stage or Gleason score. Also, the block age of the samples was not different
between clinical status groups; so that sample archive age effect is mitigated (i.e., older samples
have more degraded, fragmented nucleic acids that could skew or bias results if not evenly
distributed in clinical status patient groups).
NED PSA Systemic X2 Tests for(n=6) (n=7) (n=9) Homegeneity
PathologicalStage T2N0 2 3 3 jx θ.07
T3aN0 1 4 1T3bN0 0 0 3TxN+ 3 0 2
Gleason Score 7 2 7 4 jx θ.068 3 0 2
9 1 0 3
10-14 3 4 5Block Age jx θ.913-15 1 1 2
Group (years)16-20 2 2 2
Table 2 . Comparison of differential expression between patient clinical status groups. Two-way
comparisons for differential expression using the following statistical criteria: a) at least 2-fold
mean difference in expression between comparison groups; b) expression levels > 50 intensity
units of a normalized data range (approximately the mean expression level across all transcripts
in all samples) and c) significance cut-off of p<0.001 determined using a t-test.
ClinicalComparison Differentially Expressed RNAs
Status
NED vs PSA 316PSA vs NED 442NED vs SYS 213PSA vs SYS 194SYS vs NED 269SYS vs PSA 323SYS vs NED & PSA 310
NED & PSA vs SYS 77
Differential Expression is indicated by at least 2-fold change expression betweenmean of clinical status variable and comparison variable; also mean value > 50intensity units of a normalized data range and p<0.001 calculated using a t-test
Tab
le3
.D
iffe
rent
ially
expr
esse
dR
NA
tran
scri
pts
iden
tifie
dfr
omco
mpa
riso
nte
sts
desc
ribe
din
Tab
le2
.Se
quen
celis
tings
are
anno
tate
dw
ithth
e
Aff
ymet
rix
Hum
anE
xon
1.0
ST
prob
ese
lect
ion
regi
onID
,pr
oxim
alan
nota
ted
gene
from
Ref
Seq,
and
over
lap
with
codi
ngse
quen
ce(C
DS)
.
SEQ
Aff
y.ID
Gen
eC CD
DSS
RN
ASe
quen
ceID
Sym
bol
No
135
0927
8—
NO
ctgg
tccc
tctc
ctga
taga
gtttc
aggt
ttgcc
ctga
tgat
ctag
atga
agca
aagt
gtgg
tgac
acttc
gctg
aatg
ctct
gtca
gtgt
gcct
agaa
atag
agtc
tcta
tcag
ggcc
gtttg
cttc
ctat
caca
ctc
227
5044
0N
Ogg
catg
atct
aggc
taac
tccc
tggc
334
9866
5__
_N
Otg
ggca
agag
ctttt
gtat
gtttc
cag
431
5953
2—
NO
ccca
acac
ggtg
cagc
cgat
ttattt
atttt
tccc
tcag
cattt
ttagg
tgga
ttgac
tggg
atgc
tttat
tcaa
ctca
ggga
cgca
ccac
gaat
atgt
tttttg
535
2160
0U
GC
GL
YE
Stg
gcgc
cagc
gaaa
gcca
cgaa
cgtg
2
638
2059
7—
NO
gctg
ggcg
cagc
gttc
tgag
ggga
tgtg
gggt
ctgg
gagg
tgtc
tcga
ggtg
agag
ctcc
aagt
cacg
g
739
5534
9N
Ogt
ggcg
caat
attg
caat
acag
ctca
ctgc
taca
gccc
cagg
ctta
agtg
atct
tcttg
cctc
agtc
tccc
gagt
agct
ggaa
ccac
aatc
tcat
acca
ccac
gccc
cacg
cctg
gtaa
atttt
taaa
tgttt
ttgta
aaga
cagg
gtac
tgct
gtgt
tgcc
cagg
ctgg
tctc
taat
tcct
agtc
tcaa
gcga
tcct
tccg
ctttg
gcc
tccc
aaag
cact
ggga
ttaca
ggtg
tgag
ccac
tgtg
ccca
gccc
tgct
ctag
gtttt
cattt
ggat
ttgct
gcct
agtg
gaag
gcac
agga
tggg
gcag
tgcc
ttctg
ccag
tgag
ggag
gctc
cagg
taga
tgtc
attg
ctga
actg
gagc
tccc
ctgg
832
0452
3PI
GO
YE
Stg
cttc
ctct
tcta
cgct
ggca
ttgcc
ctct
tcac
cagt
ggct
tcct
gctc
accc
gtttg
gagc
tcac
caac
cata
gcag
ctgc
caag
agcc
ccca
ggcc
ctg
ggtc
cctg
ccat
gggg
gagc
caag
ggaa
acct
gggg
cctg
ctgg
atgg
cttc
ccga
ttttc
gcgg
gttg
tgttg
gtgc
tgat
agat
gctc
tgcg
atttg
actt
cgcc
cagc
ccca
gcat
tcac
acgt
gcct
agag
agcc
tcct
gtct
ccct
accc
ttcct
gggc
aaac
taag
ctcc
ttgca
gagg
atcc
tgga
gattc
agcc
cca
ccat
gccc
ggct
ctac
cgat
ctca
ggtt
933
7389
9SL
C43
AN
Oag
ccat
ggcc
gtag
attta
taaa
tacc
aaga
gaag
ttcta
tttttg
taaa
gact
gcaa
aaag
gagg
aaaa
aaaa
ccttc
aaaa
acgc
cccc
taag
tcaa
cgct
1cc
attg
actg
aaga
cagt
ccct
atcc
taga
gggg
ttgag
ccttc
ttcct
ccttg
ggttg
gagg
agac
cagg
gtgc
ctct
tatc
tcct
tcta
gcgg
tctg
cctc
ctgg
tacc
tcttg
gggg
gatc
ggca
aaca
ggct
accc
ctga
ggtc
ccat
gtgc
catg
agtg
tgca
caca
tgca
tgtg
tctg
tgta
tgtg
tgaa
tgtg
agag
aga
caca
gccc
tcct
ttc
1037
0764
5R
AB
EP
YE
Sat
ggcg
cagc
cggg
cccg
gcttc
cc1
11
3860
354
LO
ClO
ON
Ottc
agat
tcac
gagg
gtaa
tcca
gatg
aagg
tata
tcat
tgta
cctg
gccc
gcat
cact
cagt
aact
gtca
ccttt
gattt
attg
attta
cttg
agat
ggag
tttgg
t12
7980
tctg
tccc
cgag
gccg
gagt
gcaa
tggc
gtga
tctc
tgct
cact
acaa
cctc
tgcc
tccc
aggt
tcag
agtg
attc
tcct
gcct
cagc
ctct
ttagc
tggg
att
acag
gcac
ccac
caac
atgc
ccag
ctaa
tttttg
tattt
ttagt
agag
agga
gcttt
cacc
ataa
tggc
cagg
ctgg
tcttg
aacc
tcaa
atga
tctg
ccca
cct
tggc
ctcc
caaa
gtgc
tagg
atta
cagg
ctgg
agct
acag
agcc
cagc
ctgt
cacc
ttgat
ttaaa
tgaa
tgca
gcttt
cttg
gtgt
cttg
atgt
ttgtg
aattt
ctgt
ggag
tttga
tact
gctc
ctttt
gctg
cttg
tttttc
caga
gtcc
cctta
tcac
ttgac
gtgc
tgtta
ta
1224
0677
2M
RPS
lN
Otg
tcgt
gaca
gcct
ctct
ttggg
gcca
gcttc
tgct
tttgc
cccc
atct
ttgca
gtac
aggg
ggta
aatta
aaca
agag
gatg
cctg
aatg
aacg
atat
cctg
g5
gttc
ttgag
agac
aagt
ggga
gctg
ataa
ttctg
aaaa
ttcat
tagt
caaa
gcat
ggag
ataa
aggt
ggca
gcag
gaag
ggga
gagg
caag
gagt
agac
ccg
tgac
agttt
taga
atct
tattt
gtgc
caaa
atac
tttac
tgca
ttggc
ttgga
cctc
taat
acaa
tgttg
aattg
ttaac
catg
atag
cact
gtat
cctg
gtct
aatt
cctg
aattg
aatg
gcta
gtct
tacc
atta
agaa
tgct
atttg
cggc
cagg
c
1338
8132
4H
Ml
3Y
ES
ttatg
aattt
gaca
ccaa
ggac
ctgg
tgtg
cctg
ggcc
tgag
cagc
atcg
ttggc
gtct
ggta
cctg
ctga
g14
3271
321
—N
Oag
gagc
aggt
cgtc
aagt
ccac
cagg
atgg
aatg
ctca
gtag
gaga
aattg
cgct
gtgg
ggcc
atttg
aggc
gcct
gtcc
atgc
gggt
ccgt
cctg
gccc
ttctg
aa
1533
1637
4—
NO
aggc
atta
cgcc
agct
cccg
gatg
cctc
agcc
tcgt
gaat
tcgg
ggta
ggac
gctc
aggg
ccca
tggt
cagc
acag
cggg
tggg
tggg
tttca
gcgt
ggt
cact
tctc
cagg
ggtg
cata
attg
agca
gcttc
tcga
tcag
gtcc
acgt
g
1633
3369
5—
NO
gttg
agga
gcttg
atgc
tgca
gtga
gattg
tgcc
actg
cact
ccag
catg
ggtg
acat
agca
agac
tgcc
tcag
gggg
aaaa
aaaa
aaaa
aaac
acca
aaaa
aaaa
ggta
tatg
gacc
ctag
ttttg
gcca
ggct
g
1739
3085
5...
NO
gtgg
ctca
ctgt
agcg
tcag
tctc
c18
2945
934
CM
AH
NO
ggac
gattc
gtttc
tattt
gacc
caag
actg
atgc
tgcg
tggt
gaag
gaga
ggaa
ctgc
gttc
cagg
caga
ggaa
gtcc
agat
gcgc
aaga
gcag
agat
ggg
aggg
acag
caca
gtgg
cttc
tcag
aattg
aata
tggt
gtta
gact
tcta
aaag
agct
gtgg
taga
ggga
gaaa
actg
tcaa
agga
aatc
agta
actg
ttaaa
ggaa
aata
cctta
aacg
tcct
gtttt
cagg
ctga
ctaa
tgcc
ttagg
ggag
caaa
tgag
caac
a
1925
8140
5N
Ottg
gtat
gtag
ggttc
gtat
gaat
gaaa
cgtg
acta
ccag
ggca
gata
atga
aagg
ctttc
aatc
ccag
agta
agag
atat
aggg
ccag
gc20
2465
840
...N
Oat
gtgg
caca
attta
cgaa
aacc
aaag
agag
gcta
tgct
gagt
ccct
acac
cacc
aaga
ccgt
gatg
ttctg
ccca
ctcg
cctc
ctga
tggc
ctgt
gggg
atag
acca
gcttc
caga
tttca
caca
gatg
ggct
gtat
ggag
gagg
c
21
3431
367
...N
Ocg
atct
ctca
tggt
gggc
tcaa
gtga
tcct
ccca
tctc
agcc
tccc
aagc
acct
ggga
ctac
aggt
gcac
atca
acac
accc
ggat
aatt
2234
5811
4N
AC
AN
Ogg
gcca
ctgt
tcct
caag
catc
taaa
gggc
ttcca
gcaa
agaa
aggc
ccca
cagc
tctg
aaag
aagt
acttg
ttgcc
ccag
ctcc
agaa
agca
cgcc
aat
catc
acag
ctcc
cact
cgga
aagg
tcca
caga
ccaa
aaag
agttc
tgct
acttc
acct
ccta
tatg
ccca
gatc
cctc
agct
aaga
atgg
2338
7525
4...
NO
ttctc
cctc
agac
caag
ttaaa
ctgt
tttttt
ccca
agag
ctag
gaat
cata
tctg
atca
cact
ggga
cttc
ccttc
actg
tcag
tcta
agag
tttgc
atttt
tgag
gttta
ggaa
aagg
gaaa
aaaa
agga
agaa
agaa
agaa
attg
tatc
tgag
aaca
aaga
agct
gcca
catg
gtcc
ctcg
gtat
ca24
2515
646
ITG
A6
YE
Stg
tgct
cacc
gata
tgaa
aaaa
ggca
gcat
gtta
atac
gaag
cagg
aatc
ccga
gaca
tcttt
gggc
ggtg
ttatg
tcct
gagt
caga
atct
cagg
attg
aag
acga
tatg
gatg
25
3202
904
—N
Otg
aggg
cgttg
gacc
acac
accc
ttgaa
aaag
gcct
cgcc
ccgc
tcat
ctct
gaag
atct
tcct
ccaa
cagt
cgac
ggtg
cccg
tgta
catg
atgt
cagc
tcct
ttgcg
ccgg
gact
gcat
catc
gtcc
gccg
ccgc
actg
tgtc
gaag
ggtta
g
2627
6830
4N
FXL
lY
ES
ctgt
ccac
cttg
tgat
caaa
actg
tgga
cgga
cttta
g27
3736
631
—N
Oaa
agcg
gtca
ctgg
ccat
atga
tgcg
gaga
tggt
tttat
ctga
ggcc
tgag
aacc
aaca
gatg
tgcc
tgcg
ccct
ggcc
tgtg
cagc
tgtg
ggca
gatg
tgtc
ccca
gggg
cctg
ccct
cact
cgtg
acca
tgca
atac
cctg
caac
ataa
acat
tttct
tttcc
aagg
atct
gcag
gggt
ggac
atga
tgct
ccag
gcac
agag
tagg
aaga
aaag
gggg
tgac
ctgg
ggttc
ccag
caaa
cagg
tcca
cctc
atgc
tcac
tgcg
tt
28
2787
529
INPP
4BY
ES
tgtg
ccct
ggta
tgtg
aatg
taca
gccc
cgga
aagt
gtga
gcgg
aaaa
gata
actta
cct
2929
8018
8__
_N
Ottt
cccc
acaa
attg
ctga
tgta
cc30
3273
235
—N
Oca
atat
tcac
tgtg
aggt
caag
ctcc
taga
ggta
aaaa
ctca
aaac
tgtg
gcag
ccct
cccc
cgtg
actg
ggtc
cctc
tgaa
gtttc
taac
tctc
agag
ttgtc
cata
ctgg
gcat
ctag
cgat
tcat
31
3402
918
USP
5Y
ES
gagg
aaaa
gatc
aagt
gcct
ggcc
acag
agaa
ggtg
aagt
acac
ccag
cgag
ttgac
taca
tcat
gcag
3235
2922
7—
NO
cgaa
gtgg
gtct
ccaa
attc
cgcg
ccca
cccc
accg
cccg
agaa
gccc
acta
cgca
tgcg
tccg
cacc
ccac
cggc
gccc
cttc
ctat
tgag
catg
cgc
ggga
gccc
cacc
tattt
ctct
ctac
cgttt
cctc
cccc
tacc
tggt
accc
catc
ccta
gctc
agcc
attg
ctttt
ttttc
cacg
accc
tccg
ctgt
ttcttc
cgcg
agc
ttcct
33
3839
571
KL
K2
NO
ggag
ggaa
tggc
tgtg
tccc
acag
gaat
aaca
gcgg
gatg
cttc
c34
2697
538
CE
P70
NO
gaga
ctcg
ttggg
tgat
cattt
cattg
agat
caac
ctga
atga
ccag
gtgt
aaag
tgca
agag
taat
atgc
tatg
actg
a3
525
2107
6N
Ogg
atcc
atgc
caga
caac
gcac
attc
tgca
gaca
ctgg
ttact
ccag
tggc
tcct
tact
ggaa
acat
ataa
tatc
agtg
ataa
gtgc
tgtg
aaga
aaat
aaaa
caat
gatg
tgct
ggta
tgga
ttttta
acttt
tttat
gtga
tgaa
ccac
agaa
tgat
ggttt
taaa
tgta
tgaa
atac
atag
aattg
caac
agaa
acca
gtta
tgaa
ataa
tgaa
gata
ttaaa
tatg
acat
ctat
atttt
agta
aagc
atta
gtga
ggac
tgta
aatg
atct
ttaaa
gaat
ttggc
ttaaa
tttaa
tcta
aaat
tgct
atca
ggta
tttc
acat
cgct
gtaa
tttttg
cctg
cattc
gtaa
ctga
agag
ataa
gtaa
atgt
caga
ggtta
agat
aaat
ctttt
tcttt
tttta
cctg
tcca
tattt
acaa
acat
tctg
cgt
tccg
taca
taga
cgcc
tgga
taag
aacc
cctg
tgca
agaa
tgac
tttgg
tgct
acttt
aaaa
aaag
tggt
tggg
gaag
acct
cata
ggaa
atga
catc
aat
aata
gattg
catg
ataa
ccta
tact
agttc
ttact
gtta
ccaa
tttaa
attc
taac
tgct
taaa
agct
aaca
cggt
ccttc
ccttt
ttttg
agca
cctg
atttg
aagt
acttt
gagt
aggc
ctca
agtg
tcgt
atgc
aagt
a
3628
1102
9N
Oct
gagt
tctg
atttg
gagc
agcc
tgtc
gcaa
tacc
aaat
cacc
tttga
ataa
actta
tcct
ctca
gcttt
tattg
ttcag
aattg
atttc
gagg
atag
aaat
tcca
tttcc
ttgtc
atat
atac
ctga
gccg
ttgca
tcaa
attg
tcag
a
3727
3592
0N
Ogg
gtgt
agcg
cacc
aagc
atga
gctg
aa
3838
4376
7Z
NF1
35N
Oaa
taac
ctag
catg
gggc
ggca
ctaa
atgg
ctgc
agga
aagc
cgag
tcttc
ttcca
catc
cggc
ggct
cccc
tcgg
atgc
gagc
gctg
gccc
aggg
tgt
gttta
caga
ggtg
aggg
cttc
ccgt
ggac
ccttc
tcgt
tggg
agcg
ctta
gcct
cagg
agcg
gattc
aggg
caca
ggca
gagg
acgt
ccac
aaac
acca
cagg
aagc
cgcc
accc
aggg
gcgt
ggaa
aggc
ccaa
tgcc
tcgt
ctgg
gattc
acgg
ccgg
caaa
gcgg
cccc
tccg
gaac
ggga
cagc
acag
cggc
tcac
ctct
gcgc
ctct
gggg
gtgc
gggg
ggag
cctc
gccc
tcca
cgct
ctct
gggg
gacc
gccc
gccc
tagc
cccc
gcct
gggc
ttcgc
gggt
gga
cggt
tggg
ggcc
ccgg
gcgc
cccc
cagc
gcgt
agct
tttct
ccttc
tcgt
gggt
cttc
tggt
gctc
ggct
aggg
ccag
gctg
aagt
ggaa
ggtc
ttcca
gcag
ccct
gaca
ggcg
tagc
gcttc
cggc
cgcc
atgg
ctgc
tgtg
gtgc
tcca
tcag
gtgc
gaga
gcca
cgcg
aagg
cctc
cccg
cact
cgcc
gcag
gca
tagg
gcttc
ccgc
ggga
caag
cccg
gctc
gtca
gccc
cagg
gccc
tgcc
cggc
ctcc
agcc
ctgc
gctg
tcgc
cgga
gcta
gaga
cgcc
ctcg
aggc
tct
gccc
gtcc
ccat
cctc
gggg
tgcg
gcct
cttg
gttc
ccag
tttcg
ccgt
gccc
ctgt
ccgg
ggca
ggct
gctg
gatg
acgg
actg
cctc
tgcg
atcc
ggt
ggca
gagt
cgga
ctcc
gcgt
cctg
gggg
tcct
gcgg
gtcc
gggg
ccttc
ccag
gctg
ctct
tcct
cctc
ctca
gtgg
tgcc
cgac
gggg
gatc
ggca
aggg
cgtc
ccca
gggg
gcgc
ctcc
gtag
gcag
ctca
ggca
gcac
cccc
gcgg
gggc
ggcc
tcct
cgca
ggcg
cacc
cagc
gctc
tcct
gccg
cctc
ccgc
cttc
tgga
acaa
ggtc
agag
ccct
agcg
tgag
cgcc
c
3938
9527
5Pr
oSA
PiN
Otg
ggca
ctcg
gcat
tttga
caca
tgtc
ctgt
caaa
aggc
caga
gtcc
ccag
tgtc
ccct
cccc
tcca
tctc
tcttc
ccca
taga
cccc
ataa
cccc
agac
caa
Pl
agag
gttc
tcta
agca
gctg
tgac
cagg
ttcct
ccct
cccc
acct
gccc
tcct
agct
ccag
cact
gccc
ccgt
ggca
gccc
acttg
gacc
cccc
taaa
agg
aggg
aata
ggag
gagg
gcag
ggtg
agtg
gggg
caat
ccta
ggtg
gtgg
ggga
gtca
tgct
ccct
ttctc
ggca
cccc
cttg
ttgga
gatg
gagg
cagc
aga
cgtg
cagt
gcca
taag
gtgc
ccca
gtcc
ttctg
gagg
cctg
ggct
gcta
ctgt
tggc
cacc
ctgt
gtct
agtg
atgc
tctc
tgtg
ctca
cctc
ctag
gcca
tgg
agcc
tgag
gggg
cctg
cacc
aggt
ttgct
gaaa
ctga
caga
gcct
gggc
tcca
gacc
tctc
tccc
tcct
acag
tgct
ctcc
ctcc
ctgg
gcag
attg
gcag
gaca
agtg
ggag
caga
tggc
ctgc
ctttg
gctg
agag
ggct
acct
gccc
agcc
cctc
cccc
aaca
agat
ctct
tgga
ctca
ggcc
tcag
agcc
tggc
ctgg
ttgtg
agtg
tgtg
tccc
tgtg
tgtg
tgttg
cggg
aggg
gagg
actg
gggc
tgga
agtc
cagc
accc
aggg
aaga
tctg
tcct
cctg
ttcttg
ggaa
gcgt
tgcc
tgac
ggct
tctc
ggct
ctac
cctc
accc
ttctg
gcca
ggat
cccg
cagg
gcaa
cagc
ccca
tctg
cttg
gctg
accc
caca
ccca
ggac
cact
gtcc
ggct
ctaa
caca
gcta
ttaag
tgct
acct
gcct
ctca
ggca
ctct
cctc
gccc
agttt
ctga
ggtc
agac
gagt
gtct
gcga
tgtc
ttccc
gcac
tcta
ttcc
ccca
gcct
ctttc
tgct
ttcat
gctc
agca
catc
atct
tcct
aggc
agtc
tcttc
ccca
aagt
ctca
ccttt
tcttc
caat
agaa
aattc
cgct
tgac
ctttg
gtgc
a
4040
3109
8C
Yor
fl5
NO
tcac
ttaac
atag
tgcc
aggg
catg
caat
tttgt
ttctta
ctct
ctgg
atgt
ggga
tatg
cgag
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgta
ttaag
ctttc
tgttt
ctg
Bat
gaaa
ttttc
atat
gaaa
tttac
tgga
gata
agat
taaa
ttagt
ggaa
aaac
aaga
taat
tttgc
tttct
atgt
agtg
gtca
ccaa
gtta
aaga
attg
tgcc
agtta
ttagg
caag
atag
gcttt
gaga
tttag
ggac
agtta
aattt
taca
ccaa
ctta
gtga
acat
gaga
cttc
tacc
tagt
gtta
cattt
cattt
ttaaa
taag
caat
ttaaa
aattg
gtaa
atga
tttgt
ttact
ttgat
tata
ttctg
gtat
aattt
tctg
acaa
aatta
tctg
tgtc
ttggt
cagc
attg
ttgct
agaa
tatg
tattc
agat
tttgt
ctgt
cca
taat
tgag
aaca
caga
aaaa
atct
aattt
gact
atac
ccat
ttacc
ccca
tgga
aatg
aaac
tata
ttcta
tgaa
tgaa
aaat
gattt
taat
aatg
tggt
gtat
taca
ttttc
ttcat
tcaa
gtaa
tgta
ggcc
ctga
gtag
agca
tgtta
tgaa
tattt
agttc
ctta
gtgt
tttct
cattc
aagc
ctct
cata
ttgaa
tgag
tcta
gggt
ttgga
gag
atgt
ttcac
acag
catg
tctc
ttcag
tgca
agct
gaca
gata
tcag
tgca
caat
taaa
gaaa
ctta
attg
cacc
tttca
acttg
gagt
ataa
atttt
gtat
gtat
cta
tgta
tatc
tatg
tgtg
tattt
tgtg
gttta
aggc
ttattt
acat
aata
tgtg
acat
tttac
ctca
gaaa
ttcag
tgac
tgaa
tttca
cagc
tgct
tccc
atgc
atct
ttatta
tcta
tgttt
ctga
aaaa
ctca
aata
ctaa
ctaa
tctc
ttttc
ttccc
ttagc
tgttc
ctttc
ctgt
ggttt
taaa
aaag
tgac
caga
aact
aggt
ctct
atttt
cattg
ctttg
ctgc
atat
tcttt
taac
ctgc
tttta
tc
41
2927
637
KIA
A12
YE
Sgt
ccca
cagg
atttc
ggga
atca
ag44
4238
3957
5K
LK
2Y
ES
ttctg
aagc
atca
aagc
ctta
gacc
agat
gaag
actc
cagc
catg
acct
ca4
325
6662
0T
SGA
lN
Oat
ccat
tgat
ggttt
gata
gtgg
gctg
ggaa
ggaa
agct
gtgt
tcct
ccac
atta
ggca
gcaa
a
o0
44
2959903
LOC728
NO
tcat
ttattt
gaca
gtgc
atgt
ctgc
aatg
ccgt
gctc
atca
atga
t052
45
3100926
—Y
ES
agta
acag
ctat
ccac
caat
gtca
atcc
atac
46
3853655
—N
Ogc
aggc
aata
tctc
cgca
ccttt
aat
47
3947544
—N
Oct
gctg
gaac
gaca
ccaa
ctct
ttgcc
tccc
atcc
ttggt
tgga
tatg
atttg
aaat
ttggt
aatg
tgtc
tcta
cttg
aaac
agat
gcac
tgttt
ggat
gtttt
cagg
tcgg
tgat
ctac
ttgtc
cagt
ttgat
cgct
acca
gaag
48
2466
070
ZN
F692
NO
ggat
tagt
ccgc
tcca
ctca
ctgt
cagc
atta
agtg
gggg
tgcc
caag
acgg
ggtg
gatg
gggg
gcgc
cctc
caga
cctc
tgac
cacg
gcct
cacc
gcc
actc
gacc
caac
tatg
aaga
gcgc
cccc
agct
gcac
gcca
ggac
acga
ccttt
ccttc
ccct
agaa
acca
gtaa
aggc
cgct
gccc
tattc
aaga
tgaa
atgt
gtgg
accg
cccc
cagc
ccag
ttgaa
atttc
ccgt
gaaa
gtct
ctcg
cccc
ttccc
caca
gctc
cact
tcag
tgga
ctgg
aggg
cgca
ggcc
tttgt
tctg
actg
cttc
tgtc
tgcc
tgcc
tccc
accc
gacg
acac
tcac
atgg
tagc
gctg
agct
tcaa
cacc
ctgt
a
4930
6581
8R
EL
NY
ES
ttcga
tagg
tttga
gggg
aagc
tcag
ccct
ctgt
ggta
caag
ataa
cagg
tgcc
cagg
ttgga
actg
gctg
tgga
acac
ttaac
gatg
gcaa
atct
ctct
act
tcaa
tggc
cctg
5033
3218
3ST
X3
NO
ctga
tttca
ctcc
agac
tggt
gtgg
ccac
ccttg
tcttc
agat
gaga
at5
136
8141
0PA
RN
YE
Sct
gtca
atac
cagc
aaat
atgc
agaa
agct
atcg
gatc
caaa
ccta
tgct
gaat
atat
gggg
agaa
aaca
ggaa
gaga
agca
gatc
aaaa
gaaa
gtgg
actg
aaga
tagc
tgga
agga
ggct
gaca
gcaa
acgg
ttaaa
cccc
cagt
gcat
accc
taca
ccct
gcag
aatc
acta
ttacc
g
5238
7334
2—
NO
gcca
cagg
aggg
tttaa
acac
agca
gtaa
tgag
acttt
atttc
tgct
ggga
aaac
attg
ccgt
tattg
ccc
53
2572
055
—N
Ogt
gcct
gaga
gaaa
acgg
ccta
atcg
aaaa
cgtc
cgcg
gcat
acat
ccat
tctta
aaac
ttgag
tggc
tgct
tttct
gggt
ggaa
aaga
gcgg
tatc
agac
agg
gtga
gcag
tcgg
ggaa
cgga
tgaa
caaa
gact
tgca
ccgt
ggcc
ctga
tgcc
tttgt
tccg
agttc
tattc
agttg
tact
tgtg
cgttg
ttaca
ggac
ttta
gaat
gcag
ccct
gccc
ccca
accc
ccac
ctcc
cagg
gccg
acct
gtgc
tcct
agga
aggc
aaac
ctc
5434
3454
9M
LE
CN
Otg
gcac
ctct
tgca
tcca
ggca
gtct
tgtg
agat
gggg
gcac
atag
cact
gggg
aaag
caga
actc
cattc
tcac
ctct
atttt
gagc
ttcag
tgct
ttattt
cagt
atga
ggaa
aaac
aaca
acaa
actg
aagt
gcgc
tttcc
gtcc
tttca
aagg
acaa
ctgt
cggg
aagg
gaga
gccg
agttg
cgag
gtag
gagg
ggag
cact
ggca
ggga
gaga
cattc
ttgac
tcct
ctct
tccc
tggt
gtgt
tgtg
atcc
aggg
aatg
aaaa
gaaa
tttga
ccct
ggat
tggt
tctc
tcct
tgga
ctta
agga
atct
tacc
ttttc
cttc
caca
aagt
tctc
ccag
gcaa
ggac
cagc
tgcc
cattc
tgag
ccca
gggc
agcc
tcttc
aacc
atta
ttgg
55
4007
482
—N
Otc
caag
cagg
tcttc
gtag
taga
gaac
gaac
cagc
cctc
gatc
acca
ggtg
aatg
aacc
caca
cact
gcaa
acca
gcac
aggg
acag
tcgc
cgcc
aagt
cccc
aatg
ggac
aacc
gcag
cacg
acct
gaca
acag
ccat
gtgg
tcac
gact
aaga
cccc
tgtg
acag
agaa
gagg
ccag
ccag
tata
tgcc
aggt
gggg
cggt
catta
ggta
caaa
gttg
tc
5627
5139
1C
LC
N3
NO
cccg
agtta
gagc
atgg
attc
agttt
tagt
ctta
aggg
ggaa
gtga
gattg
gaga
ttttta
ttttta
atttt
gggc
agaa
gcag
gttg
actc
tagg
gatc
tcca
gagc
gaga
ggat
ttaac
ttcat
gttg
ctcc
cgtg
tttga
agga
ggac
aata
aaag
tccc
accg
ggca
aaat
tttcg
taac
ctct
gcgg
taga
aaac
gtca
ggta
tct
tttaa
atcg
cgat
agttt
tcgc
tgtg
tcag
gcttt
cttc
ggtg
gagc
tccg
aggg
tagc
tagg
ttcta
ggttt
gaaa
caga
tgca
gaat
ccaa
aggc
agcg
caa
aaaa
cagc
cacc
gattt
tgct
atgt
ctct
ga
5728
7164
5—
NO
ctct
gtgg
ggttc
ggttc
cact
ttgta
gttc
gggt
ttgta
atgt
aaag
aata
atgg
gaaa
ctgg
tcta
atta
tgag
agtta
agag
agta
gagg
cctg
tagt
tg5
836
2957
6PA
RP
16N
Otta
gcca
taat
aggg
cccg
gtac
gattt
ggga
gtga
cattt
ctgt
aaag
aaga
ggaa
aaat
cattt
ttcta
taat
ttgta
aagt
tgtg
aaag
agcc
acta
ccac
agt
tttta
cattg
atta
ttgga
acat
ttc
5925
4593
0IF
T17
2Y
ES
tggt
ggct
acaa
cattg
gcac
cgtc
agcc
atga
gagc
cgtg
tgga
ttggc
tgga
actta
atga
gact
ggac
acaa
gctc
ctct
tcag
6025
6973
4—
NO
ttccc
ttggc
tttgg
acat
gtga
ggga
agta
ggta
tccg
ctgc
ctgg
ggaa
agtc
aaca
ggac
ccct
cctc
cccg
tacc
tcttg
gtat
acaa
agct
gagc
ggtg
61
2832
639
PCD
HG
YE
Stta
ctttg
aagt
ttgcg
gcat
gatg
gtgg
aaag
tgta
aatg
ctaa
aaca
ctga
tgag
taga
atttg
AlO
6231
1334
3N
Ogg
ctga
ttgac
cttg
actg
gcct
catg
tatg
tgtc
tcaa
agttg
gctg
gggc
tgtc
ggct
gaaa
ttcct
tagt
tcta
ctcc
acat
ggcc
tttcc
ccat
ggcc
agc
ccag
gctg
agat
ctga
gattg
aaga
gggt
gaga
acag
aaag
taca
agat
cttc
ttttg
acttg
aaag
ctag
ccca
gaat
cgag
ggaa
ataa
accc
cacc
ttttg
atgg
aagg
agct
tcaa
agaa
tttgt
agtc
atttc
caat
ccac
gaag
agta
ggta
ccca
ctat
ttgttg
aata
aagc
ccta
ttgaa
agct
tttta
gcta
tcca
gatg
caaa
ttact
ggcc
aacc
tttgg
gtag
aatta
taaa
attc
tgcc
caaa
attc
tacc
aaat
tact
ggta
cttg
taaa
attc
tacc
aaat
aact
ggcc
aact
tttgg
gta
gaat
tgta
aaaa
ttaag
ttgta
aata
aagc
ctat
tgcc
taac
agga
acttc
aaag
tcag
agaa
tcct
aaaa
tgat
tgtg
ctac
acag
ggcc
acat
agag
cac
agct
ttgtc
tttttt
tttttg
agat
ggag
tctc
gctc
tgtc
cccc
aggc
tgaa
gtgc
agtg
gcgc
aatc
tttgc
tcaa
ggca
acttc
tact
tccc
aagt
tcaa
gcaa
ttc
ttctg
tttga
gcct
cctg
agta
gtgg
gact
gcag
gcat
gtgc
cact
atgc
ccca
ttatg
tattt
ttagt
agag
atga
gattt
cacc
agat
tggc
cagg
ctgg
tctc
gaac
tcct
gacc
tcaa
atga
tctg
cctg
cctc
ggcc
tccc
aaag
tgttg
ggat
tata
ggca
tgag
ccac
catg
cccg
cccg
cttc
gtct
tttca
ctgg
acta
atga
agat
acag
gtgt
ggaa
tggt
ttcac
agca
tgcc
caac
ctca
tgag
gcttg
ttggg
tcag
agct
tgtg
ccag
atgc
cagt
cttc
tgcc
ttcca
gtct
tata
gttg
ttacg
ctat
agcc
ttctg
cagt
ggat
ggta
acgt
ggcc
aaaa
taag
ccttt
ggttt
tgga
gtat
tcgt
aaaa
tg
63
3875673
—NO
ggga
acta
acgc
cttc
caga
acag
aaat
atgt
ggtc
cgat
gagt
attg
aatg
caac
ttaaa
atct
agttg
gaat
gtga
tgga
taag
aaga
gacc
caaa
gtca
tca
acaa
gcct
tctc
atga
acttt
cagt
catta
acct
tcct
atat
agta
agga
caga
aatg
atga
tttaa
gcca
tctta
ggca
aggc
agtg
ctaa
caat
atgc
tttt
gtaa
agtc
acca
tccg
gtg
64
3959071
—NO
ttctc
taat
tcta
ggcc
tgcc
gtttt
tctc
tgtg
agtg
gacc
tgat
ctcc
ttttg
ctttg
gctc
ccag
agttc
ccta
ttcgc
ttctc
cttg
tccc
ctga
gtcc
tgct
ctcc
-4ct
cctc
tgga
taac
ccct
tcca
gccc
ttcac
ttctc
cctc
ctgt
tattt
gctg
gaat
ggct
aattg
gctg
accc
cccc
accc
agtc
ccca
tggg
agac
cccc
tcc
agca
ccag
aaga
ggca
gact
aaca
ttttc
tttgg
tagc
agaa
tgta
aaat
tgta
aaat
gtga
aaga
tagg
agag
aaac
atca
ttaac
tgttc
tgtta
tttat
taac
atttt
aattt
atag
aatta
taga
ctta
acag
caaa
aattc
aaac
actg
aggg
tcat
cggt
gaaa
ata
65
2864793
—NO
gcct
attg
gcttc
aagt
tgttt
acgc
tttgg
tagg
ttttg
gcttg
ttccc
tcaa
agga
tccc
ttcttc
atgt
cctc
ccat
gatg
ttgca
ggca
aggg
tctc
ttgtta
tat
gtgg
tact
aact
cggg
ccca
cctg
gtca
taat
ttc
66
2976430
—NO
gctc
ctac
agta
acac
tcaa
tcaa
ttcac
ccca
ctcc
agtg
ttcttc
ctgc
aaca
tcac
tgat
ttaga
tggc
ccac
tctg
67
3862771
—NO
ggga
atct
atta
atag
tgga
ggaa
gtat
ttagt
gggt
ggga
68
3192159
BAT2L
NO
tgga
tata
tggc
attg
accc
gcttg
ctttg
atac
gaaa
caaa
aaag
caga
cgac
tcct
tcat
ccca
tctg
ctcc
tacc
gtga
ctgt
gga
69
3062031
—NO
gggt
ttgtc
atgc
tacg
ctgt
gcca
ttcat
gtgt
tttttt
gctc
atgc
tgttc
cttg
tgcc
tgga
aattc
tccc
ccta
attc
ccct
gagc
agtg
gatg
cagt
ggat
accc
tccc
caga
tccc
cccc
aact
tccc
tgag
tgttg
gcta
ccaa
aagc
tctg
agtg
cacc
cagg
gagc
taag
tcct
cctg
ccca
cagg
caat
aatg
gact
gaca
cggg
gtag
actta
aaga
aata
agag
gcac
atca
atca
attg
aaat
gtgt
ggga
cctta
cctg
gact
ctga
atta
70
2505555
PTPN18
YES
gagc
agta
cagg
ttcct
gtac
caca
cggt
ggct
caga
tgttc
tgct
ccac
actc
caga
atgc
71
2361309
—NO
gagc
cgga
cttc
cttg
tccc
acca
g72
3569829
—NO
gtac
tcgc
ctcc
tgca
tgga
gcac
gctc
aggg
gtgg
73
2946327
—NO
ttact
gttg
ctgt
ccgt
aaac
tgga
acct
ctgt
tcac
tcaa
g
7427
5143
2N
Ogg
aact
cctc
agaa
ctta
taat
atgt
tgat
attc
tttga
ttccc
agat
gagg
ggat
gggt
aata
ggat
acat
ggttt
tcca
gact
tgttt
gaaa
atgc
aact
atttt
tgg
gttg
cagg
gaag
gata
tagt
agaa
ctca
tggg
aact
ggtg
tttct
tgga
acat
gcttt
ggaa
atgc
tggg
ttatg
ccct
gtta
actc
ttaca
tcat
tagt
tttta
gccc
aaaa
ggaa
acag
caaa
taat
gtttt
atat
gagc
caca
ttttg
cgttg
atttt
cc
75
3907
608
ZN
F335
YE
Sgg
acgt
ctac
ggaa
gtgg
agca
cctc
cacc
aaga
gcca
agag
gaag
aggg
acca
gagg
agga
ggac
gatg
atga
cattg
taga
cgct
ggag
ccat
tgat
gacc
tgga
76
2316
543
—N
Oat
gaga
cctc
agac
aaag
ccac
cagg
a77
2453
070
Clo
rfll
NO
tgcg
acca
gtac
agac
cctg
tcct
ggct
gaac
aaga
agag
acac
atgc
tcca
cttg
ggag
ccttt
gcca
ccac
gcaa
ctca
gggc
tcaa
gatg
aatg
gga
ggga
gaga
tttga
gtcc
aagc
atac
attta
tattc
agtg
ttgtg
ccat
tgag
ttccc
atgt
ggat
cattc
tgaa
ggtg
atct
ccac
aaga
gggt
gtgt
gtgt
gtgt
gtttg
gtgt
gtgt
gtgg
aggg
gggg
ccgc
tgga
taca
tcac
tgaa
gcta
ttgat
ataa
caca
atga
gtca
ctgt
tcag
aattt
tgct
cttg
ttaga
tgttt
tctta
cat
tggg
taga
gtcc
agcc
78
3119
603
RH
PNl
NO
tcca
gctg
gcag
caag
cacc
gagc
atgc
cctc
ccca
ccca
gagg
acct
ccgg
gcaa
tgcc
tgtc
ccgc
ctca
tgct
ggag
gctg
cctc
gggc
acct
gcc
tgcc
catta
aaga
ctgg
tcag
acct
gt
7939
0880
1ST
AU
lY
ES
cact
gcag
aagg
aacg
ggca
ccaa
caag
aagg
tggc
caag
cgca
atgc
agcc
gaga
acat
gctg
gaga
tcct
tggt
ttcaa
agtc
ccgc
aggc
gcag
ccca
ccaa
accc
gcac
tcaa
gtca
gagg
a
8025
2566
4—
NO
gaag
ctgt
tttaa
tgga
ggcc
gaga
gttg
gaca
atttt
gtgt
tatg
gaga
attc
tggg
aaga
agga
aaga
cccg
gatg
ggac
tgttt
atta
aggc
ttatg
actg
gact
ggag
ata
81
2395
926
CL
STN
YE
Sta
cggg
aaag
aaca
tcaa
tata
agct
gacc
gtca
ctgc
ctat
gact
gtgg
gaag
aaaa
gagc
caca
gaag
atgt
tttgg
tgaa
gatc
agca
ttaag
ccca
c\1
ctgc
a
8225
7114
5A
NA
PCY
ES
ctca
aaag
tgttt
atta
catc
tgac
ctat
gtgg
gcaa
aagt
tcct
gtgc
ttttta
gtag
agtc
ccag
ctcc
a\ 1
8328
3600
6N
Ogg
atgt
acac
tgat
ttggc
ccttt
aaat
gaca
ttata
ccaa
gaag
ttggt
ataa
ggga
gtag
gaga
tgag
aaaa
ctttt
tgaa
aaat
cattt
ggtg
catg
ctaa
ggt
agct
tgta
ggac
atgt
ggttt
taaa
aaat
tcat
ggat
tttga
agaa
ggat
aggg
tgga
agta
aaat
ttagc
tgat
agca
tagg
gatc
atttg
tgat
ttata
taga
aaat
aatta
tattt
ttgag
tgag
aagc
atca
gact
taaa
tttttt
tcttc
tattt
tggt
tatta
ccaa
aagg
aaca
gaga
aact
ctca
gaat
tcttt
gaat
cata
gaat
gtta
gtgc
tgga
aagg
acta
ggaa
gtga
cttg
taca
accc
cttc
atttt
acaa
agga
tcag
agag
aattg
gtta
gtag
agtc
ccaa
atct
gcttt
taga
agta
ata
tggt
gatc
atca
gttta
gattc
actg
tagg
aagg
gcag
ccct
gtta
aagt
ttgtc
agga
gaaa
aagg
gaag
gtat
cact
ttttg
tatg
ttcac
ccag
ttgct
tctg
tagt
tttca
tacc
cctg
ttttc
tcaa
ttttc
ttaga
actg
tcttt
aatg
gccc
agct
ctac
cagg
gcttg
ttgtc
taag
gaca
ttaac
ttgtg
ctcc
cctc
aggg
atgg
gttta
ctac
tagc
tgtc
agaa
agct
attg
ggta
tcct
aatg
tgtta
atag
ctga
aact
cagc
tgta
atttc
tcct
aaat
acttc
agca
ttttg
cattc
tgta
cattg
tgg
tgct
ttttc
cacc
ttgta
ttgttg
taac
tgta
agct
ccta
gggg
gcag
caat
ttggt
ctta
ggcg
tgta
ccct
gttta
gtgc
ctgg
caat
gcca
tgttt
atat
caag
gtc
ttaat
atta
ctttt
gagt
tattt
ctcc
cact
cttc
atgg
aagt
gatg
ctcc
agcc
ttgct
aaaa
cact
gagg
gtcc
tccc
gata
agcc
aggc
attg
tgat
tcct
tggc
ttcaa
gtgc
cttc
tgtc
ttccc
cgaa
ggtc
cact
gaga
tcta
caat
cccc
cctt
8425
1777
3N
Ota
gcgg
agaa
tgac
ctac
ctac
ttgat
taat
gatg
tctg
catg
ggct
tact
aaga
gggt
gtgg
ggaa
tggt
ggca
gggg
taac
tatc
atcg
agaa
tata
ggg
caat
gggt
atat
tcta
acttt
gcat
aact
ttgct
agat
tata
tata
tcttt
actc
acaa
taag
caaa
gttg
aaca
actta
aaat
tgat
tccc
tgttt
tctg
ttttc
acta
tcca
acta
ggat
gaac
tctta
agttc
atac
atat
acttt
aaaa
tggg
aaca
aaca
tagc
ctgt
catg
tcttt
attta
ggct
aagt
gcat
atgt
atgt
ataa
gtaa
tgtg
tat
atcc
ttcac
ttatta
atag
tgcc
ttcat
gaat
tcaa
agac
ccttg
tcac
atgg
tgct
cagg
aaaa
aaaa
agtg
ccaa
tattc
tttttt
tttttt
tctc
taag
aagt
act
agttt
aggt
aagt
gaaa
ttcaa
tagc
atat
cagt
gcttt
acct
tagt
cact
ttttg
tgttt
tgat
cacc
ctta
gtca
tcac
tgga
gaga
agag
tttga
attc
atttg
tgg
ttttg
cttta
gtgg
caaa
cctc
ctta
cagg
gatg
agttg
ccct
ccat
tctc
cttg
ccat
atttg
gtta
atct
aggc
catg
acct
ttgag
ggga
aaag
gcac
agat
tgc
aact
tttct
ttcct
gcttt
tgtg
aatc
aggc
catg
ttcac
gcag
ggtc
cttg
aagg
tgac
ttgag
tgag
gaag
ctat
atac
cgat
gact
tacc
aattt
ttggt
ctt
ataa
cattc
tctc
tcttt
tttttt
tttttt
ttttg
tagc
atat
ggct
tcct
ctac
tgta
tgttg
gcat
ttgcc
acttt
tggg
gattt
gact
cctg
cagg
tattt
tttttt
ttttc
ttttg
gggg
agtg
tgac
ggca
gtgg
atat
agtc
ttaga
tgat
gtttt
taat
agttc
aagt
ccat
tagt
gtct
tact
ctag
agaa
aacg
aaag
actc
actg
tgtg
agttt
tct
tgcc
ctct
ggttt
tcag
aattc
tgtg
tagt
ttctc
aaca
t
8530
6495
7—
NO
aaat
ccac
agcc
acat
ctttg
aatg
8632
3918
1K
IAA
12Y
ES
tcat
tctc
accg
caga
gtca
aaat
ggcc
gagc
accc
cctc
17
8736
2901
7C
SNK
lN
Otc
acca
tcta
taag
gtct
caga
gcag
agga
ttattc
atgg
taat
aagt
gggg
gtgt
ggtg
cagc
cattc
cagt
aaca
ccca
-4 4-G
l
88
3709
453
NO
ggtg
ggac
ttaca
ggtc
tcgc
tata
gcac
caga
taaa
atcc
atac
taca
ccct
atca
ttacc
tggg
aatg
aggg
taca
ggag
aaaa
cagt
aaag
cctc
aaa
agga
ggaa
attc
gtaa
agag
tcct
taaa
aacc
ttaaa
tgat
ttcca
aaaa
ttatta
gggg
acgt
taat
tgga
taag
acct
acttt
agga
attc
ccac
ctaa
gcaa
tgtc
taaa
ctgt
ttgct
ttgct
gaga
tggg
atcc
aaat
tcat
gtag
caaa
agaa
ctct
gatg
cctg
ctgc
agct
gaag
agtta
caaa
tgat
tgag
gaaa
aaat
tcaa
agtg
ctca
agtc
cgta
gaag
agat
ccta
gtgt
tcca
ttaca
gctc
ttagt
gttc
ccta
ctct
ccat
tctc
ctat
agga
gtga
ttgttc
aaaa
tgttg
actta
gtta
aattg
tcct
tttta
acgc
acag
tacg
acta
gaac
tttct
caat
ttattt
agat
caaa
tagc
tattc
ttata
ggcc
aaat
acaa
ctga
gaat
agtc
agac
tcat
tggc
acc
gatc
gaga
taaa
aatg
tagt
tcct
ttaaa
caaa
gcac
aggt
tcga
caag
ccttt
atta
attc
gtca
gaat
ggca
gtca
aatc
tggc
tgat
tttgt
tggc
agta
tag
atat
tcat
tacc
ctaa
aact
aaaa
tgttt
cagt
tcct
aaaa
attg
acta
ccta
aattt
tacc
caaa
ataa
ctag
ttcca
ttcct
ttgga
ggga
gccg
taac
tatc
ttta
8927
3201
2FA
M47
YE
Scc
ttata
agcc
aaag
tggg
tgaa
gatg
aggt
atgg
agca
tggt
atttg
aacc
ccaa
gttg
tgga
aaaa
gcaa
agag
taga
cgag
cctc
tggt
tgac
cctg
aE
ggtc
tcac
ata
9027
8758
8__
_N
Ogg
ttgct
gccc
ttgag
cattt
tgcc
91
3630
110
___
YE
Sat
ggag
cacc
tgttc
gtgt
acct
cc92
3738
711
—N
Ogt
agct
aaac
atat
gcgt
gggg
ataa
aaat
cggg
tgga
caag
gcca
gata
catg
aaga
catg
ctcc
actc
tcat
ctgt
gaa
93
3675
240
NO
cccg
gact
gatg
tgat
ttcttt
gaca
atta
caaa
aaac
aaaa
aaca
aaca
aaca
aaaa
aacg
caaa
gcaa
tagc
aggg
aaag
gaca
gtttg
agaa
gacc
tgc
aagg
cctg
ccag
gcgg
ctcg
cttg
tgca
gaca
caac
cggg
acgg
ctttg
aggg
gacg
tgca
cacc
caag
9434
5892
2C
TD
SPY
ES
ggga
cctg
agaa
agac
cctc
atcc
tgg
29
526
9433
7__
_N
Oga
ggaa
gact
tgtg
cgcc
cggc
ccac
acga
acca
taga
gccg
atct
ccgg
g96
2990
414
—N
Occ
ctcc
acac
gtgt
tcag
ttaaa
gtgt
gagg
aata
ccct
cccc
agac
aaac
atca
caga
tttcc
gaaa
tcaa
acac
gctc
cagc
aagt
gttc
tgca
cacc
ccac
tg
9731
9032
8—
NO
cttc
tcag
ttggg
tcaa
gtcc
attg
acct
ttctg
ttaca
gcttg
agttt
tgtg
tctta
cgta
gaaa
aggt
gccc
ctgc
tctc
acttt
ctgc
acag
ctat
cagg
tcca
gct
cata
gcag
ttcttc
ta
98
3745
435
MY
H3
YE
Sct
gaga
ttgca
ggat
ctgg
tgga
taaa
ctgc
aagt
gaaa
gtca
agtc
ctac
aaga
ggca
ggcg
ga99
3110
076
LO
ClO
ON
Ogt
aaaa
gcgc
ggca
ccga
cacc
agct
gtgt
gcag
cagt
ggcg
gcgg
cggc
cgaa
ggag
aaat
agaa
cagc
gcag
gcaa
aaga
agaa
aggc
gcgg
1341
28gc
tgg
100
3471
539
...N
Oca
cggg
tccg
ctag
cttta
agta
caac
agga
tttcc
gtgt
tcaa
aaga
ctat
acta
agtg
ctta
atca
aatc
ccgt
caca
ctaa
agcc
aaat
ttagt
tctc
tagg
aatc
ggag
tctg
cgta
ta
101
3918
710
SON
YE
Sca
gagc
agcc
tgta
gacg
tacc
atcg
gaga
ttgca
gattc
atcc
atga
caag
accg
cagg
agttg
ccgg
agct
gcct
aaga
ccac
agcg
ttgga
gctg
cag
gagt
cgtc
ggtg
gcct
cagc
gatg
gagt
tgcc
gggg
ccac
ctgc
gacc
tcca
tgcc
ggag
ttgca
gggg
cccc
ctgt
gact
ccag
tgct
ggag
ttac
ctgg
gccc
tctg
ctac
cccg
gtgc
caga
gttg
ccag
ggcc
ccttt
ctac
ccca
gtgc
ctga
gttg
ccag
ggcc
ccct
gcga
cagc
agtg
cctg
agttg
cca
Ul
gggc
cctc
tgtg
acac
cagt
gcca
cagt
tgtc
gcag
gaat
tgcc
aggg
cttc
cagc
acca
tcca
tggg
gttg
gagc
cacc
acag
gagg
tacc
agag
cca
cctg
tgat
ggca
cagg
agttg
ccag
ggct
gcct
ttggt
gaca
gcag
cagt
agag
ttgcc
agag
cagc
ctgc
ggta
acag
tagc
aatg
gagt
tgac
cgaa
caa
cctg
tgac
gacg
acag
agttg
gagc
agcc
tgtg
ggga
tgac
aacg
gtgg
aaca
tcct
gggc
atcc
tgag
gtga
caac
ggca
acag
ggttg
ctgg
ggca
gcct
gagg
caac
gatg
gtgc
tgga
gttg
ccag
gaca
gcca
gtgg
caac
gaca
gcgc
tgga
gttg
ccgg
ggca
gcct
tcgg
tgac
tggg
gtgc
cag
agttg
ccag
ggct
gcct
tcgg
caac
tagg
gcac
tgga
gttg
tcgg
ggca
gcct
gtgg
caac
tggg
gcac
taga
gttg
cctg
ggcc
gctc
atgg
cagc
tggg
gcac
tgga
gttc
tcgg
ggca
gtct
gggg
cagc
tgga
gcac
tgga
gcttt
tggg
gcag
cctc
tggc
aaca
gggg
tgct
ggag
ttgcc
aggg
cagc
ctgg
ggcg
ccag
agttg
cctg
ggca
gcct
gtgg
caac
tgtg
gcgc
tgga
gatc
tctg
ttcag
tctg
tggt
gaca
acat
c
102
2792
431
TM
EM
lY
ES
tcct
tgga
tatc
accc
agag
tattg
aaga
cgac
ccac
ttctg
gatg
ccca
gcttc
tccc
acac
cact
catta
caag
ctca
cttta
gacc
ccga
ttcca
tcct
ctt
92cc
taca
gtca
tcat
agtg
aatc
ttc
103
3828
963
DPY
19Y
ES
gaag
tgga
gcga
gaaa
tctc
attc
agaa
caga
gtgt
ggcc
tgta
ttact
ccta
ctac
aagc
agat
gctg
cagg
ctcc
aacc
ctcg
tgca
aL
310
438
7492
7C
DS2
YE
Sgt
tcat
tgtc
ccca
tatc
ttgtg
tgat
ctgt
aatg
acat
catg
gcct
atat
gtttg
gcttt
ttcttt
ggtc
ggac
ccca
ctca
tc10
530
1246
8A
KA
P9Y
ES
ctgg
ggtc
aggg
aattt
atct
taca
caca
gtca
ggga
tttga
cata
gcat
caga
aggc
cgag
gag
106
3541
309
PLE
KH
YE
Sga
tgtc
atcc
ggaa
acct
caag
gcca
agtg
gatc
tgaa
ctcc
cgct
gcca
aattg
ttcga
gggg
aggg
ttcac
agac
gtttc
agH
l
107
3543
492
PSE
Nl
YE
Sgc
aagt
gaca
acag
ccttt
gcgg
tcct
taga
cagc
ttggc
ctgg
ag
108
3878
896
—N
Oaa
cgtg
ctca
ggga
gaat
cagc
tggt
ttct
109
2669
445
—N
Oat
ggct
aaaa
ctcc
aata
gaca
gcat
gtag
tgct
tggg
tgac
agac
tttgg
gaca
atca
cagc
cact
acaa
agtg
aaag
ggga
gaat
ccag
aaag
taca
gag
ccag
aaag
ggag
agcc
ccaa
attc
tatg
taaa
aact
gccc
aaat
ctttg
gctg
acca
ctga
acta
tgca
tgca
tggg
gcag
acgc
taag
caac
ctag
ctaa
ggat
aaaa
taac
taag
ccga
gaat
tgag
gtag
tgcc
tc
110
3953
763
PI4K
AY
ES
gagc
aacc
tgga
cata
actg
tcgg
ctct
cggc
aaca
agcc
accc
aagg
ctgg
atca
acac
atac
cccc
tgtc
cagc
ggca
tgtc
cacc
atct
111
3529
048
NO
gccc
agta
cctc
tcca
aggg
ttgta
tatc
cggt
gggg
gtgg
aggg
atcg
gcag
aggt
ggtg
tctc
agcc
ctttc
gttc
aagg
agcg
ggttg
atgg
tggt
ggtg
tagg
aattg
ctgg
gtct
tgga
aact
aggg
gcac
cagg
tact
gcgt
tgtc
ccga
aacc
acttt
agttg
ccca
tgct
tcaa
ggca
tagc
gtgt
gtca
ccaa
acca
ctga
atga
tctc
aggc
cgag
gtaa
acca
gtga
tctg
ttcta
acttt
tggt
aatc
ctca
cgcc
gtgc
ccac
tggc
actg
taaa
aaaa
agga
tttaa
ggat
agc
cagc
tgct
ctttg
gtttt
gcgc
ttggt
ctttc
gctg
gcgt
tgca
tatc
tggg
gaaa
ggta
ctct
gtgg
ggcc
aact
ctac
ctgg
tact
ga
112
3691
111
—N
Ottg
acgt
ggct
tatc
aacc
tttgt
ctct
tacg
tgct
ctttg
ttccg
gcca
tcgt
cttg
tttaa
caaa
113
3845
902
TIM
Ml
NO
gggt
ccgc
atgt
acgt
actg
cctg
cccg
gggc
ttagg
aggg
tggc
accg
gtgc
tggg
acac
acgg
gact
gtgt
cctc
gcca
cccc
ccgc
cctg
cccc
ct3
gcca
gcca
gtgc
agct
tgga
tctc
gggg
gtgt
gggg
ccct
gtgc
cttc
ctga
agtg
ctgg
cagc
ccag
tggc
acct
ccttc
aggc
ctttg
gggt
attc
ccct
agtg
tgcc
caag
tcag
cctc
atat
tctg
ggcg
gaca
gcttg
tctg
gact
tcgg
agttg
gggg
tggt
caga
cacc
acag
gagc
tgtc
acct
cctg
cgga
tgg
gcaa
ataa
att
114
4009
521
PHF8
YE
Sga
ccct
gcttt
gaaa
tctc
gacc
caag
aaaa
agaa
gaat
tcag
atga
tgct
ccat
ggag
tcc
115
3817
782
—N
Oca
tgcc
cggc
ccag
taat
tagg
tttttt
tttaa
aaag
cccc
aaac
ctta
atttt
atttg
cccc
tcat
ttcac
ggag
ctcc
cagc
gtag
tggg
gaga
agct
ggcc
cag
agaa
cccc
acgt
cctg
caca
gtga
gggc
cagc
ccttg
ggga
cccc
atgt
cctg
cacg
gcgg
gggc
cagg
ccct
gggg
ccac
gtgt
gttc
cccc
ccc
accc
ctat
tttct
gccg
cgtg
ctcg
ttctg
ctgc
acgt
ggct
gtca
cttg
aatg
ggaa
tttaa
aaat
ataa
ctta
atgc
acac
tgac
ttccc
ttgat
tggt
gcaa
ttaat
tattt
gaaa
agaa
aaat
atca
cagt
gttc
ttaag
gact
ggca
tttttc
caag
ctgt
gtca
gtttt
gattg
atgt
ctga
cgcc
gtgg
ctgg
cttc
tgaa
agcc
tca
ccag
ttctg
atgg
ctta
ttggc
gact
tcag
tgta
tcca
gcaa
tctc
tgct
catc
tttct
gggg
gaat
atat
aatg
atca
gcag
gcct
gaga
actc
ctac
aaat
caga
gata
agaa
aaca
aatg
gccc
gatg
aattc
ggcg
tgag
gggc
ttgag
caga
cgcg
ttacg
aagg
agct
gcag
gaac
gaaa
gcca
tggt
tcag
tgcc
cag
catc
ggta
gaag
tcag
aggc
agcc
agct
ggga
gaca
ccgc
cagc
ccca
ggac
ggcc
acgt
caga
acac
cgcc
acca
gatg
ccag
tgag
gagg
gcg
cagc
tgca
cgct
gctc
ccag
ccgg
ggtg
gatg
cagg
gcgg
tgcc
atgg
ccat
ggaa
aacc
gttc
ggca
ttttc
ttgga
aacg
gaag
catg
tacc
ctgt
gccc
cagc
gatg
ccac
tctg
tgca
tattt
actc
agga
gaaa
tgaa
aaca
tatg
tctc
caga
aaga
ttggt
acat
aagt
gttc
ttagc
agttt
tcttc
acga
gagc
ccca
aact
ggaa
acag
cccg
cacg
tcca
tcaa
cagg
agga
taga
taaa
taaa
taac
cata
gatc
cacg
cgca
accg
agta
ctgt
ttaga
aatg
aaaa
gga
gcg
116
3948
736
—N
Ocg
ttgat
ccca
cctc
cttg
atct
ggttt
ctaa
gacc
tcat
acaa
actg
ggcc
cgtta
gtct
ttgct
ccct
gata
ctat
ctga
tacc
atct
ggtc
tcac
117
3104
724
ZB
TB
lON
Oct
ggtc
ggac
ggaa
acgc
tccg
ccgg
cttta
ttgtc
gcttc
gtta
tgtg
gcgg
agcc
gagc
agttt
agcg
tgcc
tctc
accc
tcag
cgcc
tgcg
aagc
cgg
c11
833
6456
5PL
EK
HN
Oac
tggc
ctgt
gcta
ctta
ttacc
gggc
ttgta
atag
cggt
tcttg
tctc
cata
gcct
gttg
agtg
ttccc
agat
gtga
ctca
ccttt
ctgc
tgcc
ctct
tcat
gcag
A7
gcct
actg
a
119
3540
221
NO
atga
ggcc
ggaa
agat
ggcg
aattt
gcag
ggct
gg12
034
5807
5N
Occ
aaat
tttag
ttgtg
agga
aaca
gaca
tacc
aagt
attc
ttttta
gcaa
tgag
catta
gttc
tgttt
ttctta
acag
tgag
gcaa
acta
tgta
tattt
acat
tttgt
agaa
tattt
cctc
caaa
attg
gtttc
tgca
tact
ttttg
tttttg
ctat
cact
gtttg
ggct
taga
gtag
tatc
cttg
aggc
tcgt
ctca
tcttt
tttat
ttgttt
tgttt
ttttg
tttgt
ttggt
tttgt
tcttt
gaga
ctgc
tgca
gtac
aatg
gagc
gatc
ccgg
c
121
3773
290
NO
cagg
ctca
tcat
tcct
tgttg
gattt
cttc
ctgg
actta
tttaa
aagt
tgtta
tagc
tatta
tgaa
tgag
ctttt
ttaaa
aattg
taat
tcct
aact
atat
tact
ggtg
tag
aaga
aaac
taca
gagc
tggt
atcc
agct
acttt
attg
atttc
ttact
gtttc
tattg
gcttt
tattt
ttcat
tagt
agag
gttg
agaa
tttca
gtat
ttcat
tccc
agat
ccat
atct
cattt
ctttc
tcttg
ttttc
tttgc
gttg
ctgg
agcc
tcca
aaac
agtc
tggg
gtaa
ctgc
tatg
ctag
cagg
cctc
ctca
tc
122
2835
195
FLJ4
160
NO
ttatc
ggtg
tagc
agag
aggt
tccc
aaga
ctct
tgac
tggt
cctg
ggag
tggg
tgtg
acca
agtc
atag
ttctg
gaat
gtgt
gtag
gcaa
attc
agag
gctg
tt3
ccag
ggaa
gagg
ggat
tttga
tact
gtgt
tagg
tggg
gtgt
gtga
ggct
gttg
gcag
cagg
tgaa
cagc
tact
gctg
tgttc
tcag
gact
aggg
aaca
aag
gggt
atgc
aaat
cata
gagg
aaac
tctg
ggaa
ggcg
gtga
taag
gcct
ggtg
ggtg
ggga
ggtta
ggga
atgg
cttg
ctttc
ctgt
ttctg
gtta
gaag
ggga
gcca
gggg
gaac
cccc
agtg
gtttc
aggt
ggcc
cctg
aggt
cctg
gagg
cagc
cgtg
gatg
tgat
gcaa
ttggc
tgtg
ggac
ctta
gatg
tagg
aca
caac
ttcag
tgttc
ccat
ccag
aaag
acct
cact
caca
gggt
tgtg
ctga
gaat
gacg
tggg
gcta
agca
tgca
gagc
tccc
tgta
aact
gtga
agtg
tgat
acaa
atgt
aaat
gaca
gcag
tgat
ctcg
gggt
ggcc
cccg
gcat
gctg
ccct
cccc
cacg
ccca
tgcc
tgtg
gcag
caaa
ccttg
ttcat
cagt
atag
cttt
ctttc
ctgt
aacc
cagg
atct
acct
tggg
gggc
ttctc
aata
ctgc
attc
tatg
tagc
cagc
ctct
ttaac
ttggt
aagt
gagc
cacc
ccat
tcta
gaac
ctgg
aaat
tgga
gccc
ctca
aaaa
cagt
tcct
gttc
aagg
agga
ctga
cctg
ctgg
ggca
atgt
tggg
tgca
gtgc
agtc
cctg
cttg
gggt
ggtc
atgt
ctag
gctg
ttg
ctct
gggc
aaag
ataa
gttg
caag
attc
acag
aaat
ggga
aaat
gtga
ccaa
gtgt
gatc
ttaac
aact
gaca
aagt
ttgta
acca
accc
aagt
taga
atg
tgtg
tcaa
acag
gagg
tagt
ttaga
tatg
cttc
caag
aaca
tgtc
tgtg
ttata
acca
tagt
gcct
aagc
agtg
agct
ctgg
tttttg
aagg
gcttt
taag
aa
123
2988
321
FOX
Kl
NO
aggc
tctc
cggg
caaa
tcag
aagg
ccac
gaga
gaga
gagg
agcg
ggga
gagt
ggtg
agga
ggat
tcgt
ctct
gact
gatg
aacc
tcgc
cgtg
cctg
tctg
tcac
atcc
aagt
ctgt
gcca
gctg
ctgg
gagg
tcag
actc
ctgc
cctg
agaa
124
3632
941
UB
L7
NO
ccca
gcaa
gttg
caga
ggct
actg
ccct
tggg
aggc
actc
atga
aggt
gcct
ccat
ctct
ccct
tccc
caat
atac
ctga
tggt
caac
tctc
125
3855
574
—N
Ogg
cagc
aggg
gtca
gattt
tcct
catg
gcct
ctcc
agca
ggcc
agcc
cggg
tttct
cctg
gtgg
ctgg
gagc
cagg
agag
cggc
tccc
atgc
agag
tgct
ggttt
caca
tttgt
gggt
ggcc
catc
cgttg
aggc
cagt
cacg
tggc
cagg
ccca
ggac
caga
tggg
agag
gacc
gcac
aagt
gcag
cgtg
agct
gtgg
gagg
tgtg
gctg
acca
cgct
taga
gagt
gtgg
tgcc
agga
aaga
tctg
atgt
gtca
ccat
tcta
tatc
tgtg
atgg
gcag
acgc
aggc
aggt
caaa
gggg
ccct
gggg
actta
ccga
gccc
ctgc
ctat
tggc
cagg
cact
gtga
ttgtc
ctcg
gtgc
catg
tgcc
atgg
aagc
cctc
tcag
aggt
cgag
ggga
ggca
ggtg
gaga
caga
gcct
ctgt
aggc
ctttc
aagc
tgta
gccc
caga
gccc
agag
gcag
ccga
attc
atgt
gtgc
agtc
agag
agca
ctta
gtgg
gcac
tgta
gtca
atca
gtga
gcgc
ttact
ggac
acca
cagg
cagc
aagc
attta
ctga
gtac
ctct
ggca
gttg
atgc
atac
ttcct
gagt
acct
gttc
atag
cagg
ctta
gct
ctgg
atgt
gggg
tggt
gagg
aatc
ccac
aggc
tgag
gggt
ggcc
tggg
agct
ttcca
gggg
gcct
gaag
cagg
aggg
gctg
cctg
agtg
ggtc
agca
gccc
agcc
cagt
gccc
tacg
cagc
cctg
ccca
cacc
ccca
accc
agcc
tctg
tctc
ccca
gtcc
catc
cccc
acac
cctg
gtcc
ctcc
tact
ttgtg
cctc
gag
gaca
ctgc
cccg
ctgc
cctc
ctca
cttc
ccag
cgtc
tgct
gtcc
cctc
cgct
gctg
ctgc
ccgg
cagg
attg
ctct
gtgt
ctca
tcca
cgttt
gtcc
ctgc
aga
gttc
ctcg
cctc
tccc
tgga
accg
ccag
cctc
atgc
cgca
tctc
agag
gcttt
tttttt
ctct
ctttt
tttttt
ttttg
agat
ggag
tctc
actc
tgtc
accc
aggc
tag
agtg
c
126
2648
215
—N
Ottt
cgca
gttg
ccaa
aaat
ggtg
ccgt
aagg
ctttc
ctgg
aatta
agta
gtca
gttg
gtat
ttggt
tagc
agat
gcttt
gtaa
aacc
taat
atat
tccc
acta
tattt
cct
ttaaa
aatc
aagg
gggg
gtta
acac
cttta
gattg
tcat
gtttt
attg
agaa
aaat
ccac
ttctc
tgaa
actc
ccta
ttact
g12
727
1400
9—
NO
atct
gagg
ccaa
ggat
aaat
tcat
caac
tggg
tagc
cacc
acta
taga
agaa
gcca
acca
caat
caat
gtcg
gcta
tgcg
tcca
gctg
ccag
aggc
cgcc
agga
atga
gcta
ccttg
aaga
atcg
tccg
tgac
aaca
tttct
gaat
ggct
atgt
cact
atca
atgg
ggcc
acaa
caac
aaca
cttg
caat
ccaa
cctg
aact
tO
Occ
tatg
acca
aacc
aagc
aatc
tattt
atgc
ccaa
gtca
agtg
aaag
gtga
actc
cacc
ttcac
catg
catc
aaaa
gcct
tggt
atcc
tgcc
caat
atgc
ctgg
aacg
gtat
atat
tggg
aacc
tgct
gtgc
tggt
ggcc
ggat
tcca
tata
gctc
c
128
3342
827
NO
gagc
aggc
agttg
gcaa
accg
gaaa
g12
934
9423
5L
MO
7Y
ES
ggaa
agaa
gccg
cagg
atca
gcttg
ttattg
agag
a13
038
1634
4SF
3A2
YE
Sgg
agct
acct
ggca
cata
cgca
gggg
aaga
agc
131
2340
361
DN
AJC
NO
cggc
ggct
tcgc
ctcg
cccg
gcga
agct
tctc
tccg
gtgg
ccgc
tcct
tcttt
tccc
tcct
ctttg
cgtc
atgt
cggg
cact
taat
ttttt
613
235
5728
6H
OM
EZ
YE
Sag
agca
gctg
gcta
tcct
taaa
tcct
tttttt
taca
gtgc
cagt
gggc
acgg
cgtg
agga
ttacc
aaaa
gtta
gaac
agat
cact
ggttt
acct
cggc
ctga
gatc
attc
agtg
gtttg
gtga
caca
cgct
atgc
cttg
aagc
atgg
gcaa
ctaa
agtg
gtttc
ggga
caac
gcag
tacc
tg
133
3911
399
NO
atttt
cctg
cattc
acga
tggc
ctcc
134
2612
381
EA
FlY
ES
cagc
agta
aaat
ccag
gccc
gaat
ggaa
cagc
agcc
cact
cgtc
ctcc
acag
acgt
caca
gcca
ccac
cacc
tcca
ccac
ctat
gcca
ttcag
agct
cca
acga
agcc
tcca
gttg
gacc
caaa
acttc
tccc
ttgaa
agat
aacc
cctc
acct
gaac
ctca
gttg
gatg
acat
caaa
135
3354
739
...N
Ottt
gaga
ggtta
tttgt
ccat
ggga
tgct
cgtg
ttaaa
acaa
aaa
136
3434
425
RN
FlO
YE
Sta
atcg
caaa
cgtg
aact
ttcct
accc
caaa
aatg
aaag
tttta
acaa
ccag
tccc
gtcg
ctcc
agttc
acag
aaaa
gcaa
gact
tttaa
caag
atgc
ctcc
tcaa
aggg
gcgg
cggc
agca
gcaa
actc
tttag
ctct
tc
137
3734620
KCTD2
YE
Sag
gaag
cgga
gtttt
acaa
catc
gcgt
ccct
tgtg
cggc
tggt
taag
gaaa
ggat
acg
138
3802333
LOC728
NO
ttcag
ctac
gttg
tccc
agca
cttc
actg
gtta
acct
tttat
gtcc
acca
tttgt
ggat
ttcac
agct
acttg
tcaa
tggt
gaat
attg
atca
tcat
catta
tcta
ctga
606
gctg
ctac
cata
tccc
agct
actc
cttg
catg
ttgttc
atta
ttttc
tcaa
cact
cagc
atat
ttgca
atat
gtta
tgta
atat
caca
gaca
agga
aact
gaac
gca
gaaa
tgttt
tattt
cttg
ccaa
acat
caca
tgag
gatg
aaca
atga
aacc
gattt
gaaa
ccag
gattg
tctg
attc
caac
atct
ctgg
gtcc
tttttc
actc
tgat
atg
139
2896736
NO
agga
tcta
gctc
tcttg
ctca
agct
ggag
tgca
gtgg
tatg
atca
aaac
tcac
taca
gcct
gcaa
ctag
tgat
cctc
ctgc
ctca
gcta
tagc
taaa
acta
tag
140
3053930
NO
ttggc
acac
taac
tcca
cttc
taga
catc
tatc
ctaa
gata
ataa
tcaa
cata
taaa
aatg
caca
taca
aata
ttgac
tata
agaa
taaa
aaaa
tttca
gcat
ttaa
gctg
taca
ggtg
aatg
tttaa
atta
tggt
acac
ccat
ttgtg
ttattt
atttt
aaaa
aattt
ttttg
aagt
ataa
ctat
gtag
aaaa
ctgt
ccta
gcat
aatg
ttaag
cat
cgttc
ccaa
tgtg
ttt
141
3459700
NO
ctaa
ggtg
agac
aggt
ctgt
gaat
ccgg
ggtc
tctg
tcac
ttggg
gaaa
aaaa
acgg
gcag
aaaa
gctc
tgaa
gtttg
aaac
acat
aagg
aaaa
tctg
cta
tctg
tacc
ccca
ccct
ccca
aaat
atag
ttgac
gccc
ccgc
gtga
tgaa
gagt
ttatg
gggg
tgga
ggct
tgga
ggag
atgt
cact
gcgc
ctgg
gagc
ttgt
gtcc
gattc
cggg
aggt
ggcc
gtgt
ccga
gtgc
gtgg
ttgta
cattt
ccga
aggt
ctca
gtttc
tctc
acttt
cagc
ccgc
ga
142
3223562
MEGF9
YE
Sgt
ctgc
cagt
tgcg
atgc
cctc
acag
143
3888086
ARFGE
YE
Sgg
aaag
gcct
tgac
atgg
caag
acgg
tgta
gtgt
gacg
tcca
tgga
gtcc
acag
tgtc
ctcg
ggga
ccca
gaca
actg
ttcag
gatg
accc
tgag
caat
ttF2
gagg
tcat
caa
144
3378887
TMEMl
YE
Sgg
attc
cagc
cgaa
cttc
catc
cgca
g34
145
3280240
—N
Oac
ctttc
cctg
aggt
tgga
tagg
cact
gctc
ccct
tctc
tgtg
cccc
tggc
tccc
acat
ctaa
tgttc
tcct
ggca
ctta
cgac
actc
tcat
tcaa
atttt
ctag
ccat
gcac
ctta
agcc
c
146
3379391
—N
Ottt
tgta
catg
cact
tgca
gtat
tgag
gtta
atca
147
2351210
FAM40
YE
Sgc
agca
ccca
cctc
aaaa
gcca
aaac
agac
tcaa
tcaa
catc
ctag
cgga
cgtc
ttgcc
tgag
gaga
tA
148
3758440
—N
Ogt
ttcac
tatc
gaga
gtcg
gact
gcttt
tgga
tttac
gtgg
ctttg
tggg
atgc
ggta
tgaa
gatc
gttg
cattc
aagt
ttacc
gcct
ctac
ttcac
tgca
gaat
ctga
ctgc
cgtc
agat
ctgt
atga
c
149
2617
213
ITG
A9
YE
Sta
ttggg
ctgg
aacc
atca
aagt
gctg
aacc
ttacg
gaca
acac
ctat
ttaaa
actg
aacg
acga
agtg
atca
tgaa
cagg
cggt
acac
c15
036
7783
5C
RE
BB
YE
Saa
ccag
agga
gtta
cgcc
aggc
cctc
atgc
caac
ccta
gaag
cact
gtat
cgac
agga
ccca
gagt
catta
ccttt
ccgg
cagc
ctgt
agat
ccP
151
2332
491
—N
Oat
ccttc
taag
aggc
gcaa
gttg
gctg
ccttt
ccag
aatc
ctct
gattt
tcgt
tgat
ccaa
gggg
ctgg
aact
gagc
ttcct
ttttc
atgc
caag
tagg
gtcc
aaa
caac
agag
aaac
ataa
agct
cttc
ctttc
tccc
caac
ctca
ccag
ccct
tacc
atgg
ctgg
cttc
ccag
tgat
tcat
tcaa
ggca
aca
152
2932
426
RB
M16
YE
Stg
gggt
ccgg
ccat
ctaa
tgttt
ccag
tagt
tctg
ggat
tattg
cagc
ccaa
ccac
caaa
tattc
taaa
taac
tctg
gaat
attg
ggaa
taca
gcca
ccca
gtgt
gtca
aata
gttc
tgga
ctttt
ggga
gtgc
tacc
ccca
aata
tacc
taac
aattc
tgga
cttg
tagg
agta
cagc
cacc
aaat
gttc
caaa
tact
cctg
gact
tct
ggga
acac
agcc
acca
gctg
gacc
tcaa
aact
tacc
ccct
ttaag
tatc
ccta
atca
aagg
atgc
ccac
aatg
ccaa
tgtta
gaca
ttcgt
ccgg
gact
aat
acca
cagg
cacc
tggg
ccaa
gattc
ccttt
aata
cagc
ctgg
aattc
cacc
ccaa
cggg
gaat
ccca
cccc
catc
ggta
cttg
attc
agct
cttc
atcc
acc
accc
cgtg
gacc
ttttc
ctcc
agga
gata
ttttta
gtca
acca
gaaa
gacc
tttttt
agct
cctg
gaag
acaa
agcg
taga
caat
gtta
ctaa
ccca
gaaa
aaa
ggat
acca
cttg
ggaa
tgat
aaca
ttcaa
cagg
aagg
agat
agag
atta
ccgg
tttcc
tcct
atag
aaac
cagg
gaaa
gcat
tagt
agac
ctcc
ccct
gtgg
atgt
taga
gatg
tggt
tggg
cggc
ctat
agat
ccaa
gaga
aggt
cctg
gacg
gcct
ccac
taga
tggt
aggg
atca
ttttg
gaag
acct
cctg
taga
tata
aga
gaga
atct
tgtg
aggc
cagg
tata
gatc
atct
tggt
cgaa
gaga
ccac
tttgg
cttta
atcc
agag
aagc
cctg
gggg
cata
gagg
agat
tttga
tgag
aga
gagc
atcg
ggttc
tacc
ggtc
tatg
gtgg
tcca
aaa
153
3273
785
AD
AR
BYYE
ESS
taaa
gtaa
gcat
attg
tcaa
ccttc
ctcg
ctcc
tttca
agca
cctg
agtc
ctgg
catc
acaa
acac
ggag
gatg
acga
cacc
ct2
154
4015
409
TSP
AN
YE
Stg
agaa
ggct
ttgaa
gcag
tata
actc
taca
ggag
atta
taga
agcc
atgc
agta
gaca
agat
ccaa
aa6
155
2428
715
PHT
FlY
ES
ggcc
acag
cgttc
agttg
atgt
ggttg
tatc
ctcg
gtttt
ccta
ctga
cact
ttcga
ttgct
t
g15
632
6150
0N
OL
Cl
YE
Sgt
ctgc
caag
gtcc
caga
gcga
aagt
taca
g
157
3144
008
NB
NY
ES
caga
ctttt
gact
ggcg
ttgag
tacg
ttgttg
gaag
gaaa
aact
gtgc
cattc
tgat
tgaa
aatg
atca
gtcg
atca
gccg
aaat
catg
158
3527
425
PAR
P2Y
ES
atct
gtga
aggc
cttg
ctgt
taaa
gggc
aaag
ctcc
tgtg
gacc
caga
gtgt
acag
ccaa
ggtg
159
3836
942
—N
Ota
ccag
gagc
ccat
ggat
ggcg
tgac
ccag
cagg
cagc
acag
aatg
tact
tgcc
aagg
cggg
caaa
gagt
aaac
c
160
2746
889
—N
Ogg
agtc
aaac
gtca
gagc
ctga
gacc
tttga
tgaa
gggt
caca
tcgc
caga
gctc
agtg
g
161
2853
401
—N
Ogc
tgag
agct
agct
ttgct
ttata
atttt
gtac
ttact
aaaa
aatta
aata
actta
taaa
tgga
ggaa
aact
tgga
tcag
ttgat
agat
tttgg
ggga
gtta
gtga
aattt
cttg
aggg
gattt
tcaa
tcga
atgc
tttta
ttttc
tgtg
gcag
taat
gatc
tggt
tggt
ttatc
tcat
aaat
aact
gatg
ctgc
tgct
ttttg
tgttc
tttga
tgtc
tca
cctta
tgttt
ttact
aagt
aagg
acag
caat
ggat
tggc
ttctg
atgc
162
3593
678
USP
8Y
ES
tgtc
ctgc
gcaa
tgag
ccttt
ggttt
taga
ggga
ggct
atga
aaac
tggc
tcct
ttgtta
tccc
163
2466
363
—N
Otg
ataa
ttggt
gacg
acaa
ggaa
atta
agac
gattg
ctca
aaga
gatg
ca
164
2961
018
—N
Oga
tcttc
caca
caaa
gatc
aggt
gaag
aaat
ctat
gcag
ctat
acat
gcta
ggac
ctgt
tgta
cacc
atcc
ctgc
aa
165
3307
825
NO
ctga
ggaa
tctta
aatc
tggg
tata
aacg
tgaa
atgg
tttta
tttttt
cctta
ctgg
gtct
ccag
tcat
acag
tgta
aaac
actc
ctaa
cctc
tgta
ctttc
ctct
gca
gtgc
agtc
cttta
agaa
tccc
ttggt
ccga
tgta
acttt
gccg
ccac
t
166
3375
666
FTH
lN
Oga
cccg
cagg
gcca
gacg
ttcttc
gccg
agag
tcgt
167
3510
380
C13
orf2
YE
Sgc
agct
gcca
ccgt
ttctg
gaat
gaac
ctgc
tgaa
tact
gtcc
ttcct
gtgt
tccc
aggg
cagg
tctc
ctca
gccg
ttcac
acac
ctca
gcca
tcaa
tacc
aaa
3cc
caac
agtta
tcag
aacc
ccttc
attg
ccca
ctgc
acct
gtta
catc
catc
caca
gtac
aacc
acca
ctcc
tgttc
cttc
cattt
tttct
ggcc
tagt
gtca
ctgc
cagg
tcct
tctg
ccac
tcct
accg
cagc
cact
ccta
cccc
agga
ccta
cacc
acgg
tcca
ctct
tggt
tcca
gtga
agca
tttgc
ttcta
cttc
tgca
ccttt
cact
agcc
tccc
ctttt
ccac
cagc
tcttc
tgct
gcttc
tacc
agca
accc
aaat
tctg
cttc
attg
tcat
cagt
ttttg
cagg
gctc
ccttt
gccc
ttacc
acca
acat
ccca
aggc
ctat
ccaa
cccg
actc
ctgt
aattg
ctgg
tggc
tcta
ctcc
cagc
gttg
ccgg
tcca
cttg
gtgt
gaac
agtc
ctct
tttgt
ctgc
gtta
aaag
gttt
tctg
acat
ccaa
tgac
acca
attta
atca
actc
ctct
gcttt
atcc
tctg
ctgt
caca
agtg
ggct
ggct
tca
168
4026
959
HC
FCl
YE
Sca
agcc
agcc
aaca
agcg
gccc
atgt
cctc
169
2888
352
RN
F44
YE
Sag
agca
gctc
ccgt
cgta
ccgc
tttaa
cccg
gaca
gcca
tcag
tcgg
agca
gacg
ct
170
3320
634
USP
47Y
ES
gaga
aattg
agcg
caat
acat
gcaa
g
171
3752
442
—N
Ota
ggtg
catg
acac
tgct
cagt
ttgat
gtgc
taat
ctga
ttcca
gtaa
agag
aattg
ctga
tttaa
tgttg
aggc
ccga
tcttc
caca
ggtc
ctaa
atgt
ttccc
aa
aata
catg
gata
gcat
acga
ctgg
gtttt
tttaa
aatta
tccc
aggt
caag
catg
agta
tctc
172
3188
731
NE
K6
YE
Sca
ggac
tgtg
tcaa
ggag
atcg
gcct
cttg
aag
173
3303
115
—N
Oct
ggac
caac
cact
acttc
cgtg
tttca
agttc
atttt
tagc
acat
gaac
atga
tcac
tgtta
gcca
gtga
agta
gcttc
tgac
taga
tcat
tgat
taga
tctc
aca
aaaa
acca
aaaa
tagt
atat
ggaa
acca
agat
gaga
gcaa
aaga
aaac
gtta
cctg
agta
ggca
gcta
aaac
atat
tttga
catta
ttttta
agca
gcat
tcat
OO
aagg
ttcac
gga
174
3820
685
ILF3
YE
Sgg
ttctg
gcat
ttatg
accc
ttgtg
aaaa
agaa
gcca
ctga
tgct
attg
ggca
tcta
gaca
gaca
gcaa
cggg
aaga
tatc
acac
aga
175
3882
554
—N
Oat
aatg
ggat
ggca
aata
gtct
cccc
agcc
caga
gttc
aata
tcct
cctg
gttg
ctgg
ctga
gctta
gact
agaa
ttatg
ttgag
tttcc
agga
cacc
cattt
tacg
aatg
ttattt
tgaa
ttgcc
tgttg
tatg
tgat
aata
176
3054
154
NO
ctac
acgc
tgcc
gagg
aaaa
cgca
acag
acat
ggca
ggtg
cctc
acga
caag
aaac
actta
caaa
taga
atca
aaac
tgat
tcaa
gtca
taca
177
2624
472
CA
CN
AY
ES
atcc
gcgt
aggc
tgcc
acaa
gctc
atca
acca
ccac
atct
tcac
caac
ctca
tcct
tgtc
ttcat
catg
ctga
gcag
cgct
gccc
tggc
cgca
gagg
accc
cID
atcc
gcag
ccac
tcct
tccg
gaac
acg
178
3652
530
POL
R3
YE
Sga
ctgc
tgtg
taag
ctat
ggtg
gcat
gtgg
tacc
ttaaa
ggga
cagt
acag
tcttg
aE
179
3980
537
AR
R3
YYEE
SSac
tacc
acgg
agaa
ccca
tctc
tgtc
aatg
t
180
2808
239
NO
caca
ggtg
aattg
tctg
ggaa
gtca
ggat
gttg
caca
gaag
ctat
gcaa
gtca
taga
ttaga
aaag
caag
aatc
caca
ttcat
gcttt
ccag
cata
gatg
agga
gata
ctttt
tctc
cacc
ttctta
gattc
agtg
gctg
ggca
gaga
gtgc
tgtg
aattc
atct
gaca
acag
gcag
atta
acac
gaga
aaag
gaat
ataa
gtttt
ttat
ttttta
tttttt
tattt
ttttg
ttttg
agac
agag
tgtc
actc
tgtc
tccc
aggc
tgga
gtgc
agtg
gctc
actg
caag
ctcc
cacc
tccc
gggt
tcac
acca
ttctc
ctgc
ctca
gcct
ccca
agta
gctc
agac
taca
ggcg
cccg
ccac
catg
tcca
gcta
atttt
tttta
tttttt
agca
gaga
cagg
gtttc
acct
ccca
aagt
gctg
ggat
taca
ggca
ttagc
cacc
gcgc
ccag
ccaa
gtttt
ttaat
gttta
atct
aatg
tgca
tgga
ggca
tcat
agaa
agaa
ctga
atat
ccaa
aagt
gtgg
tgaa
atttg
agac
tttag
atac
catc
ttaat
gata
gtgg
cagg
aggc
agac
aaat
ccta
ggga
gaca
aggg
tggg
tccc
tggt
gaaa
cccc
acct
tcaa
acca
aaga
cagt
ttaaa
gcat
gaaa
gcca
agct
acaa
gtct
ccag
taaa
tcca
tgga
ccgg
atta
agaa
tctc
tc
181
2811059
—NO
ccat
cagc
ttgtg
actc
atgg
ctta
gctg
gtct
caat
gggg
aaca
aacc
agag
agtc
tatg
gatc
aaca
caaa
gcta
taac
tgc
182
2858392
___
NO
ggcc
cagc
ttcat
ggat
atac
aaac
tata
cattc
acat
gggt
tcca
cacc
taga
tggg
ctct
taca
tcat
gtag
ctgg
tcct
acct
ggga
gaga
gc183
2779739
___
NO
tctg
ttcac
ggag
ctta
gttc
tact
tcct
ga184
3298972
___
NO
gtta
acta
tagg
tagg
ggac
agag
aaag
gagc
agag
ttaat
gtat
gcta
ttgaa
gctg
agttg
ctat
cagt
ctga
acta
gact
ttgaa
ctca
gcat
g185
2933580
TULP4
YES
gaaa
ctgg
ccac
gtgc
gatg
cgga
cgga
ggca
tattc
gtgt
ggat
tcag
tacg
aggg
cagg
tggt
ctgt
ggag
ctgg
tcaa
cgac
cgcg
186
3528128
TOX4
YES
caac
tagg
ccaa
acca
gtac
agct
acta
tcc
187
3528025
NO
cggc
gcta
cgtta
acttc
gact
atat
taat
gagc
atca
cgat
cttc
atttt
catg
atcc
cgta
gtcc
tcgc
ggtc
gtag
aaaa
tgaa
agga
aagc
agcc
gtta
aO
Oca
cagg
aaga
gcga
aaaa
agga
cattt
tccc
tggc
gatc
gtgg
c
188
3581
134
AK
Tl
YE
Sga
tgat
ctct
ccac
ggta
gcac
ttgac
ctttt
cgac
gctta
acct
ttccg
ctgt
cgcc
ccag
gccc
tccc
tgac
tccc
tgtg
gggg
tggc
catc
cctg
ggcc
cct
ccac
gcct
cctg
gcca
gacg
ctgc
cgct
gccg
ctgc
acca
cggc
gtttt
ttta
189
3676391
—NO
tggg
ccat
ctcc
tcgg
taca
ggtc
agct
gccg
ccgg
cgga
gctc
ctca
tctg
tctc
ct190
2319003
—NO
ccaa
gagc
ccgg
aattt
tcat
taca
acttt
caaa
gaga
gcaa
gagg
agga
ggga
aaaa
gattt
caca
aaag
catta
tcaa
ggcc
ccaa
ccca
ggat
gccc
tgct
gtac
aacc
aaaa
tttgt
aaga
ggtc
agcc
tttca
ggga
gtca
aaaa
ccca
gcat
gtgg
a
191
2858313
—NO
ctgc
agag
caaa
atat
agag
gcac
taat
tttgt
tata
tttgc
agcc
caca
tact
gtaa
tacc
agat
ttgct
cctg
ttgttc
t192
3048750
—NO
cggg
aaac
ttcgg
aaga
cagc
tgtg
cctg
gctc
tgtg
gctg
catg
cagt
gcttc
acttg
gcca
gcag
aggt
cagc
tgtg
ccga
gctg
cccc
agcc
atga
gaa
gaga
agcc
tgcc
cttg
ctgg
cagg
tggc
tatg
gccg
gccc
agag
ccttc
ctgc
ccag
ctcc
tgca
gccc
tgct
gcct
ggga
tcag
gctg
ggag
atgg
gcct
tcct
gacc
gcca
gcct
tcct
ctcc
ccga
gcac
acgc
acat
gtag
attc
gggg
ggaa
gctg
cctg
ctct
tcct
taga
ggag
ccgg
ggca
gcta
tctg
ctg
gtcc
ctttc
tgaa
caac
tgttg
a
193
3360
848
APB
Bl
YE
Stg
agga
gggg
acgt
tgac
cttc
cca
194
3387
192
CW
C15
NO
gcca
agag
tggt
aatg
gcgt
ctgt
atga
tcttc
ggag
cctg
ctgc
atcg
gacc
tcgg
c19
538
9388
0—
NO
actc
cgca
gggc
tggc
acttc
ctga
ggga
gtgg
gggc
tcag
gcttc
agac
ggac
tttat
taaa
atgt
gtag
tttgt
gtga
atca
tttca
gtca
aaag
tttca
gct
atac
tcgg
ccgt
taag
acaa
ctta
atga
196
2393
792
KIA
A05
NO
ggaa
gggc
tgcc
atag
ttgcc
gcag
t62
197
2395
248
RE
RE
NO
tcac
agca
gcga
gcat
ccag
ggttt
gcag
ggac
gatg
ttaca
gact
ctgt
tttct
gcct
ggcg
tttca
cttg
tgtc
tgct
ccta
gcct
gtgc
tctg
ccag
cagc
acag
acat
ctgc
tcca
tcag
acct
cttc
cattt
tgca
cagg
gagt
gcag
gagg
tgaa
tgttc
acttt
ctgt
tctc
cagt
gtca
ctgt
tctg
tttcc
acgg
gatg
gaaa
gcgc
atgg
gcct
gtgt
ccat
tgta
gattt
ccttc
taga
tttct
gtgt
acac
acac
ttgat
tgttc
tgga
tgaa
tgtc
tttttt
aata
ctcc
gaaa
atttc
atca
tcta
agaa
aatg
attc
cata
caaa
taac
tcag
caca
caag
tgac
ccag
gaca
tatg
cctg
ccaa
aggg
atgt
gtta
gaag
gctg
ccttc
tcat
gcgc
attg
tcac
ttgg
atct
tgtg
gtga
ggac
ggcc
ccat
ctttc
ttgcc
acag
attg
aggc
cact
tttga
gcaa
ggga
gatc
ctgg
agtta
agac
aggt
gttg
gggg
cagc
ctgt
atttt
accc
tagg
ggca
ggtc
tgca
tggt
gacc
ccac
attg
cact
ggta
aacc
atttg
agtc
ccac
tcttc
atcc
tgga
agtg
ggaa
ctgg
agtc
ccac
ccac
agtg
cattc
agaa
agca
tgct
gtgt
gggg
gctg
cttc
tcag
gagg
ccag
gccc
ttctg
agcg
gaac
cgtc
ctgg
agag
agcc
tgcc
ctcg
tttcc
aggc
tgca
gccg
taac
gcac
tttc
198
3403
226
UN
Q29
NO
ctca
ccct
tgag
gccc
gggc
acttc
6319
939
2095
6—
NO
ccca
aatg
tgag
ttgca
tcaa
attc
ggta
aatg
taga
gtat
gtttg
ttttc
tctg
taca
caat
actg
acgt
cagt
gcta
gtgc
ctgc
ctaa
actc
agac
acca
gta
gatc
ctttt
ccag
a
OO
200
3970
256
RE
PS2
YE
Sttt
gaaa
agta
ctat
caat
gaag
cctta
ccaa
agga
cgtg
tctg
agga
tcca
201
2724
115
—N
Ogc
aggc
ctta
ctta
tttcc
acag
gacc
ttata
agct
cata
cact
ctat
tgct
tatta
cgtta
ctgg
aatc
tatc
taag
caga
aaga
ggct
ggcc
cacc
ttctg
ttat
ctat
gtttc
tata
ctct
ttgga
cttg
cta
202
3351
955
MIZ
FY
ES
caac
caca
tgcg
ctttc
gtca
cagt
gagg
accg
gccc
tttaa
atgt
gact
g20
331
3323
4PL
AT
NO
ggaa
ttctg
cttc
actc
gctta
acat
atac
acaa
cacc
tgta
acat
acaa
ggca
atgg
gcta
ggtg
ctcc
agac
cggg
aaaa
ggag
ggac
agga
atgc
ttggt
ctga
tggg
ctaa
tatg
gcat
ttaga
gaag
tacc
aagg
taca
gtgg
agcc
ggtc
acaa
aagg
gcag
acttg
t
204
2723
089
NO
tttgt
ggct
gtta
atag
gaag
gatg
ctgg
t20
531
0265
2N
Otg
tggc
aaac
tacc
acgg
caga
tgttt
acct
atgt
aaca
aaac
tgta
catc
ctgc
atgt
gtac
ccct
gaac
ttaaa
aagt
tgaa
aaaa
aaaa
aaaa
agaa
aca
aata
ggct
gggc
atgg
tggc
tcaa
tgcc
tgta
atcc
cagc
acttt
ggga
ggcc
aagg
tggg
agga
ttgct
tgag
gtca
ggag
ttcaa
gacc
tgat
atag
tgag
acc
206
3392
531
—N
Ottg
gact
tgaa
tgac
tagg
agaa
tgac
tact
ctaa
tgaa
caaa
aatg
taaa
tgtta
tatg
tggg
tttag
taag
ggaa
tgca
atga
acttg
gtttt
ggat
atgt
taaa
cttg
agtta
tcac
tgga
acat
tcag
gtag
aaat
gttta
gaaa
gcag
tttga
aata
agag
tttag
taga
aagg
ttgag
agca
gcgc
tgca
gata
caga
tttga
aagt
catc
tgca
aaga
gatg
tgtc
ttagc
gttc
tc
207
4015
009
—N
Otg
gagg
cggc
ccta
cacc
tggc
aatg
gagg
t20
825
9617
0IN
O80
DY
ES
gatg
agttg
ccgg
atga
cattg
ccaa
tgag
atca
ctga
cattc
caca
tgac
ttgga
attg
aacc
agga
ggac
ttttc
agat
gtcc
tgcc
acgg
ctac
ctga
tgac
ttaca
aga
209
3591
676
—N
Oaa
aacc
tgga
atgc
tgta
ggac
atgt
accc
actg
aggc
ggat
gcaa
ggag
cgat
ggag
agga
agga
210
3680
412
NO
tttac
ccct
tctc
ggtg
acat
ttgc
211
3906
169
CH
D6
NO
ctgc
ttgac
ttact
gggt
gata
ttggg
ctac
aact
gaga
ctca
ctttg
ctcc
tgtg
taaa
gtgg
ggca
acag
catc
tacc
acgg
aggg
ttgaa
gattt
acgg
aga
tcat
gcta
agac
agttt
gttc
caaa
ctca
aagg
aaat
cattt
gcaa
acaa
caga
aaag
gtca
gaaa
ccgt
atac
ctgg
ccaa
gaga
gatc
ccttt
tacg
ctttg
tttaa
tgttt
catg
ttgta
ttttg
ggttt
tccc
attta
ctgg
attc
tgag
ctcc
aaaa
cagg
aatc
acat
tttat
gtttt
ctta
tttta
aaag
caat
aata
ataa
ctag
tac
ttattg
agtg
ccta
ctgc
ttacc
atgt
tcaa
ggct
ttaca
tgat
tatc
ttgttt
gttc
ctca
caac
cact
ctga
agat
gggc
tttgt
tattc
ccat
tttat
aggt
gag
gaaa
ataa
ggct
ttaaa
gcat
taaa
taat
ttccc
caag
atta
tcca
gcta
gtaa
gtga
caaa
gcag
gaaa
ccaa
acca
agga
ccttt
ttctc
tagg
ggct
gag
a21
228
0310
2N
Oct
atat
gtag
ctac
cctg
tggc
ctcc
ccct
aagg
taac
cact
ttact
tttta
gtag
tgaa
gaat
gtct
aaaa
tata
ataa
aggt
atat
ctag
aaca
caaa
cctc
caag
cagg
aaca
accc
atta
tctc
tcct
ttgca
tgcc
ctgt
caat
ggct
tcaa
ggtc
aaag
ctag
ctaa
gtct
ctga
gtgt
gtag
ggag
ctga
catc
ttggt
gaaa
agtc
cttc
agta
catg
ctag
atca
tagc
tgac
ttgca
ctat
ccttc
caaa
ctttg
gaat
atct
ctct
ggta
ttgga
ttggt
taca
atgg
aaac
agaa
agat
tctg
atca
taca
aaat
aaag
atttt
cattt
gtga
gctc
aacc
attg
gatc
tgtc
atct
tcca
ttagt
agaa
aaaa
tgag
gaaa
tcaa
atac
tttgt
tttgt
tttta
acca
cgca
gag
aagg
aaga
gagt
agag
tgtg
ggaa
attg
atat
atct
gcct
tact
gccc
cagc
ccct
ccct
ctgg
agtg
tgca
agtg
gtca
gaag
tcag
caaa
ttact
tttc
catc
atat
ttggc
ccca
aact
accc
tggt
gagt
gagg
agaa
aggg
aatc
atttg
ataa
ggttt
ggaa
aata
gcat
ctgt
ttttc
ttagt
gtttc
tgga
tgaa
tttat
cagg
catc
cggt
aattt
gtac
tttct
gttg
aaaa
ctat
aata
gaat
accg
tgca
taga
catg
aaac
ctttt
ggta
ttagt
caga
agct
acat
actg
actg
ctgt
tgg
agaa
gacc
acat
ttaca
gtca
aggg
taga
atca
tctg
aaat
agttt
cccc
tgga
aaaa
aaaa
aaac
atgg
ccat
atat
gcct
acat
agat
ctcc
tcttt
gcttc
atg
aattt
ggaa
ctaa
acct
gggt
gttta
gata
attc
cagt
tcta
ggat
tttct
cacc
ttgag
tttat
tccc
ctta
ttctta
ctag
ttcag
tttgg
ccac
tgta
cata
tgga
gctta
ctag
aggc
aaag
cact
ctgt
gagg
ggca
atgg
ttata
ca
213
4025
793
—N
Ottg
tgtc
tgct
ccaa
gtgc
gaac
cctc
catg
cact
gcag
agaa
cacc
agaa
ggag
agga
gcac
tacc
tgag
cttc
tcct
tccg
ggtc
taaa
ggtg
gaac
ttgct
ttgca
gcac
ccag
cattt
cact
gtac
tgat
tcct
tttttt
ggat
cacc
acct
cctg
cgttt
agca
gccc
atat
gctg
ggct
cact
cagc
tata
cact
gcttg
214
2616
236
FBX
L2
YE
Saa
ttttg
gagg
ctgc
ccga
tgct
ccca
tttga
ctga
cgca
ggttt
taca
ctttt
a21
529
5983
2L
OC
lOO
NO
tccg
gtgc
ttacc
tcta
actg
ataa
aact
agct
ccaa
ccca
ccac
cctc
cact
gact
tcct
tcgc
actc
agcc
cact
cgca
cccg
ggtg
aata
aaca
gcct
tt12
8757
gttg
ctca
cact
tagc
ctgt
tcaa
gttg
tctc
ttcaa
ttaga
tgct
cgca
taac
atttg
gtgc
cgaa
agcc
cggg
atag
ggga
actc
ctcc
ggca
gacc
tctc
ctct
atcc
tccc
ggta
ccca
cgttc
tccc
atgc
aaga
gact
tccc
tcgc
cctc
agga
cctc
agac
cagc
tccg
cgag
cact
ccgg
cctc
tgtc
tatg
ga
216
3072
195
NO
ttagt
gtcg
tcag
gaga
ctcc
ctcc
atca
gaat
tact
cagg
ctga
ttgttt
tgag
tcct
ggga
ccca
gttg
gtat
ccag
aatc
ttggc
ttgttt
gcct
agga
acc
217
3757
471
AC
LY
YE
Sgg
gtgt
caac
gagc
tggc
aaac
tatg
ggga
gtac
tcag
gcgc
cccc
agcg
agca
gcag
acct
atga
ctat
gcca
agac
tatc
ctct
ccct
catg
accc
gaga
gaag
ca
218
3986
531
NO
gagg
gagg
aatta
cgtg
gtga
agga
tgaa
gtga
acag
cggt
gcttg
gcat
ataa
gcca
gccc
tcaa
acaa
g
219
2696
767
MSL
2Y
ES
cttc
ctgt
agct
ggtg
caaa
gact
atga
gcag
tttga
ggaa
aaca
agca
gtta
agca
tcct
agtg
aact
gcta
caaa
aaac
tatg
cgag
tata
taac
acag
acta
cact
ggca
cggg
atat
aata
gaag
cagt
tgac
tgttc
ttctg
atat
tttgg
ctttg
ctta
atga
tgga
tcat
tgttt
tgtg
agga
gaca
gaaa
aacc
ctca
gatt
catc
cttta
ctttg
tgttt
gaca
cattc
ccct
ttacc
ttcaa
cctc
agaa
ccca
caac
tgat
cctc
aagc
tagt
ttatc
tcca
atgt
ctga
aagc
accc
tcag
cattg
ctat
tggc
agttc
tgtta
tcaa
tggt
ttgcc
tact
tata
atgg
gcttt
caat
agat
agat
ttggt
ataa
atat
tcct
tcac
ctga
acat
tcaa
atac
gattg
acgt
atgt
aata
ctgt
tgac
ataa
aaac
tgag
gatc
tgtc
tgac
agcc
tgcc
accc
gtttg
tgac
acag
tagc
cact
gact
tatg
ttcca
cagg
cattg
atat
ctgc
agttt
cag
tgaa
gata
taaa
acct
ggag
actc
tctg
ttact
gagt
gttg
agga
agta
ctcc
gcag
ctta
gaaa
ctgt
ttcaa
atac
agag
gtct
gttg
ccct
aattt
gcag
ccga
acttg
gaag
ccac
tgta
tcca
atgg
acct
tttct
gcag
ctttc
ttccc
agtc
tctta
gcca
taat
gtttt
tatg
tcca
ccag
tcct
gcac
ttcat
gggt
tatc
atgt
acag
cagc
aact
ccga
agat
agca
aaat
tgaa
taga
aaac
gatc
caga
tcag
agag
tgac
agtg
agaa
agttc
agcc
acttc
caat
ttcta
ccat
tatc
cga
ggcc
caac
actg
gggg
catc
tgct
cctg
tgac
agtg
aaac
ggga
gagc
aaaa
tttct
cttc
aacc
tata
gcaa
ctgt
tccc
aatg
gagg
caca
acac
ctaa
aatc
agca
aaac
tgta
ctttt
atct
acta
aaag
catg
aaaa
agag
tcat
gaac
atgg
atcc
aaga
aatc
tca
220
3188
391
NO
cagg
ggta
cacc
caca
tcta
ttgta
ttaga
caaa
tcac
agag
ggat
tgca
acag
aaca
caga
tttttc
tcac
tgttt
tttct
ctgg
ttgat
cggg
atgc
atta
tcc
acca
gaaa
acac
tgta
gacg
actc
actc
acca
ggaa
aaga
gcat
atgc
cagt
tgga
ataa
ataa
aggg
gaag
agta
agga
agag
gctg
taga
atta
tttta
taaa
ttatc
aagt
ctta
tgag
ccag
gtgc
221
3846
196
—N
Otc
tggc
caat
gcat
atta
ctta
aggg
caat
gtcg
tggc
cagc
tgtg
gtgg
tctg
ggct
ctcc
ctct
gtat
cgcc
tggg
gagg
ctgc
tgag
gtga
ctttt
tgga
a00 U
lga
aaac
acgg
gatg
agtg
tatg
atgg
tggc
tgtg
gaga
ccac
ccaa
aatc
ccgg
ggttg
gggg
caat
agtg
aatg
aatg
ggac
catc
tgcg
tggg
tccc
tac
acga
gatg
cttg
222
3070
747
NO
gagc
tcat
tctg
cgttt
ctta
ttgct
tg
223
3821
522
___
NO
gtag
ataa
ccat
ttgtc
gctg
ctttt
tctg
ttaag
tgttc
acca
cttg
tatc
attg
gctg
gcaa
tacc
tggt
ttgtc
atag
ttt
224
3875
001
___
NO
agct
cctc
ccca
gaat
acca
tgct
c
225
3279
450
C10
orf9
YE
Scc
aaag
tttaa
ttaat
ctcc
tgct
gacg
ggac
atgc
tgttt
ctaa
tgta
tggg
atgg
tgat
agag
a7
226
3541
293
PLE
KH
YYEE
SStc
tttct
ctgg
cctg
gtct
acaa
gaat
gtca
c
Hl
227
3547
709
TT
C8
YE
Scc
tgga
acgt
ctttg
aaac
tccc
tgga
acta
atca
gaca
ggag
ggcc
tagc
c
228
3875
261
—N
Ogc
caac
cttg
gtttg
tgca
gttta
agga
aaaa
agcc
atct
ccat
aaca
tgga
agtg
caag
atga
agca
gcag
gcac
tagt
gggg
aagc
tgca
gcaa
gtta
tac
agaa
aatc
tagc
taat
gatg
aggg
tggc
taca
ctaa
aaac
agat
tttca
atgg
agac
aaaa
cacc
cttc
tattg
gaag
aaga
tgcc
ctct
aaag
ctttc
ata
ggta
gaga
gagg
tcag
tgca
tggg
cttg
aaag
aaca
ggct
gattc
tcttg
ctag
aggc
caat
gaag
ccag
tgag
ttta
229
2584
791
NO
cact
ggaa
gaat
atag
gttg
gctc
agac
agtc
cata
ggaa
ggct
aaat
aacc
acgc
tcag
gaaa
tggg
cctta
gaga
gtgg
gaag
agaa
aatg
tatc
taa
actta
tatc
tcag
taac
agct
tgat
agga
cattg
tcac
tagc
agtg
ccac
cact
aggt
cctc
a
230
3204
563
KIA
A15
NO
ttgcc
agca
ctct
gaac
ccat
gcgg
gcta
atga
cctg
ccca
tcct
g39
231
3356
165
APL
P2Y
ES
cgag
actc
tgga
tgtta
agga
aatg
atttt
caat
gccg
agag
agttg
gagg
cctc
gagg
aaga
gc23
236
5089
1—
NO
taca
ctcc
ctca
ggga
cagt
cttg
ccat
ttccc
gtttt
tcat
tctta
taaa
caat
gccc
tggt
gcag
gggt
cagc
aaac
ttttc
tata
agga
gtca
gata
atct
ttttt
tttttt
ttgta
ttccc
ttttg
gcaa
tttgt
ttaat
tgag
ataa
aattc
acat
aact
taaa
actc
accc
ttttg
gtgt
atac
gattc
agtg
ctttt
tagt
atat
tttca
agat
tgt
gcaa
ccaa
cacc
acta
tcta
attc
caga
acat
tttta
tcac
ccca
aaaa
ggag
ccct
gtgc
caat
tagc
tatc
a
233
3829
699
GPI
YE
Sac
tgaa
gccc
ttaag
ccat
actc
ttcag
gagg
tccc
cgcg
tctg
gtat
gtct
ccaa
234
3971
251
CN
KSR
YE
Scc
tctta
tacc
taga
agtc
ccac
aagc
agcg
ttgcc
acgc
cttc
cagc
acca
tcag
taca
ccca
ccaa
aaga
gaca
gttc
tgcc
ctcc
agga
tc2
235
2359
255
...N
Oag
ttgtg
gagg
catg
tttat
aagc
agat
ctga
gtttt
gcca
cact
ggaa
tttat
ttat
236
2811
000
...N
Oct
gggt
ctag
gaag
ccag
gtac
tttgt
gctg
tttca
acaa
gagc
tcttc
tgct
ttaac
tagt
tt23
733
9198
6...
NO
gctg
gaga
agttc
gtct
ggga
caaa
gagc
tctg
gcct
catg
cctg
tggg
tcca
gcag
tcaa
actg
ggac
agac
tatg
tgaa
aagg
gacc
cac
238
3888
361
...N
Oag
cagc
ctcc
cgct
tagc
ggag
agtg
ggat
gagc
acat
gcgc
ggcc
gaca
ggga
agag
tgaa
tcca
gagc
agaa
gtca
aggg
caag
actc
gtgg
gggg
ggga
agaa
aggg
acag
aagc
cagc
tccc
agca
ataa
aggt
cggg
ttattt
tgttc
ttttg
tgac
atgc
ccta
tatc
OO
239
3897
502
NO
tctc
ctgg
ggac
ttaca
tagc
tgttt
ggttt
ttcaa
cttg
attg
aaga
aagt
tattc
gggg
gatc
tgtc
caat
tcac
acct
tctg
ttccc
tgat
caaa
tact
ttagg
gtca
aaca
taag
gtttt
attg
atttt
tgct
caag
caaa
gaat
taac
atttt
tgat
aagg
aact
gtaa
tggc
aatta
taag
gtaa
tatg
gtgt
atgg
taat
atta
tgct
gcc
agag
gaaa
cattt
gact
atga
ttaaa
ttcaa
gccc
agta
ctgg
gtag
ctct
agat
aatta
aaga
taat
tttat
ccat
tcaa
gtcc
cctc
tgtta
agct
gcac
attg
aca
240
3235
381
DH
TK
DY
ES
acag
tata
ttgtg
agca
tggt
cata
aagc
tgcc
aaaa
tcaa
cccc
ctct
tcac
cgga
caag
ccct
gctg
g1
241
3286
657
NO
tcca
gtga
ggcc
acat
catg
acttc
cccc
atga
agac
atcg
aattc
acgg
ctgc
cttc
acct
ttgtc
ctgt
caat
gtct
gatc
tttcc
tttca
atta
cagc
acga
ggc
tggg
gttc
ca
242
2714
224
MY
L5
YE
Stg
gacc
agat
gttc
cagt
tcgc
ctcc
atcg
atgt
ggcg
ggca
acct
ggac
taca
aggc
gctc
agct
acgt
gatc
accc
acgg
ggag
gaga
243
3331
851
GL
YA
TY
ES
tgaa
agta
gagc
attc
gaga
gcac
tcct
cttg
gtta
cgga
agat
attc
tgaa
gctc
aatg
cctc
cagt
aaaa
gcaa
gcttg
gaag
ctgg
Ll
244
3356
559
NO
gcca
tggt
tctc
ttgtg
gtca
gctc
ttgttc
taat
cctta
gcgc
ccat
ttgga
aacc
agtta
tggg
gaaa
gcac
aatg
tagg
actc
ttggg
actc
tgca
cctg
actg
gctg
gcttt
ccaa
gatc
actg
agat
ggct
cttc
ctcc
agca
taga
aattg
atgg
ctgg
ttatc
caga
cagc
aacc
tcttt
tgcc
catg
tgct
tggt
gctc
tgtc
atgg
cggg
aaaa
gtaa
ctat
gcac
agat
catg
atta
tggt
atta
gattg
ggta
caca
tgaa
attg
ctag
tatg
cacc
tattt
tgat
gtat
gaaa
atat
cattt
tcat
gaga
ttcat
ccta
attta
tgct
atgt
atgg
ggcc
tgat
tgcc
ctaa
cagc
tcat
c
245
3636
325
NO
atca
gaca
ccgc
ctat
tcac
cagc
tcat
ctat
aaaa
cccc
ctgc
atttc
accg
caga
actg
gaac
ctat
t
246
2539
613
NO
tcca
ttcttg
cgct
taaa
aact
gtac
ctaa
ctcc
ctcc
cata
gcag
acag
aata
ggat
aggc
ttcttc
ccta
cctg
gtgt
tcaa
gttc
tacc
actg
ccct
cctc
cc
ttccc
ccat
ttacc
tctc
taga
atta
ccag
gtat
ctat
gcat
acat
gaat
tctc
tgct
tcca
ttaga
tttac
ttgtc
attc
ccca
aata
cacc
atgc
acttg
cata
ttag
caaa
gcca
agttt
gtct
ttcac
aaat
gcag
aggt
taaa
taag
agtta
agga
tgaa
ggta
aacc
aaac
tttat
ttata
tact
atat
tttac
caaa
tatg
attta
agga
tct
ctag
gcag
tgttg
tctta
247
2860
647
—N
Ogc
aatg
tgga
ccag
acca
tcga
caac
ccct
atgc
cacc
tttgt
caag
atgc
tacc
ggat
aagg
attg
cctc
tacg
ccct
ctat
gac
248
3143
676
MM
P16
YE
Scc
cgat
gcgg
tgta
cctg
acca
gaca
agag
gtag
ctcc
aaat
tt
249
3492
475
___
NO
tggg
ctga
ccac
agac
tttac
caca
aa
250
3698
355
ZFH
X3
NO
gtcc
gagc
ctcc
gtac
tggg
tgca
atga
aagc
251
3364
564
—N
Oat
tggc
gtgg
ttcttc
ccca
ggat
acct
tata
gagt
ctag
tgaa
ctttt
cctg
tttgt
ggaa
acaa
tcag
gctg
ggttt
aaat
gagg
acttg
tctc
tattt
ggag
cat
ttgtg
agcg
tgta
agga
aatg
agtg
ttt
252
3101
645
AD
HFE
YE
Sag
ggag
ctttt
gatg
ccta
tgttg
ctgt
cggt
ggtg
gctc
tacc
atgg
acac
ctgt
aagg
ctgc
taat
ctgt
atgc
atcc
agcc
ctca
ttctg
atttc
ctag
atta
t1
gtca
gtgc
cccc
attg
gcaa
ggga
aagc
ctgt
gtct
gtgc
ctct
taag
cctc
tgat
tg
253
3015
159
ZK
SCA
YE
Sag
ttcat
ggac
ctga
gatg
ctcg
caag
gggg
atgg
tgcc
tctg
gatc
cagt
tcag
gagt
cctc
gagc
tttga
ccttc
atca
cgag
gcca
ccca
gtcc
cact
t
Nl
caaa
cattc
gtct
cgg
OO
254
2686
624
IMPG
2Y
ES
atac
ctttc
aagc
tgca
tggc
cctc
agca
gatg
aatc
catc
acca
gcag
tattc
cacc
acttg
atttc
agct
ctgg
tcc
255
2897
644
NO
atgt
tggg
tccc
acct
taca
tttag
tttca
tgtc
ccct
tagt
ttcct
ccaa
tctg
tgac
agttt
ctta
atttt
ttcttg
tcttc
tgtg
acct
tgac
acttc
tgaa
gaaa
aca
tttcc
ttgat
atttc
tgat
gaaa
atga
ttatta
tgta
gaat
gtcc
ttcaa
cttg
aattg
gtct
ttttc
tcat
gatta
gaat
gagg
ttctg
ggttt
ttggc
aagc
agag
aac
agaa
gcga
tact
atat
ttttc
tcag
tgcg
tcat
atcg
gggt
catg
atgt
cagt
gtgt
ctta
tact
ggta
cact
aacc
catg
tctg
ctag
gtttc
tttcc
aata
cagt
ca
256
3909
740
—N
Ogg
tgga
gcaa
gagc
cctg
ttggc
tgat
gg
257
4049
026
___
NO
ggca
caga
aggg
caaa
ttccg
caca
ttctc
atttg
tggg
258
2321
582
___
NO
atgg
agcc
agaa
aaac
gcgt
gcag
cagc
tcat
c
259
3304
498
AR
L3
YE
Sgg
cttg
ctct
caat
tttgc
gcaa
gttg
aaaa
gtgc
acca
gacc
agga
ggtg
agaa
tact
tctc
ctg
260
3533
002
___
NO
tact
gaat
gtgc
ttggt
tttgg
acag
261
3568
039
PPP2
R5
YE
Sct
cttg
ctgc
cgct
gcag
atcc
agtc
ttcct
ccE
262
3764
440
RN
F43
NNO
Otg
cgga
atttc
tcgg
cacc
tacc
tgta
gtat
gggg
cact
tggt
ttggt
tgca
gagt
aaga
aggt
ggaa
gaat
gagc
tgta
cttg
gtta
agca
gttg
aaac
ctttt
ttgag
cagg
atct
gtaa
aagc
ataa
ttgaa
tttgt
ttcac
cccc
gtgg
attc
cagt
gggc
ccga
ca
263
3790
080
NE
DD
4YYE
ESS
ttggg
aaag
caga
taac
cctg
aatg
acat
gga
L26
426
5800
5NN
OO
gtat
tatc
acat
gact
gcca
ggtg
c
265
3029676
—N
Ogt
ctgt
gggt
ggat
agat
ccat
taat
ggat
aagt
aaat
gagc
aatg
cttc
ctgg
gttg
aggt
tttgg
atta
ctgt
ggtc
atgc
actg
cattg
gaga
tgag
gtgg
aaa
aacc
atc
266
3390272
—N
Oga
gtgg
gcac
tagg
aaac
ggat
tcaa
agag
gcga
tgaa
taat
ctga
gtttt
acag
gttta
acag
gaag
gcat
aaca
ttgta
agca
aaga
aaga
aaag
atat
tcc
agac
aaag
cata
tgtg
g
267
2642810
DNAJC
YE
Sct
ggat
ggag
taag
agcc
tctg
gtaa
taga
gatg
tttgt
gtaa
aaat
gaca
ccaa
ccca
taaa
ggtc
agcg
atgg
gggt
tact
cagc
atgc
ctgt
tgat
gag
13
gaag
taga
gagc
cttc
acct
cagg
ttctta
gcta
cgcc
tcca
a
268
2811129
—N
Oga
tgag
atat
taac
tgct
gcac
tgg
269
3070222
AASS
YE
Stg
agaa
aatg
gtgg
atca
taga
ggag
tacg
ggta
gtgg
cattt
ggac
a270
3907114
TOMM
NO
cctg
cctg
acct
tacc
caga
gaag
ccat
gggc
cacc
tgct
ctgt
gccc
gctc
ctga
aacc
ca34
271
2790942
—N
Oat
gttc
ccat
aagc
aaca
gcct
gtaa
attg
tcac
atgt
aagt
catc
taat
aagt
ttaga
acaa
tgaa
caac
tcat
tttct
gaac
tggg
acaa
acaa
ggaa
gcta
aat
tca
272
3043075
—N
Oaa
tcag
tgttg
cctg
gtgg
atag
caaa
tcaa
gtttc
aagg
ttaca
ga273
2354095
WDR3
YE
Scc
atgg
tgaa
atgg
tggg
acct
tgat
actc
agca
ctgc
tttaa
aaca
atgg
ttggc
cacc
ggac
tg274
2713294
—N
Otg
gattg
cgta
aagg
tggg
gacc
tcca
gatc
ttcca
cggg
aaag
gtac
gtttt
tcttg
ttgat
attg
tgaa
gtgt
gtat
gggg
ttgta
ctttg
gcac
cttg
tagt
g00
agtc
agttt
cccc
ccat
tggt
gatc
attg
ccaa
atca
275
3220
577
KIA
A03
YE
Sag
atgt
tgca
tcaa
aggg
ccttg
ggttg
gttta
tgaa
ctag
gcaa
tgaa
caag
atca
acag
gaat
tggt
ttcta
cact
tgtg
gaaa
cact
tatg
actg
gcaa
aa6
8
276
3531
076
SCFD
lY
ES
tgtta
ctcg
tattt
tgga
caat
ctta
tgga
gatg
277
2505
303
RA
B6C
NO
gcag
gaag
aaaa
cttc
gagt
taca
ggtc
agga
aaag
c27
826
4404
0PC
CB
YE
Sga
gatc
atgc
ccaa
ttatg
ccaa
gaac
atca
ttgttg
gtttt
gcaa
gaat
gaat
ggga
ggac
tgttg
gaat
tgttg
gcaa
ccaa
ccta
aggt
ggcc
tcag
279
2938
530
—N
Oga
gaat
gccg
gtgg
ggcc
ttcag
cagc
agca
tcct
aaca
gtgt
tgca
280
3625
346
CC
PGl
YE
Sga
ggcc
ctac
tatg
caaa
atga
tgga
agga
aaga
aaag
ccag
ttcac
tttaa
agaa
ttcag
aaaa
aata
caaa
ttcaa
agaa
atgc
agtc
ctgg
gcat
gatt
gtag
agaa
aattc
tcat
tcttt
caga
aagg
cttg
ttctg
gtgt
atttg
attg
tgct
caac
aaga
gtcc
atga
gcct
tttta
acac
agtg
gtga
atc
281
2971
361
—N
Oct
gcct
agta
ttgtg
ctac
agta
ccag
acaa
aatc
acttt
aggg
ttttg
ggga
ttttg
tttttt
taaa
caaa
ctgg
tatg
taat
ataa
actta
ggaa
ccac
cacc
acca
tctc
gagt
atca
atgt
ttcag
aatg
gagg
cata
gttc
agtg
aaaa
aaat
gccg
gcat
tggt
tttag
ttttg
attc
ttgttt
cacc
agaa
agca
tggt
ggttg
gggg
agct
gtca
aagt
actc
cgga
taga
ctca
gcgc
catta
aata
agct
cagg
cata
tacg
gcca
ggag
caa
282
3026
661
TR
IM24
YE
Saa
gtgt
gagc
gcct
acttt
tattt
ct28
331
0890
6-.
-N
Oct
gcat
cgaa
ccag
cgac
ctgg
gtgc
ctcc
ccac
ccaa
gacg
gacc
tcca
catc
ggga
a28
436
4913
4M
KL
2Y
ES
ccag
cagc
ccttt
atca
ataa
ggcc
tcca
acag
tgttc
ttcaa
tcca
gaaa
tgct
ccgc
ttcca
tccc
tgca
285
3729183
CLTC
YES
ttattg
aagt
tggc
acac
cacc
taca
ggga
acca
gccc
t286
3900156
C20orf7
YES
tccc
aaca
aaat
cgtg
gccc
aggt
agct
tgcg
atgt
ccttc
agttg
ctgg
tttcc
tact
ggga
gaag
cttc
agat
gtttg
aaac
ctct
ctgc
ctcg
gaaa
atgg
4c
287
3907653
NCOA5
NO
tttcc
tcttt
ggaa
ctct
tgtg
ttgtt
288
3102604
—NO
ttggt
cttc
acgc
actg
ggtg
tgtc
tgta
aata
cagc
gctg
tcca
cggt
gtct
gtcc
tgct
tcca
tccc
aaat
tcag
gcttt
agct
taaa
acat
cttta
gttta
atttt
ttaaa
aaat
tcttt
tcttt
cccc
caga
taaa
atct
taat
ctttt
gcac
taga
ctgt
tagc
cagg
gaaa
289
3863282
NO
gaag
tgttt
ccag
acat
ccca
attc
290
2920506
-__
NO
gcttc
ttcgt
ggtc
ttttc
catta
t291
2319069
___
NO
ctga
agca
gggt
gccc
gtttc
taca
ccaa
agag
cggc
aaaa
tgaa
gaa
292
2573693
CLASP
YES
gcttc
tggg
cctg
caga
actta
ctga
aI
293
3022667
SNDl
YES
gtgc
ccga
gtag
agaa
agtc
gagt
ctcc
tgcc
aaaa
taca
tgtc
ttcta
cattg
acta
cggc
aac
294
3570810
—NO
atgt
gggt
ggag
atga
tcttt
gcttt
gaga
acag
gattc
atta
tgtg
ggct
ggtc
agga
gtgg
gaac
aaga
gcag
gaag
gcca
cagg
ggat
tgag
agta
cttc
tgtc
ccca
acac
actc
atca
agga
cgtg
caat
cggg
ggcc
aagg
ctca
ttatg
gtga
ttttc
a
295
3847162
—NO
tggc
cagc
tccc
tagt
ttgtg
ctta
ctat
acct
ggcc
acgc
ctcc
ctac
ctaa
ggcc
gctg
gctta
accc
tagg
ggca
ggca
gtgt
taga
tcag
accc
agac
cttc
tcat
ccca
ccct
catc
acat
cggg
gaga
gggg
actc
cagg
ggcg
ggaa
ggca
ggcg
tccc
tcca
tttgg
ccag
ggtg
ggcg
gcga
ggag
gggg
tcac
tctg
cagg
aaca
ctga
gctc
tgaa
cacc
tct
296
2535304
NO
tcaa
caac
atga
atgg
cggc
aaca
ctag
gaga
catc
c297
2697615
-__
NO
tgca
gtgg
tact
atttt
agct
caat
gcag
cctta
gattc
ttgga
ct298
2931795
-__
NO
agat
ggca
tgct
gcac
aagg
gattc
atgg
ttaca
gcgg
gcttg
tggg
actg
gggc
tctc
caat
acgt
ggttg
ggttt
gtaa
agaa
atca
ga299
3011882
STEAP2
YES
cagg
ttattg
aact
tgcc
cgcc
agttg
aattt
cattc
ccat
tgac
ttggg
atcc
ttatc
atca
gcca
gaga
gattg
aaaa
tttac
ccct
acga
ctct
ttact
ctct
gga
ga300
3464921
—NO
cctg
ataa
taag
agga
acga
ggac
agct
caga
aagt
acaa
gtca
agaa
gccc
tgag
aggg
gagt
gaat
tcag
gaaa
ccgg
acat
ttcag
gtttg
cctc
agc
attg
catg
agttt
gagg
aaca
agaa
aaga
tgag
gatg
gattg
tgaa
tgat
ctcg
aatg
tgttt
tgaa
ggttt
ttgga
ggta
tttgg
attc
accc
taag
a
301
3541610
—NO
aggt
cgat
gggc
tcag
aattc
actg
acag
gttc
agat
ccag
ccga
gaag
tgga
gcat
gtat
ggta
ttagt
ggca
tttga
aggt
catta
gtgg
cata
acta
ttgt
agtg
taaa
ttcaa
gtttt
attg
gtat
attg
tgta
tagc
cctg
caga
302
3872499
NO
ttagg
agcc
atca
acga
gttc
acgc
cgga
gaaa
gacc
tttca
agtg
tgga
gaat
gtgt
gaaa
tcttt
cagt
cata
agcg
cagc
cttg
ttcac
catc
agcg
agt
tcac
agtg
gaga
aaga
cctta
tcag
tgtg
gaga
atgt
ggga
aatc
tttca
gtca
aaag
ggca
acct
cgttc
taca
ccag
cgag
ttcac
actg
gagc
aaga
cct
tatg
agtg
tgga
gaat
gtgg
gaaa
tcat
ttagt
tcaa
aagg
acat
ctta
ggaa
ccat
cagc
aaat
tcac
actg
ggga
caga
cttta
tgag
tgtg
gaga
gtgt
ggga
aatc
tttta
gtca
taaa
ggca
ccct
cattc
taca
tcag
cgag
ttcac
ccta
gaga
aaga
tctta
tggg
tgtg
gaga
atgt
ggga
aatc
tttta
gttc
aatc
gggc
acct
tagg
agcc
atca
gcgc
gttc
ata
303
2470335
—NO
aagg
cgttc
gtta
tgtg
ggat
atgc
tgtta
ccac
tgaa
cttg
aagt
tatc
aaag
aagt
304
2887649
___
NO
ggta
agtg
gattg
acct
ctaa
actc
ccta
gaag
ggct
gtag
ctttg
aagg
tgga
cattt
attg
ggct
cgca
cgtg
acac
ttattg
gggc
taa
305
3522210
___
NO
gtga
gaac
ctac
atgt
tggg
tgcc
cacc
306
3674221
—NO
agtg
agcg
tttga
gtga
acca
gcca
cagt
ctct
acgt
gtca
tcca
agga
gccc
ggca
caga
cccc
gtgt
cacc
ccca
tgtc
accc
gcag
accc
cgcg
tca
ccca
taga
tacg
caca
307
2560167
NO
ggga
caca
gagc
agttg
agga
tatc
tcttt
tcttt
cact
ctgt
tctg
aaat
ctgc
ctct
tccc
aaag
gtac
tcca
tgac
ccct
aaga
tact
cact
ctg
308
3041159
___
NO
ttaat
tcag
caca
actc
acca
ttccc
actg
gaaa
cata
ctgg
caat
tcac
atct
tcca
aaca
agg
309
3183335
FKTN
YES
gagg
cact
cagg
ccaa
aaca
ggaa
aaa
310
3204049
___
NO
gggc
gcca
tcgg
atgg
ggca
aaga
gatc
cctg
aaaa
cgta
ttgcc
gggg
gacg
ggga
cttg
cagg
acaa
gaaa
accg
tggg
agaa
t311
3393003
KIAA09
NO
gttg
attg
taaa
ccca
cagt
atct
agca
gcgt
tgtg
ccaa
attg
ccct
tgtg
tttct
ctcc
accc
aaaa
tatc
acag
ctgc
tttcc
tcac
atttg
gttc
atcc
gtgt
g99
ctgt
tcttt
t312
3598130
—NO
gcca
agta
caaa
gtgc
tgtc
ccat
atat
tacc
tttat
catc
atga
caag
cact
tgag
atgg
taat
taac
aact
ccac
tttaa
gtct
gaga
ataa
tgaa
gcaa
ctg
accc
aagg
ctcc
aaag
acag
tgaa
tggc
agag
atga
gattc
aaac
ccag
gtct
ccct
gact
cgaa
agcc
cacg
ctttc
cccc
aaca
ccat
gttc
ttttc
cttc
cctg
tcag
aaag
ggaa
atac
caaa
aaca
aatg
aggg
tatg
tatg
aggg
tttgg
ctgg
agcc
ccag
tcag
ccaa
agtc
tgat
tcca
agtg
gact
gcag
ggt
ccgt
gtta
tctc
ccttg
ggca
caat
ctca
aaag
ccca
gaat
tcag
ctgc
cttg
ctaa
ccttc
catc
ccag
tctta
gtga
gaag
ctaa
acta
acttc
tggg
cctc
aO
atta
cagc
agat
ta
313
3681604
LOC728
NO
tgcc
atag
accc
tcac
gtgt
ttgttg
catta
attg
gata
attc
aaaa
tata
aagt
atta
tcca
gtttc
agga
taat
atta
agat
ctaa
acca
atga
atgc
tcta
tgaa
138
aagt
tgta
gatta
actg
atcc
ctta
aaac
taaa
aaca
gcgt
cttc
ataa
gcttc
tact
gggg
gcag
ctac
cact
ttcttc
atca
cattt
ctta
tcct
caat
agct
gtg
ggca
atgc
caaa
atag
c
314
2410882
ATPAF
NO
gttg
gccc
ttcac
ctag
ttgac
tcag
ccct
cgat
agtc
taga
gccc
accc
cctc
ctca
ggaa
ctca
agag
ctca
gcat
ttata
atga
gcag
ttggt
aatg
agtt
1gc
ccta
tgtg
cttg
tcgc
aagc
agtc
acag
agat
gagc
ccta
ttact
tgat
attc
agga
acaa
aggt
acct
gaac
attc
tgat
aatta
tctc
agca
tact
tgag
gttt
ccttt
tttaa
gtgt
tcga
ggtta
taac
aaga
gaca
gcca
agga
ccta
caag
acag
ttgac
ttgat
tttgc
acag
tgta
acag
cgca
gttg
cattc
tggc
cact
ttgac
ctta
tagc
tccc
aaat
gatg
agttt
gtca
tcttt
atga
actc
atga
cagg
ataa
taag
cttg
aaga
cctg
ctgt
agtta
gata
tggg
cttta
atcc
ttccc
aggc
acca
gtca
gctg
aaca
aaag
cata
agcc
aaac
atcc
tgttt
aaac
tgta
gaat
aacc
agat
attc
ccat
cagg
ttaaa
gact
tcat
ctag
atga
tgcc
ccc
caga
gatg
ccttt
agtg
taag
tagc
tggc
ttggg
gtat
cagc
aaat
ttcag
gtat
agtta
gata
aaca
ggta
cagg
gcct
gcat
acta
ttaaa
ccat
agttt
gtg
gcac
ccgc
ttttc
ta
315
2821
250
CA
STY
ES
aggg
ccag
atga
tgct
atag
acgc
cttg
tcat
ctga
cttc
acct
gtgg
gtcg
ccta
cagc
tgct
ggaa
ag31
631
3407
0PR
KD
CY
ES
gattg
gagc
aatg
atgt
aaga
gctg
aact
agca
aaaa
cccc
tgta
317
3888
286
NO
agct
ccttg
ttaca
tggg
cctg
ttaac
atgg
cagc
tttgt
ctat
aagc
aaac
ccag
gaga
gaaa
gaca
tagc
agag
atgg
atgt
ttgaa
gtct
atac
cttc
cac
cccc
tttaa
agag
aaag
taac
acca
ctcc
ttttc
tgtg
tccc
ttggg
gaca
ctac
ctcc
atgt
ctgg
tcac
atgg
ctgg
acttt
acag
caga
taag
cata
ctgt
ggc
ctga
gacc
atga
ttgta
tgct
ttcct
tctg
ctga
ccttt
acaa
tccc
tcaa
taaa
ttgag
ctaa
caca
ggga
agct
ttttta
ccaa
ataa
ctgt
gttg
catc
atcc
tcca
gtttg
cctg
gtgt
cctta
atca
atgg
aagg
ggaa
taag
caaa
ctga
gtttt
ctta
cacc
ttttg
agta
tagt
gtttt
tgcc
atca
taga
tgtg
gctc
ctca
taa
ttctc
caac
tttta
tatta
aaaa
acca
aaac
ctca
aaaa
ttgta
gttc
atgt
cagt
cagt
gatg
actc
atct
taga
agta
ttttg
tttttg
gatg
tgtg
aatg
tgca
tag
ttctta
aagt
ccaa
cattc
atgt
aata
agac
atct
tgca
tata
acaa
tgac
cctta
cgtc
taag
atgt
taaa
taga
tcct
aagc
ctgg
tata
acttt
attc
aagt
atcc
ttattt
gccc
ctaa
aatg
tcttt
aata
caca
ttact
tggg
ttattt
cttg
aatg
aaca
taca
ggta
tccc
aattt
ctgt
tttta
agag
aatg
gggt
cttg
ctct
gtca
ccc
aggc
tgga
gtgc
agtg
gtgc
agtc
atgg
ctta
ctgc
atcc
atga
tcct
cctg
cctc
agcc
tccc
aagt
agat
ggga
ctga
aagc
acac
actg
ccat
ccct
ggct
aatg
ttttc
atat
tttgt
agag
ttgca
gcct
tgct
acgt
gacc
cagg
ctgg
agtg
tagt
agct
attc
acag
gcat
gattg
cttg
aaac
tcat
ggct
tcaa
ggga
aact
ccca
318
3919790
—NO
gcag
ggag
tgca
gaaa
ccttg
gaga
aagt
gaga
gtttt
ccag
ccaa
aggg
aact
ttgtg
tttcc
ctgg
ctgg
gact
cttg
ggga
tcttt
ttcag
gtttt
ctgc
agt
ttttc
tgaa
ttgag
cttta
aggc
aact
ggat
ggat
tttga
act
319
3948965
—NO
ctta
gctg
ttcca
gcgg
ccca
tgttt
aaaa
gaat
gtgc
ttcttt
ttcca
agta
tttct
gccg
cttg
catg
cact
gagc
ttcttt
ggaa
agga
gcac
catg
cagg
cata
ttttc
caga
cagg
accg
gattt
gctc
gtta
ctca
ga
320
2738676
—NO
ctga
gacc
tgaa
gcac
gaga
gcca
agtg
tgta
agca
agtc
gatg
ggat
tggg
aaga
agttt
aggc
tgct
aaaa
aattt
ttaca
taaa
gcttg
ataa
gagg
gga
tattg
catg
tgaa
ataa
acag
aaat
tcag
taaa
tatg
gtgg
aagt
gcag
gattt
aggg
gatg
gaaa
aaaa
tact
agttt
caaa
aattg
aggc
taga
agtg
aag
gtag
gggc
caga
ttatg
aaaa
gcct
tgta
agca
atgc
taag
aaac
aata
gatg
tcag
aagg
acat
gcca
ccat
ctaa
aaag
cagt
cttg
agaa
caag
taat
tgga
taaa
ggtg
acct
gact
aggt
atttg
tttttt
taag
taaa
ttgtg
ttaat
tttgt
tagg
tgta
aaaa
tggc
atta
gggt
catg
ttggg
gaaa
aaag
aatc
tctt
tctta
caga
tgaa
atta
catg
gtct
ctga
gact
gcttc
caaa
aa
321
2929182
—NO
tggg
tgta
aaca
ttgat
gatc
cttc
taat
actc
tgtc
ctct
taat
ttctc
taag
ttctc
tttct
agtg
atct
tgtc
tccc
agtg
tggt
gggt
atat
tctg
gcta
gacc
aggt
cattg
tttat
ggtg
gcag
cccc
tctg
tcat
ctga
attta
agcg
tccc
atttt
ttcgg
tact
agca
tggc
tcac
tgc
322
3004581
—NO
gcaa
atgt
tttaa
tggg
ccct
ccta
cctta
cctg
acat
aaga
taat
tcat
acta
gaaa
gaat
acct
acaa
atgt
gaag
aatg
tggc
aaaa
gcttt
agtg
tattc
tcaa
ccttt
acta
atca
taaa
gcaa
ttaat
actg
gaga
gaaa
tctta
caaa
tatg
ttgaa
tgtg
acaa
tgttt
ttaac
tagg
ctgc
aact
cttg
ctg
323
3300811
C10orf4
YES
tttgg
attta
ggtg
gcga
gtag
aaaa
a324
3426258
—NO
atgg
agat
taag
cagg
aact
tcaa
gaaa
gcca
agct
cttc
ttggt
agca
cacc
agag
tctg
aagg
325
3645807
—NO
agtg
gaat
ctct
gaaa
ctca
ggtg
tggc
atca
acaa
ag326
3976960
—NO
caaa
tccc
gggt
ggca
gcca
taat
cccc
aaag
atga
cccc
cacc
ccaa
gattc
caaa
gaag
ctag
tggt
ggtg
gaag
caaa
agga
atgc
agca
aggt
cag
ggttt
cattg
tcca
agcc
ggcc
tgac
acct
gccg
ccct
gccc
ttgcc
cagt
gcac
accc
taga
ccct
gggc
cggc
ctcc
atgc
agct
ggag
gcca
gaa
gaca
gcaa
ccca
tatc
tttgc
accc
tcct
ccat
gccc
catg
gcct
gcct
gccc
agaa
agat
gcca
ccttc
acag
agcc
agtg
ctgt
cgtc
tata
tcat
tt
327
3981
121
OG
TN
Occ
gtcg
ccgc
cattt
caag
accg
tact
ag
328
2391
399
CPS
F3L
NO
gtat
gcac
cggc
ttatg
aacc
cacc
aggc
tctg
tccg
agcg
gcat
tgaa
atgg
ccag
agtg
ggac
aggc
tggt
gccc
tcag
agcg
cttg
gagc
ggcc
agag
actc
gctg
gagc
tccc
gggg
tgttg
ccgg
gcag
ggag
gcag
gggc
tgga
gggg
agat
gggc
cagc
tctg
gagg
gtgc
ctgg
ttctg
ggtg
tggg
tca
ggga
agaa
gtga
aggt
taaa
tcag
cgag
tgac
aggt
gaaa
ttcca
tttct
gaca
aaga
ttatg
ggttt
gtct
gtgt
ctca
ctcg
aatc
tctg
gg
329
2748
879
GU
CY
lY
EJ
tgcg
aattg
gact
gcac
tctg
gatc
agttt
ttgct
ggcg
tcgt
tgga
gtta
aaat
gccc
cgtta
ctgt
ctttt
tgga
aaca
atgt
cact
ctgg
ctaa
caaa
tttga
gA
3tc
ctgc
agtg
tacc
acga
aaaa
tcaa
tg
330
3063
716
___
NO
gtga
gtgg
ttaga
gaac
aatg
gcaa
acaa
aggg
cgag
gcgt
ttaag
cttc
tgta
gaga
gg33
133
6187
4T
RIM
66N
Otg
ggct
tccg
tgat
cccg
ttttc
tagt
ttggg
gtaa
ctga
gtct
tgaa
tgct
ttact
agtc
cggc
aatc
tttgg
actta
ggct
tctg
ccct
ttgag
actc
acat
gact
ttctg
gtttg
gggt
ctgg
tcat
ttccc
tttca
atttt
tgaa
tctc
cttc
tctg
ttcag
ttggc
ttggc
aaag
tacc
ctct
gttc
tcat
g
332
2812
780
NO
tctg
gaat
gaaa
cagc
gtaa
tgag
atga
gccc
tga
333
3065
419
—N
Oct
acaa
ggat
actc
aggt
caga
aagg
ccca
ggag
tacc
tacc
tcat
tcag
caac
agtc
cagt
caaa
atct
tcct
cccc
gaca
atgg
ccta
tgac
cagc
ctgt
tcta
caca
ttcct
ctct
gg
334
3077
669
NO
aaga
gccc
ttctc
aatg
gctta
accc
atag
agcc
c33
536
5296
1__
_N
Oaa
gcgg
aggt
tgga
ggaa
gccg
tgat
tgta
cc33
628
9567
8SI
RT
5N
Oaa
agca
gccg
tgga
gaca
acca
tctt
O33
729
7805
7N
Oct
gtgt
tgtc
tcag
tcaa
gcct
agaa
ctga
agct
acct
gaca
tcaa
gcca
attg
tcca
ggac
cagc
atttc
cagc
acttc
cagt
tgtc
cata
tctg
cact
tcca
gga
gtaa
acag
gca
338
3595
946
RN
FlIl
YE
Sca
aggc
cact
tcat
catc
aagc
ttctg
cctg
cccg
cattc
tcat
ggaa
accc
ccct
cctc
agac
tcag
cctc
cgcc
tcaa
gtgg
atta
tgtta
ttcct
catc
ctgt
acat
gcttt
ccat
tctc
aaat
atct
tctc
atgc
aaca
tctc
atcc
tgtg
gcac
cccc
acca
ccaa
ctca
ctta
gcca
gtac
agct
gcac
caat
ccct
cagc
atct
tcc
tcct
acac
acca
gcca
atttc
gcac
cata
ttcca
gc
339
3628
693
HE
RC
lY
ES
tata
ggaa
atca
tcag
gagc
caag
tgtg
tttat
cagc
tgcg
ggga
caca
tcac
tcct
gttc
ggac
tgttg
ccttt
agttc
tgat
gggt
tggc
cctg
340
3734
777
MR
PS7
NO
gtga
gcta
ctgc
cacg
ctga
aaac
tacc
tgtg
ggtta
agga
tgta
gttc
ctttg
taag
ggtg
ggca
ggcc
tcgt
aaga
aaga
tgta
gcag
cata
ttcac
tatc
cgtta
atcc
ttcttt
ctttg
aggc
tgga
acttg
ctct
ctct
gccc
ctat
ttcct
tgta
aaga
ggga
gcac
attg
acttg
ggaa
tttcc
tcca
ggaa
actc
aggg
ctg
tttt
341
2806
269
DN
AJC
YE
Sac
agag
ctgg
atga
ctat
ggcc
aattt
ggag
aaag
ag2
134
228
4912
1D
NA
H5
YE
Sat
gaat
ccca
aagc
gatta
ctgc
ccca
caga
tgttt
ggtc
ggct
ggac
gttg
ccac
aaat
gact
ggac
tgat
ggga
tattt
tcta
cgct
ttgga
gga
343
3012
989
___
NO
tctta
ttcttg
gcca
cggg
acca
accc
tcac
aggg
gcac
tcta
tgca
gaag
cttc
cttc
ctaa
attg
gaac
tccc
acttt
gacc
actc
tcct
tt34
431
8465
5N
Ottg
caag
gccc
acac
cacg
tggc
tgag
aagt
caac
tact
acaa
gttta
tcac
ctgc
agcg
tcca
aggc
ttcct
gaaa
agca
gtct
aaca
gcaa
aaga
aaag
ctta
tatta
aata
aata
gtta
aata
ttaaa
ccttc
aaaa
gaaa
gact
cgac
acaa
tgac
acaa
agca
cattc
tgga
ggtc
ctc
345
2428
776
RSB
Nl
NO
gctg
ccca
agat
gttg
gaac
acca
tgtg
caat
aaag
ggct
tcgt
cttg
gattt
tggc
ctta
ttacc
acaa
ccag
346
2502
833
DB
IY
ES
acgg
cccg
ggat
gttg
gact
tcac
gggc
aagg
ccaa
gt34
726
0858
9_-
_Y
ES
ttcca
ttgat
gaat
tgga
taat
gctg
ag34
827
8161
0_-
_N
Oct
taca
ggag
aaat
cgaa
acac
agct
gcta
c34
928
1098
9_-
_N
Otta
gagt
ttatg
gacc
ttgct
cttg
gtta
350
2489
148
—N
Occ
tctg
ccttc
gctc
gcgc
cgct
ctcc
ttcag
tctc
ccca
gggc
ctgt
ttcat
ggat
aggg
tccc
cttg
cccg
cgtg
aaag
tctc
attc
ctc
351
3239
162
—N
Occ
actg
gcta
ataa
gcag
gtct
ttattg
agtg
ccca
acac
tgag
tttag
tcac
aggt
gctta
ttctg
gata
atta
atct
cagt
agat
taac
cttc
ttcag
aatg
acc
atgt
atat
caga
cttc
tctta
tccc
caat
tctg
ggtc
ctta
ttaac
ctct
tcca
gggc
taag
tattt
accc
ttcca
taaa
tcgg
atgt
tttct
taag
cattt
tttct
cctg
tca
tcac
tgaa
actc
cagc
ctag
ttcttc
acttg
tatg
caat
tttct
tgta
accc
tgtc
tctc
gtttt
ggttt
attg
tcca
tatg
ccta
ttgag
tgct
tttcc
agttg
352
3573
019
POM
T2
YE
Sct
gata
ggtc
ttgct
ggct
acct
gagt
ggat
atga
tggt
acct
ttttg
ttcca
gaag
cctg
ggga
taaa
tatg
agca
tcac
agct
acat
ggga
atg
353
3937
196
DG
CR
8Y
ES
ttgtg
cccc
caaa
aaga
ggcg
aaca
gagg
aaaa
atat
ggcg
gaga
cagc
gacc
atcc
gtcc
gatg
gaga
gaca
agtg
tgca
gccg
atga
tgac
caag
a35
431
5433
1N
DR
Gl
NO
tgct
gggc
gtca
agct
tgca
taat
gccg
tact
ttgag
ggcg
gttg
tcac
aagt
atca
caaa
atgg
gtgg
ctta
aagc
aaca
gata
tttat
tccc
tttta
gttc
tgg
aagc
taaa
agtc
cgaa
atca
aggt
gtgg
gcag
ggcc
gtgc
tccc
tctg
aagg
tgct
aggg
gaga
atgc
tgcc
tgcc
tcttc
cagc
ttctg
gtgg
ctgc
tgg
ccac
acttg
gttc
acct
tagc
tagt
agac
acat
cact
ccag
tctc
tgcc
tcca
tcct
cact
cagc
actc
tccc
gattg
tatc
tctg
cgtc
tatg
tcca
aattt
ccttc
VOttc
tttta
ataa
catc
agcc
gtta
gcttt
aggg
cccg
ctct
aatc
caat
atga
cctc
atct
taac
ttgat
tttac
ctgc
aaag
accc
tattt
ccaa
gtat
agcc
acag
tgat
aggt
actg
gggt
tagg
acttc
agta
tgtc
tattt
tggg
gaca
caat
tcaa
ccac
aacc
taca
gcaa
aggg
tagg
tggc
ctga
ctga
ggtg
gggt
ggcc
gattt
gatg
gtag
atcc
agaa
acag
aact
cact
tagg
cctg
attc
tgga
gact
tcca
cagc
gtca
gtct
aatg
tgct
gagt
gccc
cgcc
tgga
ggaa
aaat
tcgg
gcat
c
355
3697
772
PHL
PPY
ES
tccc
agcg
aatc
agta
ccgt
ggat
ctct
cgtg
ttaca
gcct
cgag
gagg
ttcct
gagc
atct
cttc
tata
gtca
agat
atta
ccta
cctc
aact
tgcg
acac
aac
Lttc
atgc
agtta
gaaa
gacc
cgga
ggcc
tcga
taca
ctct
acaa
356
2611
934
SLC
6A6
YE
Sgc
tatg
acct
cgct
gggg
agct
acaa
caag
t35
731
2780
7N
Oca
agga
agcg
agtc
tccg
tgac
acgt
ggta
caat
atca
tgca
gtac
acac
ggaa
ccag
aacc
aaga
agaa
atgg
ctgg
aaaa
gatg
aaag
aaaa
aaaa
agttg
atat
agtc
tcaa
atgt
agtg
aaaa
aaca
tcaa
ctta
taga
ttcaa
aaag
ttcat
aaac
ccct
cata
aaaa
taaa
tgta
aaga
aaac
ccta
cctg
gaca
catc
atag
tcat
cata
caga
gcaa
atttt
gtgc
attc
acag
gtaa
caaa
gata
tgaa
cgac
agct
gact
caaa
acaa
tgga
ggct
cagg
atgt
gtga
aatg
gtga
aaga
aagc
tgttg
actc
acta
ttcac
actc
attg
agaa
tatc
cttta
aaaa
tagt
aagt
gaag
acgt
tgtg
gcag
gttg
aact
ttcc
358
3909
039
SPA
TA
NO
aggg
ctac
atca
accg
acct
ttcgg
gtttc
acgg
tgaa
gaag
tact
aacg
cattg
atct
caga
ggcg
gagg
cctg
cact
tgac
catg
tagg
tggc
agag
at2
cgtg
ggct
ggct
gtgt
ccac
gtgg
gagt
tcac
ttagc
gact
caga
tatc
tcaa
ccaa
tggc
tgct
ttgttg
tctg
acaa
ggga
gaag
gtgg
cact
tccg
ttca
gattc
atttt
gcta
atct
ctcc
actc
cctg
ttcag
ttggt
tcttt
ttttg
gggt
tttgt
tttgt
tttgt
tttgt
tgttt
tgttt
tttcc
tttaa
agag
gattt
ctgt
ctct
ggga
ccct
cgct
gcac
ctac
ccct
ccca
ttgca
aagc
caac
ttgga
ctgg
gaag
ggcc
cttc
aggt
cacc
aaac
atcc
acct
ggag
cggc
gagc
taga
ggcc
tgtt
ggct
tgtg
aaat
gagc
cctc
ctgc
caca
cggg
gcct
ctcc
caaa
ggct
catc
ccttg
gccg
cccc
cttc
ctaa
acac
aagg
atcc
ccag
ctgg
actc
ccca
cccc
ctgg
cttc
ccca
cctc
tcca
ggtg
tcaa
aggt
gaac
cgag
tcca
gtat
tagc
tgaa
tgtc
atttc
gtac
acca
cagc
tcag
tcag
ccca
tggc
cttc
gtga
acttt
gctc
c
359
2412620
NRDl
NO
gtttc
tttga
ggaa
cgca
gcct
gaa
360
3220615
KIAA03
YES
gacc
ctaa
acta
ctgt
caat
ggca
tattc
ag68
361
3521230
—NO
tgct
gctg
acat
cacg
ggcc
ctgg
gtt
362
2327492
SNHG3
NO
gaca
atct
ttggc
agac
ttgga
gcaa
aaga
ttgag
gtgc
atttc
atgc
ctcc
tt363
3557645
AP1G2
YES
tgtc
aaca
ttcttg
gtcg
cttc
ctac
tcaa
cagt
gaca
gga
364
3709227
—NO
tgcc
gtcc
tcag
tgaa
tcac
agca
gcttc
aaag
gact
caac
tggc
aaga
tttcc
caca
g
VO365
4021348
ZDHHC
NO
gaga
gcaa
gtaa
gctg
tccc
tttta
actg
tttttc
tttgg
tcttt
agtc
accc
agttg
caca
ctgg
cattt
tcttg
ctgc
aagc
tttttt
aaat
ttctg
aact
caag
gca
gtgg
caga
agat
gtca
gtca
cctc
tgat
aact
gg
366
3030
989
ZN
F862
NO
aggg
cttg
accc
tgcc
taga
tcct
cctta
ccaa
gaaa
gggc
tgaa
aaga
gagg
gcat
ctgg
catg
ctttt
tgcc
acttc
tgttt
ctag
gtta
tgta
cttc
atgg
aat
atct
gggc
attg
ctta
gaaa
cctg
tgtc
ataa
attg
gcttt
gggc
aaga
ggct
ttgag
ttgga
agaa
aagg
tgga
aaga
aagt
gcttg
gggg
cgga
gttc
aggg
atttg
acgg
gattg
ccag
cgct
tcca
ggcc
gtgt
catc
caag
acca
gtgt
ggttg
tggg
caca
ttagc
ttgga
gaca
gtgg
cagc
ctct
tggg
acta
acat
cttt
367
3035657
—NO
agtc
cggt
ggct
taga
tgac
aaat
ggag
gtag
agga
cgca
ttctc
gcct
gtta
tttcg
atttt
tttct
ctcc
tcac
cctg
accg
tcac
gttc
tcac
ctttt
ccac
ctc
aatc
ccca
gtca
cact
agct
tccc
agga
ttaaa
atgg
tctc
t
368
3626930
—NO
tgct
ttcac
ttaag
gcag
ggct
gttg
agtc
ggtg
aggg
gatg
aggg
catc
tcaa
aaga
aggc
tggc
tttca
ggat
tgca
caga
gatg
ttcct
gctc
ccag
caga
ctgg
ggta
aaat
ttcct
ggga
atga
gggc
atttc
tga
369
3091396
—NO
caag
tgag
agtg
atttt
gtta
catg
ggat
acat
tgca
tagt
agta
aagt
ctgg
gcttg
tagt
gcaa
ccat
cacc
cgaa
ta370
3323928
—NO
cagt
gcat
cccg
gaaa
aaag
agag
caaa
371
3326853
TRIM44
YES
tggc
caag
agga
agtg
tccg
gacc
atgg
gcttg
atttg
agta
ccta
ttgcc
ag372
3402333
—NO
caaa
gtcc
tttgc
aagt
ctca
ccct
taag
cgcg
tggg
gact
gtgt
373
3796317
—NO
tcac
gttg
ctgg
ctac
cctc
atat
gtgt
gatg
actg
ttagt
tgta
caat
taca
tttaa
gaaa
gagg
atta
cattg
ctag
ctgg
aattt
cttc
tact
atta
taaa
aata
gaa
ctgt
tttac
caac
aagt
ctct
tctc
cgag
tggg
acttg
atgg
taac
tata
actg
ggat
aggg
caaa
tcaa
cgtc
atgg
gtcc
acag
gtttt
gcttt
aa
374
4020467
—NO
tccc
aaca
agttg
aacc
ttgcg
ttacg
ataa
aagc
agat
gtgg
catc
caat
ccta
ccac
ccag
caaa
gaga
aacc
ccag
gcat
tcaa
ctgc
agc
375
3180404
—NO
catc
gctg
tgtg
catc
taac
tcttg
attc
acgg
tctg
accg
acgc
cttg
gtttc
tgga
gaac
tgcc
ctcg
tttat
ggaa
gagt
cact
gtca
tcag
gagc
catc
ccag
ggag
aggc
tcag
tggg
aagt
gtaa
aggt
ggag
caag
taaa
ccgg
ctcc
agaa
gaga
aggc
gaag
aagg
acta
tttct
gttta
gattt
gctta
accg
cattt
tcac
tttct
tggg
aacc
cctg
atgg
agttc
aagg
tcat
agaa
gaga
agaa
atca
gaca
ctgc
taca
ggac
tcttg
ctct
tggt
t
376
3781848
NO
gagg
aagt
gtct
ggtc
cggt
ccct
agag
catg
gcag
gtat
gcac
tcaa
tgct
377
3868844
KLKIl
YES
tcaa
ggca
ttatc
tcct
gggg
ccag
gatc
cgtg
tgcg
atca
cccg
aaag
cctg
gtgt
ctac
acga
378
2396267
___
NO
tggg
gacg
gcaa
cacc
gcgc
tgca
agtc
gtta
tcat
tccg
379
2862752
GFM2
YES
tggt
ggat
gtag
taat
gaaa
gaaa
a380
3869672
___
NO
ggag
ttgca
tcct
tgct
tgac
taaa
atgg
aagc
cgta
ttgat
tttga
aatg
aaaa
gcttc
atga
tgta
gcat
ctgg
acttt
cact
gtcc
381
3876038
PLCB4
NO
taag
gattt
caac
acac
gtaa
ggcc
atgt
ttctg
atgt
tata
g382
3879228
—NO
gggg
aggg
acat
gact
tcct
gtta
cggg
atga
ggaa
gtag
aggt
tcat
ggag
ga383
2399914
___
NO
ttaga
cttg
aaag
aacc
tcgt
tcac
ctgc
caca
tgct
atcc
tagg
gaat
taat
agct
tccc
atgt
ggac
actc
actc
tatg
t384
3142073
—NO
gggg
aacg
aagc
ctga
caac
tcct
aagt
ttctg
gcttc
taca
acca
tttag
tgaa
ggttt
catta
atgg
cagg
cagg
agta
cagt
tttag
aagg
tctg
gact
ggag
agac
atttg
gggg
aatc
atcc
gtga
gtat
cgag
agtta
aaga
catg
ccag
gaat
agtg
tata
caat
taaa
atag
acag
gggc
ttggg
aaat
gcca
gttc
gO
caga
tata
tattt
gtca
taag
tgaa
ctaa
agaa
aagg
tata
tggt
tgga
tgag
ctta
aacc
cagc
atg
Ul
385
3777286
ARHGA
NO
cgca
gttc
atga
cttg
tacc
agtg
aagg
tagc
aagg
tctg
tgtg
ttggt
gatg
aattt
gcag
ggga
tcag
gctg
aaga
aaca
agac
ttggg
agag
agga
tgt
P28
gaga
ccttt
ttatg
acat
gcat
agag
ttggc
acct
cagc
ctttg
catg
gtag
ggag
ctag
ctga
aggc
tacc
tcca
tgac
agat
gtac
acttt
gaaa
agac
aggc
cggg
tgcg
gtgg
ctca
cgtc
tgta
atcc
cagc
atttt
ggga
ggct
gagg
tggg
caaa
tcac
ttgaa
gtca
ggag
ttaga
gatc
agcc
tggc
caac
aca
gtga
aacc
tggt
ctct
acta
aaaa
aaat
aaaa
taat
tagc
cagg
catg
gtgg
cgca
cacc
tgta
atcc
caac
tattc
ggga
ggct
gagg
caga
agaa
ttgct
tgaa
cacg
ggag
gcgg
aagt
tgca
gtga
gctg
agat
ggca
ccac
cgca
ctcc
agcc
tggg
cgac
agat
caag
actc
catc
tcaa
aata
aagg
aaga
taca
gcta
atct
atat
ttcttt
cacc
agag
atca
ccac
cact
aaca
attc
atac
tggt
cgttt
ttccc
tgaa
tgttt
tacc
tatg
tagg
aaaa
tttta
catc
ccaa
ttttta
attt
aaca
ctat
gaaa
agca
tttct
tttta
taat
aaag
acac
tcta
ttagt
atac
ttttg
atgg
cttc
aaaa
ctac
ttttg
tata
gatg
caaa
atta
aaat
atca
aattg
tttct
tttgt
ttgttt
gtttt
tttgt
tttttt
tttga
gaca
gagt
ctca
ctct
gtcg
ccca
ggct
aaag
caca
gtgg
catg
atct
cggc
tcac
tgca
agct
tttcc
tccc
aggt
taaa
gtga
ttctc
atgc
cgca
gcct
cccg
agaa
gctg
ggac
taca
ggtg
tgca
ccac
cacc
cccg
gcta
atttt
ttgta
ttttta
caaa
acac
aaag
agag
acgg
ggttt
cacc
atgt
tggg
cagg
ctgg
tctc
aaac
tcct
gacc
tcag
gtga
tccg
cctg
cctc
agcc
tccc
aaag
tgct
ggg
386
3851492
NO
tttttt
acgg
attc
agtg
atct
ggac
gggc
acat
acta
ccat
387
2712928
NO
gtga
gtaa
acca
tggc
gtgt
ttgttg
gcca
aaag
ccgc
aagt
ctttg
aaaa
ggtg
aatc
atca
ctttt
ccat
tta388
3060349
NO
ctct
ggag
gcttc
ttgtc
agta
ctag
aaac
taat
acttg
gtga
atat
ccttt
tctg
cttc
catta
ccttg
tatg
aaac
tggt
tcca
tttat
acac
tgag
atta
gaag
cattc
ataa
gtga
tgcg
gcac
caca
aaca
caga
agat
caaa
ctac
tcac
agtg
tgga
gaac
catg
tcttt
tata
agtg
tgat
gtgc
ctaa
aacc
atca
tc
389
3504
762
ZD
HH
CY
ES
aaac
ccca
aaac
tgtc
cagg
catg
g20
390
3543
824
—N
Oag
tcttt
tata
agtg
ttggc
tatc
atttt
ttctc
atct
atttc
cttg
cttc
tggt
tttgc
acac
tttac
catc
caca
ctgc
catc
agag
tgat
caca
cctc
tcttc
agta
gctc
ccat
catc
tact
gcat
aaag
ctga
gact
gtta
gcta
gact
gtta
ggca
ccttt
ccca
cacg
atct
ggct
ctgc
ccca
cttc
ccag
cctc
agcc
caca
tggc
taa
acaa
391
3781
951
LA
MA
3Y
ES
gattt
gggc
ttacc
acct
actg
acca
cctc
cagg
cctc
atttg
gattt
caga
ccttt
caac
ccag
tggc
atat
tatta
gatc
atca
gac
392
3216
926
—N
Oaa
cact
gatc
tcca
cgag
caca
atca
tcca
aacg
gaga
tggg
aacc
cctg
ttcta
actc
tctc
agag
gcct
gaa
393
3448
246
—N
Oct
ctttg
gtat
atta
ttgca
gatg
ttgtta
tttca
cagt
tagg
taaa
ttaga
cata
aaaa
ggtc
aaac
gact
tgaa
cgtg
ctac
cata
aaag
gtca
gtga
taaa
gcca
gaag
tcat
ttctc
agtg
tcca
tcat
aagc
cacc
394
2693
224
OSB
PLY
ES
ggag
atgg
ctgg
gttta
tcat
aaac
cact
ttgga
aaat
aattc
caac
aaca
caac
cagc
agag
tga
11
395
2672926
SMARC
YE
Sta
tgttc
atgc
ggat
gctc
ctac
caat
aaaa
cact
ggct
gggc
tggt
ggtg
cagc
ttcttc
agttc
cagg
aaga
tgcc
tttgg
gaag
catg
tcac
caac
ccgg
Cl
cctt
396
2914136
—N
Occ
atcc
gttc
tttta
gctg
acttg
aatg
actc
taat
gcag
tggg
ttgca
ggttt
ggct
actta
ccta
tgta
ctct
gtaa
acat
ccag
aggc
ccag
ggat
attc
tga
gcca
ctgt
gtgt
gaat
ggag
tttgg
ccat
ctgt
agat
ttccg
cagt
ctct
ccag
gtga
tcgc
actg
tgca
gcct
cattg
ctc
397
3210858
—N
Ottt
gctc
tgaa
aggc
agttt
ctgt
atag
gttc
ctcc
gagt
gaag
atca
gaat
gtta
tcag
acac
tggc
tgta
gctc
agat
gaca
398
3853401
AKAP8
YE
Sac
ttgga
actc
tggg
acaa
atag
agL
399
3605225
—N
Otg
gacc
catc
tcag
cagt
gctc
aaac
caat
gccc
aaac
tctc
ctgc
ttagg
acac
ttctc
cact
ggcc
accc
tgag
gcca
ccac
acct
cact
ggcc
actc
ctcc
tcta
tcttc
tctg
ctgg
ctct
tcct
cctc
tatg
gcca
ccta
aagg
atgg
gtgt
taca
cact
caat
cct
400
2377461
—N
Otg
gagt
tggc
cttg
ttaac
tatta
aaca
caaa
agca
tgtg
atag
ttacc
tgag
agat
ccat
gcag
ggga
gagt
ccag
ggga
aggt
gact
gatc
actg
cttc
cca
gggg
gagg
caga
aaat
gcttc
ctgg
agga
gatg
atgg
gtgg
atta
agga
agat
gagt
tgag
cagg
agttg
tagc
tacc
agcc
aggg
agga
aggg
tattc
agag
caga
ggaa
gcaa
ggtg
agca
aagg
cacc
aaga
tgtg
acac
tact
ggac
tgcc
aagc
ta
401
2934549
—N
Ogt
gtgc
tact
ggct
tatg
gctg
atac
gtaa
gaga
atct
catc
tatc
tcta
aatg
ttctg
cctta
agcc
gcac
aaag
ga402
3185031
—N
Otg
gcac
atgt
ggat
tcta
tacc
ttaat
taaa
gcca
ctcc
ctta
cata
caca
cattt
aggt
atcg
ccag
actc
gcct
gatg
agga
ataa
ttattc
tttcc
tcca
tattt
ctta
actc
ctgt
ctttt
ccat
gaaa
cggc
ttggc
atta
aact
ccttc
ccag
a
403
3560964
—N
Ogt
aatg
tggc
ggtc
aaaa
ccgg
cgct
ggaa
ccac
ggga
catg
tt404
2732616
MRPLl
YE
Saa
cgtg
gagc
cattt
acca
gtgt
tctta
gtttg
ccat
accc
atttg
cttc
cgaa
atca
ataa
agttg
ctgt
attta
caga
g405
3150536
—N
Ogc
cact
cattc
atgg
ttgttc
tatg
ttcca
tgaa
ctct
aata
gccc
aact
tata
catg
gcac
tcca
aggg
gatg
cttc
agcc
agaa
agta
aagg
gctg
aaaa
agta
gaac
aata
caaa
agcc
ctcg
tgtg
gtgg
gaac
tgtg
gcct
cact
ctta
cttg
tcct
tcca
ttcaa
aaca
gtttg
gcac
ctttc
catg
acga
ggat
ctct
acag
gtag
gtta
406
3774
169
NO
atcg
ctgt
acca
ttctg
gggc
actg
gaga
tggc
agct
gaga
agca
gaga
agct
ggag
acct
gaag
cctc
cacc
cttg
gctg
tggc
tgcc
gaga
gagg
cccc
tgca
gagg
gagc
ccca
ggcc
acac
ccag
acac
caca
gcca
gctc
acag
tgcg
gcca
gagg
gcac
aggc
tgct
tggg
gggt
gagg
gtcc
ccat
tcct
gcca
cctc
tcac
agac
catg
accc
acag
ggct
gggg
gcttc
ccct
ggga
gaaa
gtcc
ttccc
tccc
catg
gccc
agcc
ctgg
ggaa
agaa
agca
gag
acta
actc
aggg
actg
cagt
aggc
tgag
gaca
aggg
aggt
gggc
ttggg
gtga
aggc
cccc
cctc
actc
gcag
agaa
ggga
cact
accg
gcag
taga
ggac
aggg
aggc
agga
cgac
ggca
gtta
gaga
cgag
gcgt
ttcag
gtga
actg
tatg
tagt
gtta
ctcc
ggac
gtga
gcat
ggcc
tggg
tcag
tcga
atga
aatg
agct
gggc
ctcg
ctgc
cctg
tgcc
ggcg
gccc
ctgt
gcct
gcgg
ttgct
gggc
cacg
gggg
gctg
ctgc
tggg
gggg
accg
ccag
aggg
tgcc
atct
gtgc
agga
ggaa
agca
ggca
tggc
cctg
gtcc
tccc
ttccc
cagc
ccag
ggag
gaag
gcttt
tgtc
ccag
caga
gcca
caga
ccca
caag
tgc
ttgta
atcc
atgt
ccac
ggac
tgca
gctc
atg
407
2987860
NO
cccg
tgttt
cgtg
taca
acat
atga
taca
tgtg
acag
ttgac
aaga
gtgc
ctga
gtct
caga
gcag
gaag
agtta
tcct
gcct
ccac
atgc
attg
aagg
gaat
atat
gtca
cacg
ttact
agcc
aggc
tcag
agga
gaac
acat
ctca
tttac
aagt
tcaa
atct
gaaa
ggga
gaaa
tgtg
ctag
tattt
agtg
gatc
ttgag
gaaa
ctag
ggat
gtgt
tacc
catc
tgtg
cagg
aaca
ttaga
tcta
gatg
gggc
tctc
ctag
tcaa
aacg
ctga
408
3829652
—NO
atgt
gact
ttaca
accc
tcgg
tgtc
ctcc
tggc
tgct
gggc
ccca
gcct
cctg
actg
ttcag
ctca
caca
cgtg
aacc
taac
acct
gcct
tggt
cccc
tgct
ccat
cttc
taga
caac
tcat
gctg
agga
ccag
ccca
cagg
agga
ggcc
tccg
agttc
tctc
ttgac
gtct
tcac
cttg
acct
tttttg
ccac
cattt
tagt
ctgt
cca
ggc
o409
3900228
C20orf7
YES
gacc
aaag
cctg
tgta
catta
ctac
cact
cgag
acaa
tgag
aaca
tttac
agta
caaa
gattc
cata
tatg
gcag
ctcg
tgttg
ttttta
ttaag
tgga
ttgta
ac4
cttc
tttttg
gaaa
aaaa
gtat
ctaa
ctgc
aaca
caaa
acac
taaa
aatg
gagt
tgat
gtat
tgcc
taaa
atc
410
2858280
—NO
ataa
aata
caaa
ctct
ccaa
caag
acct
tttgg
ccat
cagg
aacg
cagc
a411
3088555
ATP6V
YES
aact
gccc
ggttc
ttcaa
atct
gac
1B2
412
3850502
LOC
147
NO
gact
gtttg
ccaa
tggt
tgta
tttag
taag
atttg
taga
ctct
gtttt
tcttt
tgac
acag
ctgc
aagg
ccaa
cagc
tgtg
caaa
gcca
caag
ttatg
ctaa
gtca
g727
cagt
tatg
ctat
agat
taca
tgac
ctgt
gact
gtat
catta
actg
ctttt
gtttt
gcttc
tgta
agttt
gcct
ataa
aaac
caca
ctca
gtct
ttgttc
aatg
gtca
gctt
ttcag
atac
aaat
ccac
tgag
ccgg
tgta
catc
taaa
taaa
413
3946288
TNRC6
YES
tgtg
gaca
agcg
agcg
atga
atct
cggg
gattt
taat
gata
tcat
gagg
aagg
atcg
atct
gggt
tccg
tcca
ccta
attc
caaa
gaca
tggg
aacc
acag
aB
tagt
gggc
ctta
414
3430162
POLR3
YES
agtc
tggt
tgaa
tattt
agat
gtga
atga
agaa
aatg
attg
taac
attg
cact
gtac
gaac
acac
aatta
ata
B
415
3751732
_--
NO
gaat
atgc
atct
ttaca
agca
tcca
cag
416
2347259
_--
NO
tggc
tgcc
tgac
atta
ttggg
tgct
417
2421262
NO
ctcc
ctgc
ttggc
ttgat
gacc
ttt
418
3886689
PABPC
NO
gtat
ggtc
ttggg
tttct
gctg
agaa
atgg
cttg
tagg
tggg
gggt
taaa
gttc
ctca
cacc
tctc
ctga
acgg
gagg
aagc
tgct
gttc
acca
gtttc
cagt
gIL
ctca
atgc
gggc
aggt
gggg
catc
tgtg
cgttt
ggta
aata
catg
atgt
t
419
3951993
___
NO
ttgag
ccca
gtgg
cggg
actc
cacg
acgt
agac
ctct
ctcc
ttctg
cc420
3966982
NO
ttcct
gctg
acca
acct
aattc
tggt
ttcat
acag
ggca
gcca
ggtg
ccca
ccta
aaag
actg
cact
tctc
aact
tcca
gttg
tttct
atgg
ataa
tggt
tgga
agca
gaaa
ttggg
ttggg
gcttc
caga
taat
cccc
cctc
acag
ggac
tgct
tcaa
gaga
aagg
gatg
ccct
ttggt
tcttt
ctcc
tttcc
actg
cctg
gaat
gtga
tgt
ctga
cact
gtgg
cagc
cata
ctgg
cata
atga
ggcc
aact
ctct
gcca
tgct
aaga
gtgg
caga
gctg
ggaa
ttgaa
aaga
gcct
gcta
gcct
tggg
ttgc
ttccc
cttg
atcg
gctc
atta
cttg
gaa
421
2756
515
MFS
D7
NO
tgag
cgcc
ttgta
gtcc
aggt
tgcc
cgcc
acat
cgat
ggag
gcga
actg
gaac
atct
ggtc
cacc
tgcg
ggcg
gggg
cgaa
aggg
ctcc
ttgcg
ggct
ccg
ggag
cgaa
ttaca
agc
422
3261869
NO
tttttg
tattc
tagt
cgttc
agcc
ccct
cccc
tccc
atct
tcca
gcgc
ttctg
aaca
actc
ctac
tgttc
tcttc
c423
3391839
—NO
ccct
atag
attg
tttga
ggac
ccta
ttaaa
tattg
tata
gcct
ctag
gaat
aagc
ctttc
cattg
acttg
ctgt
gata
aaca
gatc
ttttg
tcag
cgac
tctta
aaat
aca
aaac
agcc
ttggg
cagt
agag
tacc
ac
424
2437406
ClorflO
NO
gcttc
cctg
acct
agac
agtc
ctga
ctga
tggt
ccaa
cctc
aatc
ccac
ttattt
ttggc
tagg
ccttc
ctgg
gagt
cata
aaag
agat
gaat
ccat
tcta
gagg
4tg
caca
gcct
gtct
cttc
cctc
acaa
atgt
cagt
cccc
aagt
cattc
tgat
ccac
cttc
ctaa
tattt
ttgcc
acct
ccaa
cttc
tttca
agat
gaaa
agga
aatg
tag
agaa
gcaa
ggtc
aggg
taga
cact
taat
ccca
ctga
ctgt
cttta
atcc
actc
ttctc
cctc
tcaa
cctg
gatg
atct
ccac
actc
ctat
ccat
actc
agat
aVO OO
cagg
atat
attg
ttccc
ctat
tatg
tgct
aagc
acttt
cata
tccc
ttgcc
ttgct
taat
cttta
cagt
cctg
tgaa
gtag
gaat
tttat
cccc
agct
gagg
aaag
agac
tgag
cgag
accg
acttg
ctca
aggt
caca
cagt
ttttc
acca
gggg
tagc
agtg
ttcac
g
425
2802671
NO
taga
gtta
gagt
cagg
gccc
tggt
ccct
ggtg
ctct
ttcct
ggtc
gcttc
tcat
agcc
agga
cgttg
ctta
catta
gttc
tcaa
cccc
actg
ggac
caat
cattt
tta
ttgcc
aatc
tattt
tggg
gctc
tgttc
tcgt
ggca
ataa
atca
ttcttt
caga
gctc
ccta
tttaa
ccttg
tgag
agcg
tgtg
tgtg
tggt
ggag
gtgc
ttaca
taaa
tgca
gtat
tttta
acct
gcct
ggca
cgct
ttagg
ttggc
gtta
426
3415
835
ZN
F740
NO
aagt
aggg
cacg
ctct
caga
gacc
agga
taga
gagt
cagg
gaac
ctgg
gaaa
gagt
agtg
ccag
ttccg
ggta
ggct
gctg
cacc
aggc
ccta
ccag
ctc
aggc
tata
aaca
gtct
gcttt
gccc
caag
tcttt
tccg
tcca
cccc
caac
agcc
ttgtc
atca
ctta
gcta
ctga
tcac
gccc
atgg
cttg
acat
tgga
gggt
taca
ttagt
ggag
tccg
ccac
agct
tcga
accc
tctc
ccca
gatg
ccct
ttgcc
tctg
ctgt
ggcc
ctgg
ggttt
tatc
aact
g
427
3764797
—NO
tggg
actg
tcaa
tcat
tggc
tctg
cctc
tctg
acca
caga
ctaa
atat
atga
ccag
tcta
acct
gtta
agca
taac
aata
tgga
cact
ggga
gaga
agcc
tctt
gaaa
tcct
ttgct
gaac
ggat
catg
taca
agct
gcca
agag
ttcat
ctc
428
2431813
—NO
tgat
gcac
agttt
agga
gcag
gaaa
gcca
tgtc
ttgga
tcca
gga
429
2468734
—NO
tgcc
caag
tcct
tgaa
tcct
acag
acct
tgct
aatg
gctc
cgttc
acga
cacc
caac
tggt
cctg
aact
ctta
ggat
taac
ccgt
ttgct
tatg
gctg
agga
a430
2512616
—NO
ccta
acta
ggag
agtg
ttttc
aagg
gaga
gattg
ctat
agca
tcag
cctc
ttggg
tttgg
agaa
tgag
ctta
acca
tgag
attta
ggttt
gagc
atgt
ttact
ata
agta
aaaa
gcat
tttcc
tgga
gtct
ggca
aata
ttgtg
gaag
atgt
ac
431
3242
348
—N
Oaa
gaga
atgg
gcaa
aggg
agcg
ggta
ggat
ggcg
gccg
cacc
acca
gcca
cgga
ctgg
gaga
ggcc
cttc
tgtg
tttcc
agat
gaga
agag
agct
caag
gaat
ccca
ggaa
taga
ctcc
atgg
aaaa
gaat
gaaa
aaaa
tcag
agaa
aagc
caaa
tatc
caaa
agag
432
3316
358
CD
151
YE
Stg
ggtg
agttc
aacg
agaa
gaag
acaa
catg
tggc
accg
tttgc
ctca
agta
cctg
ctgt
ttacc
taca
attg
ctgc
ttc43
334
7382
8T
A0K
3N
Otc
ccac
tagc
gatc
agct
gaca
ttcct
aact
gaag
gctg
caat
gtgt
tgct
tattc
atttt
gtac
cgtg
ggag
ctgc
gggg
acta
gcag
agag
ctaa
acta
434
2453
074
Clo
rfll
YE
Scc
cgga
agct
gcca
ccta
atat
tgttc
tgaa
gagc
agcc
gaag
cagt
ttcca
cD
435
3144
603
C8o
rf83
NO
cagg
aatg
caag
cact
gcta
ccta
acta
aaga
ccttg
gcttt
ggaa
acat
ctta
cccc
ttggc
agct
ctcc
agag
tttag
acttg
gcag
catc
aaag
ac43
630
3593
7G
NA
12N
Oag
cact
cctc
ccac
gggc
ctgg
gaag
gcaa
atgt
gtttt
ctga
actg
tgttg
agac
cgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
ttggg
gtgt
cttg
tgaa
gtcc
tgct
aacc
tcttg
tcag
caga
atat
tcga
gaaa
tgc
437
3854
106
C19
orf4
NO
cttc
aaag
tgga
gtag
tggc
acag
agac
cata
tggc
ctgc
agaa
tgga
aaat
gtca
cctg
tttgg
cctc
ttacc
agaa
aaat
ttgcc
aacc
gctc
aact
agg
2gt
gtga
tgga
gaat
ccca
aggt
gtcc
cttc
cctc
ccaa
agcc
actc
aggg
aatc
atct
gtac
c
438
2604
179
USP
40Y
ES
ttgca
atgg
ggag
tgac
gttc
aacc
tggg
acag
aaat
ggaa
atcg
tagt
agaa
gaaa
caat
atct
gt43
926
2569
6PD
E12
NO
tagt
ggga
ctttt
aagc
atct
ctga
aata
aaaa
acttc
ttttta
caga
caag
catta
tagt
ttgag
ttaca
gaca
acag
tgtg
tata
tatg
taat
atat
atat
agta
aaat
gaaa
tttaa
atat
gaag
ccaa
acttt
ttaaa
atta
gaaa
ctac
aaat
ggtta
tact
gatta
gtgt
ctag
ccta
gagt
ggta
acca
tgct
ttact
aattc
agtta
tgaa
atac
atta
tttat
aatg
catta
gctg
tatta
gctg
ttgct
tttttg
atgt
tcag
gata
acta
tgtta
tctc
atttc
tgca
tttaa
ttaat
agct
cgag
tatta
aaag
ccc
actc
ccttc
aaga
aaag
ctttg
atttt
cccc
agtc
atga
aagc
ccttg
tttca
aattc
tttaa
tctc
tgaa
ccta
gtat
cata
agaa
tttcc
tcttt
tgat
aaca
tctg
tact
ttcat
attc
tgct
cact
atca
aatg
tattg
ttaac
actta
gtaa
gtttg
aaaa
tgaa
gggg
tttta
tctg
cattt
gaca
ttgaa
ccttg
aagt
acttt
aagt
actc
caag
ggga
aaat
taaa
gtgg
aagt
ttcttc
ggat
cttg
ttta
440
2410
545
POM
GY
ES
catg
tgga
tgcg
gatg
cctg
aaca
acgc
cggg
gccg
agag
tgca
tcat
ccct
gacg
tttcc
cgat
ccta
ccac
tttgg
catc
gtcg
gcct
caac
atga
atg
NT
l
441
2545
098
HA
DH
NO
atgt
gtgt
gcag
aattc
tagg
cagc
acct
tagg
gagg
gact
ggga
tgag
agaa
agtg
ggac
ctgg
tggg
ctca
acca
caca
cacc
tgtc
tgtg
caga
tgc
Attt
gccc
aggc
ttctc
acca
cggt
gtac
cggg
atat
ta
442
2749
561
ET
FDH
YE
Sat
gctg
gtgc
cgct
agcc
aagc
tgtc
c44
328
4912
5D
NA
H5
YE
Sttc
ttcgg
acct
tggg
agca
gcaa
aaag
agcc
aatc
caat
ggat
acgg
agtc
cacg
attg
tcat
gcgt
gtac
tacg
ggac
atga
atct
ttcta
444
3022
356
—N
Oga
acac
ctgc
ccaa
tgtc
agct
atac
ttcag
ctct
cattt
gcac
tattc
aaac
ttata
aagt
tgaa
gagc
aacc
tacg
cctg
ttg44
524
7402
6__
_N
Ogt
ggta
gcag
ttttc
caga
tgtg
gatg
cttc
atga
tctg
gtg
446
3160
004
___
NO
tgct
ttgttg
aggg
tgta
aggg
gaa
447
3795
785
NO
aatg
ggaa
tgaa
atac
atgc
ctttt
tgttt
ttagg
ggtg
ctca
gtta
taaa
tatg
aaag
gctg
agaa
aatg
ctaa
ttgta
cagt
aaga
tcaa
ctgg
acttg
tttta
ataa
gaca
ataa
tttga
aatta
gtaa
acta
tttta
taca
aata
gtgc
taaa
acttt
ggaa
ggtc
ctca
acct
cctg
g
448
2990
169
NO
cttg
ggta
tcac
acaa
gcat
tact
gaca
ttgag
ttcat
ggaa
aata
caga
aaat
atat
acaa
aatta
atgc
caca
gaac
atga
caaa
tggc
tgtg
aggg
gag
gact
tttga
tcaa
ttgct
tgat
ctta
aact
tact
ttaga
agag
gtgg
ccag
tgcc
tgcc
aattg
gctg
ccat
gtgg
atac
ttctta
tgct
tgat
ctta
agtg
ctttt
gtgt
gtga
cagc
tcta
caca
attc
ccta
gtta
agag
ttact
tctc
acca
acca
aagg
449
3858
632
NO
gctg
actg
caga
ttcaa
ccttc
tgag
ctca
cgaa
450
2481
311
KL
RA
QY
ES
gctg
agttg
cagg
ggaa
gtac
caga
agct
g1 1
451
3726
479
RSA
Dl
YE
Stg
gccc
tggg
gcta
cgca
ccga
tgtg
ggga
tcac
tcac
cag
452
3847
164
NO
ccat
gcga
agta
ccaa
gccc
tagt
tgttc
tcca
tcca
ggag
gccg
tgtg
gctc
atcc
ctta
ccca
caga
ccct
cact
cacc
acct
tcgt
ccct
acca
tcca
cagc
accc
gctg
gccc
tgaa
gtga
ttcct
tggt
cctg
gacc
tggg
gtttt
gcat
tttgt
ctct
gagg
agga
ggtc
gcat
ggag
gggg
aggg
gatg
ggga
tgtg
ggtg
tatg
gacg
ggtg
gtgg
gtag
gggc
acgt
ctga
tcca
ctgt
cacg
gcac
ctct
ggtg
agttg
taat
gcgt
tcgt
gact
ccaa
ctga
ctgg
tcttc
acct
aac
ctgc
ttattt
ctct
gcca
acac
ctgg
gatg
gggg
tttgt
aaat
ccag
cagg
gctc
a
453
2945
519
GPL
Dl
YE
Sct
tccc
atga
ccac
acac
agga
gtgg
ggaa
acct
tact
tgta
cttc
atcg
gtttt
cttc
caaa
tcag
cgac
ggga
gtaa
cattt
ttattc
ctat
cacc
aaac
cac
ctga
aaaa
tgga
ctcc
taaa
cttc
ctca
tatg
acat
caca
ttagg
ctaa
tcat
gtac
cctc
aata
tcat
cagg
gtaa
gtgt
attta
ttctg
ttatta
ttaca
tttgt
agt
gcaa
gagc
tgga
cact
ctag
aatc
ttcca
tgca
cact
acttt
ggga
gg
o45
429
6060
6—
NO
ctaa
ggct
ggat
ctcc
gtttc
tttcc
acct
gctc
caac
tgta
ccca
cagc
tgtg
tctg
actg
gctg
ccgt
catta
tgct
ccag
tgtg
taca
tctc
tcca
ctga
cact
ggtg
ctct
t
455
3275
073
—N
Oga
ggag
gttg
actc
agga
gaac
ttggg
456
3392
454
—N
Oga
tgct
gaat
tgaa
ctgg
gatc
ttcttg
tctg
tcaa
gccc
ttgaa
cagc
tttcc
cttta
agca
acaa
agat
gaca
agttt
ttgct
atttt
ggat
tagt
gaga
gtgt
acga
aatg
gtttc
ccttt
ccttc
cagg
caaa
aata
ccaa
caga
atct
ggct
tcgg
aatg
aaag
tgct
aattt
cattt
ggtg
acttt
caga
tttga
aata
cggg
cttg
aac
aggc
attg
tgca
ttagt
cagc
tgag
gcag
tgga
ggtta
ttttta
cgat
gcta
atta
ttattt
cctg
acag
tgat
ttact
ttatg
ttgtc
tact
gttta
cgcc
ccca
aac
tagc
cattg
tttta
ggaa
caga
attg
gttg
ggttg
tcat
gctc
ttttc
aaag
acaa
atct
tact
atttt
atgc
atag
aaat
acttt
cctg
taat
aaag
aaga
aagc
ttta
ttgaa
gaga
atct
tgca
tatg
aaat
gtaa
tctg
gata
aata
ttcaa
aact
aata
ggttt
atat
tttag
tcttg
atttt
taaa
gtat
atttt
tctg
acat
ccta
acca
ccttg
cttta
tgtg
catta
aagt
taaa
tctg
tttta
ctgg
gatg
cggc
tgaa
t
457
3428624
MYBPC
YE
Sca
tttga
gatg
caga
tcat
caag
gcca
aaga
taac
tttgc
agga
aatta
caga
tgcg
aggt
cacc
tata
agga
taag
tttga
cagc
tg1
458
3869530
—N
Occ
atta
cagc
agca
ggat
ccag
tgac
ccgg
gatg
ctca
catc
tctc
cctg
acgt
gggc
ggag
tagc
ccct
tcct
ccca
aggt
cact
gtcc
tgtc
caac
ccc
gtgc
tccc
ctag
cccg
ttggg
aggt
ggac
agtg
agac
atct
tccc
a
459
3180781
—N
Ota
ttggg
agtc
ctgt
caga
agct
gcct
gcta
cagg
gtct
ttagc
tttcc
ttaga
ttttc
aaag
aggt
ttgtg
atcc
aaga
agaa
taga
gtttt
gaat
ctag
atttt
atac
taaa
cgat
cact
ttaga
ttttc
agga
gtgt
gtgc
ctag
tttgt
gatta
agtta
460
3319155
PPFIBP
YE
Scc
acag
tgct
gcta
gtaa
tgaa
acct
acca
ggaa
cgct
tggc
acgt
ctag
aagg
ggat
aagg
agtc
cctc
atat
tgca
2
461
3428
631
MY
BPC
YE
Sct
tgat
cctg
cata
tcag
gttg
acaa
agga
ggca
gagt
gagg
tttgt
tgtg
gagc
tggc
agat
ccaa
agttg
gagg
tgaa
atgg
tata
aaaa
tggt
caag
aa1
attc
gacc
cagt
acca
a
462
3435
265
LR
RC
4Y
ES
agcc
gaaa
gccg
tgat
tccg
atct
acga
aggc
gatta
ccac
cctg
agcc
cctg
accg
taga
ggtg
caga
tcca
gctg
aacc
agtg
ccgc
tcgg
cgga
gg3
aggc
tctg
cgca
tgttc
gccg
t
463
3697
431
___
NO
ggaa
agcg
gggc
agac
aaag
tatc
caaa
agt
464
3928
125
NO
gggc
actc
accc
aagt
aatg
tcca
aaca
atat
ttttc
catg
tccc
ccct
cccc
caga
acac
acac
aggc
aaag
gcag
agat
gttta
ctgt
gagg
tcac
atgg
catg
aagc
aaat
gcag
cctta
tctg
tttat
tatta
caac
agcc
agtta
gcat
tggc
aaat
gtgc
tttgg
ttaca
aaca
tggt
ttctg
ctaa
taat
ttcat
gaag
aag
ctttt
agga
tatg
tgac
cattt
attc
ttagt
ataa
tcaa
tcat
tgaa
ggat
aaaa
atca
gagt
ataa
aagt
ttata
ctct
ggca
aata
taaa
taca
tgca
caga
agttt
ctga
cagt
ttaaa
tttga
gcat
atttg
aaat
atta
aagg
catg
aaat
ttaaa
aata
caca
tacg
tgaa
acta
tggt
gaac
atgt
tatg
ggttt
gtcc
catg
gttta
actc
tgct
cagc
ccac
gattc
ttagc
tggg
tcag
tcaa
tagt
ctat
tttta
tcat
tttct
accc
tgttg
taca
gcat
gatc
atat
c
465
2769
166
—N
Ogt
gggg
tcag
agaa
gata
ttagg
aaa
466
3545
551
C14
orfl
NO
tggc
gcat
atca
gcag
acgg
cggt
cact
actg
tggc
tcat
ttaaa
tctg
tgttc
agct
ccgg
gcct
ctttt
ttcag
gaat
ccttg
tttcg
ggac
accc
agtg
gac
78
a46
738
9944
3—
NO
gact
tacc
tgga
gagc
gcga
gaac
tggt
gtag
ctta
cgtg
gagt
gaga
agag
atgg
ggaa
catg
tgaa
tgag
gtca
gagc
ggttg
cagg
aaac
ggac
tgct
gctg
tagg
ccat
tttaa
agac
ttttg
atttt
cctc
tgaa
ctgc
cgag
tattt
tcag
caga
ggag
taac
atga
tctc
actta
ttttta
acgg
gctta
ctct
ggct
gc
468
3182
036
NO
aaga
ttgag
gaga
agag
tagt
gaca
a46
930
6308
4__
_N
Oga
gctg
ggta
caga
gtgc
ttcat
gctg
agcc
tccg
ggca
tgct
tcgt
gtac
actc
cgca
cact
ctcc
acac
acac
acgt
taca
tggc
agca
gtcc
470
3387
617
MA
ML
NO
gaaa
gcct
gtgt
atcg
ccgt
gact
ccgg
gcgc
gagc
cagt
gtca
gcaa
agcg
gcta
acaa
caga
cgag
aaag
agaa
agga
aaat
acaa
gcta
ctttt
ttt2
ttcca
tcta
taaa
gcgg
agca
aata
cagg
agat
agaa
ccag
attg
ctta
ttgcg
agtc
caga
ccct
caga
tcca
ctgg
ccgg
ggat
ggaa
tgta
caaa
agt
ggac
agaa
aagt
ggct
ggac
atga
ctcg
gtgc
aattt
gctg
gaag
tttgt
aagt
ttgac
catc
gtttg
taaa
ttact
ctcg
gaag
agttt
gtct
ctct
tgat
actg
tat
taga
atag
agcc
gggg
gtga
ggaa
taga
aacg
taag
cggg
aaag
aaaa
aaat
gtgt
tgaa
ggat
ctct
ctca
gtgg
ctag
cga
471
3566
496
—N
Occ
taag
tcac
gcct
ctgg
aact
ttggc
aagt
tcta
taga
agag
tcag
cagc
aggt
gtgg
gtca
aggt
ccca
attc
acttc
ctgc
agct
gtac
catc
ccttc
ctc
gttc
tgca
aattt
tctc
ta
472
3088
581
NO
tgat
ctca
tgat
ctca
aaac
aaag
gg47
331
2074
0—
NO
gagc
ttggt
ttttg
gtca
gaag
ccaa
gaaa
aaag
atcg
gaga
agaa
aaga
agga
atga
cctc
aact
tgct
ctgc
acgg
agga
cggg
gacg
gctc
agag
ata
aata
gc
474
3399
570
NC
APD
YE
Jcc
tgga
cgcc
agtg
agtta
ctct
caga
cacg
tttga
ggtc
ctca
gctc
aaag
gaga
tcaa
gcttt
tg3
475
4010
188
SPIN
3N
Otg
tatc
aaag
taat
gtgt
ccac
cggt
gtga
gcac
cagc
aact
cact
tcttc
ctca
gaca
tctc
taaa
gctg
gaaa
gattt
tgag
ggac
aata
tggt
ttcct
tcac
ccca
cacc
aatg
ttatg
aatg
agac
cagc
agac
caac
agca
aaca
ccta
ga
476
2638
236
NO
tcag
ctct
gcag
ccct
acag
ttacc
agct
aacc
ttg
477
3683253
NO
tgaa
tgaa
tatg
gtcc
ctgg
ggct
aaaa
gtaa
gaaa
ggag
aaaa
aaat
aaag
caac
cccc
aaat
ttgat
gaga
accc
tata
gtcc
ctga
ctga
ttgttt
cctg
gcca
tcat
catc
atca
gcttc
aact
ttgtc
aaca
cacc
ctta
atat
gggt
gtac
cttg
tttta
cagt
ttgcc
gagc
acttt
caca
ccca
ttattg
gata
tgac
acac
aac
ccgg
tgag
agtg
ctct
c
478
2502720
MARC
NO
ttcac
ttctc
tgct
cccg
aggt
gtcc
tcgg
gctc
atat
gtgg
gaag
gcag
agga
tctc
tgag
gagt
tccc
tggg
gaca
actg
agca
gcct
ctgg
agag
ggg
Occ
atta
ataa
agct
caac
atca
479
3378495
___
NO
aaaa
ctgc
cctg
gagg
atcg
gcac
g480
2642712
___
NO
atca
ttttg
ctca
ggcc
acgg
ccac
t481
2798891
___
NO
cattt
ccca
gtgc
tgta
tact
ccta
tctc
ccag
tgct
gtttg
ctcc
cattt
ccca
482
3428610
MYBPC
YES
agag
aagg
aggc
cgga
acta
cacc
agca
aaag
483
3691119
NNOO
ggtc
tcgc
gcgc
cgtc
ttctc
gaga
g484
3821139
LOC
126
YYEESS
agac
cattg
tgat
gattc
ccga
ctcc
caga
agct
cctg
cgat
gtga
acttg
agtc
actc
aa075
485
2349300
—NO
gaga
attg
ggaa
tcaa
gaat
cagc
cctg
tttcc
atct
tagc
caca
ccaa
ctta
tatc
tttat
gattt
tcaa
agct
tttgc
catg
tgat
tctg
cccc
caca
aagg
cat
Ocg
gtat
ttcct
a
486
2638933
NO
tgcg
gatc
tcca
aagc
ctag
ggat
ttttc
cgta
aaag
agag
tggg
ccgt
tctg
gtta
ccct
tttat
taga
aggg
tattc
cacc
acag
agag
ccgg
aggt
tttcc
agat
gtgt
gtaa
gaga
gcag
gtgc
gcaa
ggca
agca
aatg
agcg
caaa
cagt
atta
tgga
aaac
atttg
agaa
gtta
gctc
catg
agga
ctgt
gggc
tcc
acaa
gagg
actc
gact
gggt
agcc
tggt
ctga
caca
ggta
cgtg
aaag
caga
gtat
tgct
tcaa
agct
ggaa
acct
tcca
tagg
agcc
tcac
actg
ttgga
agat
gtac
tgct
gctg
gtta
aggt
caac
ctgg
ggtg
caat
gctg
ctgt
cttc
atct
tcgg
tccc
gaag
taat
gctc
aata
agat
caaa
ggcc
ttttg
gtag
atct
cctg
487
2661085
—NO
ttcct
caca
aact
ggcc
ttcca
tttcc
agca
gagg
caaa
tcaa
attg
ccct
ttccg
ccct
catc
tcttt
g488
3037551
—YES
ggca
gtga
ggag
atcg
gtaa
cttg
ggag
t489
3215176
—NO
tggc
cgtc
tgta
caga
accc
acct
gcaa
ccac
caca
ccct
gagc
ccat
gtgg
ccgt
ctca
cctc
taag
tcac
ccaa
490
3410744
DNMlL
YES
gcat
taca
agga
gcca
gtca
aatta
ttgct
gaaa
tccg
ggag
actc
atct
ttggt
ga491
3439960
—NO
caga
gcgc
cagc
aagc
ggcc
acag
aa492
3461262
—NO
tcat
tttaa
ctct
gtgc
agag
tatc
aaat
ccat
ccat
t493
2316284
—NO
atgc
caac
gccg
acgt
ccct
gcgg
gg494
3882865
—NO
tcct
agca
gcac
acat
atgc
agct
ctt
495
3685651
—NO
aggg
gagc
ttcgt
gatc
ctgt
gcac
caag
tctc
catg
cccc
ttgttg
tacc
caga
gcac
catg
ctcc
ccgc
cagc
cccc
tgtc
cacc
cctg
ctta
gtta
taca
gcca
ttgtc
cgttt
tgtg
taga
acag
tggc
tttca
agct
tttgt
cacc
atga
tcca
tattt
taaa
ttgca
accc
tgttc
ccta
tgat
acct
496
2854
135
—N
Occ
gagg
actg
ttgaa
ttgtg
gaac
atct
ggtg
gaaa
atag
caag
aggt
gaca
gaaa
tgct
aatg
gggc
agtg
agat
gtag
acttc
aggc
agaa
ttgttg
tac
ctgt
cgag
ttgga
g
497
3314
277
—N
Ogt
ctaa
ggaa
aaca
acag
gctg
cgcg
aaat
cagc
cctc
aggg
taac
actg
aacc
tggc
gctc
ttccg
gcac
caca
cagg
aggc
gacg
cccg
ggaa
agaa
gact
gagc
ggct
cctg
tctg
aaga
cgtc
tcca
tcag
tcca
gcca
498
3724
236
NSF
YE
Stg
atgg
cgtg
gagc
agct
aaac
aaca
tcct
499
3746
906
NC
OR
lY
ES
caca
ggta
tgaa
acac
ctag
cgat
gcta
ttgag
gtga
taag
tcct
gcca
gctc
acct
gcgc
cacc
ccag
gaga
aact
gcag
acct
atca
gcca
gagg
ttgt
taag
gcaa
atca
agcg
gaaa
500
3909
730
AT
P9A
YE
Sga
ccttc
ttcag
attc
gatc
gtat
gtgt
acgc
agaa
gagc
caaa
tattg
acat
tcac
aact
tc50
134
2846
0U
TP2
0Y
ES
ttggt
tgta
cagt
tggc
acga
gatc
tgca
gatg
gattt
ctac
ccac
acttt
ccag
agttt
tttttg
acta
tcac
ctcg
atcc
tgga
gact
cagg
acac
agag
ttgtt
agaa
tggg
ctttc
acct
cgtta
tcat
atc
502
2751
975
GA
LN
TY
ES
ggga
ttattt
gcca
ttgaa
cgag
agttc
ttcttt
gaat
tggg
tctc
tatg
atcc
aggt
ctcc
agat
ttggg
gtg
750
338
9410
0C
20or
f9N
Otc
ccag
cacc
tgga
gcct
tgga
tcat
ttact
tcca
ggac
cgga
tctc
cattc
agac
cctg
atct
acag
tctc
cctg
ctcc
ctct
gccc
ttcct
ccct
ctttc
tttcc
6ct
ccct
ccct
ccct
ccct
tcttc
cccc
cttc
ccttc
cctc
ctcc
ttcct
tcct
cctc
tccc
tccc
tccc
tcct
ttcttt
cttc
ctgt
ggttt
tttcc
tctc
ttcttc
ccttc
tttct
ggttg
gtgc
tgct
gggc
cagg
tggg
aattt
ctga
ttaS
504
3908
236
—N
Oac
cttc
aggg
tgct
ttcac
aaat
gagc
tact
ggag
tattt
ttcgt
atct
ttggt
tgat
gaaa
aggc
tcat
ctcc
aaac
ttaat
ggttg
tacc
aagg
tgttt
tcat
tcct
tcaa
caag
tcac
tgag
tgct
ttcttg
gaac
acac
agcc
ttgca
ctgg
gggt
tatg
ggga
ggga
ggaa
gagg
ctgg
acag
ggtta
ctgc
atac
tgcc
ggaa
ggag
attc
tcat
ttggc
tgca
gagt
gtcg
caca
505
2407
923
—N
Oga
ggtc
tccc
gtca
ctgg
gcat
ccag
ctgg
gtgc
tacg
gccg
tggc
tcct
gcct
tctg
cagc
cctg
tcag
ggcc
caag
gcct
tcct
catc
agca
cttg
ctcc
agcc
tact
cact
ggag
ttgtg
ttctg
ggtg
ggtg
tgta
gaat
actc
gcac
tcct
acttg
gggc
atca
ggga
gctc
tcta
gcaa
tttc
506
2571
603
—N
Ottg
gaag
ctaa
cctg
ccac
attg
actta
actg
agtg
accc
cagt
tcca
aggc
aggg
cctc
taca
atttc
cagt
ttatc
tattg
tttcc
tgtg
taag
agca
gata
ctta
ctgt
aagt
ccgt
aagt
gcta
ctcg
tagg
tcaa
acaa
ccttg
atga
tatc
acac
ttcaa
ttgtc
ttata
aatc
ccat
ctga
acca
ctcc
tacc
ctat
ggta
tatg
agc
cctg
ggttt
gggt
gcta
atgc
gggg
atcc
acca
tctc
atct
cattg
ccgc
ctga
gaca
ccta
catg
gctt
507
3634
205
NO
agca
gatg
tcag
catc
cggg
tcac
ctag
508
3746
967
NC
OR
lY
ES
ccac
tgta
taac
cagc
catc
agat
acca
aggt
gtac
catg
agaa
catc
aaga
c50
936
7528
6W
DR
24N
Occ
gcct
ggtc
ttgtg
cccg
agac
gggc
ggag
gctg
gaac
ttgag
acct
caat
aaag
gaag
taga
g51
038
0965
1FE
CH
YE
Sag
ctgg
cacc
attc
atcg
ccaa
acgc
cgaa
cccc
caag
attc
aaga
gcag
tacc
gcag
gattg
gagg
cgga
tccc
ccat
caag
atat
gg51
127
9347
8A
AD
AT
YE
Sca
tattg
agca
gagg
acca
aaat
cgat
gatc
tcct
tggc
tggt
ggct
tacc
aaat
cca
512
3139
792
NO
tgtg
tcac
ttctg
cattg
ccca
gtat
tttga
gttg
gggg
aaaa
atga
actc
acct
ggcc
ccac
ctcc
tgttc
cagt
gtga
aata
ctga
cact
tgct
gcca
gccc
agc
ctcc
tcac
ataa
gcat
taga
aata
tcag
acag
tcca
ctag
gtta
gagc
acct
cattc
atga
ggttc
tgtta
tttat
tcgg
ctct
ccca
ctaa
aatg
cagg
gcat
aaat
ttatg
tgtc
tgct
ctaa
gact
gcag
ctgt
ctgc
cccc
cctc
cacc
ctcc
tcct
gccc
ttcct
gtgt
cagt
gtta
cattc
gctc
cccc
cact
tttgg
tctc
aggt
ctgt
ctcc
tttttg
ttttta
tggg
tgat
ttgca
gtga
ttcag
acag
ggca
tcct
tgca
tgttg
gcac
ttcct
ggag
gtta
ccag
gagg
cagc
catc
cttc
cttg
gctt
ggcc
aggc
atcc
ccga
ctcc
accc
cact
ttccc
cttg
gtgg
tgga
aagg
tggg
ttgga
ggtg
tgta
ctgg
ggca
cata
gccc
agga
atgt
tgct
ctga
gaa
gagg
acat
gtgc
agta
gaca
caaa
aata
taag
gaag
caat
tatg
ttcat
caag
ccga
aagt
tgca
gattt
tgag
gaaa
tgtg
tttaa
gata
ccac
taca
taat
aaa
aggg
aaaa
ccac
ttggc
attta
aata
actta
cttc
aaga
ggaa
atta
tgat
atttt
cagt
ctac
aaat
agag
tggt
tttta
agcc
aacc
ttgag
tagg
taca
agca
aatta
taaa
cagt
ctgc
tcaa
atga
tttta
ctat
actta
gact
agaa
aatg
gctc
ctct
tctc
tctc
cctc
tggt
ccct
atgt
ccca
gtgc
tgtc
cctc
atat
gtta
caa
513
3296
171
—N
Oac
atgg
ctct
ggtc
gaaa
tgtg
actg
aagc
tttgc
at51
434
7005
8PR
DM
4Y
ES
ctta
tttag
ctga
caga
ccac
ctcc
515
2830
812
NO
tggg
taac
agaa
cgca
acttt
gtct
ccaa
aaaa
taaa
taaa
taaa
ttgtta
caat
gaaa
gtgg
ctct
gtgt
tagc
cggg
agtg
g51
631
5150
4—
NO
ctta
ggga
atct
gtttc
ggtg
gctc
agag
aagg
atct
gttg
ccaa
tcag
ccga
tgcc
aggt
ctcc
ggtg
ct51
729
7882
2N
Ogg
cagg
tcca
ggaa
ctgg
gtct
ggag
ccttg
aaag
caga
ag51
834
3217
4—
NO
gtgc
gatg
gaaa
aggt
atcc
aaag
tgat
actc
ccttt
gact
ctga
gcat
cagt
tgcc
cgtg
tttca
gtca
cttc
aa51
939
2285
4PD
E9A
YE
Sga
ccaa
ctgc
ccct
gtaa
gtac
agttt
tttgg
ataa
ccac
aaga
agttg
actc
ctcg
acgc
gatg
ttccc
actta
ccc
520
3225
896
—N
Oct
ggat
ctgc
cacc
tcttc
tttcg
gctg
tgga
tggc
cctg
ggag
acac
gcct
ggcc
tcag
atgc
ccct
tgac
ccaa
ctgg
agga
aggg
tggg
gggc
ccag
caga
aaag
cctg
cagc
ctcc
ctgg
gcac
cgttt
caaa
gctg
atgc
tgct
ctcc
ctgc
tctg
ggtc
tcag
ctcc
tccc
ctct
tctg
cattg
ttgac
caac
agcc
tgg
gtct
gacg
cgga
agca
gaag
aattc
c
521
3981
141
OG
TY
ES
aatc
acgc
agtg
gtgc
acgg
caac
ctgg
cttg
tgta
tact
atga
gcaa
ggcc
tgat
agat
ctgg
caat
agac
acct
acag
gcgg
gcta
tcga
acta
ca52
230
4036
6T
WIS
Tl
NO
cctc
tgca
ttctg
atag
aagt
ctga
acag
ttgttt
gtgt
tttttt
tttttt
tttttt
tgac
gaag
aatg
ttttta
ttttta
tttttt
tcat
gcat
gcat
tctc
aaga
ggtc
gtgc
caa
tcag
ccac
tgaa
agga
aagg
catc
acta
tgga
ctttc
tcta
tttta
aaat
ggta
acaa
tcag
agga
acta
taag
aaca
ccttt
agaa
ataa
aaat
actg
gga
tcaa
actg
gcct
gcaa
aacc
atag
tcag
ttaat
tcttt
ttttc
atcc
ttcct
ctga
gggg
aaaa
acaa
aaaa
aaac
ttaaa
atac
aaaa
aaca
acat
tcta
tttat
ttat
tgag
gacc
catg
gtaa
aatg
caaa
taga
tccg
gtgt
ctaa
atgc
attc
ata
523
3847
114
PTPR
SY
ES
gagt
acca
gttc
tgtta
ccag
gcgg
cact
ggag
524
3903
914
—Y
ES
aattc
caga
gaag
ggttt
gagt
ttgga
gtttt
atct
gttg
taaa
atgg
ccct
gcttt
tag
525
2545
695
GT
F3C
2Y
ES
attc
atta
tattg
acgc
tggt
tacc
ttggt
ttcaa
ggcc
tact
tcac
tgct
cc52
624
6874
2—
NO
ggag
gttc
gttc
ataa
caca
ccct
agaa
acca
tgaa
aaac
atgg
agag
ccaa
gtgg
agtg
agtg
accc
gagt
ttaca
aagc
ctgc
aatg
atca
ttcttt
tatt
acct
gaca
ggat
ttagg
gtaa
ctag
aaaa
acaa
atac
aact
tatc
ttagg
caat
cact
tctc
tatg
atct
cagg
gagg
ctat
a
527
3332
623
TM
EM
lN
Oct
gtct
gggg
ttggc
tctc
ttaac
ccttt
ctct
gctc
ccag
cctg
cctc
acca
ggga
aggt
tgga
gggg
cctc
cctc
tggc
ttctg
catc
tgcg
ccag
caaa
c09
atca
ctgc
cgttg
gtct
ctca
tgac
ttaac
tggc
ttccc
tctg
ctgc
tgcc
ttggc
ttcct
ccta
atgc
tcgt
gctc
tcct
gtcc
ttctg
aagt
tgct
ccttg
gcca
aat
ctcc
agct
528
3457
669
—N
Ogg
cttta
tcgg
atag
ggac
agac
gagg
taag
gaag
gttg
tttaa
tgag
aggc
cttg
cttg
ccag
ccta
agct
tttaa
atttg
atgc
cagt
ggca
acaa
taaa
gtg
ta
529
3575
429
EM
L5
YE
Sgt
ggca
gaga
tggt
tgta
ttcgt
ctttg
ggat
tta53
036
7606
5...
NO
atgg
ggga
cgaa
tgcc
ggct
ggat
tcaa
g53
135
4135
3PL
EK
HN
Ogg
gccc
tgcc
caga
taaa
attg
ttggt
cttg
tctg
taat
tagt
attc
caaa
ggttc
aggt
ggtg
ttgca
tctc
cgta
tcaa
tgttc
agct
agta
ggta
aagg
gaca
Hl
tattt
ggag
ggaa
ctaa
ctct
ggcc
tagc
tact
atta
tgct
cttc
acttt
ttcttt
tttat
tctc
tcaa
gccc
cagc
atca
tgca
gtcc
tcag
ttggg
tgct
tacg
tgct
cttc
tcttc
cttg
cagc
agtc
ctca
aggc
aggg
caga
ttcct
cttta
aggt
gctc
gcat
gtta
cccc
tggg
ttctg
ctga
ccca
acat
tagc
taca
aagt
ggga
cag
tccc
acag
tgct
ccct
cttc
cttg
ttgtg
ggga
agac
cgca
aaaa
gacc
tcttt
tgct
ttagt
gact
ttgttc
ctac
tcaa
taca
ggaa
gtgt
tgac
cagc
aaa
aata
ataa
ttaca
aagg
ctaa
gttc
tgat
ttattg
ttcct
gact
cccg
tgtg
atta
actg
gaaa
ccct
g
532
2428
388
NO
tgga
gctc
tcca
aatta
agaa
tggg
tagt
gaca
aact
ttgaa
cagg
taac
aaac
aata
aattc
aaag
tata
tttag
gatta
aaca
gtca
atttc
taaa
tggg
agt
ctat
catg
tctc
ccct
ttgca
atac
tggc
ttccc
aaca
ggaa
ttcta
tact
tgtc
cagt
ccac
cata
atgg
gta
S53
328
8306
8N
Ottt
ctat
ctct
tcac
ctta
caca
cac
534
3686
201
GT
F3C
1Y
ES
aggg
ctcg
attc
cagc
ttcta
cgga
cacc
tcaa
gcgc
aact
ggat
ctgg
acca
gcta
catc
atca
acca
ggcc
aaaa
535
2933
137
—N
Oat
gagc
aagg
ctgg
tatc
tgga
gtttt
gata
aagt
536
3098
268
...N
Oag
cctc
agat
tcta
ccga
ggca
ggcg
cccc
aaat
accc
tccg
gacg
ctgg
cagc
agttg
cgtg
cttc
atct
gtcc
t53
738
0415
8—
NO
cagt
gtgc
cattg
taat
ttattg
gtgg
agtg
ctat
cattg
gtaa
caat
atca
tata
aatg
tattt
aacc
cctc
ctag
ctttt
gaag
taag
attc
atgt
taga
gcta
gaa
tttgt
ctgg
gtgc
538
3570
485
NO
tgga
gggg
cagg
atta
caca
ctga
atca
ttggc
a53
936
6605
8—
NO
ctct
tggc
ttcaa
gtga
ttgat
cctc
ccac
ctca
gcct
ccca
aaat
gctg
gaat
taca
gacg
tgag
ccac
tgtg
ccca
gcca
gctc
atct
tcga
ataa
taac
agttt
tact
tcttc
cttc
ttaat
tttta
tact
ttttc
cccc
ctttt
tcttg
cttc
attg
tattg
acta
ggat
ttctg
atac
attg
ttgat
taga
ggtg
gtga
tgga
ggac
atcc
ttgtc
tttttc
ccaa
actc
aggg
caaa
aaag
gattc
atag
ttgtg
ccat
taag
taat
ttgct
aggg
tttg
540
3692
317
NO
acaa
tgtg
gctg
tgttt
gatc
ccgc
gacg
tgtc
acttg
gaat
ctca
attta
ttgcc
aata
gacg
acgt
tgtc
aggc
acta
atttt
ctgg
tggc
gctc
catc
541
3749
798
MG
C33
NO
tcgg
gggt
caca
agcc
aaag
ccac
accg
ctag
aagg
gctc
agga
ccca
gccc
agct
cccg
tcct
cttg
gtgg
catc
aagc
ttgtg
tgat
atga
ggtc
ttat
894
gtc
542
3965
785
MA
PKl
NO
atgg
agct
gatc
cagt
aacc
tcgg
agac
ggga
ccct
gccc
agag
ctga
gttg
gggg
tgtg
gctc
tgcc
ctgg
aaag
gggg
tgac
ctct
tgcc
tcga
ggg
1gc
ccag
ggaa
gcct
gggt
gtca
agtg
cctg
cacc
aggg
gtgc
acaa
ta
543
4051
547
LO
ClO
ON
Otg
acct
tggc
ccag
aatc
cacc
aggg
cagg
gcct
gtga
acag
ggcc
aact
tggg
gccc
tcct
ccgc
cctc
tctg
gatg
cctc
ccct
tcag
cact
gggc
ct12
9722
gcca
ggca
gaca
gatg
gaca
cgtg
cttg
agga
aaca
cggg
tttta
tctg
ggcg
aggg
gttc
acgc
ctgt
gctc
cggg
gccc
ctgc
ccag
gttg
tgcc
gggt
tcag
agttg
actg
gctg
ccag
tgga
acag
agtg
tact
ttgct
aatg
aagc
ccttg
tcat
tgga
caga
tagg
ctcc
tcca
ggtc
ctcc
ctgc
cctg
taat
aaa
cgttg
ccac
cg
544
2849256
—NO
aaga
cagg
ctgg
tgct
ttcaa
aatc
agta
tttta
ttgtg
aggt
caag
caga
aatg
tact
agca
gcac
aacc
acag
atga
cact
tgttt
ggct
taaa
acaa
gaca
caca
545
3166772
—NO
tttct
tgtc
tcttg
agtg
gcgg
atca
tgcg
acac
ctcc
cgac
caca
gcat
ttggg
cgag
atat
agac
atgc
ctct
aaa
546
3886907
PIGT
YES
acaa
cgag
acat
taga
ggtg
cacc
cacc
cccg
acca
ctac
atat
cagg
acgt
catc
ctag
gcac
tcgg
aaga
ccta
tgcc
atct
atga
cttg
cttg
acac
c
g547
2474079
—NO
aggg
caca
gaga
gttc
aaca
aaca
ctga
aact
taag
tttgt
cctg
gaca
tcttc
agca
tccg
acttt
ctgg
tgga
aag
548
2661946
—NO
gcca
tcttc
ctat
cctg
gtcc
tgat
549
2963039
—NO
ttggt
ctct
atct
gggc
acag
tctc
tttgg
tact
gtgt
gtta
ggac
taga
tgttt
ttaac
atga
atga
aaac
gtgg
actc
aaaa
gaaa
cact
tttca
ggtg
tcta
taa
atct
tctta
ttttc
aagt
ttact
tagt
atttg
gtcc
tggg
gtgt
ctac
atgt
cact
cagt
tttga
ggaa
aaaa
tgag
atga
cctta
gaaa
taac
atag
aaat
taat
gtg
attg
atga
atat
cctta
gaaa
tttaa
tgag
attta
tact
tgga
atat
ggca
atta
caat
atttt
attg
aaaa
atga
atag
gtca
ccag
cctg
gcc
tgaa
tgca
tttag
acac
cgga
tcta
tttgc
atttt
acca
tggg
tcct
caat
aaat
aaat
agaa
tgttg
tttttt
gtat
tttaa
gtttt
tttttg
tttttc
ccct
caga
ggaa
gO
550
2991
676
TW
IST
lN
Oga
tgaa
aaaa
agaa
ttaac
tgac
tatg
gtttt
gcag
gcca
gtttg
atcc
cagt
atttt
tattt
ctaa
aggt
gttc
ttata
gttc
ctct
gattg
ttacc
atttt
aaaa
tag
agaa
agtc
cata
gtga
tgcc
tttcc
tttca
gtgg
ctga
ttggc
acga
cctc
ttgag
a
551
3535
849
NO
ctga
atgc
taca
acct
cgga
gaga
gaat
aggg
gagt
gttg
gtgt
ttgtg
gtga
atcg
gaaa
gcac
gtgt
ttaaa
gagg
acaa
tcac
atta
attc
agttc
cagc
tatg
agat
taca
ttgtc
aaat
catc
tgat
ttttc
aaga
gaaa
tcac
aaat
ttgaa
tttct
taaa
atga
gaaa
tcta
ctttt
taaa
taat
gacg
ggat
tttca
agta
tgtta
aaaa
tact
gtag
gcta
atac
cgaa
taag
ctaa
gcaa
aact
tttgt
ggat
caaa
ccga
aacc
atgg
gcta
ttttc
ttctc
ctcc
tcttc
ctcc
tcct
tctc
atag
gaa
taaa
acaa
ctaa
atat
aatc
ctgt
cttta
taat
ttcca
tata
ccaa
gcttt
aatta
tattt
aata
ccca
tcaa
agtta
gtta
aaat
gaac
tctg
agta
attta
tacc
ttat
ggaa
aaat
tacg
gtgt
tcta
agtc
tgac
caaa
tgtg
taga
aagt
acc
552
2488
753
CC
T7
YE
Sag
ccag
ctat
ggtg
cgga
tcaa
tgcg
ctga
cagc
agcc
tctg
aggc
tgcg
tgcc
tgat
cgtg
tctg
taga
tgaa
acca
tcaa
gaac
cccc
gctc
gact
gtg
gatg
ctcc
caca
g
553
2575232
SAP130
YES
gtga
tgtc
cagt
tcta
aagt
aacc
acag
tcct
gagg
ccga
cctc
acag
ctgc
caaa
tgct
gcta
ctgc
tcag
ccag
cagt
acag
caca
tcat
tcac
caac
554
3076375
BRAF
YES
cacg
ccaa
gtca
atca
tcca
caga
gac
555
3187656
CEPIlO
YES
aattt
gcag
caga
tatc
ccag
cagc
agaa
aggg
gaaa
taga
gtgg
caga
agca
gctc
cttg
agag
ggat
aaac
gaga
aata
gaac
gaat
gact
gctg
agtc
ccga
gcttt
acaa
tcgt
gtgt
tga
556
3286398
ZNF32
YES
actg
aagc
ccac
caca
aata
tgac
cact
ct557
3840338
ZNF808
NO
acag
atgt
cggc
cact
ttcgc
catc
ctgt
tttttg
t
558
4019
888
—N
Ogg
aaca
gagt
gtgc
caga
accc
tcag
ggtg
tttct
cagc
tcttg
tttac
gggc
cacc
ag55
938
3150
4—
NO
tggt
ttgcg
aata
tgct
caag
tctc
tgga
tgaa
ttgta
tgac
agtg
aagg
ggta
tgtg
atta
gact
gttg
tggc
tttga
ggttg
atgt
gtat
gtttt
aaag
gatta
tgat
tgag
ttgga
gatc
a
560
2540
638
—N
Oct
ccttc
tatc
ccaa
tcgg
acag
tgat
atga
ccct
gggg
atgt
tcca
tgaa
tcat
ttgttc
ctct
ttggc
ccca
tctg
attg
tgta
ctta
tctc
ctta
caca
gtcc
ttctg
acct
tgaa
atga
tcca
gaaa
agat
gtaa
aaga
gtaa
attc
tata
gtca
gttta
aaag
gcag
cagg
aaag
gattg
ggaa
gcat
tttgg
acaa
gggg
tcta
gga
gaca
ggcc
cagc
tctg
cact
tttcc
gtgg
gaca
aagg
ccttt
gcat
tctc
tcct
gta
561
3229
612
UB
AC
lY
ES
acga
ggat
gagc
gtgt
ggac
gagg
ctgc
cctg
cggc
agct
cacg
gaga
tggg
ctttc
cgga
gaac
agag
c56
238
3957
0K
LK
2Y
ES
ctca
tcca
gtct
cgga
ttgtg
ggag
gctg
ggag
tgtg
agaa
gcat
tccc
aacc
ctgg
cagg
tggc
tgtg
taca
gtca
tgga
tggg
caca
ctgt
gggg
gtgt
cctg
gtgc
accc
ccag
tggg
tgct
caca
gctg
ccca
ttgcc
taaa
gaa
563
2380
069
KC
TD
3Y
ES
ggca
gcaa
gtgt
ttacg
agcc
cata
tttgg
attg
gact
atcg
aacg
agta
gcttt
aaat
gcaa
aggt
ggttg
gagg
gcca
catg
gaga
caaa
gaca
aaat
ggt
tgct
gttg
cctc
agag
agta
gcat
catc
ttg
564
3428
646
MY
BPC
YE
Stc
tgat
tgac
aaga
cgaa
gttc
acca
tcac
aggt
ctgc
caac
156
529
9843
4R
AL
AN
Oag
taac
tgtc
cgct
agaa
gtct
gtcc
aaat
ttaaa
atgt
gtgc
cata
ttctg
gttc
ttgaa
aata
agat
tcca
gagc
tcttt
gatc
gcttt
taat
aaac
tgca
agttc
Oat
tttaa
atga
aggg
ccag
cata
tata
cttg
caag
ataa
ttttc
agct
gcaa
ggat
tcag
cacc
agtta
tgttt
gaat
gaac
cctc
ctttt
ctct
gaga
ttctg
gtcc
ctgg
aaat
ccct
ttctg
ctag
tggt
gagc
atgt
aagt
gtta
agttt
ttaat
ctgg
gagc
aggg
cata
ggaa
gaaa
atgt
cagt
agtg
ctaa
tgca
ttttg
cact
aga
acgc
ttcgg
gaaa
atat
tcat
gcttg
ccat
ctgt
tcat
ttcta
aattt
atat
tcat
aaag
ttaca
gtttg
atac
agga
atta
ttagg
agta
attc
ttttc
tgttt
ctgt
tta
taat
gaag
aaca
ctgt
agct
acat
tttca
gaag
ttaac
atca
agcc
atca
aacc
tggg
tata
gtgc
agaa
aacg
tggc
acac
actg
acca
caca
ttagg
ctgt
gtca
ccat
tgtg
tggt
gtac
ctgc
tgga
agaa
ttcta
gcat
gcta
cttg
ggga
cata
atttc
agtg
ggaa
atat
gcca
ctga
ccga
tttttt
tttttt
cctc
tttgc
agtg
gggc
tagg
acag
ttgat
tcaa
caaa
gtat
tttttt
ctttt
ttctc
agtc
ctaa
tttga
acag
gtca
aaga
tgtg
ttcag
gcat
tcca
ggta
acag
gtgt
gtat
gta
aagt
taaa
aata
ggct
tttta
ggaa
ctca
ctct
ttaga
tattt
acat
ccag
cttc
tcat
gtta
aata
tttgt
cctta
aagg
gtttg
agat
gtac
atct
ttcat
ttcgt
atttc
tcat
aggc
tatg
ccat
gtgc
ggaa
ttcaa
gtta
cc
566
3544955
—N
Ogc
tttct
cctc
ttccg
ggtc
ttgga
ctg
567
2981913
EZR
NO
cagc
taag
atgc
catg
tgca
ggtg
gattc
catg
ccgc
agac
atga
aat
568
3013230
CASDl
NO
agac
tgtc
tgca
cctg
tattc
attg
tgga
acttc
ctct
ttcat
tgga
aact
ttctta
ctca
agaa
tgac
ggca
gtat
tgttt
tctta
tatg
tgca
atga
agtg
gaat
gata
aaca
gtat
gcct
ttaat
ttata
tgtg
ttcttg
ttctg
atgt
tgttt
cctg
aaat
gattt
ttcttc
ctaa
ctgt
ggttt
tcgg
gtat
gcaa
gcct
aaat
ctttg
taca
ctttg
tctc
acag
aata
gttc
tgag
gctc
catg
acag
ggttt
tgtc
attg
ttgat
gtta
ttgttg
cttc
gtttt
ataa
aaaa
gcca
aaat
tttttt
tcca
atcc
aaac
gttc
acct
gtttc
ctttc
ctca
agct
atac
cagt
gtaa
tacc
agtta
ccct
gtgg
atcc
attta
a
569
3739
279
NO
gtgg
cagt
atct
gtga
cgat
ggga
agt
570
2409084
NO
agat
tttttg
ccac
tgcc
ttgag
gatc
cctg
attc
ctat
tgat
tttac
ctgc
ccttg
gacc
tgtc
cacc
tcta
tcttc
acct
ccac
ctcc
ctga
tccg
gtct
ctaa
tca
actc
tcac
ctgg
acaa
caga
aatg
gcag
ccca
aattg
tctc
ccac
at
571
2848243
NO
tcca
cagg
acta
gctg
gcag
atct
aacg
ttggg
gaga
gaga
caaa
ggtg
gggt
caca
ggca
gccc
cagc
gtgt
tggc
ccaa
gtta
ctgg
aagt
ttgga
gttg
ccttg
ggaa
gtgg
aagc
ttgtg
tcta
gcac
ctta
gacc
accg
ggcc
atcc
tgcc
acac
gcttg
gagt
tgct
ttgaa
ctat
gctg
ggga
aggt
gcag
gcg
gagc
tggc
tggg
aaag
tcag
catg
ggca
tgtg
aagt
ttggg
tgta
catg
acat
cagg
ctgg
cagt
ggtg
ggtg
agaa
ggag
tctg
gggt
tcgg
gcca
gagg
gtgg
gctg
gaga
tgta
ggttt
gttg
ggca
tccg
caca
gatg
gagc
ctgg
gagt
agct
gaga
ctcc
atgg
ggag
cacg
agga
tgag
ctgt
tgaa
ggca
tgag
ctct
ggac
cctg
gcat
taga
ggtc
tagg
aaat
tagg
ggca
acca
gtag
gact
tgag
aggg
gagc
ctgg
ggag
gagg
aagt
gcca
ggag
tgag
ttac
actg
gcgg
ccca
gtga
aacc
gggg
ttcct
gggg
ggcg
ggat
gtgt
gtgt
gtgg
cagg
gctg
ggtta
tctta
tacc
tttca
gaac
tatg
tttaa
agttt
tgta
gatg
taat
taga
atag
gatg
agttc
atgc
cact
ttgaa
acct
ataa
ttgta
tgaa
taat
tttga
attc
cttta
cgca
gaca
tgac
acat
cacg
gcttc
tcttt
gcgt
atga
tatc
a572
3775781
—NO
tgac
tgag
ctgt
ggac
cgca
agca
gtgg
cagc
ccct
tggc
tgct
cgct
atttt
gagt
ctgg
agag
attg
ttcac
gagg
agca
cagg
ctgt
ttggc
aaca
tca
gagc
t
573
3990538
—NO
ctgg
gtag
tcta
cgag
aaat
gtca
atta
ttatc
tcta
ctac
aact
actta
cata
tatc
taat
ggga
aaag
agtg
gggc
ttagg
tgtc
agag
tgga
tggg
agac
aaa
ggag
aagc
taca
ctaa
taaa
taca
acaa
gtgg
aagg
tacc
tgtc
ccat
tcct
aaaa
ggat
ttgtg
ggca
atgc
tggc
acttg
gtgg
ccag
gaga
atct
tctg
accc
cact
ctcc
ctcc
tcttc
agtc
ctga
agac
ccca
agaa
ccca
gtta
ggat
cccc
tggc
caga
ggtc
tctg
tgac
tgcc
tctg
gact
cagc
acgt
gca
Ogc
agct
tggg
agga
tttga
gcca
gtct
caaa
aact
tttag
cccc
agaa
tgag
acca
gtga
cccc
aagc
agga
gggc
tggg
atct
ggag
ggaa
gaga
gg00
gggt
ccaa
gggg
accc
tgtg
gctg
aggc
catg
gaga
acca
gtgc
cagg
gccc
aaga
gacc
cattt
ttcca
gtta
tcag
aggt
gact
gaca
tcttc
tgcc
act
gcct
tgag
ttcag
aaat
ttaaa
aaag
cttg
cagc
aaga
aaat
gcca
gtgt
gcaa
ctgg
gtga
ctaa
agac
ca
574
2411004
KIAA04
NO
ggac
tgac
gcgt
tccc
acttt
ctta
ctct
gcac
ctcc
ccac
tctc
ttccc
attc
aaga
acga
gtttc
tcttc
cctc
tcca
ttgga
aatc
ttgct
ccgg
aggt
ccga
a94
acac
gtat
tttaa
gcgt
gccc
ttcct
cacc
tcct
ggag
cacc
ctta
gaag
tattg
ccta
ctta
tccg
ggct
gaga
atcc
ttcat
ttttg
acct
ggct
tttttt
tcgc
ccttt
ggga
gata
aagg
tccc
tctc
cacc
ctct
acta
acac
tctg
cacc
caag
gcct
tatc
ctttg
gggt
cacc
ag
575
2642748
—NO
acag
gggc
gcat
ttgcc
tcac
aagg
aaca
tttgg
caat
gtcg
ggag
atat
tctg
ggtta
taca
agtg
ggag
atta
ggaa
tgct
actg
gcat
ctag
tggg
cag
aggc
cagg
atac
tgtg
aaac
atcc
tata
atgc
acag
gaga
gctc
ccta
caac
aaac
aatt
576
2661956
NO
atgg
acac
ccac
tttcc
atgc
ttag
577
3307945
NO
aaaa
tcca
gtcc
caac
tgca
gatc
ccta
tgca
ctcc
ccca
ac578
3789999
NO
ctct
cttc
ttgtg
gtag
gtcc
caga
aact
tcta
cttg
gggg
taca
tgag
gccc
agct
ggga
ttggg
caca
gaag
gtca
tggc
tctg
cacc
tgac
cctc
acct
tag
gggt
cagg
agag
agcc
aagg
acac
gagt
gatg
ggtta
ggga
ggac
tcag
gcac
catg
gaat
actg
tggc
catc
cttc
cata
gagg
agag
gtac
agg
tg579
3963093
EFCAB
YE
atgg
cgat
tata
ccag
actg
gctta
ggtc
gcat
cctc
acac
acga
aaat
ttaca
cattc
aaga
cccc
attc
ttcac
cgtg
taga
gtat
attc
a6
580
3589632
EIF2AK
YE
caga
gaag
ctga
cgag
gaga
gaag
t4
581
3816
037
—N
Ottt
ccat
cacc
cagc
aaac
gccg
tccc
gcag
cacc
cggg
aagc
tgcg
gagt
cggg
ctgg
ggcc
gccg
cgttg
cgca
catc
ggtc
cttg
582
3417
925
LR
PlY
ES
cggt
gaat
tcct
cttg
ccga
gcac
aaga
tgag
tttga
gtgt
gcca
atgg
cgag
tgca
tcaa
cttc
agcc
tgac
ctgc
gacg
gcgt
cccc
cact
gcaa
ggac
aagt
ccga
tgag
aagc
catc
583
3696
235
RB
M35
YE
Sac
ccgc
tgat
gact
acac
cagt
ctga
tgcc
tgttg
gtga
ccca
cctc
gcac
tgtg
ttaca
agcc
ccca
agga
B58
434
6299
8O
SBPL
YE
Saa
gaag
ctta
tcca
acgc
caac
caaa
gattt
gcat
cagc
catc
tctta
gtcc
agca
agtc
ct8
585
3962
740
TT
LL
lY
ES
ctttg
aatg
ctat
ggct
acga
catc
atca
tcga
cgac
aagc
tgaa
gccc
tg58
626
8199
7—
NO
ccca
gagc
gaga
cgac
tattc
catg
gccc
tgag
caga
caat
ttctc
tttta
aaaa
tggt
ggcc
cttta
atga
tgtta
ttatg
aggt
tttta
aatg
acat
atca
aaa
gttg
tttat
atca
caaa
aacc
caac
agtc
attta
atta
tact
aatg
aagg
tcttt
atac
gtgt
caga
tttttt
aaaa
aaga
cttc
taat
ggcc
atga
gaaa
tgga
ctg
ttgct
gctg
cttg
catc
acag
cagt
ggtg
gcag
cattc
tttgt
agag
gaag
aaca
tattt
tata
atgt
ataa
ggga
atttg
cattt
cgct
tttca
cggc
aaac
ctttc
tgtg
tttttt
gtaa
atct
aaca
attg
caac
aaca
tgga
atat
ttagt
tggg
agat
gtttg
gagt
aagt
gttg
aaag
caac
tatg
tttgc
ttgct
tttttt
ttttta
tggt
taga
gaac
aact
gtgg
cgac
tttca
caga
ttgaa
ctag
taca
gttc
aaag
gaaa
atca
tcga
cttta
gcac
atct
gata
gcta
tata
ggca
tggc
agag
gtgt
ctt
gaaa
aaac
587
2743
344
PHF1
7Y
ES
ggtc
ctag
agga
atttg
agca
gcga
tgct
acga
caat
atga
atca
tgcc
atag
agac
tgag
gaag
gcct
gggg
atcg
aata
tgat
gaag
atgt
tgtc
tgtg
aO
tgtc
tgcc
agtc
tcct
gatg
gtga
ggac
ggca
atga
588
2992873
NO
ttccc
aagc
actg
tacg
gggt
attg
ggaa
ggta
agaa
caaa
aaag
atgt
agtc
cttg
ccct
taca
ggtc
tagt
tgag
gaaa
gaaa
gggg
gattg
ttttc
cagt
tgca
ctga
atat
gaat
caga
atga
ggaa
gaaa
gtac
ttgga
agcc
tcta
gctg
actc
tggt
acca
ttcca
cggg
gcca
tgat
acct
ttctta
gtc
589
3568027
NO
acat
atttt
atgc
tatg
ccct
gggg
gtaa
aggg
tgac
taag
acgt
gtaa
ggca
cttg
cattc
taca
ggcc
attg
ctttt
catg
ggta
tagg
ttttc
tctc
ccca
gtt
agac
tgta
agcc
ttgttt
590
3830
188
FXY
D3
NO
ctgg
agac
ttcct
atgt
gtgc
attg
gggt
gggg
cttg
gggc
acca
tgag
aagg
ttggc
gtgc
cctg
gagg
ctga
caca
gagg
ctgg
cact
gagc
ctgc
ttgt
tggg
aaaa
gccc
acag
gcct
gttc
ccttg
tggc
ttggg
acat
ggca
cagg
cccg
ccct
ctgc
ctcc
tcag
ccat
ggga
acct
cata
tgca
atttg
ggat
ttac
tagt
agcc
aaaa
ggaa
tgaa
agag
agct
ctaa
ccag
atgg
aaca
ctgg
aaca
ttcca
gtgg
accc
tgga
ccat
tcca
ggaa
aact
ggga
cata
ggat
cgt
cccg
ctat
gatg
gaag
tgttc
agac
agttt
ataa
tagt
aagc
ccct
gtga
ccct
ctca
ctta
cccc
gaga
cctc
acttt
atta
caag
atct
ttcca
aata
ccca
aatg
tccc
tgca
agcc
cgtta
aata
591
4011
203
OPH
Nl
NO
tccc
aggc
tcct
atgc
aagt
tttttt
tccc
catta
tatc
acac
ttatc
tagc
aagg
gacc
ttgtg
gtttg
tggc
tttag
tggc
catc
atttc
tggg
ggttg
gcttt
tac
ccttt
ttcttg
aata
tttgc
cacc
aagt
gaaa
aatg
ttagg
acat
aaac
ccttg
ccag
gtcc
ctttc
atttg
ctat
ctct
atta
ttgga
aagg
acct
aaaa
attg
gtgt
aatg
gggc
agaa
atct
gagg
aatg
gaca
tttct
aattc
ctgt
ttgttg
aagg
gaag
ttgct
ggaa
agag
catc
agta
cttg
tttct
atgc
agat
gcct
gggc
cgta
gcttg
tctg
tagc
gtct
gtat
aatta
taat
gttg
ccca
gtgt
gagg
gaaa
gagc
tttcc
tact
tgca
ctct
tcta
ccaa
ggcc
ctgt
tagt
gcac
tgat
tata
gta
ctga
caga
taaa
ccta
gata
aaga
gata
gaga
gtga
gtac
atgc
acac
tcat
gtgc
aaac
ccac
tcag
agat
gcat
ttgga
acag
tgct
actg
aaag
gcag
tagt
cattt
tcaa
gact
gaat
tcca
aaca
tggt
ttattg
gtga
gtta
ggaa
catg
taag
gcca
agta
cact
gaga
gcct
ttttg
aaag
taat
tgag
tgga
aact
tgat
gcca
ttcta
aatc
aagg
cata
tcca
ggtg
gccc
ggttg
aact
cact
ccac
tgta
ccca
gtct
caaa
ggcc
aggt
tgct
aaga
aacc
agga
gtaa
aaga
gtca
agtg
acca
tcat
ttcac
ctgc
tgct
tgcc
ccca
atag
tagt
ctct
gtga
ggcc
ttact
gacc
tcac
ctag
gaag
tgat
ttttg
agcc
cttg
tttca
gggc
tgtg
gcct
ccct
gctc
tatc
ctga
ataa
agca
gaca
ggtg
tgca
gattt
tggc
catg
aaag
catg
gcta
atag
ggcc
acag
tccc
tttaa
agaa
acat
ggttt
gact
ctgg
ttttc
ttggg
ggaa
aata
ccac
aatc
accg
atgc
aaac
attg
gaag
atta
ttgag
agcc
ctag
aaag
ctgc
t
592
3349
796
—N
Otg
caga
aaag
gtgg
ataa
ccca
actc
tggt
ggg
593
3040
457
TW
IST
NO
tgcc
tcag
ccat
tcta
gtttt
gttg
tgtg
ttgga
ggac
agtc
taat
gaag
acag
gata
aatg
tgat
tctg
acta
gtgg
gaaa
tttca
gatta
ctttt
taat
gattg
gaN
Bat
gggc
tata
agat
attg
tgct
gaag
agaa
aaca
gttg
ttctg
ttcac
tgct
ttcat
tgag
aaaa
tgtg
gaat
gtct
cttg
acta
attg
acaa
agtg
taat
gaaa
tgac
aagc
tggt
tgaa
gctg
gtaa
tcaa
tgca
tggc
cata
tttat
tcat
tttcc
acac
acttg
acca
tcta
ctttg
tata
acac
attc
tgct
aagg
ccca
ggga
taca
atgg
caaa
caag
atag
atgt
agttc
cttc
ccttg
tggc
atgt
cata
agag
aaac
agat
gtgc
atat
acac
acaa
taaa
taag
aaaa
ttaat
aattg
tatta
agtg
gtat
gaaa
aaaa
gggg
gagg
attg
ttcac
tcac
tgtta
cctg
tgtta
gtat
ctgc
aaat
gcat
gaat
ggttg
aact
ccca
tgtc
agca
tttttg
ttgcc
attta
ttta
gtac
caga
cata
gtgc
tggt
ctca
cagc
ttaat
atttg
gtaa
tgaa
taaa
ttctg
ctag
tggt
atat
gttg
atct
gaac
ttaca
atga
tggg
atat
agaa
ttggc
agag
tggc
agat
gttc
acaa
ttgtc
taca
agta
gatg
tgct
agac
aatg
gaag
gatg
cagg
ccaa
cctc
tttga
ttaca
attg
agat
tcat
gtga
ctat
tgag
ccct
ggaa
atgt
agct
tgtc
ca
594
3895
503
—N
Ogt
ggcg
gacc
gtag
aaga
gcaa
gccc
a59
539
0765
4N
CO
A5
NO
ccag
gccc
agct
agcc
aagt
ttgga
atgg
cattt
gtca
tgtc
agta
gcca
ccac
ctttg
ttcat
tgtg
aacc
tacc
aagg
ctttc
cagc
ttcat
acac
attg
acc
agag
ctca
agct
cctg
cctg
caac
tcct
gcct
agag
ttgaa
gaaa
agca
aact
ggcc
ttggc
aggc
acag
tgtc
atca
tacc
ctca
cccc
atat
gtttg
ggg
tctg
cttg
agga
ttcat
aaat
cagc
cact
ctgg
attg
ttgag
gaat
ggcc
atgg
cagc
caca
gaaa
aaag
aattt
ttctc
tctg
agcc
aagg
ttgttt
tttgt
ttttt
tctc
ttttc
tttttt
gtttt
cattt
cattg
gaag
atct
ccaa
tgga
ctga
acag
ctcc
agtc
agca
gcag
ttacc
acaa
actg
tgaa
tctg
ggcc
ccac
cact
cttc
cct
gtta
acca
gttc
tgtc
agca
tccc
cctc
tcca
gcag
cact
tcca
tgaa
gttg
gttc
tgag
actc
tggc
cgtg
aaca
cccg
tttct
tcag
tgat
ttgttt
tggg
ctttt
ggct
caaa
accc
cagg
ctct
tgttt
ttgtc
taga
ctct
tattc
tgttt
cctg
agca
gcag
gagg
tagg
gacc
acttt
gatg
tcag
acttc
tggt
agct
ggac
atgt
tctc
gaga
tggg
tggc
tgttc
gcga
ctttt
gtac
caga
gtga
aattg
ttaga
agga
gggt
ttctg
gctg
tggt
tcta
aatg
gagc
ccca
ggaa
gctg
ccct
ctc
ccca
gggt
ttgtg
ctca
gtag
agcc
596
3679
982
—N
Oaa
gaga
cact
tttcc
ctaa
gcag
gaag
ttaat
aatta
agtta
acca
agaa
atag
aagt
gcat
gagt
attg
tttgg
aaat
atgg
agaa
ggga
agaa
gagt
aga
aaca
gctg
aaaa
gatta
aaag
tgat
tgct
ttggc
gagg
t
597
2858
366
—N
Oag
ctac
acag
gaaa
taac
acca
ccaa
aaat
aaca
cattc
aaac
tcag
aggg
caat
cttc
ccta
a59
830
4821
5M
RPS
2Y
ES
acca
tgcc
gcag
agcg
aacg
gtgg
agga
tgttt
tcct
tcgc
aagt
tcat
gtgg
ggta
ccttc
ccag
gctg
cctg
gctg
acca
gctg
gtttt
aaag
cgcc
ggg
4gt
aacc
agttg
gaga
tctg
tgcc
gtgg
tcct
gagg
cagt
tgtc
tcca
cac
599
3168
775
—N
Oat
ctgt
agcc
gtag
taaa
ttttg
gaag
ggca
aatc
aata
tggc
gggt
agag
gcta
gttg
gtga
agct
tatta
acac
agat
tcttt
tcca
ggct
aaag
ttgtc
ttca
gtgg
ttaac
ctttg
tttct
ccta
gtag
agag
gtgg
gcag
gata
ccttt
tgtc
tttgc
aaat
ctat
gtct
tgat
ttagg
caag
tgga
aggc
agag
aggt
ttcttg
cat
ctgc
tcct
tcaa
aact
acct
tcag
ta
600
3643
388
NO
ggag
gtat
tact
gagc
gcca
cctg
gctg
601
2927
513
TN
FAIP
YE
Sgt
gacg
gcaa
ttgcc
tcat
gcat
gcca
cttc
tcag
taca
tgtg
gggc
gttc
agga
caca
gact
tggt
actg
agga
aggc
gctg
ttcag
cacg
ctca
agga
a3
acag
acac
acgc
aact
ttaaa
ttccg
ctgg
caac
tgga
gtct
ctca
aatc
tc
602
3911
005
___
NO
atgc
ctga
gggc
atcg
acca
gaac
gaac
caga
cacg
agtc
cctg
ccct
ctgg
aagc
tcac
gttc
cagc
agag
gaga
gg60
339
4279
7IN
PP5J
YE
Sttc
atcc
tggg
ctac
tata
gtca
caac
caca
gcat
cctc
atcg
gcat
cact
gaac
cctt
604
2802
457
—N
Ogg
ccta
tggt
agga
ttgta
ggca
taag
tgct
ggat
cagg
agtc
atcc
ctct
tcta
cctc
tgct
gcct
tctg
atgt
ttcttt
tgcc
aacc
agca
cgcc
ggac
cagc
ccttc
ttgttt
tctc
ttttg
gttc
tagg
tcac
aact
ctttc
tc
605
2874
851
RA
PGE
YE
Sca
acac
ctga
gcgt
ctca
taat
gcat
ttaat
agaa
gaac
attc
catc
gtgg
atcc
aact
tata
taga
agat
tttct
atta
actta
cag
F6
606
3147
604
AZ
INl
YE
Sga
aattg
gcttt
acga
tgaa
catg
ttaga
cattg
gtgg
agga
ttcac
ggga
actg
aattt
ca60
738
2230
3C
CD
Cl
YE
Stg
ccat
ataa
catc
tggt
gcga
tggc
tgca
agaa
ccac
atcg
gcat
gg30
608
2767
313
—N
Oct
gtct
ttgga
ttgat
ggct
caga
gaag
atgg
cagc
atag
atct
ttaaa
gcag
tttga
gtgg
ggaa
ttcttg
gatc
ttggg
gtac
ttggc
catg
tgga
cagg
att
ggct
ggag
aagg
aggg
tcag
gtgg
cggt
cctg
ctgc
caag
gtct
gagg
gtaa
tact
gctc
ctaa
g
609
3159
245
—N
Otg
tgtg
tctc
aaac
ttcaa
gaag
tctc
tagg
ataa
ccca
aagg
caac
aagg
gaga
ccaa
aaca
agga
cgct
ggag
acat
tttag
cctc
tgat
actg
acag
ctgc
agca
aaca
gaaa
acgc
agcc
taac
cgcg
ggcc
agat
aaaa
acct
caca
610
3403
122
PTPN
6Y
ES
gaag
cagc
ggtc
agca
gaca
agga
ga61
138
4959
9__
_N
Otg
ggca
cccg
ttatc
ctag
ttact
cggg
cggc
tgag
gcag
gaca
attg
ctta
ccca
gagg
ccg
612
3534
950
KL
HD
CY
ES
tgcc
tatg
atca
ccttg
catg
gaca
gcaa
tcct
gtaa
acat
caca
gagt
ggca
tcat
ttgta
taat
tata
tgca
ttgttg
tagt
ttgca
cctg
ttggt
tttaa
tgtg
ca2
tgtg
aatg
gcct
agag
aacc
613
3631
427
...N
Occ
ctgt
tttga
tatc
ctgg
ttgtg
aaat
gtct
tatg
ttttc
tgct
aatc
tattt
gtgt
taaa
gtaa
gcag
ggttg
gttta
cagt
accg
tggc
tcaa
aagc
tagt
gtag
gca
gcttt
cctc
tcct
g
614
3779
594
NO
acgc
caag
agtc
cagt
gcaa
cgat
ccct
gcca
cacg
gtca
gccg
accc
cact
tggt
gcca
cagg
gtcc
agcc
tcgg
ccac
tctg
ggct
caac
ctttc
cag
cacg
tctg
ggag
ccca
atga
ggct
atgg
gaca
gtgt
gcat
ttccc
tcaa
ggcc
tccg
gaaa
gtca
cctg
ccac
ccac
tgtg
acct
ctga
ggct
gaaa
gga
ctaa
ggac
tcct
ttccc
catc
atttc
agct
gggc
cagt
ggct
ggct
aaaa
tcag
cgca
gcat
tccc
attc
ttctg
aaag
ccac
agct
gaaa
taat
gcac
cagc
tga
aacc
acag
tgga
caga
ctga
gatg
gcct
ccct
ctgg
gcct
ctag
ttggc
aact
ttctc
accc
gggg
gctc
ctct
ttctc
ctttt
cccg
ctcc
ttgttc
cagt
cct
taag
gcct
tctg
gaag
tgag
gtgt
cctc
atcc
aagc
tgcc
agtg
ccttc
caca
aattc
ttcat
aggt
agat
ttttc
cgca
aatg
gtct
gggg
ccca
aaa
615
3931
281
NO
cccg
agga
gcttg
tgat
gtca
ccag
tgtg
ccct
gcgg
gatc
tggg
gaac
agga
gaag
tagg
ccag
agaa
gtcg
ggttg
aaag
gtgt
gtcg
tttgc
gtcc
tgg
atct
ccct
taat
ttaac
ccca
taac
attc
cattt
taca
gata
agga
gact
gatg
ctca
ctaa
gaca
ctaa
tatg
ccca
ctca
caca
gcaa
gtcc
caag
tctc
tcc
ccaa
aatg
atttg
agaa
ttttg
aggc
acac
aaaa
ggga
gaaa
attg
agtg
cccg
tgat
ccaa
cgttc
tgca
actg
tttgg
aaac
cagg
aaag
tgga
gga
gtgg
tagt
acttc
tgaa
ccct
a
616
3110077
—NO
tggg
aaga
ggat
tcgg
actc
gtca
cact
gcag
agca
gcag
agcg
agaa
agga
tgag
aaga
ggca
gaga
aggc
gacg
gcag
aaga
aaaa
agga
aaa
actg
cggc
cgcg
atca
gagc
ctga
617
3419463
—NO
gcgc
cctc
gccc
tggg
aaac
tcac
tgga
gtgg
ggac
agat
ctca
ggcg
tttgg
cagt
cac
618
3903370
—NO
attg
tattt
ctat
tgga
cgct
gctg
ctct
aggt
tact
gagt
tt619
3609638
—NO
agaa
tcag
gaaa
attg
gtgc
aggg
attg
gcac
actg
cagg
gaga
atgc
atgg
gctc
catg
620
3227685
—NO
tcttg
ggta
ctca
ccttc
agcc
ctag
agca
ttcttt
agca
ctca
cagt
ttctc
ttctc
gtcg
ggtta
ttcttt
atat
taat
ctttt
cctg
tcca
aatta
ctct
ctat
gtct
cctg
atca
gtcc
ttgac
tgga
agca
tcaa
g
621
3349840
—NO
cata
acct
ctag
agta
gaaa
gtttc
agaa
ggaa
gaga
ttccc
caaa
ctgt
gtaa
ttccc
ttgta
aatc
tcat
tcat
tcaa
tggg
atttg
agaa
ccaa
ccac
ttgtg
cagg
tgcc
cgca
tccg
aaga
taac
ttgcc
ccca
ggag
acag
caga
taag
aaga
gggc
agag
acaa
agga
ggtc
tttttg
cgac
ttgca
tc
622
3035683
FTSJ2
NO
ccca
tgtg
tgcc
ttgac
ggga
cttc
atct
tata
gact
gtta
aact
gtca
caca
caaa
cagg
ctttc
cacc
cctg
ctct
gaga
gcac
cacg
caca
gattt
ccag
ttc
ttagt
gtgg
ctgt
ttaaa
gtag
aaaa
tctg
gggg
ctgg
gtga
ggcc
actc
atgc
ctgt
aaac
ccag
ggct
ttaga
aggc
tgag
gctg
gggg
attg
cttg
aag
tcag
gagt
tcaa
gacc
aacc
tggg
caac
atag
caac
accc
ccca
tgtc
taca
aaaa
tgaa
aaac
caaa
aagc
aaac
caaa
agaa
aaat
ctga
aattt
cca
tctg
ggga
ttaac
ttctg
tcttt
ctgg
tgaa
caat
atag
caat
tcac
gcat
tcttc
aagc
agca
aaag
ttccc
ggaa
caat
tagg
gaag
acgt
atgg
tctg
aattt
atcc
aggc
agtg
ggtc
tgct
ttggt
ttttg
ctgg
aaat
ttata
tcag
tgtc
tggg
ctcc
caag
aaca
taaa
tgta
attg
ccaa
agca
agca
gtga
tgtg
gtgt
gttta
ttttc
tttta
ctca
tcta
ggaa
cttg
acgc
agct
tta
623
3439354
ZNF84
NO
tcaa
gaaa
gagt
cccg
attg
tgttc
cagt
acct
ggttc
ttctg
gtct
624
2812078
—NO
ggca
gtgc
tgta
ggga
gcttc
atta
atta
tgat
agtc
tcgc
tgtg
tcaa
atgg
atta
caaa
tgga
acca
cctg
tggg
gagc
ctgc
aagg
ttacg
gaca
gagg
eta
625
3038635
NO
gata
tgaa
aaag
cacg
catg
caac
taac
tact
gttg
atct
gcttt
gcta
626
3088092
___
NO
acaa
gagg
gcgg
gtac
aaac
gcac
agct
tcca
gggc
tgaa
atac
a627
3337338
ALDH3
YES
atga
agga
cgag
cgtg
tgcc
caag
aacc
Bl
628
3420
557
—N
Ogt
aagt
tgcc
tgtc
gctc
ctct
gaat
ttt62
938
4126
7C
NO
T3
YE
Sat
ggcg
gaca
agcg
caaa
ctcc
aag
630
2929
962
STX
BP
YE
Stc
aagg
agcc
ttgca
cagc
atat
tcct
ggcc
ctgg
tggc
attg
aagg
cgta
aaag
gggc
agca
tctg
gagt
tgttg
gtga
atta
gcac
gagc
cagg
ctgg
c5
acta
gatg
aaag
aggg
caga
aact
tggc
gatc
tgga
agaa
agaa
ctgc
ggcc
atgt
tatc
aagt
gcag
agtc
631
2387
074
MT
RY
ES
gttg
agga
ttatg
cattg
agga
agaa
cata
tctg
tggc
tgag
gttg
agaa
atgg
cttg
gacc
catt
632
3388
958
—N
Ogt
aggt
gccg
aaca
ccgc
gagt
ctcc
gcgc
ccga
cccg
acct
cgtg
ttcga
gtgt
gtgc
acgc
tggc
gttta
tgga
gaga
ctag
aaga
gccg
ccca
cgtt
aaga
gtct
acgt
gccc
tgca
gg
633
3620
562
NO
ccca
aact
gaaa
catc
atac
ctat
ta63
438
7027
8N
Ogt
ttgac
ttctg
acta
aagg
tcttg
635
2833
045
RN
F14
YE
Sta
cctg
caag
cgga
tgag
gcta
ataa
aag
636
3030
300
CU
Ll
NO
gattg
ctgc
actg
gacg
acttt
agaa
catc
cctc
aca
637
3767
510
NO
agat
cgag
gaca
gtgt
tgag
acac
cacc
acct
c63
830
3944
4—
NO
gttc
aaac
agta
gttta
ctcc
tgcc
tcag
<>63
930
6765
3—
NO
agta
tggg
agag
gggt
aatg
ctga
gca
640
3139
926
TR
AM
lY
ES
aatta
acag
gcga
atgc
acttc
tcca
aaac
aaaa
caca
gcaa
gttta
atga
atct
ggtc
agct
tagt
gcgt
tcta
ccttt
ttgcc
641
3203
423
B4G
AL
YE
Sat
gtct
atat
ctcg
ccca
aatg
ctgt
ggtc
ggga
ggtg
tcgc
atga
tccg
ccac
tcaa
gaga
Tl
642
3875
984
___
NO
tcttt
ctct
gggt
tcat
ttcct
aact
gtct
aact
ttggg
643
2848
542
—N
Oat
ggtg
tggc
aagt
tgac
tgta
tcgt
gatg
ttagc
cgtc
ttgtg
gcag
aga
644
3134
299
NO
agac
caca
gaaa
ctgg
gcgg
gcag
caga
tgca
gaca
cagc
catc
ctat
gcat
ttag
645
3538
031
—N
Otg
acca
cgtg
cgca
aatta
tttttc
tttttt
agta
gaat
ggtg
cattt
tttgc
ctttg
tattt
gaaa
tttag
ctaa
atta
caat
agct
atca
gaac
taca
attg
acct
tagt
aattt
caaa
acca
tagt
tata
aggg
tttta
agttg
atttt
taaa
aatc
ttaag
atat
gata
ggga
tttca
aaac
tatg
tgtg
tgta
tctc
tgat
aggg
aggg
gttg
gggc
acag
acta
ataa
gtgt
gcca
aaaa
tctg
tgaa
tcca
aatg
tgta
acaa
gtgt
tgtc
tgaa
taaa
tgtc
caca
aatc
catg
ctat
tattg
646
4037
525
NO
tgta
ttaga
atgt
aatg
aact
ttaat
ggaa
tgta
ctcg
aatg
gattc
gact
ggaa
tgga
atgt
tctg
gaag
tgaa
tgga
ctcc
aatg
gaat
ggat
tcaa
aagg
aat
ggaa
tcgt
acgg
a
647
2814
589
BD
PlY
ES
agct
tctg
acat
ctct
ggag
gtttc
agca
agaa
aaga
ttgtg
tagg
ttcca
aaga
gtct
gcttt
ggca
aaaa
taga
tgcg
gaat
taga
agaa
gttg
gacc
atc
aaga
aggg
ttgga
gagg
aaac
tgta
ggag
ataa
ttcac
catc
ttc
648
2823
139
—N
Otc
ccaa
cctg
gcag
tgct
gagt
ggg
649
3113
389
CO
Ll4
YE
Sgg
acag
aacc
agct
acaa
ccat
agtg
ccta
ccac
Al
650
2853
728
C5o
rf42
YE
Sca
gcag
tgag
agtc
gtcc
agtc
catg
gctc
gtttc
atgg
ctgc
ctat
ttcac
caat
cagc
agct
ttgca
ttttg
cccc
ctca
651
3895
815
RN
F24
YE
Sgc
tcgg
atttc
ccac
atta
caac
ttcag
gatg
ccta
atat
tgga
ttcca
gaat
ctgc
ctct
caac
atat
atat
tgtg
gtttt
tggt
actg
ctat
atttg
tcttc
atcc
ttagt
ttact
cttc
tgttg
ctac
652
4013
748
BR
WD
3Y
ES
ttgttg
attta
gattc
agac
ggtc
ctgg
tact
tcat
653
3384
440
—N
Ottg
tcct
tggg
caaa
cgtg
gtat
acttt
ctca
gtgg
tggt
tgga
agga
aattt
gttg
ccct
gtca
tttcg
caag
tatg
gaaa
aaga
ccta
aagg
agaa
tttag
cagc
tgca
gccg
ctgt
tttta
ttgcg
tact
ttact
agcg
ccaa
ctct
ttaat
aatc
gcttg
ttagt
tgtta
cccc
tgttt
tatg
ggtc
aaga
gact
caat
atat
cagt
actt
tggt
aagg
ccag
gcag
c
654
4026
568
FAM
58N
Oag
gccg
ggag
tgta
ctgt
gtgc
agct
gacc
caag
gcag
ccac
atct
gcgt
ttgtc
ctttg
agag
gact
ttgac
taca
atac
aggc
atga
catc
aatg
aaag
gA
aaag
tcat
gaaa
tcga
tgag
actg
aatc
ccta
cgga
tttct
taaa
agcc
agat
ttgta
ggga
gaat
gaat
gtgc
a
655
2675
946
WD
R51
NO
gcca
ggga
tttgt
acca
tggg
acttg
gA
656
2941
689
___
NO
tctc
caac
tgca
gcga
cctg
tggc
tcttc
agat
657
2330
052
EIF
2C4
YE
Sga
aggc
agtc
atgt
gtca
ggac
agag
caac
ggcc
ggga
tcct
cagg
ccttg
gcta
aggc
tgtg
caaa
tcca
ccat
gata
ccca
gcac
acga
tgta
ttttg
658
2726
487
OC
IAD
NO
cgca
gtga
ggtc
tgac
gtca
tttcc
L
659
2902
605
LY
6G6
YE
Scc
ccag
tgac
cttg
attc
acca
acat
ccag
cctg
cgtc
gcag
D66
027
3233
2__
_N
Otc
aaaa
gattt
ggag
tcta
gagg
gcag
aaat
tgaa
gagg
ttggt
gctc
tgca
cgtc
tgag
tccg
ccca
caga
ctga
ataa
at66
123
2945
5__
_N
Ottc
ctgg
atca
caaa
gggc
tgta
gggg
ttcca
gaga
tgct
ggag
ctct
ggta
caac
agga
gggc
caca
ggc
662
2373
856
PTPR
CY
ES
gtat
ttgtg
gctta
aact
cttg
gcat
t66
324
6506
9SM
YD
3Y
ES
aaaa
agct
tggc
caga
ccac
aagc
ggga
atgc
aaat
gcct
taaa
agct
gcaa
accc
agat
atcc
tcca
gact
ccgt
tcga
cttc
ttggc
a66
430
4689
9V
PS41
YE
Stg
atgg
cagc
tgaa
atta
gcca
aaa
665
3418
387
GE
FTY
ES
ttgcc
gcttt
gcac
tgac
ctcc
agag
ggcc
agag
ggtg
ggat
ccag
cgct
atgt
cctg
cagg
ctgc
agac
cctg
ctat
cagt
cagg
cctg
gatc
aagc
atg
tggc
tcag
atct
tgga
gagc
caac
ggga
cttc
c
666
3568
670
NO
ttcttg
cagt
agtg
cgtg
agct
ctca
ctct
tgct
ctgg
ggag
tctg
gatta
gtcc
ttgag
ggaa
tgga
ttagt
tcct
gcaa
gggt
aggc
tgtta
taaa
gcca
gga
tgct
ctca
ggct
ctcc
tctg
tttgc
atgg
gttg
gcttc
c
667
2824
307
NO
aggc
acag
gctc
aatta
gtttt
tcaa
actc
tagc
caag
gcag
tattt
cattt
ggga
aatc
atgc
aaca
gaac
tgct
caat
tctta
acttc
tcct
gctg
ttaac
attta
cact
taga
ctgc
cagc
aaca
gtta
actta
aattt
tggt
ctca
aggg
aaca
aaaa
aaaa
ttgca
ttcag
aattt
aata
tagt
atttt
aaaa
ctaa
tttta
gcct
gtaa
gtc
atta
tgag
caat
agta
acttt
tgta
cctc
ctca
tcttg
tctg
ataa
tata
ttcta
tatg
ctgt
caat
ctga
ttata
tagt
ctat
atgc
taga
agttg
ctga
ttttc
attc
tgc
cacc
aaga
aaac
tgta
ctttt
attta
tggg
aaaa
ggat
ttaaa
tact
ccta
gata
ctta
aaat
tttttt
aatc
taaa
agttg
atttt
ctcc
caag
aaca
atttt
gtttg
ctttc
taca
aagt
tttaa
gttta
ataa
aatg
ccgg
taat
cacc
actg
ttttc
ctcc
ccaa
aata
aatg
ctct
gatc
cata
ttata
ttaga
gtga
tact
gtca
ctta
gtat
tgat
atct
ttaat
atttt
ttaca
tcat
gaga
cattg
ttatg
atct
tacc
tgat
tgtta
tctc
aagg
tgag
ctaa
tcat
cttta
tttgc
ctta
aaaa
atac
tgta
ctgg
ctgg
atat
ttaat
cttg
ttagt
ttagt
tata
cact
tgttt
tagt
cgag
tttaa
ttcaa
aagg
actc
ttaac
agta
tggt
gtaa
aact
caga
ttttc
tagt
ccag
acaa
attg
ctc
tcta
taac
gattt
ggca
gatc
aatg
gaga
catc
cgttt
taac
tttac
tgttc
tgta
ctta
ctat
gtag
tcat
gtgc
agct
tatc
aaca
caaa
gaat
acgg
atga
ggg
cattt
aaat
ggac
taca
agtg
actc
tgac
ttgta
aact
aggg
aagc
ctct
tgtg
tttga
cctta
aagg
ccag
agaa
aaac
tgaa
gata
gattg
actta
acta
ttgcc
aagc
agat
tatg
ggtta
cttta
tact
tcct
ttcat
tc
668
3204568
—NO
agcc
ctca
ggcc
ctac
caac
cttg
actg
tcct
gtcc
c669
3410647
—NO
ccca
ggag
tact
agttg
aggg
ccag
gctg
ctga
gaag
cagt
tgaa
gtag
ccttg
aagc
aagg
atgg
gaga
ttgct
gggt
gaga
tcaa
agat
acgc
tggg
atgg
gggt
gagt
gaaa
gtgt
gagc
aatg
agta
gagg
caag
attg
taga
tgct
gcat
ttgga
ggaa
tattt
ctga
ctgc
ataa
tcag
aggt
caga
atgg
ctag
agaa
atta
atac
agag
gtac
aagg
ctag
attta
gaag
agta
agct
aaga
acag
actg
caat
gggc
ttttg
aatg
2555029
catc
cacg
taca
cata
tgag
ctgt
acac
atag
atgt
taat
cagg
ttgct
tttttc
atttt
attta
aata
ttcaa
aata
ttgtg
tgat
tgtg
atcc
attg
agat
cattg
aag
Ul
670
—NO
tagt
atta
attta
gctg
ggaa
ttagc
agct
tatg
ttgtg
tggg
ccca
gttc
atca
atat
gtgt
gtca
aata
tcca
ccct
caga
tatc
agca
gcct
ttgac
ttgca
gtca
ga
671
3238290
MLLTl
YES
tagt
acct
taat
tggc
ctcc
cttc
a0
672
3490689
___
NO
tata
gagg
aaac
ctat
cagg
gcag
ggac
agt
673
3571835
—YES
catc
catg
gattg
ggcc
cact
actc
atag
cagc
tcct
gagg
cagg
taaa
tggg
gcaa
ggac
tcac
gcca
aata
ccac
acgc
gtat
gtca
aaca
ccca
agg
tcgg
attc
tggg
gccc
ctgc
cctc
agga
ggct
gtca
gact
gttc
cccc
taaa
caca
cttc
tgaa
ggttg
tagg
gacc
ttgcc
ctgc
tgtc
t674
2318698
PER3
YES
gggg
cacc
cgttc
atta
cttc
gaga
agca
gctc
accc
ttgca
gtta
aact
tact
tcag
gaag
agat
gccc
agac
cctc
tgaa
tctc
caga
tcag
atga
gaag
gaac
acgt
gccc
acaa
actg
agta
675
2330382
NO
tggc
ccag
ctac
atttg
caat
gatg
tgca
cgta
aaca
ctgt
taag
aggt
taaa
gcttg
tata
caat
ctgt
tact
gtga
aata
acta
aattg
ggct
ttaaa
aaaa
tct
tagt
attta
ttgat
cttc
attc
acat
atac
agttg
aaat
ttaaa
ataa
caga
tggt
tattc
caat
gctg
ctga
aacc
ttttc
taaa
aaat
acttg
ttttg
ttggt
tgaa
tgt
gatg
agag
gcgc
ttctg
ggca
gtct
ctct
tctc
tccc
accc
gtct
ttcct
cctc
cgag
tacc
ccttc
tcca
gcttt
gtac
tagc
catg
taaa
accc
aagg
ttttc
ttta
aaac
atca
gaag
agat
ctcg
tcct
ccat
gccc
caaa
aaag
ccaa
ctca
ttgga
ggtg
ttacc
c
676
3837
059
GR
LFl
NO
ttggg
tgttg
aagt
ggaa
tcgt
ttcat
ccca
gcc
677
2582
942
NO
tcaa
tcag
gtca
acca
cagg
tcat
atga
aacc
atct
tccc
aact
cccc
cacc
tcca
ctgg
ttctg
aaga
gtgg
gtgt
aaac
aatg
ggtta
aaca
atcc
agga
t
gcag
acag
ctgt
taat
cata
actg
gctc
caca
678
2876
012
NO
tgtc
tcat
ctac
caat
tcct
gcta
ttata
agat
caac
ttctg
caaa
aacc
ttatc
cacc
tcaa
gtat
cctta
gcag
c
679
3372
066
NU
P16
0Y
ES
ctttg
cctta
acttc
cacg
gata
tctg
ggcc
ctgt
ggca
tgat
gctg
agaa
ccaa
acag
tagt
gaaa
taca
tcaa
ctttg
a
680
3781
765
C18
orf8
YE
Sga
gatg
cctc
ataa
atttg
tgat
agcc
gtgc
tgat
ggaa
taca
ttcgt
tctc
ttaac
cagt
ttcag
attg
cagt
acag
681
3278
980
DC
LR
EYYE
ESS
ctgg
atca
cagg
tgca
tacc
gcca
tgcc
1C
682
3435
777
C12
orf6
NO
gttc
aagt
gcaa
tgac
acac
gcct
ataa
tccc
agca
ccttg
ggag
gcta
agac
agga
agat
cacc
tgag
ccca
ggag
ctca
agat
tgca
gtga
gcta
tga
5ac
acca
ctgc
actc
cacc
tgtc
cact
tgttc
ttgtg
tgac
agaa
caag
accc
tgtc
tcta
aaaa
ataa
gata
aaac
cata
aaga
aaca
cagt
cagt
acta
taca
agaa
taat
ggct
acttc
taga
ggga
agga
gttg
tcat
tgtg
atga
ggca
cttg
gagg
ggttc
tggg
gtgc
ctga
caaa
gttc
tgttt
cttc
acct
gggt
ggta
gtta
gaag
ggtg
tccc
cgta
tttca
aattg
tacc
tttgt
gaga
ttgta
tg
683
3524
572
EFN
B2
NO
gcta
ggct
gtct
gcga
agaa
ggct
agga
gttc
atag
aagg
gagt
gggg
c
684
3790
001
___
NO
caga
acaa
gtgc
atct
gctg
gcag
ccag
ctta
agga
gtta
gcag
ccct
caat
tcca
ataa
cgaa
cgtg
taaa
tcag
aattc
685
3301
881
TM
9SF
YE
Sga
agga
gatg
gata
aagc
agat
gttta
ttggg
gcat
tcct
tatc
ccag
ctat
ggtg
tgtg
gcac
tgcc
ttcttc
atca
atttc
atag
ccat
ttatta
ccat
gcttc
a
3ag
agcc
at
686
2745
373
LO
ClO
ON
Otg
gcag
atta
tgtg
gacc
atca
acaa
acta
ccaa
aa
1301
78
687
2774
241
...NN
OO
tgca
caga
caat
tccc
tttac
tagg
tgtc
acca
tagc
tacc
atct
tcat
ctcc
ttcaa
atgt
tgct
ttggc
attg
aggc
ttacc
ctca
cttc
ctta
tttaa
aatta
caa
ccag
ctgg
gcgt
ggtg
gttc
acat
ctgt
aatc
ccag
caat
tttgg
aggc
caag
gtgg
gcag
atcc
cttg
agct
cagg
agtta
gaga
ccag
cctg
ggca
aca
tggc
aaaa
cccc
atct
ctat
aaaa
aata
caga
aaaa
actta
gctg
ggta
atgg
tggc
acac
acct
gtag
tccc
agct
attc
ggga
ggct
gaga
cggg
agaa
tcgc
ttgag
ccca
ggag
atcc
aagc
tgcg
gtga
gtc
688
2852
754
AM
AC
YE
Sta
cgac
ttaca
ggac
agca
gatg
ggga
attc
atgg
ctgt
tgga
gcaa
taga
accc
cagt
tcta
cgag
ctgc
tgat
caa
R
689
2390
973
—N
Ottt
gcca
gtag
cctc
ataa
tgag
cgtg
aggt
agta
tctc
attg
tagt
gtta
atttg
cattt
acct
aaca
attc
gtga
tgtg
gagc
atct
tttca
tggg
cttg
ttggc
tgct
gtgt
ctct
tcttt
ggag
aagt
gtct
gttg
aagt
tgttt
gaca
tttttt
aatc
aagt
tgttg
gaca
ggttc
ccttg
ttttta
agtc
agag
tagg
tgat
gtttg
tcttg
aa
ggtta
ta
690
3457
600
AN
KR
DY
ES
acca
agag
aggc
gaac
tcca
ttgca
tgct
gctg
ccta
cgta
ggcg
atgt
ccc
52
691
2853
708
C5o
rf42
YE
Scc
ttattc
cagg
gatg
ctga
cattc
cattt
ctaa
ctag
tttttc
tgga
aagc
ttaga
gaac
atga
actta
attc
tttac
tttttg
atgt
acat
acaa
catta
aaac
gac
atca
gagc
aaaa
ctaa
aagc
caga
atgt
gttta
gagc
tggt
tcttg
ctttg
ttgttg
ctcc
tgag
tcct
atga
atca
gaaa
aatc
atcc
tcttt
aaat
gatg
aata
tgg
catg
cattt
agaa
aacc
agaa
acttt
catc
atca
gtac
tggt
taat
caag
ggat
ca
692
3055
843
—N
Occ
caga
tgga
gaca
ccca
tagc
aggg
cgcc
ggat
ccaa
tctc
ttcct
ctcc
tccc
tcgg
ccttc
ctct
cccc
acaa
accc
cagg
gtct
agaa
agta
gaag
gttc
tccc
ctct
tctc
tcct
tcct
cata
atga
actc
gatg
g
693
3139
045
AR
FGE
NO
catg
gtttg
cgga
ggtta
gattt
actg
gaaa
tgta
ttcat
actg
tgaa
ttgtg
ctct
gatg
gtta
aaag
acaa
gattg
tcaa
gcat
tccg
tatta
acag
tgga
tgta
Fl
gaaa
atttt
ttcag
atgg
acaa
aatg
tata
tggt
acag
atgt
aaag
ttttc
tatg
taaa
aaat
tctg
taca
acttt
ctgt
acaa
tattg
attc
ccat
ctgg
ca
694
3607
791
WD
R93
YE
Sct
gcct
atttg
caat
gcca
ccgg
aagt
caag
ggcc
c
695
2666
907
SLC
4A7
YE
Sac
caag
aaag
aaat
acgt
ggat
gctg
a
696
3887
309
___
NO
ccag
cagg
gggt
tccc
atcc
ttcct
697
2333
654
DPH
2Y
ES
tgga
agac
gagg
gattg
ccat
cgcc
tatg
agga
tgag
g
698
3333
566
LO
C75
1N
Ogt
ctgc
ggac
gctg
tata
ccct
cctc
cac
071
699
3278
417
FRM
D4
YYEE
SSaa
gcca
gaga
agct
cgac
accg
tcaa
gtga
aattg
gagc
cacc
cA
700
3986
088
NR
KN
Otc
gaca
cgga
gcac
ccttc
tagc
ttcttc
gtct
ccag
gact
gacg
ctca
ggct
cctc
tctc
gcct
tagc
ccaa
cttg
ctttc
ccgc
ctcg
caaa
ctcc
gg
701
2558
398
LO
ClO
ON
Otg
ccca
ccag
ccaa
ctag
acag
atgt
ggcc
tgca
ttgtc
cccc
tgga
ggca
aagg
atgt
gtgg
gtat
ttattt
attta
agat
agta
agtta
tggc
cagg
tgtg
g
1283
33tg
gctc
acgc
ctgt
aatc
ccag
cact
ttggg
aggc
cgag
gcgg
gtgg
atca
cctg
aagc
cagg
agttt
gaga
ccag
cctg
gcca
acat
ggta
aacc
ctgc
ctcc
acta
aaaa
atac
aaaa
atta
gctg
ggca
tggt
ggtg
catg
cctg
taat
ccca
gcta
ctcg
ggag
gctg
aggc
agga
gaat
cgct
tgaa
ccca
ggag
gcgg
aggt
tgca
gtga
gcca
agat
cgtg
ccac
tgca
ctcc
agcc
tggg
caac
agag
caag
actc
catc
tcta
aaat
aaat
aaat
aaat
aaat
aaat
aaat
aaa
taag
acaa
taaa
ttata
taag
cagt
gctg
aata
cagc
ttcat
cattt
aaaa
atcc
aaac
agttt
ggaa
atat
atag
acaa
gtat
ataa
atgc
tcttc
ttgca
ccc
cttc
cccc
atcc
taaa
gcct
acca
taca
ggta
ggca
ctgt
taat
tattt
ggtg
gaaa
tgttt
acac
tttttg
ctttg
cattt
acac
atgt
gagt
ggca
ggga
tagt
atct
gaga
aatta
atttt
tttct
ctcc
tgaa
ttttc
tcttt
ttctg
ccct
caca
tttat
ggag
ctca
ttacc
tcca
gagc
agct
tttca
ggtta
gaaa
actg
aattt
atct
tca
ggca
tctc
tggc
ttttg
tgtg
ctga
aagc
ttaag
cattt
aggg
caac
tcca
ggat
ttgct
gagt
ggct
catc
aaag
gatg
agga
ttctc
ttgca
caga
ctga
a
702
2830
633
—N
Ogc
attg
tgta
tgct
gaag
ccat
catc
actc
tcgc
tcac
agga
ggttc
atta
tcca
tttct
gaca
agta
gcct
tctc
cttg
atcc
tcttc
ctta
ggc
703
2546
259
___
NO
tata
ccca
catc
acta
agct
ggga
ggca
cattt
704
3615
631
TJP
lY
ES
ttccg
tgttg
tgga
tacc
ttgta
caat
ggaa
aact
gggc
tcttg
gcttg
ctat
tcga
attg
gtaa
aaat
cata
agga
ggta
gaac
gagg
catc
705
3729
022
GD
PDl
YE
Sga
gctc
ccac
ctta
ccttg
gcaa
actg
gatg
tctc
atttc
aaa
706
3729
073
YPE
L2
NO
tgat
ggcc
tcttg
tcag
gaga
gcag
tggc
acgg
gggc
gtga
ggaa
gagg
gaaa
gggg
aaac
tcta
aggg
tcct
ggcg
cggg
gaag
gggt
ggaa
ggg
tgga
ggta
ggaa
caaa
attg
cgcc
gctc
ctgg
agac
ctga
taac
ttagg
cttg
aaat
aattg
acttg
tcta
aaag
ga
707
3043777
—N
Otg
cact
acat
cgct
ggca
agaa
ttgttc
agaa
aggc
tggc
ctgc
taac
gact
tctg
ggga
gattt
tc708
3080463
—N
Oaa
ggca
ccta
caca
acat
cagg
ccac
ttgg
709
2969705
REV3L
NO
ttgct
aata
aatg
tagg
ccgg
aggg
gatg
gctta
gtgg
tcta
agca
tcag
atttg
aaat
acct
agag
tccc
tgga
actg
cagg
tttga
atgt
cagc
agag
tca
accc
agc
710
2904894
MAPKl
YE
Sat
tctg
gattt
tgga
ctgg
ctcg
gcac
acag
atga
tgaa
atga
cagg
ctac
gtgg
ccac
tagg
tggt
acag
ggct
cc4
711
3421016
—N
Oat
gcct
ccag
tatc
ggat
atgt
gggc
aggc
acaa
712
3724753
NPEPPS
YE
Stg
ggttt
tatc
ggac
ccag
taca
gctc
tgcc
atgc
tgga
aagt
ttatta
ccag
gcat
tcgt
gacc
tttct
ctgc
cccc
tgtg
gatc
gact
tgga
ttaca
gaat
g713
2600045
GLBlL
YE
Stg
ttgtg
gagc
gaaa
tatg
agag
acaa
acta
tttttg
acgg
ggaa
actg
gggt
ccaa
actg
gata
tcttg
gtgg
agaa
catg
ggga
ggct
cagc
tttgg
gtc
taac
a
714
2893581
—N
Otg
cattt
cagc
ttgga
gtgc
gcag
catg
aggc
atttg
tggt
tcag
aaaa
gagg
tcttc
ctttt
tcct
cctc
ctgt
tttct
tttcc
ttcct
tctc
ccca
actc
ccca
aag
gctta
ctgc
ctttc
ttctc
aggc
cacg
tgtg
taga
taac
ctttg
agga
aaag
atgg
tttcc
gtgc
tggg
atat
ttgga
tatta
ccta
aagg
gaca
agat
gagc
ccttt
ctttg
ccttg
ttttc
tttct
ctgg
cctc
atca
gagt
gaat
tata
tctg
actg
tgtg
acag
ttaat
tgta
ccat
ccat
cccg
tgtc
ctaa
gctg
ataa
gccc
715
3712
473
MPR
IPN
Ogt
gtgt
ccca
tcca
agttg
agca
cgcg
ccttc
ccca
gcttg
cagc
agca
cacc
ccaa
gcgc
tgct
tttca
cctg
tacc
tttgt
tttat
tatta
ttatta
ttattg
ctgt
tgttg
tcat
cgtta
actg
tggg
catg
gaa
OO
716
3026
735
—N
Occ
atga
tgtg
aaag
gtcg
ggtg
atat
ctga
gagc
tgaa
ggag
gtta
agcg
gatc
catg
ccag
gctg
acaa
gggg
aaca
ggtg
cctc
tgcc
ctgg
actg
gag
ccta
cgca
acca
tcttc
tgct
ttatc
tagc
tgct
gtct
gtct
cctg
tctc
tgcg
tgag
tgtg
tg
717
3603
221
IDH
3AY
ES
agtc
gtgc
agag
tatc
aagc
tcat
cacc
gagg
gggc
gagc
aagc
gcat
tgct
gagt
ttgcc
tttga
gtat
gccc
ggaa
caac
cacc
ggag
caac
gtca
cggc
ggtg
caca
aagc
caa
718
3820
895
CA
RM
lY
ES
ccag
gtgg
acca
gacc
ggct
ccaa
gtcc
agta
acct
cctg
gatc
tgaa
719
3183
801
RA
D23
YE
Sag
aatg
agaa
tttgg
ctgc
caat
ttB
720
3420
741
CA
ND
lY
ES
agaa
aaaa
gtgt
gaag
accc
gaca
gtgt
tgttt
taa
721
2356
132
TX
NIP
YE
Scc
tgct
atat
ggat
gtca
ttcct
gaag
atca
ccga
ttgga
gagc
ccaa
ccac
tcct
ctgc
taga
tgac
atgg
atgg
ctct
caag
acag
ccct
atct
ttatg
tatg
cccc
tgag
ttcaa
gttc
atgc
cacc
accg
actta
tact
g
722
2625
835
SLM
AP
YE
Sct
ttcaa
agag
tggc
gggg
actg
cact
tttat
tcat
caat
723
2996
070
AV
L9
YE
Sgt
ctgt
cacc
tcga
gatc
ttgtc
cttc
atttt
724
3175
548
___
NO
cctc
aatg
ggaa
ccct
ggtg
caga
a72
538
1003
4N
Ota
gctg
ttcag
gaaa
gctg
tgtg
gctg
gagc
agag
aata
tact
ggaa
atca
gaga
ggtc
aaaa
acaa
tgag
cagg
agat
atcg
gaag
aaat
gttc
ctaa
ccac
ttcac
tcttc
726
2767
097
—N
Oag
tattt
catg
acaa
ctct
aaga
aggt
tcag
cact
cagg
ggca
gaat
ccgc
aaat
atca
ccta
tttcc
aatta
tact
tcac
atgt
gcat
atgt
gcac
acac
gacg
gctg
tc
727
2853
747
—Y
ES
ttattc
tggg
gaat
gcct
gaag
ttaac
atttc
tagc
aattc
ggtg
gcat
gaga
atg
728
3016
374
—N
Ocg
cagt
ttgtc
agca
tcga
tgtg
ggac
tggg
agag
gaag
gata
ttgcc
agtta
tggt
tga
729
3845
911
LM
NB
2N
Ottt
atgg
cctg
ggaa
acaa
tttgc
atttg
tccc
caaa
tacg
ctta
gctg
tgtg
ccgc
ttaga
acga
tgcg
aaac
catc
cctc
tgtg
taag
cccg
tg73
026
4276
3—
NO
gtgg
cagg
agta
ttttc
acca
aaga
aatta
aatg
ctac
aaat
ccta
ccac
caca
ccac
tggt
ttggg
ctca
taga
aagt
ttgtta
agag
tctg
tgac
atga
ggt
ggcc
tcta
atac
agtg
agttc
aata
tttga
acttc
tgta
aaga
aaag
gatta
gattt
attc
agta
tgat
ctta
aaga
ggga
tgct
aggg
gcag
cagg
taaa
aattt
cagg
gaga
caga
ttttc
gttc
agtg
ttggg
aaac
tcct
gagg
taag
aagt
gccc
atta
ggct
gggt
ggcc
actc
acct
taga
agag
agat
acta
caga
gaa
ggct
gaaa
catg
gaag
attg
tagg
ggttg
agtg
gctc
atta
gtttg
gcca
ccaa
atc
731
3642
784
PDIA
2N
Otg
tcac
cccc
gcca
tcac
tgct
ggac
agga
gcca
cccc
cttg
ggta
ccag
aggg
agct
gtgc
attg
tgaa
taaa
ga73
231
7862
1SE
CIS
BY
ES
tgtta
ccga
ggttt
tcaa
acag
tga
P2
733
3681
988
KIA
A04
YE
Scg
atcg
gtgc
agtg
aata
gcct
ccac
agat
acaa
aattg
gcag
caaa
aaga
tcct
ggtc
tcac
ttgcc
accg
gggc
tgcc
agca
aatc
actc
tct
30
VO73
429
7375
0—
NO
ggag
cgtg
ggag
ttcat
atttt
acac
tatg
ttggt
actta
ccat
gcaa
atga
acgg
ccct
gctg
ggaa
aaga
acct
ttatc
caga
gagg
ggat
ccga
ggaa
ata
gtaa
gagt
tggg
ttccg
agaa
gctg
tgct
gcat
cacg
ggag
agct
cggt
gtta
735
3284
084
EPC
lN
Oat
ggtg
cgga
acct
gctg
tttta
tcta
tttat
tgtg
ccgt
gttta
cagt
tttttg
taca
ctgt
acct
tcat
tggt
tcct
gtgc
tgta
gtaa
atgt
gtta
ggta
gctg
tgga
ctcc
ttggt
atttt
gtaa
atgg
tata
acat
aact
tggt
tccc
ctct
gggt
ccttg
agttt
tctg
tgta
tcat
gtga
aaaa
aaat
ggtg
acat
acat
acag
aattt
taca
aaaa
aaaa
aaaa
aaaa
aaaa
ggca
tcag
tttttt
taaa
aatg
ggga
atgt
acta
ttaaa
tggg
gatc
ttcct
ggtc
tact
atca
ttagg
acaa
gtaa
caag
ctaa
aaat
gaag
tctc
ttgaa
tcttt
ccat
cccc
aact
tgcc
caca
aagc
tgtg
ggtg
gtttc
tggt
actta
aagc
acta
atat
tatg
ttgct
gctc
tgcg
aaca
gccc
agt
agtc
ccat
tt
736
3642
301
—N
Occ
acag
gaga
cacg
aggc
cgag
gtgg
cgca
gcac
ccac
ccta
caca
gcag
tggc
catg
tgcc
aatta
cacg
tgct
ggga
ggag
737
3133
236
PLA
TY
ES
tgtg
tctg
aacg
atgg
ccgc
atga
ctttg
gtgg
gcat
catc
agct
gggg
cctg
ggct
gtgg
acag
aagg
atgt
cccg
ggtg
tgta
cacc
aagg
ttacc
aac
tacc
taga
ctgg
attc
gtga
caac
atg
738
2858
415
NO
gctg
gtaa
taat
cgcg
tttttg
gtaa
aaac
acct
catg
gaat
tttttt
ccttt
ctca
taaa
atag
ctgt
ttact
gtaa
aattg
agat
agcc
tctc
aagt
ctgg
aaca
ccttt
caat
tcat
caaa
aagg
gaca
acct
aata
ttcca
aaga
agac
tcaa
tcct
tttaa
caca
cata
cattt
tagg
gcca
gtca
agag
aagt
ggcc
tgttc
tgta
ggaa
aaca
tttct
tggc
acat
tatta
tttta
agtg
attg
cagg
agac
caca
gagg
gaga
gaaa
agag
acaa
caac
aact
tcta
gcat
gcct
gggg
gatg
acttg
ctc
tttca
tattt
gtgg
aacc
ctat
gtca
agag
agaa
aaca
tcta
aaaa
taaa
aacg
cattt
actc
agat
tctc
tagg
gcaa
ggtg
caa
739
3301253
NO
cttg
tggc
attg
tgct
aaag
caaa
aagg
aatg
gaat
ttgca
atga
gaga
gcct
ttcaa
gttc
aggc
tcag
ccttt
agct
gtgt
gaac
ttgaa
ccac
tcac
aattt
ttctg
aatc
tcaa
tttcc
tcat
ctgt
aaag
tttat
ctaa
tatg
ttggt
ggtg
aggt
aatc
atat
tatta
tctc
aaat
agga
gattg
tata
tgaa
aggg
ctttg
gaaa
ggca
aatc
ccat
tattt
gttta
aatc
aattt
tttcc
ccta
aaaa
agat
agct
cata
aaag
aatg
tgaa
ctta
atct
gtat
gtgt
aattt
ctttc
aagt
ccta
tctg
atgt
tgct
tgtg
ac
740
3068212
DOCK4
YES
aatg
cctg
tagt
gtcc
agta
ccga
cgac
ccttt
ggct
gtgc
agttc
ttagc
atcg
ctga
cctg
ctaa
cagg
agag
acaa
741
3664307
—NO
ccag
caat
tctg
aggc
aggg
acca
atca
ga742
2357616
LOC388
NO
atgt
ggat
tgtc
taac
tgga
ggttg
ggag
tcca
gggt
gcag
aagg
agaa
gcttg
gagt
gcag
gattt
ggtg
gtat
gtgt
gtgg
cagt
aggc
acta
tgttc
taa
692
ttgcc
ag
743
2824047
___
NO
atga
ccag
agtc
atag
gtgg
tgat
gt744
3348756
Cllorfl
YES
attta
agcg
agag
cctc
actg
gttc
ccag
gaca
tcaa
cctg
aact
ggat
cctc
cccg
ata
745
3561991
NO
ctta
tttga
tggg
tggc
caag
atac
tggg
tagg
gtgg
tgtg
atttg
ctat
actg
agta
gatc
agta
aaca
ataa
agta
taaa
tcttg
ctttg
tagt
tgga
aggt
cagt
taac
aaat
attta
gtca
atat
gatg
ctgg
agttg
gcta
atgt
atcc
ccca
ggtc
taat
taat
gaaa
ataa
tcttg
cattg
gtct
ttagg
aaaa
tcat
tttta
ttctc
tttc
tgaa
tact
taac
cata
gttg
tttac
atgt
tttac
tgtg
gaag
taat
atct
tttat
gaat
gtttt
agag
aaaa
caat
tatg
gttc
acttg
gacc
attc
agta
atttg
tcc
agcc
tggc
tagt
ttcta
gtta
tctg
a
746
3051857
NO
agag
aaga
atcc
tccg
acgg
cttc
gtta
ccat
cctg
tctg
aagc
ggat
tgca
cgag
ccc
O747
3459696
NO
tgag
cttc
acat
gggg
tcaa
actg
gaaa
gtat
ctct
ttgct
atgc
tccc
attta
atta
aatta
ggaa
atct
gcct
aaaa
cttg
aatc
actc
ctct
tttca
gctg
gtgg
ccag
gcag
gaat
gctc
cctg
aaga
gtac
tcttt
caat
agac
ttcag
aatg
tgtta
cctg
aaaa
gtgc
tcat
gatg
aatc
aaga
catc
actg
tgttt
tcca
tact
tctg
aagt
agag
attg
tcca
ctaa
cagc
ttagg
gaaa
gcag
agta
cccg
ctca
gctg
ctct
ttttc
tggt
gtgg
tcag
acca
gctg
ggtc
ctct
gaaa
catg
aat
ccat
agtg
cttc
ctttg
ttata
ctat
ttctg
actc
agttt
tctc
taaa
cggc
tgta
aaat
ttaac
tgcc
tagc
tgta
ttttc
tttct
gtgg
acttt
atgc
tctc
atat
ttagg
agaa
agaa
taag
agaa
cacc
aact
taca
tctc
tgta
gctc
tgga
tgat
tatta
aggc
cctg
gtct
aatg
ctta
aagg
taag
catc
tgtta
tcta
tattc
aata
aaa
gagg
aaaa
tgaa
cccc
agag
agaa
atca
agtta
cttta
gaat
cgca
ggct
gtcc
catg
accc
tggt
gacc
atca
ctga
atta
748
2614179
—N
Ogt
gaag
ccat
cagt
gtct
acag
cttc
agct
attg
cccg
tgtg
caga
tggt
gctta
atct
ctct
tgtg
ccttc
caga
cctg
catt
749
3047613
—N
Oat
tagg
tgat
ggta
gcgg
acta
gccg
acgg
aggg
cagg
cagg
ggag
gggg
agag
gact
ttaca
gaaa
agga
attc
tcgg
tcga
gctc
tgcc
tgga
gat
gact
ggct
taca
ctta
ctaa
accc
agcg
ggtc
a
750
3677890
CREBB
NO
atca
acag
ccgc
catc
ttgtc
gcgg
accc
gacc
gggg
cttc
gagc
gcga
tcta
ctcg
gcc
P75
131
7792
9D
APK
lY
ES
tgac
aattc
aaga
tagt
ttgca
gcat
ccct
gg75
232
1161
3T
LE
lY
ES
tcct
ggtc
ccag
acag
tcta
agag
gcac
agat
aaac
gcag
aaat
ggac
ctga
atttt
ccaa
tgac
atca
agaa
aagg
aagg
tgga
tgat
aagg
actc
cag
ccac
ta
753
3452723
RAPGE
NO
tgtg
agaa
ggtg
aagc
ccat
aagg
caca
gaag
gagt
tcag
tgag
acca
gtga
cttc
aggc
tcag
tgtg
ggag
acca
gggg
aagg
atgg
gcat
gcag
gF3
gggc
gggg
gact
aggc
tgtg
gggg
aggc
agga
tgca
tgga
aggt
aggg
aagg
ctga
ggag
gtgg
agcc
aaag
tcgg
aggt
ggga
gaag
ggct
gtg
ggtc
cctg
tggg
accc
tcat
gggg
ctca
gggg
ctgg
aatg
aagg
aggt
agat
gaga
gggt
ctaa
agct
ggaa
catc
atga
gttg
agca
tggg
atg
754
2845333
...
NO
gggg
ctaa
catg
gacg
gtca
ggag
ctac
gt755
2852760
...
NO
tgaa
tgcc
tgaa
tctc
aaac
ttgta
cttc
atca
tgga
ctcc
agtc
ttcct
ttcac
atcc
aattt
caat
gatc
atca
atac
agtc
atttt
gctc
ctta
gccc
actc
ggac
tattt
ctct
ccat
acct
gctg
caac
tgcc
ttgac
actg
gtgt
gaca
gcct
tttaa
aaac
tggt
gtttg
gggc
tggt
atca
ctcc
agtg
gattc
756
2750640
...
NO
caat
aaga
taga
cgga
aata
taat
attta
aggt
taga
gaag
aagg
ctga
gtgt
ggtg
gctc
actc
ctgt
aatc
ccag
cact
ttggg
aggc
tgag
gcag
gaca
atca
cttg
agct
gggg
agttt
gaga
ccac
tcag
actc
acttg
agac
ca757
2807722
NO
ttctta
tgca
aaca
acac
catc
tggg
ctct
ggaa
cttc
acaa
aagg
acac
tgtta
agtc
acac
758
3338718
—NO
tcat
ccat
cagg
ctgt
gtca
gacg
cacc
tgag
ccct
c759
3482760
—NO
tgac
ccca
gacc
tagg
gttg
ctgg
tgga
760
3884900
...
NO
tgct
tcga
ttgct
ttcga
gtca
tgag
ttggt
g761
3038653
—NO
aaat
gtgg
gccc
gtta
aaga
gcaa
gact
ctag
aaag
gaat
ctag
ccaa
agag
gaca
gcgt
tggt
ggga
tggg
gata
a762
3340340
RNFl69
YES
agtc
agag
ctgt
agtg
acac
agcc
cagg
aaag
agcg
aaga
gcag
agtc
agag
cagt
tcca
ggca
acaa
agcc
aag
763
3441973
...
NO
agtg
ctgt
ctac
acgc
tgag
tatta
aatc
ctaa
atgt
gtat
cttta
gctg
acaa
cctc
tccc
ttgaa
ctct
gcac
tcac
ataa
ccaa
caat
ctaa
acct
ctcc
ctcc
cata
tgct
accc
catc
tcag
taaa
tggc
aact
tcat
tcttc
tagt
tgct
caag
tcaa
a
764
3629351
SPG21
NO
accc
ggtg
tgttc
ttgta
tagt
cagt
ggca
tcag
cacc
cgtc
agcc
ggcc
ttttc
cttc
aggt
tcgt
cagg
ctca
ccg
765
2611420
—NO
gtgc
tggc
acca
tcga
gcca
caac
tcag
gggc
ctca
tc766
3371610
—NO
acag
ttttg
agac
aggt
ggttc
aggt
g767
3795789
...
NO
gcct
ggct
cttc
ctta
ccac
ccgg
atttt
gtttt
tgct
ttctg
ggta
gagc
tctg
atgt
tgcc
aaac
agcc
tgtg
ctcc
tcgt
gaac
aatc
tctc
caga
ctca
aaat
agcg
agca
gcca
aggg
gctg
ccac
tgct
tgcg
gtcc
acag
768
3118031
NO
agga
aata
acttt
acat
gtgc
tcct
ct769
2976043
IL20RA
NO
tcct
gtgc
aaac
aagt
gagt
cacc
ccttt
gatc
ccag
ccat
aaag
tacc
tggg
atga
aaga
agttt
tttcc
agttt
gtca
gtgt
ctgt
gaga
atta
ctta
tttct
tttct
ctat
tctc
atag
cacg
tgtg
tgat
tggt
tcat
gcat
gtag
gtct
ctta
acaa
tgat
ggtg
ggcc
tctg
gagt
ccag
gggc
tggc
cggt
tgttc
tatg
caga
ga
770
3330211
NO
atta
ctga
aagt
ggcg
tgca
ctga
ca771
2397801
...
NO
atgc
caga
gact
gtga
tgttg
gaaa
772
2537317
TMEMl
YES
aata
catc
aatg
aggc
ggct
gcga
t
773
2875
953
NO
caat
agtg
caat
atca
cgac
cagg
atat
taac
attg
gtac
agtc
aaaa
tacg
gggc
atta
ccat
cacc
acaa
ggat
atct
cata
ttgcc
ctta
taaa
gcca
ccc
cact
tccc
tcct
acct
cacc
cctc
ctga
actc
ctgg
taac
caca
aatta
tcca
tttgt
ataa
gttg
gtaa
ttttg
agaa
tctta
cata
aatg
gaat
aata
cagt
atgt
aact
tttgg
gattg
acttt
ttttc
actta
gcat
aattc
tctg
aaga
ttcac
ctag
gttg
ttaca
tgtg
tcaa
tagt
ttgttc
ctttt
taaa
tttta
acttt
tattt
tttcg
agag
taa
gtct
tgca
ctcc
aggc
774
2882
417
—N
Oag
gaaa
gcat
gaga
ggtc
agca
gca
775
3095
010
—N
Oca
tgca
gtac
attg
gata
tgat
ataa
tttat
ggttt
cttg
tttgc
agct
gtttg
tatc
ttttta
atcc
caaa
ccag
acaa
aatta
taaa
cattt
tata
taca
atgt
tatc
ttgg
aaaa
agtta
gatg
taaa
taat
tcat
ctta
atct
atat
ttgag
aaat
ctga
gggg
tatta
ggaa
actc
atga
gtga
atga
acat
atag
attg
gatc
aaag
gagg
agag
tatg
agag
tagg
gaga
ccag
gtaa
aaag
gtat
cata
gtca
tc
116
3450
794
KIF
21A
YE
Sgg
atgc
tcct
ttaaa
cagc
ccag
gatc
aga
111
3627
349
—N
Otc
ttctg
gctc
ttccg
tgac
gcat
taca
acag
cca
778
2360
857
FDPS
NO
ggga
gcag
ctga
tcag
gtttc
tgac
779
3821
204
PRK
CS
NO
gtgc
ggtg
gata
ctga
ccttt
gctc
cggc
ctH
780
3665
595
FAM
65Y
ES
tgga
ggat
gagg
acgt
gcag
actc
gagt
ggct
ggA
781
2852
747
AM
AC
NO
cagg
ccca
cggc
tcaa
gtga
atttg
aata
ctgc
attta
cagt
gtag
agta
acac
ataa
cattg
tatg
catg
gaaa
catg
gagg
aaca
gtat
taca
gtgt
ccta
cR
cact
ctaa
tcaa
gaaa
agaa
ttaca
gact
ctga
ttcta
cagt
gatg
attg
aattc
taaa
aatg
gtta
tcat
tagg
gcttt
tgat
ttata
aaac
tttgg
gtac
ttata
ctaa
atta
tggt
agtta
ttctg
ccttc
cagt
ttgct
tgat
atat
ttgttg
atat
taag
attc
ttgac
ttata
ttttg
aatg
ggttc
tagt
gaaa
aagg
aatg
atat
attc
ttga
agac
atcg
atat
acat
ttattt
acac
tcttg
attc
taca
atgt
agaa
aatg
agga
aatg
ccac
aaat
tgta
tggt
gata
aaag
tcac
gtga
aaca
gagt
gattg
gttg
catc
cagg
ccttt
tgtc
ttggt
gttc
atga
tctc
cctc
taag
caca
ttcca
aact
ttagc
aaca
gtta
tcac
acttt
gtaa
tttgc
aaag
aaaa
gtttc
acct
gtat
tgaa
tcag
aatg
ccttc
aact
gaaa
aaaa
cata
tcca
aaat
aatg
agga
aatg
tgttg
gctc
acta
cgta
gagt
ccag
aggg
acag
tcag
tttta
gggt
tgcc
tgta
tcca
gtaa
ctcg
gggc
ctgt
ttc
782
3004
680
—Y
ES
tgga
tgag
tgta
aggg
acac
caag
gagg
t78
327
2223
3—
NO
tgaa
attc
ataa
aggc
atgt
gtgc
ca78
428
0596
0—
NO
ccca
gggc
atta
gagt
gacc
tcac
aaga
gtat
ctca
gaat
attta
aaaa
tttga
acac
acaa
acac
aatg
aaac
tcaa
catc
tgtc
agac
atca
cgtg
aact
act
agtg
aggt
agttc
attc
tttca
acat
aaga
ttgca
ctat
aatc
cttc
tgat
aattt
cata
tttttg
tgag
actg
ggttt
tcca
ggcc
atct
gata
aaaa
atca
agta
ctat
atgg
acac
caac
atgg
agca
gaaa
atga
gggc
agca
gtgt
ccaa
tctc
attc
cagg
tg
785
2952
396
—N
Oag
gaat
ggaa
ttaat
aggt
tgtg
gcaa
tatc
aa78
626
1292
7—
NO
caac
ccaa
tcta
cagt
tgtg
ggca
gtgg
ct
787
2640231
NO
tgcc
cggc
ctgc
aata
agta
tttct
aaat
aaga
tcac
attc
tcag
ctac
aaga
attta
ggat
ttcaa
catc
ttttg
agga
ggac
tcaa
tttaa
ccca
taag
aata
ctat
gtgg
gcca
caca
tggt
gcttc
acat
ctgt
tatc
ccag
tattt
cggg
aagt
tgaa
gcag
gagg
atca
ttgct
cagg
gcct
atgg
ggac
accg
ggtg
cata
aga
ccac
a
788
3371122
SYT
13
NO
agca
ggaa
atgg
aata
tgcg
ggtc
acac
tg789
2972930
___
NO
gctg
tgac
acaa
cagt
gtgg
cgat
gtcc
caca
gacc
tgaa
a790
3529011
___
NO
agcg
gctg
cagc
acgt
agttg
gaga
aggt
gagg
gcga
tgac
agcc
tggt
tggt
gggg
taga
tcac
cagc
acag
caat
cc791
3670860
LOClOO
NO
gagc
cagc
cacc
acgt
taga
tttta
gagt
ctcc
tgga
gcac
gtga
aaac
aact
gaaa
aagg
gtaa
ccac
acat
cattt
cact
tgtg
atgt
agct
tgcc
tgtc
tc129617
caca
ccat
gccc
ctga
agaa
tagt
atat
cacc
taca
gccc
cttc
ccca
gtca
ggaa
tgga
agtg
catg
acac
atgt
gctc
ctct
accc
cttc
catg
ctca
tgg
caga
catc
atta
atca
atta
tagc
actc
tttct
gtag
agcc
agag
acag
catc
acac
tcttt
cccc
tcct
gcat
tcca
ggcc
acca
ctac
caac
tgaa
atcg
tgt
tagt
acca
taat
gaat
gcta
tgta
ccat
tctc
tacc
ctaa
gcga
ttgca
aact
gtaa
atga
attg
ttgct
gattt
ctga
gccc
ctcc
taga
tttgg
ggta
aattc
attt
cttg
ttttc
agaa
caca
gggg
atag
ggac
accc
tgtg
cagt
tcttt
ctcc
agga
caag
gaga
ctcc
ccac
tggg
ggat
gggg
cggg
gtttc
tgcc
ttaat
ttggg
cgct
cata
gtttc
aagg
agga
gctc
tttct
ggct
ttggc
cagc
taga
agga
aagg
tgcc
ctgt
ttgtta
acttt
aaaa
tcac
tacg
ggtg
tagt
gtat
ggag
tgg
gctg
tgcc
atgc
tgga
gttc
agag
caaa
ggttc
ttcag
gtttt
cttg
cgaa
ggac
ctta
acttg
tcaa
tggc
agag
ccac
accc
ccgg
gaca
tact
tggc
agag
gaat
gcct
cttc
aggc
acat
aaac
atttt
tgca
tact
ccat
gtta
gtca
ataa
accg
tttca
taag
ggttc
tttga
ggac
atct
gact
tcaa
aggg
aaaa
aat
tcat
aattc
agac
aggc
tctc
gggg
cttc
acca
taca
acgc
ctttc
ttgta
tttgg
ttagt
tttat
gggc
ctgg
agtg
ttgac
catg
tatta
792
3844568
SHC2
NO
ttcct
ccgc
atga
gcct
ctgg
catg
gtcc
ttcct
ccag
ctgg
cccc
gggc
tggg
caga
gcct
cctc
ctgc
cggg
gccc
ctgc
ccac
cccc
tcct
ttgcc
tgg
agtg
aggg
tgttc
atac
caaa
gacg
gaac
cattt
cgcc
tttaa
agaa
aata
tatc
caga
agca
gccg
ctgc
ctcg
gagc
cctg
gccc
ttggg
tccc
cctc
tcgc
ctgg
ctgg
ttcgg
tcta
acgc
cccg
gaga
gtca
gggc
tccc
agga
ccct
gggg
agga
ggcg
gattc
cggg
cctg
gctg
ggct
tcct
cttc
ccac
ctg
agga
ctgg
gtgc
acag
ttgtc
tttga
gggg
ggac
gctta
aggt
gcttt
gggg
ttctc
aggc
cagg
atac
atgc
tggc
gctg
aagt
gcag
gcag
ccttg
agg
tcac
ctgg
gatc
tcgg
ggta
gcca
ggct
gctc
caag
acag
tgga
tatc
gagg
cagt
cgtg
aggc
ccct
actc
cacc
tggg
agca
gggg
aagg
acgt
ggt
tgcc
tggc
ctgg
gctg
cgcc
cagc
agct
tccc
ccca
gtcc
tgcc
ctcc
cagc
tgtc
gacc
caga
tggg
atgt
tc
793
2810517
GPBPl
NO
cagc
ttaca
ctgt
tttgc
ttgca
gagt
cata
tcttt
ttcgt
acaa
tgga
aatc
ctca
agtc
cact
ttgtg
cggt
ctcc
ctct
ccttc
cccc
aaaa
aaca
acaa
caac
aaaa
caaa
aacc
aaaa
agga
aaat
gtag
catg
ttggc
taaa
actg
gagc
aaag
tgca
ctaa
aaca
atttc
ctga
actc
acct
gttg
tact
attc
acct
tttaa
acc
ataa
attg
ctct
ttagc
cattt
gtag
tgca
gtaa
atgt
taca
ggaa
aaga
cttg
gcac
atttt
cttc
caaa
tttta
agag
gtga
ttttc
aaaa
gcttt
attg
gggt
atg
ttgtc
agac
cagg
gtttt
caga
gttg
atgg
aaaa
gagt
cttg
tgag
aaaa
ctta
ttttg
ataa
atta
ttaca
cacg
caga
aaaa
ctga
tcac
actg
actg
gat
ctgt
ccac
gaca
tgga
aaa
794
2377968
—NO
gtaa
gaca
cgttc
ccag
aggc
agga
c795
3020324
—NO
tatg
taat
gagc
atgt
gcat
agtg
tgtg
tatg
tgttt
gtat
gtgt
ttgtg
gggg
gtaa
tggt
ctc
796
3133424
—NO
cctg
agct
ggct
tggt
ggca
tccc
aggc
tcgt
gcca
ggac
ccag
cagg
gctg
gtttc
catg
ggcc
tgaa
aagg
ccca
catc
agaa
ttcca
gggc
atag
gcg
agga
ttact
797
3225212
RPL35
YES
cttc
ctct
ttccc
tcgg
agcg
ggcg
798
3731955
___
NO
tgga
accc
tcag
gtgt
tcgc
aggt
gt799
4012443
___
NO
ggag
tagg
ggtc
aact
ggat
tttgc
cagc
800
3145052
KIAA14
YES
ttgat
gacc
ccag
aagg
agttg
gcct
tacc
actg
cctta
cgtg
ttctc
tgta
atgt
tgca
tgcc
cacc
acct
c29
801
3542308
___
NO
ggga
cggt
gcac
atgt
gcat
tcac
atgt
gtttt
atgg
atgc
taca
gagg
aata
tcag
aaga
gagg
aaga
gggc
ccca
aaca
gccc
cttta
gctc
tgaa
a802
2385170
___
NO
atgg
cagg
cccc
aatg
tggc
gtgc
attg
ag803
2617336
CTDSP
NO
ggta
gaac
tgcc
catg
ccac
aaat
attta
tttgg
aaaa
gtag
tcat
taaa
tgaa
ccca
ctgc
ctta
aatg
tcttg
aatg
ttgca
gtca
agtg
tctg
tcat
gtgt
tga
Lta
tcca
caca
gaat
tagg
ccct
aatg
agag
cctta
gacc
ctca
acca
tgcc
ccct
tcgt
tggc
atca
cagg
gcct
tattt
ggaa
gagc
gggc
aaag
agga
tgg
aaat
cata
aaat
atttc
atgg
gaat
cgaa
ccta
ggga
tagt
gctc
cact
tctg
acga
tgga
gtga
agac
acttg
gcag
acttg
agcc
agac
acttc
acct
agt
agttc
ctga
aact
gtga
gcac
cact
gcac
taag
ccag
tgcg
gagc
tgtta
ggga
cggg
ccca
gctc
ctgc
acca
cgga
caca
gaat
gtct
ggag
agg
gcca
gcag
gccc
tctg
aggg
ttctg
gaat
ctgt
gcac
ctta
tttga
ccac
actc
caaa
attc
tgttt
ttattt
taac
ccttg
aatc
tgct
ttatg
taca
taat
caaa
atat
ctat
atct
atat
ctat
atct
atat
ctat
atat
tttta
atca
tcta
catg
taaa
tgaa
gcaa
taga
attc
taac
ataa
ggcc
aaga
aatg
agac
gaat
gtttg
gggt
ttat
gtttt
ttaag
gtaa
atac
gggt
attg
ttttta
atta
ttacc
atgt
atta
aattg
tggg
ctttg
aaac
ctaa
tgaa
acct
gtta
gcca
cttc
tctg
tgcc
atat
acttc
cca
tgtta
ccaa
aata
cccc
caac
tcttt
agcc
aaaa
gaga
accc
tgac
ctcc
tgag
tttcc
atgc
tcct
ttctg
tacc
aggt
ttaaa
tgta
gtct
tctg
gaga
agt
atttt
tgac
attg
agct
ctgg
gaca
ggac
acct
tggg
tttgt
ggac
tgca
gccc
acta
tgat
gtta
ttact
tctc
tggc
cagg
cctc
cagt
ggaa
gtgc
acag
gca
ctcc
caat
gttg
ttaat
gctc
tgtc
ttcca
tttgt
tctg
gaat
ccta
cgtg
ttggt
ctgt
ggttc
catg
cat
804
2982
034
RSP
H3
YE
Sca
tggg
gaag
acac
acat
cagt
ctcc
agaa
cccg
agga
tgag
cctg
gtgg
tcct
ggag
caat
gaca
gagt
cact
ggag
gcct
ctga
attc
ctgg
agca
gag
catg
tcac
agac
acgg
gagc
tgct
tttag
atgg
aggc
tacc
taca
aaga
acaa
ca
805
2335196
—NO
gtac
agga
atcc
agct
gtgt
ggag
cag
806
3561721
—NO
tcca
gcta
taca
cccg
taac
ccaa
cagc
ggct
caac
cctg
ggcg
ccaa
ctac
cgct
gcgt
cctc
cgcc
gcttt
tcg
807
2819696
—NO
gata
atag
tagt
atgg
atgg
cagt
ggca
t808
3408052
—NO
catg
atgg
ttgtc
ttcac
tctc
ctac
tccc
aatg
attc
ctag
aaaa
gaaa
cttta
aaac
atgt
ttcta
cttc
ccat
tctg
cagt
ttagt
aaaa
tgat
ttggt
taac
tctt
aaaa
ctttt
tgtg
gaat
aatg
ttctg
tgaa
ttcaa
gtat
acaa
ggca
ggag
gatc
acttc
aggc
cagg
agttc
cata
ccag
cttg
ggca
acat
agca
agac
cccg
tctc
taca
aaaa
acaa
aaag
agaa
aaaa
aata
tata
atac
ttgag
gtag
ttgaa
ttgcc
gtga
tttat
tgtc
acca
atat
tttta
aatc
ttata
tttat
tata
ataa
tac
acat
taca
ttttta
taac
tggg
ttata
acat
tgat
tattc
tcac
ttattc
taga
tgtg
actta
tttaa
ataa
gttta
gaat
tata
tatta
aata
caat
tata
ttaaa
aaca
tttt
tgct
tttat
gtaa
aaca
ctat
ttgcc
atga
cattt
tttgt
cacg
taat
tttgt
ttttc
tttgt
gtttg
aata
tgat
caaa
tact
tccc
aaaa
ttaaa
ctta
caaa
tgta
tat
caac
atat
tata
attc
aaaa
gctg
ttttta
tttgg
gaaa
taaa
taac
aatg
tgta
aaga
ccgg
gaat
ttgttc
tcct
acct
aagc
ctga
tgat
taaa
agac
agtg
taac
tatg
cagg
tgtta
aaca
ttttc
tata
ttctta
cagc
catc
tctg
ttcat
atat
ccat
tttta
actc
cacg
cccc
tgta
agcc
tttga
gtttg
tgat
ccct
ggtg
aaga
cagt
gacc
caca
ttgat
tttttt
tttttt
taag
gtttc
tctc
tctc
tttttt
gcttt
ttctc
tcga
atat
agga
cc
809
3628
699
HE
RC
lY
ES
attc
agcc
tcat
tggg
tatc
tgcc
ctgg
cttg
gcca
gaag
aggg
tccg
gcta
cagc
ctgg
tcag
gaga
gtct
ccag
aattg
ttgttg
gtgg
gacg
gatg
gat
ggat
ctct
ggga
ctga
ttgaa
gttg
ttgat
gtgt
ccac
catg
cacc
gtcg
agaa
ttgga
gcat
tgct
a
810
3350
729
—N
Otg
caga
cgtg
caca
ttccc
agtc
cgtg
cccc
agcc
gtcc
tggg
tccc
ctta
caag
cccg
ggtc
tgcc
acttc
accc
ttgttt
ttccc
catc
ctcc
gggt
ccac
atttc
atat
gccc
ctgc
cctc
attc
cctta
tgat
tccc
tagc
tgct
ggttc
caga
gtct
tctg
ggct
taca
gtct
cccg
gc
811
3380
411
—N
Oaa
agca
agga
aggt
attg
aaga
caca
aagt
caca
tgtg
atgg
gctc
agga
gcca
gtgt
gagg
aagg
gctg
ctca
taca
gcac
agtg
gagg
at81
223
7758
8—
NO
ccca
taga
gctc
ctaa
ccag
atgc
aaca
gtga
ctgc
gtca
ccat
tccc
cact
tttcc
caac
tacg
cagt
gaca
aacc
tacc
agct
ttacc
813
3219
836
—N
Otg
ggag
ccgt
ggat
gaaa
ttgtg
ctc
814
2737
650
BA
NK
lY
ES
catc
atga
aagc
agga
agac
atac
gggc
agag
tgca
gatg
815
2843
633
—N
Oag
gctc
gcct
tctc
aata
gcgt
gtat
ttgga
tgag
atga
gtttc
ttctg
taaa
gaga
aaaa
gatg
ttaaa
acct
cattg
tcta
aggc
ccct
catc
tgag
aagt
ctt
gtct
gacc
ctct
agcc
cagc
agga
ccaa
ggtg
tggt
gcct
ggtc
ccag
cctg
tcct
ctgc
tccc
ctgg
gctg
cagt
tggc
ccag
ttgcc
tgcc
tcca
ttaga
tac
ggag
ttgct
ctgg
ggct
gaga
tgcc
catc
tcaa
gtgc
attc
tgtc
ggaa
gggt
tctc
tgct
ggaa
ggcc
tttgg
tttta
gttg
ggat
tcca
tgag
cttta
gcc
caaa
ggtg
gcct
ccca
tact
agca
tattt
816
3478
420
—N
Occ
actg
tcca
caga
ccat
ctgg
ggcc
ctgg
aggg
agcg
ggct
gggc
cagg
gagg
aaca
actg
ggaa
gggg
cagg
gaag
aatc
actg
aaca
tcag
tgt
gaag
gtgg
cctg
a
817
2749
415
TM
EM
lN
Oag
ctttt
caag
acac
ttcct
gcat
ctct
gacc
tgttg
cacc
tctg
ttatc
ttggc
acct
ctgt
tatc
ttagc
44
818
2921
363
...N
Otg
acgc
gggt
ctta
cctg
ggct
aaat
tcaa
ggtg
ctgg
cagg
aatg
cattc
cttta
tgga
ggttt
tggg
gtag
gggg
caaa
tcca
tttcc
ttgttt
attta
gttg
ttgg
caga
attc
agttc
tttgc
agtta
aagg
atca
aagt
ctcc
tttcc
ttgttg
tctg
ttagc
tgag
ggtc
atttc
tagc
ttctg
aagg
ctgc
cttc
attc
cttg
gctc
atga
tccc
ctttt
cttc
agag
cctg
caat
agtg
ggtg
aagt
ccct
ctcc
cattt
tgaa
tcat
tcct
gtta
cttc
tatc
atcc
ctct
ctct
gacc
tacc
cttc
tgtc
ttcca
tttc
cact
gtta
aatg
tcta
tttga
ttact
ctgg
gccc
actt
819
2703
246
KPN
A4
YE
Jta
gtttt
gaac
tgtg
atgc
tcttt
caca
c82
032
2661
5W
DR
34N
Ogc
tcgc
agcc
tttct
tcgg
agag
tgg
821
3373
866
SLC
43A
YE
Jat
tact
ttaag
gatc
tgtg
tgga
ccag
atgc
tggg
ccga
ttggc
aatg
ccac
aggg
caJ
822
2852
746
AM
AC
NO
ctgg
tagc
aagt
tctg
gatc
ttata
ccca
acac
acag
caa
R82
330
2310
0N
Ogg
ctcc
tatc
ccta
cattt
caaa
tagc
atttc
tctc
acaa
ttgat
tatg
tatta
aggc
ttgtc
aacg
aaaa
ggaa
acttt
tttct
ctca
gaat
ttcta
gaac
tttttt
cttt
aaaa
agttt
tttat
tctg
tagc
caga
aaaa
tttaa
agttt
gtgc
tggg
ttgct
ttggt
tata
gaac
taat
gaag
cttg
agta
atag
tttca
gaca
ttctc
aggg
tttttc
ttctg
cagc
caga
tttta
cact
tcta
ttatc
tcag
ctgt
ctta
taaa
tgta
ttccc
tcca
tcac
acaa
tata
atta
gaat
gtat
tgat
gaac
agat
gtaa
cttc
atca
ctag
agag
acgt
taaa
taaa
ttatg
tgttg
gtat
catg
acttt
tatc
aaat
catg
aagg
atgc
catc
acttt
tatg
tgat
agtc
cattg
gtat
agta
ctaa
agttt
gtaa
gta
aaga
atgt
caga
taga
taac
aaga
taac
ttcat
tttcg
agat
ctca
taac
ctgg
gcc
824
3431
525
AT
P2A
NO
tggt
acca
tcca
gtta
agcc
cgtg
acaa
aaat
ggaa
atttc
taaa
tagc
ccaa
gtct
ccag
ggca
attg
ggaa
cagc
tttga
accc
acta
agat
gggg
tctg
2ct
atgc
caaa
acat
agat
acag
aattc
acag
tttgt
cctg
catta
ggac
attc
tcttc
aact
ttgcc
actg
taga
aagt
ggag
gtag
gtca
gcgg
atgg
tgcc
aca
ttaac
agcc
gcct
tact
gaag
tgta
gtcc
a
825
3695
469
NO
tttat
cctg
atgg
aaag
agtc
cagt
agag
agga
agag
attg
atga
tgta
ggag
agag
ga82
637
6153
5H
OX
Bl
NO
ctgc
cacc
gcca
ctaa
cgga
gatg
gccc
tggt
agag
acct
ttggg
ggtc
tgga
acct
ctgg
actc
ccca
tgct
ctaa
ctcc
caca
ctct
gcta
tcag
aaac
t3
taaa
cttg
agga
ttttc
tctg
tttttc
actc
gcaa
taaa
ttcag
a82
738
2388
4N
WD
lY
ES
actg
caga
gccg
gtat
tcca
tatc
ctgg
gaga
tgcc
tctg
atcc
ttgga
tgtg
catg
gccg
tgct
ggcc
tccc
aggc
caca
ctgc
tgac
agtg
tcca
ggga
tgg
tgtg
gtca
gtct
gtgg
agct
cagc
tacg
ggaa
aact
tcag
ggga
agca
acat
atgt
ccag
catc
aaag
aaga
aaca
ccta
cctg
tgcc
gtct
cagt
cca
gaag
c
828
2475
018
F0SL
2N
Oct
cagc
tctc
acag
gggt
aatc
atct
caag
tggt
atttg
tagc
caag
tggg
agct
atttt
ctttt
ttgtg
cata
taga
tattt
ctta
aatg
aagc
tgct
ttcttg
tcttt
tat
ttcta
aaag
cccc
ctta
tacc
ccac
tttgt
gcag
caaa
gatc
cccg
tgca
ggtc
acag
cctg
atttg
tggc
cagg
ctgg
acaa
attc
ctga
ggca
caac
ttgg
cttc
agttc
agat
ttcaa
gctg
tgttg
gtgt
tggg
acca
gcag
aagg
caaa
cgtc
cagc
caac
acac
agga
ctgt
aaga
ggac
tctg
agct
acgt
gccc
tgtg
aaga
cccc
cagg
ctttg
tcat
agga
ggtc
gttc
agct
tccc
caaa
gtca
gagg
tgat
ttgat
ttggg
gaag
actg
aata
ttcac
acct
aagt
cgtg
agc
atat
cctg
agttt
tact
tcct
tatg
gcttg
ccct
ccaa
gttc
tctc
tctc
atac
acac
acac
accc
ttgct
ccag
aatc
acca
gaca
cctc
catg
gctc
cagc
tatg
ggaa
cagc
tgca
ttggg
gctg
ccttt
ctgt
ttggc
ttagg
aact
tctg
tgct
tcttg
tggc
tcca
ctcg
cgag
gcag
ctcg
gagg
tgtg
gact
ccga
ttggg
ctgc
aggc
agct
ctgg
gacg
gcac
aggg
cggg
cgct
ctga
tcag
ctcg
tgta
aaac
acac
cgtc
ttcttg
gcct
cctg
gcca
gtct
ttctg
cgaa
tagt
cctc
tcc
ctgg
ccag
ttgaa
tggg
ggaa
gctg
ctgg
caca
ggaa
ggag
aggc
gatc
ccgg
ctga
ggct
tagg
aaat
tgct
ggag
ccgg
ctcc
aagc
agat
aatt
cact
gggg
aggt
tttca
gagt
caaa
catc
attc
tgcc
tgtg
ttggg
ggcc
aggt
gtgt
caca
caag
catc
tcaa
agtc
aaaa
gcca
tctg
gggc
tgct
gcttc
tgttt
ctca
ggct
ctgg
ggaa
agga
atct
ccct
ctcc
tctc
acttg
attc
caag
tgtg
gttg
aattg
tctg
gagc
actg
ggac
tttttt
tc
829
2987324
—NO
gttg
gtgc
ccaa
cata
tgcc
tgcc
a830
3379421
SUV420
YES
cgcc
tttcc
ttcga
ggag
ctca
aggc
attt
Hl
831
3109159
___
NO
cattc
agca
ctct
cacg
ggat
ggac
agag
cagt
gttc
atgc
ag832
2480318
___
NO
ctct
gacg
cact
tgag
gggt
tggc
tc833
2841516
C5orf41
NO
gcag
ggat
ctag
acgt
cagg
ccaa
ctga
gagc
tgac
aggg
aaga
gaga
gcta
gcta
gtat
agac
atag
acat
gaga
atca
gctc
agct
gctg
tagg
aga
aagt
cact
gaca
gaca
gtag
catc
caag
gcat
ccag
cgtta
tatg
accc
acga
aaac
cccc
aaaa
ttaaa
tata
caca
ataa
aagg
tcaa
atgt
ttttc
ttga
acaa
aagt
atat
ttata
ttgaa
aaca
atca
ggtg
aaga
gtag
accc
taat
cgcc
agta
ttt
834
3738805
FLJ3576
YES
ggag
cttc
ttacc
tgct
caca
ctgg
7
835
2771
466
NO
aaaa
gaga
gcaa
tcct
gcaa
gcca
atcc
cgac
tctg
aaaa
tcat
cact
a
836
3322962
—NO
tggt
ggtg
atgc
ttata
gtgg
agag
cctc
tacc
ttgct
gcag
ctc
837
3906050
—NO
taga
tctc
caaa
atga
cctc
tcac
ca838
3119642
—NO
ccag
agaa
agcc
gctg
acag
ggag
gaga
ggcc
ccag
cccc
tatta
caaa
ccca
gcca
tcaa
839
3125570
—NO
cccc
aaaa
ccat
ctgc
aaca
gcca
g840
3933835
—NO
ctga
tgga
ccttc
tggc
cgct
tcta
tgttt
cttc
tctg
attg
841
2802644
—NO
ggga
tcac
ttgcc
acct
ggga
tgtt
842
2854122
—NO
aagt
tctg
tgtta
gata
gtca
agta
gggt
gact
gtag
ttagc
atcc
atgt
atta
tata
tttca
aggt
agct
agaa
gact
tgaa
atgt
tccc
aaca
caga
aatg
ata
atac
tcat
gaga
taat
ggat
a
843
3147
610
AZ
INl
YE
Sgc
aaga
gttg
ggtg
tacc
tcca
gaaa
acat
tattt
acat
aagt
ccttg
caag
caag
tgtc
tcag
ataa
agta
tgca
gcaa
aagt
tgga
gtga
atat
cctg
acat
gtga
caat
gaaa
ttgaa
ttgaa
gaaa
attg
cacg
taat
cacc
c844
3398488
SNX19
YES
gggt
gaac
aaat
gccg
gctg
agct
ggg
845
3629947
NO
gacc
agct
ctga
tggg
caac
ccaa
actc
aaga
agtg
taac
ttttc
tgca
gaac
ccttc
agaa
ctta
aaag
atca
ctga
tgta
aggt
cact
agct
ggca
cctc
ttct
tata
atta
caat
taaa
tggg
caat
aact
tacc
ctta
cata
atga
caac
tact
atta
aaga
gaaa
tcat
ggca
tttga
gaac
aatg
acat
aaga
ttaaa
tcaa
ggca
ccat
ggct
aact
attc
agga
3677823
CREBB
ttttg
tgtg
cgac
aact
gcttg
aaga
aaac
tgg
846
YES
P847
3101370
NO
tagg
tgcc
ctga
aagt
tattg
ttgct
tttttt
gtttt
tttttt
ttcag
tttgt
gcgt
gtca
cttg
aatc
agaa
acca
aaca
catg
taaa
aaaa
tatc
atcc
tcaa
tgcc
cccc
atta
actc
tctc
tcca
gaag
gtga
caat
gtta
gtga
actc
aaga
ctct
cact
gatg
atgg
tattt
taca
atga
aaac
acaa
ggaa
accc
tttga
ggtc
caat
tttc
acat
cata
ttctc
caaa
tagt
aaaa
tagc
agct
ctac
atgt
tgat
gaaa
agaa
atttc
aattt
cttc
ctat
ttgttt
ttact
cata
tcaa
catta
atat
gtat
ctgg
att
tatta
atttc
caaa
aaga
aaat
tttag
ttacc
aaat
atttc
agaa
attta
ataa
agca
ttaca
tata
tgta
atta
gcac
ttatc
tacc
aaaa
aaac
atat
gtgt
atgt
att
tattt
atct
tacc
ttcac
tgaa
gttc
tttttt
ctgg
ctgg
acat
gaga
aaca
ggat
taag
tgat
caat
gctg
gcttt
atttc
ttcat
aagc
agta
atttg
ggtc
tttttc
attc
aaca
caac
gcag
cattt
tcat
aata
aattc
acaa
aaga
caat
acaa
agaa
acac
ctac
tgaa
taga
actc
tgtc
gagc
aattc
atgt
tttaa
agttg
gact
ctat
acca
aact
ggca
ttatg
gtat
tata
ggca
tttga
tttttg
ttttc
ttattt
tcag
tttgt
cagt
ttcttt
acta
ccat
tattt
ttttc
tagc
cgga
gata
acgt
ataa
tca
848
2977512
NO
gcca
agat
tagc
atat
gtga
gtgg
caca
849
2329726
NO
caag
cacc
tcgg
tata
gcat
tatta
ctga
aacc
actta
attc
ccag
ctttt
tgag
tttttt
aaaa
aaac
ccac
tgca
ctaa
gattc
acaa
ttcat
tgct
acat
acaa
atta
aagc
tagt
aaga
acac
acta
acgt
caca
agttt
ctca
ttcta
aagt
850
3626942
NO
tgca
tgct
caca
gtaa
cacg
tata
ctaa
aat
851
3823
576
RA
B8A
NO
gccc
acgg
ccac
caga
atgc
aattg
agaa
atcg
tttat
tttag
taac
tgtc
tgat
ctttt
tcaa
ctttg
gaga
tgga
ataa
gtta
aaaa
tttgc
tattt
ttcct
gtaa
cat
ctgc
tgaa
cggg
ccca
ccca
cacg
ttgta
tattc
agag
agag
agag
ggag
tcaa
ggtg
tgac
cgtc
gacc
acag
ccag
tgtc
aggc
ctct
gcct
ctgg
gcct
ttgct
ttgtg
gcct
cact
gcaa
caca
aagc
tcca
ccag
gagg
ctgg
ttcac
gtcc
ccta
ccac
ggaa
gcga
ggtc
ccag
aagg
ccag
cggt
ggttc
cagg
agca
acag
ctcc
caaa
ccct
gagc
aagg
caac
cgat
cgcc
agga
ccag
gaag
catc
accc
agga
gattt
ggcg
ccca
cttc
852
3837475
GLTSC
YE<
cgga
gcttc
ctac
aatc
catc
ctttg
aaga
ccac
R2
853
2802804
—NO
tcga
gtac
tatg
tcttt
tgtg
gcac
tgaa
acct
tata
acac
tatg
atag
ctct
gcttc
caca
gcag
tgaa
aacc
gaaa
gagc
cggg
ctgg
gcac
ctgc
ctct
ttga
ggtg
gaac
tcgg
gtca
g
854
3410550
NO
atat
ctcg
tcta
accc
ctac
cattg
tgcc
tggc
ccttt
gaaa
tgtg
tttct
tgct
gacg
tgct
tgttt
ctcc
t855
2808525
...
NO
aaac
agag
tagc
agttc
agcc
ctta
tttgc
cttc
aaaa
aaag
aaaa
gtac
aaag
tgttc
tcta
attc
tggc
tttta
gaaa
gttc
tagg
aagt
taaa
tgaa
cact
tttttt
tcac
tatg
aatta
tatg
agtg
aattc
atca
gaac
ggct
gcttc
taag
aagg
atttt
ggtg
gtttt
agaa
aggt
aggt
gttta
ttcct
ttgcc
tggg
aagc
ctca
cca
gaga
tgttg
actc
taca
accc
atct
cattc
gtta
gtttt
tgtg
gaca
gcag
ttgcc
tcat
cggg
aattt
tttttt
ttcac
ttcct
tacc
atgt
ttccc
ttcgg
atgg
atag
acta
gcc
856
3484
743
—N
Oat
ttaca
cttta
acaa
cttg
ctgg
gaat
ttaaa
caac
ttgct
ggga
acat
ccttt
gtga
ggca
ctgt
agcc
tttttt
tattt
gaaa
ctac
ttaaa
ctcc
aaga
cagg
ctc
acta
actc
tttttc
aagt
gcaa
caac
aaat
gtgt
tagc
tgaa
aata
gcta
attta
atga
ttattt
gtga
caca
gaca
cgat
tatg
tgag
ccaa
gaga
857
3765
643
INT
S2N
Occ
gaag
gattt
tgca
gtcc
tctg
tcag
taac
ttcca
ttgat
tagg
caga
cata
ttcag
gtaa
accc
taat
catta
aaaa
aaaa
ttatc
aatg
taga
aagt
aattc
cct
tttttc
tctc
tgag
atat
acct
caat
caca
cact
tccc
cacc
ccca
cttg
aaac
agac
ctct
tcac
ttgtg
tttttt
tttttt
ttttc
ctga
ggtg
gagt
cttc
ccct
gtt
gccc
aggc
tgga
gtgc
agtg
ggat
gatc
ttggc
tcac
tgca
acttc
tgcc
acct
gggt
tcaa
ggga
ttctc
gtgc
ctca
acct
cctg
agta
gctg
ggac
tgc
aggc
acgc
gcca
cctg
tattt
ttgta
ttttta
gtaa
agac
gggg
gtttg
ccat
gttg
ccca
gact
ggttt
tgaa
ctcc
tggc
ctca
ggtg
atct
gccc
acct
tggc
ctcc
caaa
gtgc
tggg
atta
cagg
tgtg
agcc
accg
cacc
tggc
caga
ccgc
ttcac
ttgta
aaag
aaat
tagg
ctaa
taag
aagg
tgta
gtttt
tgag
aaat
gaaa
tttaa
cttta
gcct
tttca
ctag
taaa
tagt
caca
tctc
atttt
cttc
ctttg
taaa
atgg
ggtta
ctac
tggc
ccta
cctc
atat
tcta
tgag
aatg
agttt
gtag
ctgt
ttcaa
atca
tgaa
gtgc
atag
tatc
acat
gtga
taga
atat
ttata
acttt
ttatta
gatg
ctta
atgt
tcaa
ttaag
taat
tttga
tgtg
aaaa
ataa
aagt
aata
aaag
tatc
ttaaa
aata
gcat
aaga
atttt
cata
ttttta
aaca
aggc
agttt
tgta
gtcc
ctta
agat
taaa
taca
actg
ctcc
tttttt
tttta
aact
gagg
ccttg
cgat
atttt
gtg
858
3810029
—NO
ctgg
tctc
agct
ggac
tatc
gctg
aaca
gagg
cgttg
gccg
caca
tccc
cttg
gaag
gact
ctttg
cttc
caca
ttcgg
aaca
ctgt
cctg
ct859
2658003
—NO
cttg
aaat
ttgag
caca
tccc
actg
tatta
tttca
ctgt
atgg
caaa
agtg
aagg
aggt
aggc
atgt
tctg
tcat
cctta
atta
agct
ttggg
ctgc
ttattg
aagt
atc
atttt
ttcta
taga
ttgga
acat
ggttt
atat
agca
gact
atct
tgtc
cact
gaca
ttgcc
cagt
actg
agttc
c
860
2788748
NO
atgt
agta
cattt
atgg
gctg
tgat
861
3242486
__.
NO
ctca
gcta
ctta
ggca
ggag
aggc
a862
3540024
MTHFD
YES
aggg
gagt
ggat
caaa
cctg
gggc
aata
gtca
tcga
ctgt
gg1
863
3439941
—NO
tgttg
ggaa
gcag
caac
aatc
agcc
atac
agct
ttcaa
g864
3457676
CNPY2
YES
ttgtg
gagg
aata
cgag
gatg
aact
cattg
aattc
ttttc
ccga
gagg
ctga
caat
gtta
aag
865
3843592
NO
gcct
ggcc
agttg
taac
tattt
atca
ctgt
ctgg
actc
ataa
acat
tttga
gatg
acttt
ggga
gtgc
ctgg
ggag
tttca
ttcag
cctg
gtgc
atat
gggc
agtg
aaaa
atat
gctc
atct
gtga
tagt
agat
gtgg
tttgg
ttcac
aaat
ccag
gggc
tggg
cttta
aacg
gcat
tagg
tgga
caga
ggag
ggttg
ctac
agct
gag
gtct
cagc
tggt
gcag
caca
aaag
tgaa
atag
ggcc
atgg
ctat
ctga
agcc
atat
gtgg
ttctg
tgtg
gcta
tgga
gtca
ttcca
ggag
acgt
tcac
agga
tga
tttgg
gatta
ccac
tgct
attg
taaa
ccat
ggaa
attc
tgtg
ttaat
attg
gatg
ctat
ttgta
tttgt
caag
cata
gtct
ggat
gctta
tcca
cattt
tggt
ataa
ggc
caga
attg
aagg
ccca
atat
gatg
tgtc
acttt
gaca
tcttg
tgaa
agtg
ggac
aacc
ttaaa
aggt
gtga
gtac
aaat
ctca
ttccc
actc
tgct
ctct
tga
ataa
tgta
tcct
agcc
taac
agcc
cgttt
tatc
aagg
tgat
aagg
cagg
gttc
cagg
ttgtta
tattt
tgtg
gtgg
gtttc
aattc
catc
ccta
ccag
caga
gtta
tgg
aaac
aaac
cattg
cata
ttccc
aagg
gaat
catg
ggcc
gtta
ta
866
2518951
NO
gcaa
ccct
gcag
tggt
agta
gtgg
tg867
3187461
NO
cctg
ggaa
tattg
gcgc
taga
gaac
agtg
atat
aattt
gagg
tgtc
taca
aatg
ttgtc
taaa
aaac
ctct
ctcc
tctc
atcc
catg
tttcc
tctg
ttctc
tact
ctta
aaat
ctag
gtat
cact
ggga
agag
taac
gcac
tagg
ccca
cagg
acct
tagt
atta
atat
aaat
gagc
tttag
ttttc
aaat
gtca
tttttt
tctc
tata
actg
gag
aaat
tcca
ttaat
aaaa
gaac
ccta
gggg
atag
aggg
taga
ggga
gcga
gagg
ggag
taag
ggag
gaca
cgga
cata
ctta
acac
tacc
tagg
gcac
cta
acca
tcag
aaat
g
868
3730738
—N
Ogg
caga
taaa
gtac
ccat
acta
aaaa
gatg
tcttt
agct
tggt
tgaa
aaat
aattg
tcag
tcct
acca
ctgt
gagt
tcaa
agaa
ctta
agaa
aata
caga
acttc
aagc
agag
atat
agag
gattg
869
2898998
SLC
17A
YE
Sgt
gaga
agag
atac
attg
tgtg
ttc4
870
3397924
RICS
YE
Sat
ctaa
agag
ctgg
tcta
cctc
gtgc
agat
tgct
tgtc
ag871
2494772
___
NO
taaa
tgac
tgtg
gact
agtg
cgcg
tttttt
gtttt
cag
872
3322617
___
NO
catg
accc
accg
taca
tgca
tggg
catg
atct
gattt
tttttt
taat
taaa
gttg
tcac
agtta
tcaa
agtc
tcat
ta873
3735156
___
NO
taaa
attg
aaat
gcag
gttg
ggca
c874
2346360
NO
gtat
tctc
tttca
tgtg
gtca
gtgt
atga
875
3757401
NO
cacc
agttt
agcc
tttga
gtgt
gcag
agct
ctgc
cctc
cctc
ccac
ccct
cagc
ccca
aatc
caag
atttc
atag
ccct
aaca
ccca
ccca
agca
gcttc
cct
caca
catg
ccct
ttgttt
tcttc
ctct
cttc
tatg
gttc
ctta
ggga
aagg
agcc
ttcttt
aggg
atga
aaag
ctaa
ctac
agcc
cagt
ctgg
cctc
cagc
agcc
cag
ggtc
agct
cagc
ctcc
actg
gagg
cgag
ggag
gagg
gcaa
aggg
catg
ggag
aggt
aggg
ctgc
cctc
cagg
agcc
ttccc
cttc
ccta
ggag
ccag
tcag
gattg
ggga
ggaa
ggca
gagg
ggtc
ctag
ggag
ctgt
caca
taga
ggaa
aagg
ggct
ggga
gtgg
ggat
gaca
agaa
gtac
aaag
aaag
aga
aagt
ttggg
gaga
tgga
taac
aagc
tcag
ctgt
gtca
gtga
tgtg
gagg
ggag
gtat
ggtg
gggg
acca
gcca
tggc
ccta
tcca
accc
cagg
ctcc
acag
gccc
caaa
ttggc
tttgc
aaat
ccaa
acat
tttaa
ggaa
gtgg
tttag
ggat
ggaa
gaga
aaca
cggt
agag
gtct
gtgc
tgtg
gattt
tcat
ctag
ggtg
gtgg
gaac
acca
gaga
tgtc
tcct
ctgc
cattg
tttta
gttg
gctc
ctgg
acct
catc
ccag
caga
ggga
gggt
gact
gtca
gcag
gtca
gtcc
cacc
gcca
ctgt
cccc
ctcg
cagt
gggc
ctct
gtgt
ctca
ggcc
tggc
cagg
ctcc
ctgc
ccct
caga
gctc
ctcg
tgga
cccg
ctcc
tcgt
cctc
atct
gact
tcag
cttg
agct
cgtc
gact
gtga
tcttg
ccgt
cctg
g
876
2386
080
—N
Ogg
gcgc
tttta
tgag
gcat
gagg
gagg
cttg
atag
gggc
tttga
aggc
actg
atgg
caat
tggt
gaag
taga
ctttt
atct
aaag
gggg
caga
ga87
730
1360
8__
_N
Ottt
gcct
cttc
gagc
ttggc
agtg
cgag
ccac
aga
878
3460
675
___
NO
atga
acac
tcgc
aggt
ggtc
aggt
g87
925
6460
1M
RPS
5N
Oca
cagc
ttggg
atgt
tacc
ttgcc
ttttg
t88
029
3937
4SL
C22
AY
ES
tgat
aatg
cctg
gaag
gtcc
atat
cgct
aagt
tctc
ctta
ctgg
ttgga
ttaat
ctttg
gcta
ccta
ataa
ctgg
atgc
attg
23
881
3320
209
___
NO
ccac
catg
attg
tgct
tgcc
ggga
g88
238
5262
7L
PHN
lY
ES
gtgt
aaca
accg
cacc
cagt
gcgt
ggtg
gtcg
ccgg
ctcg
gatg
ccttt
cctg
accc
ctgt
cctg
ggac
ctac
aagt
acct
ggag
gtgc
agta
cgac
tgtg
tccc
883
3349
758
—N
Oaa
acag
gcca
tgtg
actg
agga
gcag
gtag
cattc
atat
tattc
atat
ttcga
gcttg
gtag
atgt
agaa
ttttg
cagt
ggag
gcca
ggag
agga
tgtg
agag
ctgc
agga
caga
ggca
tagg
tttttg
gagg
tggt
tgtg
gtgt
tccc
cgat
ggca
atca
acttc
acgg
agag
aaga
ggtta
ttttc
tact
tagg
ttgcc
tgga
gcet
a88
434
2750
3N
Oag
gtcc
ttcag
gtgc
catta
atttt
tcttg
ttattc
aaca
cata
ttattg
agcc
tctg
taat
gtcc
tagg
cttg
ttcaa
gatg
ttcag
aata
aagc
agtg
gata
gaac
aggc
tttaa
agcc
ctgt
acttt
attg
ccca
agtc
aacg
tcct
taga
atac
aggt
acct
caca
acta
ccttg
aggc
tttta
ttatg
ttattt
tcttc
tgca
ttcag
gaat
tcc
ccag
agcc
aata
ggat
acaa
tttta
aaat
gtca
aacc
ttttta
gatc
tcat
acca
tttgg
cttc
agtc
cttc
tttcc
aact
gtat
cattg
tcta
tttct
aaat
aggg
atg
gtct
acac
tgta
aggc
tgat
ttgct
cattg
cccc
caca
aaat
ttgtc
ttgca
ttttc
tctg
ttcat
tcct
gcac
ctga
aata
atttt
gcca
ctgc
cttc
tcaa
agcc
ctca
aacc
ttcaa
tgtc
tggg
tcaa
agcc
gttg
gctc
ccag
gaag
cctg
tttta
gaaa
ttaaa
tccc
caag
ttcttt
tatc
tgca
ttgat
tattt
aaca
cctg
acag
ttg
caac
aact
aaga
tgat
ttgag
cgag
ataa
tgct
ccac
atttt
ttacc
gtgt
ctgt
gtttg
tagg
tgtg
ccca
actg
acat
actg
gcta
agag
ttcag
ggtc
cca
885
3835463
NO
gtcc
tgga
tcat
caac
ttggt
taca
acat
ggtg
ttaac
atgt
ggga
ctga
ttgtg
tttct
tgtta
atat
ttctg
tttct
ccca
acca
cgct
gaat
att
886
2809576
NO
tcaa
atct
atgg
ctta
ccag
agag
aaga
ccag
agcg
caat
atct
ccgt
ggcc
ttagt
cctg
act
887
3601075
NO
atgg
gcca
aagc
tggg
gcac
agga
aa
888
3864620
C19orf6
YES
gact
tatg
ttttc
cggg
ccca
gagc
gctg
aaat
gaag
gaac
gagg
gggc
aacc
agac
cagt
ggca
tcga
cttc
tttat
tacc
ca1
889
3888182
CSElL
NO
ggaa
gttc
tcct
tttga
acttg
tcac
gaat
tcca
tcttg
taaa
gg890
3558243
RIPK3
YES
cttc
ggga
tcct
aatg
tggg
cagt
gct
891
3702633
___
NO
tttac
tcct
aagc
acca
gtga
tgac
ctcg
tcaa
gcag
ggtg
ggct
gcag
ggag
atga
cagc
gggt
atga
aagg
gtgt
ga892
3433106
NO
tagc
tcaa
tgca
accg
acgt
ttctg
agcc
ccca
gaat
ctga
tcca
gatc
ccgg
acat
ataa
acca
cgcc
aaga
agac
aggg
gctg
tctc
tagc
acat
tctc
tcg
aggg
agtc
cggg
gccc
cttc
ccag
acgc
aact
gcaa
aagg
aagg
gcta
acgc
catg
gcgg
gccc
gtgg
tttat
tttat
atcc
gaca
aagt
gcac
ggc
agcc
tgaa
ctgg
actg
gaat
gaaa
agac
gtgt
ctgg
gacg
ggtc
tacg
ggga
cgcg
ctgc
ggga
cctg
tccg
gcttg
gcttc
caag
ccgc
taac
caac
cgc
tgga
tgct
ctgc
agttc
gctg
gtgg
ccgc
ctct
ttctc
cgcg
caga
gtttg
ggcg
acaa
893
3544
695
TT
LL
5Y
ES
gggg
cggg
gcgt
ctac
ctga
tcaa
caat
894
3219
716
—N
Ogt
ggca
tctc
tgct
tgga
agtc
ctat
tctta
gtct
cagc
ttcag
taca
ccct
caag
ctat
ctta
tttgc
ttcta
aaaa
tctc
ttcta
gtac
ttctg
tttca
gtct
ttgta
tagt
tcct
atct
gttc
tttgg
atat
taat
tatg
tata
gtca
cgta
tata
tact
gtgc
agac
ttattt
agag
gcttt
ttgtc
atat
tttgc
ttaaa
cctg
ctac
tgtg
caat
ctta
cattc
cagt
tcttt
catta
gtat
ggtc
tttttt
tttcc
tttct
cctc
tcttc
taca
cttc
tattc
tact
tttcc
ttcttg
gtcc
ctttg
cgta
gcat
ctttt
gcct
ggcc
atct
cagt
tttat
ctttt
aagt
ccttt
ttcttt
gttc
atcc
cctg
cttc
agtg
agct
tttgc
ttctc
tgaa
ctttg
ataa
caag
tatta
ctga
acca
tttat
ttctg
ggtg
taga
attc
atc
caaa
ctga
gcttt
tgag
ggaa
cctta
ttggt
tatc
taat
tcat
tctc
ctgg
ccag
taca
gctg
tcat
tctc
tgaa
acat
ccat
gaca
gtat
ctac
ttgaa
tgttt
aatg
gaca
tctc
aaat
gtaa
cgtc
agag
actg
aattc
caga
ccttc
ctcc
agttg
tctg
ttcttc
tcat
agtc
895
3442
242
ZN
F384
NO
aagg
agaa
ggaa
taag
acgg
cagg
agg
896
3293
761
—N
Oac
cctc
ctta
ccttg
tcct
taga
tgct
taac
atttt
tgttt
ggttt
tctta
taaa
aata
acag
atct
tatg
aagg
taga
ggtc
agat
gctg
aatg
agcg
tctg
gcaa
aag
tggg
atct
gagc
agct
gatg
cggg
ctgc
cagg
gcca
agcc
aagg
ttgtc
ccca
aggc
ctgg
ccttg
gagg
gtcc
cccc
cgac
cggc
cctg
tccc
tgcc
cttg
ctcc
ctgg
gagg
ggac
ggtg
gatg
ggtg
ctgc
tcac
accc
acaa
ggct
tccc
tgtc
tagt
caca
gctc
tgtg
atct
ccag
gggg
gtct
ccat
gatc
acgt
cgcg
aa
897
3829701
GPI
YES
ccca
ggag
acca
tcac
gaat
gcag
agac
ggcg
aagg
agtg
gtttc
tcc
898
3539958
SYNE2
YES
agtg
agaa
ctac
agaa
ggcg
agga
gg899
3620286
EHD4
NO
ccct
atat
tattc
tatc
cgtc
ctcc
ag900
3621069
___
NO
atga
atcc
acct
aaaa
gggg
caga
ggca
901
2496816
—NO
ccac
ccat
ggga
agct
gatg
agttt
tgtg
tattg
aattc
gaag
agca
gaga
atgg
ggga
aaag
gaga
ttccc
ccat
tgaa
aacc
taga
taat
gaga
gagg
cgg
aacc
aaaa
gcag
actc
agaa
cttc
actg
agaa
taat
gggg
gtgt
agta
gtca
ggtg
agag
agac
tgga
ctac
caaa
tcat
cgttg
ctat
ccag
tta
902
2644239
NO
tgaa
gacc
agtc
gaag
gagt
gcag
acca
ggaa
aagc
ggga
cgttg
atttg
gatg
gga
903
2832085
ZMAT2
YES
tgga
ctttc
gccg
aaag
tggg
acaa
agat
gaat
atga
gaaa
ctcg
ccga
gaag
aggc
tca
904
3757844
STAT3
YES
tgct
gaac
cctc
agca
ggag
ggca
g
905
2943556
—NO
cctc
atct
gctc
ctac
tacc
ttgca
906
2459948
—NO
aaac
ttttc
tcga
tacc
cttc
tgtg
atga
ctt
907
3119958
GRINA
YES
ctttg
tccg
ggag
aatg
tctg
gacc
tact
atgt
ctcc
tatg
ctgt
cttc
ttcat
ctct
ctca
tcgt
cctc
agct
gttg
tggg
gact
tccg
gcga
aagc
ac908
2567115
—NO
cctg
atta
gcat
atca
cagt
gctc
tagg
agtc
ctgg
aacg
ccttc
ctct
tcca
tgtg
cact
gctg
ctga
catc
cagg
ccat
tttgg
aacc
tgca
aatg
tctc
ttcc
c
909
2893649
—NO
taag
tctta
gtaa
ctcc
gtgc
tgga
tgtg
cattt
tgca
ttcag
cttc
tttat
cata
aggg
agga
acaa
agac
tggg
gctc
ctgc
tgac
atct
gcag
acac
aaag
ctga
gtcg
gaat
ttgtg
gtct
tagc
ttcgg
ttcag
cacc
aagt
gtat
ttata
agttt
gcag
tcat
aggt
tttgt
tgtc
caaa
tccc
aaca
aagc
ccttt
ccca
cctc
cac
ccca
cccc
ggaa
gtta
tata
gggt
tcct
tgaa
ggga
caaa
gcct
ctag
aaaa
agaa
aagt
cact
gaaa
gttg
taat
gtga
aacc
agca
aaga
gaaa
ata
catg
tttgg
tttag
ttgtg
gtca
cagt
ccat
ggtg
910
2346547
—NO
ccct
gaca
gatc
tcgg
tctg
gggc
acag
ga911
2852745
AMAC
NO
ctgg
agct
tccc
tgga
ctca
acttc
ctaa
aggc
atgt
gagg
aagg
ggta
gattc
caca
atct
aatc
cggg
tgcc
atca
gagt
agag
ggag
taga
gaat
gga
Rtg
ttggg
tagg
ccat
caat
aagg
tcca
ttctg
cgca
gtat
ctca
actg
ccgt
tcaa
caat
cgca
agag
gaag
gtgg
agca
ggttt
cttc
atct
taca
gttg
aga
aaac
agag
actc
agaa
gggc
ttctta
gttc
atgt
ttccc
ttagc
gcct
cagt
gattt
tttca
tggt
ggct
tagg
ccaa
aaga
aata
tcta
acca
ttcaa
tttat
aaa
taat
tagg
tccc
caac
gaat
taaa
tatta
tgtc
ctac
caac
ttatta
gctg
cttg
aaaa
atat
aata
caca
taaa
taaa
aaaa
tata
tttttc
atttc
tattt
cattg
ttaat
caca
acta
ctta
ctaa
ggag
atgt
atgc
acct
attg
gaca
ctgt
gcaa
cttc
tcac
ctgg
aatg
agat
tgga
cact
gctg
ccct
cattt
tctg
ctcc
atgt
tgg
tgtc
cata
tagt
acttg
atttt
ttatc
agat
ggcc
tgga
aaac
ccag
tctc
acaa
aaat
atga
aatta
tcag
aagg
atta
tagt
gcaa
tctta
tgttg
aaag
aatg
aac
tacc
tcac
tagt
agttc
acgt
gatg
tctg
acag
atgt
tgag
tttca
ttgtg
tttgt
gtgt
tcaa
atttt
taaa
tattc
tgag
atac
tcttg
tgag
gtca
ctct
aatg
ccc
tggg
tgcc
ttggc
acag
tttta
gaaa
tacc
agttg
aaaa
tattt
gctc
agga
atat
gcaa
ctag
gaag
gggc
agaa
tcag
aattt
aagc
tttca
tattc
tagc
cttc
agtc
ttgttc
ttcaa
ccat
tttta
ggaa
ctttc
ccat
aagg
ttatg
ttttc
cagc
ccag
gcat
ggag
gatc
acttg
aggc
caag
agttc
gaga
ccag
cctg
ggg
aact
tggc
tgga
cctc
cgttt
ctac
gaaa
ta
912
3659
213
NO
acct
ccaa
ggtta
cagc
acttt
ctaa
gaat
gtgc
ctat
gtag
catg
tctg
tccc
ccca
ctgc
ctcc
acca
aaat
tctg
agaa
caaa
gaca
ctgt
aggg
attg
aacc
tgaa
gagt
attc
gctg
gaat
ggta
tgct
aagg
tttct
ttcct
gctg
taca
cctc
ctct
gccc
caac
aagg
tgct
ttaag
ttctg
tttttt
aattt
agga
aacc
tgtt
ctgc
tatta
taga
aggc
catg
aaaa
gaga
ccag
tgat
ttatc
taat
cata
agca
aaaa
tagt
agaa
ggct
attg
tcttg
tcac
ttaag
ttgaa
tgaa
tccc
agtg
attg
tagt
tgct
tcac
gaaa
aata
acca
atttg
ttcta
caga
ggta
gtct
gttc
ccag
agat
tgaa
aaaa
aaaa
agat
gtag
gaaa
gttta
aata
aata
aaat
tgt
ttaaa
ttcac
aatg
gagg
gcca
attc
acat
ggcc
ctct
gcat
tttga
tgta
attc
taac
agttt
tact
agca
gaat
ctaa
caaa
ttttg
aaag
tcag
agcg
tatg
tata
gata
tata
taca
catg
caca
cattt
tttta
atga
tacg
ttatc
cact
taaa
aaaa
aaag
tctc
aaga
atgt
cata
agaa
actc
cctta
acct
caat
gagc
ctca
ggt
ttcta
atcc
ctaa
a
913
3296
517
POL
R3
NO
ctca
tgta
acct
ggga
agag
ccat
aaag
gggt
accc
ccca
cccc
aact
gcct
tcct
ggaa
gcat
gcct
ggga
agta
agct
gtag
gcct
gatc
cgag
ccag
Aga
ccac
tggg
atcc
gtgc
aaag
gcca
gggg
aggg
taga
attg
aatc
atag
gagt
gaac
tccg
gcac
ctgt
tgtc
ttctg
cggg
tgac
gcac
acttt
aaat
gg
914
3028090
—NO
cttg
gtcg
tcac
attc
tgct
tttag
915
3291113
—NO
aaga
tcttt
gagt
tgta
cttg
gtgt
gtgc
tgttt
ttgttt
gaga
tgca
gtga
atgt
gcaa
atag
ccca
gccg
taaa
tagt
tact
taaa
ataa
tctc
ttggt
actg
ggt
agca
gcag
taga
aagt
tgat
ctgt
tta
916
3616816
—NO
gggc
cgaa
actg
ctgc
cgtg
catta
gctc
tgtg
gagg
ggag
tcat
aaat
tctg
gttg
tcag
cgttt
tccg
tggc
tttgt
g917
3842744
—NO
ctcc
atat
atta
gctg
gggc
tgcc
ataa
caaa
acat
cata
gact
gggt
ggct
taaa
caac
aattt
acttc
tcac
agttc
tgaa
gtct
ggaa
gtct
gaga
acag
gtc
gcca
gcat
ggtta
ggttc
tcgt
aggg
ttctc
ttcct
gttc
918
3956320
PITPNB
YES
tccc
gagg
gctc
cttg
gtgt
ttcat
gaga
919
2473315
—NO
ctgg
gctg
ccaa
tctc
ttgcc
ttggg
ggca
gagc
ctca
tctc
tctg
gcta
ctgg
ggtta
gcta
ccttg
ttgga
gctg
gcaa
agcc
attg
gtgt
tgac
atcg
gggg
tgac
tcct
gaag
ccgc
cata
tacg
tgct
gcca
atgg
ttttg
atct
tggt
gatc
accc
ggaa
cttg
ggat
tgtc
cagg
agct
gagg
ccag
gattc
agga
aaga
attg
tcag
cagt
caag
ggag
agga
atga
ccca
accc
agtg
acag
atgt
acct
gttta
caag
gacc
ccac
taaa
ccca
agag
acac
tttcc
tatc
aaag
ttgt
tact
tagt
ctac
cgct
tatc
tgcc
acat
gatc
ctgg
gaga
acttc
920
2659117
NO
tgcc
taca
tgct
ccttc
tagc
tttgt
ttccc
tgat
gttc
cttg
ccac
ctgc
tact
cctc
tctg
agct
acttt
tcca
ggttc
caca
ggga
gagg
ttcag
ctgt
ccgt
ggta
gaca
tgag
ggta
gaga
atga
ggtg
gttg
ggttc
cact
tacc
tttct
atca
tcttg
cagt
atgg
atgt
ctct
tgac
tggt
accg
tgta
921
2826873
NO
gata
tttat
ttggg
tgca
tgttt
cct
922
3498544
—NO
tggc
gtct
gatc
agga
aggg
actc
tgct
gcgg
ctgt
ggat
ctga
cctg
ggta
tcag
atgg
tgat
gcttt
gtaa
atag
tgtg
cctc
atag
gggt
ctgt
tctta
gtt
aaaa
ggag
cctg
cttg
gtga
ttcct
tttg
923
2756844
SLC26A
YEJ
cagg
ccat
cgcc
tact
cattg
ctgg
ccgg
gctg
cagc
ccat
ctac
agcc
tcta
tacg
tcct
tcttc
gcca
acct
catc
tact
tcct
catg
ggca
cctc
acgg
ca1 1
tgtc
tccg
tggg
catc
ttcag
c
924
3088854
NO
atgt
gccc
tacc
caga
cagt
gcct
g925
3173548
PGM5
NO
cggg
agat
gcttc
actg
atgc
cttc
ttgct
acct
gtttg
tgcc
tctta
tgac
tttgg
aaaa
acaa
aaga
tattt
tgct
tttgg
ggga
taga
gggt
gggt
ggga
aaa
gaaa
aaaa
atcc
atttg
gtttt
ggttt
tgtc
ctat
tcct
ccaa
atgc
agca
gggc
cttta
gttg
tctg
ttaaa
gctg
cact
ataa
tttgg
tatc
taca
tttta
tcac
aca
aagg
aacc
tccc
ctttt
gaca
acaa
ctgg
gcta
ggca
gctg
ttaat
caca
acat
ttgtg
catc
acttg
tgcc
aagt
gaga
aaat
gttc
taaa
atca
caag
aga
gaac
agtg
ccag
aatg
aaac
tgac
ccta
agtc
ccag
gtgc
ccct
gggc
aggc
agaa
ggag
acac
tccc
agca
tgga
ggag
ggttt
atct
tttca
tcct
aggt
cagg
tcta
caat
gggg
gaag
gtttt
atta
taga
actc
ccaa
cagc
ccac
ctca
ctcc
tgcc
accc
accc
gatg
gccc
tgcc
tccc
ccat
ccca
tccc
caa
catc
cctg
tacc
acct
tctc
tcac
atct
tcta
aagc
tttgt
acaa
atca
caat
ggtg
cact
tcca
acaa
aata
tatc
aata
ggtg
ttttc
ctct
ctta
ttttg
taaa
tag
tatta
tttta
gcta
ttaag
ctgg
atac
cttc
tttca
aattc
agcc
attc
agttg
taaa
gttg
ggaa
gaag
tttct
tgac
aaga
ctct
gcaa
ttaaa
tgct
taaa
atttg
gag
ggga
tcct
tcct
tgat
taca
tcaa
gtat
gttg
gtac
atgg
gttta
taca
agttc
ctct
tgag
aagg
caaa
aaga
ccac
catg
tgtg
agag
ctct
ttgac
ttgg
ccaa
tagg
ggcc
tatc
ttaat
gcac
ttgttt
ggac
acat
ttctg
atct
tattt
gtaa
aggc
tgca
aaag
gaga
ggat
gaaa
tgct
gtaa
aagt
agga
aatg
aagt
ggaa
gctg
gaag
aaaa
tgta
attg
gtgg
taca
gcta
tggg
ccag
atgg
tgga
gggg
aggg
tggg
gacc
cctg
c
926
3200
057
BN
C2
YE
Stc
cctc
ccaa
ccag
tccc
atca
ttcca
acca
gtgg
tacc
atag
agca
gcac
cc
927
2974
443
MO
XD
lY
ES
ggcc
atga
gagt
ctgg
tgca
ccac
atcc
tgct
ctat
cagt
gcag
caac
aact
ttaac
gaca
gcgt
tctg
gagt
ccgg
ccac
gagt
gcta
tcac
c92
823
8590
5__
_N
Otc
taaa
aggc
gtgg
aaac
attta
taag
aaat
acca
atta
tgca
gtgt
tggt
gtca
gtgt
ggac
ttaac
tgcc
gaga
ctga
ctta
ttgcc
ag92
936
3956
6—
NO
acag
gtta
tttct
atgg
acag
gctc
ttgac
agaa
actc
tgaa
catg
gata
ttact
gtcc
ccttc
attc
aaat
gagg
ctga
actg
aggc
tcat
aagt
attta
actc
tcac
acct
agtta
acac
agta
agat
ttgaa
acca
gata
tata
tgaa
tcca
cgaa
gtct
tatc
tg
930
3200
565
NO
tggc
atttc
atga
gggg
aggc
ttatc
tgtg
g93
130
8378
1M
CPH
lN
Oat
gtag
tggc
ctat
gttg
aagt
gtgg
tcat
ccaa
tgga
acag
aaaa
ttattc
aaag
acat
ttaca
acac
agct
tgtg
gata
tggg
ggca
a93
238
2937
6__
_N
Oat
ggcc
gtcg
tcag
gagc
attg
tgga
933
2359
017
___
NO
gtaa
ccgg
ggga
tcttg
cttg
tcag
tgcc
tgga
c93
431
8098
2H
AB
P4N
Ogg
ggag
acttt
tcca
gctg
ggcc
aagg
gagt
caga
ctct
aaga
acaa
taga
tgttg
ctttt
cccg
tgtc
atgt
aaat
ttgttg
cact
tttttg
ggct
gagc
tgtta
gagg
ggct
tctc
caga
ggct
cgag
agca
ggcc
atttc
ccaa
gaag
atga
agaa
tggt
gact
gtgt
tttta
ttgaa
ggaa
tttca
aatg
aaga
ataa
tgttt
aaa
atgt
gtat
atag
agat
agta
taga
ctcc
tccg
cgga
agca
tgga
ggga
aagg
aggt
tgta
aaat
agac
tcca
tgga
gact
ctta
ggaa
gcag
taga
ttccc
ggg
ggct
gtgc
cttta
gcgt
taga
ggaa
acac
atag
agct
ggaa
ctgt
taat
ggaa
agca
gtca
cagc
tgag
ttttc
ggag
acca
a
935
3915
249
—N
Occ
aagc
tgta
gatc
ttccc
aaga
tattt
ttcag
gaac
gaag
catg
taaa
ctta
catg
agtg
aaag
acag
tggt
aaaa
aaaa
aaaa
aaac
agtc
tattt
tggg
ctttg
gctc
ctac
ctaa
atct
agct
tgtt
936
3263
765
—N
Otg
caga
gtcc
taca
gctg
gttc
aagg
cagc
atca
ggct
caga
accc
ctttc
tctc
agct
ctct
gttg
tgat
tattt
cagg
acct
tgttc
tgttt
tcca
catg
gcct
cta
atct
agac
acct
gacc
tggt
ggga
gtaa
ccttt
ca
937
3303
272
ER
LIN
lY
ES
ggag
ctca
aaaa
gtac
cagg
ccat
tgct
tcta
acag
taag
atct
atttt
ggca
gcaa
catc
ccta
acat
gttc
gtgg
actc
ctca
938
2683
681
—N
Ogg
tgac
ccag
gcac
gata
gcta
agaa
ggca
aagt
cgct
tgttc
tatta
gctg
tttag
gcat
ctttg
cagc
catc
cttg
atttg
gagt
ttttg
atct
tgat
ctaa
ttct
atcc
ctca
aaac
cagc
cctta
caat
ctca
cgtg
ccca
cctc
ttcca
tgac
agtc
cttg
ggcc
taga
ggga
gggt
gattg
tata
gtttt
agca
gcag
ggca
tttg
cagt
gaaa
aaca
gattg
ggcc
cagt
ggga
tgcc
aaat
gaga
gagg
gtcg
catc
t
939
2683
865
RO
BO
lY
ES
tggt
gtgt
ggct
tcat
acat
ttggg
gacc
ctat
ttcca
ctcc
ctcc
tcttg
gcat
gaga
ctgt
atac
agga
tcca
cccg
ag94
037
8111
6—
NO
ggca
gctg
ccct
gtaa
tgttc
tacc
agta
cttc
agac
tcaa
catg
cacg
aaga
ttaa
941
3050
281
—N
Oat
gctg
ccct
tcga
tgac
cata
gtga
atgt
tcca
caga
gtgt
acta
cgag
ccac
aaac
acca
ggga
gaga
accc
cagc
atca
gtgg
agta
gaaa
caaa
aagg
tgttc
tgtg
tttcg
cact
tggg
tcct
g
942
3903
591
—N
Oat
cggc
atag
ccta
gatg
ggtc
ttctg
tttca
gggt
ctct
caca
ggag
gtag
tcaa
ggtg
tcag
ccag
ggct
ggtg
tctc
atct
tagc
gcttg
gctg
ggga
ataa
tctg
tttcc
aaac
taac
ttggt
tggt
gaca
gaat
tctc
ttcttg
aagg
gatg
ttgaa
ttgaa
gggc
ttggt
ttctc
actg
gctg
atgg
ctgg
aggt
tgct
ctga
attc
ctttc
catg
tagc
ttcttc
aaca
tcac
aaag
ctga
gaaa
ggaa
tagc
aagt
ctcc
tagg
aaga
tgga
aatc
acaa
tttttt
ataa
tcat
ggag
tggc
atcc
tatc
acat
tttcc
atag
tccg
ttagt
taga
agca
aatc
acca
ggcc
agtc
cata
ctca
ggag
ga
943
3115
066
—N
Ogt
ctag
gggt
taca
ggat
catc
atct
aagt
tctg
cttc
tgcc
tgga
cttg
ctgc
gtaa
tttga
atga
gtgg
ctaa
acct
gact
ctttc
tcct
cttg
944
2509
773
MB
D5
YE
Sca
ctaa
caga
aggt
ttgga
agcc
taca
gccg
tgtc
cgga
aaag
gaac
aga
945
2322
410
NE
CA
PNNO
Ottc
tcttt
ggtg
gtttc
taaa
gtgc
ct2
946
2837
342
CY
FIP2
YE
Sttt
tgtg
cgga
ctgc
cattc
ctttc
accc
aaga
acca
caac
gaga
caaa
cctg
ccaa
cgtc
cagc
ctta
ttacc
tcta
tgga
tcc
947
2936
910
—N
Otta
aagg
tgta
gagt
aaga
agaa
tcaa
tacc
gtgt
gttg
gata
ttaag
caat
tggt
aatta
atct
tgct
tcct
ttatc
ttgac
aaga
tgtg
ctttt
tcac
ttac
948
3958
553
SYN
3Y
ES
atcc
ttcaa
gcca
gact
tcat
cctg
gtcc
gcca
gcat
gcct
acag
catg
gccc
tggg
ggaa
gact
accg
cagc
ctgg
tcat
cggc
ctgc
agta
tgga
ggg
ctgc
ctgc
tgtc
aact
ctct
ctac
tccg
tc
949
2901
722
NO
acag
taat
ggaa
gacg
ggag
ttgca
gtgc
tcag
tcat
ggaa
ttcct
ccta
950
3092
902
NR
Gl
YE
Scc
aaat
gagt
ttact
ggtg
atcg
ctgc
caaa
acta
cgta
a95
124
3710
2K
RT
CA
NO
tgcc
acgt
gctc
agaa
attc
acag
caga
tgca
atct
catg
taaa
accc
tcat
gtgg
taaa
acaa
agat
ctat
catg
gttg
ccct
taat
ctttt
ctct
tttttt
tttttt
ttP
2ttt
ttttg
agac
aagg
cttta
ctct
gtcg
ccta
ggct
ggag
tgca
ggga
tgtg
atca
ctga
agcc
ttgac
ctgg
tctc
aagc
aatc
ttccc
tcct
cagt
ctta
cca
gtag
ctgg
gact
acca
gtga
gtgc
cacc
acac
ccag
ctac
tgttt
taag
tttttg
taga
gatg
gggt
ttcac
caca
ttgcc
cagg
ctgg
tctc
gaac
tcct
ggg
ctca
agtg
atcc
tcct
atttc
agcc
tccc
agag
tgct
ggaa
ttaca
ggtg
tgag
ccac
tgtg
cctg
gctg
ccct
cattt
ctttg
cccc
cctc
tagg
acttg
ctttc
tcc
gcat
agcc
ctttt
gcag
gcttc
agag
ttcttt
ccat
ccag
tagc
cccg
ggac
ttctc
tctg
ttagg
ttttg
Ul
952
2657
991
CC
DC
5Y
ES
tatg
ccga
gattt
tgct
gtcc
tgga
gU
i0
953
2520
359
—N
Oct
ttgag
cgat
gaaa
acag
ttttta
tcttt
ttagc
aaag
ttgta
gtta
ggat
cttc
tgtc
tgttt
ttagg
ggtc
tcca
ctgg
tatta
aagg
cctg
a95
425
9227
4ST
AT
lN
Ogc
tgag
gttta
gctg
tcag
ttcttt
ttgcc
ctttg
ggaa
ttcgg
catg
gtttc
atttt
actg
cact
agcc
aaga
gact
ttact
tttaa
gaag
tatta
aaat
tcta
aaat
tct
atta
atct
ctca
ttaat
agta
tttaa
tata
aaga
ttctta
aaat
tact
gacg
ttatg
aattg
gtttg
atgc
955
2681
309
—N
Oat
cttc
taag
tgtc
catg
cagc
ttgaa
tcat
tgca
caca
gtct
tcta
tgag
ttatg
aaaa
taaa
taaa
accc
cctc
ttttc
ccca
tttta
tggt
tgtg
aaac
tgac
acag
ataa
caca
aaag
ggta
gatta
ggga
aagc
cata
tccc
ctct
gcca
ccca
cctc
caat
actg
atcc
agct
ttcct
ggat
cagg
ctag
tagg
gcgt
tgag
gcttc
ttact
agaa
956
2961
267
CO
L12
YE
Sttg
gaat
ccat
ctcc
tagt
ccag
tgac
tggc
taca
aagt
catc
ctca
cacc
aatg
actg
cagg
aagc
cgac
agca
cgct
ctga
gtgt
gggg
cctc
agac
aaA
lcc
acgc
tcag
tgttc
gcga
cctc
tcag
caga
caca
gaat
acca
gatc
agtg
tttcc
gcca
tgaa
957
2854
242
RIC
TO
NO
ggct
gcat
ccta
ttacc
acaa
tggg
gtgt
gcta
taac
tgct
ggta
ttaga
gagg
gaac
tttgg
ccct
ttcac
gtttt
tctta
atgt
ttgta
acac
tact
tcag
aggt
tR
tata
acct
caaa
gcag
aaga
agag
cctc
aaca
accc
ggga
ctta
taag
ttattt
ttatg
ttact
agac
ttgca
taaa
gattc
ttgttt
tcca
actc
ttcat
tttgt
tgc
aatg
tgtta
ttaca
ggat
atat
gaac
caat
taag
gtttt
tcac
taca
gttc
ttgaa
taaa
attta
aaaa
tcat
ttttta
tttta
atta
aaaa
tattt
ccca
tttat
agaa
tgc
atat
atttg
caat
ggac
ttcca
ctttc
atca
acttt
ccat
ctca
tcgc
tttaa
acag
gaac
ttgaa
caag
cact
gtta
gttta
gacc
taaa
ggat
agga
aagc
atta
aata
atac
tttgg
atct
cctg
agga
aaag
ataa
gtttg
cttg
caat
ttaca
cattc
catg
ggga
aaga
agag
ccat
atttc
ctta
aaaa
aaac
atta
ataa
agct
tgt
tattg
agaa
aaat
tgta
gtga
aaag
cctta
agta
ccaa
atttt
aaag
cagc
agta
actta
atttt
tata
tcag
tgttt
ttgttt
tgca
caaa
ctaa
atgc
agtg
gta
ggtg
ggttt
atga
gtat
atta
attg
ccttt
atcc
atttg
tgaa
gtta
agttg
atga
gggc
aagg
tttttg
tttgt
ttaat
ttgta
tatg
tcta
aagg
tattt
ggaa
ctttt
taca
ggaa
ttaaa
cata
tatg
caaa
tttgt
atat
aaaa
atag
catg
gcca
tcat
ttgaa
tgct
tgta
aatg
aaag
gatta
tcttt
tttga
gatc
tata
tata
aata
gaaa
tag
aaaa
tcca
gctg
gact
gatta
ggat
tcttt
tttaa
ttcat
ttgtg
tata
acat
tttta
ttaca
atta
caca
tcag
ttttg
acac
agtc
atag
caac
atta
atat
tttcc
cat
gatg
caga
tcct
ttttg
taat
gggc
ttgttc
tttga
958
2957
263
—N
Otg
ggga
ggca
cata
acag
tttttt
attc
tggg
aaga
gcca
gttc
ccca
ctca
acat
attc
aata
ggca
caga
gacc
aggg
gacc
acgg
aaag
ctcc
agtg
acc
cccg
accc
ccgc
caac
tcttc
ctaa
caac
atttg
actc
cttg
ccct
cctc
cgttg
gaac
tgtg
cttc
ctgg
aagg
aaag
tgat
tgaa
gaag
aaga
gatg
tagt
tctg
taaa
aggc
ataa
aaac
agct
tgttt
tttta
aaaa
aata
atat
ttttc
tgtta
tgat
gcaa
atttt
ttcat
gact
cttc
tttct
ctca
ctct
ccac
agtc
atttc
atc
ggca
ggtc
ctgc
cagc
tctg
cctc
ccaa
acac
attg
agac
tgtc
tgct
gcttt
ctgc
ctgc
acca
ccaa
ccct
agtc
tagt
gacc
tttga
ccag
ggga
ga
959
3429
874
OC
C-I
NO
aaca
gaag
aatc
cgta
acag
aaga
tgac
a96
037
6691
5—
NO
ctcc
tcca
ttgaa
gcag
attg
atta
aaac
agct
tagg
aaag
ggca
aact
tgga
tcac
gagc
agtg
gattt
ttttc
atat
ctga
tagt
gaat
ttaac
tttttc
atttc
tgg
cgaa
atta
aaga
gatc
tgtg
acca
aaag
tggt
caag
cact
ggag
tctg
a
961
2691
040
—N
Ocg
cact
ggca
ttgga
acat
aata
aact
aaaa
ttaaa
atgt
ttcttt
actg
tgcc
tcac
cattg
caca
atca
aaaa
aaat
tttttt
gcat
acac
tgtg
gtga
aata
att
tttaa
aact
tgga
cttg
ctac
tgtg
aaga
ctga
tgttt
aaag
tttat
aact
gtat
aaac
atttt
ctat
gtcc
cccc
ccac
acca
tgga
tttct
tata
acat
tgga
tata
ttttt
ccttg
ctgg
acta
ttctc
ttttg
gattg
a
962
2824
389
DC
P2N
Oag
tgca
ctgc
tttct
taca
ggaa
gaca
actc
agaa
caca
cagc
cata
atct
gcttt
ggtg
tcag
tcac
atca
cagt
ggtta
taca
tttgt
cctg
aatg
gtga
ttca
ttcct
ttcca
tcta
acttc
cttg
tgac
aaat
agca
tttac
tattg
aaat
agtc
ttaaa
atag
ggaa
aaga
gtga
aagt
tact
ctta
tgca
tggg
agtg
ggct
taaa
ggg
ttgac
ttcca
ccag
ctga
gtac
aaac
aagt
gtaa
agaa
aaac
tatg
ataa
agta
tgga
ggtc
ttgag
aattg
ttcat
aatg
aaat
cttta
atct
gggt
tatc
aga
gtaa
tgaa
atca
ccaa
taaa
aaca
ttcca
tttat
cttg
gatc
ttctg
tggt
tagc
agag
gagt
atgg
tcta
gcta
tgtta
gcag
cttg
aagg
caca
acgt
aaa
agga
atga
gcttt
caca
aggt
gctta
cgtta
gctg
ggta
gtat
tggt
caaa
ggta
ctct
gaag
agtg
aatg
caga
aatc
agttg
gctg
tgttt
gtaa
tcttg
tcc
caca
tgga
ccct
aacc
tttag
taga
cttc
agtc
tttta
gtcc
agta
gaag
acaa
tggc
cctg
aata
tgtg
ttttc
tgct
ttttg
gtgg
aagg
tttcc
tgca
ggtg
gtt
gggt
ggttt
ttggg
ttattg
gcta
ctgt
tcgt
tcag
tgcc
atttg
aaga
cctg
ttgttt
ctta
atcc
tttttc
ttggt
gtga
ctgg
ccag
gaaa
aagt
ccat
tgag
tcct
tttttt
ctcc
ctgg
tgtc
ttacc
ccttc
tgta
gaat
aatc
cata
tttta
aaat
ttgttt
caca
tcttt
atag
caaa
cctg
ccac
agac
ttgca
agca
gtga
tttta
tcttt
cct
tcct
gaga
aggt
gatc
tcac
ttttg
atgt
ttggg
tcac
ctgt
ggac
acct
gtgc
tcttt
tgag
tctc
aagg
ctta
tagc
tttca
cattg
gcttg
ggaa
caac
aga
cttg
tgtg
tggt
tcta
gagt
gaaa
tggg
cagt
gttc
tgct
ggtc
ctca
gtct
ttgag
ag
963
4054
485
GA
BR
DNN
OO
ctct
gcag
gatc
ggga
tcag
agcg
tggg
agga
ggtg
gggg
tgga
cgtc
catc
cggt
gaac
agtg
a
964
2871
027
EPB
41L
YYEE
SScc
cggc
ctga
tcag
aatg
tgac
aaga
agtc
gaag
caag
actta
ccct
aagc
gaat
agca
caaa
caca
gcca
gctg
4A
965
3468
170
—N
Ogc
tgag
ccca
taac
ctgc
gtct
tctg
ctct
agcc
tcttg
tctg
tctc
attc
ctcc
tctg
catg
gattg
tctc
tattt
ggaa
ggtc
cgtg
gtttt
tatg
tgta
aaat
gaa
agct
gccg
tgttc
ccttt
ccaa
cacc
aagt
ctag
ttctc
tagc
tccc
attc
tatc
agtg
tcat
acct
ccca
gctta
a
966
2739
471
EN
PEP
YE
Sga
tact
gcat
tcaa
acga
aaca
tgtg
gcca
ttctc
tgtg
cggt
ggtg
gtgg
gtgt
agga
ttaat
agtg
ggac
ttgcc
gtgg
gct
967
3725
508
___
NO
ccag
tgct
ttgca
ctgt
agct
gctc
aata
a
968
3946
498
MC
HR
lY
ES
ttcgc
ggtc
gtga
agaa
gtcc
aagc
tgca
ctgg
tgca
acaa
cgtc
cccg
acat
cttc
atca
tcaa
cctc
tcgg
tagt
agat
ctcc
tcttt
ctcc
tggg
catg
ccct
tcat
gatc
cacc
agct
catg
ggca
atgg
ggtg
tggc
acttt
gggg
agac
catg
tgca
ccct
catc
acgg
ccat
ggat
gcca
atag
tcag
ttcac
cagc
acct
acat
cctg
accg
ccat
ggcc
attg
accg
ctac
ctgg
ccac
tgtc
cacc
ccat
ctct
tcca
cgaa
gttc
cgga
agcc
ctct
gtgg
ccac
cctg
gtga
tctg
cct
cctg
tggg
ccct
ctcc
ttcat
cagc
atca
cccc
tgtg
tggc
tgta
tgcc
agac
tcat
cccc
ttccc
agga
ggtg
cagt
gggc
tgcg
gcat
acgc
ctgc
ccaa
ccca
gaca
ctga
cctc
tact
ggttc
accc
tgta
ccag
tttttc
ctgg
ccttt
gccc
tgcc
ttttg
tggt
catc
acag
ccgc
atac
gtga
ggat
cctg
cagc
gcat
gacg
tcct
cagt
ggcc
cccg
cctc
ccag
cgca
gcat
ccgg
ctgc
ggac
aaag
aggg
tgac
ccgc
acag
ccat
cgcc
atct
gtct
ggtc
ttcttt
gtgt
gct
gggc
accc
tact
atgt
gcta
cagc
tgac
ccag
ttgtc
catc
agcc
gccc
gacc
ctca
ccttt
gtct
actta
taca
atgc
ggcc
atca
gcttg
ggct
atgc
caac
agct
gcct
caac
ccct
ttgtg
taca
tcgt
gctc
tgtg
agac
gttc
cgca
aa
969
2818
169
___
NO
agac
acag
acgg
gtgg
accc
aagc
attg
aggg
c
970
3092
610
UB
XN
8Y
ES
ttttg
ctttg
tggc
cgga
ttttg
ctac
tgct
tgct
cttc
tta
971
2560
754
___
NO
gccc
gcta
tctg
catc
tgcc
ttttg
gact
ccttg
gggg
aaat
aaaa
caat
cgtg
taca
gttg
ggag
gctc
ttcgt
tttca
ccat
cagt
tgac
tcca
tcag
t
972
3850
324
—N
Oga
gttc
aggg
agac
gcca
gaca
attg
cagc
agttt
gtag
agtta
tgtg
ggtc
aggc
tagg
atta
gtttt
gctg
cagg
aata
agca
atcc
cccc
aaat
ctta
atca
aggt
ttacc
taaa
atcg
gccg
agca
ca
973
3209
807
NO
cttc
tctg
aaaa
gacc
agcg
gccc
tatg
acta
tc
974
2639
722
KA
LR
NN
Occ
ttcttc
tcta
ctgg
gtgc
aattc
gagg
ttgct
gagc
ttctc
tcca
aagt
ctaa
aatg
gtgg
ggca
gtag
ggac
ctgt
gaga
ggcc
caat
ggcc
caat
gtac
ttc
cccc
agat
ccca
ctca
gaac
agca
ggta
cgcc
cagg
ctcc
tgct
gccc
taga
ggtc
tgca
acat
gagt
gaag
aggt
taat
taga
ggga
caca
ctta
tct
ctga
agttt
ttctc
cagg
ctga
acat
ttcta
ttatc
agtg
gccc
ctaa
tctg
gaaa
aacc
catc
attc
taat
ctag
cttg
tatc
ccca
catc
atga
gaaa
gagg
gaag
aaga
agaa
ggga
tgat
gtgt
ggta
gaga
agat
gagg
gtta
acttt
agcc
tttcc
caaa
cact
ggca
acaa
ccac
ttcct
caac
aattt
ttcta
tttgc
ttcag
ccta
ctca
gattt
tttca
ggttt
ttcta
gctc
ctcc
ataa
cctc
actc
cctc
ccga
ggtc
tctg
gttc
aaag
acca
ccct
acca
gccc
ctat
ttgct
tcag
gtta
tcc
tgttg
aggg
tggg
tggg
aaga
gtga
gaag
atat
aaat
gaaa
aact
ggcc
acat
gttg
ataa
ttgttg
aaac
taga
tgat
ggta
cgtg
gaca
ttcat
tata
ccat
tctc
ccca
cttc
tgta
tttgt
ttgca
atttt
tcat
aata
aaaa
agttt
taaa
agtc
ctcc
agttt
ccaa
caca
ctca
agag
agag
cccc
aacc
ccaa
acac
agag
tttc
atgg
aaac
ccca
cacc
aagg
caag
aggc
agag
atga
tgat
tcca
tttca
tata
taca
ctca
ttttc
taac
tttttt
taaa
ggcc
cact
gcttt
atttt
caat
agat
taaa
cctg
atttc
tgag
aggt
cctg
aagt
tggg
ctta
tttcc
ctgg
c
975
2972
483
—N
Ogt
gatg
acta
tggg
ccag
gaca
aca
976
2518
314
ITG
A4
YYEE
SSgc
agttg
gtgc
ttttc
ggtc
tgat
tctg
ctg
977
3680
796
___
NO
tgtc
tgct
tcaa
cctc
acac
ttctc
tcag
978
2329
274
—NN
OO
ctgt
ggga
agga
ctct
tcta
ggttt
tgga
ctct
ctaa
ttcttc
tgct
gaac
ccag
tgac
tgtg
taaa
catc
caga
ggcc
cacc
ccac
ccaa
agaa
tgtc
agac
t
979
3379
673
CPT
lAY
ES
ttcag
gcag
caag
agcc
ggca
acgc
catc
catg
ccat
cctg
cttta
cagg
cgca
aact
ggac
cggg
a
980
3549
257
___
NO
tgag
tcct
ccat
tcca
aact
ctcc
t
981
2360
474
FLA
Dl
YE
Sga
catc
tagg
gcct
ctga
acttt
ctcc
gggg
cgca
gcgt
gacg
gctg
gcat
catc
attg
ttgga
ga
982
2584
988
SCN
3AY
ES
aaaa
taag
atgc
ggga
gtgt
ttcca
aa
983
3630
231
LC
TL
YE
Stc
acat
gcaa
atgg
ttacg
gaga
tcgt
ggta
ccca
ctgt
ctgc
tccc
tctg
tgtc
ctca
tcac
tgct
gttc
tact
aatg
ctcc
tcct
gagg
984
3801
606
___
NO
ctgg
tgcc
aggg
ctcc
tctc
actg
ccct
acac
ttgca
ggga
ggtg
gact
tccc
tccc
985
2476
381
NO
agca
tcac
atat
tcag
ggga
ccttg
aaaa
gtttg
gtgt
gcct
gagc
atag
tgta
gctg
gaag
gtag
tggt
ggga
gatg
agcc
tgga
gagg
aaat
cagg
ggct
agaa
gata
aaca
ggaa
tata
tcac
agag
ttact
tacc
ttaaa
agta
atgt
tgag
ccac
tgaa
aggt
tttca
gcag
ctct
catc
aggt
ttgca
ttata
gaaa
gatg
gtct
tcag
gtgt
agttg
aatg
gttg
gagg
ggaa
gttg
gaac
aaga
ccaa
acta
ccat
ggaa
gaga
ctttc
atct
gtttt
gtct
cagc
agat
agag
ctgt
atat
gtct
tctc
cctg
tcct
actg
ttttg
gggg
actg
agat
tttgg
tact
catc
agca
ctct
tgct
ctca
tatta
tgga
OO
986
2522
188
LO
C26
0N
Ogg
tgct
gacc
caat
tcgc
tgcc
aaaa
gagt
gtca
atca
gaat
atac
aaat
cccg
tatg
gttg
tgtc
atcc
tctc
ttaat
cattt
ttact
aattc
taat
aatc
agct
ct
10ag
cttg
cttc
ataa
ttttc
atgg
ctttg
cttg
atct
gttg
atgc
tttct
ctca
tcaa
gact
ttgca
gc
987
2673
516
UQ
CR
CY
ES
atgc
cagt
gtgg
tacg
tgag
atct
gctc
caag
taca
tcta
tgac
1
988
3659
195
NO
gaag
aagc
agtg
catc
tggg
tggc
a
989
2652
475
NO
tctg
tccc
tctg
ctag
ttagt
tctg
taca
gtta
tccc
caca
gtcc
tcttc
ctaa
agta
gaca
ttact
ttcttg
ttcag
gtct
agag
ttggt
tgcc
ttcgc
atgt
agtta
aag
ttctg
tagc
atgg
tgta
taat
atcc
ataa
cact
caac
cctg
ctta
cctc
tcaa
gagg
cacc
tttac
ctgt
cacc
ccac
agtc
tatg
tgtta
aattt
ctgt
tccc
taga
tacc
ttggg
cctc
tttct
aaat
acac
acttt
catg
tcca
ttattt
cccc
ctgt
ctgg
aata
ctct
agag
ccttt
cattg
acaa
ttcac
ttgttc
tttttg
gtgc
ccct
cct
caat
gtga
ccct
tttag
aacc
ttcat
caca
gaaa
tagt
tgttc
tgttt
tttca
gcat
ctgt
gattc
tttcc
ctcc
ctgt
cata
gaat
aaaa
tgtg
tattt
atag
tcat
cac
caag
aact
cata
ttgcc
tcat
gggg
attg
tatc
tcag
catc
990
3198
107
—N
Oat
gaag
cagc
acag
gtac
gagg
tgca
ag
991
2994
022
—N
Ogt
agat
tcga
tggg
cccg
aggc
tcag
992
3548
367
—N
Otc
ctgt
tttct
gcat
tagc
tgttt
gcat
gtat
ttagt
aggt
taga
ggtg
ggaa
ctag
agat
caga
gaat
tgttt
atgg
cagc
agag
ttagc
agta
acttg
agag
ggca
tagc
taag
tcaa
agac
ctac
t
993
2440
185
CO
PAY
ES
ccaa
aaag
gtac
aggt
gccc
aact
gtga
tgag
atct
tcta
tgct
ggca
cagg
caat
ctcc
tgct
tcga
gatg
cgga
ctct
atca
cact
ctttg
acgt
acag
caga
agc
994
3480
073
—N
Oat
gggg
ctta
tcct
tact
cgca
gtca
ccta
aaag
995
3513
910
KPN
A3
YE
Sgg
tcca
attg
agtg
ctgt
ccag
gca
996
3978
139
___
NO
gaag
gcct
cggc
aaag
gact
gtgc
t99
725
4009
1__
_N
Oca
gctc
cagg
gtga
aatg
gtgc
tag
998
2896
688
—N
Otc
ccga
gggc
tgat
cact
ggaa
acag
ggca
acct
ctgg
tgac
ctgg
tttttc
atca
ccag
ccag
ggca
gaag
aagt
caaa
gtgt
gctc
aagg
gcttt
gcac
gctg
ctcc
ctag
aact
gg
999
3022
504
NO
actg
tcttg
gtgt
tggt
cagg
atta
atat
aagg
gagt
g10
0031
0404
4__
_N
Oat
ggcc
tgaa
agct
ttagg
tggt
gcaa
caca
gggt
gaa
1001
3939
535
SLC
2A1
NO
caat
caat
ggtg
agcg
tggt
attc
cagg
ctaa
aggt
aatta
actg
acag
aaaa
tcag
taac
aaca
taat
taca
ggct
ggttg
tggc
agct
catg
actg
taat
c1
ccag
cact
ttggg
aggc
caag
gtgg
gagg
atca
attg
aggc
caga
gtttg
aaac
cagc
ctag
gtaa
cata
gtga
gacc
ccct
atct
ctac
aaaa
aattt
taa
acat
tagc
tggg
catg
gtgg
tatg
tgct
aaca
gctc
tagc
tact
cagg
aggc
tgag
gcag
cagg
atca
cttg
agtc
caag
agttc
aagg
tagc
agta
agct
acaa
tcac
acca
ctgc
atgc
caga
ctgg
gtga
caga
ggga
gact
tcat
ctct
ttaaa
acat
aata
ataa
taat
taca
gact
cagg
aaat
gcag
tgaa
agaa
aaa
taca
ggttg
gcca
ggtg
aggt
ggct
gatg
cctg
taat
ccca
gcac
tttgg
gagg
ccaa
gatg
ggaa
gattg
ctttg
agac
caga
agttt
gaga
ccag
cctg
ggcc
acat
agta
agat
cctg
tttct
acca
aaaa
aaaa
aaaa
aaaa
aaat
tagc
tggg
tgtg
gtgg
taca
tgcc
tgtg
gtcc
cagc
tact
cagg
aggc
tga
aatg
ggag
gatc
acttg
agcc
tggg
aggt
cgag
gctg
cagt
gagt
ccta
attg
agcc
actg
cact
ccag
cctg
gaca
acag
aaag
agac
catg
tttca
aaaa
aaat
aaat
acag
gttg
tagt
gggt
atgg
gtat
gcat
acca
ggag
gccc
gacc
tcgt
ctga
gagg
ggag
tggt
caga
aaag
aattt
tctg
agga
attg
atg
ttttta
atca
acttt
attg
aggt
ataa
tttat
acat
aacc
aact
gcat
ccat
tttaa
gtat
atat
ttggc
gagt
tttga
gttc
taag
tata
gtttt
ggtg
agtg
tata
cact
tgg
gaaa
cacc
accg
tgat
caag
atgg
aacc
tttac
ccta
cccc
aagg
cacc
caca
tgcc
cattt
gcta
tcag
taac
ccac
ccca
gtcc
caga
cctg
gg
1002
2420
794
NO
taga
tctc
caca
tgat
aacc
tgttg
aact
ttttg
agta
tctg
ttgat
caag
aaaa
taaa
gtca
aaag
ctac
aatta
atta
gtaa
tgtg
ctca
aaat
aaat
ttgta
tttt
ataa
atca
taac
atct
tttcc
atgt
tact
ctgt
ctat
atgg
agttc
ttaat
acca
caca
tttgc
ttagg
gaga
tgga
atta
gcaa
ctag
tctg
tgct
accc
tgta
tttttt
tttta
acca
tgta
gttg
gattt
tttaa
aata
caat
aaaa
gagt
tcac
ttttc
cagg
tttga
caac
atttt
tatg
aattg
gaaa
ttttc
tgtg
gcta
caga
tact
atta
acat
tttgg
aaag
gtgt
aaag
aaca
ttgttt
taat
gtaa
ggct
cttc
tcta
acct
ccct
cccc
tctc
caaa
ttctc
cata
atat
gaag
tagg
aaga
tatg
tattt
ttctg
ttgcc
atta
aaat
gttc
tggc
catc
gtgg
gagt
gggg
atgg
ggag
ggcg
ggga
tggg
aatta
gaac
ttact
agaa
taaa
attc
caaa
taga
ggttt
gggg
aaac
ctac
ttgtg
agtg
ctttt
taac
ctca
aaat
aagg
atttg
gggt
gggt
ggag
aagg
tgat
gggg
caga
agaa
gttg
tttttg
gagg
agag
aaag
caga
ttgaa
agg
aaaa
ggac
agct
gata
gcca
ctta
aata
atgt
catc
gtta
gagt
gata
ccta
tttttt
gtaa
aaag
aaaa
caat
atgc
tcca
ttctta
caaa
gcat
tgat
tgta
tttt
aata
tata
caat
gtaa
atac
tatc
atgc
agtg
gtgt
ccaa
ggga
gaga
atac
agtc
atga
gttc
ttagt
ttctg
ttattg
gttg
ggcc
aata
aagc
ccct
tcat
catc
cttc
ttttc
tgct
tatc
acta
gaga
caga
aact
aaaa
acca
tggc
ttcag
gctg
ctaa
aagc
ctaa
aaca
aaac
agaa
caac
aaca
acaa
ataa
ggca
ggt
tgga
caag
cttg
ctga
gtat
ttggg
agcc
tttttc
1003
2444781
—NO
taat
gggg
ccaa
aggg
gcaa
caca
aagc
attg
aaaa
catc
actg
gctc
acaa
aaac
agtc
acct
tgtta
ccttc
tcag
ttgca
tttgt
ttattt
caca
aggc
ttcat
tcac
acat
aaaa
acaa
gata
ctaa
tcca
attc
aagt
tcat
aacg
atta
taaa
agta
aaca
tttgt
tggg
acaa
tgta
caat
aaat
tgca
ctttt
taga
caag
cat
taca
tttac
attta
taga
gtgt
acta
taca
taat
acat
ggaa
ttacg
gaaa
cgtc
taat
tggt
cattg
1004
2537266
—NO
tgat
acac
ttctg
aacc
acag
ccag
ggtc
cagg
cctg
aaca
gatg
tcag
catc
ttcag
ggag
ctttt
gcca
ggac
gccg
ccca
gcca
catg
gcca
ccac
gttc
attc
aaac
acgg
ggct
tcct
gtca
aact
acac
taga
1005
3061935
—NO
aaga
ggtc
accc
ttgcc
gtttg
atgg
aatg
ccta
ttattt
gata
gctg
ctga
tacc
acga
taac
acag
agat
tccc
tttca
atta
aaag
tgcc
tgga
cacc
acag
ccag
acaa
cttc
tgtc
ccac
caca
tgta
tact
gaat
ttcag
tcttc
ttaca
tctc
atttc
aaat
atat
tttga
aata
tattt
ccag
tttat
ataa
atag
tata
acac
ctaa
caat
tccc
attta
tttgc
aaat
atta
ttgtg
agga
tcga
ctgg
gaca
taaa
tg
1006
3893061
—NO
ttcct
gtcc
agct
aaag
atgc
tgcc
tggc
cgtg
gcct
ccac
gctc
tgta
aaca
tgac
ccac
accc
ccca
gggc
actc
cacg
gaag
ccttg
gaga
taag
ggac
acat
cact
tcta
tgct
cacc
tgaa
gtga
cccc
attg
ttaaa
ggac
acag
ctca
actc
aatc
aggg
cacc
agca
gcga
ctcc
aagt
cagg
gtgc
aaac
tct
ggag
gctg
ctgt
gtga
gggg
ggca
ctag
agat
ggtg
gtaa
acag
aaag
gggg
gcttc
cttc
acca
gcca
ggtg
accc
cagg
caag
tcac
aagg
aacc
gctc
tgat
cca
1007
3996137
—NO
ccat
ggaa
tcct
gttg
gagc
tggt
ctac
agaa
1008
2424640
DPYD
YES
tggg
agaa
ttgttg
ctat
gcag
tttgt
tcgg
acag
agca
agat
gaaa
ctgg
aaaa
tgga
atga
agat
gaag
atca
gatg
gtcc
atct
gaaa
gccg
atgt
ggt
catc
agtg
ccttt
ggttc
agttc
tgag
tgat
ccta
aag
1009
2823618
—NO
ttcct
gcta
ctgg
ttggt
taat
gagg
cact
gagc
tagg
aacc
ctgt
aagt
gttc
aagt
gtta
aaat
tgtg
ttgttg
tacc
tttaa
atat
aagg
agaa
gaaa
ttggg
agag
atca
gatta
aaat
atta
aata
ctta
tatg
gttta
gaaa
agtg
acag
cata
aggg
agat
caac
cagg
tttaa
gggt
aaat
actta
ggtg
acca
gaag
atct
acag
taaa
cttg
agga
cata
tccc
acca
agac
ccac
aact
cagc
tcca
gctg
attg
tttcc
acaa
tagg
actg
aatg
cctg
c
1010
2491474
—NO
atct
ggcg
tgga
gatc
gttc
aaag
gtgg
gaaa
ctac
ggga
ggaa
ggta
ctca
ggtg
ttgag
tgca
gcca
tggg
gcca
cttg
aatta
gcat
ccag
gcag
cccg
cgcc
cctc
ccca
gaga
caat
cagc
ggtg
tttca
1011
3780190
—NO
ccca
tgat
gtct
gtcg
tgca
gatg
aatg
tgtc
cacg
cgtg
ctgg
gcta
gtac
agaa
gatc
g1012
3443271
—NO
ccac
agta
tggt
cattc
agcg
aaag
aagt
acca
agta
tttct
ttctg
ttaat
gagt
caga
ggta
ttaat
atat
gtat
gtga
gtcc
ccca
tttac
cctg
cgca
agat
aagt
tcttt
taaa
tgca
atta
gaat
atcc
taag
ataa
atta
caaa
ctcc
tctta
tgta
tcct
tttct
ctga
ggtg
aaat
gaga
cact
gcac
agat
gaga
ggta
ctat
t
1013
3249840
—NO
gcac
tgaa
cgct
aatc
ccttt
ttcta
tttcc
gcttc
tcac
aggc
acag
gaac
acag
gcac
aaat
gcac
acca
caca
1014
3365366
HPS5
NO
gggg
ctac
ttttg
tcag
tgtc
tgta
ccct
tggc
atcg
gcat
ctgt
gact
cttta
tcca
tgac
ctca
gtgt
ttctta
acca
aagt
tgta
ctca
gcat
ttctta
acca
aagt
tgaa
ttttg
aaaa
gagt
cagt
ccttg
tttgc
tgga
atta
gaat
gtta
atgt
ccta
gtat
tattc
cgaa
ctac
agta
ttaac
tgct
tgttg
ctag
tgga
ttaga
caga
ttcttt
tctta
ctgt
ggct
tcca
tgttg
ggag
caga
agct
tttca
tcct
ggtc
acat
gaag
acag
atgg
tatta
ttgac
tgga
gttg
aatta
ttttta
tatc
ttgtc
tgg
caca
atat
ggaa
atta
ctga
aata
agac
ggtg
tata
atgg
aatta
acac
ccaa
aata
agta
gaac
actg
aaga
tttga
atttg
atat
ttaag
taaa
atgg
gact
ggg
tgca
gtgg
ctca
ggcc
tgta
atcc
caac
ccttt
ggaa
ggta
aaga
cggg
agga
tcac
ttgag
gcca
ggag
ttcaa
gacc
agcc
tggg
caac
atag
tgag
aatg
catc
tcta
caaa
aaat
aaaa
aaaa
ttagc
tagg
cata
gtac
ctga
ggcc
agga
ggtc
cagg
ccgc
aatg
agat
gtgt
tt
1015
2883
453
AD
AM
lN
Oct
ggct
gtat
ggct
gcat
gtga
caag
ccac
gtcc
cctc
ccac
ctct
cccc
aaac
ccct
gcat
ccct
gtat
tcac
acgg
gtca
ctct
gact
ca9
1016
2924473
—N
Oag
cagg
gact
cagc
gtga
ggct
tgaa
gcaa
acag
tgga
atct
ttaga
agcc
tgct
ttgtg
tgc
1017
3378661
—N
Occ
ctgg
gtgt
aacg
aggc
cattt
tgca
acta
catc
cagc
cagg
ctga
aaac
actta
cttta
ttccg
agtc
catc
agaa
ttcttg
aaaa
tcag
agga
tctc
tcaa
gcca
cccc
gctg
aata
tact
ctac
atta
aaat
ctgt
cacc
atag
gaaa
atat
taga
aaga
taag
tggt
ataa
ttaaa
aacg
ataa
aagg
acca
taaa
aagg
atcc
taag
taat
acag
tata
aatg
gtgt
ttgtta
ttctg
tgta
tata
gtta
tgct
gtta
gtca
ataa
aata
tatg
ctta
gtaa
gaag
catc
tctta
gaaa
atat
ggag
agc
acat
tgag
aagc
aacg
cttg
aggg
gtttt
cca
1018
3754749
DDX52
YES
aggc
tggg
tgtc
ctgt
acca
gaat
acat
aaaa
g1019
3150460
—NO
gctg
ccca
cggg
acttt
gcag
gagg
1020
2664515
—NO
ttctc
tttag
actta
ccag
ttagg
ctct
aattt
agga
aaaa
taaa
acaa
aaac
ttaca
tggg
tttgg
agta
aaat
tagc
tgga
gtaa
aatc
caag
gcta
gattt
cta
ttgaa
1021
3123616
—NO
agga
gggc
ccaa
tatc
cctg
ggtc
g1022
3432252
—NO
agtg
gtca
aacc
caga
tgag
ggtc
ccgc
ttctg
gttta
tata
caat
aa1023
2593632
—NO
ggag
cggg
aagg
agat
tgac
tgtta
gtttc
tta1024
3054956
—NO
tgtg
ccag
ccac
agta
cgat
gtac
ctcc
tcca
atct
atgt
gtcc
aaat
cagc
gcat
ccttt
aagg
ccct
actc
agat
ggca
cc1025
3096135
—NO
ggga
ccca
gaag
caac
cgtta
tgag
caac
aaaa
ggaa
gaca
ctgg
tttgg
atgg
acgg
gaag
cggt
tggg
tggg
ctgg
actg
acgg
aggc
ccct
ttag
a1026
3541539
—NO
tgtg
ggct
ggcc
atct
atat
ccat
ccat
ctat
ctat
tattc
cacc
catc
cctc
catc
catc
catc
cattt
attta
atta
agtta
ttttta
acgg
ttggc
tctg
gagt
tgac
attc
attta
tttta
aaag
aaca
gtga
atca
tagc
tcttt
tcct
caag
atgc
ttgct
tttat
ttgca
tggt
ggca
tattg
atta
agta
ctca
gaac
acta
tttga
cata
tgg
taag
ccct
caat
aaaa
gtttg
ttatta
ttatg
actg
tact
tctc
ccct
ctaa
tgta
taga
ggta
aaca
ttttg
gttc
ctat
tgtg
acac
gtgg
gtta
taat
gagt
tagg
gcct
actta
agaa
gaaa
gtgg
aaag
aaag
cggg
taaa
attg
ttgct
caga
tact
agca
tgtg
ggtg
ga
1027
3762591
—YES
gggg
cggt
ggtc
atta
attc
tgct
atct
tagt
atct
ctct
ctgt
tttgc
tgct
tgtg
cact
tttct
atttc
tacc
ggtg
tgcc
agct
ct1028
3305557
-_-
NO
acct
gcca
tcct
cata
gtca
ccgg
gag
1029
2394713
TNFRS
NO
tggc
aaaa
gagc
aatc
tgga
tccg
cctta
gcca
gata
cata
aggg
tattt
gcct
tcac
tttca
gcca
gcat
tccc
ccca
gcga
tcct
agcc
agat
atta
cF25
1030
2406204
—NO
ctga
tatg
tatc
tgttg
ggca
agta
gttc
catg
agga
gtttg
tttttg
ttcag
aatg
atct
ttagg
aaaa
atgg
aaat
acca
cggt
gaag
ggaa
agaa
gtca
gaa
tatg
taca
caaa
gcct
a
1031
2722321
TBClD
YES
tgttt
ttccc
ggga
taca
tctg
tgttg
agtc
acttt
gcat
tcaa
cagt
gcct
cgcc
acca
aaat
cata
cata
a19
1032
3532235
NO
atag
tgtta
tatc
cgtg
ctgc
cata
tcac
taaa
atag
gcttg
ccaa
ggca
ggtg
aggt
gtat
gaat
gctc
aagc
ctca
caga
actg
caat
caag
tgcc
aact
ataa
ataa
tact
gaaa
aaag
ttgac
catc
tgac
cagt
ggat
aata
ctttc
aagg
cattc
aatta
gctta
tcct
ttgca
gtat
tcta
agct
attc
acat
tgac
gatc
acat
acat
tgta
gtgc
ttgct
gcaa
ggga
ggca
tata
acca
gttg
ttttg
gcta
aaat
atga
cagg
aagg
cattc
cttg
ggttc
tata
taaa
agta
acag
tattc
aaca
gtct
aatg
gcaa
tcac
tgat
aggc
tgct
taaa
1033
3053937
NO
tgca
cgct
ggaa
tcat
ggct
cattg
cagc
ctcc
aact
ccta
ggct
caag
ggat
ccttc
cacc
tcag
cctc
ccga
gtag
ctgg
gagt
acag
gtgt
tcgt
cccc
acca
tgtc
tggc
taat
ttaaa
aaaa
tgtc
tttgc
agga
tggg
atct
tgct
atgt
ttccc
aggc
tggt
ctca
aact
cctg
ggct
caag
tgat
tttcc
catc
ttggc
ctcc
caaa
atac
tggg
atta
tacg
tata
agcc
acca
cgtc
tggc
ctga
gtca
tctc
ttttg
accc
agaa
caaa
acca
gtcc
agga
gtgg
ctgt
actc
ccca
gga
gtgt
acac
tggg
gagg
ccca
cagc
tccg
gggt
gcag
tggg
gctc
ttgca
attta
atga
ctga
ttcat
cttc
aaaa
aagc
acaa
tctc
agcc
aggc
g
1034
3304915
NO
tggg
agac
tgac
ctaa
ctga
gcat
caac
tatg
ccaa
gcac
tgcg
caag
gtgt
cgag
gcta
tagg
aacg
tgcc
tttct
accc
taga
gaag
tttgc
agtc
tgct
gtgg
gtga
tgag
ctat
atag
aaga
atgt
acta
ttaga
tgga
ggga
agtg
aggg
agca
acat
ggtta
cctg
agga
agag
cagt
ccag
gcca
aggg
caca
gca
agtg
caaa
ggcc
ctgg
ggtg
acgg
tgtg
tgct
ggct
gctc
atga
tgca
gtag
ggag
gcct
gtgg
ctgg
gatg
aaga
atag
tgga
gggt
aaag
tgga
aag
aggt
gaga
gcag
agat
gtag
ctgg
cagt
gggg
aggg
aata
gggt
gaag
cagc
acca
tgta
gggc
cttg
cagg
acat
tcta
agga
ttttg
acttt
catc
gagt
gaga
tgaa
gattc
actg
aggg
gttc
taag
taga
ggca
ggga
catg
atct
ggct
tcaa
tcat
agaa
tgat
gggg
attg
acta
ggca
ggat
gggt
tatg
atg
tgca
aggt
cgca
gggg
gctg
ctag
ccag
gtga
gata
ggat
gtgg
gctc
tggc
agca
gtgg
aggg
ggca
aggc
actg
cggg
tgttg
gaca
gact
ctg
atga
ttgga
tgtg
ggcc
tcaa
ggga
gagt
ttgtg
tggg
agct
ggga
aaca
actg
catc
aggt
tgcc
tggg
gtcc
cact
ctta
ggcc
gaca
acttg
4-
1035
3545436
NO
gcgg
tcac
tcag
ctat
ccaa
tggg
atga
caca
cagc
aaat
cagc
tcttc
actg
cctc
aaca
gaca
gtca
ctaa
aata
atttc
tttaa
aggt
atat
aaat
gttg
gcca
gtgt
aatc
ctct
acaa
aaca
cattt
gcaa
atat
aaaa
taat
ggag
gtat
gatg
ctgc
tcag
agaa
tggg
aatg
ctct
ggca
gtcc
ccaa
catc
ctat
ccca
cttg
gggc
tttgc
tccg
aggg
gaag
agca
ggtg
cagg
atat
attg
ccct
ttgat
gtga
tgta
gatg
ctac
tagg
gacc
cact
ggga
agag
tccc
gttc
cttg
agt
tatg
ctac
tgta
ggga
taga
ccct
tggt
gtga
ggaa
tcta
acaa
tgag
tcaa
agga
aaca
tcac
aaag
aagg
tgac
acct
caga
ggag
acaa
agga
ctct
gaaa
tcct
ggaa
tcac
ggga
tatc
tctg
attg
gtta
ggga
attc
ctct
gcac
tcaa
tctc
cagc
aaat
gggc
agat
tccc
tcac
aact
taga
atttt
tcat
ggg
gagt
gctg
gact
gtac
ccca
aaca
tgga
gtaa
ggtg
gcat
catc
catc
tctg
aagg
gcca
tact
cctc
gcaa
agc
1036
2632506
NSUN3
YE
Stg
aata
tgat
agtc
tgag
attg
aggt
ggct
aagg
caga
cgttg
gaat
ctttc
atcc
caca
gcct
ttgat
aaat
gtaa
ttaaa
gtgt
ctga
attg
gatg
gcag
a1037
3333132
—N
Occ
tgag
ttcct
gggc
actc
tttcc
tgag
cttg
cttg
gggc
tgat
ttgaa
ttcag
1038
3459329
—N
Oct
ctgg
atttt
gcca
ctca
caca
gttg
1039
3597530
APHlB
YE
Sgc
cagt
gaag
gtttg
aaga
gtat
aaac
ccag
gtga
gaca
gcac
cctc
tatg
cgac
tgct
ggcc
tatg
1040
2757682
—N
Oaa
cagt
gcaa
gaaa
ggac
gcat
cact
ctct
tctg
gtgc
cgat
gtgc
tctc
ttgtta
a1041
2811303
—N
Otc
acgt
ggct
ttgtg
tacc
aggc
gtcc
aggg
agca
atta
ggag
ggca
atct
gtta
agtc
ttttg
cctc
aggg
caga
acca
ttagg
tgtg
ctgt
gtat
gtgc
ctc
aagg
ggtg
taaa
taac
taag
ctat
tcat
gtac
tcat
cagt
ttcct
ttcaa
ggca
tctg
tgaa
acct
ttaat
ctgt
ggaa
gaat
caga
gaaa
gttg
acct
attg
tatt
atta
tcct
ccct
cacc
tgaa
tattt
tga
1042
3611
032
TT
C23
NO
gagt
ccttt
tgtc
ctgc
ggtc
cata
taag
aggg
tctg
gatc
tgga
gaca
ctgt
agaa
cacc
aagc
agct
atca
tgag
gcta
tgga
agtg
tcgc
tgag
agca
gga
aact
aagg
tctc
ccttg
aatg
agag
aaag
atga
cctc
attc
ttagg
ccct
gctg
tgat
ggaa
ttagc
atgc
tcct
ggga
tgtc
tcct
agtg
actg
acct
ttgg
tttgg
atgt
tctg
aggg
atct
aggc
aaaa
gtga
acag
tttta
gcac
tattt
caca
aaac
tctc
tgtg
acta
tctg
tatg
caaa
aaat
agat
ttaaa
agta
gtat
tttt
aaaa
ataa
aggc
aaga
acct
tggt
actg
acca
gagg
atgg
gctc
cagt
ttcta
ttttta
aatg
tggc
ccttg
ttata
gcct
ggaa
aaga
tttaa
aaaa
atag
tttt
gtttt
ttttg
taaa
caca
tact
ctac
tttta
ttaat
atat
aggc
tgtc
tata
ctct
tgac
taca
cagc
aatg
tctta
gtta
ttagg
aacc
aaat
attc
agct
tatg
gact
acc
cagg
gctg
gcttt
ggct
tgat
ttctc
ccttc
tgct
tact
ttaag
tcttg
ccat
ttgat
tgct
tgta
gtag
gcta
gggc
aaat
caag
gaga
ggag
aaag
atta
aagt
tgtg
ccca
aaat
ggcc
aaaa
taat
tttga
gaaa
gaat
gagg
tgga
gagg
tgga
ggaa
cttg
cttta
ctga
atgc
caaa
gacc
ctttt
atttt
tttaa
ataa
agt
aatta
agac
agca
tgttt
gtca
cccg
gggt
agac
aaaa
agac
cagt
ggaa
cagg
gcag
agag
ccca
agcc
ttgac
agat
gaca
gaaa
tggc
cttg
cggg
attg
gtga
ggaa
gagg
tggg
ctag
tcaa
taaa
tggg
tact
ggga
ccgc
tggt
tttcc
aaat
ggaa
gaaa
atgg
atca
ctac
ccca
aatc
aaac
acaa
atat
taa
gctc
aaga
aaga
tttga
agga
ctaa
gtgt
gaaa
tgca
aaac
ttgga
gcttt
taga
tgaa
aata
ctgg
aaaa
tata
ta
1043
3977071
—NO
gtgg
ctgg
acca
tgcg
gaac
tgga
t1044
3363301
DKK3
YES
tgat
ggag
gaca
cgca
gcac
aaat
tgcg
cagc
gcgg
tgga
aga
1045
3644748
___
NO
tggg
ccac
gtgg
tgct
ctct
gaag
ttaga
a1046
3799937
___
NO
catg
cagt
cccc
aagg
ctat
cggt
cctc
ctca
g&
1047
3336773
FBXLIl
YES
gagg
agct
tgcc
aaca
gcga
tccc
aagt
tagc
cctc
actg
gagt
tcct
atag
taca
gtgg
ccaa
1048
2822962
___
NO
tgtg
agaa
caga
ctcg
gcag
tgac
atcc
actc
caaa
tgga
1049
3627325
___
NO
ccag
acct
gagc
cacg
tagc
tatc
aaac
ac1050
3780263
___
NO
catc
tgaa
atta
atgg
gact
aatg
taag
gattg
atat
tttca
tattt
gctc
aattg
catc
ctgg
gtta
agca
cattc
aaca
g1051
2656010
VPS8
YES
atgg
cgct
atct
ctgc
cctc
agta
tc1052
3028587
___
NO
tcac
ccag
gattc
tcct
gtac
ctgc
tccc
aatc
tgtg
ttcct
aaaa
gtga
ttctc
actc
tgct
tctc
atct
ccta
ctta
catg
a1053
2419066
ZZZ3
YES
gttg
ccttt
gagt
gatg
gtcc
agaa
g1054
3081481
___
NO
atgg
ccac
ctgt
gttg
acgt
ggag
a1055
3935290
___
NO
atga
tgaa
agcg
tgga
ggta
ccag
ct1056
3051677
ECOP
NO
ttgat
gtgt
gaac
gctg
acct
gtcc
tgtg
tgct
aaga
gcta
tgca
gctta
gctg
aggc
gcct
agat
tact
agat
gtgc
tgta
tcac
gggg
aatg
aggt
gggg
gtg
ctta
tttttt
aatg
aact
aatc
agag
cctc
ttgag
aaat
tgtta
ctca
ttgaa
ctgg
agca
tcaa
gaca
tctc
atgg
aagt
ggat
acgg
agtg
atttg
gtgt
cca
tgct
tttca
ctct
gagg
acat
ttaat
cgga
gaac
ctcc
tggg
gaat
tttgt
ggga
gaca
cttg
ggaa
caaa
acag
acac
cctg
ggaa
tgca
gttg
caag
cac
agat
gctg
ccac
cagt
gtct
ctga
ccac
cctg
gtgt
gact
gctg
actg
ccag
cgtg
gtac
ctcc
catg
ctgc
aggc
ctcc
atct
aaat
gaga
caac
aaag
caca
atgt
tcac
tgttt
acaa
ccaa
gaca
actg
cgtg
ggtc
caaa
cact
cctc
ttcct
ccag
gtca
tttgt
tttgc
atttt
taat
gtct
ttattt
tttgt
aatg
aaaa
agca
cact
aagc
tgcc
cctg
gaat
cggg
tgca
gctg
aata
ggca
ccca
aaag
tccg
tgac
taaa
tttcg
tttgt
ctttt
tgat
agca
aatta
tgtta
agag
acag
tgat
ggct
aggg
ctca
acaa
ttttg
tattc
ccat
gtttg
tgtg
agac
agag
tttgt
tttcc
cttg
aact
tggt
taga
attg
tgct
actg
tgaa
cgct
gatc
ctg
1057
2699603
NO
aaag
aagg
atct
gtca
gcca
agtc
ttttc
t1058
2836754
NO
agtg
tggc
cgttt
gcat
tact
gctg
gaat
tttga
agac
tact
acat
attc
cgtc
agtg
tcag
gattt
tgttc
ttcag
tgtg
gtac
tttttt
tttttt
tttttt
aaga
tgga
gtct
cact
ctgt
cgcc
tggg
ttgga
gtgc
agtg
gtgt
gatc
ttggc
tcac
tgca
acct
ccag
ctcc
cggg
ttcaa
gcga
ttctc
ctgc
ctca
gcct
cctg
agta
gctg
ggat
taca
ggca
tgcg
ccac
caca
cctg
gctta
tttttg
tattt
ttagt
agag
acgg
ggttt
cact
atgt
tggt
cagg
ctgt
tcttg
aact
cctg
acct
cgtg
atcc
gccc
acct
cagc
ctcc
caaa
gtgt
tggg
atta
cagg
cgtg
agcc
accg
cacc
cggt
agtg
tggt
acac
tctta
aaat
cata
gttc
ctgc
tgtc
tg
1059
3504560
—N
Ogc
atgg
ccat
ggat
cggg
caag
ttggt
tttac
t1060
3924596
PCNT
YE
Sag
acct
gaag
gcac
aatc
acaa
gaag
agat
cagg
cgct
tgtg
gtcc
cagc
ttgat
tctg
ccag
gacc
agta
gaca
ggaa
tt1061
2408919
—N
Ota
ggtg
acgg
attg
ttagg
tgca
gtaa
acc
1062
3219942
PTPN3
NO
ggac
cagg
cttc
tacc
atta
attc
acat
gcttt
aaaa
aaat
taaa
aatta
aaaa
aatg
aaaa
ataa
aaaa
actta
aaag
atttt
acct
atat
tcct
ggaa
cata
gtt
atct
tacc
ccag
ctttg
tagt
tcct
gttta
agga
aattc
cctc
tgaa
ggaa
atttt
agct
ctca
agag
tcaa
agaa
gatg
tata
gagg
tggt
atct
gtta
gtttt
tgtt
tgttt
gtta
agga
tcta
ggca
gaga
ggtc
atta
cattt
ttctg
aagt
tgag
gtca
cagg
tggg
gtac
ccac
aaga
gagg
tcat
taca
tttct
gaag
ttagg
catc
ccag
gcat
ctct
tcta
aaga
acta
tcct
gtat
tcta
attg
ttaaa
taaa
tatc
ttcttc
attg
tgag
cctc
caca
actg
ctgt
ttctg
ataa
attta
gatc
attc
tttgc
aga
atttt
atat
actg
gcaa
tctc
aaaa
ctag
gctc
cttc
aaga
ttcct
agga
agtc
tccc
aagc
tcgg
gttg
ggttc
cact
attc
tag
1063
3240958
DNMlP
NO
gtca
acaa
gacc
ttgct
ggac
ctcg
tg17
1064
2830809
—NO
gtgc
agat
tgtta
atcc
ctag
tgtg
cagg
gata
ggga
ggga
gagg
aaat
gtcc
acac
agac
gagg
agaa
ggcc
tgta
aatg
caaa
tttca
cact
gttg
atg
gtttg
tcct
gttg
gttg
acca
gcat
tccc
aaga
taac
cagg
tgag
1065
3020549
NO
cctc
atgc
atgc
ttcga
gatc
attg
tttgc
ttcat
ttccc
taaa
atct
acag
ttcct
ttgaa
agta
gact
ctgg
cgtta
tcac
ctttc
cccg
gtct
ttttg
cagt
ttcct
gttg
acct
ctga
gcca
ttctc
cctc
actc
ttgttt
cact
ctgt
ttgca
acac
tacc
tcttc
ctgt
atcc
tggc
ctct
cattt
ctca
caga
tcat
catg
ctgt
tcttg
aaac
atttt
attta
cttg
atttt
agaa
aact
atac
tctta
gtttc
ccta
aaat
ctca
taat
ctct
ttgcg
ggat
tttcc
tcat
ctttg
tgac
ttcca
aaat
ttgaa
gtgt
cccc
tgg
ccag
tcct
gggt
ggct
acat
tcaa
tctc
1066
3282482
—NO
aaaa
gaag
tcgt
ttattg
gtgc
atgg
gaca
ctct
ggga
ggta
gtat
ccat
ttgga
tgtc
ctgc
tgcg
gttg
gaga
catg
gaac
aaaa
1067
3192070
—NO
cctg
tcgg
tgga
ctga
atgg
gcca
agtg
ttcag
aagt
ccttg
ggaa
gagg
atag
tgtc
ctga
atca
taat
gtgg
tattt
tc1068
2473609
—NO
aaag
tgtc
ctat
tctc
gggg
tggc
aggg
aggg
gggc
aggc
caca
aagg
acat
gagt
gtta
ggaa
aaga
agaa
agca
cagg
gaat
taag
gcag
gaag
gga
tagg
ctaa
ctac
tgttc
tgtg
caaa
tgca
attg
ggttt
atga
tcag
gact
gact
gccc
ttacg
tata
aagt
tgct
aacc
ccca
agac
ctga
aggg
aaaa
gtc
ttttg
aagc
tgcc
agttt
tttga
tcat
tcaa
caag
aaga
cctg
aaca
ctga
gaat
ccttt
ttctg
gattg
gttc
catc
aata
ctttg
tccc
tgaa
atca
ggaa
atac
cttg
ccat
tatg
ggac
agcc
cttta
aagt
tcttt
tgat
gttg
gaca
gtgc
ccct
ggcc
accc
agaa
cccc
atga
gttta
acac
cgaa
aaca
ttgaa
gtgg
tcta
cttg
cccc
cgaa
caca
gtct
ctaa
ttcag
catc
taga
tcag
ggga
tcat
gagg
acct
ttaag
gtta
ataa
caca
cggt
actc
taac
ctca
atag
aaca
tcat
gaa
aatc
tgga
ggaa
ttaca
ccgc
tgaa
gatg
1069
2814
779
NO
actc
tgct
gact
gtaa
tggc
tctc
acgc
agttc
ctaa
tggg
cact
gtgg
aacc
actc
agca
taga
cagg
cata
ggca
gg
1070
3558342
—NO
gccc
agtg
aagg
ggtg
acag
aact
agcc
1071
3611698
LRRKl
YES
atct
cctg
ccag
atca
cgga
gctc
gacc
tttct
gcca
actg
cctg
gcga
ccct
cccc
tcgg
ttatc
ccct
gggg
cctc
atca
atct
ccgg
aagc
tgaa
cctc
tcc
gaca
acca
cctg
gggg
agct
gcct
ggcg
tgca
gtca
tcgg
acga
aatc
atct
gttc
c
1072
3701429
NO
tgcc
gcttt
tccc
tcca
cttg
tgta
acag
cata
ctca
ggttt
atac
caac
tggg
agac
acgc
aggc
atct
cccc
tcac
aagc
agct
gtct
actc
aattc
tgta
gtt
atgt
ctgc
taag
taat
caga
atgc
acaa
tagg
ttttta
gcat
ctgt
atct
catc
ctca
gagt
agag
agta
cctg
tttta
cgag
agaa
aact
gaga
aaac
ggtc
tct
cagg
gcct
ctgg
ggca
gcct
tcaa
ttgga
cagc
agta
gaat
ttcgg
tcca
gtttt
caaa
tcttt
tctta
ttgttg
gattg
acta
tgcc
atcc
tgca
aaag
gaaa
atta
cttta
aatta
ttattt
caaa
aaaa
caac
acaa
cagg
ctgg
agtg
gaaa
cgga
atga
ctga
aaac
gagc
acat
tttgt
gaaa
tact
tttaa
ttaag
agttt
ttcca
tcta
gtgt
cagc
agta
agag
aaaa
agag
tatg
agta
gatta
aacg
aaat
gtgg
tgag
aata
ctaa
gcaa
gact
ctaa
aattc
tgca
ttcat
ttgac
catta
ctg
aattc
ctct
tagt
caat
ttaca
ttcca
ggga
gatg
cccc
ccttg
tatg
ttcct
aggc
tgtg
aaat
gctc
ca
1073
3933
333
C2C
D2
YE
Sttc
cgcc
ggcg
gcat
caac
agaa
agac
ccag
gcat
gagt
cagt
caca
caat
gacc
ttgtg
ttcct
ggag
cagc
caga
gggt
tccc
ggag
gaaa
ggca
tca
1074
3799
749
—N
Ottg
gattc
cact
aatc
accc
gatc
accc
c10
7524
8713
1A
NT
XR
YE
Sga
aagc
tgca
ctcc
aggt
cagc
atga
acga
tggc
ctct
ctttt
atct
ccag
ttctg
tcat
catc
acca
1
4-10
7631
8771
9—
NO
gggc
aaac
ttcag
cctc
atta
gata
gagg
ctgg
acttc
cgga
agag
ctgg
gaag
gggg
acca
tagt
ttctg
ggga
gggg
tgga
gaac
tcac
attta
ccta
gctg
ctgg
ccag
gcta
ttttc
atac
ccac
agac
cgat
ttaat
tctc
aact
actg
tact
cattt
agta
gatg
agaa
gaat
gaga
cattg
aaaa
catg
caca
gtgg
agg
tggg
aatg
aaag
ccag
ctct
ccaa
ctct
ccag
tcct
ctttc
ctgt
cact
gcat
cagg
ctgc
aggg
tgaa
gggg
aggt
ctgg
gata
caaa
gaga
actta
gag
gtgg
agca
gttg
gattc
tgtg
cagt
gcta
ggag
ggag
gaga
gggg
ttgga
gtag
gtgg
gaag
tagg
cctc
cttc
agttt
ggat
agca
cttc
cctta
acc
caat
gact
ctag
tggg
aggg
tgga
ggga
tggg
gatg
ggaa
ggga
gcct
ggga
gtga
ggag
gaaa
ggca
aact
ctgt
cttc
cccc
aggg
gagt
caat
gaa
tagt
acc
1077
3474744
NO
ttagt
gtac
ccttc
ggtg
acag
gtaa
gggc
tttct
gaaa
cgca
ttgtc
ttttta
ttata
gcca
ttcta
gtag
taag
gtgt
gaag
tggt
atct
cata
gtgg
tttgg
gtt
gtat
ttccc
tgat
attg
ggat
gttg
1078
2489696
NO
atgc
agtta
caaa
atcg
gattc
agca
tcag
aaca
g1079
3195395
ARRDC
NO
gtga
gtcg
acag
ccag
ggct
tggc
a1
1080
3497381
—N
Ogg
tgga
cagt
gatg
gtttc
aaag
ggga
aa1081
3130174
GSR
YE
Stc
attg
ttggt
gcag
gtta
cattg
ctgt
ggag
atgg
cagg
gatc
ctgt
cagc
cctg
ggttc
taag
acat
cact
gatg
atac
ggca
tgat
aa1082
3180421
—N
Oga
gagg
tgga
tctg
tctg
gact
tcag
gctg
tgag
aacc
tcac
agca
caga
tagg
atgt
gagc
tttac
ccct
tgtg
ttcag
gccg
ctcg
cacg
gagt
cagc
ttgc
ggaa
gcac
gttc
ccag
gcct
gctc
tgcg
tgta
atgg
tgga
att
1083
2384882
GALNT
YE
Sga
agtc
ggtg
aagc
acat
ggat
ttgtg
cctta
ctgt
ggtg
gacc
gggc
accg
ggct
ctct
tata
aagc
tgca
gggc
tgcc
gaga
aaat
g2
1084
2601621
NO
tctc
ctgc
ttcct
agac
gaaa
tcta
agca
atat
tttat
ccac
ttcttt
ttagt
aaaa
gaaa
taca
tattg
cctg
ttggg
gtca
tgag
ctat
gtag
ggaa
tgaa
aaaa
tttt
ttaaa
ggag
aatta
taaa
gaga
agga
ggat
acaa
gaga
aagg
aaaa
cgaa
agct
ggtg
ggaa
gttg
agcc
atgt
ttatc
tcta
gtgg
aatc
ctta
ccttg
tgttt
1085
2624206
ITIH3
NO
tgga
tggc
ccgg
atttt
atgg
catc
tgga
ac1086
2626149
RPP
14
YES
tcaa
gattg
tgga
gttg
gact
gaat
gctg
caca
gttc
aaac
agct
gctta
tttcg
gctg
tgaa
ggac
ct1087
3466862
NO
tgtc
actta
gtct
actg
ccag
tcta
ctgt
ctgt
gcca
ccat
ggga
aattg
catg
agttc
agac
ctgt
gaaa
aatc
agaa
taaa
gata
aaaa
caca
tttttt
gaag
gtta
cgga
aattc
ttaag
atct
tggt
tttct
tgaa
ctta
ctac
tagt
acta
ctga
tttag
aaat
ggcc
tacc
tcct
atag
tagg
aaag
ggca
aaga
aaac
aata
gtt
aata
ggga
aggc
ttaaa
ggag
cttg
aggg
ctgt
cctg
gcaa
taat
agaa
gaga
tcct
tgttt
tttaa
agga
gagt
taat
gtac
aaag
attc
tgaa
aagt
ctag
aaa
catg
ggta
aaat
tttta
tgat
tgcc
gggc
gcgg
1088
3400762
NO
tcaa
tgac
cagc
gtag
acag
agct
agta
gcag
caat
gaga
tgag
tgac
taga
gagg
taga
gaga
aaac
cagg
aaag
tgtg
gagt
catg
gaag
ctaa
ggaa
gatg
gtgt
tggg
aata
gagg
tggt
gaac
agag
tccc
ctgt
tcttg
ctga
tagg
agag
gtca
aatg
agat
aagg
acag
aggc
gtgg
ccttt
ggat
tccg
aga
tgtg
ttgat
cacc
ggtg
cact
tggc
aagc
atgc
ca
1089
3511392
KIAA05
YES
aatc
ccgc
ttccc
tgta
cttta
tgaa
tatg
actg
ggaa
aagt
ggct
tcttt
gtgg
acttt
tttga
tatc
ttccc
aaga
acag
ccaa
tggc
gtttg
gcac
ccttt
tgtg
64
acag
tggc
accg
ctgg
gaag
tcct
ctca
aagg
tcaa
gtgg
ttctc
catg
agca
gca
1090
3762555
—NO
tgca
catg
tggt
aaga
gcat
ccag
cagg
aaag
aaga
gggc
aagt
tgag
tgtg
cgtg
ggca
tgaa
ggat
tcac
tgta
cgca
tccc
agcc
ctgc
tcat
ctca
gccg
catg
gact
gggc
acac
attta
agca
tgga
gggt
ggtg
gtttt
cagg
ctgt
tttct
agag
ctgt
agaa
ctca
aatg
aaat
gctc
tgag
tctg
tgag
gggg
ggag
gagt
caat
aaag
ggtg
gcta
agtg
catg
gagt
ttgag
gtac
ccgc
tctg
ctgt
aacc
1091
3853067
ILVBL
YES
gcttg
ctgc
tctc
acgg
gaga
acga
ggat
cagg
tggt
caag
gt1092
4015570
—NO
ttcat
ggttg
ccta
gtgc
tgcg
agga
gtgg
ggaa
tggg
gaat
gagt
gctta
atgg
gtat
gggt
ttcttg
gggt
gatg
aaat
gttc
cggc
acta
agttg
tggt
aatg
actg
catg
acttt
gtac
taaa
acta
ctaa
attg
taca
cttta
aatg
ggtg
aattg
tatg
gcat
atga
gtta
tt
1093
2458440
NO
tgca
ctttt
gcct
ccga
actc
tcgt
gttta
atta
1094
3333734
SLC3A2
YES
atgg
gttc
cagg
ttcgg
gaca
taga
gaat
ctga
ag1095
3379257
___
NO
ggcc
gtgc
agct
tcttg
acaa
attg
caaa
ggtg
ccca
cgag
tttcc
aagt
1096
3662461
NLRC5
YES
ctca
acttg
atca
cgag
gttc
ctga
cacc
gtcc
gagc
tcct
ttttg
atct
1097
3670791
___
NO
gtgc
aggt
ctaa
tggg
ctcc
ttcct
tc1098
2689662
___
NO
ctca
tggc
aact
tcttc
cgct
tcct
agttt
ca1099
2698749
XRNl
YES
gctg
ggat
cact
atgg
aagc
aact
atgc
attg
gggg
ca1100
3940626
—NO
tgca
ggct
ttacc
tatg
accg
aaca
tgca
tgga
aatg
ccct
ctga
tgga
ctca
ggca
tgat
acat
acat
caaa
cagc
cttg
cacg
gtgg
ta1101
2577438
—NO
gaga
gcat
ggct
acaa
cagc
aata
tttct
ggaa
acca
gtta
tcgg
ggga
attc
acag
g1102
2885881
...
NO
atgg
actc
atcc
caga
acaa
tcct
cagg
cact
ccga
aatg
caga
tgcc
c
1103
3473848
TA
OK
3Y
ES
atgg
ctcc
agag
gtga
tctta
gcta
tgga
tgaa
ggac
agta
tgat
ggga
aagt
tgat
atttg
gtca
cttg
gcat
cac
1104
3721713
NO
agaa
cctg
gtcg
tgtc
ttgag
aacc
cagt
ccaa
acag
aatc
aggc
ctct
ggac
tggg
agca
acac
tccc
ttcac
ccgc
aaag
attc
agga
aaag
cacc
ccaa
ggac
aagg
aaac
caat
gagg
tctg
ggct
agct
ctgc
agct
ttagg
atac
tagc
tcta
ggga
agga
tttttt
ccttt
ttaaa
cagc
gtat
cact
ctgt
tgcc
tagg
atgg
agca
caga
ggca
ccct
cata
gctc
actg
cagc
ctca
aact
cctg
ggct
ctgg
cgat
cctc
ccgc
ctca
gcct
ccca
agta
gctg
agac
caca
ggca
cgtg
ccac
catg
ctcc
tagg
gaag
gagc
ttgag
aaga
aact
gcca
ggag
tgaa
ccag
ggct
ggct
gctc
tgtg
atgt
tctc
tccc
cacc
tccc
ctcc
agct
ctca
acttg
gtgg
cagg
gccg
gcac
cctg
ctct
ccct
ccta
actc
ccag
cctg
ctgc
tgcc
ccct
tctg
ggac
ccta
atttt
ctgg
acttt
gaga
aatg
ggc
tgcc
cctg
gggg
tgcc
tcca
agag
ccca
tttga
ggga
tcgg
gtgg
ggct
gacc
tctc
tgtc
ttcttt
ggat
catc
gcct
tctc
acac
tgtc
ctcc
ctct
tgat
tct
gaaa
aatg
gtcc
tgct
gccc
atgg
agaa
ccac
agta
agat
agat
ttctc
atgc
agct
agtg
aggg
gact
tc
1105
2924660
...
NO
acca
tgttt
ttgcc
tcca
gtct
gtc
1106
3242466
.-.
NO
atgg
ctat
ggga
atat
cggg
agttg
agga
agaa
aaag
gctc
cagt
ggca
ctga
ag1107
3268238
BTBDl
YES
agca
gctc
acca
ccgg
ctgc
gaga
agtg
gctg
gaaa
tgaa
cttg
gttc
ctct
aggg
ggga
cgca
gatc
cacc
tcca
caaa
atcc
caca
ggac
ctgc
tcc
6ac
aaag
tgct
g
1108
3293847
SPOCK
NO
tgtg
gcat
gcgc
tgac
aaat
gtgt
ccttg
atcc
acac
tgct
cctg
gcag
agtg
agtc
accc
aaag
gccc
cttc
ggcc
tcct
tgta
gctg
ttttc
tttcc
ttttg
ttg2
ttggt
tttaa
aata
cattc
acac
acaa
atac
aaat
tgac
aggt
caaa
atcc
a
1109
2416565
JAKl
NO
aatc
ttgct
ttttg
tggc
ctgg
ggat
tggc
tgtg
tgat
cagc
attg
aaat
gggt
tgttg
cagt
ttgaa
gtct
ggaa
gggg
tatg
tccc
tggg
ttcta
attc
tttcc
ctct
aggt
ttctg
tact
cagc
tcag
agtc
taga
atgg
tagc
tttta
ggat
tgga
aaag
acca
aagt
tacc
taat
tcaa
ctct
ttcat
tttttt
ttcag
tgga
aaaa
ctaa
tac
ccag
aaaa
ggga
agag
agtta
accc
aaaa
atta
catc
tagt
gaat
tgtc
tgat
tagc
tttac
aattg
aggc
attg
taga
gctc
tttga
ggca
ttagt
cttg
gtt
gtct
gagt
gcct
ggct
atta
aagg
ctgt
ctttg
cctg
ttccc
tcttt
ctgt
tagc
tgcc
tgga
agtg
cagg
gttg
cttc
acttt
tagc
acct
tgac
tctg
tacc
tgac
atgg
tgct
tcaa
cagg
atat
tgc
1110
3198506
—NO
gtct
acgc
agct
gcca
taga
ggag
gggg
attg
tttgg
gaga
ccttc
ccgt
ggca
gaat
caaa
gctc
gtag
ttcag
ctgc
ggga
atct
ggtg
tact
taa
mi
3384336
RAB30
YES
ggga
agac
gtgc
ctcg
tccg
aaga
ttcac
tcag
1112
3649726
—NO
ttgtg
caac
agaa
aact
caac
ccaa
1113
2732288
—NO
aggt
ttagt
tgtg
tggc
cgta
atca
catc
caac
acca
agtc
ggcc
aacc
atat
gtga
gctta
atcc
caca
gaat
tctta
tgtta
gacc
tccc
tcttc
agat
aata
gag
gaga
tttttt
ttaag
tata
cgtg
caca
gagg
aagt
gact
tcttt
ctgc
gcac
tctg
ttgtc
actg
gaa
1114
3291868
—NO
tacg
agga
agtg
acca
gtgt
tcat
ctga
aacg
gatg
ccttg
tttct
caga
aatg
tcat
tctc
tttat
ttcat
cttc
gtag
ttctg
tgcc
ttctc
tcag
caat
ttctg
attt
cttta
tttac
tgcc
acac
agat
gaca
aatg
acat
gcat
gtcc
atga
tacc
cttta
ctca
attc
acat
gcga
tttcc
aagg
ctgg
ctat
aattc
ccaa
a
1115
3572359
—NO
gtgg
gggg
ctgt
aatc
taca
aaag
ccat
cttc
agta
ccag
gttta
attc
catc
attg
tttttc
gggt
tttag
gtct
ctgg
acta
ttgtc
cttta
acaa
caac
a1116
3613987
—NO
gcct
gcct
gttc
atgc
caaa
tata
catc
ctca
caat
gcac
agta
caca
acat
gggc
acat
aggc
acag
aggc
tatg
ccgg
1117
3768275
NO
ggct
ttgaa
gggc
aact
gacc
aact
ttttta
taca
taaa
ggat
ataa
acta
caac
tattt
tccc
actc
tgtg
taga
atgt
ggaa
cgga
aaca
gtgt
gtgg
caca
cttt
aaat
gatg
ctaa
acttc
taca
gtgt
gtaa
actg
gcttg
ttcaa
gagg
ccaa
agtg
cttc
tgttg
agaa
acac
tgat
gacc
aatc
tcct
ctttg
tggt
cctg
tga
tact
gtga
aata
tata
tttga
tcttc
ctcc
ctgt
ttccc
agca
taca
attc
ctaa
aacc
ctca
gaat
ctgc
aaaa
taag
atttt
ttttta
atgc
taat
gattg
actg
atg
gctg
gagg
ctcc
taga
gagc
ctca
ggat
tggg
gctg
attc
tcag
ggct
ttc
1118
2367
980
RA
BG
AY
ES
gacc
atct
tctc
cagg
tgga
ctac
ctga
agaa
gata
gtgt
tttat
ttaat
aaac
tgac
ctac
ttagg
atgt
atga
aggt
ttcttc
ccca
cgta
atga
agta
gagg
cPl
Lttt
acgg
gcaa
tggc
aacc
atg
1119
3063
344
ZN
F394
YE
Sgt
gtga
agaa
tgcg
agaa
gagc
ttc11
2031
4713
7—
NO
tttag
gaaa
aggc
tgct
atac
tgtg
gacg
aagt
ctca
gatc
agaa
1121
3737
089
CC
DC
4N
Oac
ggga
cgct
ttctc
tggg
atgc
attg
agtc
aggg
aata
gaag
atgg
ggtg
aatg
atgt
ttctg
accc
acag
accg
ttggg
atgc
ttttg
ctgt
cact
ttccc
a0
tcat
gctg
catg
aggc
atga
cagc
agct
gt
1122
2686
311
FIL
IPlL
YE
Stg
ccaa
gcat
gcga
tattc
agag
tctc
ccca
gacc
ggca
gtca
tcat
ggca
gtttc
agcg
ttcaa
acag
caat
agct
caag
tgtg
ataa
ctac
tgag
gata
ataa
aatc
caca
ttcac
ttagg
aagt
cctta
catg
caag
ctgt
agcc
agcc
ctgt
gaga
cctg
ccag
ccct
tcag
cacc
actg
cagg
ataa
ccga
actc
aagg
ctta
atta
acgg
ggca
ct
1123
3307
401
NO
gcca
ctct
gctc
aaag
ttggc
acga
gttg
gaat
gtaa
tcac
catc
tctg
cact
caag
gcat
gcag
aaat
gaag
tcta
cctc
agcc
ctca
aaga
cctc
acct
gag
aaaa
agac
aagg
ctcc
aaac
agaa
ggca
aaac
agaa
tgag
cata
gaaa
gaga
ccac
tttttc
acat
acct
ttaca
aagc
actg
ctca
cccc
tcct
taac
ctcg
caca
gaca
gagc
atgg
ggcc
tcag
tgtc
actg
ccac
tgcc
accc
tccc
actg
acta
gctg
tact
ctct
ga
1124
3648042
CIITA
NO
ctct
gagg
acac
taac
cacg
ctgg
acct
tgaa
ctgg
gtac
ttgtg
gaca
cagc
tcttc
tcca
ggct
gtat
ccca
tgag
cctc
agca
tcct
ggca
cccg
gccc
ctgc
tggt
tcag
ggttg
gccc
ctgc
ccgg
ctgc
ggaa
tgaa
ccac
atct
tgct
ctgc
tgac
agac
acag
gccc
ggct
ccag
gctc
cttta
gcgc
ccag
ttggg
tgga
tgcc
tggt
ggca
gctg
cggt
ccac
ccag
gagc
cccg
aggc
cttc
tctg
aagg
acat
tgcg
gaca
gcca
cggc
cagg
ccag
aggg
agtg
aca
gagg
cagc
ccca
ttctg
cctg
ccca
ggcc
cctg
ccac
cctg
ggga
gaaa
gtac
ttcttt
tttttt
atttt
taga
caga
gtct
cact
gttg
ccca
ggct
ggcg
tgc
agtg
gtgc
gatc
tggg
ttcac
tgca
acct
ccgc
ctct
tggg
ttcaa
gcga
ttcttc
tgct
tcag
cctc
ccga
gtag
ctgg
gact
acag
gcac
ccac
catc
atgt
ctgg
ctaa
tttttc
atttt
tagt
agag
acag
ggttt
tgcc
atgt
tggc
cagg
ctgg
tctc
aaac
tcttg
acct
cagg
tgat
ccac
ccac
ctca
gcct
ccca
aagt
gct
ggga
ttaca
agcg
tgag
ccac
tgca
ccgg
gcca
caga
gaaa
gtac
ttctc
cacc
ctgc
tctc
cgac
caga
cacc
ttgac
aggg
caca
ccgg
gcac
tcag
aaga
cact
gatg
ggca
accc
ccag
cctg
ctaa
ttccc
caga
ttgca
acag
gctg
ggct
tcag
tggc
agct
gcttt
tgtc
tatg
ggac
tcaa
tgca
ctga
cat
tgttg
gcca
aagc
caaa
gcta
ggcc
tggc
caga
tgca
ccag
ccct
tagc
aggg
aaac
agct
aatg
ggac
acta
atgg
ggcg
gtga
gagg
ggaa
cag
actg
gaag
caca
gcttc
atttc
ctgt
gtct
tttttc
acta
catta
taaa
tgtc
tcttt
aatg
tcac
aggc
aggt
ccag
ggttt
gagt
tcat
accc
tgtta
ccat
tttgg
g1125
2568862
NO
tcttc
ttcta
gcag
tgct
cagt
gggt
1126
3405549
DD
X47
NO
tggc
gtga
gcga
gata
aacc
tcct
aaca
gtag
gtttg
taca
ataa
ggcc
atag
ggca
ccga
tgcc
tgttt
ccat
aattg
ttgta
atttt
atgg
gcttt
gaag
tgtt
gctc
taga
tact
tact
ttctc
ctttc
aggg
tagc
gaga
ggct
ccca
ttaga
atat
ttcat
atgt
acct
gtct
ggttt
gcaa
gctg
ttgag
ttgat
ttattc
agcc
ctag
tggt
aata
atga
ctc
1127
3479
324
AN
KL
EY
ES
aact
gaag
gagc
ggat
caga
gagt
attta
211
2837
3614
0—
NO
tgga
tgtc
tact
ttgca
cgct
gcga
ttggg
agag
ctgt
cccg
ctgc
atgc
gttc
cctc
tgta
atttc
ctca
gagc
tcac
atac
gtac
ctct
ctca
cgag
tgaa
ctca
gattt
tcca
ttgttt
tgct
ttattc
tatc
atttg
ctttg
tggt
tttgc
tgta
aata
ttgaa
atct
taat
catc
atcg
agag
gcac
agcc
aagc
tttcc
agct
ctcc
acc
cccg
tggc
ccat
ccaa
gtct
gttc
atct
ggta
acttc
tgttg
tctg
ggac
ggca
gcaa
gaag
atgc
cggg
gcttg
cctg
gagt
cctg
gcag
agga
gtgc
cact
cact
tttgc
ctag
ggtc
cagt
tggg
gctta
aaaa
atat
ttgga
gaga
agag
taat
gagg
atttt
ttgtg
tttcc
taga
cattt
tatg
catc
tattt
ttgta
aaat
cac
cttc
gcta
acttt
cacc
aagc
acttt
gagc
acag
tgga
acttc
agaa
gcac
aacc
agcc
taac
ccca
gaat
ctca
ggtg
gcac
agct
agaa
attg
agtc
cctt
ccta
ggac
actg
gccg
tggc
cttc
aggg
tgac
cagt
cagt
gcca
ccgg
gttg
gcat
tggt
gtta
caga
cagg
cctc
tgaa
gaca
aacc
gagc
accc
cag
cggc
ccct
caaa
ctaa
gcaa
ggat
ctttt
ttctg
tctg
cgct
ccct
gggg
caag
gtca
aagg
tgga
cccg
ccca
ctgt
gggt
caat
tcct
tcga
aaag
atgg
gacc
cgca
cccc
cgcc
acag
gctg
gccc
cccc
ccac
ccat
gggg
aagg
tggt
gtgc
tgct
ggct
ctga
ccat
actc
ttttg
gaaa
ttgag
aagg
aaag
cattg
agtg
ggac
ctaa
tccg
gaga
agaa
atta
aaga
ccag
aaaa
agaa
ggaa
gctg
ggaa
tgaa
actc
aaaa
tgca
cttg
aacc
tgga
agcg
gcaa
ccct
cagc
tctg
cgcg
gccg
agcc
tcag
caag
agttc
gttt
1129
3954
953
NO
catg
gtct
aacc
acag
aggg
gaat
acta
ctca
gcaa
taca
aagg
agtc
tctc
ctaa
tacg
tgca
acaa
cattc
atga
atcc
caaa
aact
ttatc
tgga
gcca
agc
caga
caca
aatg
tgta
cata
cagt
acgg
aatg
acttc
agat
tctg
aaaa
aagg
caaa
tctg
acca
attg
aggc
agaa
agca
ggtc
agtg
gttc
ccca
ggtc
tggg
actg
gggt
gggt
tact
gata
gcaa
atgg
gcat
gtgg
gtgc
cttg
gggt
aggg
taaa
ggttc
catc
ttgat
cacg
gtgg
tgttt
ccca
agtg
tata
4-
1130
2396
759
FBX
O2
NO
tcga
gggt
agat
aggc
ctta
actta
gtcc
atag
cgtc
ctca
ccttc
ccca
agcc
acac
atcc
tcct
ccca
tccc
ttgct
ccga
tccc
agcc
cctg
agca
ggga
gaga
gaag
ttttg
ttggc
atag
gtttg
ctta
ggta
gccg
gcttc
taga
atgt
agat
ccgt
gagg
gcgt
gagc
ttgga
ctgg
accc
actg
ctga
ttccc
tgca
cct
agca
cagt
gcttg
ccac
aaag
tggg
ctct
ca
1131
2519
281
ITG
AV
YE
Sgg
tgag
cggg
acca
tctc
atca
ctaa
gcgg
gatc
ttgcc
ctca
gtga
agga
gata
ttcac
acttt
1132
3640
619
___
NO
tctg
ttgtc
tcgc
ccttg
ctct
gctc
ctac
ttcac
ccat
cttc
tgag
atcc
aagt
caaa
atca
tctg
cacg
1133
3976
861
HD
AC
6Y
ES
acct
aatc
gtgg
gact
gcaa
ggga
tg11
3440
4829
3—
NO
gggg
taga
acaa
tcaa
gctta
ttcct
aagg
atttt
ctttt
gaca
aata
aatg
ggtg
gtag
tgttg
tttat
tgag
atag
gaaa
aact
atgg
gagg
aaat
gattt
gaa
gtgg
gtgg
tttga
aata
aaag
ttttg
tttaa
atat
gaga
tgat
tgat
tgac
attta
tgtg
gaga
aatc
cgaa
ggtc
aatg
gcat
ttaag
agac
tcat
ggtg
aggc
cag
ggct
tcag
gtat
ttatg
ttggc
agca
gcaa
tacg
tgta
gtgt
gtta
aattc
cagg
gcgt
g
1135
2393
657
KIA
A04
NO
ccct
gtttg
tctg
agcc
ctttg
gaga
tttag
gttg
agtc
atga
gaac
cggt
cattg
gaac
atac
acttt
atta
tgtta
caaa
aaca
aaaa
tccc
cact
gaaa
caca
95
gcta
aaaa
aata
acac
atttt
ccca
agat
taca
ttacc
aaaa
acag
ttgtta
tgtc
attg
gagg
gcgt
ccat
taat
actg
ctcg
gaga
agca
cgat
ctta
ca
1136
3389
355
CA
SPl
NO
aggt
ttttg
gaat
tatg
tctg
ctga
1137
3584
178
NO
attg
tagt
ttgaa
gagc
tgcc
cttg
ggaa
ctca
tggg
acag
gcat
cagg
cctg
gaat
ggga
tgga
caag
tggg
gcct
ccag
gaga
agga
gcat
tgca
ggag
atgc
tgga
gagt
gggg
agca
gtac
gtgc
agca
gata
gggg
a
1138
2365
984
MPZ
Ll
YE
Stc
acca
gtta
agca
ggct
cctc
ggaa
gtcc
ccct
ccga
cact
gagg
gtct
tgta
aaga
gt
1139
2468
456
NO
aaag
caat
ctta
acag
tgag
ggct
atga
aacc
gttg
aaaa
ttctc
ccaa
ggga
agtg
atgg
aaac
ctcc
ttgct
tgag
tcat
ttcaa
agtc
gaca
aaac
aacg
aaaa
atac
atgg
aggg
aggg
atcc
tgta
gtgt
tggg
gagc
agac
tcga
tgat
ttcat
agttc
ttttc
tgtc
tctg
atttc
catg
ttctg
tgaa
atag
tattg
cagc
taa
gaaa
catg
aaat
ccttc
actc
cgac
agta
tg
1140
2954
656
YIP
F3Y
ES
acct
gcttc
ggct
actg
gctg
ggag
tctc
atcc
ttcat
ttact
tcct
tgcc
tacc
tgtg
caac
gccc
agat
cacc
atgc
tg11
4134
9918
9IT
GB
Ll
YE
Sag
atgg
cata
ttgtg
ctcg
ggga
agg
1142
3102
170
PRE
X2
YE
Sga
agga
tatta
ttaca
gaga
caat
gtttc
tgtg
gaag
aattt
caag
ctca
gata
aatg
cagc
ctca
ctgg
aaaa
ggtc
aaac
agta
caac
caga
agct
c11
4331
5137
2-_
.N
Oca
ggca
ctgt
atca
ggtg
ctgc
gtac
ttctg
ctag
ctca
tctc
acct
ggtc
atgc
cgct
ctga
1144
3153
640
-_.
NO
ttcttc
cttg
atgc
agga
tctg
aggt
tgaa
g11
4537
0835
1-_
.N
Otc
acat
tctg
gggt
taga
aggg
gccc
aatg
gatg
ggaa
ttcttc
atat
aaaa
gagg
aaat
gcc
1146
3793
991
...N
Otg
caac
aatg
ggag
ctttt
taac
tagt
ctct
gtaa
ggcg
ttgtc
atca
catc
tg11
4728
4018
6D
OC
K2
YE
Sca
tgag
ttcat
gagt
gaca
ccaa
cctc
tcgg
agca
tgcg
gcca
tccc
cctc
aagg
cgtc
tgtc
ctct
ctca
aatg
a11
4829
4591
3C
MA
HN
Ottg
cctc
ccgg
agtg
tgta
acca
catg
agaa
atag
aggg
aagg
ttctg
tcct
taag
aagt
taaa
agtg
tgac
tggg
aagg
taaa
acat
actta
ttaag
ggtta
acta
gcat
catg
ccat
tctg
tgtg
atgg
tgag
tttgt
gtgt
gtgt
gcac
atgt
gcgt
gagt
acac
acat
atac
attta
aaaa
ggtc
ttgaa
tata
ccaa
aata
ttat
gtct
aggt
gatg
ggta
gcttg
tata
tttttt
ctgt
atcc
ttgtc
tgtg
tttgt
caaa
tttta
taga
aaca
ataa
cagt
aaca
gctc
tctg
acct
tcta
ccct
gtat
gttc
tgc
gtcc
tgtc
taat
tcct
caag
ttcct
tcac
tgcc
tgtc
actc
tgat
gcttg
ctat
atat
cagc
ccat
cctg
ctttt
tgttt
ctta
gacc
caaa
gtgt
caag
tgca
cac
ctgc
cctg
ggtc
tttgt
gtttg
ctgt
ttcca
tttcc
tgga
gtgc
agct
ggcc
ctga
tttgc
atgt
gacc
agca
1149
3706
405
PAFA
HY
ES
tcgt
cact
ggca
gcgt
agat
caaa
caIB
l
1150
3898
481
—N
Oat
ggcg
gcag
agta
tctg
gttc
tctt
1151
3962
245
NA
GA
YE
Stg
ctct
tgct
ggga
catg
tggc
ccag
gtgc
tgat
gctg
gaca
atgg
gctc
ctgc
agac
acca
ccca
tggg
ctgg
ctgg
cctg
ggaa
cgct
tccg
ctgc
aaca
ttaac
tgtg
at
1152
2877
309
BR
D8
YE
Sca
caga
gcca
tggt
ccat
ccga
gaga
agct
atgt
tt11
5329
7372
0—
NO
gtttg
gtgt
ctgg
tact
taga
gctc
cctg
aaac
tgtg
gtgt
tt11
5430
9176
2FZ
D3
YE
Saa
tcat
ttgcc
tctc
ggcc
acat
tgttt
acttt
tttaa
ctttt
ttgat
tgat
gtca
caag
attc
cgtta
tcct
gaaa
ggcc
tatta
1155
3418
179
—N
Oct
gagc
tcat
ccag
taaa
actc
tcaa
aaga
tgaa
taga
tgcc
ctttt
ccttg
agga
gcct
tctg
cctg
atttc
tttgt
cact
gagt
ttggg
tccc
tggt
tgga
gtgg
cagg
agga
aggg
gtcc
acca
cgtc
ttctg
ac
1156
2685
385
NO
tttgg
tgtc
tttct
gaag
ggat
tcgg
cccg
gag
1157
2912
792
NO
ggat
gaaa
tcgt
gcac
gtcc
ctct
tcgt
ccct
ctca
ggac
agaa
attc
ttttg
tcca
gcat
atat
ctat
gttg
cata
tacc
acca
gcca
atta
gtca
ctta
gtag
ctgg
tgttg
gtta
tgag
atca
actg
tcat
agta
tcac
agtg
cttg
tttca
ggta
accc
ttatg
ttattt
aata
gtgg
ccca
agag
tgca
agag
tact
gttg
ctga
caat
ttgga
ttcac
gaga
agc
1158
3590
727
JMJD
7Y
ES
tctg
gtat
gaca
tgga
atac
gacc
tcaa
III
JMJD
7-PL
A2G
4B11
5923
6302
8_.
_N
Oga
aacg
ccgt
cctg
cgga
aaca
tcca
ccat
ccgg
acc
1160
2374
733
—N
Ota
gaga
ctgg
atca
taga
agga
gttg
tttgc
catg
ctta
ccct
taaa
ggct
ttctg
agca
gtca
tcct
ggga
tgga
cagg
gcag
ggag
tgga
gaga
ccag
cga
gcag
gtttt
tgtta
cggt
tcga
gtga
gggt
tgaa
caca
ggca
gtgg
tggt
ggag
atgg
gttg
cagg
gata
atct
tgag
agag
attta
ggag
acaa
gattc
acag
gttg
1161
2448
301
TPR
YE
Sga
gattc
agcg
attg
cagg
agga
cact
1162
2596
900
___
NO
gaat
ggtg
ggaa
atgt
atgc
ttgac
tcaa
aaca
gcca
g11
6327
1380
5Z
NF5
95Y
ES
ctat
atgt
tctc
ctttc
agcc
aaga
cct
1164
3878
694
—N
Ogg
ggtg
tctg
cgtg
ggga
gaaa
tcag
ttgtc
ggca
tttta
aatc
tctg
gata
atga
g11
6524
3463
4A
RN
TN
Ocg
gctg
ttgat
acga
ttgtc
tgtta
tcga
acac
attc
agtg
ataa
agct
gggt
tact
gctg
ctttt
ggtg
ctct
cacc
ttatc
tgga
agat
ctgc
aaac
atta
ccta
aata
ggct
ggca
agat
aaac
acttt
ctgg
aacc
cgag
acttg
gcca
taaa
gata
atgc
tgca
tttttc
tgtc
agaa
tcac
atat
gatg
tgtg
ttctg
taga
ggtta
tttct
gcat
ggaa
actc
aact
tcttg
gatta
gccg
tccc
agtg
aaaa
tcct
cattg
ttgga
gtgt
aaac
caaa
tacg
aagc
cctc
ttgca
aagt
agcc
tcttt
cat
ccca
tact
caaa
atac
ccag
tttag
caag
caac
tgag
attta
agtc
tctc
tggc
ccta
agag
gtttt
tcct
ctttg
ctcc
ctcc
aatc
ttgag
attg
ggttt
tgct
ttaga
gtgc
aagt
atca
taat
tccg
tatg
atag
atgg
ggcc
tgga
cacc
catc
tcaa
cagg
gtca
cttg
gtaa
ttaac
aata
gcca
tata
aatg
cgga
taca
ggtt
acta
ccct
cacc
cttta
ccttc
ctca
ggta
acag
tcgt
agat
acca
gcttt
tttttt
tttttt
tttaa
attg
gcttt
ggcc
agta
gcta
aagt
gcaa
gact
gaat
taat
gag
aaga
tata
ttaaa
tgta
gtca
tagg
ggac
tgag
gagc
aagg
gtgg
ccttg
aaga
ggcc
aaag
gaat
gtcc
atttg
ctga
gtttc
ccttc
ctta
tgtc
tcca
gtc
tggt
gcca
ggta
gtgg
agta
aaaa
agga
gaca
gttta
tttttt
tattc
tatg
tgca
cact
taca
gtat
acat
atat
attta
tatc
acaa
tttac
gaaa
ccaa
aaag
ttgag
tttcc
aatg
gaac
ccttg
tttttt
aata
atcg
acttt
ttaaa
tgtg
atca
ggac
tata
atat
tgta
cagt
tatta
tagg
gcttt
tggg
gaag
ggga
ggat
agc
gaga
agat
gctc
tggg
ggttt
tgttt
ttgct
tttcc
ttcag
ggttt
tattt
ttgac
tgttt
tg
1166
2502
158
—N
Otg
cact
ttccc
agcg
tttct
gtct
gc11
6725
1134
8—
NO
agac
aaca
gtaa
agaa
atac
cagc
aattt
gaag
ccct
gtcc
ctga
atcc
tact
cctg
ttgcc
ttcat
atcg
agag
ttcca
agtg
ttggc
ctgc
agtta
agttc
tga
1168
2697
262
DZ
IPlL
YE
Sgg
aact
gcat
gaag
agca
catg
gctg
a11
6938
2972
7G
PIN
Ogt
tctc
tgcc
aagt
gctg
gcca
gagg
cgcg
tgtg
ttggt
cctg
gtcc
cccg
ctttc
tccc
ccac
tgtc
ctgt
ccct
cccc
tccc
cgtg
cagc
tgct
cag
1170
2566
885
___
NO
ccta
cgag
gtcc
atgg
cagc
gggt
acttg
aagg
attta
1171
2644
609
NO
ggct
ttgag
gatc
tctg
tgcc
acttt
g
1172
2681
773
FOX
PlN
Otc
ctta
cgag
gcta
gaga
ttttc
tgac
aaca
gtat
gtta
gtttc
atttg
gtcc
gtga
atgg
agat
ggca
gcaa
atat
tctc
agac
cctc
ccag
atct
ggct
aaga
catc
ccct
tattt
agga
acga
ccct
cccc
tggc
atac
ccta
ctct
ccat
cgcc
tgttg
gcaa
taaa
cgtg
ccttg
tgga
tattc
ctta
aaag
ccttc
ctgc
catc
cct
ctag
cagt
cact
gtaa
agttc
aaat
ctca
aaga
tgct
tagg
agcc
aaga
ttctc
tctg
ctcg
tgcc
cagc
agcc
cctg
ctgt
gagc
tcac
tgcc
ttccc
cag
cacg
acct
ccta
ggct
gctg
ccttc
ctttc
tgttg
gagt
ctgg
agtc
aggc
taac
tcct
aaaa
atct
gcag
agag
gtct
ctgg
aggg
agga
ccca
agcc
tgtt
acta
ccta
agcc
agaa
tttta
cacc
tagt
ttcct
tgaa
taga
gaag
tcca
accc
caga
ggtg
tgca
caca
cacc
cgcc
ctgc
aaat
cagt
gacg
ggga
gaa
agcc
tgac
ttttc
ttggg
ttttg
ctag
tggg
ggaa
attg
ctca
tttgg
aagg
gcac
cagc
aagc
tta
1173
2941
811
NO
gtgt
cgct
tttat
tcta
actg
ctgg
aact
atttg
ttgta
cctc
tttct
gaag
atca
gtttg
gggc
taaa
ttaca
agct
ccat
cagc
acag
gggc
catg
gctg
cctc
ctg
actg
ctgt
aggc
tgag
cact
tggt
agag
catc
agcc
c
1174
3037
970
NO
tcac
ctaa
aagg
acga
atcc
atct
gcttc
tgtc
tctg
gtac
tgag
ggct
tttta
gattc
cgga
ggct
ttctg
aaga
1175
3235
439
SEC
61A
NO
acac
tttga
cgga
tcgt
ttttg
tcag
atga
211
7636
7057
6-_
-N
Occ
aagg
tcac
acaa
tacg
aggt
gggg
tcag
aatg
caaa
ccca
ggca
gtgt
gcct
acct
agcc
caga
ctct
taac
cact
aaga
tgtc
acta
aaga
agca
1177
2682
104
—N
Ogg
agat
ttgga
gtac
ttgga
gcat
ggag
1178
3809
910
—N
Ogg
gacc
ttgtc
cctc
tttgc
atgg
aggg
atgg
tttct
ttagg
tcct
gggg
gata
gccc
aggc
cact
tcca
ctct
gaag
atct
gtgt
tcttt
atca
gtct
ttcaa
tct
agta
tattt
gcat
ttaat
aggt
gaca
aatg
ggcc
tgga
agct
gtgg
tggg
gatg
gagt
ggga
aacc
agct
ctca
gtct
gaca
ttgac
tcct
ggag
ggga
gggt
cacc
tctg
ttcct
gggc
agcc
tttca
gtgg
cgaa
ggcc
acca
tccc
ccac
agcg
tgat
tgag
aagc
ctgg
cact
tctg
caat
gagt
actg
tgga
gcct
gacc
tttgg
ctag
tagc
ttggt
cttta
ttgac
aggg
ctgt
c
1179
2428
310
NO
cccg
ggtc
tgtc
ataa
ccgc
aaat
ttcta
tacc
actc
tgct
gcttt
aaga
tatc
aaca
ggca
aaag
ataa
ccac
tgtc
caac
tcac
ataa
aaat
agga
catc
aagt
tttgg
ggga
cttc
atgt
ccta
cctc
cagc
ttact
aagc
ctgc
tatc
tctg
gaga
ttcca
ctac
tgca
aacc
tagt
tagg
caat
gggt
aata
gcat
tacc
catta
agcc
aaat
gtgg
agtg
aaca
ataa
taat
aatc
acta
ggac
ttgga
gagt
cact
aaat
aaag
atttg
ttgag
taag
tgga
ggaa
ggct
gcct
tcta
agct
gtgt
atca
ctttg
gccc
caac
ctgc
agta
ctcc
ttccc
actg
tagc
cact
gact
cctg
ctcc
cagc
ttcaa
aggc
tagg
atcc
tctc
ataa
agaa
ttacc
aaag
ccag
agat
ccct
acct
gcct
aata
tata
ctga
tgcc
tcat
gtgg
gccc
ttga
1180
2429
289
—N
Otg
tggt
ttgat
tagt
aact
accc
tgga
gtgt
ctca
acttt
1181
3246
430
—N
Oac
ccga
tgttg
acag
tttct
ccct
ccct
ctct
ctct
ctgc
tgttt
cctc
tata
gagc
taag
tctg
gtttg
cattg
aaat
aagc
atgc
agta
atat
gcct
tttgc
ctta
cca
acttg
tattt
gcac
acag
gcag
ca
1182
3314
235
NO
gtgt
ccta
agcg
ggtg
catg
tcca
g11
8323
3284
1E
RM
AP
YE
Stg
gaga
agag
cccg
gttg
cattt
tg11
8424
5207
7PI
K3C
2Y
ES
catg
gtga
tgca
tattc
gggg
cttg
B
1185
2592323
NO
atag
aggg
ttcttg
cagc
tgtc
caag
gtag
aggc
agtg
gaag
cttg
cgct
cagt
agaa
tcag
tgga
ggta
ggaa
gagg
tgga
caga
ttcag
gtta
tgct
ttgg
atgt
gata
ccaa
tata
gaag
gatc
ctta
tgttg
aagg
atta
gatg
cagg
agtg
agag
caga
aagt
caag
gatg
gctc
taca
gcag
ctac
taga
accc
ctt
gaaa
aatg
gtaa
tgct
gtac
acgg
agct
ggga
aggc
aagg
aggg
ggca
gggg
cagg
agttt
tgta
ggac
tcaa
tagg
tttgt
ggag
agaa
aagg
aggc
tgat
ggga
aaat
caag
ggct
ttgttt
tgag
caac
atca
agat
acct
atta
aata
ttcaa
ggca
tcaa
acag
taaa
tgag
tcag
agttg
tact
agta
tcaa
gattt
ttg
atca
ttgttt
tatta
atcc
taat
acaa
acct
tcct
agttg
tcag
ggttc
agc
1186
2895598
NO
atgt
tctc
ggaa
caag
gaac
cagt
aaag
taca
gaca
tctg
gccc
cagg
atttt
gctg
gctc
cagg
gacg
tcac
tata
aacc
cact
aaca
ctaa
ttgct
ttcac
gtct
gctc
ccat
ttcta
1187
3578431
NO
catg
cgaa
agca
acag
tctg
atct
ccag
gtta
tgaa
aact
ataa
ttaaa
aagt
cact
acac
agaa
aagg
tcca
agttt
ccag
cttg
gcaa
aact
tggg
tgca
gtta
catg
aaac
gttta
ataa
actg
aattg
ttttc
ccca
atgt
gtaa
acaa
aatg
acaa
gact
aaat
ctgt
gcct
ggta
atttc
aatc
taac
tctg
aagc
taca
caaa
acac
acag
acac
cttg
taca
tttca
cctg
ttagt
atga
ccat
ctttt
ttcttc
ggtg
gtgg
aagg
aagg
gcaa
ggtg
actg
tgta
aagc
aaag
atgt
ggga
agt
agaa
atat
tgttt
acta
gaca
ttata
aagt
catg
gtga
aaag
aaag
cttc
gatta
atgc
atag
ccta
aggg
1188
3590373
NO
tcttg
tggt
tcct
taac
ttgtg
gcag
cata
actc
cagt
cttc
acat
gaag
ttcttc
ctgt
gtgt
gtat
ctga
gtcc
taat
ataa
ggag
acta
gtca
tattg
gattg
gggt
ccat
cctg
ctct
gtta
atga
cctc
atct
tagc
tgct
gaca
tctg
taat
gact
ctat
tacc
aaat
aagg
tcac
agta
tgag
gtag
tagg
ggct
agga
ctttt
aac
atac
gaat
attg
ggag
gaca
cagt
tc
tgcc
tgct
tcac
acaa
gtcg
aact
gcaa
ctgg
gatc
ttggc
ttcttg
tcaa
gtca
gtag
catg
caag
ccag
tttaa
gtca
aaag
agga
caca
ctcc
tctta
ctt
(Jl
1189
3277334
NO
gattc
ccct
tgcc
atct
atttt
ctcc
ctca
atct
tagc
cact
tgga
gaag
aattt
aata
ccttt
tttttt
tcca
gagt
ccaa
aggc
tgtg
gatc
taag
tggg
gcat
tttg
tgct
caag
cgaa
tctg
ccat
gcac
tacg
gtttc
aact
c
1190
2323027
...
NO
gagc
ggta
tcttg
aagg
tggc
tttaa
gtttg
ggaa
gttc
ggga
agag
cagt
aaag
gcac
gata
ctgg
gact
gagc
ccag
tagg
agt
1191
2360843
HCN3
NO
tgtc
agca
gatg
tcttg
ggtc
ctga
gt1192
2834307
STK32
YES
gaat
aaac
aaaa
gtgc
gtgg
agcg
caat
gaag
A1193
2890310
C5orf45
YES
aaga
agaa
aacg
tggg
acac
cagc
aggc
1194
2510000
—.
NO
gata
atga
agcc
aacg
caaa
aatc
cacg
tctc
cagg
ctaa
1195
2575065
NO
tcttt
ccag
ctca
tgtc
tccc
gggg
ggct
gggc
tgtc
tcct
gtcc
tgca
gccg
cagc
acttg
atct
cagc
aggt
ggaa
tgca
gagc
agag
gaat
ccgc
cag
acat
taca
ggga
tcta
tctg
agta
taaa
acgg
gact
gctc
tttcc
acta
aattg
ctttt
ggag
aata
aagt
taag
tgaa
ttaaa
atttc
ccca
gtct
caat
tgct
aat
atgg
taaa
tattc
gcag
ttaca
actc
tcat
taac
aaaa
tctc
ttagg
agtc
ctca
gtaa
tttga
gact
gtaa
agac
gtct
tgat
acta
caat
gcct
gaga
acga
gttc
tctg
gatc
ttacc
tgcc
cctc
ttaca
tgag
gagg
cacc
ctga
tgtc
tggg
tggc
ttacc
ctttg
gcct
ctgg
cggg
cattc
agca
tgag
tgaa
1196
3336
723
FBX
LIl
YE
Sttt
aatg
taga
gtat
attc
agcg
gggt
ggct
tgag
agat
cctc
tgat
tttca
agaa
ttctg
atgg
actc
ggaa
taaa
1197
3660
063
—N
Oca
tgct
ctga
cgct
ctat
gcca
ctgc
tgca
cct
1198
4029
263
—N
Ota
aaag
actg
aaca
agaa
gggc
acag
tgttc
acgg
agat
tgaa
gtga
acaa
gaaa
1199
2663789
CHCHD
NO
gatg
aaga
tttgg
accc
ttcca
ttcat
aatc
ccttt
ctaa
gtga
aggg
agag
gctg
gcttg
gctg
ttcct
tgtta
ttccg
aaag
ccct
ggttt
gggg
ccca
tgttc
4ac
actg
gctc
tcag
tcta
gtca
ggtg
ca
1200
3743045
___
NO
gcca
ctct
cgtg
ctgc
ttgag
ctgc
1201
2826559
NO
agca
aaga
ctct
cggg
aaca
ccct
atta
cagg
aaaa
tctc
tcat
acct
gacc
tctg
ctta
accc
actc
gtga
gatta
aatg
atag
tcttc
attc
tctc
tgta
gaat
ttgct
gact
gatg
ctca
tgct
acat
tgaa
tgac
agta
gact
attc
tttag
tttta
tcttg
aact
tttat
tccc
ccca
gtta
ctgt
agac
ttacc
aaga
aaca
tttgt
gttt
gtac
ccaa
caca
ctgt
tatg
gcac
ttgat
gttg
taat
caca
tagt
ttaat
gaat
tatta
ttcag
tgaa
ctct
gaag
ataa
aaag
gcgt
ttgta
tgag
aacc
aattg
gct
acttt
gaaa
tcaa
gtga
gtag
ccaa
aatc
aatta
tggt
tgaa
atag
atga
gggt
taaa
taat
tttaa
atga
ttagg
gaaa
aaaa
aaca
aaaa
acac
ctaa
cgat
ctaa
aaaa
atct
gcaa
taat
atta
tttttc
acgt
ttgta
aaat
gtct
ttctg
tgttt
aaga
aaca
gaaa
ctgg
aaat
caca
ggtg
ttgttt
tatg
gggg
tagt
ttttg
caag
aaaa
agga
aata
gaac
tttct
tcga
taga
ccca
cact
gga
1202
2882611
FAMIl
YES
gtga
gcca
gcca
agaa
ttctg
agtc
tgttg
acca
aggt
gcca
aacc
agag
agta
aatc
agaa
cctg
tagt
ttcca
ctcg
gaa
4A2
1203
3044784
—NO
gacc
cacg
gcaa
ttttg
aata
cttg
gaag
agca
gtct
aagt
accc
tcac
acaa
1204
4016017
ZMATl
NO
tgta
ggta
ttccc
atgg
tttca
tcct
tgac
ctta
ctct
acaa
acac
gattt
ctat
ttccc
tgat
ttcat
ttctc
atct
gtat
gctta
tgac
ctttc
tgtc
aagt
atta
gattc
acat
atcc
aaat
ggca
gcta
ctgc
ttcac
ctgg
atgc
ctca
tagc
caU
l1205
2872559
—NO
ttggt
gatg
cttta
tcga
cctc
tgag
aatc
ccttt
gtga
tctg
cagt
gcta
ctgt
ggttc
acaa
aatc
tgtg
ttcttg
ctct
ccct
gata
gcat
ctga
cattt
gtca
tcag
gggt
tggc
aatc
1206
2810393
MAP3K
YES
tgct
agtg
caac
tact
gctc
catc
gatc
ccttc
acat
ttgtc
tcct
ggttt
acga
gatg
tggc
tcttc
gttg
tttag
aact
tc1
1207
3309863
NO
aagt
gtag
cggc
aggc
atcc
tggg
a1208
3349873
NNMT
NO
gaat
gctg
ttagc
ctga
gact
cagg
aaga
caac
ttctg
cagg
gtca
ctcc
ctgg
cttc
tgga
ggaa
agag
aagg
aggg
cagt
gctc
cagt
ggta
caga
ag
tg1209
3849058
NO
gtac
atca
ttacc
cggc
acag
acac
a1210
2609601
—NO
cccc
tcaa
atct
taca
gctg
ctca
ctc
1211
3489275
NO
tctc
ttgat
tagt
ctag
gtgg
tggt
ttagt
tctg
ttctg
atttt
ttattt
gtttt
agag
atgg
catc
ttgcg
atgt
tgcc
taag
ctgg
actc
aaac
tcct
gggc
tcaa
gtga
tcct
ccca
cgtc
agcc
tccc
aagt
acct
ggga
ctgt
aggt
gtgc
acca
ctgc
acct
ggct
gtct
gctc
tgat
ctgc
acca
ctgc
acct
ggct
gtct
gctc
tga
tcttt
atta
tttct
tcca
ctaa
ttttg
gattt
ggttt
gttc
ttgct
tttgt
agttc
ttgag
atgt
gcat
tgct
agat
tgttt
att
1212
3490771
NO
tctc
cgta
ttgct
tagg
gtgg
tctc
gaac
ttctg
acct
caag
cagt
ccttc
ggcc
ttggc
ttcct
gaag
tgct
agag
ttaca
ggcg
tgag
ccac
agca
cctg
gcct
acat
atta
cattt
aagt
ctgt
gatg
cattt
tgaa
ttttta
tgtc
aagt
gtga
gattt
aact
tgag
gttc
ctttt
tgtg
gggt
ggat
agta
tgttc
gggt
gttc
tggc
acct
atct
tttct
cattg
aattg
ctttt
gcac
ctttc
ttaaa
aatc
agttg
agcg
tattt
gtgt
gagt
ctat
ttcta
agttc
ccca
ctgt
gtta
gattg
agct
gtat
gtga
tcta
tatg
tatg
ttcct
ctgt
cagt
ccca
ctgt
ctta
agct
gtag
ctgt
ttaag
tctg
aaat
tggg
caga
ctca
ttcct
ccta
cttta
1213
2353
513
NO
taat
gtca
tcat
cagt
gggc
aagt
ctag
aacc
ctaa
tatc
ctgg
tgct
caaa
atta
gtgt
tatc
aattc
agtc
ttctc
caca
ttaaa
caat
ttcca
attc
ctttg
agg
cctg
gtgt
gatg
gattc
tgtc
tttca
aatg
ctttt
tttct
ttcag
tgac
tcta
tcgt
tcat
aaat
gtta
aata
aaac
cctc
actc
tgtta
taaa
aata
taga
cttta
aaaa
tttt
ccaa
agta
aaca
ctcc
agag
aaag
gtag
gcgg
gctg
aata
ggct
gctg
ttgtc
cttc
ttact
gccc
a
1214
2610
157
CR
EL
DY
ES
catc
tgtg
tgaa
ggag
caga
tccc
a1
1215
3092
298
LE
PRO
YE
Stg
ggga
gcttg
tgca
cttg
ttctc
acag
gaaa
caca
gtca
tcttt
gcaa
ctat
acta
ggct
ttttc
ttggt
ctttg
gaag
caat
gacg
acttc
agct
ggca
gcag
tT
Ll
1216
3241
630
CC
DC
7Y
ES
taaa
gaaa
atcg
acca
gaag
cagt
gaaa
agttg
1217
3489
976
NO
ctct
gagg
tttag
gcct
cgat
attc
agga
gagt
ggtg
tagg
agga
agat
gaac
cact
gggt
tttag
acac
atgg
aggc
ttagt
tcag
ggct
ttggg
cctc
ccgg
ttagt
ttgttt
agtc
ttgttg
atct
agga
ggag
atca
cacc
agat
ga
1218
3671
050
PLC
G2
NO
ggag
aacg
tgcc
ctat
tcac
actc
tggg
aaga
cgct
aatc
tgtg
acat
ctttt
cttc
aagc
ctgc
catc
aagg
acat
ttctta
agac
ccaa
ctgg
catg
agttg
gggt
aattt
cc
1219
2476
149
—N
Ogt
cagt
taga
caga
gtcc
acaa
ggga
gtcg
1220
2670
672
—N
Oga
actc
acgg
tcttc
agcc
ttcct
gc12
2127
9284
8—
NO
tggc
catg
gtga
aaca
gtgc
ctct
a12
2231
6812
7—
NO
ttcca
atca
gcag
tatc
atgg
atta
aat
1223
3227
785
—N
Ogg
gcag
tttga
ctat
gtcg
catc
cagc
ttccc
atgc
cgtc
catc
ccta
ggag
tctg
tctc
acca
taca
aggg
cgca
gaca
ggcc
agct
tgag
ggtg
ttcat
caa
cgca
gcat
tgttt
gaag
agca
gaag
ttaga
atct
attc
attc
gggg
gcct
gact
gcat
aaat
acac
tgtg
gttg
ccgt
ccag
ca
1224
3367
153
—N
Oct
tgta
tgac
ccttt
gtct
cggt
tgac
tagt
tgttt
gctc
taag
aata
tttca
tgttt
tgct
gtttc
tttca
gtttg
tagg
tgct
aagc
tattg
aaag
gttta
gaga
ctaa
tct
aggt
ctttc
taaa
ttatg
gctg
cttg
cagt
tttat
tcct
ggag
aaat
ggttt
aaca
gcac
ttcttt
gagt
tggc
actc
ttctg
cctg
tctc
agct
ctaa
ctttc
ccgc
ctcc
ttctc
agta
gtcc
ctct
ttcac
cctc
cact
cccc
aagc
ctga
aatg
gaaa
gaac
ttgct
ttgtta
ctga
tgat
aata
tttaa
gctc
taaa
atttt
aaca
gtgt
ttca
gttg
caaa
taaa
tgga
aattg
tgta
ttctg
tttaa
tgtg
tttca
gaaa
atcc
ctgc
tattt
tttttt
tcttt
aaat
acaa
gaaa
acag
gcgt
gtaa
tgcc
tcat
tgaa
aa
1225
3483
227
—N
Ottt
agtc
aagt
aggt
tgag
agca
ccag
ggat
cattt
tttct
tcaa
ccttt
tagc
tcag
tgat
accc
agag
caat
gtgc
ctac
tcag
tgga
gaag
atag
tattt
gtgt
gaat
aaat
ttact
gttc
tgtg
tgtta
catg
tcta
gtta
ggag
tggt
cctc
taat
tctg
ta
1226
3766
878
POL
G2
YE
Saa
aatg
tggt
tcct
tgtg
ttctc
tctg
taaa
tggg
gacc
taga
ccga
ggca
tgct
ggcc
tacc
tcta
tgat
tcttt
ccag
ctg
1227
2555
150
NO
tggg
cgtg
aata
agga
agca
aaac
tcaa
ttata
ataa
catta
ctat
tgttc
aaac
ttcac
atgg
cact
tgca
accg
gagg
ctac
tttgt
ctgc
ctgt
ctcc
atac
tca
tgct
gtct
tatg
ctca
tgttc
ttca
1228
2601
747
DO
CK
lY
ES
gata
ccac
ttccc
actc
gtct
tcca
aggg
gggt
ggag
gagc
ggga
ggaa
ctgg
tgttt
tcaa
gtcc
ggct
ggct
ctac
aagg
ggaa
tttta
acag
cacc
gt0
gaac
a
1229
2664745
—NO
ttatc
cacc
gcac
atga
aaga
gagc
tcag
cgcc
aact
gcca
gtcc
actg
aagg
gtgt
cgca
gtga
tattt
gtttg
a1230
2698409
—NO
atgt
gcta
ctaa
cagg
tcag
gagc
ccag
atca
aaga
gctg
gtttg
ggag
gtgg
ataa
tgaa
ggttg
tgaa
gtct
gtgc
tccc
tctg
agaa
gcct
gagg
aaa
atgc
aggt
ggct
ccct
caat
caac
agac
atcc
aaaa
tgg
1231
2704917
PHC3
YES
atca
gcag
attc
ctct
tcat
tcac
cacc
ttcca
aagt
ttccc
atca
tcag
ctga
tatta
caac
ag1232
2823922
CAMK4
YES
aaat
ggga
ttgtc
catc
gtga
tctc
aaac
caga
gaat
c1233
3717013
___
NO
ggtc
tata
tgtc
tgta
ctta
tgct
ggta
ccac
1234
3498270
___
NO
atgt
ccag
cgca
agga
cact
gtcc
agcc
ttctc
ttcct
gcac
atca
gctg
cctc
ccgt
cagt
cctg
ctga
acgg
ggat
ccttc
ccag
atttc
1235
3737381
—NO
cttc
aaag
ggtg
tcca
gatg
ctat
gaag
cattg
ggttc
gact
tggg
gtcc
acgt
ggttt
gtgc
ctca
gggt
ccaa
aacc
acct
gcat
tcta
ccgg
tggc
gtct
gcag
aaga
cgaa
tggc
ttgaa
ggag
ctgg
cact
ccgc
cggc
taga
tgat
ccaa
acca
tttca
tttct
ttgtc
gtgt
ggaa
aagg
ccct
ctgt
gatg
cact
tct
gttc
tctg
ga
1236
2512319
LOC643
NO
ctgt
agct
ttgag
tttgc
accg
ttcct
tgtc
ccag
gaaa
aagt
taag
aaac
ttaaa
atgt
tttgt
tttgc
tttta
aaga
agac
agga
caga
aaac
ggga
tttaa
aa072
ctta
gtttt
taaa
aaaa
caaa
agct
acag
agta
ggga
gcat
aggt
caac
catc
tcca
ccta
aattt
tttttg
ttcct
aaac
tcag
aattc
taca
accc
aact
aaat
taa
tatg
ccaa
gattt
tgcg
gatg
ttaaa
actc
aaaa
aaag
cagt
atgc
aaga
cggt
cccg
gggt
aaaa
ttcta
ctgc
ctttc
cacg
gaca
agtta
1237
3596425
NO
agtc
ctcg
ctgt
agca
tggg
gtaa
agcc
agga
tatg
acttc
acat
agttt
aact
ccag
aact
taca
ctgt
catc
ttgttt
tgttt
ttctg
cttc
tcttg
tgtg
tcaa
aaga
attc
ctcc
ttagt
tgttc
tcat
atgg
ggca
acca
ttaaa
ttagg
cagt
gtaa
aaaa
caag
ttatg
caga
ataa
agta
gcaa
gagc
aatc
tact
ctga
aatta
gtg
ttttc
cctc
acttt
agca
ccttt
agct
gacc
cagc
ttcct
accg
ccca
gaca
tttta
caaa
aatc
tctc
tctg
gagt
tgcc
tcct
gatc
tgga
ttagg
agtc
acat
a
1238
3809494
NO
ctgt
gttta
acac
ggct
gtgc
tact
tcag
gctc
ccat
ctcg
ctga
gggt
gccc
atgt
cagc
caat
gacc
tt1239
2442446
NO
gtct
tttgc
agta
ggtc
aggg
ctaa
gagg
acta
atcc
ctca
agag
tgaa
gaaa
aaaa
ctat
tgat
ggtg
gcttg
aaag
agtc
aact
gctg
tagc
ttttc
ttgga
gatg
atca
caaa
atgg
ggag
caga
aaac
aggt
tgaa
atta
caga
agat
taat
tagt
aaga
aaag
atgg
gtta
aaga
tgta
aatta
gagg
caaa
gtta
agaa
gaat
ttggt
ttcttc
atta
tact
caga
aagg
taag
gatg
gaaa
agttg
atgg
agca
agat
tgtg
aagt
atct
tata
tgca
agac
taag
aagt
ttgga
ctag
attc
tga
agac
ttagg
agaa
ccag
tgaa
gaaa
tctta
atag
agga
atcg
gatta
gatc
ttagg
gtca
gaac
aatta
ttagc
aagc
aggg
tagg
tgaa
gttc
ctgt
tgac
ccac
tttg
1240
3281111
—NO
agca
tctg
tcgc
tggt
gaga
ccct
cagg
aagc
ttcta
ctca
tgttg
gaag
gcag
tagg
gagc
tggc
atgg
cttg
gcga
gagg
caag
gaag
gaga
gggg
aag
gctc
ttttc
cata
agca
gttc
tccc
tatg
gcac
caag
ccat
ttatg
aggg
accc
accc
ccat
gacc
cagt
cacc
tccc
accg
agca
ccac
ctcc
aaca
ttggg
gatc
acat
tgca
acat
gagg
tttag
aggg
gaca
aata
ttcaa
actg
tatc
actg
ggag
atgg
ggct
accg
gaag
tcaa
gcca
ttcc
1241
3770080
—NO
ctgt
acta
agtg
ctga
tgtg
ccag
cgct
atg
1242
3821078
—NO
acag
atgt
gatc
aaaa
ggcc
taat
atat
gtgt
aattg
gagt
ccca
ggag
tgaa
gagt
gaaa
atga
ggca
gaaa
aaaa
ttcaa
gtga
gaat
ggct
aaga
attt
cccc
aaac
tgat
gaaa
ggca
tcca
ccca
caaa
tccc
aattt
tgta
cttc
caac
cact
gtta
tacc
accc
ccct
caac
aatg
ccta
taaa
tgttt
gttg
aaaa
tga
gtga
atga
atgt
actc
caat
cattt
tctc
acttg
gatg
ttccc
tact
ttttg
c
1243
2531265
SP140
YYEE
Sat
aata
gcaa
agcc
gacg
gcca
ggtg
gtct
ccag
tgaa
aaga
aggc
gaac
gtga
atct
gaaa
gacc
tt1244
2719554
CC2D2
YYEE
Sac
tggt
ggct
cgat
atgt
gtcc
ttgat
tccc
ttcttg
cctg
acac
tgtc
tcat
ttggt
ggta
tctg
tA
1245
3342924
—NO
tgag
gtcc
acaa
atca
gtca
agttg
aata
aaga
ggaa
attc
taag
gaaa
atat
taaa
aatta
caga
aaga
ggaa
gctg
ggga
cact
gagc
ataa
acttt
gat
gaca
tcaa
tttta
ccgt
gtgc
ctat
ccac
tattc
ctcc
actta
gtttc
tcag
atct
tgcc
agca
acct
tagg
gcttg
catc
ctcc
atttc
1246
3811421
NO
gtac
gtgt
gagt
gcgc
tgta
taaa
gtct
ggaa
ccac
aatta
g1247
2448106
___
NO
ttgtg
tgat
tgca
atgc
tgaa
tttgt
c1248
2661177
___
NO
ccttg
ttccc
acat
cggc
acat
tatc
1249
2748212
NO
aggc
atttt
ttgtc
caca
cact
tgct
ttggg
gattt
ttatta
ggta
gacg
gtaa
ttaga
ggtg
tcag
tcat
atga
aagt
aaat
cact
agtc
tttcg
tatta
ctgt
tttct
cagt
ggtc
ctta
tcag
tttttt
ttgttg
ctct
gtaa
tcat
tcat
atgg
taag
actta
acttt
tagt
catg
taaa
tttgt
ttagg
gaca
attta
aact
atttt
gtca
aatta
gtc
gtttt
attc
caag
ttgta
cgtc
ttatg
cattt
attta
gtat
agtg
aaag
cctta
tata
caca
tttaa
tata
atta
gtct
ctgg
atat
ttgga
ttattt
tttag
tagt
acttt
ttta
gagt
gtgg
gtttt
ttcttt
tcttt
tctg
tttttg
agac
agtg
tctg
gttc
tgtc
accc
aggc
tgga
gttc
ggta
gccc
agtc
tt
1250
3081649
NO
cacc
tttgt
gcat
gtat
atac
ttcaa
gttg
tatg
aattt
cttta
tgca
aaga
gcct
ttaaa
atat
aaac
tgtg
atttt
tgttt
ctaa
tttttt
acttt
ggat
tgac
aggg
cct
tttttc
aagt
gttg
gaat
ttcct
tttag
atttc
ttatc
agag
tttat
cttg
atttt
actg
aaat
tcat
ttagc
tata
tgaa
attc
cttta
gcca
tagt
cttg
attc
agat
cccc
ccac
ctga
gtct
ttcca
ggac
cttg
tgcc
atct
gtta
ttgcc
attg
ttgta
ttcac
ttaca
gcct
cttc
tcca
tgag
taca
agcc
cacc
atca
tgct
c
1251
3415065
NO
gagc
cattc
ctttg
gcaa
ctct
tgct
gtca
gcca
tcttc
caaa
gagc
tttg
1252
3436225
NO
tgtc
tacc
atgt
tggg
cagc
acag
atct
aaag
cattt
agaa
ccat
ggct
agag
ctat
gtgt
ttact
atca
ttgct
tcaa
tatg
gaga
gtca
tcca
aaca
ggttt
cattt
gctc
ccag
ataa
actg
tcat
ggga
cctg
ccgt
ggcc
agtg
ctca
ccttg
tact
gctg
gaat
aattt
aaca
atttt
tttca
actg
ccat
catc
atca
tgcc
atttt
atgt
cttc
gtag
cattt
atca
ctat
ctaa
aggt
aggt
cgct
ta
1253
2546677
—N
Oct
ccca
aact
cttg
agga
catg
tggg
gaac
taca
gctg
g1254
2723423
___
NO
aagg
tagt
gtgc
aaag
cgaa
aaga
gggc
acag
acca
aaat
catg
caaa
a1255
3291647
___
NO
gtgg
gtta
catg
ttcct
gcct
ggca
1256
3449366
IPO8
YE
Sgg
acct
caac
cgga
tcat
ccag
gcgc
tgaa
gggc
acca
tcga
cccg
aagt
tgcg
gattg
cagc
cgag
aacg
agct
caac
cag
1257
2935290
___
NO
gtttt
acaa
cacc
caaa
gcaa
caga
aa1258
3296065
KCNM
YE
Sgt
tatg
gtga
tctg
ttctg
caaa
gctc
tgaa
aaca
tata
atat
gcttt
gtttt
ggaa
tttac
cggc
tgag
agat
gctc
acAl
1259
3414403
SMARC
YE
Sta
cggt
gtac
tgtc
ctac
tgat
gctg
gatta
cca
Dl
1260
3611215
___
NO
ctag
tcta
ggga
tagt
ggta
gaag
tacg
taat
gtag
atg
1261
2997513
___
NO
ccttt
gctc
tctc
atat
gcat
ccgg
caca
gcag
c1262
3180413
NO
ttcaa
gggc
atag
gcgt
cgga
attg
tgat
ttcgt
gcgt
gttt
1263
3732
510
—N
Oga
acca
ctttg
ttgac
cttc
tgct
gcaa
1264
2572
597
CC
DC
9N
Oct
ggac
ggtg
tcta
tcaa
aagt
gcaa
atgc
ataa
gctc
tttga
acta
gcaa
tgct
actc
ctaa
gaat
tgta
cctg
tgta
caag
gtac
ttgtg
taca
agag
tttttg
t3
tgaa
acat
tgta
atag
caaa
aagt
tgga
gagt
gcca
ccaa
taca
gggg
aatg
gtta
aata
catta
tggc
cagc
ttaaa
caat
ggaa
tagt
ttgta
gaaa
tttaa
aaca
atgg
atcc
tgtg
tgtg
tgta
ctac
cata
gaat
gggc
ccca
agat
gcat
gatta
agtg
agga
aaaa
agcc
catg
gtat
ccta
atgt
ttgga
atgg
agaa
agt
gtgt
agat
gtat
gtgt
gtgc
atat
gtttt
caca
caga
cgta
taca
cgtg
tata
tcat
atgt
ccac
ctgc
atat
gcct
agga
tgtc
tctg
ggag
gata
tcta
gca
gccc
ggca
gcct
gaat
tgcc
tctg
ggaa
1265
3216
409
—N
Oag
gttc
tgaa
gtcc
cagg
caaa
tgta
atag
gggg
tagg
caga
ggac
ttact
taga
gaga
tggt
ggtg
aaag
tctc
aaac
tgca
gattt
ctag
aaca
aaca
ggt
tact
tgta
agaa
gcag
acag
agat
actta
gatc
agta
aaat
acttc
tgta
acac
taga
agtg
agaa
gaca
tgag
aacg
ttatc
taga
aact
atca
agag
gaaa
tgac
tgca
gacc
aaaa
atcc
tacg
c
1266
3294
290
—N
Otc
caca
gctc
gattt
ggcc
aattt
cgag
ttgaa
agga
tggt
actg
gacc
cagg
cctc
ataa
gtag
cttg
tcat
caag
atct
ctag
tagc
caaa
tcag
aaac
ccac
aatg
aagt
aaag
atca
gtat
gtca
agga
agac
aaga
ctaa
cggg
gtaa
aaag
cctta
gact
agaa
acta
cattt
aggg
ggaa
ataa
agca
aaaa
taaa
aacc
tttca
ataa
gtaa
cact
agttt
ccag
atgg
tact
ctag
ctca
ggca
tgag
aatc
tgttc
ttaaa
agtg
ggac
atta
tttca
atca
taaa
cata
attta
ctag
aat
ggac
atttg
gcaa
agca
agtc
catta
tcca
cctg
ccat
caca
gtca
ctttt
ttagt
cacc
agtg
aagg
caag
cctg
aagg
acac
ctac
tcca
aagt
gcttc
cca
aaaa
tttgc
caca
gggg
ctga
ttcct
actc
tgttt
gagg
cattc
ttttg
tgct
gaaa
gattg
tcac
caga
aaga
caac
tgaa
agac
aaat
attg
ctgc
ttggg
agca
gttg
tggc
acca
gtgt
ggct
ccca
ctgt
agca
atta
aag
OO
1267
3446505
—NO
gtct
tgta
gcac
tcaa
acct
ttgtc
tttcc
caca
agac
tgaa
aact
ccat
gaca
cagg
aata
tttct
tatc
ctgt
gaac
ctttg
tgtc
atac
cacc
tata
gtag
tgttt
agta
taaa
ataa
atta
attg
tttgg
aaat
ctat
ccta
aaaa
aaaa
aacc
taga
ttcag
taaa
agct
ctgt
ggcc
aagg
atgt
ttgtg
ttaca
gtga
acac
cttc
gagc
acttc
agg
1268
3616936
NO
ctca
tagc
atgg
tggt
ctcg
cagt
agtta
gcct
tctta
cata
gtta
ctgg
cttt
1269
3628351
MGC15
NO
cctc
cgtta
cccg
tcag
ccca
gcat
ccgg
ctttg
cttc
cata
a885
1270
2745112
—NO
gcaa
aattg
gtct
ggta
tctg
tcta
aattg
gtca
atac
aagt
gtta
ttctg
gaat
catg
taaa
aatc
actta
ctttc
ataa
ctgt
ttaga
accc
caaa
tgttt
tccc
tttga
aggg
gaac
atct
gtgg
aata
tgat
tttga
atag
agga
cttc
tatg
aaat
atat
ttcaa
atct
taaa
aata
ttttta
atac
ttcct
aaag
tggt
atag
gattt
accc
tcg
ctga
ctat
gcct
tcca
tttaa
gaac
agtc
ttgca
tttgt
ggct
tttca
1271
2824232
APC
YES
tgtc
cctc
cgttc
ttatg
gaag
ccgg
gaag
gatc
tgta
tcaa
gccg
ttctg
gaga
gtgc
agtc
ctgt
tcct
atgg
gttc
1272
3101817
SGK3
YES
agttc
tggt
ttcag
tggg
aaga
agtg
1273
3129414
NO
ccta
gctg
tctg
tcag
gtag
aatg
aggg
tgaa
ggag
atct
agga
tgct
tcag
gcat
tgcg
cttg
aact
taaa
aaac
agga
tcag
cagg
ccct
gact
tcat
aagg
ccca
taaa
taca
aatg
acta
gctc
ccttt
ctca
aggt
cattg
aaaa
tata
cagt
agttt
caga
catc
acat
gggt
ttggg
caaa
gggg
gcag
atttc
caag
cta
ggtc
actta
atgg
tatc
1274
3576514
—YES
gtgg
ccgg
gagg
caaa
acttc
tcta
tgag
tgga
aacg
tttga
agaa
taca
tcac
ccat
tacg
atttt
gcta
atct
tagg
tttca
gaca
ccac
agac
acat
tgat
tct
atttg
gaaa
gata
gcca
tcaa
tcag
ccag
c
1275
3650814
COQ7
YES
ataa
agaa
atttc
ggga
tgaa
gagc
ttgag
cacc
atga
cata
ggcc
tcga
ccat
gatg
caga
attg
1276
3723777
—NO
gcca
aaga
gcat
gtag
agca
aact
aggc
tcta
ttcag
cattt
ccat
ttctta
atgt
atct
atttg
atttc
aata
aaat
ttcat
cattt
tgaa
gatc
tctc
ccat
tagc
ttgg
gaaa
ctac
agaa
acat
gcac
ataa
ctac
tgag
aagg
tatc
aagt
gttc
ctac
acta
gttta
aaaa
aatc
attc
aaca
ccca
cata
aattc
tgat
atgc
acta
ggct
gaac
agct
gtta
acca
ctga
gaag
ctgg
gcac
tgca
tcct
ctcc
ctgg
gtca
aaac
tcta
tatg
ct
1277
2550931
NO
atgc
acat
ggga
cagt
ctaa
gatg
gtag
tggt
gg1278
2681842
___
NO
acca
agtg
acat
tttgc
attc
agcc
tt1279
2868846
___
NO
gctc
tagt
gcca
cagg
aagc
tgcc
t1280
2880139
PPP2R2
NO
agcc
aagg
gaga
accc
aaga
caga
gaa
B12
8133
0907
3—
NO
attg
aatg
attg
tttag
aggc
agtg
tga
1282
3636
536
HD
GFR
YE
Sgc
aattc
agca
acag
agct
cttc
agaa
P3
1283
2347802
RWDD3
NO
ggga
taca
ttcag
gtct
cagt
tgttg
aggc
tgttg
agca
gcaa
tcca
ggta
tgaa
aatg
ggaa
agat
ttgac
tgct
tctc
tcat
gcag
ggga
ataa
agca
aag
gc1284
3202177
PLAA
YE
Stc
ctgc
acttg
acat
tcttc
ggttg
tcaa
ttaaa
cacc
ccag
tgtg
aatg
agaa
cttc
tgca
atga
aaag
gaag
gggc
tcag
ttcag
cagt
catc
ttatc
aatc
ttct
gaac
ccta
aagg
aaag
ccag
caaa
ccag
ctgc
ttgct
ctca
ggac
tttttg
caat
tgttt
tgttg
gcca
ggca
ggac
aaaa
actc
atga
tgtc
ccag
aggg
aatc
actg
atgt
ccca
tgca
atag
aact
ga
1285
3945822
—N
Oaa
ggaa
gcga
gaga
cttg
gcag
tcct
gtag
gcttt
aatg
gaat
cagc
aatg
gacg
ctcg
1286
2682207
—N
Ogg
ggct
ttaat
tctta
atcg
ctgt
gagg
attg
ttatc
cgtg
ggat
gtga
gctg
tctta
gtgg
tattc
aatc
aagt
gtcc
actta
gcca
tgac
tgga
agta
attg
ctc
ccaa
gtgg
gcc
1287
3263773
DUSP5
NO
agcc
atta
cggg
agca
cagc
atgt
gctg
acta
ctgt
acttc
caga
cccc
tgcc
ctct
tggg
actg
ccca
gtcc
ttgca
cctc
agag
ttcgc
ctttt
cattt
caa
gcat
aagg
caat
aaat
acct
gcag
caac
gtgg
gaga
aaga
agttg
ctgg
acca
ggag
aaaa
ggca
gtta
tgaa
gcca
attc
atttt
gaag
gaag
caca
atttc
cacc
ttattt
tttga
acttt
ggca
gtttc
aatg
tctg
tctc
tgttg
cttc
gggg
cata
agct
gatc
accg
tcta
gttg
ggaa
agta
accc
taca
gggt
ttgta
ggga
catg
atca
gcat
cctg
atttg
aacc
ctga
aatg
ttgtg
taga
cacc
ctct
tggg
tcca
atga
ggta
gttg
gttg
aagt
agca
agat
gttg
gcttt
tctg
gattt
tttt
tgcc
atgg
gttc
ttcac
tgac
cttg
gact
ttggc
atga
ttctta
gtca
tact
tgaa
cttg
tctc
attc
cacc
tcttc
tcag
agca
actc
ttcct
ttggg
aaaa
gagt
tcttc
agat
cata
gacc
aaaa
aagt
cata
ccttc
gagg
tggt
agca
gtag
attc
cagg
agga
gaag
ggta
cttg
ctag
gtat
cctg
ggtc
agtg
gcgg
tgca
aact
ggttt
cctc
agct
gcct
gtcc
ttctg
tgtg
ctta
tgtc
tcttg
tgac
aattg
ttttc
ctcc
ctgc
ccct
ggag
gttg
tcttc
aagc
tgtg
gact
tctg
ggat
ttgc
agat
tttgc
aacg
tggt
acta
ct
1288
3497166
NO
cgcc
agca
tacc
agct
catg
tttca
gaat
ttctg
tcac
aaaa
agga
acac
aata
tttta
agttg
acta
ctgt
ggga
aagc
tatg
atca
ttcaa
taac
ctac
ctttc
tga
tagc
tcttt
ctca
gata
atat
aagc
tata
aacc
aaaa
tagt
cagc
cttc
acct
gattt
acag
tgcc
ctttg
aaac
ttcct
aagt
tttct
agcc
cagt
aaat
gctg
agc
tacc
agttc
tact
tcca
agct
agag
aacc
tgttt
acta
aagt
taga
agtc
tagt
gact
tcct
ctgt
caga
ctgt
ccca
gaat
gctc
tccc
tacc
ccca
acaa
ctgt
gtaa
gaac
cgtc
atta
gcca
tctc
caaa
ttgcc
aaag
tgct
cagt
ccct
tcg
1289
3618874
NO
gtaa
catg
catc
tgtg
tggg
aacc
t1290
3202840
NO
tttca
ctgc
tact
gcga
actg
gtcc
tgaa
ggtg
caga
tctc
ttaga
agga
ggac
tccc
acttc
ttggt
ggtg
gtcc
tcttt
ttact
ggat
gtgg
tccc
ctgg
aag
aact
catg
ttaaa
attc
atgg
aata
tcca
ccgt
catc
catg
tgtc
ctcc
ccgt
gagg
gagg
tccc
cttg
ttcct
ccac
ttcct
cctc
ttcca
cctc
taag
acct
ctt
ggag
ggcc
tcta
cttc
ttgga
ggtg
gagg
cggt
ccac
gtct
acca
ctttc
aaat
gatg
gtttg
ttggc
ttgttc
cacc
tcga
tggc
ttttc
catc
taat
gact
ttcc
attc
atgt
ctct
ggct
gcat
cctta
gcat
ctgc
tggg
ctttc
aaag
gtga
caaa
agca
aatc
ctcc
tgat
ttgttg
gtttc
atgg
tcttt
cacc
aaga
gtac
ttcca
cta
ttcgt
ccat
atttg
ccaa
atac
tgct
tcaa
gagc
tttct
cattt
gtttc
cgtg
ttaag
ccca
ccag
tgaa
cggc
tttcc
tggg
caat
ctgc
1291
3755656
—NO
catta
gggg
accc
tgcc
tgaa
tcttt
gcca
acct
gcaa
ggaa
actc
1292
2469949
—NO
tgcc
agtg
acat
ccttg
ggtta
ttcgt
atac
ctca
tgct
ttaga
ttcgc
ttctg
gttc
atgg
tgcc
agcc
ctta
gaaa
ctct
gggt
tcct
acct
gaag
ccgg
tctta
tca
aaac
gcca
cccc
tggt
tcca
gtttg
atca
gggt
aacc
ttgtg
aatta
gcaa
gcag
cagt
cccc
agtg
tgtg
ctga
catg
gagg
ttgga
gaga
aaat
agat
atgt
ccca
tgcc
tggt
ttcca
gcca
atac
tgat
gtcg
cggg
ttagt
ccag
cagg
ctga
ggct
gagg
cctg
tgac
aaag
tccc
tgca
tgga
ggtc
ttcct
gcgt
caga
gcct
agcc
ctgt
tcaa
ta
1293
2830605
—NO
gagc
tttct
aatg
gggg
tggg
aaga
1294
3235906
—NO
ttggg
gttg
atac
ccat
catc
gctc
ttaga
ctcg
gggt
tgat
actc
gtca
tcgc
tgtta
cact
cagg
gccg
atac
ctgt
catc
a1295
3616555
—NO
aggc
agac
atca
accc
tcta
agac
atttt
tttcc
tatc
ctct
ggga
atat
tact
ttttg
gaca
atct
tggt
ccat
tggt
aagc
tcat
ggga
atttg
tcag
agttt
ttttg
tttct
tttgg
ctca
tgttt
agca
tcga
ttggc
agag
tgttt
ggag
tcat
cc
1296
3369121
ELF5
YES
cagg
aattt
tgga
gcgg
gttg
accg
1297
3908978
—NO
tccc
atca
gcaa
catc
cgtc
atac
cctg
acgc
cc1298
2338495
—NO
aaga
gcag
ggga
ccta
tgat
gcag
aaag
tcct
ttgct
ggag
ctca
cagg
tcga
atttc
aggg
aagc
tgcc
caag
ggct
tgaa
catc
cctg
ctttc
aggt
tcc
ag1299
2452931
—NO
ccac
atag
tacg
gtgt
ataa
caag
gagt
ggtc
aaag
agttg
tggc
tggt
ggttt
ccttg
gcaa
agcc
aaag
tcag
tgag
tttca
ggat
ggcg
ttggg
ccttt
tgg
aggt
gtat
aaga
gattc
tc
1300
3448040
—NO
tgtta
agcc
ttcct
gaga
tcac
acca
aatg
tcct
tgat
ggga
attg
taaa
caac
cctta
cttc
tggt
ttcta
atga
tattc
cagg
aaag
gagc
ctaa
ctag
agtc
a1301
2823916
CAMK4
YES
gattg
tgga
aaag
ggat
atta
cagt
gagc
gaga
1302
3799998
—NO
gggc
atca
agaa
tttgg
acat
actg
gtgt
ctgg
tatc
gagg
ttcttc
agct
actc
aatg
cacg
tgat
gatg
gtga
tgat
catg
atgg
taac
aaca
gac
1303
2978095
SHPRH
YES
tcat
gaat
attg
gatg
gctc
tgag
gaat
cgtg
tgtc
tgct
gttg
atga
acttg
caat
ggct
acag
aacg
acta
agag
tgcg
tgat
ccta
ggga
gcca
aagc
ctaa
tccg
cctg
ttcttc
atat
c
1304
3475
634
—N
Oga
cacc
tcag
gtga
catg
cttta
tagt
tggc
ctga
gcct
tgac
ccag
tggt
tcaa
cctg
agttt
ggtc
ttggt
acca
ttcag
gtgt
ttgtg
agct
gaga
ctga
tgag
tagg
aaca
ggtg
tttta
tatc
tcgt
ctttt
gtca
aaag
taaa
gact
cattc
ctttg
gtaa
ccca
acca
aatc
agga
tcag
1305
3683
092
SMG
lY
ES
gtat
tctg
agca
cacc
caac
taca
g13
0640
2204
8R
AP2
CN
Ocg
ccaa
gtct
ctgg
gcta
tttttt
atttt
tgca
aatg
tgct
ttcta
atag
ccat
tgcc
ttcca
tgttg
tttac
ctaa
tcag
cata
tttttg
tctg
aata
cttg
aaca
tttta
aca
gtaa
cgca
ggtg
taga
atca
gaaa
ggaa
actta
tgca
gagt
aata
ttttg
gttc
agttt
taac
atcg
tgac
aatg
aggg
ctttt
tcta
gcaa
tgat
tttta
aatt
gtgt
aagt
ttgac
agta
tttta
ttgttg
ggttt
ttattt
gattt
tagt
tgtg
tgct
tttca
tttgc
agaa
gtta
gtaa
ctgc
agct
cacc
tact
gcac
caaa
gttc
tcga
tttta
ggag
ccca
gcttt
agtc
atttg
aaca
tgct
tcta
aata
aaat
aaaa
caaa
acca
aaac
tata
ctttt
gatc
tata
ataa
gagc
tcaa
taac
tttgt
caag
gaaa
gctc
taat
atat
gcag
tgat
ggttt
atga
aagg
gtgt
ggca
atttt
aaat
ttata
ttgtg
tgtg
atgt
tcaa
ataa
agtg
gtat
ctac
attc
atgt
gattt
atgg
gtca
gcat
gacc
atta
atta
ctga
gtag
aaat
tgac
taaa
ctttg
atttc
ctttt
tttaa
atcg
tgttg
cattt
gattc
ctga
gcaa
attc
cctc
aaag
tgaa
ctct
tgttc
ttaaa
ttttg
aattt
tatg
gtga
gattg
taaa
gata
gagg
caat
tgaa
acat
tgttc
ctta
tttat
gaac
tgct
tgaa
gtga
atac
ttaat
ttaag
tttgc
acttt
aata
cca
aact
taaa
acca
aaca
ctca
tttaa
aagt
aggt
taag
tgat
catg
gatc
attg
ttatta
gcttt
gtgg
ctttg
tgaa
attc
taaa
ggaa
tcaa
ataa
ttcat
catg
att
taaa
ttttc
taga
gattt
tgat
tttttt
ataa
tgttt
ctttc
ctgt
agat
tgtg
ttcttg
tttct
ctct
ctct
ctct
ctct
ctct
ctct
ctct
ctct
ctct
ctct
ctct
ctct
ctct
caa
aatta
cagt
gttc
attg
tcat
tgac
c
1307
3153
356
—N
Ottt
tctg
agga
atag
ggct
caat
ggtg
ggct
taga
atat
tgag
taca
ccat
gctg
taaa
caga
tatg
ctgt
catc
tagg
ctttg
ttgta
ccat
ttaga
gagc
acag
gca
1308
3529
618
PSM
El
YE
Sgt
ggat
gtgt
ttcgt
gaag
acct
ctgt
1309
3830
025
—N
Oaa
ttcat
tcgc
ctcc
ggta
cttg
caag
cctc
gctc
agtc
ttaag
caag
aggg
gatg
gattc
gccc
gcag
cact
gaga
atcc
aggg
gcag
gcgg
gatg
gcg
ttcag
gcgc
tgttg
ctag
aaat
ctct
gtct
ttact
ctgt
tttga
aggc
agca
tggc
aggg
tgaa
caca
agca
caga
ctga
aggc
agct
tgcc
gcgg
ttcac
atcc
tggt
caca
ccac
ttcct
gacc
atgt
cacg
tggg
caaa
tta
1310
3866
580
KPT
NN
Oag
ggat
gatc
ccat
tcca
ccct
gtcc
actc
tgga
tgag
aattg
gcca
cctg
atgg
ttattt
atac
gtcc
agaa
gaac
1311
2620
390
TG
M4
NO
gcca
cctg
ctga
cgac
ccttg
agaa
gctg
ccat
atct
tcag
gcca
tggg
ttcac
cagc
cctg
aagg
cacc
tgtc
aact
ggag
tgct
ctct
cagc
actg
ggat
gggc
ctga
taga
agtg
cattc
tcct
ccta
ttgcc
tcca
ttctc
ctct
ctct
atcc
ctga
aatc
cagg
aagt
ccct
ctcc
tggt
gctc
caag
cagt
ttgaa
gccc
aatc
tgca
agga
cattt
ctca
aggg
cca
1312
2693
884
NO
cagt
ctga
gttc
ctgc
accc
ttcag
1313
2724
817
___
NO
gatg
atgc
tgga
caaa
ctgc
aaat
cctg
aact
taga
gact
gttg
cagc
aata
acca
ttttg
actg
tgat
aata
tcat
tctg
ctca
agttg
1314
3718
581
SLFN
5Y
ES
ccag
tgac
ccgg
aaaa
ccttc
atga
aaaa
caac
tttga
acac
atcc
agca
catta
tcat
tgat
gacg
ctca
gaat
ttc
1315
2645
299
NO
ctgg
cagc
ctgc
tcaa
cact
ttaaa
tttta
tcag
tttag
tacc
tgct
gatg
cgtc
taaa
actg
ttggt
ccca
ttcta
gtttt
atac
ttaca
tatg
tagt
atta
aaag
cac
ttaag
agaa
atgt
agaa
tata
taat
cttta
taaa
agta
ttaaa
ggaa
ttattt
aaat
tgaa
tttgt
atcc
cattt
atct
tgct
tgct
aaat
agat
ttaat
tgga
gcat
cca
tttgt
ataa
cgtc
tggc
tattt
ctac
aatg
ttaaa
acag
tggg
ttgag
gaag
taaa
atttg
agtta
ctca
gtta
ctga
cata
gaaa
agaa
ccct
gtga
gacc
aaat
tcag
ctttg
gaaa
caat
ttagt
gtaa
ttattt
tgca
tatg
taga
taca
ccttt
gaaa
gcca
agaa
aggt
ttatg
attg
gatc
tgta
ctat
atta
attta
aaaa
caaa
ttattc
ttaag
tata
ttgta
gcat
ttctg
ttcct
gtaa
gtac
tttac
tcat
ctta
aatg
tact
gatc
atta
gcca
ccttt
ataa
aaag
aatg
tgcc
tgac
ccac
tagt
atttg
ata
gaaa
tatta
aaac
agtta
agat
cctta
aaca
tttct
gcat
ataa
tttttt
attg
agta
aatg
taac
ttaac
gtaa
gtag
ttcta
tgaa
gtct
gtgt
aaat
taaa
accc
tgttt
tgtg
taga
tgaa
gcaa
tcttt
ataa
gtttc
tgtc
atca
agac
tgta
ttttg
tgtg
ctat
ttttc
cata
gcta
ggaa
ggtg
gcaa
gtaa
aaca
tatg
cact
atcc
tag
aaac
atac
tgttc
cagg
taga
tctta
attta
ctac
ctgg
aatg
agcc
atct
tagg
tgac
cacc
a
1316
2927067
NO
aatc
tgta
gttg
taat
gtag
ccttc
tacc
taaa
taaa
agag
atcc
taac
ataa
tggt
tctc
atta
ctta
agcc
tcat
tatg
atca
agct
cttc
ccaa
ataa
tctg
acc
aa
1317
2434911
NO
ctga
gacg
atga
ttagt
agaa
ctga
gaag
tgcc
cattg
gtttt
ggtg
agat
agag
gtca
gaca
ttacc
ttagc
acttt
ttggg
1318
2916250
C6orfl6
YES
ggct
caga
gggg
tgcg
caca
acaa
cctta
t2
1319
3152957
___
NO
tctg
gcaa
aacc
cagg
tgcg
gtaa
ccct
gcag
cagc
ttcca
ctca
ctcc
tgca
atgc
ctga
ca
\1320
3190622
SPTAN
YES
gaag
ctttt
gaga
caga
cttc
accg
tcca
caag
gatc
gcgt
gaat
gatg
tctg
cacc
aatg
gaca
agac
ctca
IN)
1
1321
3508990
—NO
ggat
ttcta
tgcc
acac
tacc
cgta
acttt
gaaa
aata
acttt
aggc
tgca
gtttt
cagc
aaac
agga
cagt
cctta
gctg
ccac
atag
ctca
acat
aaag
tgca
caaa
aaac
ttcac
ggtg
ggac
agtg
aatc
ataa
attc
ccaa
actg
acgt
gtgt
ctac
agaa
caga
tg
1322
3567717
—NO
ttttc
cata
gcttg
gttc
tcag
gggt
tata
gatg
caag
aagt
tcct
ggaa
agag
ctct
aaac
caga
aaat
aaca
gtct
gctg
gtca
aact
caaa
actg
tcttg
gttg
atat
atga
gctg
gctg
tcttt
actg
ttacc
tcct
gtca
tgat
cattt
tgct
aattt
tctg
ctat
aact
agag
ggga
atgt
c
1323
3780094
—NO
tgcg
ttaga
agac
ctgc
cgac
aggc
cgga
taag
gcca
atga
gaga
ggga
acgg
agaa
gagg
ggca
cctg
ggttc
acag
gcct
taag
aaag
aattt
agt
tact
tattt
aaca
aatta
aaca
atgt
gtgt
cacg
catg
tagg
tctg
ggaa
atgc
atta
ttaaa
aaca
caca
gtgc
agag
gtgc
agca
ccag
cata
cggg
tgga
tt1324
3799938
NO
ggac
accc
tcaa
ccag
taac
acac
actc
tcct
tcttc
aatg
acgt
ttgtg
ttcct
agaa
tcca
ttttc
ctca
gtca
g1325
3932154
—NO
tcttg
ccat
accc
aagt
aggg
aatc
acttc
cagg
aaaa
caag
tcct
caag
cact
caat
gact
ttgta
aacc
catg
actg
aaac
aaga
ctca
tctg
aggc
agta
ggac
tggt
ggga
ttcta
ctga
cttc
ttagt
atca
catg
gcac
catc
acaa
atgc
attg
ttaga
gcag
caac
atcc
agag
ttctta
agct
tgta
ccct
aaca
1326
2534308
NO
cctc
acag
tgac
acgc
gatg
ttttc
aaaa
gtcc
atct
ttcca
cagg
atgt
ttaca
gcac
ccag
cggg
tagg
cagt
gcta
tttat
taac
cctg
ttgta
ctga
tgaa
caaa
ccag
ccgg
ctca
gggc
aggg
ggtg
agct
tgat
caag
atca
caga
cagg
aagt
gaca
gcgg
ggtc
tcag
ccca
ggcc
atgc
ctga
cacc
caca
ctc
ttctc
tggt
gggg
gcgc
ttcca
ccttt
gtcc
ctgg
aagc
ccag
cctg
acag
cttc
tgca
gtgc
ctct
ggca
gccg
ccca
caaa
ccca
gcca
ggca
aaat
ccca
gggt
gcca
gaca
ccag
gaga
aattt
ccag
ggga
atttg
gcca
tttgg
ctgt
ctct
tcac
ttccc
tgaa
aagc
caaa
caga
gaat
ggat
ttctta
tcac
act
gtag
caag
gcaa
tcaa
ctgg
aaaa
gaga
cagg
agga
gaag
tgtg
agga
ccaa
cagt
caca
ggga
ccca
ggaa
tcgt
gtta
gcca
ca
1327
3696944
—NO
gctc
ccaa
ggcc
caat
caga
taca
gctc
tcca
aacc
tgcc
agtg
ctgg
tgaa
cgcg
ctgc
tggg
agca
ct1328
3712853
—_
NO
ggcc
gtga
agga
agaa
tgcg
gtgt
t1329
3916807
—NO
ttcac
cagg
atac
gact
gttg
gacc
agct
gctg
gaga
tgga
cctg
ctac
ccct
cagc
agcc
tccc
cacc
acaa
gaca
agtg
atct
caat
gtcc
ccaa
acct
gtgg
gacc
ctgt
tcta
caca
cctc
atttt
tgttc
cggc
gtttc
atcc
tcct
tgtg
tgat
tgta
ctga
ttttc
atga
gaca
caag
ttact
tcttt
acat
ccat
attc
ccaa
agca
gggt
taca
tggt
agga
aaga
aagg
aagt
tgga
ggta
ctaa
gctc
attg
tgtc
tcct
c
1330
2381295
—NO
agga
agta
acta
gagc
cggc
atttg
gggg
tgttg
cgga
cgat
tctta
cgaa
tcca
gaca
atttg
ctct
tttaa
acgc
ggcc
ttgcg
gttg
tcta
acag
tgag
aat
agat
gact
aact
gaat
ttgtc
agac
tccc
tgcc
ttctg
tcat
aata
acga
cgca
gaat
tgcc
accg
ttatg
tgtc
1331
2608169
...
NO
gtat
gggg
agaa
cgag
gatg
tgat
a1332
2363544
—NO
ccac
aaga
aggg
ctag
agaa
atac
agag
atat
ttgaa
gagt
gtga
ggaa
gagt
atta
acac
acca
gtttt
cttg
atca
atag
1333
2407863
—NO
acat
ccttc
ttttc
ggcc
ttgaa
tcct
ttcttg
ctttg
caag
gaac
aattt
tcttc
ctac
tata
aact
acag
cact
aagg
tatg
gacc
ttaga
aatta
tata
gtct
aatt
ctct
gctg
acac
acaa
atcc
cttc
aaca
acaa
agaa
tcat
aagt
tcac
attg
aaag
gtct
ctcc
aaat
ctcc
cata
aatc
taca
atat
tcttg
atgt
ttaaa
attta
ggat
aata
gaga
attc
acta
attc
ataa
gaca
aagt
acca
atgt
aata
attc
tata
aaat
gtca
taat
ctgc
tttcc
ttgta
acttc
tgcc
tattg
actc
1334
2920969
FIG4
YES
agga
tcct
cggg
ctta
tttcg
agcg
gtttc
agct
tttgg
tg1335
2949119
LTB
YES
tgcg
tggt
gcga
gtgc
gtga
atat
tggg
ggcc
cgga
cgcc
ca
O\
cattg
ggcc
ttcca
tcat
caga
aagc
tcct
catg
ccac
atct
tgac
acat
tattc
ccgt
cctc
ctg
1336
3241194
NO
1337
3447084
...
NO
agga
tgtg
agac
tgcc
gtgg
atat
cagg
gaag
ggag
actta
taag
aagg
agaa
atttg
gtca
actg
tgtc
aaat
gcttc
agcg
atca
ggat
gagt
aatg
agga
acgg
ccat
ttgat
ttgat
ttgat
cattc
agat
atct
ttaaa
gacc
tttaa
gagg
gcag
tttca
gt
1338
3626136
—NO
acga
ccat
gcgc
actg
agaa
acag
gcac
cagg
atgt
cgag
aacg
gcaa
tgta
tttgc
tcac
aaaa
agct
ctac
ttcac
acac
cacc
tgtg
catg
agcg
tgc
caca
tcaa
aatg
a
1339
3800039
NO
cata
atca
ggtc
ctct
gcaa
gtgg
gagt
ccaa
ctgt
gacg
cagc
cccc
ccac
gttc
aaag
catc
ccttc
agct
tccg
gcaa
tctc
ctct
ccag
g1340
3887175
NO
gccc
tggt
tgca
actc
tgct
ggtc
t1341
4011861
...
NO
atga
gggg
aacg
gtag
ctga
caat
agca
gagg
aggg
ttttg
cagg
gtct
ttagg
agta
aagg
atga
gaca
gtaa
gtaa
tgag
agat
tacc
caag
aggg
tttg
gtga
tgga
agga
agcc
acag
gcac
agag
aaca
caga
atca
cttta
tttca
tatg
ggac
aact
ggga
ga1342
2320367
FBXO4
NO
ggta
caat
ggca
tggc
ttctg
tcta
aggt
acag
aggg
gttg
gcat
ttcag
gaac
cagg
ccat
caca
gaaa
cagg
ttcat
gggc
agac
ccct
cagt
gagc
tg4
cagg
tatc
tcac
ctgg
cagc
cgtc
cagt
actg
ctgg
cttc
ctct
ggag
gccc
agcc
acag
gctg
gggt
tggg
gtgt
gtgg
acat
ctct
gggc
agct
cttg
agt
ccac
cttg
tgcc
agat
cagc
agtg
ccac
cc
1343
2680360
...
NO
gagc
tggg
tcag
ttgct
ccat
tatg
tact
ttggt
ttgca
g1344
2866718
ARRDC
YES
actc
gcta
cctc
attc
gaag
gccg
acat
ggca
gtgt
gcgc
tattg
ggtg
aaag
ccga
attg
caca
ggcc
ttg3
1345
3251759
N NO O
gaca
acct
tcaa
tgca
ctgg
tact
ttgta
ccct
gacc
tcta
ctgt
gatg
aaca
ctca
acttg
ggca
agga
1346
3742688
NUP88
YE
Stg
tgttg
agttg
gagc
ttgct
ttgaa
1347
3772092
LOClOO
NO
tggc
tgtg
cctc
tcga
tgat
gatta
agat
ttcaa
tattt
acag
caaa
acca
caaa
gcaa
atga
taga
ataa
agca
aaac
aatg
gaaa
atct
gagt
tcac
tcgt
g131096
agag
aggt
acgt
atgt
gagc
tctg
agga
aatta
caga
ggga
acgc
atgc
agcg
ggac
agct
ctcc
caat
cgca
gcgt
gcaa
agta
gaca
tcca
1348
2596545
—N
Oga
cagc
agag
ctat
ctat
cagt
tcct
ctga
acttg
ttctg
aatg
agct
tttct
tact
gttc
agtc
ttcac
tctc
ctat
tact
gtttt
gcttc
ctca
taga
attc
caga
ttagt
gggc
ccct
ggaa
gttc
agct
ggta
gatg
agaa
aact
gagc
ccat
tgaa
gtta
aagt
cacc
cagc
tagt
cagg
gttg
atct
tccc
aact
caga
gtcc
cctg
ctta
ctta
aact
tttat
atcc
aaca
ggttt
acttc
ccct
ggaa
actta
tttgg
tattt
tttat
tttcc
atct
ccat
gccc
tcga
tagg
tttc
1349
2928626
—N
Oca
gccg
ccaa
agac
gcct
gaga
gcgg
cagg
ggag
tgtg
ttgcc
ttagg
acaa
tagc
aaac
tggg
acgt
ggta
gagt
ctgt
aaat
tcca
agca
ctgg
g1350
4035053
—N
Oca
tggt
atca
tgtg
cctg
taat
gtc
1351
2487210
ANTXR
YE
Sca
gggt
caag
aaca
acca
gcca
gcc
113
5228
9195
5—
NO
ggcc
cttg
acaa
atgc
ctga
aatc
t13
5330
6118
5E
RV
WE
NO
ctct
tcaa
acaa
caac
cagg
agga
a1
1354
3104349
—NO
atgg
acaa
gtta
ggcc
ctcc
acttc
gaac
ggga
agac
ttgtg
caaa
ggttg
tatg
acag
tgat
acaa
aatc
agac
agtg
cc1355
3915245
—NO
aattg
ggttc
ttgag
gtgg
attg
gagg
agaa
cagc
ctct
ggat
gtct
gctg
ggca
gtga
cgat
gtgt
ctgg
aggc
tttga
cattg
gattc
atta
ggca
ttgct
aat
gtac
acaa
aaaa
tgta
gcaa
aaat
gatg
cctg
tttct
gggt
gaga
acat
ctg
1356
3961553
—NO
ctcc
cagg
tttac
aggt
cttg
actc
agta
cttc
atca
gcaa
tagg
ttgtg
gtca
cagg
aaat
ggaa
gctg
cctc
agga
cctg
tagc
aact
tgcc
tgca
tgct
ggg
ttatg
ttatc
ttttg
agtg
gtgc
ccaa
tacg
aggt
gtct
aggg
gaaa
gtct
taag
tggt
tgga
gccc
atga
ca
1357
2523501
NBEAL
YYEESS
tctc
acag
tgga
attc
gtcc
ctttc
tt1
1358
3601127
NO
gtgt
gtttg
tgca
agca
tgtta
agca
tttct
ctgt
cttg
tgttc
att
1359
3892321
___
NO
tgca
gggg
tgtc
caga
cggc
ctgg
tgca
ggat
ccct
ttgat
gtgg
agga
1360
3087823
___
YES
ggag
gatg
ttcgg
acta
taga
ttcag
ctgt
ggga
tctg
gttc
tgta
gctg
agag
caca
tcgc
taaa
cata
gatg
tgca
gtct
gagg
cttc
a1361
3337295
___
NO
aagg
ctac
aact
gtgc
tctta
gagg
caaa
agaa
gaaa
atga
gact
gcca
ggca
tgaa
taat
gaga
aatc
tttga
tgga
atta
gcca
tgca
gaac
agat
a1362
3528558
TRA
@NO
tcttt
ggtc
ccgg
aacc
agat
tgtc
cgtg
ctgc
cct
1363
3536444
NO
ctttt
ctct
gtag
ttggc
ggaa
tcag
ctca
gtta
cattt
tttac
taag
ttacc
caca
ttctg
acac
tcct
tgac
agttt
taag
atct
tcttc
taac
acac
ttgaa
taga
atgg
atac
tgga
atct
atttt
gaca
gctg
ttgaa
aatc
tattc
tgttg
ttaca
ggag
gtta
agga
ggtta
tttgt
aaca
ctgg
gatta
tttaa
tgaa
ccttt
tgaa
aagg
tgtg
caga
ctgt
tcag
gcaa
atag
tattt
tttag
aattt
aaat
gattt
tggt
tttca
cagt
taaa
ttatc
aaat
gtaa
tgct
tttaa
gaat
tata
cacc
tagt
aata
tttttc
atta
atttt
ctcc
acca
gtgt
agta
atag
taca
ttaca
atgt
tctc
aatta
ccgg
tgcc
ttcta
aaat
gcag
gtgt
agag
tcttt
aaat
acag
ctag
tcta
ttgcc
agct
gtcc
cata
gata
acct
tctc
tttaa
aagt
gacc
tttga
gcaa
tttca
taaa
gaat
aaat
atttc
tagt
tttttg
ttgct
gaac
tgct
aaaa
gatg
gttc
tata
catg
taa
cagg
tggc
tttag
ttggg
ttgct
t
651
1364
3823557
RAB
8A
YES
atgg
cgaa
gacc
tacg
atta
cctg
ttcaa
gctg
ctg
1365
2707806
—NO
caga
ccct
tgtg
gtcg
agttg
tttac
tttgg
taat
taag
tacc
tgac
tctg
ctgg
acgc
tcttc
gcgt
ttcag
aagc
gctg
tttgg
cattg
taca
gtct
gagc
gtga
gttg
aattc
c
1366
2849541
NO
gttc
ttcat
aaat
cctc
cact
caca
agct
tgca
gcct
ttcct
atca
cagt
cata
tcca
atct
caat
tccc
ctgc
ccag
tata
atgg
gact
cga
1367
3795889
—NO
acgc
tgac
cagt
gacc
gagg
acac
agttg
tgtg
ttagg
ctcc
atca
cctg
ctgt
acttt
gagt
tggg
aaat
tttca
tcat
ctta
gaaa
ctgg
gtca
tttta
tcag
agt
ctag
agtc
agat
atag
aaaa
agttt
gtgg
ctat
ttctc
caat
ttata
tgac
taag
gtcg
ggta
tcttt
ttcaa
agtg
tcta
attg
aaat
tgaa
aagg
cagc
aattt
aaag
ttgct
attg
caag
ggca
gaaa
atgg
tctta
agaa
agcc
agct
ttcaa
attg
aata
aaca
tgac
tgcg
ttcac
tttttg
agc
1368
3895772
NO
gaaa
caaa
ctga
caat
actc
gtga
aatc
aaca
ccca
gatg
aggt
agac
cagc
actc
ca1369
2920915
—NO
ccaa
ctca
cacc
atag
cagc
gctc
att
1370
2971825
MANIA
YES
ctgg
aggc
agca
gtat
tctg
gcag
aattt
ggaa
ccct
gcat
ttgga
gttta
tgca
cttg
agcc
actta
tcag
gaaa
ccc
1371
2990655
NO
ggttt
cttg
ataa
ctgc
tcca
ctta
gaga
aatg
agct
gtga
tctg
tgta
gtttt
ggaa
aatg
ggta
gctg
gggg
gatc
atct
ttaaa
caat
ctttt
tcat
gctc
atca
cgaa
atgc
tttta
caat
aggt
ttattt
tgac
tttgt
gttta
gaat
tttttt
tttttt
ttgct
ttttg
ctat
agac
tcttt
aaaa
aagt
tgttc
ttctg
gata
ttattg
atag
atgg
aac
aaaa
ttcat
gtcc
cttg
cttg
aatc
tttta
gcga
gcta
ttcag
agat
tcta
tatc
ccca
tttac
tcat
ggttt
tttca
aggt
gaat
gaaa
caac
atac
cctg
ct
1372
3198975
MPDZ
NO
tgca
ctgg
tcct
gaca
atttt
tatg
ctgt
gttc
agcc
gggt
cttc
aaaa
ctgt
aggg
ggga
aata
acac
ttaag
tttct
ttttc
tcat
ctag
aaat
gcttt
cctta
ctg
acaa
ccta
acat
cattt
ttcttt
tcttc
ttgca
ttttg
tgaa
ctta
aaga
gaag
gaat
atttg
tgta
ggtg
aatc
tcgt
tttta
tttgt
ggag
atat
ctaa
tgttt
tgta
gtc
acat
gggc
aaga
atta
ttaca
tgct
aagc
tggt
tagt
ataa
agaa
agat
aattc
taaa
gcta
acca
aaga
aaat
ggct
tcag
taaa
ttagg
atga
aaaa
tgaa
aat
ataa
aata
aaga
agaa
aatc
tcgg
ggag
tttaa
aaaa
aatg
cctc
aattt
ggca
atct
acct
cctc
tccc
cacc
ccaa
acta
aaaa
aaga
aaaa
aagg
tttt
ctaa
tgaa
aatc
tttaa
aaat
actg
tcag
tattt
taaa
atttt
caac
agta
ttata
aaaa
cattg
catc
tccc
cacc
tcta
atat
gcat
atat
atttt
tcct
gcta
aaat
tgg
tttct
acaa
ttgag
taaa
tggc
aaat
acat
gaag
caat
gtcc
ctaa
atttt
ataa
agaa
atta
tattt
aatg
caca
tttca
atttt
cattc
ttattt
ttgac
ctttt
gta
aaat
atttt
catg
ttgct
ataa
gtaa
atga
tgat
gcca
cccc
atgt
tgac
tatg
gtttt
tcta
gaaa
gcaa
ctat
gctg
ctaa
ccat
agag
gaac
atag
aagg
gtt
ccag
aatc
tttag
tgct
ggttt
taac
aacc
gatg
caac
atta
aaaa
tgtg
ttagt
gtgc
tgtg
caat
tggt
tttca
attc
atat
taat
ctta
atga
caga
gaac
aatg
tgtta
ctaa
ttattt
tggt
tgta
tgcc
atta
gtaa
attg
atag
aaaa
atta
aggg
gatta
acat
aact
tcat
ttcat
tgac
ttata
ttaac
atct
tata
atac
aata
gttta
agac
taag
ggaa
acag
atgg
agct
gttta
ttgag
acaa
ctgg
tga
1373
3652
974
—N
Oct
gaca
tggt
gtgt
caca
aaga
gctc
c13
7436
8029
6L
OC
388
NO
gtag
accc
taca
ccat
tcct
cagc
c21
0
1375
2530
462
CO
L4A
YE
Sttc
cctg
ggtta
atgg
gtga
agat
ggc
313
7625
5027
2—
NO
cagg
ggca
agag
ctag
ttaga
aaac
tgag
ctcg
ccct
gtct
ctaa
tgtc
tggt
gagc
tgct
ctcc
tctc
tcta
caat
atat
cacg
ggta
gtttc
atgt
caac
aaa
1377
3118
585
—N
Oga
atat
gttg
ttaag
tggg
cagc
at
1378
3403
090
C12
orf5
YYEE
SSta
agttt
gctc
gcttg
gtca
agtc
ctac
gaag
ccca
ggat
cctg
agat
cgcc
agcc
tgtc
aggc
aagc
tgaa
ggcg
ctgt
ttc7
1379
2524
930
___
NO
tgtg
aact
tggg
ttgtg
tcgt
agaa
aatg
tgag
tgtc
agag
ag
1380
3125
787
CN
OT
7Y
ES
ggtta
cgac
tttgg
ctac
ttaat
caaa
atcc
taac
caac
tct
1381
3171
102
___
NO
ccaa
ggac
tggc
agga
cgtc
agtg
atgc
tggg
1382
3250
113
KIA
A12
YE
Scc
tgag
cata
tagg
ggaa
gatg
ttcttc
gccc
tgcc
atgt
tagc
taag
tttcg
agttg
cccg
tctc
tatg
gcaa
aatc
atta
ctgc
agat
ccca
agaa
agag
ct
79
ggaa
aattt
ggca
acat
cattg
gaac
atta
caaa
tttat
tgttg
atta
ctgt
gaaa
agca
tcct
gagg
ccg
1383
3747
008
...N
Ogg
tgac
tgga
ccac
gcag
gtag
tttct
catg
aatg
attta
gcac
tgtc
cccc
tact
gctg
tctc
gtga
taga
gttc
tcat
gaga
tttgg
ttgca
taaa
aagt
gtgt
agca
cctc
ccct
ctct
ctttt
cctc
ttgct
ctag
ccct
gtga
agat
acct
gctc
tggc
tttgc
ctttt
gcca
tgag
taaa
agct
ccct
gaga
cctc
ccca
gtca
tac
ttcct
gtac
agcc
catg
gaac
catg
agcc
aatta
aacc
tttttt
cttta
taag
ttacc
cagt
ctca
ggca
tttct
ttata
gcag
tgtg
agat
tgaa
ccga
taca
gac
aaca
ataa
caaa
aagt
agat
aaat
atga
aggc
caga
gaaa
tggc
aact
tcttc
acga
aggg
aagc
tgct
tatc
ctca
agca
agtc
agttg
agga
ctat
cctg
tgag
ctat
tttcc
gtgc
cttg
ttttc
cact
cata
cagg
gaag
tatg
ccct
ggtg
tctc
acag
ttgct
tcag
ttcttc
tttac
tggg
accc
taaa
tata
ataa
caca
ctga
acca
ctca
acttc
tcag
acct
tcaa
tatc
cctg
gggc
tttttc
tcaa
atgg
acac
agca
tagc
tttca
gaaa
gctg
aata
atgt
agac
aaaa
gaag
tctc
cct
cacc
ctttg
atta
tgga
aaac
caaa
gattt
attc
agtg
ccaa
ggat
aaga
aagg
aagg
acttg
atca
acaa
ccac
tgtta
agat
tttga
ccca
ttcac
tact
tca
cgtg
aact
ccca
ggga
aatg
gaaa
atga
aaga
aggt
ctcc
gcgg
taga
ggga
taac
aatg
1384
2406
715
NO
acag
agcc
ctga
ttcca
cata
catg
agct
ccca
tgtg
tggc
tggg
gctc
tgct
gggt
cctg
gggg
cagg
agga
ggat
ggag
aggc
aggc
ccta
ccat
ggag
gagg
ttgtg
atct
aggg
gcag
gctc
ctaa
ccag
ttaac
atgt
gcac
agttt
ggca
agtc
tctc
acag
aagt
ctat
agca
tgtg
gggg
aacc
cagt
ggaa
gaag
tgat
cgag
ctct
agga
tgag
gagg
agac
tggg
gaga
acttc
agag
agcc
ctta
gaac
ttgaa
ctgg
gact
gggc
ac
1385
2808
962
ISL
lN
Occ
tcct
tggc
tgaa
agag
tcct
ttcag
gaag
gtgg
agct
gcat
tggt
ttgat
atgt
ttaaa
gttg
acttt
aaca
aggg
gtta
attg
aaat
cctg
ggtc
tcttg
gcct
gtcc
tgta
gctg
gttta
tttttt
acttt
gccc
cctc
ccca
ctttt
tttga
gatc
catc
cttta
tcaa
gaag
tctg
aagc
gact
ataa
aggt
ttttg
aattc
agat
ttaaa
aac
caac
ttata
aagc
attg
caac
aagg
ttacc
tcta
ttttg
ccac
aagc
gtct
cggg
attg
tgttt
gact
tgtg
tctg
tcca
agaa
ctttt
ccc
1386
2849
483
...N
Occ
tgttg
gtgt
gtcg
ataa
tggc
aagg
tctg
cagt
gacc
cagc
agtg
gctg
aggt
gggg
ctgg
ggct
gaga
acct
gatc
cctg
agag
gcca
tttat
tgta
aga
acag
gcaa
catg
gcag
ggaa
gtga
ccag
gaac
tagg
acag
gcag
taag
tggg
agag
aatg
acgg
caag
cctg
gagc
tgtc
ccag
aggt
gact
gacg
gatg
ctgt
acaa
ggtg
gaa
1387
2868
168
ER
APl
YE
Sat
ttgcc
ctac
agat
ggtg
taaa
aggg
atgg
atgg
ctttt
gctc
taga
agtc
aaca
ttcat
cttc
atcc
t
1388
3082
899
AR
HG
EY
ES
atgg
agttc
gagt
gaat
ttgaa
agtta
cgaa
gagc
agag
tgac
tcgg
agtg
caag
aatg
ggat
tccc
aggt
ccttc
ctgc
gcag
caac
caca
aaa
FlO
1389
3505
964
CE
NPJ
YE
Sgc
tgag
aacg
catc
tttag
ctaa
acttc
gcat
tgaa
cgag
1390
3522
535
DO
CK
9Y
ES
caaa
gcta
attg
agcc
actc
gact
atga
aaat
gtca
tcgt
ccag
aaga
agac
tcag
atcc
tgaa
cgac
tgttt
acgg
gaga
tgct
gctc
ttccc
ttacg
atga
cttt
1391
3404
455
NO
cact
agac
agca
attc
agag
cctc
caaa
ataa
agaa
tattc
ataa
aagt
aaca
atag
aggt
aaat
ataa
aacc
caga
atta
ctac
atgt
gtca
tata
gttta
taa
cttc
tcct
attta
tagc
tttct
atat
ttata
tttat
ctat
aact
tcat
aggc
aaat
gaat
aaaa
atta
taaa
tatg
atag
tggt
cata
taat
gtat
aaag
atgc
aatc
tgtg
acag
tctta
tgaa
gcag
ggat
gaag
acat
atag
gatc
aaaa
tgttt
gcat
agtta
ttgaa
gcta
tgttg
atat
tatg
aaat
tata
ttgtta
caag
tttaa
gatg
ctaa
ttata
attc
tcaa
ggta
acca
ctaa
taaa
atta
ccaa
aatta
tgca
gaaa
agga
aaaa
agaa
aaac
aata
cact
ataa
aaaa
ccaa
ttaaa
taca
aaaa
aagt
cag
taac
agac
aact
tgag
aaac
aaag
acat
ataa
gata
taga
gaaa
acaa
atga
ttaaa
tggc
aaaa
gtaa
atct
tgttt
tagt
aatc
acat
taaa
taga
aaag
gatg
aagc
catc
ctat
taaa
gggc
tgag
actg
acaa
gttg
gcta
aaaa
ctaa
aata
aatta
aaaa
gaaa
aaca
agac
tcat
ctac
atgc
tgtc
tata
agag
acttg
cctta
gata
taag
gaca
caaa
gaag
ttgaa
agta
aaag
gact
gaaa
aaga
tattc
cata
caaa
cagt
agta
acca
agat
agtg
ccga
gtgg
ctat
atttt
tgt
caaa
caaa
ataa
acta
aagt
aaaa
tttac
aaga
gaaa
aaga
aggg
catta
tgca
ttgac
aaaa
atttt
gaca
tagc
caaa
taat
tatg
ttata
aaat
atat
gtac
ttaat
aata
cagc
ctca
aaat
atat
gaag
caat
aattg
ctat
aattt
aagg
gaga
aaag
aaca
gttc
tatg
aaaa
gtta
gaga
atga
aata
ttcca
ctttc
aac
atga
gatta
aaca
acta
gaca
taag
atca
ataa
ggaa
atag
aaaa
tttga
acaa
cact
ataa
acca
atta
tcc
1392
3309338
—NO
ttcca
gctc
atcg
gcgt
tgag
gtgc
ac1393
2475878
NO
gctg
ctga
gtct
caaa
acag
ccaa
a1394
3032635
—NO
gagg
ctcc
tcttc
ctat
ggat
ttgct
tcaa
ggag
gtca
gggc
1395
3475556
VPS33A
YES
atta
cattg
agga
ttgta
tcgc
ccaa
aagc
actc
gttg
atca
aggt
gtta
agac
tagt
ttgcc
tcca
atcc
gtgt
gtaa
tagt
gggc
tcaa
ac1396
2930679
NO
gatc
aaca
ttctta
atga
cata
agag
aagg
gagt
tgca
ggtg
gatc
taag
ggaa
aact
gttc
cagt
agta
aagg
agca
gcaa
atcc
gata
gccc
taag
gga
1397
3489376
CDADC
YES
ggat
cata
aaac
agga
gttg
gggc
ag
1398
3933755
NO
gtttg
cattt
gacc
atgt
cgtg
ggca
ggag
a1399
2430088
...
NO
tcat
tctta
aggc
taag
gtgg
caag
ataa
1400
3420474
...
NO
atgg
aagc
atgt
tggc
gtca
tcag
caga
gaca
ggga
aattg
ggaa
ggag
agga
gagc
ataa
ttagt
gtgc
agaa
acttg
gaaa
tgct
gaca
ttgga
ctcc
tggc
aga
1401
3956302
PITPNB
YES
gact
gtcc
ccag
atgt
gtgc
ctat
aagc
tggt
gacc
atca
aattc
aagt
ggtg
ggga
ctgc
aaag
caaa
gtag
aa1402
2334264
hCG
18
NO
ctgc
tcgc
tgag
atag
acac
atac
ctga
ttgcc
tcct
ttgga
gagg
cttg
tcag
aaac
aaaa
taat
gcaa
ccat
ttgtc
tctc
agct
acct
gcga
cctg
gaag
c20661
cccc
tccc
cact
tcga
gttg
tccc
cgcc
tttct
ggac
ggaa
ccaa
cgta
cttc
ttaca
tata
tcga
ttgat
gtct
catg
tctc
ctta
aaat
gtat
gaaa
acaa
actg
tgcc
ctga
ccac
cttg
ggca
catg
tcgc
cagg
actc
cctg
aggc
tgtg
tcac
gggc
acgc
atct
tcaa
ccttg
gca
1403
2358
760
SCN
Ml
NO
cagc
ctgg
tgcc
ctat
taat
aaga
ctgt
caaa
aaga
ggtc
tttgc
ccttc
atgt
tgtc
tgcc
tgct
tctc
cact
acca
ctca
aagt
ccttt
cccc
cttc
aaat
caca
taag
gact
gcag
gtag
gcac
tagg
ggga
cctg
gcca
catc
tgga
aaca
gggc
tgtg
gcta
agttc
cctg
tgaa
agta
gaag
agtc
aaaa
ggca
ttggc
agg
gtta
tggg
gaac
acca
aggg
aggg
aaca
cccc
cacc
tcct
ccta
gtaa
cctg
aacc
tccc
tgcc
tgct
gatc
ccca
gagt
ataa
ataa
tccc
ctgg
tga
actg
gcag
taac
ccttg
gggg
ttagc
gcca
agat
tctc
accc
caaa
gccc
aagg
aagg
aggc
aggc
aaaa
tgga
taga
aggg
ctttt
attta
cagg
aaag
gagg
acag
atga
ggat
ttaag
tgtc
cagt
gctc
ccag
cgct
agttg
gtaa
aggg
aaat
gcag
tgtta
tctc
ttcttt
tgct
ggcg
acct
gtaa
1404
4005
886
CA
SKY
ES
tcat
caca
aagc
accc
agac
cggt
ttgcg
t14
0524
4019
1__
_N
Oct
cctc
accc
gcat
cata
actc
ctct
tcaa
g14
0625
6196
8SU
CL
GN
Oaa
ctgt
ggaa
tgga
tcac
gtag
acat
gtaa
ccca
gcag
cagt
ttgct
tctg
ttgtc
cact
gatta
atca
gcct
atgt
gcct
gaca
ctgg
tcttg
ca1
1407
3216
427
NO
atgt
tgag
aaag
cata
tgca
tgtg
acag
ggcg
gattt
gtta
gaag
c14
0832
3590
9__
_N
Ogt
taga
ctcg
gggt
tgat
atct
gtca
tcgc
tgt
1409
3469
137
—N
Oag
caac
agaa
agct
cgtg
gtgt
tcct
ctgc
ctca
aagg
ccaa
acgt
ctgt
gggt
tttttg
tcttg
gctg
ccga
ccgc
agca
cgtc
acac
tgttt
gaaa
tttgc
agc
tgcg
atca
tttat
cttc
aggg
gaca
gggg
t
1410
2343
471
NO
gaca
gattg
aaag
aggt
ggtg
aagc
1411
2464
386
—N
Ogg
taca
caca
ttagg
agac
tgca
gtttt
ggtg
agag
gtga
tgga
agct
cgag
acta
gtga
ggaa
aagg
gagg
tgtc
aaca
tgag
ccct
agat
tcttg
gagt
ccag
aact
tgaa
ga
1412
2672
266
AL
S2C
NO
gtgt
gagc
atgc
acag
gtta
gggt
gggc
tgL
1413
3730
349
NO
tggg
ccttt
caaa
ttgag
tcta
agttt
aaaa
gata
taaa
gagg
tgtg
gggg
aagg
gcat
tcct
ggca
gagg
gaag
agca
tatg
caaa
ggtg
tgga
ggcc
taga
acta
cact
gcag
cttta
aggg
gtgg
tgag
tgat
ggcc
gggc
accg
tggc
tcac
acct
ataa
tccc
agca
ctttg
ggag
gcca
aggc
agga
ggat
tgct
tgag
ctca
gtag
ttcaa
gacc
agcc
tggg
caac
atag
cgag
accc
tgtc
tctg
caaa
atat
tttta
aaaa
atta
gctg
agtg
tggt
gatg
cacg
cctg
tcta
ctag
gga
1414
3755
723
ME
Dl
YE
Sgg
ctcc
tcca
tagc
agag
aaat
ctta
tcag
aata
gtcc
cagc
tcag
acga
tggt
atcc
gacc
acttc
caga
atac
agc
1415
3861
643
—N
Oct
ggca
agca
gacg
ggttt
ggaa
taga
cattg
gaag
aagc
tttta
gggc
atcc
taga
cagc
gttg
cgat
gcttg
gggc
aggg
gcag
gaaa
aagc
cact
gat
ggac
tttca
gggc
agag
ctgt
aatc
gagt
tgat
gtttc
agat
ttagt
cgaa
gggt
gtga
gagg
gctg
agac
ggag
aaag
tggt
gtgg
caga
agag
gttg
ctgg
tggc
catg
ggtc
gggg
tgca
gatta
tgag
agaa
ctc
1416
3542
787
PCN
XY
ES
tttga
tcct
atgt
ggga
gcga
ctga
aaat
agaa
tgag
cttg
gtta
agca
cctc
tcct
ttgcc
1417
2468
417
RN
F144
NO
tgcc
acca
aagg
catg
gctg
tggg
cA
1418
2520
238
NA
Bl
NO
ggga
ggaa
cattt
aagc
tgat
ggaa
gtgg
aagt
ggaa
gttg
ctgt
acat
tggc
agca
aggc
ctcc
gagt
tagc
ttttg
aatg
cagt
taac
tggt
ttctc
ttaac
tgtg
gaat
tcat
tgaa
aagt
caga
ctcc
gagt
ggtc
gttc
cagg
atat
ctt
1419
3129693
—NO
cagc
ctgt
gtgt
actc
atttc
cttc
tcttg
agta
ccac
taat
ttgat
agttc
acgt
atcc
tcta
ctaa
gagg
taga
gtga
gaca
gctta
ggga
catc
agaa
gtac
tac
tttcg
gttc
ttcca
catg
gaaa
catg
agga
acga
ggaa
taag
tattc
tttcc
ctaa
cact
taca
gtgc
actta
agac
aaag
tttttt
tttttt
ttgga
ctga
atat
acg
tcga
tgga
caca
ggcc
caga
tatg
gaga
aagg
gcag
gtttc
ctct
aggc
aaga
tttgc
acgg
ggaa
cctg
a
1420
3845840
—NO
cctg
gatc
cctg
aaga
ctgc
tctg
agat
cgat
cagc
caga
gaga
aaca
gaga
gaaa
gctg
gctc
agcc
ctga
aggc
tttga
gcaa
aagt
ttcag
aagg
agc
tgga
agga
gaca
acag
cccg
ttctc
cctg
agct
gtgg
ccgg
gagg
gccc
aggc
cagc
acca
cgat
gtac
agat
ggcc
cagg
gggc
tcga
catg
agtg
aggg
ctga
agttc
gtgc
aacc
ccaa
ccgc
cacc
cctg
aattg
ggca
gaac
caat
tttttt
ctta
gcac
ctgt
ctga
ctct
gctta
ccat
gaca
accc
caag
ggcc
cccc
agaa
ttgat
atcc
aagt
gcct
ttagc
ctat
caaa
gaca
ggag
gggg
tgca
ttctc
ggcc
aggt
tagg
gttc
acag
agca
tccg
tata
tta
1421
2621
185
NB
EA
LY
ES
ggct
gtag
acct
ggac
catg
tgac
agat
gagc
ggga
acgg
aagg
ctct
ggag
ggca
ttatc
agca
acttt
gggc
agac
tccc
tgtc
agct
214
2228
7123
4—
NO
tttgc
tgcc
atcc
atag
tcttt
tcct
aagt
atca
caaa
agga
tatg
cctg
ccaa
tctc
ttaaa
aaat
gaca
atgc
agac
ttgtg
ctca
aaat
gaag
ctac
agat
ttaga
acct
caga
ttcaa
tgaa
ctaa
aagg
agaa
tata
ttaaa
tgtta
aattt
gacc
tcag
aaat
tcca
ctat
acta
agga
tgttg
ccta
agac
ctct
caaa
gact
gagg
acca
gcag
aaca
ctgc
ccat
ttcac
tcta
gaac
caca
caca
agtc
tgttg
ttccc
aagc
caat
gtga
aagc
tata
agcc
ttgag
actc
aaaa
gagc
aca
ggtg
tcca
cagg
tgac
ccaa
acat
caaa
1423
2904825
C6orfl2
YE
ccag
gggc
tcac
tttct
ccaa
caaa
a7
1424
3133366
...
NO
gtgt
cagt
acag
ccgc
gtgg
caga
gctg
a1425
2577848
NO
ggct
tgca
tcag
caca
cattg
taca
gcct
tgca
aatta
caca
gaaa
ttgaa
aaga
aggt
tcct
tgttc
ataa
gtgc
aatg
aatc
tttga
tgtc
cagt
gatc
cgct
gcaa
acaa
tgaa
aaca
aact
gtaa
agca
ctta
agaa
gtttc
cagc
actc
atca
aatg
cattt
cctta
ggtg
acaa
aaaa
tcca
caag
gaga
atgc
aaat
aagt
ggca
tttag
ttatta
gaga
gata
cttg
aaaa
aaaa
aaag
tgga
agta
ttttc
ctgc
ttaga
atga
tttct
gttc
ccct
ttgaa
tgta
agtg
gttta
agag
gcat
taag
aaca
taat
ttattt
tata
ctta
atgt
tcac
acta
ccta
aatg
aata
agga
aagg
gccc
taca
agga
agag
ttatg
tattt
tttgg
taat
tgttt
ctga
gttta
gaga
ttcag
ggga
agag
taat
cctc
tattt
gtgg
gctc
ttctta
gcat
ccct
aacc
ccca
agga
gata
gtac
atgg
ttttg
acttg
gtta
tttca
ctcc
tgca
ttcttt
ttccc
cca
aattt
taag
ttggg
tcac
attg
tgct
ggct
gcaa
aa
1426
3332349
—NO
gggg
atgt
ctgc
aatg
tcag
cact
gattg
cagc
acca
aact
caaa
gaca
atac
cact
catg
ccac
tgcc
aaca
tacc
cctg
gcttc
ctca
acaa
cttta
gaat
tatta
tattg
ccct
aagt
gcat
ctag
aata
1427
2416733
—NO
ggaa
aagt
acat
gcca
gaag
tcag
ctgt
gcag
taac
ctat
gaga
gagc
tact
cctg
tttca
cccc
agat
catc
acct
cgat
tcca
aag
1428
2437457
ASHlL
YES
aagc
acca
gttg
agat
tccc
agtc
cttc
tgaa
accc
cagc
taaa
ccttc
tgaa
cctg
aaag
tacc
ttgca
gcct
gtgc
tttct
ctca
tccc
aagg
ga1429
2704212
—NO
gaga
ttgat
gtgg
ttgtc
agct
aaac
ctgc
tcttt
cagt
agtg
gaaa
tt1430
3888795
—NO
gttg
tttgc
cttg
acag
tgcg
gctg
cgct
tcgg
cctg
ctcg
ggtta
ccag
ggaa
caag
gc1431
2556326
—NO
ttgcc
tcag
ctga
cttta
ccaa
aaat
tccc
tttcc
cttc
cctg
cttta
agtc
atttc
atgt
aggt
tagt
cttc
attta
ttcttt
tgcc
tctc
taca
gtag
ttattg
ctttg
tgc
tgta
cattt
gggt
taaa
ggag
ttgta
tatg
tgta
tggt
gaca
gcat
aatg
cctc
tgaa
gccc
agga
ccta
gcta
cttc
ttagc
tcct
aacg
taat
ttcat
ccat
g
1432
2878702
—NO
gtca
gagt
gtat
aagg
tgaa
atgc
cgttc
tgttt
ttgga
g
1433
3328538
—NO
gcca
tccg
gatg
tcag
ctct
gtgg
acat
tgac
tgct
ggttt
tcct
gggg
cctc
ggttt
tcct
ctct
ggcc
tggc
atgg
actg
cctc
ttgcc
ttgga
gttg
ctgg
gat
ggtg
gggt
gtgt
gtgt
gagt
gaga
gtgt
gttg
ttggg
gggc
agtg
ccct
ggag
agga
cttc
atag
cct
1434
3496801
NO
agga
ctgt
gtgg
cgga
ctat
gcag
cata
gt1435
3668198
—NO
tcca
cagg
gcag
aaac
tattg
caga
gttc
ac1436
3764547
4-Sep
YES
agca
gtat
ttccg
agac
gaga
gtgg
cctg
aacc
gaaa
gaac
atcc
aaga
caac
aggg
tgca
ctgc
tgcc
tgta
cttc
a1437
2544277
ITSN2
YES
ggag
tgct
agca
agtc
tgga
gcat
caaa
taaa
1438
2581567
___
NO
agca
gtttg
gcag
ccac
atgg
attg
a1439
2612556
___
NO
gcct
gggg
gaat
ccgt
gaac
gcct
a1440
2894809
SYCP2
YES
ggtg
tattc
atttc
cgtg
tattg
ctgc
tttL
1441
3956007
___
NO
gaca
gtct
tcag
aaaa
gggc
agct
gg1442
2407214
GNL2
YES
gaaa
caca
cgtg
tgat
taag
cagt
catc
atta
caaa
aattt
caag
agga
aatg
gata
cagt
tatg
aagg
atcc
atac
aaag
ttgtc
atga
agca
aagc
aagt
tac
caat
gtct
cttc
tcca
tgat
cgaa
tccg
gcct
cat
1443
2948537
KIAA19
NO
tcttt
ctca
catc
ctag
agac
ggtc
tttaa
tacg
catta
accc
tgtg
ctgc
caca
tctg
gctc
ctgc
cctc
attg
cctc
caat
ccgg
actc
ttcct
ctca
catc
a49
°1444
3991159
___
NO
acag
gtgg
actg
cagg
gtcg
tctta
caaa
atga
caag
1445
2444356
___
NO
taga
ccct
gaca
ttcag
ctcc
aata
atgg
atgt
ccca
gtat
gccc
tgaa
tcca
aaca
aatg
cctg
actg
caat
aa1446
2480997
MSH2
YES
agac
aggt
tgga
gttg
ggta
tgtg
gattc
cata
cag
1447
2671526
—NO
atgt
tcct
tcca
aatc
tcac
attg
aaat
gtaa
tcttc
tatg
ttgga
ggtg
ggcc
tgttg
ggag
gtga
ctgg
atcg
tggg
ggta
gattt
ctca
tgaa
tggc
ttatc
gtca
tcct
cttg
gttc
tgtc
cttg
tgat
agtg
agttc
tcgt
gaga
tcat
gttg
tttaa
aagt
gtgt
ggta
tccc
tctg
ccttg
ctct
tg
1448
2673140
—NO
aaga
ccac
attg
agcc
acct
atct
ccat
ctgt
gctta
tgta
ctct
gcttc
cttc
ttgtta
ggat
gaac
aatg
ggcg
aacc
ttcca
agttc
ctat
ttaaa
gtta
cccc
aatc
tgtg
caga
agat
cctg
tctc
ctcc
cacc
tact
caag
gcat
tact
ctgg
ca
1449
3394422
THYl
YES
catg
aagg
tcct
ctac
ttatc
cgcc
ttcac
tagc
aagg
acga
gggc
acct
acac
g1450
2485881
___
NO
gggg
tgac
attc
agat
tcaa
tcgg
tcta
cgga
tgag
aaa
1451
2572007
NO
aggc
ttcttt
gaca
atcg
tttct
gatc
tttag
ctga
gtct
agat
ggta
aaag
tgaa
ctag
tgaa
aggg
aaga
atga
gtttt
gtttt
ggat
ggca
tctc
atta
tttttt
atttt
ctttt
tattt
atat
atttg
tttttc
ttattt
tttttt
ctttt
tgat
tttaa
tacc
tcca
ccct
actta
caga
tattt
tttat
tttct
tgac
aact
ctga
taaa
tgat
ggat
tgga
ggat
caga
cttg
ttgac
atca
ctga
tgag
gcttg
tcttg
ttgct
tgtc
ttctc
tggc
taac
attc
ctta
gtaa
aaat
atttc
tata
tttttt
tcttt
atttt
atttg
gtca
ttattt
cat
tttga
gcttt
tttca
tttat
tact
agat
aagc
gagt
tttga
gtgt
caaa
tttgc
actta
atttg
tttttg
gtac
tgtc
ataa
ggac
tgta
gcat
acac
agca
tact
aca
actg
taac
ttttta
cgaa
aata
taca
tttttt
gaag
atga
cttc
agaa
actg
aggg
agag
agac
ccac
ttgag
acta
ggca
gcaa
acag
tgaa
gaag
aaat
gagc
aaca
tggt
gtgc
agga
gcaa
agttt
ccta
ctgt
atat
ctct
actc
taag
tgta
caat
gcct
a
1452
2804366
—NO
gata
cttc
atct
agta
attc
ctgc
ccag
gtttt
gaaa
cctc
atgc
tgga
ccct
cctg
tgca
aggc
tgcc
ctgc
tcat
tctg
ctct
catc
tgat
cctc
tcac
tggg
ccag
tc
1453
2369480
FAM20
NO
tcgg
caat
tgct
cattc
tagg
gttg
ggca
tcat
agttg
gtca
gtct
taat
tccc
atgc
caaa
ggac
aaac
aggt
gtga
cattt
ggat
agat
gaat
actg
ggat
tgB
gctc
tgga
gcat
gtgt
tttga
gttg
aacc
ttgca
gtcc
tttct
ctac
gccc
gtgg
atttt
gtgg
aaac
a1454
2556336
...
NO
agtc
ctgt
ctag
tggt
cagt
ccta
acag
tgtg
gtta
aagg
ggag
ctaa
atttc
aact
ttcag
gggt
cagc
tgag
acaa
ccac
ctag
atttt
ccaa
gtta
gttc
caat
ttcaa
gtac
aata
atcc
tttga
caac
atag
aaaa
gaat
tata
gtta
gcaa
aattg
caat
tgtc
acag
tcaa
ggtc
ttgttg
gtaa
taga
gggt
tggg
gaaa
aagt
taaa
acag
catta
aatg
aata
tattt
ttaac
ataa
caaa
agttc
ccag
aaga
catc
cact
tcaa
attc
aagc
cata
ttcat
caat
gtta
aaaa
cctta
ctct
tctc
cctg
atag
tgtg
agga
aaa
1455
2589618
TTN
YE
Scc
tatc
cctg
taaa
gcct
gtcc
caga
ag1456
3405368
—N
Oct
gggg
cata
aaaa
agca
agtg
ttctc
actg
aggg
atga
atag
tcta
ctag
caac
tgta
agac
agttg
cacg
aata
atga
tgat
acaa
agag
gtat
ccga
tgaa
tata
atgt
agca
ggat
aaac
aaaa
tggc
agtg
gaaa
acag
gagt
aaaa
ccaa
aggg
aagg
attc
tgag
aaga
ctat
ttata
atgg
gaac
taaa
attg
tca
gctg
aagc
tcag
atgg
cggt
agag
gcat
tcca
gatg
gagt
atag
aaaa
tgaa
tgaa
gacc
aagg
acag
gaca
gaaa
tatc
tggg
gtgc
actc
acta
ggt
gagt
gtgt
ccag
acat
tgaa
agag
gatg
agca
gagg
aaga
gaac
agga
gagg
taag
aact
aaaa
aggt
ggta
ttagg
tcct
ggaa
ggtc
ttaaa
tatc
aagc
taaa
gagt
tttac
tgcc
aagc
taca
gaac
tttat
caaa
tgcc
atac
tgag
aaga
ctga
tcga
tttat
taaa
gcaa
actc
ctaa
agga
tctc
ttcttg
gaaa
caat
tcat
tagt
ggtta
gtgt
ggaa
tctc
agtg
tggc
acgg
aata
aagt
aaag
taca
tggg
taac
ggcc
ccag
aaaa
acca
gagc
acag
ccca
gcgc
atta
ccac
tcat
cgcc
tggg
tgat
cttg
1457
3447472
—NO
tttta
agga
gggt
tgca
gaaa
gtga
1458
2331932
—NO
tgcc
ggtg
cagt
tcct
tggt
atac
tga
1459
2413973
USP24
YES
tacg
tcga
tgga
gttc
agca
caag
cacg
agaa
tttgg
gaat
cttc
acaa
taca
gtgg
cgtta
cttg
ttttg
c1460
2972147
—NO
aagt
cagg
aggg
ttcag
tgac
caga
1461
3568132
—NO
ttagg
tattt
cgag
tttga
agtg
tcta
taag
tcat
ctgg
agct
gtcc
agga
gatg
gttc
tata
catg
aaga
tcag
gacc
agttt
cttg
tcat
ca1462
3593984
—NO
ccag
gtgg
tcta
gcgc
tcct
aaca
aaca
cctta
tgat
tatc
cttc
attc
aaca
gagg
agga
gact
gcag
gcat
gcttc
agtc
agtg
gtaa
gaac
agga
agaa
aaac
ccaa
atca
gcaa
cattt
tcttc
ctta
atttg
gtat
aaac
gtcc
ttttc
atta
tttca
agtta
ctaa
tttgc
taat
ctca
aatg
ttttc
tgaa
tcag
gtttg
ctaa
atg
ctgt
aaag
aata
gcta
tcct
tcat
acaa
tact
aggt
tttat
tttttt
tagt
atgt
cagc
ataa
taag
gtac
atgc
gcat
tcat
tcca
ccaa
cctg
tttag
ataa
tact
cagc
ttggg
tcat
attc
aaat
tcta
caca
gaga
ttctc
tgttt
tcaa
taca
actg
ctta
gaaa
gcag
aattg
ccta
gtga
tgac
tgta
1463
3878647
NO
gcag
ccca
tccc
gaag
tgct
tatta
catta
gaac
attta
gtgc
tttta
aaga
tggt
gtgt
gtgt
gtgt
gtgt
gtgt
gtgt
gcgt
gcgt
gtgt
gtgt
gtgt
gaga
gaga
gaga
aagc
atat
ctaa
atgt
tggt
acca
aatc
atac
ccca
ttgga
aaga
ttcaa
ttttc
tgaa
attc
cacc
taag
aattt
ttgta
tgta
tata
tgaa
ggca
attg
tga
aatta
taga
gaat
tagc
aaag
tttat
ttaga
atttg
aggg
gtttg
gagt
gcag
tttta
caga
ttttc
tgac
tctc
ttttc
tttaa
tgtc
catg
caga
gact
aaca
taat
tgac
tttta
gatc
tttat
ccag
agag
tcaa
tcca
attc
aattc
aaaa
atgt
ttattg
ccag
tcca
ctct
gaac
a
1464
3271123
NO
gtga
gtcc
ctgg
gtca
gtga
gcac
a
1465
3291
284
TM
EM
2N
Oga
aagt
ggct
tattt
gagg
ttgttg
gtgc
tgttg
gtat
tattg
ccat
agttg
ttattt
aattg
cttg
atta
aact
cagt
caaa
tatc
cttc
cta
614
6638
5978
1D
MK
NN
Oca
agga
gcca
ccag
attg
gatg
ggag
cccc
caca
ctcc
ctcc
ttaaa
acac
cacc
ctct
catc
acta
atct
cagc
ccttg
ccct
tgaa
ataa
acct
tagc
tgc
c
1467
2326
309
—N
Occ
ttcac
ggat
ttggt
tggg
gtca
gaga
ga14
6830
9104
0B
NIP
3LN
Oag
tgca
ttgtg
ctct
ttcca
agttc
agca
gcag
ttcta
tcag
tggt
gcca
ctga
aact
gggt
atat
ttatg
atttc
tttca
gcgt
taaa
aaga
aaca
tagt
gttg
ccct
ttttc
ttaaa
gcat
cagt
gaaa
ttatg
gaaa
atta
ctta
aaac
gtga
atac
atca
tcac
agta
gaat
ttatta
tgag
agca
tgta
gtat
gtat
ctgt
agcc
ctaa
cac
atgg
gatg
aacg
tttta
ctgc
tac
1469
3823
592
HSH
2DN
Ogg
ggtg
cttc
atgt
cata
gcttc
tcag
aaag
cagc
agta
actg
aggc
tgtg
gatc
tgag
aacg
ggag
ctgc
tagc
caag
caat
gagt
gaag
ctttt
gtcc
gtag
tggc
atgt
tttat
ctga
ggcc
agcc
tttgt
actc
ctgt
gtta
1470
3551
323
CC
NK
YE
Sct
acac
caca
agtg
ccgc
aagt
acag
cagt
caca
gccg
tctc
aaag
ctcc
gaac
catc
ccag
ccc
1471
3602
745
RC
N2
YE
Sat
ggat
ttgtta
gtttg
gaag
aatt
1472
2332
284
NO
cccg
gcgc
gtat
cgta
ggca
gtgt
accg
tggc
cgtg
ccgt
caga
gtgt
gcgt
gtgc
gtgt
gtgc
cgtg
tcga
ggct
gtgt
ag14
7326
8989
1—
NO
ctag
aact
atat
tagc
tgct
cagt
gagt
tggg
caat
tgga
tattt
ttggt
tttat
gattt
caga
tttta
taga
ccag
tgga
atac
aggc
cttg
tgca
tatg
aaga
tca
ggtg
acaa
gtttg
c
1474
2952
713
NO
tgttt
gttg
tttgg
ttatg
atct
tgct
t14
7531
8128
9T
MO
Dl
NO
cggt
ggag
tcca
tgcc
tttga
actg
gatg
tgttc
tattg
atga
cctg
tgct
ctgc
aggg
gaaa
ccag
aagg
caaa
atgc
tggc
agca
tgaa
accc
ttttg
tgg
ttcag
ttcttt
atgc
acta
aggt
tttag
gttg
acta
gtgg
ttgta
gttg
aaaa
tttta
taaa
atac
cgtta
atgt
gaag
tttttc
tttag
tcac
agaa
gttg
aatc
tggt
tat
tattt
aaaa
acta
gaag
cccc
caaa
ccag
caga
tctta
ctga
agat
gatg
ttcca
gcag
cagc
gact
tagc
ccca
ggag
ccca
gtttc
aatg
gcct
tgct
gtg
tggt
gtttc
aagt
gcat
ttaaa
atgt
gtga
caca
gaaa
cggc
acac
tcttc
1476
3412
443
—N
Oac
ctttg
acat
ctct
agca
cacg
gagc
aattc
taga
tggt
tgat
aaaa
gagc
ataa
atca
atcg
acac
aaat
atcc
cctta
aacc
ttgtc
tttca
tgtc
cagt
att
1477
3886
313
TO
X2
YE
Sat
gagt
gacg
gaaa
ccca
gagc
tcct
gtca
acca
gcca
g14
7839
7655
7R
BM
3N
Oat
ggga
cgttt
gtag
aacc
tgag
tattt
ttcttt
ttacc
agttt
tttag
tttga
gctc
ttagg
tttat
tgga
gcta
gcaa
taat
tggt
tctg
gcaa
gtttg
gcca
gact
gac
ttcaa
aaaa
ttaat
gtgt
atcc
aggg
acat
tttaa
aaac
ctgt
acac
agtg
tttat
tgtg
gtta
ggaa
gcaa
tttcc
caat
gtac
ctat
aaga
aatg
tgca
tcaa
gcca
gcct
gacc
aaca
tggt
gaaa
cccc
atct
gtac
taaa
cata
aaaa
aatta
gcct
ggca
tggt
ggtg
tacg
cctg
taat
ccca
gtga
cttg
ggag
gctg
agg
cagg
agaa
tcgc
ttgaa
cccg
ggag
gcgg
aggt
tgca
gtga
gcta
agat
cgcg
ccac
tgta
ctcc
agcc
tggg
caac
agcg
agac
tcca
tctc
aaa
aaaa
aagg
aaat
gtgt
atca
agaa
catg
atta
tcca
gcgg
tattt
tcta
attc
agat
catc
aaac
tgat
tata
taga
agag
ttggc
tttaa
aatg
tttgc
aaat
gtc
tttttt
ttttta
atac
tgga
agaa
aaaa
tattc
tgttg
tgtc
tcat
acag
tgct
tagg
atgt
ctttc
1479
2635
686
PVR
L3
NO
ttcca
tagc
tgta
gctg
gagc
ggta
attg
gagc
tgttc
ttgcc
ctttt
catc
attg
ctat
ctttg
tgac
tgtg
ctgc
tgac
tcct
cgaa
aaaa
aaga
ccat
ccta
tct
tgac
aaag
t
1480
2680971
—NO
agttg
gctta
caac
ctgt
gctg
tattg
a1481
2715956
HTT
YES
ttgag
ttgta
cagc
cgct
ggat
cctg
ccgt
ccag
ctca
gcca
ggag
gacc
ccgg
ccat
cctg
atca
gtga
ggtg
g1482
2452913
__
NO
ctgt
gcac
agga
ctcc
tcaa
atat
tttcc
gtt
1483
2667796
—NO
caag
agaa
ggtg
agca
tata
acag
gtaa
ataa
caga
agct
ctga
tttttt
ttttc
ttttta
tagt
ttggg
acta
ttcat
ccag
caag
agaa
ggac
gctta
aaag
taga
tt1484
3602153
C15orf3
NO
aagg
gattc
tgat
gagc
cgat
gggc
cctg
gagg
cagc
ccat
taaa
gcat
ctgg
o y
1485
3752031
CRLF3
YES
tacc
aaaa
aggc
ctcg
caca
ttcag
ttgga
cag
1486
3765307
APPBP2
YES
ggaa
actg
aaca
gagg
ctgc
ttcaa
gaag
ctca
tgat
ttgca
cctg
tcttc
actc
caac
tagc
taaa
aaag
ctttt
gggg
aattt
aatg
taca
gact
gcaa
aac
acta
tgga
aacc
ttgga
agac
tt1487
3956644
NO
gtac
atgc
aaac
ggga
tggg
agca
g1488
2345202
NO
tgat
gccc
gcgt
tgtc
ttttc
agac
tctg
tttttt
gcct
ttagc
atgc
cttg
taat
tttttt
tttttt
ttttg
ataa
gctg
gatg
tgac
ataa
gggg
taaa
aaga
actg
aga
taaa
cagg
ccttt
agtg
tggc
ctag
aggc
ctat
ctgg
ctag
gagt
tagg
ctgt
gttta
ctgt
ttgat
gtag
ctttg
gtgt
caga
gatta
aaat
ttcct
ctcg
tgta
act
gcttt
tgtc
tcct
ttgttg
tcttt
gggt
ttccc
taat
aact
ccttc
ataa
gtag
gttc
cgag
gcttg
tagt
tattt
aagc
tgta
agtc
cctg
ttatta
caca
ggag
ccct
attg
atgt
ggtg
tgtg
tgtg
tgta
aaag
tgttc
tata
atct
tatg
atta
gctc
ttagt
gagc
ctgt
gtct
ttgga
ctgt
gacc
ttcat
gagt
gcttt
ttagc
1489
2655521
ABCF3
YES
aaag
gaga
gtcg
gttg
gaat
catc
tggc
aaga
acaa
1490
2920283
...
NO
gcag
gaca
tacg
ctaa
aacc
ggag
caat
acag
agaa
gatta
gcat
ggcc
ctca
ccca
aaga
tgac
acgc
1491
3102056
...
NO
agat
taca
aact
caca
tttaa
tgtta
taaa
tgca
gaaa
aagg
ccaa
aaac
acaa
aaag
agtg
gcaa
cgag
ggac
ag1492
3161310
—NO
tgaa
gacc
ttcac
gcttt
ctct
gtaa
gtttt
cattc
aaaa
catc
tttca
atttc
tttttt
ttcttt
ttcttc
tttttt
gccc
tcat
tttag
ttagt
ttgag
tttct
tgtg
gctc
tgta
gtga
ctgc
tcta
atag
aata
tccc
ttaca
acttt
gtgg
cagt
taat
ttctg
gatg
atca
ctgt
gact
tcca
tttac
atgt
atttg
g
1493
3289083
NO
acag
gtaa
atta
ggac
ggca
ctgc
gggt
actc
tgtg
ttccg
cctg
ctca
agttc
taga
cttc
tggt
gtcc
atag
caac
ccag
caga
1494
3304760
-..
NO
cggg
gagg
aaag
acag
ctct
acca
g1495
3895434
C20orfl
YES
tcttt
cctg
ggca
ctta
tcta
acag
94
1496
3406522
—NO
tggt
tttgt
ggat
aggg
gaaa
tcta
gtag
atat
taaa
tattt
ttatc
tgca
aaaa
gcat
ttggt
aagg
ttctc
aatg
ggag
atta
atga
ctaa
aata
aaga
atga
ggca
ttagg
agat
aaat
ttgct
tatg
agca
agag
acag
atac
taac
agta
gaag
gaaa
tact
tttta
gattg
gaaa
ggag
tgac
atat
agca
caag
ccag
gaat
cag
ttttg
ggac
cta
1497
2692051
NO
gtat
cgga
ttaag
ctca
aaac
ggga
ggag
gaaa
caag
gatg
actg
gagt
caag
gaa
1498
2835105
LOC728
NO
ggca
gggg
ttggg
acaa
gtgc
taag
tatg
caag
actc
aagg
gaag
agct
264
1499
3935012
N NO O
cftc
tcca
øffc
caga
tttaa
ccac
ataa
acac
atca
aatc
ccap
ct
1500
4038282
—NO
tggc
tggg
tgag
ttcta
aaaa
tgcc
caag
acct
ccaa
gact
aggt
gaga
tctg
aagt
aatg
aaat
gcct
ccat
tcac
taaa
ggcc
tgaa
attg
tcct
gact
gcct
cag
1501
2856770
—NO
atgt
tcaa
ccag
atga
tggg
ccat
gattg
t1502
3579731
—NO
ccca
tgtg
tatg
ctttg
agcc
gttc
ctta
gttg
agta
ggtg
attg
acca
tacc
agca
gcac
tggt
gcca
atta
agcg
gtct
gcttg
gttc
acac
agcc
tttca
ttcct
gttc
tagc
atgg
ctct
gtct
taaa
tctg
catg
catg
tattt
attc
ctta
atat
atta
cctc
tcttc
ctttc
ttcct
tttta
cattt
cttc
ccat
ttttta
ttgttt
ctct
ctct
ctca
ctct
ctca
ctca
ttctc
tcttg
cact
ttgtc
ttcgg
agct
cattg
ccat
ggag
tggc
ttggc
tgta
cgca
tcaa
aaag
ccac
ccgc
aaac
gtcc
atcc
ttggc
cgtg
cggt
tcca
cgtc
tcac
tcat
tact
tcct
tcct
gcat
tcat
agca
tcct
tcct
tcct
gcttg
ctct
ctttt
tatta
ggca
tttat
tcct
ctat
tagt
ggag
tcag
gtg
ccct
tatg
acaa
gtcc
tagc
1503
3636549
—NO
ctac
tgaa
agca
tgtc
ctgt
gaat
gtta
cttg
tcag
tgac
aaaa
tagg
taca
gaaa
ttgaa
ataa
gagt
ttaga
aattt
ttaga
gccg
tttga
tgga
gtag
ctttg
tga
atct
aata
attg
aggc
ttaca
ttttg
tatg
tcttt
atttt
tctc
ataa
tttat
tttta
tttta
taaa
catg
tctg
tttgc
aact
catta
gaac
tttaa
aaaa
aaaa
acta
gtca
ttcac
tcta
ggta
gagt
agaa
gtac
tgac
tatg
aagg
atct
ttaaa
ggtc
aata
gtgg
tcct
aata
cgta
aaat
aattt
gctc
atat
ggtg
tattt
aaac
tattg
gcaa
ctat
gtaa
aagg
ggga
atgg
gttg
acct
tcat
gtta
caaa
tatta
atttc
tcct
gcat
aaaa
gtac
cctc
attg
1504
3653150
—NO
tttca
tgag
atct
acac
ctgg
gctg
ggca
cgcc
tgttg
gtttt
ttact
gaca
acgc
ataa
tttttc
gagg
agtg
gtaa
tcag
aggc
aacc
gttc
tagt
gaca
gtg
attc
aggc
gtaa
acac
ttggc
tttgg
ccca
gagg
ttgca
ctgt
gtga
gacc
acag
gtga
acac
taga
atgt
ttaca
ctct
tccg
gagt
ctgt
aggt
ggat
gcc
a1505
4011856
IL2RG
NO
cagg
gtcc
tgta
gccc
taag
tggt
acta
acttt
ccttc
attc
aacc
cacc
tgcg
tctc
atac
tcac
ctca
cccc
actg
tggc
tgat
ttgga
atttt
gtgc
cccc
atgt
aagc
accc
cttc
atttg
gcat
tccc
cact
tgag
aatta
ccct
tttgc
cccg
aaca
tgttt
ttcttc
tccc
tcag
tctg
gccc
ttcct
tttcg
cagg
attc
ttcct
ccct
ccct
ctttc
cctc
ccttc
ctct
ttcca
tcta
ccct
ccga
ttgttc
ctga
accg
atga
gaaa
taaa
gtt
1506
2351151
—NO
aaat
agct
gggt
ctca
gtgt
ctct
tt1507
2577491
—NO
gttc
tccc
tagt
tttct
ctac
aaca
gtgg
aaaa
caaa
gcag
tgct
ctct
atgc
cagg
acag
taca
tttgg
acgg
cgac
acct
tgttt
gcag
aatc
ct1508
2693258
—NO
atct
acat
tttca
cact
gaca
ccca
1509
3156385
—NO
agtc
acac
ccttg
ggga
tctc
attg
tgttt
catg
actta
atta
tgct
gctg
actc
tcag
attg
ttatc
ttagt
ttttc
agac
ttcttt
cttg
aact
ccag
accc
gtgg
ccac
tatg
ttctc
gcca
tgtg
gaat
ctga
tgtc
tcat
aagc
atttc
agac
ttaat
atac
tcaa
aagg
aact
gata
ttcac
gcca
agcc
tgct
tctc
tccc
cttc
tcca
ctca
ctaa
atgg
cact
acttt
ccttc
tggc
tgg
1510
3175668
VPS13A
YES
cttg
gagg
tatta
tagc
agaa
gtga
atttg
gccg
agca
ttcta
cagt
tatta
cattt
ttaga
ttatc
atga
tgga
gcag
ctac
attc
ctct
taat
aaat
caca
caaa
gaat
gaac
ttgttc
1511
2389740
NO
atga
gtga
ttcat
tctg
ccct
gtgg
acac
ttggg
aaca
gaag
gcga
ctct
tgag
ctgg
gtttt
ccag
gctta
ttagg
cgct
cagt
gggt
taac
agaa
tgag
ctg
gtcc
agga
agag
atca
gcct
gtga
aagc
cgtg
gctg
ggat
atta
ttgga
gat
1512
3224
499
NO
atct
ctta
ggtc
atat
gtca
tcgc
atta
ccttt
aaaa
attg
tatg
gtct
aaat
gtaa
aata
gtta
tttttc
aaca
tttgt
tttaa
tacc
cttg
taac
tgct
tttta
ctca
aga
caga
cattt
cact
taca
ctaa
gtgt
gtgt
gtgt
gtgt
gtgt
gtgt
gtag
aaac
ttagg
atat
aaca
cgta
tgta
aatta
caca
caca
tacc
tgttt
aaat
gtaa
gaa
ttcag
tgat
agga
acag
ttcat
ctaa
gaag
atttt
aatg
acag
ggca
ctct
tttgg
ccaa
agca
gtgg
aattt
ccta
agca
gcaa
ggttg
gtaa
tagt
gctg
gta
ggat
ctgt
atgt
gtaa
gctg
ttaca
caag
ctga
ggat
caaa
gtgc
caat
ttctc
aagg
agaa
aaac
ttctc
cgag
aaat
gaaa
gtct
ctga
tctg
tgttg
tatg
ccc
taa
1513
2807
754
C7
NO
gggc
caga
acac
actc
taca
aaat
gact
agga
taac
agaa
agaa
cgtg
atct
cctg
atta
gaga
gggt
ggttt
tcct
caat
ggaa
ccaa
atat
aaag
agga
cttg
aaca
aaaa
tgac
agat
acaa
acta
tttct
atcc
tgag
tagt
aatc
tcac
acttc
atcc
tata
gagt
caac
cacc
acag
atag
gaat
tcct
tattc
tttttt
taat
tttttt
aaga
caga
gtct
cact
ttgttg
ccca
ggct
ggag
cgca
gtgg
ggtg
atct
catc
tccc
tgca
acct
ccgc
ctcc
tggg
ttcaa
gcga
ttcttg
tgcc
tcag
cttc
ccaa
gcag
ctgg
gatta
cagg
tgcc
cgcc
acca
cgcc
cagc
taat
ttttg
cattt
ttagt
agag
atgg
ggttt
cacc
atgt
tggc
cacg
ctcg
tctc
caa
ctcc
tgac
ctca
ggta
atcc
gcct
gcct
tggc
ctcc
caaa
gtgc
tggg
atta
caga
catg
aacc
acca
cgcc
tggc
tgga
atac
ttact
cttg
tcgg
gag
attg
aacc
acta
aaat
gtta
gagc
agaa
ttcat
tatg
ctgt
ggtc
acag
gggt
gtct
tgtc
tgag
aaca
aata
caat
tcag
tcttc
tcttt
gggg
tttta
gtat
gtgt
caaa
cata
ggac
tgga
agttt
gccc
ctgt
tcttt
tttct
tttga
aaga
acat
cagt
tcat
gcct
gagg
catg
agtg
actg
tgca
tttga
gaat
agttt
tccc
tattc
tgt
ggat
acag
tccc
agag
ttttc
aggg
agta
caca
ggta
gatta
gtttg
aagc
attg
acct
tttat
ttattc
ctta
tttct
ctttc
atca
aaac
aaaa
cagc
agct
gtgg
gagg
agaa
atga
gagg
gctta
aatg
aaat
ttaaa
ataa
gcta
tatta
taca
aata
ctat
ctct
gtat
tgttc
tgac
cctg
gtaa
1514
3929
304
C21
orf5
NO
tcca
caca
gtga
gagt
ccag
ggct
cagg
gctg
ctca
agac
caag
gcttc
ccttg
atgc
ttcac
ttcgg
gtga
tcag
acat
gggg
gacc
tgttt
atct
ttagt
aQ
gccg
gcaa
aggc
aatc
gtaa
aaca
tttgt
tgta
acttc
agga
ttgtc
actc
taag
aaca
cc
1515
3947
636
TSP
OY
ES
gtac
ggct
ccta
cctg
gtct
ggaa
a15
1627
4823
4N
Ocg
aggt
ttatg
ttgtg
ttagg
ctca
tgag
aagg
caag
aaat
tttat
tattt
aaat
gtttt
ctcc
agta
atgt
attg
gattc
aagc
cattg
agat
ttggg
gagt
gtttc
agt
gact
tattt
gggc
aaaa
caga
atat
ttgaa
attg
gcat
tgtc
tcag
aatc
tggc
1517
3630
746
ITG
AIl
NO
gact
gtaa
atac
gaac
ccaa
tctg
caca
ctcc
aggc
ctct
agttc
caga
agga
tcca
agac
aaaa
caga
tctg
aattc
tgcc
ctttt
ctct
cacc
catc
ccac
ccc
tcca
ttggc
tccc
aagt
caca
ccca
ctcc
cttc
ccca
taga
tagg
cccc
tggg
gctc
ccga
agaa
tgaa
ccca
agag
caag
ggct
tgat
ggtg
acag
ctgc
aagc
cagg
gatg
aaga
aaga
ctct
gaga
tgtg
gaga
ctga
tggc
cagg
caag
tggg
acca
ggat
actg
gacg
ctgt
cctg
agat
gaga
ggta
gccg
ggct
ctgc
accc
acgt
gcat
tcac
attg
accg
caac
tc
1518
3004
500
—N
Oct
catg
cttc
aaag
gcat
attc
tcaa
gaaa
cagg
tgta
atttg
tgca
gata
atca
cacc
tggg
aagg
gatc
acag
agaa
ggag
aaaa
agaa
agaa
atgg
ctta
ttctg
aggt
gaat
gtgt
c
1519
3552
925
DY
NC
lY
ES
tgcc
acct
tgat
gacg
cagt
gcaa
agag
gggg
caca
gaag
gaag
gcct
gatg
ctgg
actc
gcac
gagg
agct
ctac
aagt
ggttc
acta
gcca
ggtta
tH
lcc
gcaa
cctc
cacg
tcgt
gttc
acca
tgaa
cccg
tcct
cgga
ggga
ctca
agga
ccgg
gcag
ctac
atca
cc
1520
3649
810
—N
Oca
gcaa
gagg
tgca
ggttg
agta
tttgg
aggt
ggta
tattt
ggcc
agac
tttag
ccgc
atcg
atta
tgct
atttc
ttgct
ggct
gtttt
aaat
tttttt
taaa
aagg
agg
aggt
gtca
gata
catc
atttc
a
1521
2782
685
NO
aaaa
ctag
ggct
gtcc
aggc
ttaca
1522
3166121
NO
caaa
agcc
tgct
gctg
ggta
gata
g1523
2524769
NO
agtg
tagt
gcag
cattg
cctc
tgga
actg
gcct
gattc
ttgcc
tcat
aaaa
ttcttg
tgac
caat
aaat
gaaa
ttcta
caca
ttttta
tgtg
tcta
aaat
ccca
agac
tagt
atct
gaca
catg
gtta
gctc
tcaa
tata
tttta
gcta
ctgc
tatta
tcat
tattg
gtag
tact
atta
aaca
ttttg
aaac
agaa
aatc
acttg
gggc
tggt
cacg
gtgg
caca
cgcc
tgta
atcc
tagc
atgt
ggag
aggc
cgag
gcag
gtgg
atca
ctgg
agct
cagg
agttc
aaga
ccag
cctg
acca
aatg
gtaa
aatc
ctg
tctc
tacc
aaaa
aaat
acaa
aact
taga
tggg
cgtg
atgg
catg
cccc
tgta
gtcc
cagc
tact
caac
aggg
tggg
gtaa
a
1524
3101
675
AD
HFE
NO
ttaac
tgaa
agaa
ttacc
gctg
gcca
ttgta
gtgc
tgag
agca
agag
ctga
tcta
gcta
gggc
tttgt
ctttt
catc
tttgc
gcat
aact
tacc
tgtta
ccag
tata
1gg
tggg
atat
acat
ttatc
ttgca
ggaa
attc
ccca
aagc
tcag
agtc
cagt
tcct
tcca
taaa
acag
gctg
gaca
aatg
acca
ctat
gtta
gacc
ccca
ggc
tcga
cttc
aggg
gtca
gtgt
tcct
gtcc
caaa
cccc
acac
agaa
tact
ctgc
ctct
gtttc
atgt
agca
aatg
a
1525
3633
418
SIN
3AY
ES
cacc
tctc
acttg
cgta
tgaa
gaca
aaca
aata
ctgg
aaga
tgct
gctg
ctct
gatta
tcca
ccat
gtga
agag
gcag
acag
gcat
tcag
aagg
agga
caa
atat
aaga
taaa
acaa
atca
tgca
tcat
tttat
tcca
gattt
gctc
tttgc
ccaa
agag
gtga
tctc
tcag
atgt
ggag
gaag
agga
agaa
gaag
agat
ggat
gta
gatg
aagc
caca
gggg
cagt
taag
aagc
acaa
tggt
gttg
gggg
cagt
cccc
ctaa
gtcc
aagt
tact
gttta
gtaa
caca
gcag
ctca
aaaa
ttaag
agg
aatg
gatg
aagt
atac
aacc
tcttc
tatg
tcaa
caac
aact
ggta
tattt
ttatg
cgac
tgca
ccag
attc
tctg
cctg
aggc
tgct
acgg
atttg
ttccc
aag
ccga
acgg
caaa
ttgaa
gaag
aaaa
ccga
gaga
gaga
atgg
gaac
ggga
agtg
ctgg
gcat
aaag
cgag
acaa
gagt
gaca
gccc
tgcc
attc
agct
acgt
ctca
aaga
1526
3904
327
NO
ttcca
aagg
tcgc
taaa
gata
ctgt
tact
acta
ttgag
atat
tattg
gcta
cttc
acgt
ttaca
tagt
aaat
gtttg
cagc
atat
aaca
ttaca
gact
cata
aacc
cata
atta
actta
taag
tgtta
atgg
acaa
ctgt
gcttt
gattt
ttgcc
tttag
tgat
aaga
aaac
aaag
tagt
gaaa
tggg
tcac
tcct
caaa
gcat
ggaa
cattt
taa
ctttg
ccta
gtaa
ggaa
aaac
aaaa
caaa
atat
agca
atta
catg
tgga
accg
taac
ctgc
aaaa
gtaa
caca
aata
ttgtc
tcaa
aagg
taca
aata
ggttg
tac
ctgg
acct
taag
cagc
atg
1527
2652
509
FND
C3
YE
Sgc
acca
attg
acaa
cggt
tcaa
aaat
cacc
aact
acct
tttag
agtg
gB
1528
2441
680
—N
Oag
gcaa
gagc
ccttt
cgtg
caag
taag
caat
tttgc
tgcc
tccc
atttt
tttttt
tttttt
caat
ctgt
ttggg
acttt
gagg
gctg
gggt
ggga
aggt
agtg
gaat
gga
atag
ataa
acag
ccag
tcaa
gagc
tgtg
ggga
ggttg
acag
aattg
gggt
gcag
gtac
atgt
agga
taca
caga
agct
ttgtg
tctg
tgga
ggct
gtat
gagt
ctgt
gggt
gagc
agca
tgtc
taag
tggg
tgga
aa
1529
2694
895
NO
atgc
cttg
gaga
cgag
ccca
gaag
gag
1530
3048
170
C7o
rf44
NO
tcct
atca
tagc
tggc
accg
tgttc
atgc
ctgt
ggta
ttccc
tgta
tccc
tattc
ctgg
gaag
cagc
ccag
gagg
ggga
ttccc
ttggc
ctct
ccag
ggtc
agg
tcac
ctttc
ccttc
atta
gtgt
gatg
ggac
cttc
agaa
gaga
agca
ctttc
tgcg
gcca
aact
g
1531
3085
948
YE
Sga
ggcc
ggta
gtgc
tgaa
tgga
catg
gggc
tgga
tgaa
aaga
cttc
cgcc
tgcc
tctc
tcag
cccc
ggaa
tcgc
catg
agct
gcta
tt15
3233
0024
6C
PEB
3N
Ogg
ccttt
ccttg
gtgt
gaca
cttg
aact
agtc
gattt
tttga
aggc
tata
acct
aacc
tcga
ctat
ttgttg
ttatg
tgca
1533
3822
344
MG
C32
NO
ttgga
gaca
ccat
gtta
gtcg
tggc
tggt
ctcc
aact
cctg
acct
cggg
tgat
ccgc
gccc
cgcg
gcct
0715
3423
4552
0NNO
Oaa
gcct
gtttg
agag
cagg
actg
at
1535
2549493
—NO
gctc
cggc
cgag
agaa
gcca
ccat
tcct
ttc1536
2788417
ZNF827
YES
gaag
agtta
ctgg
aagc
ggca
catg
gtga
ttcac
acag
gttta
aaaa
gtca
tcag
tgtc
cgct
ctgt
ccat
tccg
gtgt
gctc
gcaa
ggac
aatc
tc1537
2544515
ADCY3
YES
acta
tgtg
acct
tcat
ggtg
gggg
ag1538
3329911
NDUFS
YES
ccac
tgtc
agac
cacg
gaat
gatg
tggc
ccac
aagc
agct
ctca
gcttt
tgga
gagt
atgt
ggct
gaaa
tcttg
ccca
a3
1539
3467685
—NO
cagg
gagc
tgca
tgtc
caac
tgttc
tcag
ggga
gaaa
gtttg
gcct
tctc
tagg
aaca
gacg
gaga
gaag
gcaa
ggga
tcct
aagg
gaga
tggt
gcaa
ggg
ctttt
gggg
agttt
gggg
tttat
ccaa
aatg
tagt
agag
aaga
gggt
tcttg
aaga
gtttt
cagc
aggg
atgg
gaca
tgtg
ggat
ttgtg
taaa
gcag
gga
gcaa
acta
tggc
tcaa
gggc
ctgc
tttgg
ctca
ctgc
ctgt
ttttg
gaaa
caaa
gtttt
attg
gatc
acag
acac
acca
ttagt
tttct
tagg
agct
acaa
aggc
1540
2535364
NO
atca
agga
agtc
gagt
aaaa
cact
tct
1541
2979944
SYNEl
YES
gcag
caaa
tagg
tgaa
agat
tgaa
tgaa
tggg
cagt
cttc
agtg
aaaa
gaac
aagg
aact
ctgt
gagt
ggttg
actc
aaa
1542
3210881
—NO
atgg
ctct
acag
cacc
ggat
tttgc
atca
gaat
ccct
tcga
gtttc
ttatc
ttcac
catc
tgta
gcat
ttaga
ttgaa
aagg
ctta
cctg
ggttt
tcttt
ttttg
tgttt
tat
gatc
tcaa
gcta
ttggg
agta
gacc
aattc
gtgt
ggct
caat
atga
gagt
taaa
gtta
cgtc
agaa
ggta
atga
aagc
tgat
tttgc
ccttt
tatta
acta
tagt
atttt
aatc
ctgg
tatg
taaa
atta
ttact
cattt
gtgc
ctaa
tgca
gata
atgg
gaag
cagc
tcac
tgtg
gtaa
agta
gggg
catta
agca
cagg
tgca
ac
1543
3984493
NO
ccag
ggaa
atat
ggga
actg
tgag
aggc
agca
ggca
agtta
tggg
caga
ggca
gata
caat
gagt
tcat
gtgg
gaaa
ttt1544
3294206
...
NO
tttttt
ctgc
actc
ccat
tgct
tta1545
2389550
__-
NO
atgt
attg
gtgg
acag
catc
tagg
aaga
ga1546
2593736
—NO
gctg
taac
acttt
acta
tgtc
tatg
caaa
taca
cattt
aaac
aata
tagg
aaga
tttca
ataa
ccaa
acta
actg
taca
aaaa
ataa
acaa
atta
acttt
atta
catt
gcta
cttc
aact
tcag
tcct
taca
ttgat
ttgttt
tttaa
aaaa
tacc
agttt
gaaa
caca
ttact
gaaa
gtga
gtgt
acac
aata
aata
gaaa
atag
ggat
gcat
agt
gctg
gaga
cattc
aacc
aact
tatc
ttcat
ctgt
tgcc
tact
gttg
1547
3315642
ATHLl
YES
gaat
gcag
acgg
gtca
ggcg
ctgt
gaac
ttcct
1548
2638195
—NO
gtca
acga
gggg
aatc
cact
ctgc
c1549
3973655
—NO
cagt
ttggc
aaag
agca
tgcc
actg
ttcct
gctg
tcta
tcaa
ag1550
2409486
NO
tctg
aaat
ccga
catc
ccag
tgtc
cattc
tggg
gcac
agct
tccc
ctcc
tttca
agtc
tctc
tcttc
ctcc
ctct
ccac
aaca
caga
ggcc
tttag
ctcc
ccag
ccc
ttact
gttc
agtg
cgtc
cacc
cctc
cttg
tcta
gacc
ctga
agta
tatta
catc
agct
actc
tcttg
tgag
ctcc
tcca
gtac
tctg
cccc
ttgtc
cttc
tattg
ccct
agtc
tagg
1551
2635963
PHLDB
YES
cctg
cctg
tccg
gaag
gaag
acttt
gattt
gcgg
agcc
atgt
agag
actg
ctgg
ccac
aata
ttgac
acct
gtta
ccat
gtat
caat
caca
gaga
agac
ctgc
2cg
agga
ttcct
catc
aaaa
tggg
tggg
aaaa
ttaaa
acgt
ggaa
aaaa
cgttg
gtttg
tttttg
atcg
gaac
aagc
gaac
attc
tc
1552
3730508
TANC2
YES
tcca
caat
gttg
ctgc
cttg
ctct
gccg
ctca
cctc
agct
gaca
gcct
atcg
ggag
cagc
ttcttc
ggga
acct
cacc
tgca
gagc
atgc
tgag
ccttc
gttc
ctgt
gttc
a
1553
2512118
NO
ggca
tatg
tttgg
tatc
tctg
gtcc
ttagt
tcct
gaac
aattc
tggg
ctaa
tgct
gtag
tca
1554
3181032
—NO
tgct
tcta
acaa
atcc
gccc
tgtc
acat
gcat
atgc
tttct
caac
attc
actg
tcaa
aaat
gccc
tacc
agga
caca
acat
agtg
acaa
taac
acat
tcaa
tttaa
ggat
aatg
gttg
agag
ggaa
ccag
ctaa
gaac
ttaca
ata
1555
2401404
ASAP3
NO
aggc
ctct
ggat
ggca
gaag
gaaa
g1556
2496802
MAP4K
YYEESS
tgat
ctttg
gtct
tgtg
gcat
taca
gcca
ttgag
atgg
ca4
1557
3048180
___
NO
tcag
attta
ctttc
ccgg
agta
gtca
cagg
actg
ggca
gtga
1558
3436406
—NO
ctct
gtgc
ctta
tcag
gaga
gggt
gact
ctgt
gccg
acttt
acaa
ggct
atgg
gaag
gaca
c1559
3840203
LOC400
YYEESS
gctg
tcgt
agag
tggc
aatc
atca
gtgg
c713
1560
2633650
NIT2
YES
ctgg
ctga
aata
cgcc
agca
aatc
cccg
ttttta
gaca
gaag
cgat
caga
cctc
tatg
1561
2835873
—NO
tgat
caaa
tagt
gccg
tctg
cctg
gagt
acag
catg
gggg
aaga
ggttt
ggct
gtgt
tttga
tgta
gtca
ctgc
ccat
agtg
ttgta
gttg
cttc
atttt
gatg
tgt
cata
cagc
taaa
gatg
ctcc
cttta
ggtc
atttt
tgttg
ccgc
tgcc
tctg
cggc
ttgtta
ctac
tgtt
1562
2849540
NO
agaa
gaca
agag
gaag
ccgt
tctg
ctgg
cattt
cgtc
ggga
tgaa
gaca
acac
agcc
a1563
2986834
___
NO
ctgg
ccgt
gtttc
ctgt
gagt
tttga
gg1564
3222790
—NO
tgct
gttc
tgct
tctc
ccat
tcttg
cata
gagt
aatg
gagt
tagt
gaaa
atgg
gcttt
ctttt
acaa
caca
aagt
gttg
ctct
tcttt
ctcc
tagg
gaga
aagt
agta
agct
ctgt
gggt
gtag
gagg
agga
aaat
agca
ggag
ctta
gcca
agga
gggt
gctg
cagc
acat
gtgg
gtct
tcat
agtc
cgga
aggg
tgag
tgac
ac
1565
3317133
NO
ccaa
aagg
actg
tcgt
ggca
gagc
agcc
atgg
ccaa
gtgg
actc
aggg
ttcat
tttcg
gaac
ccct
ttaag
ctta
taga
aagt
cccc
atca
tcac
ccag
ctc
ggcc
ccgg
ggca
ccga
gtcc
gtct
ggtg
tggg
gcgt
ccttt
cgac
agag
cctc
ctcc
acaa
accc
agac
tctg
gcttc
cacc
aggg
cccc
ggac
acct
gct
tcct
cagg
gcag
acag
agag
gggt
gaca
cact
gcac
cctg
tggc
tggg
aaaa
gggg
cctg
ggac
tgtg
cccg
tctg
ggac
ggag
caga
gcca
cca
tccc
atgc
aggc
acta
tcac
cgca
ggca
aa
1566
3600340
—NO
aggg
ccac
tcaa
ccat
gcgc
tctg
ggag
gagt
gggc
tgag
atgg
ggct
gcttg
gtca
gccc
tgac
tgtg
ctttt
gatc
tttgg
gcgt
cacc
1567
3929187
—NO
ctgg
ccac
cagg
gaac
agtta
cgta
aatta
tttga
tgtg
tcttt
ttaac
aaga
tatta
ggaa
gtgc
tcgg
cattg
ttagg
atttt
ttttg
gcca
cgtg
gaaa
aaca
cac
atga
tata
aagc
aaag
gaaa
ataa
ggca
gtct
gcaa
atgg
atct
gctta
gaac
tgcc
atta
cgta
a
1568
3989061
—NO
cttg
ctct
gtca
tacc
ttgtg
gaaa
tgta
aaag
atca
atga
cagc
tagc
ctttt
caca
caat
tattt
atgc
ctcg
gccc
tcta
attg
acac
agtta
cccc
tagc
act
catc
gttg
tgat
cacc
tgca
gcca
agac
gctg
atttt
ctgc
tcct
atag
cctg
gcaa
ccttg
tggt
acat
ggga
ctac
gtcc
ttta
1569
2735839
MGC48
NO
tata
cata
tgtc
tgcc
tcta
ataa
a628
1570
3660480
___
NNOO
tgag
accg
caat
tcca
gcac
gcat
tggc
agga
cct
1571
2655657
___
NNOO
tggg
agga
ggac
gtct
ttggg
attc
cttc
tcttc
atga
gctg
c1572
2848543
NNOO
agca
taaa
gaaa
gttg
gcct
gggt
tttcg
tgac
tcgg
cagg
gtcc
actc
agttt
aaaa
tgc
1573
3397172
—NO
ctgc
cgtg
ggcc
cattg
acca
gagc
gacc
tgct
gtct
gatg
cc1574
3790842
___
NO
atgt
ggcg
aatg
ggta
gaag
aatg
t1575
2824108
___
NO
ttttc
tgat
tcac
tacc
actg
ctgg
tgtta
cagc
gtga
acgt
gacc
tgaa
gaca
a1576
3361136
___
NO
agaa
gcag
gtat
aagc
caaa
tgttg
agag
ttaga
gaat
gcca
gcct
gcag
ttgtc
atca
ccca
ggct
tagt
acc
1577
3971846
___
NO
atgt
tcac
tcag
gctc
cttg
gtgt
g1578
3294447
ANXA7
YES
gtga
gtttt
ccgg
atat
gtag
aaag
tgg
1579
3850170
S1PR2
NO
gttg
cact
atttg
gggc
acag
aata
atca
ccaa
aagt
gaga
aaaa
cgag
tttgg
gtgg
ctgg
ggag
gact
ttggg
actc
ttgat
gcaa
ggcg
caac
ttgag
aaaa
ttctg
ggtg
tgat
atttg
caca
gaca
ccct
ccttt
caaa
aaca
gcca
cccc
ccaa
gcta
ttctc
agct
ccac
acct
gcag
cccc
agct
aagg
tacc
aggt
ctcc
tgag
caag
gcag
agag
aagc
cttg
agcc
ttctc
tgtg
tcttc
tttca
agaa
cccc
gctg
tgtc
ttcttt
caag
atttt
tttttt
gaga
cagt
ttcaa
gattt
ttgt
tttgt
ttttg
agat
ggag
tctc
actg
tgtc
accc
aggc
tgag
gtgg
cagt
ggttc
aatc
tccg
ttcac
tgcc
acct
ccac
ctcc
cggg
ttcaa
gcga
ttctc
ctgc
ttcag
cctc
tcga
gtag
ctgg
gact
acag
gcac
ctgc
cacc
atgt
ctgg
ctaa
tttttg
tattt
ttagt
agag
acag
ggttt
cact
acgt
tggc
cagg
ctgg
tct
caaa
ctcc
tgac
ctca
agtg
atcc
gccc
gcct
cggc
ctcc
ccaa
ttgct
ggga
ttaca
ggcg
tgag
ccac
tgtg
cccg
gcct
tcttc
tttca
agtta
tata
gaa
tgga
gcat
gggg
gtgg
cagt
ggct
aggg
acat
ttcct
gggg
acac
tctc
ccct
aacc
cccc
agaa
ggac
ttcac
aaaa
acct
gtgg
ataa
tgga
aggg
atgt
tacg
gtac
aaac
gtat
attta
tgtg
tgtg
tgtg
tgta
tgtg
tgtg
cgcg
cgcg
cgtg
tgca
cata
ggcg
tgat
gtct
gtga
ccct
c
1580
3918219
NO
tgcg
gattc
gtct
gctc
cacg
cagg
gtgc
tctg
acgt
actc
tgtg
tggc
ctgg
agct
caga
aacc
tgtg
ctgg
actc
agtg
1581
2370552
NO
gttg
gggt
gaac
tgaa
cctg
tcca
ttt1582
3476124
NO
gagc
ggga
tcga
ggcg
ttttta
ataa
ttcga
gttg
ggaa
gacc
cgga
tggt
tcat
caaa
atga
tggt
gttg
agca
cccg
ggag
gcag
acat
tggg
tattt
gtt
caca
atttt
gcac
ctgt
aaga
aaat
gttc
cttta
taac
tttca
agga
ttgcc
gtga
gcgt
ctttg
gggc
ctttt
cattc
atgt
acaa
gtttc
attc
atta
ctat
tcaa
ctt
ggta
gact
tggt
gaaa
aatg
taaa
agca
ttctg
tattt
ggat
cagt
cttg
ggaa
ctaa
atttg
tcgt
gaat
tggg
gcct
gtca
caga
ttgcc
tcca
gtgt
gcat
cctc
cacg
caga
ccct
ccttg
ctca
gtct
ccct
ctgg
cgcc
catg
ggga
agcc
gtgc
tgtg
gttg
gcag
tgcg
gcca
tctg
ggct
gccg
caac
gtgg
cagg
gca
ccgt
ccag
caag
actg
cccc
ttagt
gcag
ggct
aagc
aagg
aaaa
gaag
aagc
ccaa
aaag
gcgg
gttc
tttttg
atac
ttgta
attc
aggc
gtcc
tcc
agag
tg
1583
3929
424
C21
orf6
NO
ctgt
tgct
gattt
tggt
gaca
catc
tctg
ttgat
atca
gtaa
atat
caat
tgat
aagt
aaaa
aaat
atgc
aagt
ctta
acttg
gatg
tcttt
catc
ttgca
tgtg
ctca
c6
ctcc
ctcc
cctc
tcac
ccag
tcgt
ttggg
ttgttt
gcct
cccc
ttact
gtca
tatc
ttcat
agtg
tgga
acta
aaac
gtag
aaat
gagg
gaag
tata
gtgg
acag
atgt
ttccc
ccac
ccct
tcttt
tacc
tagg
caag
agat
tagg
ggag
tcttt
tttga
ttaaa
gaga
gtag
gtcc
aaaa
taaa
gacc
ctga
aaca
ctaa
aatc
tggg
gatc
ccca
acaa
aaga
actg
gctc
agta
ccta
gtga
tccg
acgg
agac
acct
ctgt
taga
cagc
tcct
gccc
acac
acac
agct
tcca
gtct
tgct
cttc
tat
atga
acag
gcag
ttaac
gatg
atca
taag
gacc
aacc
tgag
cagc
agaa
ccaa
aggg
actg
agga
aaca
gaca
gtgc
tggt
acaa
gtat
gcaa
gtttt
ctgt
aaac
agat
tact
ttcaa
acag
ttggg
agcc
ccag
ggag
ataa
taga
aggt
atcc
tact
caac
agga
gtat
atag
aatg
ctgt
agaa
gagg
aata
ttgta
aga
acaa
gaac
tatc
tcttg
aaaa
ctaa
aaat
atga
taac
tgaa
atat
aaat
ttaaa
tagg
attg
gaag
ctaa
agct
agga
aatta
ccct
agaa
tgttt
tttga
gaa
atgg
aaaa
tacg
aaga
ccaa
aaaa
ataa
aata
aaac
tatg
ctgt
gaaa
gaga
aaga
ttagc
atttg
agaa
ggcg
gaag
ttggt
cctg
ctca
gatg
cggt
tttca
gatg
cc
1584
2418445
—N
Oag
gtttc
aacc
caac
tgga
cctg
cttg
1585
2977983
EPM2A
YE
Saa
ttcta
ccaa
atat
ctgg
ctgg
gtag
1586
3074397
CNOT4
YE
Sttc
atcc
aatc
acag
tgca
cggt
cccc
ttttg
aagg
ggca
gtaa
caga
gtca
cagt
cgtta
ttctc
agac
aattt
tcgc
catc
ccaa
ccct
atcc
caag
tggg
cttc
ctcc
tttcc
ccag
ctcc
ccac
agac
atcc
agtg
actg
gcct
acag
cacc
agaa
1587
3320669
USP47
NO
cggt
ggag
cctc
atga
tctc
ttatc
ttttg
aggc
tgag
gcag
gtca
catg
caac
aaat
tgtg
accc
tgct
cccc
acaa
gtca
tgca
aagg
ttttg
aaga
gcttt
tac
cgtg
gggc
agat
gaac
ttgtg
tcaa
ccat
gcac
accc
tgtg
agaa
ccaa
gtac
ctgt
gtttc
taag
gcgg
gcac
tcaa
ggtg
aggg
gtgc
attc
tggc
cOC
Oaa
agaa
acaa
aagc
tgtg
gtttc
agga
ccat
gccg
tgtg
tagc
tgat
ctgt
acgg
gacg
tgta
tgta
agga
agag
caat
catg
atag
ataa
gaac
agtg
tgt
gaag
cagc
cttc
acac
taga
gtgt
ttggt
catc
tctta
taat
gtaa
ggga
aggt
acttt
aaaa
ttctg
ggaa
gatg
cgat
gaac
tcat
gtcc
cagt
caga
aaat
aat
ccaa
tgaa
ataa
gcat
tggt
tgcc
aggc
caca
gtta
ggaa
ttgta
1588
3562057
NO
gagc
aaaa
atat
tgtc
aggt
ttcttg
ctgt
ggttc
tgga
tgttc
agta
gcag
gctc
atttg
1589
3924690
DIP2A
YES
gggc
ctcg
tgga
gcat
tcgt
acttt
gagc
gtcc
acag
gtgg
cttc
tgtg
aga
1590
3029146
—NO
agtc
aact
cggg
gagt
caaa
ttagg
aaag
gcttc
gaag
gatg
ggca
ggac
gtaa
gaat
ccag
agag
gaga
ggga
tgttt
ggag
aaca
gaaa
gtgg
aag
gttc
ctgt
agga
aaat
gata
gagc
tata
cttg
gaga
ggca
agcg
gggc
1591
2480430
—NO
tcttc
actg
aagg
tgcc
agcc
tatg
ggtg
gact
cgtg
atct
agac
ctaa
aggg
ttagt
gtga
gttc
ctag
agga
aaag
aaag
ccat
gagg
caga
ttgtg
gaa
ggct
ttgaa
ggcc
aggt
ggag
ggcc
ttgaa
cttta
ttcca
taga
caat
gggg
agct
attg
a
1592
2923442
NO
ctgg
gaag
aact
tcag
gctg
ttgttc
aggg
catg
gata
tcac
cact
gtcc
a1593
3393487
FXYD6
YES
cctc
tgga
acct
gagg
cggc
tgct
tgaa
ccttt
ggat
gcaa
atgt
cgat
gctta
a1594
3879288
...
NO
gcttt
caag
aata
cggc
catg
tctg
cattg
ttggc
ttaca
cgca
gaca
cacg
ctcg
ggttt
ttcat
ttggc
aggg
tg1595
2889490
AGXT2
NO
catc
tcat
ccaa
atac
acgc
tattg
agaa
ggcg
agcc
tgac
ctcc
ctct
taca
gata
aagt
cagc
tttca
gagg
ctca
gggt
gggg
gggc
ctgc
ccga
ggc
L2
cata
atgc
tacc
cacc
ccct
cctc
ctaa
ccac
tggt
ctgt
tgga
ataa
ccca
1596
2915
334
DO
PEY
YE
Sag
attg
gttg
agag
tctc
cgttt
gcca
cagg
tgcc
aact
ctcc
attc
tcaa
gtgt
tcct
gtttt
tcag
agtg
ttact
tttaa
gaat
gtct
cccc
aaca
tctta
cctc
ac1
tctg
gcct
acca
tgat
taca
gaac
tt
1597
3186
622
—N
Oct
cata
ttagg
atgt
ggac
ctgt
ccaa
atct
tcag
ggag
tcct
ccct
cccc
aggc
cagg
caat
ttctc
cagc
acca
aggt
aacc
actg
ttttg
ttctc
tattg
cta
tgga
ttggt
tttgc
ttattt
gtaa
attta
tttgt
agac
ttggt
ccct
atca
ctac
gaat
tggt
tttgc
ttattt
gtaa
acttc
atat
aatta
gaaa
tcat
gagg
cattt
tcat
aagt
gtgt
tttca
acttc
tata
aaga
gaaa
caca
atgt
acct
tcttt
tttgt
gtcc
taaa
ccttc
tttttc
tcag
tccc
cttc
cctg
taca
tgca
ctttt
caaa
gtga
ttaga
ggta
caga
gaac
tcttc
ctct
aatc
taat
aaac
aaaa
tgat
acaa
actc
aata
atag
gtgg
aaat
gccc
ctca
ggttc
aatg
ctag
agag
aaac
tgca
ggga
atgg
tggg
gact
atgg
caaa
ctat
ggaa
a
1598
3720992
—NO
cattt
ctat
ggtta
ttcgt
ggaa
tgac
tcttt
gacc
acgc
ggag
aagg
caaa
acttc
agcc
atttg
tgttt
ttttc
ccct
tggc
cttc
cccc
ctttc
ccag
gaag
tcc
gact
tgttc
a
1599
3916840
NO
tggc
aattt
caac
agttc
gtca
ctgc
acgc
tgcc
tctta
aagg
1600
3959370
___
NO
catg
ggct
tctc
agat
ccag
gtga
tgcc
ca1601
2571999
SLC35F
YE
tggt
ttcac
tcag
cgca
ggcg
aatg
gctc
ttggg
attg
ttattc
ttctg
cttg
ttgat
gtga
tatg
ggttg
cttc
ctct
gaac
ttact
tcg
5
1602
3757901
—NO
tattt
cctta
ttaca
tgttc
caga
cagg
agtg
ctag
ccca
1603
4002896
___
NO
ctttc
atgg
agca
ccgc
tgct
cacg
acct
gtta
cagt
ctag
aaaa
atgg
aaaa
actg
aaaa
tggg
tatc
tatg
ccac
ctaa
1604
4013965
—NO
aaaa
tcta
ctgt
ggag
ggag
tgca
aa1605
2327098
—NO
atac
ttgag
agta
aatg
aact
gctc
ttgtta
cttta
ctat
gtca
atta
caaa
gctta
aaat
tcac
acaa
gcca
cacg
acat
attta
agaa
tgca
tgtg
ttggg
cca
ggtg
c1606
2381607
NO
gaca
tcag
agtg
gaac
ttggg
cgat
gctg
ccaa
catg
a1607
2995272
C7orf41
NO
cact
cccg
cttc
agtg
gggt
ttcta
tgga
gttg
tcttg
gtag
ccttt
gcca
ttttg
aattt
agag
tcca
ttttg
tggc
tgac
tattc
tctta
agttt
atgt
tgga
gaat
taac
attc
gctg
actc
gaat
gtag
agaa
ctct
gaat
gtat
taag
gata
ggttt
tgag
tcct
caca
ggtg
acct
tact
gagg
gaaa
gcat
ggca
gaga
agaa
atgc
agtc
tgca
ctttt
tatg
tact
tttta
agtg
tccg
taag
tgaa
aggt
tttgc
ttata
aagc
atga
atttt
aata
tcta
gtca
ttaaa
ctgc
acaa
gtgc
aaat
acaa
ggg
cagg
aaag
gata
atca
ctta
gcttt
ggac
taag
aggg
taag
agag
gccc
agaa
gcct
ttaag
tgttt
tgcc
atta
ctga
gtta
cctg
ggta
tgta
gcga
ctgg
ttc16
0832
6178
9SU
FUY
ES
gctc
caac
ctga
gtgg
tgtc
agtg
ccaa
gtgt
gcct
ggga
tgac
ctga
gccg
gccc
cccg
agga
tgac
gagg
acag
ccgg
agca
tctg
catc
ggca
caca
gcc
1609
3934
119
SIK
lN
Otg
tgcg
cgtg
cattg
atta
ctat
ccat
ttcttt
agtc
aacg
ctct
ccac
ttcct
gattt
ctgc
tttaa
ggaa
aact
gtga
acttt
ctgc
ttcat
gtat
cagt
tttaa
agca
gccc
aggc
aaag
atca
tcta
caga
ttcta
ggaa
ttctc
tccc
ctga
aatc
aaaa
cctg
gaag
acttt
tttttc
ttattt
tagt
tgag
aagt
ttcat
aaac
tgct
caag
gatta
gtttt
ccag
gact
ctgc
ggag
gaac
ggca
ggaa
gaac
ctca
gaga
gggc
agag
gtga
cttc
aaag
tgct
gggg
actc
cgtc
ctga
gggt
cact
tggc
cctg
agcc
cctg
cgtg
ccct
tgcg
gaag
ccca
gaag
cttc
ttcct
gctg
cacc
tccc
gtttc
cgct
gctg
ctga
cgttt
atgc
atttc
atga
tggg
gtcc
aaca
agaa
cacc
tgac
ttggg
tgaa
gttg
tgca
atat
tgga
ggct
gact
gtag
ggct
gggc
agct
ggga
gaca
ggct
catg
gctc
atgg
ctca
tggc
tcag
ggc
ggtg
cctg
ccct
gggc
cggg
accc
ccct
cccc
accc
ccca
ccta
ggct
ttttg
ggttt
tgttc
aagg
aagg
taaa
gtga
gagg
tttag
gtca
gtgt
tttta
agttt
ttgttt
tttttt
taaa
gcaa
atcc
tgta
tatg
tatc
taca
tggg
agac
aggt
agac
acta
ctta
tttgt
taca
ttttg
tact
atac
gtttg
tgttc
cagg
tttca
gct
tccc
tcgc
tcct
gttg
ttaag
aagc
gtcc
ctgt
cagc
acag
gtgt
gcat
tgag
gaag
gggc
ccca
gggc
cttc
gctc
cctc
agca
ctgg
ggtg
gagg
cgg
cagg
aagg
ggcg
gccc
ttacc
tggc
aggt
ctgg
gcgc
acct
ttagc
aggt
ggac
tccg
tggg
gctc
cacc
agcc
agaa
gcct
ctgg
aagg
caac
gaag
gcaa
tgct
gctc
cctg
agtc
cagt
cccc
gccc
ccaa
accc
agcc
cagg
tgcc
ttcag
ctac
ttcgg
cttc
ttaaa
ccct
gcag
tgtt
1610
2748
304
NO
actc
aagg
tctg
tcca
tgct
ggtg
actta
gagt
agtg
ggat
ttaaa
ccag
agga
tgca
cact
tgga
ggca
cttg
aaga
tgttt
caag
ggac
atgc
aggc
attg
aata
tttaa
agga
aatta
attg
ccga
ttctta
gatc
ttgag
tgtta
atca
gttc
taac
actg
accg
tcct
gaga
aaat
gcct
atgg
ccca
ggtg
acct
gctg
gtttt
ccttt
caca
ccttc
tcttt
catg
atta
ccat
tcttt
aaag
ggaa
gaca
tacc
tttca
ccta
cccc
ttatc
agag
tgcc
aagt
aggg
ggat
aatta
aaat
actg
ctgg
ggct
gctg
ggaa
agtg
agtg
actc
ta
1611
2358329
TARS2
YES
ctcc
agca
ccca
tgtc
ctgg
gggc
agca
gctg
aaca
attc
ctag
gtgc
tgttc
tctg
ca1612
2636804
NO
tctta
tcac
aaca
ctgc
ggcg
ggaa
aatc
agga
aatg
ggttc
acct
ttcag
cagt
cacc
acag
taaa
cttc
attg
agaa
cata
catg
ccca
gata
tctc
ttctg
tgca
catg
tata
tgtta
acag
ctag
ccat
tcag
ccaa
cagg
ta
1613
2907584
KLHDC
NO
cagg
gaag
tcac
taat
ggga
gagt
ggga
ggta
tttga
aaag
gggg
tttcg
tggg
tagt
ttttg
tcct
acttt
catc
tctc
ttttg
atcc
cgac
ag3
1614
3185561
NO
gcga
gtcg
actc
acct
acct
ctttt
caag
ccgg
ccta
gccc
cttc
ccgg
aacc
tcgg
ctcc
cccc
caac
gaaa
ctac
tgct
aagc
caac
tgga
ctac
acttc
c
1615
3752751
—NO
atgg
gttc
aaga
cctg
tttat
gtgc
gaga
acaa
cagg
ctttg
gtgc
tggt
gact
cacc
aaga
tggg
aaat
acag
gaaa
agaa
gcag
ggat
gtgg
tagc
agag
tgt
1616
2366872
—NO
ttccc
atcc
ctgc
tcta
gcag
cctg
1617
2389086
NO
tgag
agag
gtttc
tattg
ttcac
tttag
aaaa
atgt
aggt
caat
gtaa
tggg
tgat
atct
gttc
ttaat
tattc
taag
aatta
aatc
ctgt
atgt
tatc
agtc
acct
atta
tatg
gcac
accc
tcag
ggg
1618
2523976
NO
aggc
aagg
ggta
aagt
ccag
gatg
aggc
1619
3394423
THYl
YES
gcag
ttcac
ccat
ccag
tacg
agttc
agcc
tgac
ccgt
gaga
caaa
gaag
cacg
tgct
ctttg
gcac
tgtg
gggg
tgcc
tgag
caca
cata
ccgc
tccc
gaa
ccaa
cttc
acca
gcaa
atac
a
1620
3510
096
POST
NY
ES
aggg
agaa
acgg
tgcg
attc
acat
attc
cgcg
agat
catc
aagc
cagc
agag
aaat
ccct
ccat
gaaa
agtt
1621
2974447
—NO
atgg
atta
cggg
ctttt
atgt
gggc
agca
acttg
tgcc
tcag
1622
3000375
—NO
gtca
ctttt
gtgg
cctc
ttggc
tgta
ccttc
cctg
ggac
agga
cacg
gtcc
tttct
1623
3498077
—NO
gcag
ttatg
tata
caca
gggc
aatc
atca
gacc
ataa
tcat
tact
gttc
gagg
ccag
aaca
gaga
cgac
tagc
tctc
tgtg
tgcc
tttcc
caag
tctc
accg
tgat
ggac
tgtc
ctct
c
1624
3670758
—NO
gtct
tcag
agct
agcc
ggct
gtgt
cttc
tccc
cgttt
ctttc
actg
gtga
ataa
ccttt
cgttt
gaac
cagc
cttg
attg
ttctc
tcca
tgttc
cact
gtgg
ctcc
cca
tgga
cact
gcta
ggtg
gaat
ctct
gaga
aaac
aggc
acttc
gcag
tcgg
cgca
tcttt
ctct
gcca
gctta
ggga
ttgcc
gtca
aggt
ggag
gaag
ggta
ttttg
gcca
tgttc
ggag
tcgt
tgttt
tta
1625
3418194
—NO
cagt
tagc
tgta
gaga
tgtg
attta
gcaa
agttg
gtta
taaa
gtgg
gtttt
tgta
aattg
tttcc
ttatta
ctgt
tcttt
gtag
aattg
aaga
tgta
ttctc
tccc
agca
aaa
tagt
tcac
ctgc
agat
cattg
aaaa
gtttg
gctg
aaga
aagg
ggtta
ttttg
gtgt
gggg
gtag
gagg
cttc
tgtg
ggct
gggc
atgg
tggc
ttaca
tttgt
aatc
ccag
cact
ttggg
aggc
tgag
gcag
gagg
atca
cttg
aggc
cggg
agttc
aaga
cttg
cctg
acca
acat
aatg
aatc
tccc
atct
ctac
aaaa
aga
atca
gcca
ggcg
tggt
ggcg
caca
cctg
tagt
ccca
gcca
ctca
ggat
gctg
aggc
ggga
tggt
ctct
tgaa
cttg
ggag
attg
aggc
tgca
gtgg
agca
gtac
tcgt
accc
1626
3687863
ZNF764
YES
gagg
tggc
gaaa
tgtc
agac
acaa
acgg
ac1627
2438168
___
NO
ggga
atat
agag
aggc
tgct
gaga
g
OC
ggct
ctct
gtac
caac
aata
taac
agac
tttac
agta
aaag
aaat
agtg
gcag
ggtc
agat
gatta
caca
aattt
ggta
gggg
cattt
gtat
gcca
gtca
acaa
1628
3059966
KIAA
13
YES
24L
ttattc
cttc
tgaa
agta
aggg
tttcc
gagc
agcc
ttatc
atca
caat
c
1629
3281916
___
NO
aaaa
agta
aggt
ggat
atgt
gcag
tggc
t1630
2838994
___
NO
gtag
aaag
ggga
atgt
gtac
tgct
agag
gtta
cggg
ctttg
a1631
2724513
UBE2K
NO
gtgt
acca
agac
tagc
aaga
gtttg
cttc
agga
ttttg
ttgaa
taat
taag
ataa
tattt
tgag
tgtg
tcag
ggcc
attc
aaat
tgttg
gtgt
tgca
tcac
agct
acct
taac
tgttt
ttaac
atgg
atcc
tctg
tgcc
tgtg
aattt
acttg
catg
cttg
tact
tgac
ttctta
ggat
gggt
agct
gaaa
agac
cacc
atttt
aagc
atttg
aga
attc
ttaaa
tatg
aaat
ttattc
agaa
ttgaa
gatg
gtga
ccta
ttcag
agcc
tttttg
tcct
tgtc
aaca
gact
ggga
cagt
gtct
gattc
cccc
ttcac
cccc
ccc
accc
ccgc
cttg
ccac
acac
agct
aata
ttcta
atgg
taaa
tttct
ctgt
atca
ggtg
ggga
aatg
tgct
gaag
gaca
gtat
gtat
ccct
tgct
tcat
tttta
ggtc
gtag
gtttg
gaat
gtct
tgtc
ccag
ttcttc
aaac
actc
ttaaa
tttttc
ttaag
taat
gtaa
aaat
ggaa
ctgc
caat
tttat
ttctc
ttgca
aaaa
tagt
aaat
acttg
atgt
taca
ttattc
ccag
gttta
atga
aaga
accc
aact
tagt
ttttc
agtg
aattt
gaca
ccta
tttttt
agtg
atga
aattt
ttcttt
gaga
actg
gcaa
ggat
gcag
tcag
ctgt
ttgc
agttt
ttagc
ctga
1632
2415
597
NO
ccct
gtta
atct
tgtg
gtca
ccat
acac
tcttg
ggtta
gctg
tcaa
atta
gggt
ttctc
cacc
ctgc
ttttg
tata
atga
agttt
ctaa
tcct
aaat
gcag
gatg
ccc
cattc
tgag
cctg
tttgt
attta
acag
aaaa
tagg
gattt
tcgt
ccca
ttttta
tctta
ccaa
ttcac
tccc
caac
tgtg
ccag
aatg
ccca
gagg
taat
agcc
tttt
ggaa
ggag
ctttc
ccat
gcta
cattc
cagc
aggg
taaa
acaa
tctg
aaaa
tgttc
ttgcc
ccttt
tatc
actg
agttc
tcca
tatc
ccag
gcta
tttta
taaa
taga
atttt
tccc
taca
gaaa
atat
gtaa
agcc
attg
cgaa
gaga
aaac
ttttc
taca
cata
gaaa
acat
gata
atttc
tcat
tagt
tttgg
aaca
atac
aggt
ctac
tcta
agttg
tttct
tattc
tgga
gtac
aaaa
tttgc
atttt
ataa
tgta
aattg
ggac
actg
tata
ctag
ctttc
tc
1633
2818227
—N
Oca
ctat
gtca
actg
ggtg
tcttt
gcta
agttt
ggca
tcaa
tatg
gtga
cctc
tcgg
gagc
aggg
gacc
acta
ggtta
cata
agga
gggg
tgaa
ctgg
ccca
gat
tgga
aatg
gagc
aggt
caaa
actc
ccat
gcta
acca
gtag
tgag
atgg
agcc
t
1634
3412324
IRAK4
NO
atat
acac
ctat
ctca
acca
tttttt
taac
tgat
tttttt
ccta
aata
ttcttc
tttac
cttta
acaa
ggca
tagg
ctgt
tgca
ggac
agtg
gtta
ttaaa
gcat
gggt
tgaa
cttc
caaa
atat
a
1635
3598210
ANKD
YE
Sgt
cggg
gcca
tatg
gctg
tgct
gcag
cgac
ttgtg
gaca
tcgg
gctg
gacc
tD
lA
1636
3728
164
—N
Oct
tgct
aacg
ctcg
ccag
agaa
atat
tccg
tgga
ggtg
ttaaa
acaa
gact
agtg
gaaa
gaaa
tggt
aact
taaa
aaga
aaga
aaat
ccat
ggca
gtga
agaa
caga
tttca
tatta
gtac
tgtc
ataa
tattc
tcta
atgt
atca
tata
aata
aaaa
tctg
tttag
acca
gaat
tttaa
taag
tcac
caaa
ggaa
ggttt
tgaa
tcct
aca
gttta
tgta
tccc
ccttt
gcga
tgta
aatg
ttcac
atct
caag
tgaa
gtag
agtg
ggga
gacg
cttc
ggag
acta
gagg
ctgt
taat
tagt
gagg
tata
ttttg
atca
gggc
aaga
agac
tggt
tcat
tcta
tatc
ttcgt
atttc
tctg
gatg
atta
tttca
ttttg
taaa
atat
gtaa
gagt
tccc
cccc
agcc
cctta
tcttt
ttaaa
aatt
aaaa
tggt
catg
ctct
tcta
aaaa
gatg
actta
atttg
aagg
ctttg
gaga
catg
agga
tgat
gggg
atcc
atca
ctcc
ctgt
tcttt
gctc
ccttt
ggac
tgga
gtg
ttggc
tgtc
tgga
agag
ttttg
cttg
agct
cgtg
ggtta
ttttc
ttcta
tgtg
gagg
aacc
aaga
gact
cttta
gcat
ctttc
agca
agag
cgag
gtct
gggt
gta
ctca
ccat
ctgt
ctta
atta
tcct
tgtg
tata
agaa
aaca
ttccc
atct
tttcc
caat
tgcc
atttc
cttc
ctta
acttt
taga
ggcc
agaa
cctc
atct
gttg
atgc
agg
gaga
tctc
acttc
tttaa
ccta
tttct
agttg
ccttc
agtc
acag
cggg
gtgg
ccag
gggc
gagt
cact
taag
cctc
ccca
aggc
tcag
tttcc
tcct
ttgta
aaag
aggg
agaa
ttata
gtac
tcac
tgcc
tgca
gtta
gttg
tttgg
agga
taaa
tgca
aaat
tcttc
actg
ggag
tctg
gcat
aaaa
cagc
actc
cgca
aatg
tct
gttg
atct
tatta
ttata
gaat
tatta
ttttta
ttatta
ttattt
ttaag
acgg
agcc
tcac
t
1637
2779
706
NO
acct
ggta
aagt
gaga
agag
tgat
cagg
agaa
gata
atttg
cata
gtat
atat
agat
ttcta
ttcca
tttat
ataa
aacc
cttg
aatg
gtac
ctta
gtag
aaac
aat
ataa
aata
aagc
taat
atgg
gtta
gatg
ataa
tgtg
gtta
agtg
gattt
tttttt
cctg
tctg
aaca
aaag
aact
gaca
attta
atca
atag
accc
ctgt
cagt
ctgg
agag
atgc
ttcta
gtgg
tcag
cacg
gcac
tctg
gtct
ttgtg
1638
3394
582
—N
Oaa
cagc
tcaa
catg
tgtc
ggga
tgtc
cagg
gaa
1639
2429
120
TR
IM33
YE
Sca
tattt
tgaa
agca
cggt
gtga
tcct
gtcc
ctgc
tgct
aatg
gagc
aata
cgttt
ccat
tgtg
atcc
cacc
tt16
4034
2875
6N
Oca
gcta
aggt
aaat
ctgc
ttggt
ccaa
acaa
aaca
aaac
aaaa
caaa
acac
caga
ctgc
aaca
ataa
cagg
aaaa
gatc
ctct
tcag
tgat
ttatg
ttgttc
tct
tact
ttcat
aact
agttt
gaat
gcaa
ggct
ggta
aagg
gata
caca
gaga
atca
ttattt
taaa
taac
aaaa
gcca
ttcaa
aact
ctct
ctac
ctgt
caag
gat
gtttt
atgc
tccc
attc
ttattt
gtttg
gcag
taaa
cata
ccttg
ccca
cagt
cgcc
agca
tcaa
accc
acag
gaca
agac
attg
catg
cttg
gtca
caga
actta
tcag
cgag
ccag
gaat
tcgc
acat
c
1641
3595
186
NO
ttgttt
aggc
aaag
cacg
ttatg
gcttt
ttcttg
ccta
gttta
ggtc
tgtg
ctta
atga
ctac
tgga
atag
cagc
aaac
ataa
catc
ctga
taac
atttg
ggag
gaa
aagc
cata
tttga
aaat
tggt
gaaa
tgca
ataa
tcttt
aaac
cagg
aatc
atca
gtttg
ctaa
taaa
aggt
atat
ggct
ccaa
gtta
cgta
ttttg
tcag
ggtc
ctc
tggc
atttt
cact
agtc
atttt
gcta
gtgt
gtac
aatg
attc
ctat
cttg
ctgt
aaaa
cttg
gtga
tttttt
tata
ttgttg
aact
aggt
gaca
tgaa
tattg
agtc
agat
cac
ataa
tctta
gagt
aaca
cata
agtg
atta
tgta
gcgt
tgtg
ggtg
gaaa
caat
tctg
gata
aaaa
gtgc
atga
cagt
tcag
aact
actg
tgca
ggca
gcct
gtc
tgaa
atca
ggttt
aaaa
attg
gccc
tgtc
tcca
gctg
actc
gtttt
gtc
1642
3959
259
NO
ctca
agca
cagt
catg
attg
gtttt
gtat
tttct
tctc
cagg
ctac
ataa
taat
tttttg
acct
ggttt
tcttg
atgc
ctta
taat
tctg
aatg
ctgg
acttg
tctc
tttat
tacc
tgcc
agtta
gc
1643
2604457
—NO
acag
cagg
aatc
tgtc
agtg
tccc
tga
1644
2843218
PRR7
NO
ggcg
cgcg
cacg
acttg
agac
ctgc
cacg
ggca
gccc
ccgg
ccgc
gggt
cccc
gagt
gacg
ctgg
cggc
acct
gaga
gtgt
ggcg
cggg
cccg
gggc
cacg
cagc
ggag
ccca
gtgt
ccag
tgaa
gcgt
ctga
ggac
ccgc
cgcc
1645
3039818
NO
gctg
acct
attg
ctga
ggac
tatg
agaa
aaaa
gtta
ttaca
gaat
gagt
cata
tgga
aaac
acttg
caaa
c1646
3483658
___
NO
gcag
acaa
tccc
catc
agct
taag
gttc
ccgt
cacc
aggg
tttcc
cttc
atcg
aaac
tttgc
caca
gtca
1647
3605805
SCAND
NO
tgta
gtat
tcac
tact
ccag
acct
ttttc
aagt
tgaa
tttttt
ttcttt
ttccc
tatg
tctg
ttctta
cgta
tttttt
ttaaa
ctttt
atttc
agta
gtgt
ttggg
gaac
aagt
ag2
tgttt
ggttg
catg
gaaa
agttc
ttcag
tggt
gattt
ctga
gattt
tggt
gcac
ccat
cacc
caag
ggta
gtct
ttttta
tcca
tcaa
gtct
gtac
ccaa
tgtg
t
1648
3659197
—NO
agag
agcc
tagc
ctga
atat
gcca
cagc
cgca
caga
gtttc
tcct
1649
3775146
—NO
cagt
taag
tgca
gctc
ggtg
agtc
ccgg
cagt
tcct
tccc
ggca
ctgg
ctcg
tccc
tggg
ttctc
aagg
ttcca
tgcg
gcca
cagc
gtcc
gtcc
acct
gtcc
acgc
gagc
caca
tgct
gaaa
tgga
ggtg
gata
aaat
tcat
cagg
cagc
tgct
gtaa
cacg
gaaa
tgtg
caga
tgcc
agag
tagc
ttcgt
c
1650
2534837
—NO
tgtg
cacg
catg
cagc
actg
cgat
aacg
gggc
aggg
ggag
ttagg
tgat
tgct
taaa
gcac
aagt
gtgt
aggg
aaga
aaca
aagg
cagg
aggg
gcca
tta
agga
gcct
gtag
agtg
tttgg
gtcg
gggg
ctca
cattt
gtac
agag
tgag
ctga
1651
aaag
ctac
aagc
atgg
ccgc
ctgt
ggta
tcga
ggtg
ttgca
aaca
atat
ctgt
gttg
cgct
tcct
gtttt
aacc
tacc
tcgt
tttgt
ttgttt
ttgttt
cact
gttc
atc
00
2705269
NO
Ul
acag
cagt
gtta
tctc
cagg
agac
atat
agag
agct
caac
cggc
aatc
tcag
gtgc
attta
acat
tttta
aaac
gaaa
cagt
agttg
acca
aattt
ttcttc
ttaa
aaaa
ttgga
agtg
gggg
gaat
ccaa
tgac
aaaa
acta
atgt
ggct
tgttt
ctgg
agaa
aata
atta
ctgt
aaat
ggaa
caac
aaca
acaa
aaaa
aact
acga
tctta
ctga
ctttg
ccta
aata
caca
agca
gctg
atgt
acta
ttaat
gaga
acga
aata
caca
ttagg
aaaa
tgga
gcca
tttca
atct
agtg
gtttg
ggca
aga
tggg
gaag
agaa
gggg
aaac
attc
tagt
ttctg
gatta
catta
ttatg
cccc
tcct
gaaa
aggt
ggttg
tcat
ttgca
tttat
ttaaa
gcag
gtaa
tatg
cagg
aatg
taac
tgag
gatta
tcttc
aggc
aatc
agca
agat
atcc
tcct
catg
gtcc
cttta
gctc
tcaa
aagc
aatg
aaat
cctc
ctgt
tctc
atttt
tact
gctg
tggt
tgt
gctg
ctga
acaa
tact
atct
tctc
aaat
tcca
tgcc
acaa
attc
agca
ataa
ctttt
tgga
ttgaa
tttaa
caac
tact
gtaa
ttgga
tgct
gatg
tgga
caaa
atat
attg
atttc
gattt
cact
cccg
aatg
tgat
tgcc
acca
gctc
1652
3479
382
GO
LG
AY
ES
tcag
caaa
cagc
ccgt
ggga
aacc
aag
1653
2320
890
NO
gtttc
atga
gaca
gcga
tctg
ccta
acca
ttcac
tcct
ccta
gaag
caaa
agct
accg
tgat
cattg
aggc
acag
ccgg
tcttc
atat
atcc
tcaa
agga
ggg
ggat
tagc
ttcca
catc
ccttc
tcta
ttcct
ggag
gtgc
cagt
agtg
ggag
agtc
atgg
caga
agtta
ccag
aggc
aagt
gggg
caga
ggca
agta
gggt
ata
tgttt
ggat
cagt
ggat
cagt
gtcc
tttat
tgtg
caga
aaaa
gcta
aaat
taca
gattc
ttctg
aagg
gata
cata
gcag
ccgt
ttcac
caat
gtcc
ctca
gtc
cattt
gtcc
aggt
ccag
a
1654
2824597
NO
tgta
ggtg
ctgc
tttaa
tgct
tgga
ctga
tgtta
ttgttg
tggg
agtg
gatta
gtta
ttgtg
agca
tggg
ttggt
tata
aaag
caag
gtca
aggc
tggg
1655
3916352
NO
taat
gggc
ctga
gtgt
cacc
aaga
g
1656
2900
984
HL
A-F
YE
Sgg
aatg
aatg
gctg
cgac
atgg
ggcc
1657
3003
162
GB
AS
NO
ccct
atgc
cgat
gttg
tcct
ggat
tacc
tttttt
gtcc
tgtc
ttatc
acttc
gtct
cctg
tctc
gggt
ttctg
gctg
tctg
ttcct
atttc
tattt
gatg
ttgtg
ctct
cact
tca
cacc
cagc
atgt
agaa
tcac
cttg
ctct
tcca
acttc
cctg
tctg
tgcc
actc
acct
tgttt
tact
cagc
ccgt
tcct
tagt
tctc
tgtg
gtga
atct
gtca
ccta
agct
gtgc
attc
ttcat
cgga
catg
tattt
tggt
tctc
ttgttt
ccttc
tcca
tctc
tctc
tcca
agcc
tctc
gagc
ccca
ttccc
aagt
agct
tttttt
ctga
ctac
cttt
ctgg
gctc
caag
ctat
acct
gtgt
ctcc
ttcct
gccc
tgca
atac
atta
cccc
aaag
tttgt
caga
ttgat
ctgc
ttgag
ttctg
ttttc
tttttt
catc
acgt
tttttc
ttttc
tctg
aaac
tttca
gcag
ctcc
ctgt
ttgca
gaac
ataa
agac
ttaac
tcct
gcct
ggttt
ttagg
ggtc
ccaa
taat
ggga
ttctg
ccca
accc
atcc
agttc
tgg
gtac
aagt
actg
cctg
cctta
caat
ctcc
attc
aggt
aagg
ccag
ggct
ccta
agag
cagg
tttgg
ggtc
ccca
gcaa
agag
gagt
ggct
acaa
atcc
atg
cgtg
acttt
gtgt
tggt
gaaa
aacc
tggg
cccc
gttg
ccag
gccc
ctttc
tgga
cccc
acct
gaat
catg
tgta
ctgg
gcttc
ctcc
tggg
cact
gcct
cca
atgc
aggg
ttgtc
tattc
ctgc
tcat
gtca
tgcg
acga
aaag
gaaa
tgttg
gaca
taca
tttcc
ctag
caga
atcc
ctct
ttccc
tgtc
ctcg
atttt
cattg
atcc
catg
ttattt
ctga
acat
tcat
gagt
cacg
catc
agat
acat
ttaat
atca
gatc
ctaa
tttaa
aaag
taaa
tgct
actc
ttcca
aaag
taca
catg
atgt
ggtc
act
atac
tggg
tcat
aaag
atgt
atgg
gtttt
cagt
tagt
taga
ctta
gacc
gt
1658
3743421
—NO
ggaa
cagg
agaa
gccc
caat
tttca
gaag
agag
ggga
gttg
attg
agaa
gtgt
aaga
ttgtta
cttg
agaa
taat
cttg
aggg
tatg
aagg
tagg
gtaa
ggga
gaat
ggtg
ataa
actc
1659
2949593
DOM3Z
YES
ctgg
cggc
ccag
tcac
cgtg
tctg
taca
ccaa
gatg
cacc
ttacg
ccttc
ctgc
cc
OO
acgg
caga
agat
ggtc
cctc
gcgt
ctct
ccca
tcttc
cggc
tatg
gaac
caac
acac
1660
3824798
MAST3
YES
1661
3933890
___
NO
tctc
cgta
accc
tata
acag
tccg
ata
1662
3340811
___
NO
cattt
attta
tgat
caac
caga
gaga
ctaa
acag
tgga
ctca
tggg
tttgg
actc
tattg
caat
tcaa
1663
3888249
DDX27
YES
gatg
aaga
actc
acca
acac
aagc
aaga
aggc
cct
1664
3932432
___
NO
atag
ggaa
tgtg
ctttg
agct
gttt
1665
3941419
___
NO
cagg
aaaa
gagt
tggc
ctgg
acct
gca
1666
3976987
CCDCl
NO
gttg
actg
cgtg
tcca
ttgttg
ttata
gttg
gttg
aatc
tgtc
ccat
taat
tctc
cttc
c20
1667
2892314
___
NO
acat
ggcc
agta
ctgt
ttcag
ggga
atat
tggg
tggc
gctg
g1668
3379157
—NO
gggc
caga
gaac
tttac
aatg
atta
tgaa
gatc
aaag
ggca
ttaga
atca
agct
ataa
agag
ccac
tgttt
gatg
ttggg
atgt
gagg
atgc
tgca
ggtg
gat
gtct
gcac
gttg
atgg
tgag
aaca
tggt
cacc
ctgg
ccct
gctg
ggtc
tttgc
taaa
gaga
ctgt
gctc
tgttc
ttggg
gccg
ttttc
atca
cctg
atta
1669
3830022
NO
agga
cggt
cggc
gtgc
agag
ttcct
ggag
tgct
gggg
ggcc
tgag
atgg
ttgta
tgac
gtct
ggag
gattc
agga
gggt
gtat
gggg
tctc
aagg
acac
1670
3904149
CPNEl
YES
ccttg
gttc
agct
gtcc
atttc
ctgt
gacc
atct
cattg
acaa
ggac
atcg
gctc
caag
tctg
accc
actc
tgcg
tcct
tttac
agga
tgtg
gg1671
2408020
NO
tgaa
ccat
ccat
tgcc
taaa
actc
cagc
aacc
ctaa
aagg
accc
aatc
aacc
agaa
gaaa
aaat
acta
atac
ctca
tatc
atac
agca
acct
aaca
tttca
aagt
gcttt
caca
tgtg
ttagt
tcat
gcac
acct
gaca
acag
tttaa
ctga
ggga
gcca
gggc
1672
2437
930
ME
X3A
NO
ccct
gggt
ccag
taga
atgt
ataa
aagt
tgta
agga
aaag
ataa
atag
agga
ggga
agtg
gctg
agtc
cacc
ctga
gttg
ccca
atct
tcag
atac
cagg
gttg
gatc
aggt
tgct
agttt
aaga
ttggg
agct
tcca
gtct
gctg
gggt
tgat
tctg
agaa
tcct
tgga
ttttta
aattg
tagg
acaa
agaa
atga
gggg
ttcat
ttcc
cagg
gtct
tgga
aagg
atgc
acac
tgat
catc
tcaa
taag
acag
gggc
tggg
ttggg
ggca
gcag
agga
ggcc
aagc
acat
tcac
ctgc
accc
ctag
tacc
tggg
cagc
ccat
actc
caat
gtgg
ta
1673
3264854
—NO
ctcc
acct
ctttt
tgtg
cgag
ggcg
gcct
c1674
2701275
—NO
acttg
ctgg
aaaa
agta
catg
ctct
cc1675
3017578
—NO
tatc
tctg
atgg
ttgct
agtg
aaca
ttaag
ccct
caat
atttg
tttaa
tcag
tgct
atca
aagt
ggttt
aaat
ggat
gtag
tctc
cagg
ggtc
tctg
1676
3625878
—NO
ctac
ttcag
tgag
tggg
acttt
agcc
tgag
ca1677
2438152
—NO
cagc
tccc
tgag
atag
atgt
aaat
ccta
agca
ttgtg
ggtg
ttttg
gtttg
atgg
gact
ggat
tgaa
agat
ttaca
gact
taaa
ttgta
aagg
aagg
taat
ttaga
gaga
ggaa
ggaa
ataa
acat
tatg
ttggt
ttggt
tata
acca
ctgg
cttg
tctc
cact
gaca
tggc
ctgg
gggt
gagt
ggtg
tattt
gcaa
agct
ccttt
cagg
tct
gcat
taat
ctct
ggca
ttagt
tggc
tgtg
accg
atta
gcct
ccca
gtta
aagt
atgt
agtc
agttc
ttagt
gatg
gtaa
atgg
gtta
ctga
ggcc
ttctta
ttaca
tcca
atta
tgag
gtga
ctat
ataa
tttat
cttg
caaa
ctgt
tgag
tgtg
ttcta
aaca
gtat
gctg
tggg
ggcg
ctat
cact
aatta
tgct
gga
1678
2786550
NO
gtac
ccag
gagg
tttgg
catc
actc
aaga
ccct
caga
atgc
ttcct
ctgc
tttaa
atgc
ttgga
gtgg
ctct
ctgt
ccaa
actg
tatta
cccc
atcc
aaac
taaa
tca
cctc
tcac
tatg
tgga
tgaa
gcaa
acaa
caga
tcttt
tgttt
cgaa
atga
tttca
tgtta
acta
tattc
cagg
atat
taag
atct
tact
taag
gaaa
ctgc
tttgc
OO
agtg
ccag
ggat
ctaa
gcta
tata
1679
3122495
ANGPT
YES
cagg
gaca
gccg
gcaa
aata
agca
g2
1680
3252241
___
NO
cttc
tccg
cacc
acat
cgcc
ctta
ctct
aaaa
ttgac
caca
taat
tgga
agta
aaac
actc
ctca
gcac
aagc
aaaa
taat
a1681
2806568
—NO
agag
cact
caat
tcga
ctat
ccgt
ccag
ctttc
acag
aattt
acttc
tgag
ccca
caag
actc
aaat
tgta
taag
gata
gagt
ctgt
ttcat
tcat
caac
atat
ccag
cacc
tgac
atag
tttta
catg
gcac
agag
tggg
tgttc
aata
aata
tttgt
caat
gattg
aaca
gaaa
ttttc
agca
ccaa
atga
aaga
aaca
catg
acttt
gaa
caac
ggca
cgag
aata
ccaa
ctgt
ttgat
tatg
caac
gaca
ggat
ctttt
ggc
1682
3385452
NO
agttc
ctaa
atgg
gaac
actg
cctta
tgga
1683
3968192
—NO
gatc
aacg
ctta
attc
aaaa
ccag
gctg
taga
agaa
gaag
aaga
agag
ctaa
gaag
gttc
ttagt
gctg
atgg
ggag
aaga
gaag
gttc
tgga
gctg
tgc
catc
cccg
ggtg
gaaa
gagg
gaat
gtgt
tcttt
cttc
ccttg
acac
ttctg
tgca
gatg
aatc
aggc
agga
atca
agag
cggt
tg
1684
2397794
—NO
cttc
tgat
cacc
aagc
caca
tgtg
atga
ctcc
tggg
ccct
tcttc
ctgg
gaca
actg
cacc
agct
tcta
atca
gctc
gccc
atct
cagc
ctga
cccc
tctc
tgc
agcc
tgaa
gtcc
agac
tctc
tagc
ctgg
cacc
atag
tcct
ttcct
tctc
tctg
acct
aatc
atta
acca
taac
cc
1685
2412877
ZCCHC
YES
cgat
gtga
cattg
ggga
tgct
tcca
gggg
aagt
ttatc
ttcat
atg
11
1686
2904959
MAPKl
YYEESS
gtgg
gctg
tatc
atgg
caga
gatg
ct3
1687
3367083
NNOO
acag
actg
acag
aggt
gcat
ggtg
gag
1688
2762520
NO
tttac
catc
gtttc
attc
catc
catc
c1689
3463535
NO
tgcc
tgta
gctc
cata
cctg
tata
atca
acag
ctat
taaa
tatc
tgct
aagc
aaac
taaa
ctaa
gcat
tttca
aaag
taaa
cttg
tgaa
acct
gatg
gattt
cagt
agt
tgac
atca
gcat
ctac
ttttg
ctga
agtc
agta
gcct
ggac
atta
tttat
gaca
ctta
tttttt
tttcc
catc
ttgcc
ctta
tcca
ttcat
tcac
caag
ccttg
ttgag
ttta
tcttc
tgag
tatc
tcat
tgcc
atca
cacc
aatc
tagg
acac
catc
atct
ctta
cata
ggtg
atta
caat
agcc
tcca
cact
ggcc
tccc
tgtc
tccg
tttag
ctc
atct
ccca
atat
atttt
ataa
aaat
gata
ctgt
tact
ttctc
tgct
ttaat
ggct
tacc
atta
ccttt
aaat
ggag
tctg
aatc
agtg
gtct
ccaa
aggg
gggt
gata
tat
aaga
taat
ctgc
tagg
gtgc
agga
ggaa
aata
ttgga
acttt
tattc
atat
ttacc
ttttta
tttga
aaaa
cact
aagg
catta
atat
ttaaa
agct
acta
ctgt
gtaa
tagt
tattt
agtc
attc
cttc
tttgc
tcta
tgtg
tcag
atgg
tcat
ttggt
acta
aagg
tgtc
ctga
ggga
agca
tgct
ttgttc
acttt
ctgc
atgt
tgca
caat
acta
taat
ttgaa
tgtc
catc
tatg
cgga
tttat
gagt
tatc
tagt
ttaac
tatta
caaa
atat
gtaa
gtct
ggga
ttagg
gaat
ttgtg
gaga
aaat
caag
tgttt
aact
gcaa
ataa
gatc
acat
tgttg
ttcag
ctgg
taag
atat
acaa
aatc
aaat
ctgt
tctc
atga
atta
agaa
aaat
tacg
ctga
ttttg
aggt
tttaa
gtga
aact
tttta
aaaa
attc
tcat
tttttt
gtaa
ggca
tccc
tttct
gcttt
gata
gcca
ttaaa
aatc
aaat
attc
aaac
tgat
gtta
gaac
taga
tcttt
gaat
ctct
gtat
caca
cggt
gtta
agat
cttc
agaa
ataa
tgaa
gcat
attc
aata
tttta
1690
3533262
—NO
tggg
ggag
ggag
gata
ctac
tgag
caa
1691
3651809
—NO
caaa
gatg
agag
gtgc
gaag
ttgtc
caag
tcca
acag
ctca
actg
aact
ttcct
aagt
ggaa
ttgtta
1692
3719905
MLLT6
NO
ggttt
tgca
tctc
attta
cttc
tcca
ttggt
tcag
gata
aag
ctga
gcgt
gcca
tctc
ccag
tggc
c00
1693
3744442
—NO
1694
3994048
—NO
gcaa
gcag
cttg
tcag
acgt
gcat
gtgg
tttgg
a1695
2566874
—NO
ttccc
tcca
aagt
ggaa
gtga
gctg
gaga
gatc
tctg
ggtc
tgaa
agtc
acct
gcta
tgttg
gagg
aggg
tcag
gcttt
ttgag
gact
actc
tggt
caag
gtc
tgca
gact
tgct
tccc
tcct
ttaac
ccaa
atct
tatta
ttatg
gcta
caca
tgtg
ccac
acat
tata
gctc
caga
tact
cacg
caag
gttc
1696
3788902
—NO
agttt
ctttt
ggca
ggtg
gtcc
agaa
1697
2684832
VGLL3
NO
aaca
caac
cattt
acga
tctc
agtc
agca
gattt
actc
tact
caag
gaaa
aaaa
gaaa
caat
ctta
ttgga
agca
gatg
ttgac
actg
tgtc
agtta
ttgaa
gac
ggaa
ggag
ttcac
ttgag
ccat
tgca
gtta
caaa
gggg
tattg
atgg
cagt
1698
2808743
—NO
ggcc
aact
gtca
agat
gtttt
gaag
gcca
atct
ggaa
gtat
atgt
taaa
ataa
agat
gtgc
ataa
tttct
aact
gagc
aact
ctac
tttgt
ggta
tttat
ccta
gag
aaac
attc
acaa
ttatg
ccca
gaga
gata
tatg
tagg
gtat
ccct
tgct
acat
ggttt
ataa
caaa
aaaa
tata
ttagc
caac
tagg
gtcc
atta
gg
1699
3311746
—NO
tttcc
tcag
tgca
ttcttg
gtgg
tgag
attg
attg
gtga
gcat
cgta
taac
ccac
tctg
ggag
cata
gcct
atttg
tattt
gtgt
atgt
ctat
acttt
gtga
ggttt
agcc
cact
tc
1700
2398939
—NO
atga
aatg
gtttg
gaaa
ggag
cctt
1701
2948399
—NO
agag
ggcg
atca
ggtc
tcat
tagg
cccc
aggg
tgtc
tgag
gggt
gatc
tctg
ccag
tggc
ggtg
ggca
aggc
agaa
gagg
cgtc
tgct
gcag
tgga
agga
tcat
gaca
gcct
gagt
taaa
ttcca
cctc
ttctc
agct
gtga
ggtc
ttgag
taag
tgat
tttgc
tact
ctga
gtct
tagt
tact
ttg
1702
2974
531
NO
tctc
atttt
gattc
tggc
agtg
ggcc
caaa
tgct
aagt
gtct
tgcc
caaa
gtag
aaat
ttcca
gttg
ctac
ttcca
tggt
gtgc
cgca
cagg
caat
ggct
gagt
tct
t
1703
3385
006
CR
EB
ZN
Otc
tgga
tgat
tagc
acat
ggat
aaag
gaga
tttct
ggaa
tata
aaat
ggat
tgttt
ttgaa
atttc
tagg
tttgg
ctct
attta
ctgt
aatg
gttg
aaaa
caat
ttagt
Fat
ttggg
tgac
ccttt
tgttt
ttcttc
taaa
tgtg
cctc
tggt
aaaa
taca
gaac
taga
ctaa
agat
gtag
ctttt
taat
atttg
tcttt
tgat
ggtg
gcag
gagt
tcat
aca
ttaat
tgaa
ctaa
caca
tcat
atttt
gacc
tact
atttc
tatc
atat
tgac
ttact
gtttc
tgca
cttc
tttga
ccag
actta
tc
1704
3777
967
—N
Ogc
gagt
gact
ccgt
ttcttt
ggac
ctgc
ggttg
agct
ggca
gcag
aatg
gagg
ggtc
cgag
aaac
gggt
gagt
gtga
atcc
cttg
caga
aatg
ctct
ctgg
gggt
ctgc
ccttg
aatg
ttagg
cggc
atga
caag
tcaa
atgc
agtc
cacc
aggg
tc
1705
2446
212
TO
RlA
IY
ES
atgg
ccga
cagt
ggac
ttagg
gaac
ctca
agag
gP
2
1706
2527
475
—N
Occ
atttg
agat
gttc
tccg
aggt
gg17
0731
1089
2—
NO
ttggt
ttcaa
atgg
acag
cagc
aaa
1708
3229
044
BR
D3
NO
gatg
tttct
ggta
atca
tgga
ccct
tctc
c17
0935
1854
9M
YC
BP
YE
Sca
ggac
cagg
ttctc
ggttg
tcat
ctcc
taag
ccaa
agac
tctc
ccag
ccaa
tagg
tcta
gccc
atcg
ggtg
ctag
ttctc
cacg
ctcc
tcct
cacc
acat
gat
2aa
aaat
ctac
ctca
aaaa
agta
ctgc
tcct
gtta
agac
aaag
cttg
atcc
tcct
cggg
aacg
ttcta
aatc
agac
tctta
caca
cttg
atcc
agat
accc
tccg
caa
gaag
aaaa
tgcc
cctc
acag
aacc
tttga
gagg
acgg
tcaa
cgtc
acca
aaac
caaa
atca
gtac
caaa
ggat
tcta
caga
ttccc
ctgg
atct
gaaa
ata
gagc
tccc
tctc
ccca
tgtg
gtac
agga
aaac
ctcc
acag
tgag
gtgg
tcga
agtc
tgca
cctc
aagt
a
00
1710
3701324
—NO
caaa
ttaca
attc
aggt
tctg
tttga
ggtg
ctcc
tgag
aaga
ggaa
aaga
gtca
gaaa
ccta
caca
aaga
tgta
taga
agat
tagg
tcct
gaga
g1711
2941831
—NO
ttgcg
tttac
atga
cttg
agtg
gcct
gtca
gtcc
cgct
agta
aagt
ctcc
tcat
ccat
ccac
tctg
tttaa
acaa
tcaa
actc
tgaa
acct
gtgt
ctcc
tgga
ccat
ccta
taat
ccct
aaaa
tagg
ctgt
tatg
ctttt
cacc
tctg
ccct
actc
tgtta
gcac
tacc
ttact
tattg
acct
ctttc
aatc
aaat
cgtta
tcta
aagg
atac
ctgt
aact
caga
tagg
ttgcc
tact
gggc
ctgt
ttagt
ttta
1712
3403963
NO
tttttt
ctgc
acta
tggc
ttggg
ccc
1713
2337265
Clorfl7
YES
gaaa
ggca
acaa
caag
cagt
caca
gggg
5
1714
2774211
___
NNOO
catc
agga
gtgt
ggat
tctg
ccat
tagt
accc
aaca
gatc
aatg
ccca
gtgt
ttc1715
3626362
ALDHl
YYEESS
gcttc
agaa
aggg
gacg
tctg
ttgga
taag
cttg
caga
cttg
gtgg
aacg
ggac
aggg
cA2
1716
3760869
...
NO
gagc
gtct
ctcc
ttcaa
atac
ctgg
atttt
tttttt
tttgt
acac
tggt
tcat
agat
cggc
acttg
acttt
gaac
ctgg
cacc
aaaa
ggca
caat
atct
gata
ccct
gtac
aaga
gcta
ttaga
gatg
ctgc
cata
tgga
tggg
caaa
actg
agcc
aatc
ccac
ttagg
aatg
gaag
gcttg
gaca
tgga
aggg
agga
tata
aacg
agg
agttg
gaga
aaaa
cgca
agcc
cagt
ttttg
ctag
agtg
gaaa
tgaa
agtg
ggaa
tgag
ggtc
ttgttt
ttagt
cctc
taag
gacc
agga
agca
atttt
aaaa
cttc
cttg
gtttt
tctg
aaag
cagc
atat
tcaa
aatg
ccag
caaa
aact
ccta
acaa
ctgc
aaaa
ccaa
aaga
ggat
caaa
gctc
acca
acat
ccct
tctta
ttgct
gaaa
ggct
ctaa
aattc
agga
tgcc
ctgt
tccc
ttgta
aaag
ggaa
aata
atta
agtc
tgat
ttatg
gtaa
tcat
acca
catc
acac
ttcta
aaaa
aata
ttcaa
gtg
tgtg
acca
gggg
acgt
ttgac
acca
tt
1717
2772067
—NO
cctc
aaag
tgtc
tagg
aggc
ggttt
gata
tgca
gact
aaag
tgag
a1718
3350159
—NO
tttac
atca
tggt
gtgt
gaca
aaac
aattg
cttc
cccc
tcct
ccct
gccc
ttccc
cacc
cctg
acttc
taag
ataa
cta
1719
3946034
—NO
aggt
ctgc
ccat
ccca
aacc
tggg
t1720
2699092
—NO
gcttc
aaac
attg
gccc
ttcac
gtac
aaca
aact
ctat
cattc
gatg
tatc
aggg
cgag
caaa
ttcct
acaa
caac
aaca
cagt
taaa
gaaa
atgg
attc
aga
ccta
ccac
tata
taaa
acta
cact
tgac
ctaa
gtta
ctat
aattg
tgca
gaaa
atga
ttattc
tgct
gact
aaca
ccga
attta
atac
caaa
aaag
taac
ctgg
agca
tatg
ttaat
gcct
tgtg
catta
agag
ggtg
tggg
gcag
tggg
agta
ggga
agtg
ggag
ggga
tgga
aagg
tgag
ggag
ggta
ggaa
acca
caat
aca
gcct
cata
gtaa
tattt
aattt
tcta
aatg
tcac
ttattt
tcca
aaag
taaa
acat
taaa
ttata
aatc
acaa
ctaa
ttaga
aagc
tgca
aata
ttgat
atat
aaaa
tca
gtaa
ctta
gagc
ctgc
atgt
tatta
cagc
catc
ta
1721
3027187
NO
gttc
acgg
cagc
tccc
aaca
ggtc
tgag
gtaa
gacg
ccgc
tatta
tcct
cgttt
gctg
ggag
aaga
ctcg
ggct
taga
gcct
ctga
ttaca
cagc
tgca
ctct
aacc
atttc
ctaa
ta
1722
3496668
NO
tctc
atgt
acct
cgta
gtat
acct
acta
tata
ccca
gaaa
aatta
aaaa
ttaaa
aact
aaag
aaag
aaat
aattg
caaa
atgt
aata
atgt
ctttg
aggg
gaaa
ataa
aaac
ttcag
tgaa
atac
tcta
agag
gggt
ttgat
gcag
agct
gggg
gatg
tggg
aaag
taga
gata
ggaa
agttg
agga
ggga
ggga
tggg
ttaca
gtttg
ccaa
ggac
agaa
ttact
tgag
aggg
tggg
agca
ggac
acac
atgt
cagg
gcct
ctgg
aaca
ggaa
gagg
ttggt
ttatg
gcag
aaat
tgaa
agaa
tgct
ctca
taca
cagc
tcac
tttcc
tcct
tgct
ttttg
tctg
cata
gcat
tattc
aata
tgta
gcat
aata
taag
ttgaa
atca
tata
tcac
ctttg
tggt
cttc
tccc
acca
gaaa
tcag
gcat
tatg
aagg
caga
aggg
caga
ggta
tttgt
cgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tttga
ctttt
gtat
ttttg
ctgc
Ocg
tattc
ctgg
tacc
tgga
atag
cact
tggc
agac
aaat
gtat
gcaa
agtta
ttcaa
atcc
agtg
ggac
caga
ccac
agtg
1723
3718614
—NO
cagt
gggg
atgt
agaa
tgtg
tcca
aa1724
3759098
—NO
gggg
cagc
acag
gtac
tctg
atttt
gggg
ccac
gagg
gcca
aatc
cgcg
cctg
cacg
tag
1725
2721510
—NO
tattc
ctgg
acta
tagc
cgct
gatc
tcat
tgat
gccc
ttcttc
1726
2938424
—NO
ctca
agag
tgaa
cccg
gaaa
ggct
ccta
agga
ggtc
cctg
cctc
agtc
cagg
caag
gctg
tggc
ctag
acga
ggtg
gcg
1727
3606484
AKAPl
NO
tgct
gctg
cccg
catg
ggat
gcgc
aggg
gagg
cgtg
ggga
tccg
cagg
aggg
tggt
tggg
atac
accg
gata
cctc
tgct
ctca
ttgct
tgttt
gcaa
atg
3ct
ctat
ggac
atttg
tgtg
ctaa
atcc
ta
1728
3750237
—NO
ttcac
ccac
atac
aaaa
tagt
gcgc
gtttt
ggct
acttc
agcc
tcttc
ccttc
tcta
gaaa
taac
tcct
ccttg
tact
tagt
aaag
gcac
acat
tcac
ttcaa
gagt
agg
1729
3320010
—NO
cagg
aggc
ccga
ccaa
aagc
agat
caaa
tagg
agtg
acag
agttc
agag
ctgc
ctgg
gaca
aagt
tttaa
atttt
atttt
cact
gcaa
ccca
tagg
aaga
taaa
cattt
tata
tcat
gacc
cagc
acac
aca
1730
3907649
—NO
agga
atgg
gccg
catg
gact
acct
gggg
gtcc
ccaa
ggcc
ccag
ctcc
tctc
ctttc
cgcg
ccttc
tccc
aacc
tttat
tgca
gcgt
ctcc
tccg
gcga
gac
gagg
ccgg
gctta
gaaa
aagg
gcag
cgaa
gacc
cagg
ggcc
aaac
tggc
actg
agga
gctc
tggt
ctct
gcgc
ggcg
gggc
gccc
tctc
cgaa
tcag
cccc
aaca
ggcg
tggc
ctcc
gggc
ttcag
gcag
cggg
gtaa
gggg
ccag
gaca
cggg
caca
gggt
ctgt
atgt
aaac
ggtg
acag
cggc
1731
2546923
—NO
ggct
gtgt
cacg
tgga
cagt
gtca
catc
tgcg
ggtg
ggga
taac
aagg
cacc
tgag
tgcg
tgaa
ggag
ga
1732
2788764
—NO
ccat
ggga
acag
tcttg
cagt
gatg
gacg
gtttc
catc
cccc
atca
ctga
cgcg
ggaa
taac
tgtg
ctgt
1733
2886332
___
NO
ctttt
ggca
aaaa
tgcg
ggcc
atgg
ggtg
tcaa
ag1734
3307785
—NO
aact
tcaa
actg
tgct
catg
tggg
caca
gaag
catc
gtgc
aggg
ctgc
actg
tcac
cagc
ccca
gcag
ccac
aggg
cagt
ttgtc
acca
gctc
cctg
aga
atca
actc
cagt
gtga
tgga
ggtg
accc
1735
2339825
NO
tcttt
aagc
gaga
agta
gatg
tggg
gagg
ggttg
ttgca
ttttta
aaga
ttcct
gtat
tagg
ccgg
gtgc
1736
2922550
___
NO
gtga
acttt
gtac
ttacg
tgtg
ctttt
gtgg
tagc
aagt
atca
tcttt
ctgt
ttcct
tttag
cattt
ttggt
gtac
tggt
cta
1737
3319874
IPO7
YES
tagc
gcac
agttt
gaca
tgtc
aaca
ag1738
3571362
—NO
tggg
tact
cgag
cact
gaag
cattc
tcat
cact
tccc
atga
agac
acta
atcc
catta
aact
aggc
attc
ctct
ttccc
tgag
gttc
gttg
atag
ttgta
tcttt
tgt
cattt
ccca
tagt
ggga
aaac
taag
gtcc
gtgg
aggg
ggat
gtga
atcc
ataa
tttct
cttc
aaca
aaca
ggca
gcag
agcc
attg
tgga
atttc
tagc
ctaa
gcca
taag
atttg
tggt
ttgtc
ctga
tgta
agtg
cattc
actg
1739
2562256
—NO
acag
gcgg
gtat
ggta
atag
caat
ctca
ggag
ggct
cttc
tgttg
ctgt
gtag
aaaa
tgaa
ccaa
gtag
cata
gaat
cgtg
acca
gcttt
ccac
acct
ggct
gga
gcac
tcct
gggg
ccttc
agta
ggga
cagt
tac
1740
2599272
NO
ggaa
gtta
ctgc
ttccc
ctgg
tgtc
tgttc
ttctg
gcttc
agct
tcgc
tggg
gatg
gattc
1741
3019543
—NO
gtgg
ccttt
tagg
aact
gtag
agta
actta
atat
aaga
gaca
ttattt
ggtg
ggca
gaat
taag
atct
tggg
agcc
attg
ctgg
tgca
atcc
tcac
ttggg
aaaa
tttaa
caaa
atat
agtta
gacc
ttttc
cagg
tctc
accc
attc
catt
1742
3316144
TMEM8
NO
gagg
gcca
cattc
ggag
cctc
cgtc
cact
ccag
tttta
tcag
ctttt
gcct
tttgc
acgg
agtg
ctaa
acaa
attc
tagc
tctg
tgttt
ttttc
ccat
tccc
agat
tt0
acta
tcag
ttctc
ctta
aaaa
gtat
ctaa
gctg
ttaca
gtag
ctttc
ccttc
acttg
attc
tattg
tgtg
ttttc
tatg
tttgg
aata
atta
cacc
caaa
tatc
taga
tattt
tctc
ttcac
cgca
ttttg
taaa
ta
1743
3808909
TCF4
YES
gtct
atgc
tcca
tcag
caag
cact
gccg
acta
caat
aggg
actc
gcca
ggct
atcc
ttcct
ccaa
acca
gcaa
ccag
cact
ttccc
tagc
tcc
1744
2360953
NO
acat
tcta
tcag
agag
taca
tgat
atca
ccat
gcct
tatc
attg
gtga
agtta
acct
tgaa
cttg
gtta
aggt
gctg
tctg
ccag
gcct
cttc
actg
taaa
gtta
ctgc
ttttc
cctc
ttcat
actc
tgttc
ttt
1745
2458
084
WD
R26
NO
cagc
tgttc
tagg
atct
tgtc
atttt
tact
gaaa
gtct
cgtg
caca
tgtg
aagt
gccc
tctg
agttt
aagg
tttgg
ttaat
actg
gtat
atttt
tata
agat
tgaa
attg
tgt
cccc
atct
ttaac
ttaaa
cattt
tcat
tatc
aggg
tcaa
tgtg
ctag
aact
gaat
tgta
acat
tttta
ggca
caga
tgga
aaaa
aatg
ttattg
gctc
cttg
aaaa
tgtg
tgtg
tgtg
gcga
gggg
aata
gatc
caca
aaag
catg
tatg
tact
taca
aacc
aagc
tgta
gaga
tcaa
gaaa
agaa
ctta
agtg
ttgat
ctca
agat
ttcta
aattg
tcaa
gattt
acat
ggca
ttgtg
gtgg
aact
agtta
ac
1746
2775432
—NO
tgga
gacc
agat
aatg
gatg
ctgg
caag
atct
ggga
atca
aagc
catg
gaaa
gggt
gcca
ttggt
cag
1747
3577885
DICERl
YES
taga
taat
atgt
taat
gggg
tcag
gagc
ttttg
1748
2413468
TMEM4
YES
gctg
gagg
atag
ttgca
agta
ttgttt
ggtc
agtg
ctat
ttcta
ccca
tctg
cacc
acag
tattt
ataa
ttttc
agca
ggat
tgat
ttgttt
catc
ctat
acag
tggc
tQ δ
1749
2478883
NO
tgaa
caga
cctc
aatg
tcag
acaa
gcca
gggg
agcc
tgc
1750
2783808
—NO
atat
tttta
atgg
tgtc
agcc
aggc
aaa
1751
2406967
___
NO
ctcc
tgtg
actc
agag
ccct
cggc
tt1752
2436718
UBE2Q
NO
tgct
gtat
ttgga
tctc
acgc
tgcc
tctg
tggt
tccc
tccc
tcat
ttttc
ctgg
acgt
gata
gctc
tgcc
tattg
cagg
acaa
tgat
ggct
attc
taaa
cgct
aagg
1aa
aaaa
aaca
aaca
caga
actg
tttca
agta
ctca
agac
tgac
ttaca
gacc
aacc
aacc
acct
tgct
ggaa
ccct
tgct
agca
ggca
ttctta
taaa
agaa
act
ttcga
gcct
cctta
tattg
ctgg
aaac
tcag
ctgt
gctc
caga
ctag
agcc
tcct
tacc
tatg
ctat
gga
1753
2663594
—NO
ggca
tctta
tttca
cgca
ccac
aggc
ccttg
ttcttc
ccag
atgt
cagc
aggc
tttct
ccaa
taaa
tgct
tcttc
atttg
ctgt
tgcc
cttg
gaat
catc
ccag
aga
tttta
aacg
gctg
tgtg
tatg
tttca
tggg
gaag
tctg
tggc
ctcc
ttaca
ctgt
cctg
caga
actg
gcac
ctcc
ggct
gtttt
1754
2854791
NO
tttag
aattg
actg
agga
gcgg
ccgg
gtgc
ggag
gctc
acat
ctgt
aatc
ccac
acgc
cttg
ggag
gctc
tgag
gcgg
gtag
atca
cctg
aggt
cagg
agttt
gaga
ccag
cctg
gcca
acat
ggcg
aaac
cccg
tctc
cact
aaaa
atac
aaaa
atta
gcca
ggtg
cagt
ggca
caca
cctg
taat
ccca
gcta
ctcc
gaag
gctg
aggc
agga
ggat
cacc
tgag
ccca
ggaa
gttg
agac
tgca
gtga
gctg
agat
tgca
ccac
tgca
ctcc
agcc
tcag
tgac
agcg
agac
tgtc
tca
aaaa
aaga
aaaa
agtg
actg
agga
ggaa
gagg
ccag
gtgg
caaa
tgga
acag
aatc
acca
aagg
gtga
acag
gact
aagg
caat
gtag
tgta
tgg
ctca
gcta
cgtc
agag
tgga
aaag
gtgt
tatta
gagc
agaa
acta
tggt
ccct
gcgt
caca
ggga
agca
acct
acag
agaa
gcag
cagc
tccc
caag
agag
gaga
gata
agaa
gcca
gaag
cctc
agag
tgaa
caat
tgtc
cta
1755
3350791
SIDT2
NO
tcag
agttt
ggtg
tatta
gagg
aact
gcca
gttg
ttcat
actg
gcta
ggca
gggc
ctta
cattt
gagg
ggag
aagg
gtga
gaga
ttgag
ctgg
gtgg
agga
ggac
atga
aggc
ctttg
ggtg
ccat
κ1756
3447872
KRAS
NO
catc
ttcag
tgcc
agtc
ttggg
caaa
attg
tgca
agag
gtga
agttt
atat
ttgaa
tatc
cattc
tcgt
tttag
gact
cttc
ttcca
tatta
gtgt
catc
ttgcc
tccc
tac
cttc
caca
tgcc
ccat
gact
tgat
gcag
tttta
atac
ttgta
attc
ccct
aacc
ataa
gattt
actg
ctgc
tgtg
gata
tctc
catg
aagt
tttcc
cact
gagt
caca
tcag
a
1757
3709501
—NO
gtcc
agtg
cctc
ataa
catg
gtct
caag
ctct
taac
aaat
ggag
ccca
ggat
tcaa
aaat
ctga
actc
taat
aatg
gcttg
tcat
tctg
gtttc
ttacc
cttg
agg
aacc
tgtg
aggt
gtta
acca
acct
gtaa
gctg
cattc
cagc
actg
gcaa
acct
ggcc
tctc
aaat
atcc
agcc
aattc
taag
tttgg
tgcc
agga
1758
3606344
—NO
gaaa
aaat
tgtc
agttg
cagg
gata
cattt
ctca
ctaa
tgaa
gaaa
catg
gaaa
atat
ctgt
gtaa
gggg
atca
cgct
gtttc
ttaag
ttcag
atta
ttgga
aga
gggt
ggtg
atgt
aggt
gtgt
actc
ttcct
gagg
ttg
1759
3884906
FAM83
YES
ttctg
tctg
gcca
agtg
gttg
aaca
ctttg
atct
ggag
ttccg
aatc
ctgt
atgc
ccag
tcca
agcc
catc
agcc
ccaa
actc
ctgt
ctca
cttc
caga
gcag
cD
aaca
agttt
gatc
acct
cacc
aacc
gaaa
acca
cagt
ccaa
ggag
ctca
ccct
gggc
aacc
tgct
gcgg
atgc
ggct
ggct
aggc
tgtc
aagt
actc
cc
1760
3919041
SLC5A3
NO
ttgga
aaca
gaaa
cgag
gctta
ttgct
attg
caga
aatc
ccaa
actg
gcaa
aggc
cagt
atat
atgg
tattc
cata
atat
aacc
agct
tttga
aattt
atgt
gttt
ggat
tagt
gcct
tctg
gtta
ccag
tattg
actc
tgct
agttt
gcac
ctttc
cgttc
tta
1761
2847683
—NO
ctgg
gagg
ctttc
agat
gcag
catc
cacc
aagg
aata
cacc
gact
aaca
caca
tgac
agcc
tgaa
ctag
caac
ctgc
atca
acac
ccct
gcag
aaaa
gca
gtgc
atttc
aact
gctc
attta
ataa
gtat
ttgaa
ttcat
aatta
caaa
acat
cttc
tgta
atta
aacc
acca
tatg
cattt
aaaa
tattt
tggg
gaca
aaga
gtag
caaa
gaat
tata
ttgga
tattg
acta
aaaa
cact
tatg
aata
tcaa
taat
ttgct
cctc
ctcc
ctttt
catc
atag
actc
tttac
aatg
atac
tgac
cttg
gggt
tg
1762
3776975
NO
tttgc
aacc
caac
ctgc
tgga
agtg
caaa
gttc
agac
cagc
agca
agaa
ctct
tgcc
ccat
gcttc
tgct
gaca
t
1763
3543413
RB
M25
YE
Sat
ctcg
aacc
agtg
gagc
gcac
tcgt
aacc
tgga
tccc
agaa
ggtc
gcga
aggc
agta
ccgt
ttcct
cagc
ggc
1764
3653407
NO
ttgag
tgtg
gtag
tgtc
cgtc
tgta
gtcc
cagc
aact
cagg
aggt
tggg
gcag
gagg
atca
cttg
agcc
cagg
agttc
aagg
ctgc
agtg
agct
ataa
ttgc
acca
ttgca
ctca
agct
cagg
caac
agag
tgag
acct
tgtc
tcaa
aaaa
aaaa
aaaa
aaat
ttaaa
caag
cgac
agta
atat
ttcat
taga
aatg
tgat
gaat
gcta
tgaa
ggtg
taga
ggat
gtct
ttaga
atat
ataa
ctgg
gagg
gaga
ccttt
tgag
ttaag
gcag
gtgc
tggt
cagg
aaag
tcct
gagg
aagt
taca
tttg
gattg
a
1765
3670580
—N
Ogt
gaaa
gact
aaac
agac
gact
tcaa
aact
ctca
tcca
acat
ccaa
gagt
tctg
ccat
ttcca
agat
tcag
caat
cctc
1766
3767260
___
NO
tacc
agtta
gtac
aaag
atca
cagc
caat
atag
aacc
taaa
agta
ttcac
aatg
aaaa
tgac
aatg
tagt
tgtc
gtttt
gagc
tgca
tggt
actta
aatc
tg1767
2456432
___
NO
cacc
tgct
ccga
gggt
cagg
gcta
agtg
gtttg
agca
agtg
gcaa
gctc
acag
gtcc
agttg
gctg
aggg
cctg
accc
atga
cgat
ccat
tt1768
2750681
___
NO
tatg
acac
tgaa
tacg
ggaa
agtg
atgg
gagt
gagg
ctgg
ttaaa
atttt
tcac
ccac
gcag
tctc
cctg
tgtt
1769
3071991
LOC653
NO
cacg
tggt
cact
gaca
ctcc
catg
gc852
1770
3436046
ATP6V
YE
Sac
tcgg
atcc
gcac
caac
aaat
tcac
cgag
ggat
ttca
0A2
1771
2383133
—N
Otc
agcc
ctgt
gaag
atgt
cattc
catta
tcttc
tgtc
ttcta
ttgttt
ctgt
ggag
atgg
tagc
tctc
agac
aaat
tgttg
gttg
ctcg
tttgt
acat
actc
ccca
ctcc
Oca
tcct
gccc
tggg
tact
ctaa
cattt
tttca
ctttg
tgttt
agttt
cagc
agttt
taag
ataa
cgtg
ccta
ggtc
gggc
1772
2434773
FAM63
NO
acttt
ccag
ccgc
agag
tagt
gcag
A1773
3338526
PPFIAl
YE
Sgt
ttgga
gcaa
tgat
cgag
tgat
tcgc
tgga
tcct
gtca
attg
gcct
taaa
gaat
atgc
aaac
aatc
ttata
gaga
gtgg
tgttc
acgg
agca
cttc
tggc
ctta
gatg
aaac
cttc
gact
tcag
tgca
ctgg
cact
gctg
ttaca
gatc
ccga
cgca
gaa
1774
3671856
KLHL3
YE
Sca
ggca
gacg
cgag
tgtc
tcgg
ccat
acaa
gatc
agcg
aatc
atca
6
1775
3060641
NO
ctcc
catg
gctg
gata
aacc
aaat
ctga
taca
tcca
cattt
aagg
ttgttt
tcca
agttg
gtttc
cata
aaag
gcct
ttaac
aata
atag
gcttt
taac
aaca
aaaa
ggta
tccc
tccc
atca
caat
gaga
gcttg
atga
gggc
tcaa
aagt
gact
tcaa
aaac
tgta
aata
atta
ttttc
cttg
gacg
gcttt
aaaa
acag
ctac
tgat
agca
aatc
agaa
acac
taaa
gaaa
aaag
acaa
taag
gaaa
cagc
tgttt
gtct
agtg
aatc
cata
ataa
atac
caat
ttgag
gcta
taga
ttaca
aagc
caaa
atat
tctta
tagg
aaag
ttaat
gttta
tattt
acaa
tccc
atgg
acta
aaaa
aact
gtct
aata
tctta
aaga
ctga
gtct
acct
ttattt
aaca
ttgttt
atac
acaa
aga
ccca
aggg
atgg
taga
attc
ttgat
ccttc
tgga
caaa
agca
tagt
gaga
gggg
actg
aaat
taac
aaaa
aggg
aaaa
ttaac
atta
atct
caaa
attc
tacc
catc
gttg
tctta
aaat
gatc
aaga
ctca
gcta
catc
tgag
aaaa
agga
aaag
gatc
agaa
gtga
aaga
aatc
agaa
gcca
gtaa
ttaaa
aaaa
tcat
aacc
ttgtg
tgct
ttcta
ctga
cggc
attta
aa
1776
3379
623
MT
L5
NO
ctgt
gcct
aggc
tgttg
acaa
ctttt
gctg
agttg
gaca
gaat
aaga
ctgt
gggc
catg
ctgc
ttgct
tcac
tctc
tggg
tgtg
atgg
atcg
gtga
gggt
gcag
ttgct
atga
ttgta
attg
gtca
tctc
atca
tggc
tgtta
acat
ttctg
gacc
aaac
aaaa
atga
cttc
agct
acttc
caga
ttctg
ctga
taca
ttaca
cagg
atga
tct
aaaa
ggct
acag
ttctg
aaga
tact
catc
tttct
tgta
atct
gggg
gcat
ctgg
tcaa
tagt
attg
cctc
tgtg
ctgt
attg
tgtc
taga
acat
tcaa
ggac
attt
gata
cttc
tcat
acttt
aattg
cgtg
ctct
ccca
gttg
cagc
accc
tctg
aggc
ttcta
tgtg
ttgca
gccg
atgg
gcat
ttgca
caga
gcag
cagt
catg
acag
cgttt
tcag
ggca
gcta
tttct
ccttt
tccc
ccttt
tcct
ctcc
cccg
gcttc
ccat
tctc
ctcc
tcct
ccttc
ctct
tctc
gtcc
cctc
ctcc
tcct
ctttg
ttctc
ctcc
ccct
gctc
ttcttc
ctct
ctcc
ttttc
ctcc
ttctg
tgtc
tata
ctag
atta
ttttc
aatc
agtc
acat
gtttt
tcttt
atttt
gaat
gtta
caaa
atat
ttctc
tattg
taaa
cggt
gtgc
cagg
aaaa
acat
gaga
tgtg
tattt
gtat
ttttta
tgca
cata
atta
ctta
atcc
aatg
cctc
tcca
ctta
attg
ggcc
tgat
ga
1777
3547548
—NO
ccag
tgttg
ggtg
caaa
tata
ccta
ggat
a1778
2779848
___
NO
atga
gaag
ggta
ttgga
tacg
atta
ggtta
gaag
ctcc
tagc
tcct
cagt
gaag
tggt
acat
aagg
gctc
tctc
tgc
1779
2869655
___
NO
ttcttt
ttcta
ccct
cact
gtcc
tct
1780
2964686
___
NO
tctg
agca
cacc
tgtc
caga
gagc
tc1781
4004181
DMD
YES
ccttc
agaa
ccgg
aggc
aaca
gttg
aatg
aaat
gtta
aagg
attc
aaca
caat
ggct
ggaa
gcta
agga
agaa
gctg
agca
ggtc
ttagg
acag
gcca
gag
ccaa
gcttg
agtc
atgg
aagg
aggg
tccc
tata
cagt
agat
gcaa
tc
1782
2432028
PDE4DI
YES
atgt
ctaa
tgga
tatc
gcac
tctg
tccc
agP
1783
2443375
F5
YES
tact
caag
atca
agaa
gata
acgg
caat
tata
acac
aggg
ctgc
aagt
ctct
gtcc
tctg
aaat
gtat
gtaa
agag
ctat
acca
tcca
ctac
agtg
agca
ggg
agtg
gaat
ggaa
acca
taca
ggct
gaaa
tcct
c
1784
2736714
NO
ttcca
gagt
ttgca
ggga
cagt
taga
gtca
atat
gaat
tctc
agaa
taag
cact
gcca
gata
gtcc
ggaa
gccc
tttac
aagg
aaag
tgac
atttt
tgct
gtgt
tttg
aata
gtga
acaa
gaat
gtgc
agga
agag
aatg
aaag
gata
tcac
tagc
agaa
gaaa
tgcc
tcttt
gtaa
aacc
atag
aaaa
taac
tctg
agttg
ccat
atag
gaca
tatg
agaa
tttca
gtgt
tact
gaaa
caaa
agct
tttgg
aaaa
ggta
tggt
gatta
aattg
ggta
taca
atct
gaaa
ggta
aattg
agga
caac
ttgtg
tgc
cagt
ttgca
1785
2969491
NO
ttgta
gctg
ttaga
aggg
atag
ttcta
caag
gtat
ccaa
atttc
ctaa
gtat
atttc
tttaa
atttt
ccttt
tattg
aattt
tctg
tttac
tggc
atta
gtct
cata
tcag
ttttg
gcta
ccttt
agct
gcttt
tgttg
aattc
tattt
aata
aagg
cact
gcaa
gtat
aaaa
ttatta
aaaa
taga
caag
caca
atcc
tttct
ctgc
taaa
caag
ctta
tga
aaaa
cctg
gtga
tacc
caaa
gagc
taca
aagg
aaac
tctc
acac
ttg
1786
3417650
RBMS2
NO
ctgc
acttg
tgga
acat
caca
tggc
aaaa
acag
gagt
tttttc
gcta
gact
tttttt
ttcttt
ttaac
ctta
ttaaa
aatg
agat
tggt
ccta
a1787
2586174
___
NO
ctttg
ctta
taat
gttc
tgta
gtgc
tgat
ggac
tgcc
tgat
tgta
ga1788
3777613
—NO
aatg
ctgc
aggc
atca
gtgg
gaga
g1789
3888166
CSElL
YES
tgac
atcc
cgtc
ttcct
atat
ggcc
ttattt
cctc
atct
ccttc
agcc
agtg
ctttg
ggaa
agaa
cagg
aaat
attc
ctgc
tcta
gtga
ggct
tcttc
aagc
attc
ttag
aacg
cggt
tcaa
acac
aata
gcaa
gtg
1790
2332
174
CT
PSY
ES
tgaa
ttctc
aaga
aacg
tgct
ggga
tggc
aag
1791
2684
865
VG
LL
3N
Oag
aacc
aact
acag
tcac
ctct
gcta
cctc
agca
tggg
ctgg
agcc
tttca
tgga
acag
taga
cata
gtgc
ccag
cgtg
ggat
tcga
taca
g17
9227
5642
6—
NO
aggc
aatg
gcga
tttta
ggct
ctcc
aggt
gatta
caat
atgc
aacc
atgc
tccc
aaat
gtct
gctg
taaa
ccaa
catc
tttcg
gagg
acca
gttg
aaaa
taat
atttc
tcaa
atta
atgt
gaaa
agtg
tttgc
tgttg
agttg
cggc
ctttc
agtc
ccgc
ctttg
ttctc
tact
cacc
actg
ttgag
ttgtg
gcct
ttcag
tccc
gcct
gtgt
tct
ctac
tcac
ctct
actg
cttg
ttttg
ctcc
tgat
tcaa
acca
gttc
caca
cata
ctaa
gccc
actg
tgct
aggt
ggcc
tgac
cgtg
gtga
atat
tattg
tgga
ggaa
ggac
tttgc
tgta
agaa
attg
cattc
ccca
aaac
tgaa
acca
tgat
attta
ctca
actg
aggt
aaaa
aatg
aaag
acta
aggg
ggac
tccc
aagg
gtca
ggg
caag
aata
aata
ccttg
gaat
atta
atac
ccat
ctca
tgat
gcct
gagt
gtaa
atgc
tcc
1793
3309131
C10orf4
NO
ggaa
atgc
taat
ttgag
cttc
attc
atag
ggga
acct
acta
tata
ttgca
tccc
tgct
ggttg
gaaa
ttatc
ttcat
ctct
ggac
tgca
ttgttt
agaa
aaat
gtta
at6
ggct
taca
attc
tgag
aact
ttattg
tgtg
gctc
tg
1794
3435688
ARL6IP
YES
ctga
cgga
tgag
caga
agtc
ccga
atcc
aggc
cat
4
1795
3544550
LOC731
YYEESS
tgga
aagc
tttga
agac
ggcg
gttc
tggg
223
1796
3737293
KIAA16
YES
tgaa
ggca
ttgtc
tgca
tttcc
aaga
agca
ccta
gata
aata
cattc
ctta
caag
tacg
tcat
ttata
atgg
ggaa
tcttt
tgag
tatg
agttc
attta
caag
cacc
18
agca
gaag
aagg
gcga
gtac
gtca
acc
1797
2769855
NO
ttagt
ttggg
tctg
atct
ttgttt
c1798
2914869
—_
NO
cttc
cttg
tgga
agttg
gcttg
atg
1799
2754682
NO
acag
agcc
cgag
ctttt
gtcc
tgca
ggct
caag
cttc
taga
ttcgt
cttc
tcgt
taat
gcgg
ggac
ggac
1800
2353146
—_
NO
ttcca
gcag
gcac
cgaa
aaag
ccct
gtga
gcttc
tctc
tgag
cggt
ggag
aact
gcac
atgt
atgg
atgt
tgct
gcttc
c1801
2685210
—NO
atat
ttgag
tgtc
ttattg
gact
tgga
acttc
cacc
agta
tctc
caga
atta
ttgtc
tgtc
tgca
g1802
2774664
—NO
agttt
tggt
tccg
acaa
gctg
cacc
attc
caac
tgcc
1803
2517907
—NO
taca
caaa
ttgaa
cgcg
gtag
ggtg
gggg
agga
agta
ggga
gata
aagc
ctat
gctg
ctga
ttcct
caat
tata
ggag
cagt
ctct
aaaa
gccc
tcgt
caat
ctag
tgat
gtgt
1804
3156168
NO
gtgc
aagc
tgtg
tagt
ggat
cattg
agtg
t1805
3359979
NUP98
YES
ggac
tcttt
ggaa
ccac
aaat
acca
cctc
taat
ccttt
tggc
agca
catc
tggc
tccc
tcttt
gggc
caag
tagt
tttac
agct
gctc
ctac
tggg
acta
ctat
ta1806
3701355
—NO
tcaa
taaa
agttg
tattg
agat
gtga
ttcac
atac
tgta
caat
ttatc
ccttt
aaaa
tatg
caat
tcgg
tagt
tttta
gtgc
attc
acag
agttg
tgca
gcca
ttatc
gtaa
tcag
tt
1807
2329396
HMGB4
YES
gtca
gcta
gaaa
ccgg
tgca
gagg
gaaa
ag1808
3290788
CCDC6
NO
tcca
tgct
caag
agcc
attg
taag
agat
taag
gggt
ttcta
ggttt
ttggt
gattt
tttgt
ttgttt
ttttc
tttgt
ttttta
gggt
tttttt
ttcttc
tttaa
tttttt
gatta
aaa
cata
caca
cagc
tgtta
gcat
aaag
tcgt
gggg
ggca
ttttc
tgga
atgc
tcag
cagt
tctg
atta
actg
ccaa
gccc
aggt
tgcc
tctc
atga
ggca
actg
aaa
aaat
cctg
tgtc
ttgat
agca
tggg
tgc
1809
3449856
—NO
ctttg
actc
cattg
tctg
ctttg
gct
1810
3573894
DIO2
YES
aggg
gaac
caga
gcgc
acaa
ggga
a1811
3670649
—NO
ccac
gttg
tcaa
gcac
agaa
ctat
aaaa
acaa
tgga
ttcag
tggg
tgaa
ggta
ggag
aagg
ggttc
aaga
gattc
agac
ttaca
gtag
gaaa
gtta
tttttc
caa
tctc
agta
agttt
tttag
ggtta
tgaa
gattg
agca
ctga
cgac
gctg
aaat
acca
cagt
gcaa
atgc
ttccg
taga
ttccc
tggg
ctct
gcac
tcac
caga
tcc
atttc
tatg
ataa
agca
caga
gctc
ctca
aaca
gaac
agtc
gagt
tgca
ggct
cagc
agcc
tcct
tgtc
agtg
tcct
gctc
aact
cagc
1812
3851066
NO
tgca
gtgc
ctgg
atca
taac
tctg
gatc
atca
ctga
aacc
t1813
3912092
SYCP2
YES
tcct
tggg
agac
ctgg
caaa
atga
at1814
2626159
RPP
14
NO
tcta
ctttt
aaat
cagg
cgtg
gcag
c1815
3448099
BHLHE
NO
cacc
agct
gtaa
aaga
tcct
atgc
gaaa
gaca
ctgg
ctct
tttttt
taat
cccc
caaa
taaa
ttttg
cccc
ctttt
aggc
catg
ttcca
ttatc
tctta
aaat
tgga
a41
ccta
attc
gaga
ggaa
gtaa
gaag
ggtc
tgttc
tgtg
gctg
agct
aggt
gaac
cccg
gggt
aggg
gaaa
gatg
ttaac
acct
ttgac
gtct
ttgga
gttg
acat
ggaa
cagc
aggt
agttg
ttatg
taga
gcta
gttc
tcaa
agct
gccc
tgcc
tgttt
tagg
aggc
gttc
caca
aaca
gattg
aggc
tcttt
taga
attg
aattt
actc
ttcag
tattt
tcta
atgt
tcag
ctttc
taaa
aggc
atat
atttt
tcaa
agaa
gtga
ggat
gcag
tttct
cacg
ttgca
acct
attc
tgaa
gtgg
tttaa
atgg
tatc
tct
tagt
aact
tgca
ctcg
ttaaa
gaaa
cacg
gagc
tggg
ccat
cgtc
agaa
ctaa
gtca
ggga
agga
gatg
gatg
agaa
ggcc
agaa
tcat
tcct
agta
cattt
gcta
acac
tttat
tgag
aaat
tgac
catg
aatta
atgg
actc
atct
taat
ttcttc
taag
tcca
tata
taga
taga
tatc
tatc
tgta
caga
tttct
attta
tcca
tag
atag
gtat
ctat
acat
acac
atct
caag
tgca
tcta
ttccc
actc
tcat
taat
ccat
catg
ttcct
aaat
ttttg
taat
ctta
ctgt
aaaa
aaaa
gtgc
actg
aact
tca
aaac
aaaa
caaa
aaac
aaca
acaa
caaa
aaac
aagt
ccaa
actg
atat
atcc
tata
ttctg
ttaaa
attc
aaaa
gtga
acga
aagc
attta
actg
gcca
gtttt
gattg
caaa
tgct
gtaa
agat
atag
aatg
aagt
cctg
tgag
gcct
tcct
atct
ccaa
gtct
atgt
atttt
ctgg
agac
caaa
ccag
atac
caga
taat
caca
aag
aaag
ctttt
ttaat
aagg
ctta
aacc
aaga
ccttg
tcta
gata
ttttta
gtttg
ttgcc
aagg
tag
1816
3872
452
ZN
F552
YE
Sag
cacg
agag
actg
ctcc
ctac
agaa
gaac
cttc
tgtg
tggt
gtga
atgt
ggga
aatc
ctct
agca
aata
tgac
agct
tcag
taat
catc
aagg
agttc
acac
taga
gaaa
aacc
ttata
cgtg
tggg
atat
gtgg
gaaa
ttattt
aaca
gtaa
gtcc
cacc
tcct
tgta
cacc
agag
aattc
acac
tgga
gaga
agcc
atat
gagt
gtga
ggttt
gtca
gaaa
tttttt
aggc
acaa
gtac
cacc
tcat
tgca
cacc
agag
agttc
acac
tgga
gaaa
ggcc
atat
gaat
gcag
tgat
tgtg
ggaa
gtc
attta
ccca
cagc
tcta
cattc
cgtg
ttcat
aaga
gagt
tcac
actg
gtca
gaag
cctta
tgag
tgca
gtga
atgt
ggga
aatc
ttttg
ccga
aagc
tcca
gtct
cact
aaac
acag
gaga
gttc
acac
tgga
gaaa
agcc
ttacg
ggtg
cagt
gaat
gtga
aaaa
aaat
ttagg
caaa
tctc
ttcac
ttcgt
catc
1817
2489
627
POL
E4
NO
gtgt
tctg
cata
agtg
gcttc
ctga
1818
2934
111
—N
Otc
aaga
ttata
tatg
gcag
atga
ggat
aa18
1931
8129
0T
MO
Dl
NO
gtaa
tggc
ccag
ctta
gaga
cttc
agct
actg
atct
catc
actta
ttaga
caaa
ttgct
gctg
acct
tacg
cctg
tata
ttaag
cctc
cgca
ggat
gccg
gaca
atg
gtga
agaa
actc
caga
tatc
aagg
aattg
ggaa
atcc
tggc
caaa
ccac
ccca
agat
gatta
cact
gaaa
tgta
gtat
tagt
actg
ctgc
caga
tctc
ttttt
aaca
tcat
gtgc
gtct
cttg
ggat
ccag
caaa
agtg
ttaag
ccac
aatg
ccct
tgtg
ccttt
taat
atac
caca
gtgc
cagt
taaa
ctaa
tattt
ttgttt
gttg
cttt
tggg
agtta
ttttc
atta
gtga
tttca
gcaa
atct
catg
ataa
agga
caag
gtca
agaa
ctcc
agag
cact
gagc
agag
aggc
tggt
gatg
aaaa
ggtg
aag
gcct
gcgc
actg
aact
gtaa
1820
3487
237
AK
APl
YE
Saa
tgttc
cctg
tgcc
aagt
tcac
aagt
g1
1821
4009
127
JAR
IDl
YE
Stg
agtg
tccc
ccca
cagt
agtg
gtga
agga
ggag
ttagg
tggg
gatg
tgaa
ggtg
gagt
caac
atcg
ccta
agac
cttc
ctgg
agag
caag
gagg
agct
Cga
gtca
cagc
ccag
aacc
ctgc
acca
agat
gacc
atga
ggct
acgg
a
1822
2750
647
—N
Otg
agta
cagc
ccac
tgat
tgac
attc
aaga
ccca
ttgga
aaaa
tcag
gaga
caca
agag
tggg
aaga
gtgc
agat
tgga
gcag
ctat
ccaa
aaat
aca
1823
2916
198
ZN
F292
YE
Sgg
gctg
gact
tgct
acct
gtat
agaa
ctgt
gtgt
aaag
gctc
ttcgc
ttgga
gtct
acag
aaaa
tact
gaag
tgaa
aata
tcta
tttgc
aaga
ccat
ttcat
gttt
gttg
cctg
atga
tctg
gaag
ttaaa
cgtg
cttg
tcaa
ctga
gtga
atttc
ttattg
agcc
taca
gtag
atgc
gtat
tatg
ctgt
ggaa
atgt
tgta
taat
cagc
cag
acca
gaaa
tatg
atga
agag
aatc
ttcca
atac
caaa
ttcttt
acgc
tgtg
agct
gtta
cttg
tattg
aaaa
ctca
atgg
ccct
ttgat
ccag
aattc
tggg
attg
gaaa
acct
tgaa
acga
caat
gtct
tgca
ttaat
ggga
gaag
aagc
atcc
attg
tgtc
ttcaa
taga
tgaa
ctaa
atga
cagt
gaag
tata
tgaa
aaag
tggt
aga
ctac
caag
aaga
gagt
aaag
aaac
ttcta
tgaa
tggg
ctttc
tggt
ggag
ttggt
gcta
attc
tggc
cttc
ttaaa
gaca
ttggt
gatg
aaaa
gcag
aaga
agag
agag
ataa
aaca
gtta
agag
agag
ggga
tttat
atct
gctc
ggttt
agga
attg
gcaa
gcct
acat
gcag
tattg
tgtg
ttgtg
tgac
aaag
aattc
ctt
ggtc
acag
aata
gtac
gaca
tgct
caga
aaca
ttaca
aaga
tgga
attta
tagt
tgcc
ccat
atgt
gcaa
agaa
cttta
attc
taaa
gaaa
ctttt
gtcc
ctca
tgtc
acac
tgca
tgtta
aaca
atct
agta
aaga
gaga
ctag
cagc
tatg
aaac
catta
agaa
gattg
ggaa
ggcc
tcca
aaga
tcac
aact
acca
atga
aaat
cag
aaga
ctaa
tact
gtgg
ctaa
acag
gagc
agcg
acct
ataa
aaaa
gaat
agtc
tcta
ttcaa
caga
tttta
tagt
gttta
atga
caat
gatg
gttc
agat
gatg
agaa
tgat
gaca
aaga
taaa
tcct
atga
gcca
gaag
tgat
tcca
gtcc
agaa
acca
gtac
ctgt
taat
gaat
ttaat
tgcc
c
1824
2984
592
SFT
2D1
YE
Stg
gcttc
cggg
cggc
ataa
agct
ttttg
cagt
gtttt
atac
cctc
ggca
atct
tgct
gcgt
tagc
cag
1825
3123
564
—N
Oca
gttc
attc
cgttt
ccac
ctgg
cagc
tgct
ccct
ctga
cacc
caag
gact
cgca
ggga
gtgg
ccgt
tgga
cctg
caga
cc18
2631
6587
9T
EK
NO
gtag
cagc
cagt
cccg
tttca
tttag
tcat
gtga
ccac
tctg
tcttg
tgttt
ccac
agcc
tgca
agtc
agtc
cagg
atgc
taac
atct
aaaa
atag
actta
aatc
tca
ttgct
taca
agcc
taag
aatc
tttag
agaa
gtat
acat
aagt
ttagg
ataa
aata
atgg
gattt
tcttt
tcttt
tctc
tggt
aata
ttgac
ttgta
tattt
taag
aaat
aaca
gaaa
gcct
gggt
gaca
tttgg
gaga
catg
tgac
attta
tata
ttgaa
ttaat
atcc
ctac
atgt
attg
caca
ttgta
aaaa
gtttt
agttt
tgat
gagt
tgtg
agt
ttacc
ttgta
tact
gtag
gcac
acttt
gcac
tgat
atat
catg
1827
3887
555
...N
Otc
cattc
cagg
attg
gtgc
tgaa
gcac
atgt
gtcc
tgaa
gttc
agga
cgtc
aggg
taat
tgac
acag
aaga
aa18
2823
5674
0—
NO
tata
atct
agaa
acat
tcttc
agcg
tttttt
ttcttt
catta
catg
aaga
gtca
agac
cagt
ggttt
tgtg
gaat
gcat
gata
atct
gggt
tctta
aaaa
tcat
ttcttc
ttga
ttaga
tttcc
cattg
tcttg
aattg
actc
agtc
agcc
tttcc
tcac
cgct
actc
cagg
ttcac
taat
g
1829
2665
536
C3o
rf48
YE
Sga
gaat
ggat
aggc
tttta
ggtc
ttgga
agaa
aaga
agtg
tcca
gggt
tcaa
tgga
gtgg
ctgc
tct
1830
2833
553
._-
NO
ttttg
ggga
tgac
attg
tgga
aagt
ttacc
ctca
1831
3242
698
-_-
NO
tgta
atac
ctca
tcat
agat
atttg
gggt
gccc
ttca
1832
3911
222
PME
PAN
Oga
agttc
tagc
cact
cgag
ctca
tgca
tgtg
aaac
gtgt
gcttt
acga
aggt
ggca
gctg
acag
acgt
gggc
tctg
catg
ccgc
cagc
ctag
taga
aagt
tc1
tcgt
tcat
tggc
aaca
gcag
aacc
tgcc
tctc
cgtg
aagt
cgtc
agcc
taaa
atttg
tttct
ctct
tgaa
gagg
attc
tttga
aaag
gtcc
tgca
gaga
aatc
agta
cagg
ttatc
ccga
aagg
ta
1833
2676501
—NO
attg
aaga
accg
ctgc
tgga
gaac
t1834
2839345
___
NO
gcag
cctta
ttcag
caac
gctg
ggga
tgac
aaca
tg1835
3239038
___
NO
gaat
aagc
agaa
acgg
ctgg
gcgt
g1836
3475695
ZCCHC
YES
cctc
ggaa
tgct
gctc
gaat
aagt
gaaa
agag
aaaa
gagt
atat
ggat
gcct
gtgg
agaa
gcaa
acaa
tcag
aattt
ccag
cagc
gata
ccac
gcag
aag
Q Oaa
gta
1837
3563672
NO
gttc
tgag
gccc
agca
actc
tgag
a1838
2333149
CDC20
YES
acct
gaac
cttg
tgga
ttgga
gttc
tggg
aatg
tact
ggcc
gtgg
cact
ggac
aaca
gtgt
gtac
ctgt
ggag
tgca
agct
ctgg
tgac
atcc
tgca
gcttt
tgc
aaat
ggag
cagc
ctgg
ggaa
tata
tatc
ctct
gtgg
cctg
1839
3740914
NO
ctgg
tgag
gagc
cgca
actg
ggca
tctc
c1840
3949524
___
NO
ataa
aaca
tcttt
gacg
tggg
agct
gcag
tact
g1841
2598329
FNl
YES
cattg
tctc
cacc
aaca
aact
tgca
tctg
gag
1842
2354732
___
NO
gggt
gtca
tgga
atct
tagg
agcc
ctgc
attc
caat
tgcc
cagg
ctt
1843
2534399
—NO
caga
atga
tgga
gtttg
catg
gcac
agt
1844
2677654
C3orf63
NO
cagt
acct
gcgc
aatc
cttg
attta
atga
aatta
aattt
tttat
ttcaa
aaga
tagg
cttc
tgttt
atca
aagg
cgttg
aaca
atttg
atttt
taaa
ttagt
tttag
caaa
o OO
atgg
aagt
ataa
tggg
aata
tattt
ttgag
gtgg
tctta
gtag
taat
tacc
atttg
ttgaa
caca
ttaat
gcta
tact
agac
agat
acag
tgga
gata
gcttt
caaa
cccg
tatc
tattc
ctaa
ctac
tgcc
tcta
atac
taat
tact
aact
tgtta
ctaa
tgct
aact
tttaa
aaat
gtttt
tgaa
aattg
caat
ttcta
caaa
taaa
tgtc
atag
tgca
aata
aatg
tcag
gattt
agag
aagg
agcc
taaa
ctga
atttg
tgtta
tttca
gtaa
aagt
tatg
gagt
tatg
ggtc
taga
cgtg
ttaat
aagt
taga
caga
act
ttggc
acttt
agaa
caaa
tgca
tgag
tggt
atttc
agttc
ctaa
gtta
cata
aaaa
gtgt
gaaa
gaac
actg
tgag
gtcc
cagc
aacg
ccag
acta
tatta
agg
taag
taga
aagt
gtttt
tata
gggc
ttcaa
tacc
cagg
tggt
gaca
gagc
agaa
gaac
cggt
tttttt
tttttt
tttttt
acta
ctaa
gcttt
taca
aaag
aagt
atct
gttt
tattg
tata
gaac
aagt
acag
cattt
tact
acat
agta
taca
agttt
ttaat
gagc
attta
aaaa
aata
agta
aatc
tagg
ctat
ttgaa
aaat
acag
ttgag
cca
gtga
gcac
atttt
ataa
tttgg
aaga
caca
aatc
aaat
gtga
agga
tttga
ttttc
taca
tttaa
aatg
aaga
acca
aaac
tctc
ttcttg
atttt
cagc
taaa
ggca
ggag
acta
ctttc
caac
tcct
tttgc
ttctg
gaga
aggc
cctg
aaat
caca
ttgat
atat
gtttg
ttagt
aaaa
tgtg
caca
cctg
acttg
caat
gttg
tgtta
aact
agaa
ttata
ttcac
ttgga
aaac
atat
cgct
tagg
ataa
aattt
tgtc
a
1845
2495347
—NO
agaa
agtc
ttaat
ggga
gagt
gccc
atgg
tata
t1846
3436411
—NO
cacc
atcc
ttgag
gggt
cgag
ccag
gact
ggaa
cctc
agtc
cggc
cact
acaa
cttc
ataa
atac
catc
gccc
tgag
atgc
tgaa
atcc
cagg
ggcc
ccc
cacc
ctca
gacc
aggt
gccg
gtat
tg
1847
3630941
—NO
tccc
tggt
atgt
acac
gcat
tcct
gtgt
tttgt
gaaa
aacc
gaca
ccat
gctc
ctcc
ctca
ctac
atgt
aaaa
cact
tttat
tcat
taaa
aaga
aaac
tgac
tggc
ttg
gacc
taca
aatta
gtttc
atta
tttgt
taat
gtttg
aaag
ccat
taaa
agat
gaat
atta
aggt
ttcttt
atac
tcaa
tact
tgta
gtttt
gtttg
gggg
aatg
agag
gatg
ccct
tggt
acct
ttgtg
aggc
ctct
ccac
tgag
ggtc
aatc
atga
cttc
tgttt
taaa
ccag
ccca
tccc
atct
tctc
cagc
tgct
ctcc
ttatg
tcttg
cttc
tct
cccc
tcca
acct
tctc
agca
taag
gact
caat
ccta
ggct
ccta
cccc
agac
gggt
gcct
tcca
acgt
tcct
ggtg
c
1848
3886257
—NO
cact
tggg
ttgga
aact
ctgg
cttc
tttcc
cccc
atgg
cctg
tttgg
ctca
cagg
tgcc
acca
gcct
cacc
ttgat
ccac
tgtc
acttg
tgct
tact
cggt
gtta
gct
gcttt
gcct
ccttt
cagt
ccct
cgaa
ccct
gagc
tcttc
tcag
agct
ccta
gtgc
tcct
ctgt
cggc
ctgc
agtg
cact
cccg
cttg
tgcc
tattg
tgta
1849
4004292
—NO
agga
tgaa
ttctc
acca
acca
tata
ttattt
tccg
ataa
cagt
gtttg
gtag
tgag
gggg
atct
attg
ctag
gtaa
aata
cgat
ggat
tcaa
ggag
ctta
ctgt
ttgat
gata
gggc
aaga
tgtg
ttcat
ggat
acta
gattg
aga
1850
2960481
—NO
ttgga
gcca
gtca
ggct
tttgt
ttaga
cattt
taac
tttttc
ttgct
ttcct
tgca
aact
cctc
agcc
ttcag
actg
gttg
gaaa
gtaa
atgt
acaa
tctta
cata
aattt
tcag
gtaa
tagc
atttc
agct
ttttc
ccca
agat
tttttg
cttg
ggag
gaga
caga
ttaga
ctgg
attc
ggag
tcttg
atttt
gcaa
aggt
aaca
aaag
acat
gttt
tttta
taag
acttt
tcat
cata
agttt
atttt
attc
aaca
gaag
caaa
atct
aata
taat
ggaa
aaaa
taaa
gatc
tgtg
ataa
atct
gatc
tgtg
tgga
taaa
caca
atta
gaaa
gact
taaa
gatta
agta
ttgaa
acaa
acta
ccaa
aata
tttta
atac
tgat
ttgta
aaaa
tttca
gtac
atttt
tcttc
tttgc
ttaat
tcta
ctgg
gtcc
tgtt
t1851
2977359
—NO
tagt
cattg
gtga
agca
ggat
tcca
a1852
3199214
—NO
tcag
gcttc
tgttg
caag
ccaa
tgta
ggga
acca
1853
2743060
—NO
ttcct
gagt
tcaa
gata
tagt
taag
tttaa
ataa
tagg
acttg
tgta
tgaa
ttgga
tggt
gtgg
gagg
gaaa
tcaa
aagt
tctg
ggcc
gggc
gcag
tggc
tcac
gcct
gtaa
tccc
agca
ctttg
ggag
tctg
aggc
gggc
agat
caca
aggt
cagg
agat
tgag
acca
tcct
ggct
aaca
cgct
gaaa
cccc
atct
ctac
taaa
gata
caac
aaaa
tccg
ggcg
tgg
O1854
2811122
NO
agga
gttc
tcca
cata
ctct
gccc
accc
tgat
agca
actg
aggt
aggc
caga
aact
tctc
atgc
actg
gtta
cctc
ctga
gcca
tcca
caga
aagg
gtca
O1855
3560070
—NO
gtgt
gatc
aaaa
aagt
ggtc
agtg
tgaa
1856
2597541
—NO
acat
tgaa
attc
aaga
tgca
ggca
gaga
aaga
ggac
atag
acca
gat
1857
3747622
___
NO
gcgc
gtgc
tcaa
cttg
gatg
ggac
acac
ctag
tcttt
cttc
aagt
ccct
ggat
tcaa
agag
cttc
aacc
gttc
ttgca
ccgt
cagg
c1858
2332216
—NO
ttgat
tcat
ccta
tccg
tgag
cgtg
g1859
3029849
—NO
gctg
gcac
ccca
gaag
aagc
tagc
tcag
catg
tgcg
ggtc
1860
3090304
ADAM
YES
atac
attg
atct
ctat
ttggt
gctg
gaDECl
1861
3441298
___
NO
acta
tctg
aaaa
tggg
cagg
tggt
gca
1862
3584491
NO
tgcg
atct
ccgc
tcaa
aaag
atac
ttttg
aatc
ctttt
caaa
agta
atttg
tgttt
cattt
tttaa
ttagt
atga
taag
cact
tcaa
gacc
cttta
caat
tctg
gcta
gat
ttggt
tgcc
atat
ttaat
agca
tttta
taaa
taat
ggag
aaaa
ggga
aaca
ttatg
catg
cttg
atta
cggt
catg
agct
gcat
taac
aattc
cgttc
aaca
acag
act
gcat
atgt
caga
gcag
tccc
ttaag
atta
caaa
1863
2775809
SEC31A
YES
gtaa
acttt
gagg
atga
ttctc
gtgg
aaaa
tacc
ttgaa
cttc
tagg
atac
a1864
3380586
SHANK
YES
tttcg
atac
aaga
agcg
ggtg
tata
aaca
agcc
agtc
tcga
tgag
aaac
agttg
gcca
agct
ccac
acga
ag2
1865
3410510
NO
tgtg
tccc
gcag
ctca
atca
gact
tctg
gatg
ccat
caag
gagg
tctg
catg
acct
aagg
atca
ttgttc
tttga
aagg
aaaa
caag
ttcat
taat
ctta
ctgg
ctg
ccag
ggtta
ggat
gcaa
agtta
atca
atta
ttagt
gact
accg
tcta
ccga
atga
ctac
gtgc
aagc
agtc
atgc
atgg
agga
agaa
aagg
acaa
agag
aaag
gaaa
ataa
gtaa
aaaa
caaa
cgttt
aatg
atttt
gata
atat
aggc
tcag
ttaca
tcac
caca
aaca
1866
3266758
NO
gttc
tgga
actc
ttcac
acct
gtga
cctg
cat
1867
3878506
SEC23B
YES
gaaa
ctgg
agca
ccca
tcct
aact
gatg
atgt
1868
3189392
—NO
tgca
agat
tcttg
gatc
ctcg
tccg
gtac
atttt
ggct
gcat
tttat
gtga
cgttt
gtga
gggg
gtta
gaaa
tact
ttaaa
atttc
aggg
aact
tgaa
taac
aatg
aaa
gatg
tgca
gcta
gtcc
aaac
agat
ggac
ttcag
tagc
aattc
cagg
ggcg
tttc
1869
3737624
KIAA
13
YES
ctcc
ctgc
agaa
gcgt
cccg
acgg
ccac
atcg
tgag
tgtg
ag03
1870
2587558
SP3
NO
tagc
cgct
ctcg
aaaa
ccta
cgct
gcca
cggc
cgct
cattg
tctc
tccc
cttc
cacc
cggg
ggca
aaca
ggaa
gcgc
gccg
cctg
gcag
accg
acgg
acag
gcgc
ctgg
acca
atga
gcac
agcc
gaca
aaga
gcac
ggcg
gcga
atga
1871
2636399
BOC
YES
gccc
tacg
tggt
gtcg
ggct
acag
cggt
cgcg
tgta
cgag
aggc
ccgt
ggca
ggtc
ctta
tatc
acct
tcac
ggat
gcgg
tcaa
tgag
acca
ccat
catg
ctc
aagt
gg1872
3383004
RSFl
YES
aact
gcca
gtca
tagt
gaag
ctag
aaaa
acct
ttgcc
agaa
aatg
aaga
aaaa
aaga
ttatc
aaag
aaga
aagt
gattc
cttc
aagg
aaaa
tgtc
aaac
cca
Otta
aagt
tgag
gtga
agga
atgt
agag
caga
tcct
aaag
atac
caaa
agta
gcat
ggag
aagc
cagt
ggca
cagg
agcc
tgaa
agga
tcga
atttg
gtgg
Oca
atat
taaa
tcttc
tcac
gaaa
ttact
gaga
aatc
tact
gaag
aaac
tgag
aaac
ttaaa
aatg
acca
gcag
gcca
agat
acca
ctaa
aaaa
acga
gaaa
ttaa
actg
agtg
atga
ttttg
acag
tcca
gtca
aggg
acct
ttgtg
taaa
tcag
ttact
ccaa
caaa
agag
tttttg
aaag
atga
aata
aaac
aaga
ggaa
gaga
cttg
taaa
agga
tctc
taca
atca
ctgc
tttgg
gtca
tgaa
ggga
aa
1873
3959450
—NO
tgttt
ttatc
acag
ggcc
gtcg
ttgaa
gcat
gtat
gact
atgt
gtgt
gaaa
atgt
tctg
caaa
ccaa
aaag
ggct
cagg
tgct
ataa
tt1874
3984482
SRPX2
YES
gatg
agat
gcca
cgca
ctac
cattc
atca
ctag
tggc
actta
cacc
tgca
caaa
tgga
gtgc
ttcttg
actc
tcgc
tgtg
acta
1875
2333884
___
NO
ttgta
gagt
atga
acac
tcct
ttact
ttcag
caac
gtcg
gcat
ctcc
aaat
aa1876
3392870
—NO
atgc
atga
gggc
acgc
ctcc
tact
ttc1877
2395894
CLSTN
NO
gccg
cgcg
gtttc
ccttc
gcag
atgt
gtat
1
1878
2701311
NO
attg
agcc
tgcc
catc
atgc
attg
caac
agga
aggt
ggat
ttgaa
aagg
gaaa
agaa
ttaaa
acat
ccca
tcac
acgt
gtaa
t1879
2976681
—NO
caga
tacg
ggtg
tgta
cttg
aatc
aagg
agag
gcag
taga
gcag
cctta
tgttt
tgttc
ccca
aaga
gtga
gatg
tgta
ccca
attc
ttgca
cata
gtga
aga
agct
gttc
tcttt
tttgt
tgttt
gtca
catc
tgat
tattc
atgg
agaa
agttg
cagc
ttggg
gcta
gtaa
aattt
tttttt
aaca
ccgt
tctc
atat
tattt
gctc
tcttc
tttt
aata
aagg
acag
atag
gcct
tgat
gtgg
ctgt
aa
1880
3398768
NO
gcaa
ctttc
agtg
gagg
cgct
gaga
gcaa
attg
cagc
agag
atgc
gctc
aggg
tgca
gtaa
caag
gaag
gacc
cgct
ctcc
gtgt
cagt
ggcc
tgca
gggg
aagt
gaga
accg
ggct
catg
ctgg
gctc
ctct
acttt
gcag
ggttt
aagg
ccg
1881
2376
766
SRG
AP
YYEE
SSac
taca
gcat
gaag
gaat
tttcc
gggt
gtca
ggat
ccca
218
8226
4835
1...
NO
ggag
cagc
ttcaa
tcaa
aagg
cggg
gctt
1883
2855
376
FLJ3
225
NO
ccttt
tctg
gggg
atcc
tagg
caga
agcg
atgg
cagg
acaa
518
8437
1558
5...
NO
aatc
acca
tgac
aggc
agag
tgtg
ggca
1885
2432
021
...N
Oag
tttag
ttctc
ttacg
tgtg
gcta
gcca
at
1886
2527
209
RPL
37A
NO
ttttta
taat
gtgt
ctag
gctg
ggt
1887
3753
258
—N
Ottg
agct
gccc
aagc
cacc
taga
gg
1888
2607
727
...N
Ogc
tgtg
cgaa
tgga
ctca
gctg
cccg
aact
caca
gaat
atca
gtaa
cagc
accg
aaac
ttcac
a
1889
2970
557
HD
AC
2Y
ES
gctg
tcaa
ttttc
caat
gaga
gatg
gtat
agat
gatg
agtc
atat
gggc
agat
attta
agcc
t
1890
3233
715
LO
C39
9N
Occ
ttctg
caaa
tggg
gatg
agtc
ctc
715
1891
2555
212
PUSl
OYYE
ESS
catg
ctgc
atgg
ttgct
ggta
aaac
a
1892
2601
671
DO
CK
lYYE
ESS
tatc
ctgg
tgga
gcag
ctat
acat
gtgt
gtgg
agttt
ctct
ggaa
gtct
gagc
gata
tgaa
ctca
ttgct
gatg
tcaa
caag
ccca
tcat
tgct
gtct
ttgag
OU
1893
3630
677
CA
LM
LN
Otg
ggcc
tgaa
aact
tgga
gcaa
ttaat
tttttt
taaa
aagt
gttc
ttttc
acttg
ggag
agat
ggca
aaca
cagt
ggca
agac
aaca
ttacc
caac
tata
gaag
a4
gagg
ctaa
ctag
caac
aata
atag
atga
tttca
gcca
tggt
atga
gtag
atct
tta
1894
2677
724
AR
HG
EN
Ogg
cgcc
atca
gcca
ctttt
agaa
gcca
tcag
ccag
tgtg
ttggg
aaaa
gagg
tttgt
caag
tgttg
gcct
atgg
gaag
gtgg
tcaa
tgaa
tgttt
tga
F3
1895
2898
386
—N
Otg
gagg
cattt
aatg
gcga
tgtg
cagg
tgca
ag
1896
3412
766
AR
ID2
NO
catc
cacc
tcga
aagc
tggg
catta
acga
tattg
aagg
acag
cggg
tact
tcag
attg
cagt
gattt
tgag
aaat
ctttc
ctttg
agga
gggc
aatg
ttaag
ct
cttg
gcag
ctaa
tcgt
acct
gtc
1897
3975
856
NO
tgga
atgt
acct
ggaa
tagg
gaag
tgga
cagt
gagg
gc
1898
2466
297
...N
Occ
aggt
cgca
catc
agct
gcag
gcac
agcc
agttc
atgg
gtag
acag
ccac
atcc
tgcc
ccag
gagg
aggg
cagc
cagc
atgg
gcga
tgca
agct
cccc
tcgt
ctgg
aacc
agca
caac
ctca
ggca
ctcg
agcc
ttggc
ccct
caca
agca
acgt
cctc
aatg
ggtg
gcgt
tcgt
tcat
1899
3240
817
...N
Otc
catc
gattg
actg
cgag
gggc
tagg
gttc
acag
caag
caga
acgt
cctg
ggttt
ctct
tcaa
tacc
acct
cctc
ctct
cctg
ttcag
atgc
tgag
aaac
atc
tggc
gcag
taga
gatg
ctgc
ctct
gaca
tttct
atga
caaa
acgc
gtgc
ctgt
gcta
gggt
ttgca
cagg
caca
agtta
tcac
aacc
tcgg
gatg
gggc
tg
1900
3454
311
LA
SS5
YE
Sca
tcac
ttggt
cacc
attg
ggct
tatc
tcct
tctc
ctac
atca
acaa
tatg
gttc
gagt
ggga
actc
tgat
catg
tgtc
taca
1901
2437
092
DPM
3N
Ogg
gagt
gggt
ctca
aata
taga
gggg
gtag
caag
gtgg
aagg
ggcc
cttc
cacc
ttttg
tggg
cact
caag
ttcgg
acct
ctgg
gatc
ggcg
attc
cc
1902
2683
293
NO
catc
tgtg
cttg
acgc
agag
gcct
ccgg
ggtc
gcga
gcttc
ggag
gtgg
aggt
gccc
cgtc
ttcgg
gttc
ccac
gccg
ccct
ggcg
acct
gggg
ctcc
agc
ccca
ccgc
ggac
tccg
gtgg
gacg
tggg
tggc
gcaa
gcac
acct
tgcc
cctg
tgac
tcag
cttg
aggg
ggcc
caag
ctgt
ccca
ctga
gcac
gcgc
ccag
cagg
ccgt
cgcg
ctgc
gggt
cctg
gtcc
tcct
gtca
tttac
tggg
gtgc
ggag
gcct
ccga
ggag
tctg
aggg
tggg
acag
tccc
acttc
cgga
gga
gcta
gggc
aggg
aaca
caaa
gtcc
cgag
tgcc
gggg
ca
1903
3701415
—NO
ccat
aaaa
ttcgc
atgg
tgaa
atgc
acgt
aaca
taaa
attta
ccat
ctta
acca
tttaa
atgc
acag
ttcag
tggt
atca
gata
catta
ata
1904
2590322
—NO
ctag
aatta
agat
atgc
tgat
gagt
tgct
tctg
cagt
tcat
1905
2973489
—NO
tttgg
cgttt
gcta
caga
atgg
tgaa
tttat
gtat
ttctc
ttcca
ctttg
acat
ggac
attg
cctc
attc
ttcag
tagt
gtgc
atat
tcac
ctta
attta
ttcaa
aagc
tat
tttac
ctttt
tcaa
taac
acca
catc
aatta
cctta
actta
ttcac
ttaca
tcag
tcttc
cc
1906
3020769
NO
ggat
gcag
gact
gagt
catg
ccac
t1907
2807755
—NO
gagt
agat
gtaa
ctaa
gagg
tttga
gaaa
ggaa
tttca
gcag
agtta
ttgaa
ggag
aagt
caag
gaga
ggag
aagg
agag
gagc
agtg
aatg
ttgtc
gtg
aaga
ggtg
ttgtc
ttgaa
atac
tg
1908
2918704
PRDMl
YE<
ctct
ggag
tgga
tagg
gtta
atcc
gggc
agcc
agaa
actc
ccag
3
1909
3024096
___
NO
taac
aata
tgtg
cagg
cagg
accg
aaaa
O1910
3270274
—NO
gggc
gcttt
ccca
ctgt
tcta
gacg
tagg
agag
aaag
cact
tctc
agat
ggtg
agct
tttat
cttc
caat
cttg
tcaa
ccga
ggag
acct
ggaa
gttc
ccttt
caac
ctgg
aatg
tgag
ccct
gcttg
ctgg
gcac
gaga
taag
gacc
aagg
attg
accc
agga
caat
ttcag
agag
caca
gggg
tggc
atat
cctc
atgt
cagg
gcgt
ggag
ctaa
tattc
aga
1911
3356025
NO
atgg
ccca
gaat
gggt
tgtc
agtg
ctcc
ag1912
3913713
YTHDF
NO
ggac
cgttg
agct
cact
acca
cctg
gagt
ttgag
ttgaa
gcat
gaaa
atgg
tgcc
catg
cctg
acgc
tcca
gcgc
ctgg
atct
gcac
gtgc
ccttg
taga
gg1
atcc
ttacc
gtcc
taga
gagc
agac
gcttt
ctga
aaac
tact
tgct
ccaa
aaga
ccct
ctga
gtta
acgt
ttcag
ctgt
atca
ttaga
cttg
tattt
agag
cgtg
tcac
ttcct
ctga
actg
tta
1913
3506761
NO
tgta
gtgt
tgct
gtat
cttc
catc
gtac
agtg
gaat
ggca
tctta
caga
aggg
gtttc
ggtc
ctat
ggtta
gctta
atgt
tgaa
gtca
ggga
aagt
caaa
atca
ccat
tgga
actc
ccttc
aact
gtct
ataa
aata
cact
atgc
acag
tacc
ttggt
ttacg
tata
tttgc
ataa
atac
atac
tttgt
atag
tcac
atgt
tgta
tgac
aact
tac
tgag
ttgttc
gatg
aaca
actc
tatg
tgct
aata
ataa
ttttg
atgt
atta
aaat
ggga
aagc
atttg
ctta
actg
agtg
gaga
ggca
cagg
ttgct
gagt
caga
cgt
atct
cctg
ttgca
cacg
tacc
ttcag
gatg
g
1914
3888725
PTPNl
YES
gcag
atcg
acaa
gtcc
ggga
gctg
gg1915
3357756
NO
tctg
tcca
gagt
gcaa
ggcc
gaac
caga
ggac
acac
atga
tcat
ttcta
tttta
gaaa
cacg
cctg
gggc
agag
agga
cagc
aggt
gaaa
gaga
ctga
gag
agaa
cagt
caga
ggct
agta
ctgg
ccaa
gaca
ggag
agtg
aggg
agca
accg
taat
gagg
atgg
atga
ataa
tttga
gaga
gattt
aggg
gtaa
cga
acca
gact
cact
c
1916
2371244
NO
atga
gctc
acct
tgcc
tgca
actg
tctc
t
1917
2660674
NO
tgcc
cattt
ttcta
cggc
ttgttg
atttc
ttggg
agttc
cttg
taag
cttta
tagt
ctct
gtaa
ttaat
gagt
tcca
aaca
tttttc
ctag
tttgt
ctttt
gtct
tttgt
ttttg
gtg
tggt
gtgt
caaa
caga
actg
tttttt
tgttg
ttgttc
ttaca
tagt
ttaat
tttta
atgc
ttctg
tattc
taaa
tcat
agcc
aaaa
atga
cgttc
ccat
gcaa
agga
atgt
ctca
tgttt
tcttt
ggat
acttg
ctgt
ggttt
catta
ctttt
ccgt
ggaa
agct
ctga
tcct
tgtg
cagt
tgat
cctg
atat
aa
1918
3199144
NO
taga
atgt
ttgta
tattg
ctgc
ctgg
c1919
2411720
___
NO
agat
ccaa
ttaga
gcca
gggt
cagg
1920
2682075
—NO
tgat
aata
ggga
cgttg
tcag
accc
ttaac
taag
ccac
acac
tgct
ggga
gcct
ccgg
aata
acttt
aatg
tagt
tgta
ggga
aggg
agtc
gttg
gcag
gga
tgag
gatc
cctg
cttg
actg
ctct
ctttc
ctga
cgag
gaag
aaaa
ggac
agtg
atca
agca
cagg
gtct
caaa
ggag
gtct
ta
1921
2709855
NO
ggac
agca
ttacg
gtgg
acag
tgaa
1922
3016026
MUC3B
NO
ctgc
aaaa
cggg
taca
gcat
tcct
gtat
gata
gctc
acgc
cgtc
gttg
tgaa
aacc
acat
agac
ttggt
caat
tctc
ggtc
ctac
tctg
ccct
cccg
tctc
agc
III
cctc
gtgt
tgcc
attg
cMUC3A
1923
3550736
LOC730
NNOO
acca
ctac
ttcat
gcag
ggca
gtgt
gcc
217
1924
3059100
—NO
ggaa
gcta
ccaa
ggag
acta
ttaaa
gaaa
ttcag
gatg
gtga
agac
agct
taaa
ggga
aaca
gttc
tgca
aata
gagg
cgat
ggga
gata
tagg
aaag
a
Ogg
gact
atgt
gtag
gtct
tgttg
cgtc
gctg
aag
1925
3379723
MRPL2
YEJ
tggt
tggg
gcag
acaa
cttc
acgc
tgct
tggc
aagc
cact
1
1926
3432355
NO
cgtc
gtgg
atgg
atca
gtttt
gctc
gata
gagg
gaca
tgttt
ttctg
tggc
aaca
ggag
ggca
aaag
gaga
aggt
ggcc
acag
atgc
cggt
agat
gagc
tgag
agtg
attg
tattc
ccta
tcct
ctcg
gaag
cttg
aggc
aagg
ccat
caac
agac
aatc
agag
ggaa
taag
aaga
gata
gaat
atat
gaag
aaag
ggag
aaa
agat
gaaa
tcgt
aattg
tgta
gcag
ggca
agaa
gtcc
agaa
atttc
tgtg
ctgt
gcca
agttc
ccag
ttgag
gcgg
tga
1927
3511352
—NO
ctga
cggt
ttaag
atgc
tgca
tttta
agta
atgc
tgag
ttgta
ttttta
tttca
gtca
ttcat
ttcta
ctta
ttttg
agca
ccca
ttata
tgtta
ggttc
catg
ctag
gcac
tgta
aaga
taaa
gatg
agaa
agac
tcag
ccct
tacc
taaa
ggag
tata
gagc
ctag
ttggg
gaga
cagg
aggc
agat
agtta
cagc
atgg
agag
caaa
ttct
ctga
ggga
ggaa
tgaa
gtga
gata
agag
caca
gaag
aaag
accc
gggg
atca
gagc
agac
ttcct
ggag
gagg
caat
cattg
acttg
1928
2627501
—NO
tcct
gcttg
tgaa
caga
ttcct
atttc
cgtg
agga
ggtg
acct
cgaa
gaag
agca
gcttc
tattc
tgct
ggga
ggtg
tact
gcct
tctg
gtat
cagg
cttc
tgca
tccc
ggga
aagc
tcgg
gata
acct
gagg
cacc
1929
3608077
NO
agac
caca
atga
ggga
catg
gcaa
gttc
ct1930
2393810
___
NO
ttgag
gaaa
attg
ctct
gtcc
ctat
1931
2506539
___
NO
tcca
tttaa
atga
gagg
gtgg
agct
gtcc
ttgca
aacc
aatg
atgc
agct
gatg
ccat
gcag
tgga
tcttc
cttc
cccc
taaa
1932
3068053
—NO
tttcc
caga
gctc
tacc
ttgttg
tcag
cact
ccct
gcaa
tag
1933
3762439
NO
ctaa
gaga
gtta
ctga
ccct
aact
cccc
taaa
acct
aact
atca
aaag
ccta
ccac
tgac
tgga
agcc
ttaca
ataa
cact
aaca
gttg
g
1934
3896
232
—N
Otc
agtc
tgcg
atga
aggt
gaac
ccat
ctta
taaa
gcag
agct
tact
taca
ttctg
cagg
atttt
ggtg
tgga
tgca
taga
aggc
ttacc
tggt
tagt
aagc
ctcc
attc
ctcc
gacc
taca
gaag
gcaa
ccct
tctg
cagc
tcca
agca
ggga
tttct
agaa
aaga
ccag
gtgt
acaa
aaca
tcta
agct
gtgc
ctcc
cacc
acag
gtgg
ggaa
gtcc
cctg
atttg
gtca
gaca
caga
tgaa
tgtc
tgtg
tctg
actg
agcc
caga
aaca
gttc
aattg
atgg
gggc
tgga
acca
ccaa
gaaa
aca
ctcc
tgac
ctgg
gcaa
cact
gtct
tcca
gatta
t
1935
3916
144
C21
orf7
NO
tgtg
tgta
cctg
agat
aaga
tgat
gtct
tgtc
caat
gctg
gtttc
cacc
ttgtg
ccct
gagc
tgttg
ggag
aggc
tcct
gcga
cctg
tgac
419
3623
8957
8—
NO
ggct
tacg
cagc
agat
gcac
gttc
agca
ttgag
aacc
cggt
ccga
cagg
aaag
acac
acca
tttttc
cgcc
agaa
actc
cagt
gttc
aaaa
taca
aact
gcca
acttt
tcag
gcaa
taat
catg
agct
tcca
ttaga
aaaa
gcca
tcag
ctca
ccag
acga
cgaa
atat
gctc
tcgg
gatta
agaa
ctttc
tcat
cctc
cctg
agcc
ctttc
ctttc
aggg
gaag
tgca
agttt
g
1937
3290
995
AN
K3
YE
Sag
cggg
ctaa
cacc
actg
catg
tagc
tgca
catta
cgat
aatc
agaa
agtg
gccc
ttctg
ctttt
ggac
caag
gagc
ctca
cctc
acgc
agcc
gcaa
a19
3834
0315
3E
MG
lN
Otg
tcac
atcc
tttga
ccct
ggtc
tgag
ctga
ctgc
tgga
agat
gatc
tttct
gcac
tgag
actg
tgga
gtttg
ggga
agcc
aagg
ctgt
acat
ttgct
atttg
ttta
tcct
atga
atac
tgttc
ttgca
aacc
tggt
tgttt
tggg
gttc
ctaa
agta
tcca
gtgg
tgta
aaac
tgttt
gttc
cccg
ggac
ttcag
ggac
agat
agga
ggtta
caga
gtttg
cagt
ttggt
tcca
tgct
ttgaa
ggca
ggct
ttagc
tccc
agat
tccc
atgt
gcta
aagg
agag
aacc
ctga
tgat
ggag
aaga
actg
tgaa
aga
gagc
agtc
agga
atgc
tagt
ggtg
aaaa
actg
aaca
aaca
gaag
tgat
tttat
ctaa
taca
gttc
caag
gtag
aaaa
agtg
gagc
aggc
aggg
ccttg
cac
ccct
ctcc
accc
cccc
atgg
gggg
ggtg
gtgg
tagc
ggca
cata
caca
atc
O
1939
3439875
—NO
acct
ggct
ttggt
tattg
atga
gaag
agta
atta
atag
aaca
ttccc
tttgc
tgtg
atac
ccta
ccca
aaat
agct
cggt
tgttg
gttc
acttt
cttc
agat
acct
ccttg
tctc
atgg
accg
ccta
ctta
ct
1940
3568358
—NO
tcat
gggt
ttcca
aata
tggg
atgc
aattt
caag
ttctta
acac
caaa
tata
accc
tact
tgttt
tagt
ttcct
ggca
ttcat
ccta
tcat
cccc
agttt
actg
cacc
tcag
gtta
tcaa
ggat
gtgg
gagg
gattc
taca
aggg
tatg
ttccc
ccag
agct
ttttg
gtac
cctg
caat
ccca
gaag
ttgttg
atgc
tagg
atgt
ctgg
agag
cac
tggc
cattt
atta
ctga
1941
3701322
CDYL2
YES
tgac
gtga
atca
cgct
acac
tggc
ggag
aacg
ggct
c1942
2975832
—NO
gcca
tgat
cgta
gcac
tata
ctct
agcc
1943
3072788
—NO
gtct
aattc
agaa
cccg
gtgg
gctc
tcca
tgca
ttattg
gcag
cact
tttat
ccag
gggc
acag
acag
ggaa
tagg
gtta
ctgg
aaac
aggg
agtg
tagc
tcaa
ggaa
ttgttc
tctc
tctg
gagt
ggat
aagt
tgttt
gacc
tact
gact
cattt
cata
ggaa
ttagg
aaaa
caac
ctag
tgtg
gtttt
tattg
ttagt
cctg
gagg
attc
caaa
atct
gtgt
gcct
gcaa
ataa
gatg
agct
gggg
aaag
ggag
cacc
agca
tttgg
tgttt
ctac
ttgcc
tcttt
tcct
gatg
gaat
ccaa
tcct
ctgc
ttc
1944
3380
982
NO
gctg
tttca
gtcc
acac
cttg
ttcac
agta
agat
attc
aattc
aattc
agca
aaca
tttgt
tgag
tgct
tgct
gctc
tgtg
cctg
acca
gaag
aaat
gaag
ccaa
gat
acaa
ataa
taga
accc
tgtc
atga
gcac
agaa
agac
ctgc
tgag
tttct
attg
attg
tggt
aaac
cagc
ctgc
atcc
agttc
caag
taac
agct
gccc
actt
agtg
aata
ctct
ctca
tgttt
caac
actc
agttc
tagt
gtag
cctc
ttcta
taaa
gcct
ttcttt
tcca
gata
aaat
tggt
taat
gtgt
cctg
tgtta
ggct
gattg
tttt
gatg
atcc
ctat
tcat
ggtc
tccc
ctgt
atcc
atct
ctttt
tgcc
tata
acat
tgta
gtcc
catc
tcac
tctg
atgc
tggg
acca
gaca
tgtg
acttg
tttta
gcca
atga
gatg
tcga
taaa
catg
aagc
aaac
agaa
gcttt
aaag
gagg
cctg
ccct
cttg
ctct
ttgcc
agtg
ccat
gaga
acat
ggct
gatc
agct
ggag
ggtta
tga
gatg
tgta
ggtg
agag
acat
gcaa
aaga
gcca
gaga
tacc
agct
gaag
ccat
ccta
gact
agcc
caca
gtca
gcct
accc
ccag
tcaa
gatc
agga
gag
tcac
ccaa
ccca
caga
tgac
tgcc
gaca
catg
agta
agcc
1945
3902033
—NO
attc
cacg
cgtc
aaat
tgtg
ggca
gtga
1946
3952107
___
NO
acag
gcac
aggg
gagt
caag
ctta
g1947
2678585
___
NO
agca
ccca
gtct
ctgt
tctc
agag
g1948
3463862
PPFIA2
YES
tctc
ttggg
gatc
acga
gtgg
aata
gaac
tcaa
caga
ttgga
gtac
taag
cagc
cacc
ctttt
gaaa
gtga
cact
gaaa
tgtc
tgat
attg
atga
tgat
gaca
gag
aaac
aattt
ttagc
tcaa
tgga
tcttc
tctc
tcca
agtg
gtca
ttccg
atgc
ccag
acgc
ta
1949
3727958
...
NO
cacc
atct
tctc
ggat
ggat
ggca
aag
1950
3974043
...
NO
agca
agcc
agtg
cagt
ccttg
gtca
cagt
cact
caca
g
O1951
3982214
—NO
ccaa
aacc
aatg
tgtc
taag
gtgc
ggca
tcttg
1952
2685843
...
NO
gcgg
tggt
ccaa
tcat
agttc
acta
t1953
2686522
ABI3BP
NO
ccag
acat
gcca
ccaa
ctaa
atcc
g1954
4014220
POFlB
NO
cttc
ctct
gtag
ctgc
caga
cctg
c1955
2838069
TTCl
YES
aatg
gcca
tcaa
tgac
tgca
gcaa
ag1956
3701416
...
NO
cgca
tctg
gcct
atac
cagt
gtat
gtttt
cttta
ggaa
aaaa
taag
gcag
aaac
caga
aaac
taac
acttc
acat
aaat
gtag
gtgc
agaa
tattg
gtac
gata
ttg
gata
actta
aattg
ctga
actg
tatg
aaac
ttaat
ttttc
taag
gcaa
aaat
tgat
atca
ttgga
agga
tcttg
gagt
cagt
ggca
ggca
aaca
aaat
ttaaa
cttg
gctg
cattt
tcttt
tttcg
tttca
aatg
aata
ctta
acta
ggtg
caat
ggct
ttattt
gtgg
tcttc
1957
3756107
...
NO
tgaa
ttcac
agtg
ttgta
tccc
atgt
aggg
aaaa
ataa
gact
aattt
tccc
ccca
cact
ctaa
caca
ctgt
aattt
tgca
cttg
gaac
tttgc
taat
ctac
tgtc
caa
cctc
cttc
tcaa
caaa
gtca
aatc
aaag
gaga
atta
acct
agaa
gagg
ggac
aata
aaat
aaga
ggtc
ctca
ggga
gagg
gaga
acca
agga
aata
gaa
tcta
ctta
aatc
tggc
caga
tagc
acct
cgtg
gatt
1958
2382147
CAPN2
YES
atga
cagg
ctgc
ccac
caag
gacg
ggga
gctg
ctct
ttgtg
cattc
agcc
gaag
ggag
cgag
ttct
1959
2590022
ZNF385
NO
agag
aaac
acat
cact
agat
tgac
gagg
gcttt
taga
gact
gctc
ctat
aattg
agga
atca
gcag
cata
ggct
ttatc
tccc
tact
tcB
1960
3831701
...
NO
gtat
tggc
gact
gcct
gtgc
atac
ac1961
2792492
NO
ctga
ctgt
gaat
taga
gtttg
cgct
attg
gaag
gtgg
aacc
cact
gtgc
tcat
tgta
ta
1962
3289301
NO
gcat
ttact
agtg
ctgt
gtgc
ctcg
ttgct
tttct
gggt
agtg
tttgg
tgat
gggc
atag
tgtg
gagt
ggtg
gtag
agta
caca
gtgt
gattc
tgtc
tggt
aagg
aag
ctgt
atgt
tgat
gttc
actg
atgt
gaat
aggt
tcat
acag
tccc
ttagg
cattt
ggct
ggag
gata
aaga
acag
ccat
tgac
cagc
agag
tact
tttttt
gaag
tttat
gcaa
gaca
gcgt
gcta
ggta
ctac
tgcc
agct
tga
1963
3686344
XPO6
YES
gtcc
tttct
ccag
cccg
acat
ccac
ctttt
taaa
caaa
atct
cttc
tact
tgga
gact
ctca
acac
caag
caga
1964
2596800
—NO
tgac
taaa
atttt
gggt
ctgg
gctg
ttata
tcac
tttgt
gtta
agtta
tttgt
agta
gcta
atgt
gcta
actta
tata
tgtg
ttcttt
ctgt
tttttg
gattt
tacc
ttttg
tttt
ggca
taac
ttatc
tata
tttttg
tttac
atta
tttta
aaaa
tatc
agta
actg
taat
aata
taaa
attg
aagt
tgta
ttgttg
acaa
tacc
tcta
gtttt
taag
gtca
tctg
tca
cata
attta
agtg
cacc
agttc
ctaa
tgta
taaa
g
1965
2614940
NO
ctgc
tcca
ttgga
gact
tcct
gagg
gcaa
caga
caaa
ctat
ttgat
agct
gaat
1966
3988991
NDUFA
YES
tcat
tttgg
gtat
cact
ggag
tctg
atgg
aaag
agat
aggc
gcat
ctct
ggag
ttgat
cgtta
ctat
gtgt
caaa
g
1967
3068834
NO
tcat
ataa
taga
cagt
gcgg
ttgca
ccttg
tgttc
atgt
actg
atag
ctgt
taag
ggtg
cagt
gatg
aacc
aaca
gagc
tc1968
3116499
___
NO
gcct
ctct
gttc
tcat
gggc
taca
tg1969
3227229
FNBPl
YES
ccaa
cctg
aacg
aaat
gaat
gatta
cgca
gggc
agca
tgaa
gtta
tctc
cgag
aaca
tggc
atca
caga
tcat
tgtg
gact
tggc
acgc
tatg
ttcag
gaa
1970
3777767
—NO
tgtg
tttgg
ggtg
atta
gagc
ttcttg
caag
ctta
gaag
tagg
aagc
taat
g
O1971
3950135
___
NO
cact
ggca
cagg
acga
ccct
gaag
g1972
3425446
—NO
gcat
gtct
tgct
acta
cttc
atag
atgt
cata
agac
ccgt
ttatg
gatta
gagt
tggt
gtag
aaag
ccat
tgcc
ttgag
ttgaa
aaat
gaat
actg
taaa
agct
caat
acaa
atcc
tcca
aagc
acat
gcac
tt
1973
2359042
NO
ccaa
gttc
tcat
cttc
agca
tcgc
agttt
cgttc
tcca
cc1974
2437704
GON4L
NO
ttcca
ttcta
ttact
ctta
ggttt
gtat
ctca
tgga
attc
acttt
aacc
tctg
cctc
gtgt
tattt
gcttt
agct
gtac
atta
tgct
ctta
1975
2778373
___
NO
ccca
aaac
atag
atcc
gaac
acca
gcttt
gccc
1976
2954503
___
NO
atttc
ctac
actg
ttgat
ggtg
gcaa
tgac
aaaa
tcaa
ttgca
gtg
1977
2441001
___
NO
ctgg
agaa
atgg
ctat
gggt
gact
g1978
3162378
___
NO
gaat
tagt
tcct
cagc
ccag
tgta
agac
tgga
ttaaa
aatc
gtgc
atgg
atgt
gcag
gg1979
3199012
MPDZ
YES
cacc
cagt
cagt
caga
gtca
gagc
caga
gaag
gctc
cattg
tgca
g1980
3449682
___
NO
aagg
gtgt
ggca
cattg
ctgg
tgttt
tcag
ggaa
gaaa
gcag
aacg
tctg
tggg
caag
cagt
gg1981
3687337
NO
ggag
atcc
ttgtg
tagc
ggca
gaaa
gttta
gcaa
cact
gtct
cctg
catta
gtgt
ggag
ggta
ggaa
atat
acct
aagg
aacg
gatg
gtct
agct
aagg
agc
tttcc
agac
agaa
tgttg
aagg
tgcc
acct
ggct
tcttg
ctgc
ttata
gtaa
aact
gaag
agga
gaga
gtaa
gtta
aaga
aaaa
aaac
actta
aatg
aaaa
agag
cctg
aaat
tgttg
ggttt
gaaa
attc
ccag
ccttt
ccag
ataa
caag
tgat
gtta
aaat
taac
aaat
ggct
tccg
tgca
aaga
ttaaa
taca
aggc
atta
aata
cagg
gaaa
tgtg
gtct
aaag
gtca
agct
gagg
atgt
gatg
aaaa
agtc
ctttg
ttaag
acct
caga
aata
ccaa
caag
atgc
ctca
gagg
ggac
ttcca
gtc
gaat
gaaa
taat
aatc
acgc
ttctg
agaa
ggtta
aagg
tgttg
tcttt
cagc
gtct
cagc
acaa
gccc
aaag
taga
gaag
ggct
tacc
ttgca
gaga
ttttc
1982
2333554
ST3GA
YES
ccta
ccct
tggc
agtg
tggc
agtg
aL3
1983
2570354
NO
gtgt
tact
tcca
ccat
ggtg
ccac
acac
ttaat
atcc
attc
c1984
2873144
NO
gggg
cttc
tcta
tgcc
cgga
aact
gcct
catc
aact
gaaa
caaa
tcct
gaat
catc
aaag
ggcc
aaac
ccac
tgcc
acct
ttaga
cgga
gcag
gaga
gtta
ccct
ggct
ctgt
cact
gcca
ctaa
gtct
a
1985
3409485
NO
tgaa
tagg
cagc
tagt
acat
gaac
ctgg
catta
gggg
tttgg
ggct
ggca
tact
taaa
catg
tggc
taga
tgtg
atca
ccttc
agag
taaa
tgta
aata
gtat
cat
ccct
cctta
tctg
tatc
1986
3476542
NO
gtgt
gttc
cagg
cgat
ggca
ggag
gga
1987
3620299
NO
atga
cagt
ttttc
tcta
ggct
aata
actc
aatg
agca
gaaa
ggaa
agtc
ctcc
ctttt
aaat
gtat
atga
atgt
atat
gtag
ttcac
actta
cact
ctga
caca
gat
ctca
ttgtg
tcca
atga
tgtg
tgtg
tgtg
tgtg
tgtg
tgtg
tgtc
tcca
agac
tgag
acac
atgt
acta
aaga
gtga
tact
ggcc
acat
ctac
tctg
ggct
tgga
ggg
taac
ttcag
caa
1988
3626913
NO
tgca
acct
cgac
ctgc
ttggc
ttaag
caa
1989
2699759
NO
ccat
catg
agct
gacc
aaca
atttc
taaa
ttcct
actc
agaa
acac
agca
aggg
gaat
gcag
tcaa
gcac
aaag
ggct
taaa
tgac
caga
taag
gctg
ggc
tgga
tgaa
taaa
gagg
aata
tcct
atga
atgt
gact
gtgt
agttt
taga
gtat
cccg
attc
aact
gtta
tattt
gact
tgtg
gcct
tgta
gaca
gtgt
gtta
1990
2737582
NO
tagc
caag
tgca
gagg
gtgt
ctcc
gggt
gtgt
gcgt
ggcc
caca
agag
ggca
agaa
gaga
gaag
gcgg
ggat
agga
ggga
agag
ggtg
gggg
tga
gagg
cagc
cctg
agag
ggcg
gagg
aagg
aaga
ggct
ggga
gctc
ggcg
cccg
cggc
gcag
ctgc
cgtc
gctg
ccgc
agct
gtcc
aggg
agga
tcg
ccaa
aacg
gcga
cgaa
taaa
caac
ttacc
ttgcg
gaga
agag
ctac
gact
gcga
tgc
1991
2964449
MDNl
YES
cccg
atgg
aata
tgca
ggct
ctgg
acat
gatta
gaaa
tttga
tgga
ctttg
accc
acaa
acgg
acca
gcct
ga1992
3695220
DYNCl
YES
ggag
ctgc
cttg
attta
caca
tcag
LI2
1993
2939888
LYRM4
YES
taaa
gaat
atct
cctc
cagg
tgtg
gt1994
3632627
—NO
atgg
agat
ctac
actg
acac
cctc
agtc
ctag
aggc
agag
aaga
ggttc
agat
ctgg
cctg
1995
3695228
DYNCl
YES
cacg
cgct
gcaa
cgtg
tgga
ttctg
gatg
gaga
cttg
tacc
acaa
aggc
ctgc
tgaa
atttg
cagt
ttctg
ctga
atcc
ttgcc
agag
accc
tcgt
cattt
ttgt
LI2
tgca
g
1996
3854016
—NO
taga
tctc
gtcg
ctgg
tatg
cttg
gttc
cgct
cctc
gtttc
ttagt
tgct
tattc
tacc
cgtta
ttcac
tgat
gtta
1997
3980369
FAM15
NO
tcca
tcac
agac
agca
gagc
cggg
cagc
tttct
tatg
ccat
tttct
acac
tgtg
cttc
atga
gtag
gact
ttcttg
cact
agttc
ctat
gact
gagt
ctcc
aaac
t5B
ggttt
ccta
gtag
tccc
ccat
ccct
tcct
ccct
tacc
cagc
tatg
attc
agttg
tctc
tgcc
ctcc
ctct
tacc
ctgc
ctct
gtgt
ttcgg
tgag
agtc
1998
3799919
—NO
tgta
ttaca
caga
gact
accg
caca
aaac
acgg
accc
aatg
ccag
gcaa
gcac
gcgc
ccag
cctg
aacc
tcca
agaa
ctgt
ctt
1999
3032425
—NO
caca
cctg
taag
ctga
ggtg
aagg
agtt
2000
2475245
PPPlCB
YES
aaag
aaag
ctaa
atac
cagt
atgg
tgga
ctga
attc
tgga
cgtc
ctgt
cact
ccac
ctcg
aaca
gcta
atcc
gccg
aaga
aaa
2001
3487
169
DG
KH
YE
Stc
tggg
gata
ccga
aagt
gggt
catg
tgaa
gcga
attc
tcca
g20
0239
5515
5—
NO
aatg
acat
ccct
tttga
gctg
tgga
tggt
g20
0328
2716
0PH
AX
YE
Sca
cggc
aact
gcat
gtgc
acca
gtat
caca
ttatc
gagc
tgttg
aaag
tgtg
gattc
aagt
gaag
aaag
tttttc
tgat
tcag
atga
tgat
agct
gtct
ttgga
aac
gcaa
acga
caga
aatg
tttta
accc
tcct
ccca
aacc
agag
ccttt
tcag
tttgg
ccag
agca
gtca
gaaa
ccac
ctgt
tgct
ggag
gaaa
gaag
atta
aca
acat
atgg
ggtg
ctgt
gctg
cagg
aaca
gaat
caag
atgc
agtg
gcca
ctga
acttg
gtat
cttg
ggaa
tgga
gggc
acta
ttgac
agaa
gcag
acaa
tcc
gaga
ccta
caat
ta
2004
3186
892
—N
Oat
agat
aacg
tttgt
gtgg
cact
cccc
aaga
ctca
ggca
ttgtg
ataa
ctaa
cact
ttcca
tgaa
taat
gtca
taga
attc
tcac
aaac
atcc
tgtg
agat
agac
acta
ttattt
tcaa
cccc
atgt
acat
atga
gaat
actg
agag
ttgga
gaga
ctga
gtaa
catg
ccct
agat
gact
cagc
tagc
aagg
aatg
aaaa
gcac
attc
taa
ctcc
accc
acat
atat
agttc
cagg
atcc
catc
tctc
tacc
tcac
atta
atta
gttc
tata
ccag
cttta
tgttt
ccta
ctag
aatg
tgca
tgttt
taaa
ggct
gag
ctttc
2005
3195536
—NO
tggc
cttc
tgga
cagt
aggt
aggc
atgt
gatc
actg
ttgtc
acta
aacc
tggg
aaat
gattc
ctgg
gtca
gggt
tcat
taat
tgc
2006
3238305
—NO
gtat
tgaa
attc
ctcg
agcc
gctg
ctttt
ctca
ctcc
ataa
ttctg
gcca
gaat
ttggt
actta
aaat
atttt
gtct
aaaa
tatta
caat
agct
actta
agtc
atct
ccc
tgac
tcca
ctct
gttg
tcttt
cagg
gcgt
cgtc
caca
ctgt
agcc
aaag
tgat
ctta
taaa
aaca
taat
tcta
atca
tggc
actc
ttctg
ctta
aaaa
tgttt
taat
ggc
tttcc
gtta
ggtta
aaat
ttaaa
agtc
ctttg
tagc
ctgt
gaga
2007
3744218
VAMP2
NO
acttg
ctgg
aaaa
cggg
gatg
cttg
cccc
tctc
cagg
acta
ttgag
ccca
gaga
gagc
tgtc
ctct
cattg
ggtg
aact
gattg
agga
aggg
tcta
ttgtc
ttttta
aatg
gcac
aattt
taag
ggttt
gagg
gtac
agtc
cctta
acct
gcca
cggg
aggg
ggcc
ccca
aact
ttcttc
cccc
caca
cttc
tggt
tttct
gtgt
ggag
gggg
agca
ggga
tatc
taag
ctgt
ggtg
tgaa
aggg
tagg
agag
atgc
tgga
ggtg
gggg
tgct
gtgt
tttag
accc
ccca
tatta
tccc
agtg
tccc
ctg
cccc
cctc
ttccc
ccac
ccca
tgcc
ccca
attc
tgtg
gcgc
atcc
agat
tgtg
aaaa
tgta
2008
3810033
—NO
gtga
atat
atct
aggt
ccag
cagg
gtct
caga
agca
gatg
ccaa
cctg
aggg
gaat
attta
ccag
atga
2009
3943786
—NO
cagc
agct
gcta
tcca
tcgt
catc
acca
gtgt
catc
atcg
ccat
cccc
atca
tcac
cacc
aaca
ccgc
cacc
cccc
cagc
gaca
cact
agct
gtgg
acac
atc
cttta
cgcc
cttg
aacc
tgag
tttct
acag
ctat
gaag
caag
tccc
atgg
aattt
acag
acca
aacg
ctaa
gtgt
aggg
ctcc
ctgg
gagc
tggt
ctta
tgg
agca
cact
ggca
2010
2321472
—NO
cagt
ccat
gagg
tcca
cttg
tcta
aata
tgtc
acttg
aagt
gcaa
ggta
ccaa
ttgct
cagt
ggct
caaa
atga
ttttc
tggt
tctg
ttgtc
atttc
aaga
gcttc
tcttg
aggg
ttgag
agag
tctg
ttttc
ctaa
gaat
ctgg
ttctc
tcca
tcag
tctc
tgttt
ccac
ctta
tcttc
ctgg
gaag
gtgt
gcttt
ctttg
aggt
gaga
tgtg
aagc
ctgc
ccac
gtgc
agttt
tatg
tcga
attc
caag
tcag
atct
taac
ttggt
actc
ccgg
agcc
tgttg
gaag
tctta
2011
2361272
—NO
gtgc
aaca
tcac
tttga
cttg
atta
ttcttg
ggtc
tgttt
tattt
cccg
ctttt
atttt
gcttt
tgaa
atct
ttttc
cttg
gtgg
atttg
tacg
tgtc
ttcac
taga
tgcc
tca
aatta
agtc
tgac
caca
atcc
tact
ctac
tt
2012
2447714
—NO
tgcc
aact
tctc
ttcac
caca
caat
ctgc
agaa
gc
2013
3175
671
VPS
13A
YE
Sta
tgat
gatg
ccta
taga
tttgg
ggga
aaag
acaa
tata
tttag
tttca
ttcttt
gaag
gttta
caac
gcat
tattt
tattc
actg
aaga
tcca
aggg
tattt
aaag
taac
atat
gaaa
gtga
gaaa
gcag
agtta
gcag
agca
agaa
attg
cagt
ggca
ttaca
agat
gttg
gaat
ttctc
ttgtc
aaca
atta
cacg
aagc
aaga
agta
gcct
atat
a
2014
3517776
—NO
tata
agga
cttc
acag
atag
ctgc
cgtg
gatg
ctg
2015
3626941
___
NO
taat
ttcac
aatg
ggtg
tggg
aggt
tggt
gagt
agat
ggac
taga
ttcaa
aagt
agat
acct
cgct
ctca
ccta
tagc
ctag
cagt
tt2016
2467805
TTC
15
NO
ctcc
aaca
cact
acgt
caga
agga
cccg
g2017
3701421
___
NO
ttgct
ttgtc
ttgtg
atcg
cagc
tcac
ggca
gcct
tgga
ctcg
tggg
cacg
ag2018
3717066
NFl
YES
caga
aagt
gctg
caat
tgcc
tgtg
tca
2019
2476534
___
NO
agat
ggta
caaa
ggtc
gtga
tagg
gctg
2020
2750682
___
NO
ccaa
gatta
ggcc
actg
ccct
gtaa
c2021
2412205
—NO
gggt
taag
ctgg
gaac
catg
tctc
ctgg
ggca
aaat
tttaa
ttcttc
tagc
cact
tgtta
tgtg
atct
caag
caag
gtac
atac
gtct
ctgg
gtct
tttttt
cttta
tct
aaaa
tatg
aaga
gatta
agtg
ttttc
ttaat
tttct
ttcag
ctct
gata
ttatg
tgat
tctg
tgat
taaa
tgac
ctgg
atgt
tgat
aata
catg
cc
2022
3019902
—NO
ctttg
gagt
tttcg
ccca
gatc
agaa
tttta
ttgcc
ttttg
ccgt
agac
tgct
gcaa
gattt
ccag
agca
aaga
gaag
gtat
agtg
ttcct
taat
gtga
gtta
tgaa
gatc
aaaa
taca
gcaa
agag
aaga
tttca
gaga
tttaa
aagt
gata
atttt
tact
gaaa
tttgg
aaat
atgt
ctcc
atat
tttaa
cttta
tttta
taaa
aaag
aatta
ct
Oct
gtttt
ttgaa
atgg
ccaa
gcat
taag
taat
atat
gacc
gtga
atttc
ttgag
ggtg
ggaa
atgt
gact
ggat
tttat
atttc
gattg
tatc
acat
gcag
cctc
2023
3750928
KIAAOl
YES
acac
tatg
tggc
attg
tgct
ttgga
gaag
tgcg
tatc
agaa
cgga
ccta
caga
aagt
ttctg
acct
gtct
gccc
cattc
00
2024
2511032
___
NO
atga
ctga
gtct
tata
gcgg
tgttg
ggca
tagc
a2025
2568632
___
NO
atcc
tttgt
cccg
attg
ggtc
ccag
atac
acaa
aa2026
3039827
___
NO
gagt
ccttc
ctgg
atag
cagc
acca
aag
2027
2437284
Clorf2
YES
cagc
tgcc
accg
gttg
gagg
gctg
g2028
3076362
NO
gcat
acaa
gtct
gccc
agtc
ccag
gaag
aaag
agga
gaga
ccct
gaat
tctg
acct
tttga
tggt
cagg
catg
atgg
aaag
aaac
tgct
gcta
cagc
ttgg
gaga
tttgc
tatg
gaaa
gtct
gcca
gtca
acttt
gccc
ttcta
acca
ccag
atca
atttg
tggc
tgat
catc
tgat
gggg
cagt
ttcaa
tcac
caag
catc
gttc
tct
ttcct
gttc
tgga
atttt
gtttt
ggag
ctct
ttccc
ctag
tgac
cacc
agtta
gtttc
tgag
ggat
ggaa
caaa
aatg
cagc
ttgcc
ctttc
tatg
tggt
gcgt
gtt
cagg
ccttg
acag
atttt
atca
aaag
gaaa
ctat
tttat
ttaaa
tgga
ggct
gagt
ggtg
agta
gatg
tgtc
ttggt
atgg
agga
aaag
ggca
tgct
gcat
cttc
ttcct
gacc
tccg
gggt
ctct
ggcc
ttttg
tttcc
ttgct
cact
gagg
ggtc
tgtc
taac
caag
cagg
cta
2029
3527927
FLJ1035
NO
tgag
tcag
catc
ccca
atttc
tacc
acat
ccag
cact
ccta
ccat
cttg
catg
tacc
ttcct
tcct
gtcc
ccgg
ctac
ttggg
agtc
agct
tcttc
cctc
tccc
aac
7tc
atct
ccct
cttc
t
2030
3889598
...
NO
cagg
tctta
gcgg
tcat
cagg
atga
aggc
ggtg
ggaa
gtgg
agca
acag
agat
ccaa
atat
ggaa
agag
cacg
tgct
gata
ccag
aggc
actg
gaaa
taat
agca
atat
ccga
aagt
ctgc
tcac
cctg
cgtc
attc
cttc
agac
agta
gtcg
ggct
ctat
tc
2031
3735624
MFSDl
NO
ttaaa
atct
gcaa
tact
ggcc
aata
ttcttt
tatc
aaac
agga
gacc
gcag
cttta
aagg
ggga
aaat
gcag
acgt
tgga
taaa
aaca
gcaa
gaaa
tagt
cat
1ttt
catta
atag
gtct
caaa
cagt
ttacg
aaac
agcc
atta
ttatc
2032
2739508
___
NO
ggca
tgtg
tggg
taat
tagt
tctc
acta
tggt
taga
ctcc
tgaa
gtga
gtgg
tcca
agttg
ttagc
gtgc
ttctg
ctct
gtga
ggcc
aggc
cagt
atcc
tgtc
tg2033
3819517
RABIl
YYEE
Sga
catc
gcca
agca
cctg
acct
atga
gaac
gtgg
agcg
ctgg
ctga
agga
gctg
cggg
acca
cgca
gaca
gcaa
catc
B2034
3330082
PTPRJ
NO
gggg
tcag
ctat
gcag
ccca
tcac
gtgt
gtttt
tcat
ctgg
gatg
aaaa
agcc
tggt
tctc
ttttg
aaat
gcttg
attg
tact
tattg
agct
aaac
aagt
cttg
gtg
actg
ttgttg
atttg
cctc
aaaa
gtttt
aagt
cctg
ggttt
tcag
acta
ctgt
gta
2035
3378442
SPTBN
NO
ggcc
acat
tctc
ctaa
tagc
atga
aaca
gtca
gctc
acttt
ctgc
ctcc
tcct
ctta
cact
tccc
tgct
gtcc
actg
cggc
caac
tcag
caca
gtgt
ccttg
aag
2ct
gattg
aggg
tc
2036
2473789
—NO
gcgc
gggc
tttcg
agca
cata
caaa
cctg
atta
caaa
agtc
agat
ttcttt
atttc
gtct
tggg
cacg
tcat
tttaa
cccc
gctc
agcc
tctc
ttttg
tgta
tgaa
act
gaaa
gtaa
tagc
ttctg
ctgt
gcaa
aatg
atta
gagg
aattt
gtga
agcg
cttg
gcaa
cata
ctag
gacc
c
2037
3115627
—NO
ttgag
attta
tgtg
tgtg
ccag
acac
acac
acac
acac
atat
gtac
acaa
gcaa
tttttg
tgtg
tgtg
gctc
aagc
cagg
caga
gaaa
gaca
aata
ctaa
atg
atct
cact
cata
tgtg
aaat
ctaa
aaaa
gttg
atct
caaa
gaag
taga
aagg
agcc
tggt
ggtta
tcag
agtc
tggg
gagg
gggt
tggg
gaga
tattg
gcca
aagg
atat
acaa
tttca
gtta
ggta
ggag
gaat
aagt
tcaa
gaag
tcga
ttttg
acta
tagt
taat
aatg
tgttg
tattc
atga
acaa
ggct
aaga
gagt
ggat
gt
IN)
aaag
tgttc
ttacc
acaa
aaag
gata
acta
catg
aggt
aatg
caca
tgtta
atta
gcta
gattt
agta
attc
cata
atgt
atcc
actg
°2038
2745189
___
NO
tcca
tttct
tttta
ctgc
gtcc
gtcc
ctga
caat
acct
gctg
tttac
tcag
ccct
taac
gttg
2039
3450260
___
NO
aaag
tgat
acat
ttgtg
aggg
agga
gg2040
2656876
—NO
caga
cctta
tctc
atag
ggtg
atta
tgag
catta
gatg
aaat
cgtg
ggtg
aaag
gctc
agta
cagt
gcca
ggta
taca
agaa
gcat
tcag
ttaag
taat
agct
gtta
ttgtta
taac
tgag
gtttt
caca
agcc
gcac
act
2041
2952697
NO
gcta
gtttc
aaag
actg
cgac
cctg
ttttg
aattg
gcttg
tttat
gttg
cgac
ggtg
gctg
gaag
ccag
gcca
accc
aggg
tcag
ctttc
ccaa
atca
gtcg
ccaa
ggcc
cacg
agaa
cagt
agttt
caga
actc
ccag
aaat
aggt
tcaa
agat
gcct
gtcc
tgta
ccag
tcct
gccc
agcc
tcgt
ttttc
ctta
ccac
cctg
actg
cagt
ctta
actc
ctgc
aaac
taat
ttgat
cgct
ctaa
gttta
accg
cgcc
caga
atttc
acttt
tgct
tgct
tggc
ctttg
ctgt
gggt
caca
acag
ccct
tcttg
c
2042
3579
549
WA
RS
YE
Scc
tggc
ctct
gtaa
gcct
gtgt
atgt
tatc
aata
ctgt
ttcttc
ctgt
gagt
tcca
ttattt
ctat
ctct
tatg
ggca
aagc
attg
tggg
taat
tggt
gctg
gcta
acat
tgca
tggt
cgga
taga
gaag
tcca
gctg
tgag
tctc
tccc
caaa
gcag
cccc
acag
tgga
gcct
ttggc
tgga
agtc
catg
ggcc
accc
tgttc
ttgtc
cat
ggag
gact
ccga
gggt
tcca
agta
tact
ctta
agac
ccac
tctg
tttaa
aaat
atat
attc
tatg
tatg
cgta
tatg
gaat
tgaa
atgt
catta
ttgta
acct
agaa
agtg
ctttg
aaat
attg
atgt
gggg
aggt
ttattg
agca
caag
atgt
atttc
agcc
catg
cccc
ctcc
caaa
aaga
aattg
ataa
gtaa
aagc
ttcgt
tata
catt
tgac
taag
aaat
cacc
cagc
tttaa
agct
gcttt
taac
aatg
aaga
ttgaa
caga
gttc
agca
atttt
gatta
aatta
agac
ttggg
ggtg
aaac
tttcc
agttt
actg
aact
ccag
acca
tgca
tgta
gtcc
actc
caga
aatc
atgc
tcgc
ttccc
ttggc
acac
cagt
gttc
tcct
gcca
aatg
accc
taga
ccct
ctgt
cctg
caga
gtca
gggt
ggct
tttcc
cctg
actg
tgtc
cgat
gcca
agga
gtcc
tggc
ctcc
gcag
atgc
ttcat
tttga
ccct
tggc
tgca
gtgg
aagt
cagc
acag
agca
gtgc
cctg
gctg
tgtc
cctg
gacg
ggtg
gact
tagc
tagg
gaga
aagt
cgag
gcag
cagc
cctc
gagg
ccct
caca
gatg
tcta
ggca
ggcc
tca
2043
3703
035
—N
Ocg
atgg
acac
tgat
gccc
acac
aggt
g20
4424
4456
2R
C3H
1N
Otc
caga
tgga
tcgt
tggc
ctaa
atttt
caca
cttc
tccc
ctgt
tcat
ccttt
ttcct
cttc
cctg
cttc
ctgg
gaat
aaaa
ggaa
ctttt
ttaaa
aaaa
ttaat
tagt
ccac
aggt
ctca
ttatc
tttct
ttatg
atta
atct
atga
ctttt
tggt
acaa
gaac
aatg
gaaa
aagt
gaat
taag
gtaa
tgaa
caaa
acct
ttcac
ccac
ttaaa
cattt
tcca
gtttt
gaga
ttcct
cttc
gtgt
ttgtg
gtgt
cttc
ccct
tgtta
cccc
ttctg
ccct
ttttc
tctg
acta
tggt
aattt
ggtc
tttag
gctc
atat
cagt
ctcc
ccga
gaca
ttctg
cagt
catta
tcac
ctttt
tggg
tgga
tttta
ttttg
tttta
ttttg
tttttt
ttaaa
aaaa
taac
ttttta
acat
tggt
gcat
atttg
cttg
ggat
agag
cttg
tgta
attta
ccaa
tcgt
attg
attg
taag
tgat
tgtg
ccct
gcag
aggt
atat
ttaac
aaga
caaa
aata
atct
tggt
taat
aaag
gagc
ccat
gaga
tttga
gtca
ggt
tgta
agtg
aaat
cact
taca
ctttt
ggat
agaa
tttat
actc
ctgc
tctta
taaa
tcag
tggt
agac
ttacc
atttt
ttaaa
gtttt
cttg
cattt
ttttg
tttttt
tattg
cca
cagc
tccc
tattc
tttct
tgcc
tgcc
tcca
cccc
cctg
ttcag
gaaa
aaaa
aaaa
ttgag
cctta
aagt
gaca
gctg
atttt
ttaat
tgct
gaat
tttgt
gaaa
tttta
ctttt
tcca
agtg
tttcc
aact
ttaaa
aaga
gaag
tgaa
gaca
aata
ggttg
gaat
ggtg
aaga
caaa
tgga
ttgga
atttc
acag
gctg
tgaa
taat
tcct
tagg
atct
ggca
aacc
gtga
agtc
ttattt
gaag
acct
tatc
tcct
gaga
gttc
ttttg
gagt
agga
aaaa
gaac
ccta
tttga
aata
gacc
gtttt
tctc
ttgttt
ttaa
tctg
tttaa
tattt
ctga
ttttta
agca
gcttt
2045
3816284
DOTlL
YES
ccag
gtcg
tgct
ccag
gttg
ctgc
tgcc
acca
actg
caaa
catc
acta
tggc
gtcg
agaa
agca
gaca
tccc
ggcc
aagt
atgc
gga
2046
3231694
—NO
atcc
acca
agca
gtgc
gtcc
aggt
g2047
3760111
—NO
gaat
gaac
gcag
agag
tgtc
agtg
ctga
ca2048
3960937
—NO
cagg
caca
ccgc
cttg
aggt
gggc
agtg
ccc
2049
2519659
COL3A
NO
gaaa
gattc
attg
gcat
gcca
cagg
ggat
tctc
ctcc
1
2050
3380965
NO
tcta
accc
ttgag
acac
tgcc
aaca
tccc
tcct
agta
caag
acttt
tcttg
gtgc
ttgct
gtttt
taga
ggag
ttttg
gccc
ttcag
tgttt
gtaa
agct
ctcg
gatc
ctca
gaaa
aaaa
ttcttt
agtg
atta
atttt
gaaa
ttgga
gctg
tttga
ttattt
tgat
taag
aagc
tgat
tttcg
gcct
ggca
cg
2051
3690096
DNAJA
YES
agat
ggga
atga
tttgc
acat
gaca
tata
aaat
agga
cttg
ttgaa
gctc
tatg
tgga
tttca
gttc
acat
ttaag
cacc
ttgat
ggac
gtca
gattg
tggt
gaaa
2ta
ccc
2052
3777917
—YES
agga
tcct
gccg
aaag
gtgc
tcag
tggc
ct2053
3811929
—NO
atgt
atca
ttctc
cata
cacc
tgtg
cctt
2054
2436400
NO
ccag
ccac
gcgt
ccag
cagg
tcag
ggat
t2055
3868616
SHANK
YES
taca
tcat
taag
gaga
agac
agtc
ttgct
gcag
aaga
agga
cagt
gagg
ggttt
gg1
2056
2833315
NO
atct
tttga
ttgct
agcc
cagc
tgct
ttcct
tgtta
tcac
tttgt
ggag
cagg
ctgg
acat
tgac
aatg
agttc
tgag
actg
agtg
gaat
ggga
gacc
cctc
cca
gctg
gtgg
tcga
gctg
ccta
gagc
atgg
tcca
tctg
tttgg
gact
aggg
ttgag
ctcc
2057
2886091
NO
tgac
gaat
ttcgt
ctttc
ttccc
tacg
gtgg
agtc
agtg
ttaaa
cata
caac
aagg
agttc
tggg
gcac
aggc
2058
3146585
RNF19
YES
atgt
ccac
gatg
tgct
gctta
tata
ataa
agat
gaat
gatg
ggag
ctgc
aatc
acat
gaca
tgtg
cA
2059
3271059
—NO
gtga
gata
cttg
acat
gggg
acct
ttac
2060
3589673
___
NO
ttcaa
gaag
ttctg
ggga
caga
ggg
2061
3971336
MBTPS
YES
tcct
tgga
aaaa
cgct
gatg
caga
ctttg
gcac
aaat
gatg
gctg
actc
tccc
tcttc
ttattc
ttcct
cctc
ttcttc
ctct
tcct
cctc
ttcttc
ctct
tcct
cttc
ttc2
atct
tcttc
ctct
tcct
cgct
tcac
aatg
aaca
ggtg
ttaca
agttg
tg
2062
3304025
—NO
gtat
gtga
taag
tattt
ggat
ttgag
agta
gacc
tttca
tagt
gagg
ctta
tcag
gttg
aggg
gagg
tgat
gact
ggaa
atat
attta
tttat
tttaa
atcc
cact
ttggc
tttag
taaa
atct
gaat
atgt
gtgc
atttc
tagt
ttctta
cacc
aaat
gtag
gttc
cgta
agttc
atgc
aagt
acag
tggg
tc
2063
3350758
PAFAH
YES
agac
aaag
agcc
tgat
gtac
tgttc
gtgg
gaga
ctcc
atgg
tgca
gtta
atgc
agca
at1B2
2064
3921733
—NO
cagt
gtcg
ctac
aggg
gatc
tctc
aggc
tcac
aacg
ggcc
actc
ctct
aggg
aagt
tctg
gtct
catc
atga
tcct
tgttt
ggtc
tcac
tccc
catg
tcct
tctc
tgt
ccct
cctc
caac
tgcc
attta
tttat
ttaac
tgaa
aaag
tacc
aatc
accc
acat
aggc
atga
2065
2361637
NO
acag
gagc
caat
gccc
aatc
atag
c2066
2625412
—NO
acca
cctg
ggta
ttgaa
gccc
tata
aaaa
cact
ttcta
ctta
cctta
atat
agtc
tggc
gttg
ctgg
gacc
tcac
agtg
ttcttt
caca
ctttt
tatg
taac
ttagg
aac
tgaa
atac
cact
catg
cattt
gttc
taaa
g
2067
2737704
NO
gattg
ctat
ggaa
gtac
tgag
tccc
tgaa
ggag
agac
aagg
tgca
ggaa
gccc
caat
cgcc
aggc
catc
gatg
aaat
tgtg
gagg
gcat
cgca
gagc
gtta
tcat
ccag
gcaa
tcgt
ccct
att
2068
2795252
___
NO
gttc
ggat
gattt
cacg
ggag
cagc
2069
3203541
AQP7
YES
gctg
tgac
ctttg
ctaa
ctgt
gcgc
tggg
ccgc
g2070
3510917
—NO
aaga
ccag
ggta
atca
taga
actg
ccaa
gcag
aaag
aaat
gtag
aatg
tagc
agtg
gtgt
ggag
accc
gtgg
ctga
cagg
tggt
ggtc
ttcat
atag
gct
gagg
gaca
ggct
agat
gcat
gtca
agtg
gc
2071
3670551
—NO
gagc
tcca
ggga
tatc
tcttc
tgca
cattc
ttcca
gagc
agtg
catc
tctc
ttctg
gttg
tttgt
acac
ccttc
tttta
ctct
cctta
ttatc
ctct
ccca
tgttc
attc
agt
acat
actta
gtga
gtgt
tcac
caga
tgct
tagc
acct
gggg
tata
c
2072
3729159
DHX40
NO
gcct
atgg
cagg
attc
tttct
tgaa
ttaat
atta
atcc
ttaaa
ttgat
ttttc
tggg
atta
taca
aattc
ctttt
tata
taaa
agta
tattg
tttaa
aaca
gtag
ctat
agc
catta
acca
aagg
acag
atga
tata
tata
tata
tgat
atat
atat
atat
ataa
gttc
tttttt
agct
gtac
ctac
gtac
ttata
tcag
cacc
atgt
atgt
aggt
gtga
tagt
acttt
caaa
cagc
gcct
ccac
ctgg
ccta
ctct
gtta
tttcc
acct
gtttg
ggta
gggc
cattt
aact
tcca
ttatg
ccaa
acttg
ggat
ggga
ttttc
gaag
caga
caac
acta
tttca
tcgt
gtttc
aaat
tgga
acct
tgag
gcta
gtta
gtat
caca
ct
2073
2969
482
CD
C2L
YE
Stg
tattt
gccg
gctg
ccag
attc
cata
cccc
aaac
gaga
attc
ctta
atga
agat
620
7430
4011
6SN
X13
YE
Stg
gcaa
cctta
tttta
ctac
acgc
attg
taga
tgac
tttgg
caca
cact
tacg
agta
ttcag
aaag
gctc
aaca
2075
3819
122
LO
ClO
OY
ES
gagg
tggc
taga
aaac
ggca
agaa
c12
9391
2076
2359763
—N
Oac
ctgc
gtca
cacc
ctgg
ctag
tgac
ccag
tcag
acgt
gctg
ggga
ccct
ggct
gacc
acct
gtac
cagg
ggca
agga
agga
agga
gagg
ggag
catt
gtga
agca
gggc
agga
tgac
tcttg
aagg
tgga
gaca
ggcc
ctgg
acag
ctct
ggga
ccct
taat
ggtg
gcag
c
2077
2969302
WASFl
YE
Sgg
tttga
taga
aaat
cgcc
ctca
gtca
ccag
ctac
aggc
agaa
cacc
tgtg
tttgt
2078
2330410
Clorfll
NO
gaag
ctgg
tccc
acgg
aaag
tggt
atct
cggg
aatc
ag
3
2079
3265
461
FAM
16N
Otc
tagg
tggg
gaac
tgac
tgat
aacc
cttg
gcag
caat
caaa
gtgc
cagt
ggct
cctc
gatg
tttac
atttt
tttct
atttt
gttc
agtc
ttttg
tttta
aatg
attc
taOBl
aaga
gatta
aaga
aaac
agag
tttta
aatg
tcct
attta
catg
ttaaa
ggat
ttggg
gaaa
ttggg
tatg
tatg
tgaa
tggg
tgta
catg
tagg
aacc
tgta
gttc
agca
aagc
tgcg
ctgg
gcac
agca
tgct
tgta
cttg
attg
acaa
aatc
gtgt
ttgcc
agtc
cact
ttcta
tttttc
cttta
agtg
atgc
tgat
cact
caaa
taat
gct
tttaa
gcta
ttgttt
gtttt
tattt
gaca
tgtta
agcc
gcag
cact
ttcttc
ttcat
ctttc
cattt
actg
atat
ttggg
ggaa
cagg
ctat
caaa
ggttc
cggc
ttgaa
ggg
aact
gtca
ctac
2080
3735
093
LO
C64
3N
Ogc
cacc
gact
tcgt
gcag
gaga
tgcg
c
008
2081
2402
500
PAFA
HYYE
ESS
ggca
acat
tgac
atga
gccg
tgtg
gctg
tgat
ggga
cattc
atttg
gagg
ggcc
acag
ctat
t2
2082
3552
152
ME
G3
YE
Stc
ttcaa
ccca
ctgc
ttcct
gact
cgct
ctac
tccg
tgga
agca
cgct
caca
2083
2736
402
NO
atta
taag
gtat
gata
tggc
ttccc
aattg
tcaa
agaa
ttaaa
atgc
tcat
tagg
agag
gagt
gact
gcttc
tccc
tttag
taga
aaaa
caat
tcaa
actc
aggt
tac
caat
taca
atac
acaa
acta
aaac
agtta
tgttg
agcc
ggga
aacc
gtct
tcttg
tagg
ctat
cttc
tagc
acca
aata
atgc
ccac
atga
tttct
ggct
gaa
ggcc
a
2084
2751
960
NO
gcttc
cttc
gtga
attc
taac
cgta
ccgt
tact
gtgt
ttgag
gatc
actg
gcag
ccga
gctc
tgag
tggg
gaac
tgag
ataa
gaaa
ggtg
ttgtc
atga
taac
a
cctg
gaca
gctg
catc
ccac
agac
tatg
ggag
tgcc
agga
aact
ggct
acct
caca
gcag
agta
gaga
aaat
ggaa
ataa
acac
aaat
ataa
gcttt
tcag
tgag
ctgt
gggt
caga
gaag
gcca
caag
tggt
ctgg
caga
tcat
ggtg
cagc
tcttt
gtttc
atga
gcta
cgtg
cagc
agta
gtttt
tatg
agct
ctttt
tgta
gct
ttcca
gccc
tagg
attc
agcc
atgg
taa
2085
3024231
—N
Otg
ccca
gaaa
cgac
cattt
tgga
caat
ctca
tcca
gtttt
acgc
ttgttt
tattc
agaa
agag
aatc
acct
atcc
tcttc
acat
c
2086
2928814
—N
Ota
agga
aattg
ggttg
tcac
cact
ggct
ttgag
taat
tcgt
tttgt
ctat
gcaa
gctg
tcttc
atac
gaaa
aagg
ttttc
atac
tata
tcca
gtcc
c
2087
3166955
NFXl
YE
Sgt
tgaa
gtcg
aaac
atcc
cact
ggac
at
2088
3829651
KIAA03
YE
Scc
tgac
tgtg
gtgc
aagt
ccat
ttcca
gtttt
tgac
tcat
gcgt
taca
gaag
gtcc
agcc
ggtg
gctc
actc
ttgct
ttgct
gagg
tcat
cgtg
cc
55
2089
3709186
JMJD3
YE
Sgt
cagg
cagc
tgta
agcg
gcga
caga
ag
2090
2947628
—N
Oct
gttg
acttc
agag
tcga
gtgt
gag
2091
3820638
SLC44A
NO
ccca
gtga
tggc
cacg
tcttg
tttct
agaa
tccc
ttcct
ttgca
caag
ccat
attc
tgaa
ctct
taat
tcct
tccc
ttaac
ttcaa
tccc
tatg
tctc
ctgt
ccta
cctc
2at
ggttt
tcct
ggcc
ccac
acct
gatg
ctca
gagc
ccac
tgtg
aacc
cctg
gtgc
ctct
ttttg
gact
cggt
tc
2092
3823
661
APl
Ml
NO
cagc
ctca
gact
ttttc
ccac
tgag
ggtc
caga
gagc
gggg
ccac
gtgt
cacc
cacg
tctg
cgct
tggt
cacc
cgtc
ctcc
ccac
cctg
tgtg
tgttt
atgt
cat
agtta
catta
a
2093
4038647
—NO
aggt
ctcc
gcttt
cctg
gaaa
tcca
gacc
c2094
2543897
—_
NO
agct
ctgg
gtga
gact
tgtc
ttccg
gggg
ctg
2095
2353902
—NO
ctct
gatta
ggtc
ttgtg
ccca
ttgttg
aacc
aaat
gctg
tgct
tagg
agga
tggg
acat
gttc
tttttg
ttttg
ctta
gtta
gggc
ctgg
ctct
ggag
ccag
gaga
gggg
agga
ggaa
atgt
tgct
ggat
gtaa
gtta
gcac
tgat
acta
atta
ttcta
acaa
tgat
taat
agtta
gaat
ggct
gggt
gcag
tggc
acat
agtc
ccag
ctcc
gcag
gagg
ttgag
atgg
gagg
atcg
cctg
gggc
cagg
agtc
caag
gcta
tggg
gcac
catg
atca
tga
2096
3190953
PPP2R4
YES
cact
ttgtg
gatg
agaa
ggcc
gtga
atga
gaac
caca
agga
ct2097
3922999
PKNOX
YES
gcgc
ccgg
ccga
gaat
gaca
tctta
a1
2098
3701615
NO
atgg
acag
cttc
aaag
ggca
gtttg
gggg
tttct
gttta
ttttc
ctgt
atgt
tctg
gtca
cctc
aatc
atct
aacc
agaa
gaac
tggc
ccaa
aatta
ggca
caag
aaaa
aagg
aaaa
agtc
tagc
cctg
ccac
ataa
ctcc
tgga
actg
gttg
caga
aact
gaac
2099
2528729
NO
ggga
agac
acga
aggg
aggg
cgct
g2100
3188305
.._
NO
ggat
agct
tcta
ttgtg
gacg
tgga
atcc
2101
3578798
...
NO
agat
ctac
tcct
ggat
gcgg
gtac
agga
cagc
atcc
ccag
actg
tttgt
aaag
tctc
atgt
gata
catg
gagg
gcag
gcct
2102
3810041
—NO
agca
gaag
caaa
cggg
ggtg
tctc
agga
gggg
gccc
cgttt
ccttc
ttgta
aagc
tgct
ccca
ccca
cagc
caca
ctcc
attc
agtc
aggt
ttatg
tgct
gtt
cttc
cttg
tccc
ttatg
aggt
agaa
ccat
cacc
ttgcg
tgca
aagc
agct
ta
2103
2520084
MGC13
NO
ggtg
tctg
tcgg
tcttt
gcttg
tcag
ctgt
057
2104
2615285
_-_
NO
gtgt
ttgtg
ctca
atgt
aaga
gcaa
ggaa
ggtg
cggg
aaga
tgag
gggg
atga
ggcg
cagg
agta
cagg
gctg
ttattc
a2105
2676383
NEK4
YES
tggt
agag
tggt
ttgta
caga
tcgc
catg
gcttt
gcag
2106
3337536
.-_
NO
aggc
ttcttg
caca
ggca
gaag
gggc
t2107
3670582
—NO
tacg
tgcc
caag
aatc
aatc
caat
ccaa
ctga
taac
atttt
ctct
gaag
atag
ctgc
ctct
tcaa
atac
tggt
acag
agat
tcaa
gaag
ctgt
acca
caga
cca
tcag
tt
2108
2359853
NO
ccca
tttttc
atta
ggtc
cagg
tgta
tcca
aatg
ttgcc
a2109
3349998
___
NO
tgat
gtgg
cccc
gattg
caga
attg
agttg
gagg
ctga
agca
tgtg
agct
ctga
tcct
tgcc
2110
3431977
NO
cttc
cctc
tgga
agac
taac
atgt
actg
ctga
agtc
ttgttc
agtg
tcct
tcta
gccc
caga
ctttg
cagc
ctac
caac
atgg
actg
agac
aggg
aaag
cgtg
ggca
agtg
agttc
cata
ctct
aattt
tcct
acag
aagc
atcc
taag
tggt
gtgt
gggt
tctc
agat
caag
tcac
aaac
acat
tcac
agcc
catg
atat
cgta
caa
cact
acttt
ttagt
tcaa
atta
atta
tgag
taac
aaca
gact
atct
gcca
aaat
aggg
agct
gca
2111
2482
372
AC
YP2
YE
Sag
ctag
gaaa
atag
gagt
ggttg
gctg
2112
2619
332
SS18
L2
YE
Sca
ttgca
gatg
ccag
tcca
acca
gcac
2113
3810
037
—N
Occ
cgtg
gctc
taaa
agta
ttatc
cgaa
atca
acca
gtca
cccc
tttca
ccttc
tctc
agttt
aaaa
atgg
atcc
agtg
cata
ccca
atag
2114
2944
959
—N
Ogt
agga
atca
aaca
tagc
gcca
tcta
tctg
ctttt
tata
ttatc
ctac
acta
tttta
aaaa
ctgc
tcaa
cagt
ctta
taca
gaaa
tcttt
aaaa
gata
gaca
ggat
aaca
tgct
atat
taac
ccca
ccat
tgaa
ataa
tcca
acac
catc
acga
ttccg
atta
agag
aaga
aaaa
aatc
tttttt
ttttc
tttttt
tttct
tttttt
ctttt
tttttt
tccg
aaa
ccac
tcgc
cctc
cact
gact
gccc
ctgt
acca
catc
aaac
agtc
tcct
ctcc
tcca
cgcc
tccg
gggt
ctgg
gaag
tctc
acct
cact
gattt
c
Table 4 . Differentially expressed RNA transcripts used to plot hierarchical clustering and
expression matrix ('heat map') in Figure IB. The 526 RNA transcripts represent a subset of the
differentially expressed transcripts (Table 3) between patients in the three clinical status groups
(i.e., 'SYS', 'PSA' and 'NED') disease using linear regression and a p-value cut-off of p<0.01 .
Weighting factors were from the regression coeffecient values; positive and negative values
indicated transcripts correlated to increased expression in 'SYS' and 'NED' disease, respectively
with intermediate expression values in the 'PSA' disease group. Weighting factors were used to
derive 526-metagene values in Figure 2 .
SEQ SEQ SEQ SEQ SEQ
ID Weights ID Weights ID Weights ID Weights ID Weights
N o N o N o N o N o
1 -6.08 57 -3.6 113 -3.3 169 -3.06 925 4.36
2 -5.71 58 -3.6 114 -3.3 170 -3.06 926 4.36
3 -5.68 59 -3.59 115 -3.27 171 -3.06 927 4.33
4 -5.39 60 -3.58 116 -3.27 172 -3.05 928 4.27
5 -5.26 6 1 -3.58 117 -3.26 173 -3.05 929 4.27
6 -4.84 62 -3.58 118 -3.26 174 -3.05 930 4.18
7 -4.7 63 -3.58 119 -3.26 175 -3.05 931 4.17
8 -4.68 64 -3.58 120 -3.25 176 -3.04 932 4.13
9 -4.66 65 -3.57 121 -3.25 177 -3.03 933 4.12
10 -4.55 66 -3.57 122 -3.24 178 -3.03 934 4.08
1 1 -4.53 67 -3.57 123 -3.24 179 -3.03 935 4.08
12 -4.47 68 -3.55 124 -3.23 180 -3.02 936 4.07
13 -4.4 69 -3.52 125 -3.23 181 -3.02 937 4.07
14 -4.37 70 -3.51 126 -3.22 182 -3.02 938 4.04
15 -4.32 7 1 -3.5 127 -3.22 183 -3.01 939 4.03
16 -4.27 72 -3.5 128 -3.22 184 -3.01 940 4.03
17 -4.23 73 -3.49 129 -3.22 185 -3 941 4.02
18 -4.2 74 -3.48 130 -3.22 186 -3 942 4.02
19 -4.18 75 -3.48 131 -3.21 187 -2.96 943 4.01
20 -4.1 76 -3.47 132 -3.21 188 -2.96 944 3.97
2 1 -4.09 77 -3.47 133 -3.21 189 -2.96 945 3.95
22 -4.06 7 8 -3.47 134 -3.2 190 -2.95 946 3.95
23 -4.03 79 -3.47 135 -3.19 191 -2.95 947 3.95
24 -4.02 80 -3.46 136 -3.19 192 -2.95 948 3.95
25 -4.01 8 1 -3.45 137 -3.19 193 -2.94 949 3.92
26 -4 82 -3.45 138 -3.19 194 -2.94 950 3.92
27 -3.96 83 -3.44 139 -3.18 195 -2.94 951 3.9
28 -3.95 84 -3.43 140 -3.17 196 -2.93 952 3.89
29 -3.95 85 -3.43 141 -3.17 197 -2 92 953 3.88
30 -3.95 86 -3.43 142 -3.16 198 -2 92 954 3.84
3 1 -3.95 87 -3.43 143 -3.16 199 -2 92 955 3.84
32 -3.92 88 -3.42 144 -3.15 200 -2 92 956 3.83
33 -3.91 89 -3.41 145 -3.14 201 -2.91 957 3.8
34 -3.88 90 -3.41 146 -3.14 202 -2.91 958 3.8
35 -3.86 9 1 -3.4 147 -3.13 203 -2.9 959 3.8
SEQ SEQ SEQ SEQ SEQ
ID Weights ID Weights ID Weights ID Weights ID Weights
N o N o N o N o N o
36 -3.85 92 -3.4 148 -3.13 204 -2.89 960 3.79
37 -3.8 93 -3.39 149 -3.12 205 -2.89 961 3.77
38 -3.8 94 -3.38 150 -3.12 206 -2.88 962 3.76
39 -3.79 95 -3.36 151 -3.11 207 -2.88 963 3.75
40 -3.74 96 -3.36 152 -3.11 208 -2.87 964 3.74
4 1 -3.73 97 -3.36 153 -3.11 209 -2.87 965 3.72
42 -3.73 98 -3.36 154 -3.11 210 -2.87 966 3.71
43 -3.71 99 -3.35 155 -3.1 211 -2.86 967 3.71
44 -3.7 100 -3.35 156 -3.1 212 -2.85 968 3.71
45 -3.7 101 -3.35 157 -3.09 213 -2.85 969 3.7
46 -3.7 102 -3.34 158 -3.09 914 5.32 970 3.7
47 -3.68 103 -3.34 159 -3.09 915 5.27 971 3.69
48 -3.67 104 -3.34 160 -3.08 916 4.82 972 3.69
49 -3.66 105 -3.33 161 -3.08 917 4.64 973 3.68
50 -3.66 106 -3.33 162 -3.08 918 4.59 974 3.67
5 1 -3.65 107 -3.32 163 -3.07 919 4.54 975 3.66
52 -3.65 108 -3.32 164 -3.07 920 4.49 976 3.65
53 -3.64 109 -3.31 165 -3.07 921 4.45 977 3.64
54 -3.64 110 -3.31 166 -3.07 922 4.43 978 3.63
55 -3.63 111 -3.3 167 -3.07 923 4.38 979 3.62
56 -3.62 112 -3.3 168 -3.07 924 4.38 980 3.62
981 3.61 1037 3.39 1093 3.18 149 3.04 1205 2.9
982 3.61 1038 3.39 1094 3.18 150 3.04 1206 2.89
983 3.6 1039 3.39 1095 3.18 151 3.04 1207 2.89
984 3.6 1040 3.38 1096 3.18 152 3.03 1208 2.89
985 3.59 1041 3.38 1097 3.18 153 3.03 1209 2.89
986 3.59 1042 3.38 1098 3.17 154 3.03 1210 2.88
987 3.58 1043 3.38 1099 3.17 155 3.03 1211 2.88
988 3.58 1044 3.37 1100 3.17 156 3.02 1212 2.88
989 3.57 1045 3.37 1101 3.16 157 3.02 1213 2.87
990 3.57 1046 3.37 1102 3.16 158 3.02 1214 2.86
991 3.56 1047 3.36 1103 3.16 159 3.01 1215 2.86
992 3.56 1048 3.35 1104 3.16 160 3.01 1216 2.86
993 3.55 1049 3.35 1105 3.15 161 3.01 1217 2.86
994 3.55 1050 3.35 1106 3.15 162 3.01 1218 2.86
995 3.55 1051 3.34 1107 3.15 163 3.01 1219 2.85
996 3.55 1052 3.34 1108 3.15 164 3.01 1220 2.85
997 3.54 1053 3.33 1109 3.14 165 3 1221 2.85
998 3.54 1054 3.33 1 10 3.14 166 3 1222 2.85
999 3.54 1055 3.33 1 1 1 3.14 167 3 1223 2.85
1000 3.54 1056 3.32 1 12 3.14 168 3 1224 2.85
1001 3.54 1057 3.31 1 13 3.13 169 3 1225 2.85
1002 3.53 1058 3.31 1 14 3.13 170 2.99 1226 2.85
1003 3.53 1059 3.31 1 15 3.12 171 2.99
1004 3.53 1060 3.31 1 16 3.12 172 2.99
1005 3.53 1061 3.3 1 17 3.12 173 2.99
1006 3.53 1062 3.3 1 18 3.11 174 2.99
1007 3.53 1063 3.3 1 19 3.11 175 2.99
1008 3.52 1064 3.29 1 20 3.11 176 2.99
SEQ SEQ SEQ >EQID Weights ID Weights ID Weights ID Weights
N o N o N o No1009 3.52 1065 3.29 1121 3.11 177 2.98
1010 3.51 1066 3.29 1122 3.1 178 2.98
1011 3.51 1067 3.28 1123 3.1 179 2.97
1012 3.5 1068 3.27 1124 3.1 180 2.97
1013 3.49 1069 3.27 1125 3.09 181 2.97
1014 3.49 1070 3.27 1126 3.09 182 2.97
1015 3.48 1071 3.27 1127 3.09 183 2.96
1016 3.48 1072 3.26 1128 3.09 184 2.96
1017 3.48 1073 3.26 1129 3.09 185 2.96
1018 3.48 1074 3.25 1130 3.08 186 2.96
1019 3.47 1075 3.24 1131 3.08 187 2.96
1020 3.46 1076 3.24 1132 3.08 188 2.96
1021 3.45 1077 3.24 1133 3.08 189 2.95
1022 3.45 1078 3.22 1134 3.08 190 2.94
1023 3.44 1079 3.22 1135 3.07 191 2.94
1024 3.44 1080 3.22 1136 3.07 192 2.94
1025 3.44 1081 3.21 1137 3.07 193 2.94
1026 3.43 1082 3.21 1138 3.06 194 2.93
1027 3.43 1083 3.2 1139 3.06 195 2.93
1028 3.42 1084 3.2 1140 3.06 196 2.93
1029 3.41 1085 3.2 1141 3.06 197 2.93
1030 3.41 1086 3.2 1142 3.05 198 2.93
1031 3.41 1087 3.2 1143 3.05 199 2.92
1032 3.41 1088 3.19 1144 3.05 200 2.92
1033 3.4 1089 3.19 1145 3.05 201 2.91
1034 3.4 1090 3.19 1146 3.05 202 2.91
1035 3.4 1091 3.19 1147 3.04 203 2.91
1036 3.39 1092 3.19 1148 3.04 204 2.91
Table 5 . Differentially expressed RNA transcripts used to plot hierarchical clustering and
expression matrix ('heat map') in Figure 1C. The 148 RNA transcripts represent a subset of the
most differentially expressed transcripts between patients with clinically significant 'recurrent'
(i.e., 'SYS') and 'non-recurrent' (i.e., 'PSA' and 'NED') disease. Weighting factors were from
the test statistic values; positive and negative values indicated transcripts correlated to increased
expression in recurrent and non-recurrent disease, respectively. Weighting factors were used to
derive 148-metagene values, which were converted by scaling and normalizing into 'POP' scores
depicted in Figure 7 .
SEQ Weights SEQ Weights SEQ WeightsID No ID No ID No
1 -4.1 925 3.04 1291 2.99
2 -4.21 926 4.61 1299 3.74
3 -5.48 927 3.79 1300 3.33
4 -3.04 933 3.24 1304 3.14
6 -3.73 934 3.65 1311 3.66
8 -3.88 935 3.64 1314 3.95
14 -3.78 939 4.2 1318 3.9
20 -3.14 941 4.14 1320 3.45
32 -3.87 944 3.72 1324 3.31
33 -3.75 945 4.03 1330 3.02
36 -4.93 947 4.44 1335 4.35
42 -3.46 949 3.77 1341 3.7
45 -3.1 954 4.34 1344 2.93
46 -4.14 960 3.54 1346 3.51
60 -5.72 968 3.84 1357 3.82
63 -4.79 970 3.5 1367 3.35
65 -3.82 971 4.68 1369 3.75
66 -4.02 974 3.63 1372 3.07
67 -3.37 978 5.27 1375 3.17
69 -3.25 986 2.59 1383 3.67
79 -3.1 999 4.74 1390 3.49
86 -3.57 1004 4.5 1395 3.19
88 -4.54 1005 3.62 1402 3.45
96 -3.68 1014 4.86 1416 3.29
100 -3.63 1022 6.29 1425 3.73
104 -2.93 1023 4.08 1443 2.97
115 -3.29 1031 3.75 1453 3.01
129 -3.91 1032 3.49 1469 3.02
130 -3.32 1039 3.44 1474 3.52
181 -3.27 1045 3.41 1489 3.66
182 -3.37 1052 3.28 1503 3.19
187 -3.56 1060 4.48 1527 3.25
189 -3.44 1062 4.1 1551 3.41
194 -2.09 1080 3.97 1598 3.26
217 -4 1093 4.02 1624 2.91
225 -4.01 1095 3.77 1689 2.82
SEQ Weights SEQ Weights SEQ WeightsID No ID No ID No241 -4.16 101 3.55
265 -3.86 108 3.39
280 -4.02 117 3.97
293 -3.36 123 4.03
295 -3.57 124 3.45
334 -3.36 126 3.25
355 -3.44 132 3.51
387 -3.73 146 3.78
400 -3.17 147 3.37
437 -4.02 153 3.71
445 -3.14 164 3.54
460 -3.36 167 3.6
468 -3 194 3.2
536 -3.65 208 3.35
592 -2.7 218 3.56
596 -2.79 219 2.96
684 -2.75 233 3.44
915 3.78 234 3.86
920 4.36 248 3.56
923 3.23 261 3.19
Table 6 . Genes identified in a literature search as being correlated to clinical outcome or
prognosis in prostate cancer patients. Indicated are the gene name and HNGC gene symbol.
Gene Name SymbolAminoadipate-semialdehyde dehydrogenase AASDHATP-binding cassette, sub-family A (ABCl), member 5 ABCA5ATP-binding cassette, sub-family B (MDR/TAP), member 1 ABCBlATP-binding cassette, sub-family C (CFTR/MRP), member
ABCC22ATP-binding cassette, sub-family C (CFTR/MRP), member
ABCC44ATP-binding cassette, sub-family C (CFTR/MRP), member
ABCC55ATP-binding cassette, sub-family G (WHITE), member 2 ABCG2V-abl Abelson murine leukemia viral oncogene homolog 1 ABLlAcetyl-Coenzyme A carboxylase alpha ACACAAcyl-Coenzyme A oxidase 1, palmitoyl ACOXlAcid phosphatase 2, lysosomal ACP2Acid phosphatase, prostate ACPPActin, gamma 2, smooth muscle, enteric ACTG2Aspartoacylase (aminocyclase) 3 ACY3AF4/FMR2 family, member 3 AFF3AF4/FMR2 family, member 4 AFF4Aryl hydrocarbon receptor AHRAbsent in melanoma 2 AIM2V-akt murine thymoma viral oncogene homolog 2 AKT2Aminolevulinate, delta-, synthase 1 ALASlActivated leukocyte cell adhesion molecule ALCAMAldehyde dehydrogenase 1family, member A2 ALDH 1A2Anaplastic lymphoma kinase (Ki- 1) ALKArachidonate 12-lipoxygenase ALOX 12Arachidonate 15-lipoxygenase, type B ALOX 15BAlpha-methylacyl-CoA racemase AMACRAlanyl (membrane) aminopeptidase (aminopeptidase N,aminopeptidase M, microsomal aminopeptidase, CD 13, ANPEPp 150)Anthrax toxin receptor 2 ANTXR2Annexin A2 ANXA2APEX nuclease (multifunctional DNA repair enzyme) 1 APEXlAndrogen receptor (dihydrotestosterone receptor; testicularfeminization; spinal and bulbar muscular atrophy; Kennedy ARdisease)V-raf murine sarcoma 361 1 viral oncogene homolog ARAFAmphiregulin (schwannoma-derived growth factor) AREGAT rich interactive domain 4A (RBPl -like) ARID4AArmadillo repeat containing 5 ARMC5Aryl hydrocarbon receptor nuclear translocator ARNTN-acylsphingosine amidohydrolase (acid ceramidase)-like ASAHLATPase family, AAA domain containing 2 ATAD2Activating transcription factor 1 ATFl
Gene Name SymbolAtaxia telangiectasia mutated ATMAtrophin 1 ATNlATP synthase, H+ transporting, mitochondrial F l complex,
ATP5Ddelta subunitATP synthase, H+ transporting, mitochondrial F O complex,
ATP5Jsubunit F6Aurora kinase A AURKAAurora kinase B AURKBAXL receptor tyrosine kinase AXLBeta-2-microglobulin B2MBCL2-antagonist of cell death BADBAIl -associated protein 2-like 2 BAIAP2L2BCL2-antagonist/killer 1 BAKlBRCAl associated protein- 1 (ubiquitin carboxy-terminal
BAPlhydrolase)BRCAl associated RING domain 1 BARDlB-cell CLL/lymphoma 2 BCL2BCL2-like 1 BCL2L1B-cell CLL/lymphoma 3 BCL3Breakpoint cluster region BCRBCSl -like (yeast) BCSlLBiglycan BGNBaculoviral IAP repeat-containing 2 BIRC2Baculoviral IAP repeat-containing 3 BIRC3Baculoviral IAP repeat-containing 5 (survivin) BIRC5Baculoviral IAP repeat-containing 7 (livin) BIRC7Bloom syndrome BLMBMIl polycomb ring finger oncogene BMIlBone morphogenetic protein 4 BMP4BoIA homolog 2 (E. coli) BOLA2V-raf murine sarcoma viral oncogene homolog B 1 BRAFBreast cancer 1, early onset BRCAlBreast cancer 2, early onset BRCA2BTB (POZ) domain containing 14B BTBD 14BBruton agammaglobulinemia tyrosine kinase BTKBUB 1budding uninhibited by benzimidazoles 1homolog
BUBl(yeast)Chromosome 15 open reading frame 33 C15orf33Chromosome 17 open reading frame 56 C17orf56Chromosome 17 open reading frame 57 C17orf57Chromosome 1open reading frame 115 Clorfl l5Chromosome 1open reading frame 77 Clorf77Chromosome 2 open reading frame 33 C2orf33Chromosome 2 open reading frame 37 C2orf37Chromosome 3 open reading frame 14 C3orfl4Chromosome 8 open reading frame 32 C8orf32Chromosome 8 open reading frame 53 C8orf53Chromosome 8 open reading frame 76 C8orf76Calcium channel, voltage-dependent, beta 4 subunit CACNB4Calmodulin binding transcription activator 1 CAMTAl
Gene Name Symbol
Caspase 2, apoptosis-related cysteine peptidase (neuralprecursor cell expressed, developmentally down-regulated CASP22)Caspase 3, apoptosis-related cysteine peptidase CASP3Caspase 8, apoptosis-related cysteine peptidase CASP8Caveolin 1, caveolae protein, 22kDa CAVlCas-Br-M (murine) ecotropic retroviral transforming
CBLsequenceCholecystokinin CCKCyclin A2 CCNA2Cyclin B l CCNBlCyclin C CCNCCyclin D l CCNDlCyclin E l CCNElCyclin H CCNHCD34 molecule CD34CD38 molecule CD38CD40 molecule, TNF receptor superfamily member 5 CD40CD44 molecule (Indian blood group) CD44CD59 molecule, complement regulatory protein CD59CD69 molecule CD69CD9 molecule CD9Cell division cycle 2, G l to S and G2 to M CDC2Cell division cycle 25 homolog A (S. pombe) CDC25ACell division cycle 25 homolog B (S. pombe) CDC25BCell division cycle 25 homolog C (S. pombe) CDC25CCDC42 effector protein (Rho GTPase binding) 5 CDC42EP5Cadherin 1, type 1, E-cadherin (epithelial) CDHlCadherin 11, type 2, OB-cadherin (osteoblast) CDHI lCadherin 13, H-cadherin (heart) CDH 13
Cyclin-dependent kinase 10 CDKlOCyclin-dependent kinase 2 CDK2Cyclin-dependent kinase 4 CDK4Cyclin-dependent kinase 6 CDK6Cyclin-dependent kinase 7 CDK7Cyclin-dependent kinase 9 CDK9Cyclin-dependent kinase inhibitor IA (p21, Cipl) CDKNlACyclin-dependent kinase inhibitor IB (p27, Kipl) CDKNlB
CDKN2A interacting protein N-terminal like CDKN2AIPNL
Cyclin-dependent kinase inhibitor 2C (pi 8, inhibits CDK4) CDKN2CCyclin-dependent kinase inhibitor 3 (CDK2-associated dual
CDKN3specificity phosphatase)CCAAT/enhancer binding protein (C/EBP), alpha CEBPACentrosomal protein 135kDa CEP 135Centrosomal protein 7OkDa CEP70Chromatin assembly factor 1, subunit A (pi 50) CHAFlACHKl checkpoint homolog (S. pombe) CHEKlChromogranin A (parathyroid secretory protein 1) CHGAChromatin accessibility complex 1 CHRACl
Gene Name SymbolCeroid-lipofuscinosis, neuronal 5 CLN5Clusterin CLUCalponin 1, basic, smooth muscle CNNlCannabinoid receptor 1 (brain) CNRlCollagen, type XVIII, alpha 1 COLl 8AlCollagen, type I, alpha 1 COLlAlCollagen, type IV, alpha 3 (Goodpasture antigen) COL4A3COMM domain containing 5 C0MMD5Catechol-O-methyltransferase COMTCoatomer protein complex, subunit beta 2 (beta prime) COPB2COP9 constitutive photomorphogenic homolog subunit 5
COPS5(Arabidopsis)Cytoplasmic polyadenylation element binding protein 3 CPEB3Cysteine-rich secretory protein 3 CRISP3V-crk sarcoma virus CTlO oncogene homolog (avian) CRKV-crk sarcoma virus CTlO oncogene homolog (avian)-like CRKLColony stimulating factor 1 receptor, formerly McDonough
CSFlRfeline sarcoma viral (v-fms) oncogene homologColony stimulating factor 2 (granulocyte-macrophage) CSF2Colony stimulating factor 3 receptor (granulocyte) CSF3RC-src tyrosine kinase CSKCystatin B (stefin B) CSTBConnective tissue growth factor CTGFCollagen triple helix repeat containing 1 CTHRClCatenin (cadherin-associated protein), alpha 1, 102kDa CTNNAlCatenin (cadherin-associated protein), beta 1, 88kDa CTNNB 1Cathepsin B CTSBCathepsin L l CTSLlCortactin CTTNCullin 2 CUL2Chemokine (C-X-C motif) ligand 14 CXCL 14Chemokine (C-X-C motif) ligand 9 CXCL9Chromosome X open reading frame 4 1 CXorf41Cytochrome b5 type A (microsomal) CYB5ACytoplasmic FMRl interacting protein 1 CYFIPlCytochrome P450, family 27, subfamily A, polypeptide 1 CYP27A1Cytochrome P450, family 2, subfamily C, polypeptide 9 CYP2C9Cytochrome P450, family 3, subfamily A, polypeptide 5 CYP3A5Disabled homolog 2, mitogen-responsive phosphoprotein
DAB2(Drosophila)Death associated protein 3 DAP3Death-associated protein kinase 1 DAPKlDeleted in colorectal carcinoma DCCDodecenoyl-Coenzyme A delta isomerase (3,2 trans-enoyl-
DCICoenzyme A isomerase)Decorin DCNDynactin 2 (p50) DCTN2Damage-specific DNA binding protein 2, 48kDa DDB2Dopa decarboxylase (aromatic L-amino acid decarboxylase) DDCDevelopment and differentiation enhancing factor 1 DDEFlDNA-damage-inducible transcript 3 DDIT3
Gene Name SymbolDEAD (Asp-Glu- Ala-Asp) box polypeptide 6 DDX62,4-dienoyl CoA reductase 2, peroxisomal DECR2DEK oncogene (DNA binding) DEKDENN/MADD domain containing 3 DENND3DEP domain containing 1B DEPDClBDEP domain containing 6 DEPDC6Diacylglycerol kinase, alpha 8OkDa DGKADEAH (Asp-Glu-Ala-His) box polypeptide 9 DHX9Deiodinase, iodothyronine, type II DIO2DIRAS family, GTP-binding RAS-like 3 DIRAS3Dyskeratosis congenita 1, dyskerin DKCl
DKFZP564O0823 protein DKFZP564O0823
Discs, large homolog 3 (neuroendocrine-dlg, Drosophila) DLG3Discs, large (Drosophila) homolog-associated protein 1 DLGAPlDeleted in malignant brain tumors 1 DMBTlDedicator of cytokinesis 5 D0CK5Desmoplakin DSPE2F transcription factor 1 E2F1E2F transcription factor 2 E2F2E2F transcription factor 3 E2F3E2F transcription factor 4, p 107/p 130-binding E2F4Endothelial differentiation, lysophosphatidic acid G-protein-
EDG7coupled receptor, 7Endothelin receptor type B EDNRBEukaryotic translation elongation factor 1 alpha 1 EEFlAlEmbryonal Fyn-associated substrate EFSEpidermal growth factor (beta-urogastrone) EGFEpidermal growth factor receptor (erythroblastic leukemia
EGFRviral (v-erb-b) oncogene homolog, avian)Early growth response 2 (Krox-20 homolog, Drosophila) EGR2Early growth response 3 EGR3Euchromatic histone-lysine N-methyltransferase 1 EHMTlEuchromatic histone-lysine N-methyltransferase 2 EHMT2Eukaryotic translation initiation factor 2-alpha kinase 2 EIF2AK2Eukaryotic translation initiation factor 3, subunit H EIF3HELKl , member of ETS oncogene family ELKlELK3, ETS-domain protein (SRF accessory protein 2) ELK3Elongation factor RNA polymerase II ELLElongation factor RNA polymerase II-like 3 ELL3Epithelial membrane protein 2 EMP2Ectonucleotide pyrophosphatase/phosphodiesterase 2
ENPP2(autotaxin)Enhancer of yellow 2 homolog (Drosophila) ENY2EPH receptor A l EPHAlEPH receptor B4 EPHB4Epsin 1 EPNlErythropoietin EPOEpidermal growth factor receptor pathway substrate 15 EPS 15
Epidermal growth factor receptor pathway substrate 8 EPS 8
Gene Name SymbolV-erb-b2 erythroblastic leukemia viral oncogene homolog 2,
ERBB2neuro/glioblastoma derived oncogene homolog (avian)V-erb-b2 erythroblastic leukemia viral oncogene homolog 3
ERBB3(avian)V-erb-a erythroblastic leukemia viral oncogene homolog 4
ERBB4(avian)Excision repair cross-complementing rodent repairdeficiency, complementation group 1 (includes overlapping ERCClantisense sequence)Excision repair cross-complementing rodent repairdeficiency, complementation group 2 (xeroderma ERCC2pigmentosum D)Excision repair cross-complementing rodent repairdeficiency, complementation group 3 (xeroderma ERCC3pigmentosum group B complementing)Excision repair cross-complementing rodent repair
ERCC4deficiency, complementation group 4Excision repair cross-complementing rodent repairdeficiency, complementation group 5 (xeroderma
ERCC5pigmentosum, complementation group G (Cockaynesyndrome))Excision repair cross-complementing rodent repair
ERCC6deficiency, complementation group 6V-ets erythroblastosis virus E26 oncogene homolog (avian) ERGEndoplasmic reticulum to nucleus signaling 2 ERN2Estrogen receptor 1 ESRlEstrogen receptor 2 (ER beta) ESR2V-ets erythroblastosis virus E26 oncogene homolog 1
ETS l(avian)V-ets erythroblastosis virus E26 oncogene homolog 2
ETS2(avian)Ets variant gene 1 ETVlEts variant gene 4 (ElA enhancer binding protein, ElAF) ETV4Ets variant gene 6 (TEL oncogene) ETV6Even-skipped homeobox 1 EVXlExocyst complex component 2 EXOC2Exostoses (multiple) 1 EXTlExostoses (multiple) 2 EXT2Enhancer of zeste homolog 2 (Drosophila) EZH2Ezrin EZRCoagulation factor II (thrombin) receptor F2RCoagulation factor V (proaccelerin, labile factor) F5Family with sequence similarity 114, member A l FAMl 14AlFamily with sequence similarity 13, member C l FAM 13ClFamily with sequence similarity 49, member B FAM49BFamily with sequence similarity 84, member B FAM84BFamily with sequence similarity 8, member A l FAM8A1Fanconi anemia, complementation group A FANCAFanconi anemia, complementation group G FANCGFas (TNF receptor superfamily, member 6) FASFas ligand (TNF superfamily, member 6) FASLG
Gene Name SymbolFatty acid synthase FASNFibulin 1 FBLNlF-box protein 32 FBXO32F-box and WD repeat domain containing 11 FBXWI lFames yl diphosphate synthase (farnesyl pyrophosphatesynthetase, dimethylallyltranstransferase, FDPSgeranyltranstransferase)Fer (fps/fes related) tyrosine kinase (phosphoprotein
FERNCP94)Feline sarcoma oncogene FESFEV (ETS oncogene family) FEVFibroblast growth factor 12 FGF 12
Fibroblast growth factor 3 (murine mammary tumor virusFGF3
integration site (v-int-2) oncogene homolog)Fibroblast growth factor 5 FGF5Fibroblast growth factor 8 (androgen-induced) FGF8Fibroblast growth factor 9 (glia-activating factor) FGF9Fibroblast growth factor receptor 1 (fms-related tyrosine
FGFRlkinase 2, Pfeiffer syndrome)Fibroblast growth factor receptor 2 (bacteria-expressedkinase, keratinocyte growth factor receptor, craniofacial
FGFR2dysostosis 1, Crouzon syndrome, Pfeiffer syndrome,Jackson-Weiss syndrome)Fibroblast growth factor receptor 4 FGFR4Fragile histidine triad gene FHITFolliculin FLCNFriend leukemia virus integration 1 FLIlHypothetical protein FLJ90709 FLJ90709Fms-related tyrosine kinase 1 (vascular endothelial growth
FLTlfactor/vascular permeability factor receptor)Fms-related tyrosine kinase 4 FLT4Flavin containing monooxygenase 5 FM05Fibromodulin FMODFolate hydrolase (prostate-specific membrane antigen) 1 FOLHlFolate receptor 1 (adult) FOLRlV-fos FBJ murine osteosarcoma viral oncogene homolog FOSFK506 binding protein 12-rapamycin associated protein 1 FRAPlFrizzled-related protein FRZBFYN oncogene related to SRC, FGR, YES FYNFrizzled homolog 7 (Drosophila) FZD7Gamma-aminobutyric acid (GABA) A receptor, gamma 2 GABRG2Growth arrest and DNA-damage-inducible, alpha GADD45AG protein beta subunit-like GBLGastrulation brain homeobox 2 GBX2Ganglioside-induced differentiation-associated protein 1 GDAPlGrowth differentiation factor 15 GDF 15
Glioma-associated oncogene homolog 1 (zinc finger protein) GLIlGlutaredoxin 2 GLRX2GPI anchored molecule like protein GMLGeminin, DNA replication inhibitor GMNNGuanine nucleotide binding protein (G protein), alpha 15 GNA 15
Gene Name SymbolGNAS complex locus GNASGuanine nucleotide binding protein (G protein), beta
GNBlpolypeptide 1Glucosamine (UDP-N-acetyl)-2-epimerase/N-
GNEacetylmannosamine kinaseN-acetylglucosamine- 1-phosphate transferase, alpha and
GNPTABbeta subunitsGolgi membrane protein 1 GOLMlGolgi-localized protein GOLSYNG protein-coupled receptor 137B GPR137BGrowth factor receptor-bound protein 2 GRB2Growth factor receptor-bound protein 7 GRB7Gremlin 2, cysteine knot superfamily, homolog (Xenopus
GREM2laevis)Gastrin-releasing peptide receptor GRPRGlycogen synthase kinase 3 alpha GSK3AGlutathione S-transferase pi GSTPlGlucuronidase, beta GUSBH l histone family, member X HlFXHeparin-binding EGF-like growth factor HBEGFHCCA2 protein HCCA2Host cell factor C 1 (VP 16-accessory protein) HCFClHemopoietic cell kinase HCKHistone deacetylase 1 HDAClHistone deacetylase 7A HDAC7AHepatoma-derived growth factor (high-mobility group
HDGFprotein 1-like)HECT, C2 and WW domain containing E3 ubiquitin protein
HECW2ligase 2Hypoxia-inducible factor 1, alpha subunit (basic helix-loop-
HIFlAhelix transcription factor)Hydroxymethylbilane synthase HMBS3-hydroxy-3-methylglutaryl-Coenzyme A reductase HMGCRHyaluronan-mediated motility receptor (RHAMM) HMMRHook homolog 1 (Drosophila) HOOKlHomeobox A9 H0XA9Homeobox C4 H0XC4Hepsin (transmembrane protease, serine 1) HPNHypoxanthine phosphoribosyltransferase 1 (Lesch-Nyhan
HPRTlsyndrome)V-Ha-ras Harvey rat sarcoma viral oncogene homolog HRASHydroxysteroid (17-beta) dehydrogenase 4 HSD17B4Hydroxysteroid (17-beta) dehydrogenase 6 homolog
HSD17B6(mouse)Heat shock transcription factor 4 HSF4Heat shock 27kDa protein 1 HSPBlIntercellular adhesion molecule 1 (CD54), human rhinovirus
ICAMlreceptorImmediate early response 3 IER3Interferon, gamma IFNGInterferon gamma receptor 1 IFNGRl
Gene Name SymbolInsulin-like growth factor 1 (somatomedin C) IGFlInsulin-like growth factor 1 receptor IGFlRInsulin-like growth factor 2 receptor IGF2RInsulin-like growth factor binding protein 1 IGFBPlInsulin-like growth factor binding protein 2, 36kDa IGFBP2Insulin-like growth factor binding protein 3 IGFBP3Insulin-like growth factor binding protein 6 IGFBP6IKAROS family zinc finger 1 (Ikaros) IKZFlInterleukin 11 ILI lInterleukin 12A (natural killer cell stimulatory factor 1,
IL12Acytotoxic lymphocyte maturation factor 1, p35)Interleukin 12B (natural killer cell stimulatory factor 2,
IL12Bcytotoxic lymphocyte maturation factor 2, p40)Interleukin 13 IL 13
Interleukin 1, beta ILlBInterleukin 2 IL2Interleukin 3 (colony-stimulating factor, multiple) IL3Interleukin 4 IL4Interleukin 6 (interferon, beta 2) IL6Interleukin 6 receptor IL6RInterleukin 8 IL8Integrin- linked kinase ILKInner membrane protein, mitochondrial (mitofilin) IMMTInhibin, alpha INHAInhibin, beta A INHBAInterferon regulatory factor 1 IRFlInsulin receptor substrate 2 IRS2ISL LIM homeobox 1 ISLlIntegrin, alpha V (vitronectin receptor, alpha polypeptide,
ITGAVantigen CD5 1)Integrin, beta 1 (fibronectin receptor, beta polypeptide,
ITGBlantigen CD29 includes MDF2, MSK 12)Integrin, beta 3 (platelet glycoprotein Ilia, antigen CD61) ITGB3Integrin, beta 4 ITGB4Inositol 1,4,5-trisphosphate 3-kinase A ITPKAInositol 1,4,5-triphosphate receptor, type 1 ITPRlIsovaleryl Coenzyme A dehydrogenase IVDJanus kinase 2 (a protein tyrosine kinase) JAK2Jumonji, AT rich interactive domain IA JARIDlAJumonji domain containing 2B JMJD2BJun oncogene JUNJun B proto-oncogene JUNBJun D proto-oncogene JUNDPotassium channel regulator KCNRGKinase insert domain receptor (a type III receptor tyrosine
KDRkinase)KH domain containing, RNA binding, signal transduction
KHDRBS3associated 3KIAAO 196 KIAAO 196KIAA0922 KIAA0922KIAA 13 24 KIAA 1324
Gene Name SymbolKinesin family member C2 KIFC2V-kit Hardy-Zuckerman 4 feline sarcoma viral oncogene
KIThomologKruppel-like factor 6 KLF6Kelch domain containing 4 KLHDC4Kallikrein-related peptidase 2 KLK2Kallikrein-related peptidase 3 KLK3Kallikrein-related peptidase 4 KLK4Karyopherin (importin) beta 1 KPNBlKeratin 15 KRT 15
Keratin 5 (epidermolysis bullosa simplex, Dowling-KRT5
Meara/Kobner/Weber-Cockayne types)L l cell adhesion molecule LlCAMLymphocyte- specific protein tyrosine kinase LCKLipocalin 2 LCN2Leprecan-like 1 LEPRELlLeucine-rich repeat-containing G protein-coupled receptor 4 LGR4Ligase I, DNA, ATP-dependent LIG lLigase III, DNA, ATP-dependent LIG3LIM domain only 1 (rhombotin 1) LMOlLIM domain only 2 (rhombotin-like 1) LM02Poly (ADP-ribose) polymerase family, member 1 LOC649459Lactotransferrin LOC728320Hypothetical protein BC008326 LOC89944Lysyl oxidase LOXLeucine rich repeat containing 2 LRRC2Limbic system-associated membrane protein LSAMPLatent transforming growth factor beta binding protein 2 LTBP2MaI, T-cell differentiation protein-like MALLMucosa associated lymphoid tissue lymphoma translocation
MALTlgene 1Monoamine oxidase B MAOBMitogen-activated protein kinase kinase 6 MAP2K6Mitogen-activated protein kinase kinase kinase 8 MAP3K8Mitogen-activated protein kinase 1 MAPKlMitogen-activated protein kinase 10 MAPKlOMitogen-activated protein kinase 14 MAPK 14
MARCKS-like 1 MARCKSLlMARVEL domain containing 3 MARVELD3MAS 1oncogene MASlMegakaryocyte-associated tyrosine kinase MATKMethyl-CpG binding domain protein 2 MBD2Melanoma cell adhesion molecule MCAMMutated in colorectal cancers MCCMCF.2 cell line derived transforming sequence MCF2Myeloid cell leukemia sequence 1 (BCL2-related) MCLlMinichromosome maintenance complex component 7 MCM7Microcephalin 1 MCPHlMdm4, transformed 3T3 cell double minute 4, p53 binding
MDM4protein (mouse)Mediator complex subunit 30 MED30
Gene Name SymbolMyocyte enhancer factor 2C MEF2CMeis homeobox 2 MEIS2Multiple endocrine neoplasia I MENlMet proto-oncogene (hepatocyte growth factor receptor) METMethyltransferase 10 domain containing METHODHypothetical protein MGC 15523 MGC 15523Antigen identified by monoclonal antibody Ki-67 MKI67Myeloid leukemia factor 1 MLFlMyeloid leukemia factor 2 MLF2MutL homolog 1, colon cancer, nonpolyposis type 2 (E.
MLHlcoli)Myeloid/lymphoid or mixed-lineage leukemia (trithorax
MLLT3homolog, Drosophila); translocated to, 3Myeloid/lymphoid or mixed-lineage leukemia (trithorax
MLLT4homolog, Drosophila); translocated to, 4Myeloid/lymphoid or mixed-lineage leukemia (trithorax
MLLT6homolog, Drosophila); translocated to, 6Matrix metallopeptidase 1 (interstitial collagenase) MMPlMatrix metallopeptidase 10 (stromelysin 2) MMPlOMatrix metallopeptidase 14 (membrane-inserted) MMP 14
Matrix metallopeptidase 2 (gelatinase A, 72kDa gelatinase,MMP2
72kDa type IV collagenase)Matrix metallopeptidase 3 (stromelysin 1, progelatinase) MMP3Matrix metallopeptidase 7 (matrilysin, uterine) MMP7Matrix metallopeptidase 9 (gelatinase B, 92kDa gelatinase,
MMP992kDa type IV collagenase)V-mos Moloney murine sarcoma viral oncogene homolog MOSMembrane protein, palmitoylated 7 (MAGUK p55
MPP7subfamily member 7)Mitochondrial ribosomal protein L l3 MRPL 13
MutS homolog 2, colon cancer, nonpolyposis type 1 (E.MSH2
coli)MutS homolog 3 (E. coli) MSH3MutS homolog 6 (E. coli) MSH6Microseminoprotein, beta- MSMBMacrophage scavenger receptor 1 MSRlMacrophage stimulating 1receptor (c-met-related tyrosine
MSTlRkinase)Metastasis associated 1 MTAl5,10-methylenetetrahydrofolate reductase (NADPH) MTHFRMyotrophin MTPN5-methyltetrahydrofolate-homocysteine methyltransferase MTRMetastasis suppressor 1 MTSSlMucin 1, cell surface associated MUClMAX dimerization protein 1 MXDlMAX interactor 1 MXIlV-myb myeloblastosis viral oncogene homolog (avian) MYBV-myb myeloblastosis viral oncogene homolog (avian)-like
MYBL22Myosin binding protein C, slow type MYBPC 1V-myc myelocytomatosis viral oncogene homolog (avian) MYC
Gene Name SymbolV-myc myelocytomatosis viral related oncogene,
MYCNneuroblastoma derived (avian)Myosin, heavy chain 11, smooth muscle MYHI lMyosin, light chain 9, regulatory MYL9Myosin, light chain kinase MYLKN-acetyltransferase 2 (arylamine N-acetyltransferase) NAT2Neuroblastoma, suppression of tumorigenicity 1 NBLlNibrin NBNNon-SMC condensin II complex, subunit D3 NCAPD3N-myc downstream regulated gene 1 NDRGlNADH dehydrogenase (ubiquinone) 1beta subcomplex, 9,
NDUFB922kDaNeurofilament, heavy polypeptide 20OkDa NEFHNeogenin homolog 1 (chicken) NEOlNeuropilin (NRP) and tolloid (TLL)-like 2 NETO2Neurofibromin 1 (neurofibromatosis, von Recklinghausendisease, Watson disease)
NFl
Nuclear factor of kappa light polypeptide gene enhancer inB-cells 1 (pi 05)
NFKBl
Nuclear factor of kappa light polypeptide gene enhancer inB-cells 2 (p49/p 100)
NFKB2
Nuclear factor of kappa light polypeptide gene enhancer inNFKBIA
B-cells inhibitor, alphaNitric oxide synthase 3 (endothelial cell) NOS3Notch homolog 1, translocation-associated (Drosophila) NOTCHlNotch homolog 2 (Drosophila) N0TCH2Notch homolog 4 (Drosophila) N0TCH4Nephroblastoma overexpressed gene NOVNADPH oxidase 4 N0X4Aminopeptidase-like 1 NPEPLlNAD(P)H dehydrogenase, quinone 1 NQOlNuclear receptor subfamily 4, group A, member 1 NR4A1Neuroblastoma RAS viral (v-ras) oncogene homolog NRASNeuropilin 1 NRPlNeurotrophic tyrosine kinase, receptor, type 1 NTRKlNeurotrophic tyrosine kinase, receptor, type 2 NTRK2Neurotrophic tyrosine kinase, receptor, type 3 NTRK3Nuclear mitotic apparatus protein 1 NUMAlNucleoporin 98kDa NUP98Ornithine decarboxylase antizyme 2 0AZ2Oxysterol binding protein-like 9 OSBPL9P antigen family, member 4 (prostate associated) PAGE4PAP associated domain containing 1 PAPDlPar-3 partitioning defective 3 homolog (C. elegans) PARD3PAS domain containing serine/threonine kinase PASKPre-B-cell leukemia homeobox 1 PBXlProliferating cell nuclear antigen PCNAPCTAIRE protein kinase 1 PCTKlPlatelet-derived growth factor alpha polypeptide PDGFAPlatelet-derived growth factor receptor, alpha polypeptide PDGFRAPlatelet-derived growth factor receptor, beta polypeptide PDGFRB
Gene Name SymbolProtein disulfide isomerase family A, member 5 PDIA5PDZ and LIM domain 5 PDLIM5Phosphatidylethanolamine-binding protein 4 PEBP4Phosphatidylethanolamine N-methyltransferase PEMTPlacental growth factor, vascular endothelial growth factor-
PGFrelated proteinPhosphoglycerate kinase 1 PGKlProgesterone receptor PGRPhosphatase and actin regulator 2 PHACTR2PHD finger protein 20-like 1 PHF20L1PHD finger protein 8 PHF8Phytanoyl-CoA 2-hydroxylase interacting protein-like PHYHIPLProtein inhibitor of activated STAT, 2 PIAS2Phosphoinositide-3-kinase, catalytic, alpha polypeptide PIK3CAPhosphoinositide-3-kinase, catalytic, delta polypeptide PIK3CDPolycystic kidney and hepatic disease 1 (autosomal
PKHDlLlrecessive)-like 1Phospholipase A2, group HA (platelets, synovial fluid) PLA2G2APhospholipase A2, group VII (platelet-activating factor
PLA2G7acetylhydrolase, plasma)Pleomorphic adenoma gene 1 PLAGlPlasminogen activator, tissue PLATPlasminogen activator, urokinase receptor PLAURPlasminogen PLGPlexin domain containing 1 PLXDC 1Promyelocyte leukemia PMLPMSl postmeiotic segregation increased 1 (S. cerevisiae) PMSlPolymerase (RNA) I polypeptide C, 3OkDa POLRlCPeriostin, osteoblast specific factor POSTNPOU class 2 homeobox 1 P0U2F1Peroxisome proliferator-activated receptor delta PPARDPeroxisome proliferator-activated receptor gamma PPARGProtein phosphatase 2 (formerly 2A), regulatory subunit A,
PPP2R1Bbeta isoformPapillary renal cell carcinoma (translocation-associated) PRCCPeroxisomal proliferator-activated receptor A interacting
PRIC285complex 285Protein kinase, cAMP-dependent, regulatory, type I, alpha
PRKARlA(tissue specific extinguisher 1)
Protease, serine, 8 PRSS8Prostate stem cell antigen PSCAProteasome (prosome, macropain) 26S subunit, non-
PSMDlATPase, 1Patched homolog 1 (Drosophila) PTCHlPatched homolog 2 (Drosophila) PTCH2Prostaglandin E receptor 3 (subtype EP3) PTGER3Prostaglandin-endoperoxide synthase 1 (prostaglandin G/H
PTGS lsynthase and cyclooxygenase)Prostaglandin-endoperoxide synthase 2 (prostaglandin G/H
PTGS2synthase and cyclooxygenase)Parathyroid hormone-like hormone PTHLH
Gene Name SymbolPTK2 protein tyrosine kinase 2 PTK2PTK7 protein tyrosine kinase 7 PTK7Pleiotrophin (heparin binding growth factor 8, neurite
PTNgrowth-promoting factor 1)Protein tyrosine phosphatase type IVA, member 3 PTP4A3Protein tyrosine phosphatase-like (proline instead of
PTPLBcatalytic arginine), member bProtein tyrosine phosphatase, receptor type, F PTPRFProtein tyrosine phosphatase, receptor type, G PTPRGProtein tyrosine phosphatase, receptor type, H PTPRHProtein tyrosine phosphatase, receptor type, N polypeptide 2 PTPRN2PoIy-U binding splicing factor 60KDa PUF60Purine-rich element binding protein A PURAPaxillin PXNPyrroline-5-carboxylate reductase 1 PYCRlPyrroline-5-carboxylate reductase-like PYCRLGlutaminyl-tRNA synthetase QARSRAB32, member RAS oncogene family RAB32RAB8A, member RAS oncogene family RAB8ARabaptin, RAB GTPase binding effector protein 2 RABEP2RAD21 homolog (S. pombe) RAD21RAD23 homolog A (S. cerevisiae) RAD23ARAD50 homolog (S. cerevisiae) RAD50RAD54 homolog B (S. cerevisiae) RAD54BV-raf- 1 murine leukemia viral oncogene homolog 1 RAFlV-ral simian leukemia viral oncogene homolog B (ras
RALBrelated; GTP binding protein)RAPl, GTP-GDP dissociation stimulator 1 RAPlGDS lRAP2A, member of RAS oncogene family RAP2ARetinoic acid receptor, alpha RARARAS p2 1protein activator (GTPase activating protein) 1 RASAlRetinoblastoma 1 (including osteosarcoma) RBlRetinoblastoma binding protein 6 RBBP6Retinoblastoma-like 2 (pi 30) RBL2Retinol dehydrogenase 11 (all-trans/9-cis/l 1-cis) RDHI lRecQ protein-like (DNA helicase Q 1-like) RECQLRecQ protein-like 4 RECQL4V-rel reticuloendotheliosis viral oncogene homolog (avian) RELV-rel reticuloendotheliosis viral oncogene homolog A,nuclear factor of kappa light polypeptide gene enhancer in RELAB-cells 3, p65 (avian)Ret proto-oncogene RETRas homolog gene family, member A RHOARas homolog gene family, member H RHOHReceptor (TNFRSF)-interacting serine-threonine kinase 1 RIPKlRelaxin 1 RLNlRing finger protein 139 RNF 139Ring finger protein 185 RNF 185V-ros UR2 sarcoma virus oncogene homolog 1 (avian) ROS lReplication protein Al, 7OkDa RPAlRas-related GTP binding C RRAGC
Gene Name SymbolRelated RAS viral (r-ras) oncogene homolog RRASRhabdoid tumor deletion region gene 1 RTDRlS 100 calcium binding protein A4 S 100A4Sterile alpha motif domain containing 12 SAMD 12Stearoyl-CoA desaturase 5 SCD5Sodium channel and clathrin linker 1 SCLTlSodium channel, nonvoltage-gated 1 alpha SCNNlAScribbled homolog (Drosophila) SCRIBSyndecan 2 SDC2Succinate dehydrogenase complex, subunit B, iron sulfur
SDHBdp)Succinate dehydrogenase complex, subunit C, integral
SDHCmembrane protein, 15kDaSEC14-like 1 (S. cerevisiae) SEC 14LlSema domain, immunoglobulin domain (Ig), short basic
SEMA3Fdomain, secreted, (semaphorin) 3FSerpin peptidase inhibitor, clade B (ovalbumin), member 5 SERPINB5Serpin peptidase inhibitor, clade I (neuroserpin), member 1 SERPINIlSplicing factor 1 SFlSecreted frizzled-related protein 4 SFRP4SH3-domain binding protein 2 SH3BP2SH3 domain containing ring finger 2 SH3RF2Sonic hedgehog homolog (Drosophila) SHHSeven in absentia homolog 1 (Drosophila) SIAHlV-ski sarcoma viral oncogene homolog (avian) SKISKI-like oncogene SKILSolute carrier family 14 (urea transporter), member 1 (Kidd
SLC 14Alblood group)Solute carrier family 20 (phosphate transporter), member 1 SLC20A1Solute carrier family 22 (extraneuronal monoamine
SLC22A3transporter), member 3Solute carrier family 25, member 42 SLC25A42Solute carrier family 44, member 1 SLC44A1Solute carrier family 45, member 3 SLC45A3SMAD family member 4 SMAD4SWI/SNF related, matrix associated, actin dependent
SMARCB 1regulator of chromatin, subfamily b, member 1SWI/SNF related, matrix associated, actin dependent
SMARCC 1regulator of chromatin, subfamily c, member 1Sphingomyelin phosphodiesterase, acid-like 3B SMPDL3BSmall nuclear ribonucleoprotein polypeptide E SNRPESyntrophin, beta 1 (dystrophin-associated protein Al,
SNTBl59kDa, basic component 1)
Syntrophin, beta 2 (dystrophin-associated protein Al,SNTB2
59kDa, basic component 2)Syntrophin, gamma 1 SNTGlSuppressor of cytokine signaling 7 SOCS7Superoxide dismutase 1, soluble (amyotrophic lateral
SODlsclerosis 1 (adult))Secreted protein, acidic, cysteine-rich (osteonectin) SPARCSAM pointed domain containing ets transcription factor SPDEF
Gene Name SymbolSpleen focus forming virus (SFFV) proviral integration
SPIloncogene spi 1Secreted phosphoprotein 1 (osteopontin, bone sialoprotein I,
SPPlearly T-lymphocyte activation 1)
Squalene epoxidase SQLESulfide quinone reductase-like (yeast) SQRDLV-src sarcoma (Schmidt-Ruppin A-2) viral oncogene
SRChomolog (avian)Steroid-5-alpha-reductase, alpha polypeptide 2 (3-oxo-5
SRD5A2alpha- steroid delta 4-dehydrogenase alpha 2)ST3 beta-galactoside alpha-2,3-sialyltransferase 1 ST3GAL1ST3 beta-galactoside alpha-2,3-sialyltransferase 5 ST3GAL5ST6 beta-galactosamide alpha-2,6-sialyltranferase 1 ST6GAL1Suppression of tumorigenicity 7 ST7Signal transducer and activator of transcription 1, 9IkDa STATlSignal transducer and activator of transcription 3 (acute-
STAT3phase response factor)Signal transducer and activator of transcription 5B STAT5BSix transmembrane epithelial antigen of the prostate 2 STEAP2Stress-induced-phosphoprotein 1 (Hsp70/Hsp90-organizing
STIPlprotein)Serine/threonine kinase 11 STKI lSpleen tyrosine kinase SYKSynapsin I SYNlSynapsin III SYN3Tumor-associated calcium signal transducer 1 TACSTDlTATA box binding protein (TBP)-associated factor, RNA
TAFlCpolymerase I, C, 11OkDaTAF2 RNA polymerase II, TATA box binding protein
TAF2(TBP)-associated factor, 15OkDaT-cell acute lymphocytic leukemia 1 TALITax 1 (human T-cell leukemia virus type I) binding protein 1 TAXlBPlTATA box binding protein TBPTranscription factor 7-like 2 (T-cell specific, HMG-box) TCF7L2TEK tyrosine kinase, endothelial (venous malformations,
TEKmultiple cutaneous and mucosal)Telomerase reverse transcriptase TERTTranscription factor AP-2 gamma (activating enhancer
TFAP2Cbinding protein 2 gamma)Transcription factor Dp- 1 TFDPlTranscription factor binding to IGHM enhancer 3 TFE3Trefoil factor 1 TFFlTRK-fused gene TFGTransferrin receptor (p90, CD7 1) TFRCTransforming growth factor, alpha TGFATransforming growth factor, beta 1 TGFBlTransforming growth factor, beta 2 TGFB2Transforming growth factor, beta 3 TGFB3Transforming growth factor, beta-induced, 68kDa TGFBITransforming growth factor, beta receptor I (activin A
TGFBRlreceptor type II-like kinase, 53kDa)
Gene Name Symbol
Transforming growth factor, beta receptor II (70/8OkDa) TGFBR2Transforming growth factor, beta receptor III TGFBR3Transglutaminase 2 (C polypeptide, protein-glutamine-
TGM2gamma-glutamyltransferase)Thrombospondin 1 THBSlThrombospondin 2 THBS2Thrombopoietin (myeloproliferative leukemia virusoncogene ligand, megakaryocyte growth and development THPOfactor)T-cell lymphoma invasion and metastasis 1 TIAMlTIMP metallopeptidase inhibitor 2 TIMP2Thymidine kinase 1, soluble TKlTransmembrane protein with EGF-like and two follistatin-
TMEFF2like domains 2Transmembrane protein 134 TMEM 134Transmembrane protein 45B TMEM45BTransmembrane protein 65 TMEM65Transmembrane protein 7 1 TMEM7 1Transmembrane protease, serine 2 TMPRSS2Tumor necrosis factor receptor superfamily, member 10a TNFRSFlOATumor necrosis factor receptor superfamily, member 10b TNFRSFlOBTumor necrosis factor receptor superfamily, member 1Ia,
TNFRSFl IANFKB activatorTumor necrosis factor receptor superfamily, member 1Ib
TNFRSFI lB(osteoprotegerin)Tumor necrosis factor receptor superfamily, member IA TNFRSFlATumor necrosis factor (ligand) superfamily, member 10 TNFSFlOTumor necrosis factor (ligand) superfamily, member 8 TNFSF8Topoisomerase (DNA) I TOPlTopoisomerase (DNA) II alpha 17OkDa TOP2ATumor protein p53 (Li-Fraumeni syndrome) TP53Tumor protein p53 inducible protein 11 TP53I1 1Tumor protein p73 TP73Translocated promoter region (to activated MET oncogene) TPRTPX2, microtubule-associated, homolog (Xenopus laevis) TPX2Tripartite motif-containing 38 TRIM38TRNA methyltransferase 12 homolog (S. cerevisiae) TRMT 12Transient receptor potential cation channel, subfamily M,
TRPM8member 8Trichorhinophalangeal syndrome I TRPSlTuberous sclerosis 1 TSClTuberous sclerosis 2 TSC2Tetraspanin 13 TSPAN 13
Tetraspanin 14 TSPAN 14
Tissue specific transplantation antigen P35B TSTA3Tetratricopeptide repeat domain 29 TTC29Thymidylate synthetase TYMSTYRO3 protein tyrosine kinase TYR03Ubiquitin-conjugating enzyme E2, J2 (UBC6 homolog,
UBE2J2yeast)
Gene Name SymbolUBX domain containing 3 UBXD3Vesicle-associated membrane protein 2 (synaptobrevin 2) VAMP2Vav 1 guanine nucleotide exchange factor VAVlVav 2 guanine nucleotide exchange factor VAV2Versican VCANVascular endothelial growth factor A VEGFAVestigial like 3 (Drosophila) VGLL3Wiskott-Aldrich syndrome (eczema-thrombocytopenia) WASWD repeat domain 67 WDR67WEEl homolog (S. pombe) WEElWNTl inducible signaling pathway protein 1 WISPlWingless-type MMTV integration site family, member K)B WNTlOBWingless-type MMTV integration site family member 2 WNT2Wingless-type MMTV integration site family, member 2B WNT2BWingless-type MMTV integration site family, member 5A WNT5AWingless-type MMTV integration site family, member 8B WNT8BWerner syndrome WRNWilms tumor 1 WTlXanthine dehydrogenase XDHXeroderma pigmentosum, complementation group A XPAXeroderma pigmentosum, complementation group C XPCX-ray repair complementing defective repair in Chinese
XRCClhamster cells 1X-ray repair complementing defective repair in Chinese
XRCC4hamster cells 4X-ray repair complementing defective repair in Chinesehamster cells 5 (double-strand-break rejoining; Ku XRCC5autoantigen, 8OkDa)X-ray repair complementing defective repair in Chinese
XRCC6hamster cells 6 (Ku autoantigen, 7OkDa)V-yes-1 Yamaguchi sarcoma viral oncogene homolog 1 YESlYipl domain family, member 1 YIPFlTyrosine 3-monooxygenase/tryptophan 5-monooxygenase
YWHABactivation protein, beta polypeptideZinc finger protein 36, C3H type, homolog (mouse) ZFP36Zinc finger protein 3 13 ZNF3 13
Zinc finger protein 34 ZNF34Zinc finger protein 5 11 ZNF5 11Zinc finger protein 7 ZNF7
Table 7 . RNA transcripts used to derive metagene values for 18-RNA metagene depicted in
Figure 3 . The 6-RNA metagene is a subset of the sequences listed in Table 7, also depicted in
Figure 3 . 18-RNA metagene scores were scaled and normalized to generate 'POP' scores
depicted in Figure 4 . Weighting factors were from the linear regression coefficient values;
positive and negative values indicated transcripts correlated to increased expression in 'SYS' and
'NED' disease, respectively with intermediate expression values in the 'PSA' disease group.
SEQ ID No Weights1 -6.08
2 -5.71
3 -5.68
4 -5.39
5 -5.26
6 -4.84
7 -4.7
8 -4.68
9 -4.66
10 -4.55
1 1 -4.53
914 5.32
915 5.27
916 4.82
917 4.64
918 4.59
919 4.54
920 4.49
Table 8 . RNA transcripts used to derive metagene values for 20-RNA metagene depicted in
Figure 3 . Weighting factors were from the linear regression coefficient values; positive and
negative values indicated transcripts correlated to increased expression in 'SYS' and 'NED'
disease, respectively with intermediate expression values in the 'PSA' disease group.
SEQ ID No Weights1 -6.08
4 -5.39
6 -4.84
9 -4.66
14 -4.37
15 -4.32
16 -4.27
18 -4.2
19 -4.18
20 -4.1
2 1 -4.09
915 5.27
916 4.82
917 4.64
920 4.49
922 4.43
928 4.27
929 4.27
931 4.17
935 4.08
936 4.07
Table 9 . RNA transcripts used to derive K)-RNA metagene values, which were converted by
scaling and normalizing into 'POP' scores depicted in Figure 5 . RNA transcripts were identified
using Nearest Shrunken Centroids algorithm with leave- 1-out cross-validation to distinguish
'recurrent' (i.e., 'SYS') and 'non-recurrent' (i.e., 'PSA' and 'NED') disease from Table 3 RNA
transcripts. Weighting factors were from the test statistic values; positive and negative values
indicated transcripts correlated to increased expression in 'recurrent' and 'non-recurrent' disease,
respectively.
Seq ID Weights3 -5.48
36 -4.93
60 -5.72
63 -4.79
926 4.61
971 4.68
978 5.27
999 4.74
1014 4.86
1022 6.29
Table 10. RNA transcripts used to derive 4 1-RNA metagene values, which were converted by
scaling and normalizing into 'POP' scores depicted in Figure 6 . RNA transcripts were identified
using Nearest Shrunken Centroids algorithm with leave- 1-out cross-validation to distinguish
'recurrent' (i.e., 'SYS') and 'non-recurrent' (i.e., 'PSA' and 'NED') disease from Table 3 RNA
transcripts. Weighting factors were from the test statistic values; positive and negative values
indicated transcripts correlated to increased expression in 'recurrent' and 'non-recurrent' disease,
respectively.
SeqID Weights SeqID Weights
1 -4.1 947 4.44
2 -4.21 954 4.34
3 -5.48 971 4.68
32 -3.87 978 5.27
33 -3.75 999 4.74
36 -4.93 1004 4.5
46 -4.14 1014 4.86
60 -5.72 1022 6.29
63 -4.79 1023 4.08
66 -4.02 1032 3.49
69 -3.25 1080 3.97
88 -4.54 1093 4.02
100 -3.63 1101 3.55
241 -4.16 1164 3.54
265 -3.86 1248 3.56
334 -3.36 1304 3.14
437 -4.02 1311 3.66
920 4.36 1330 3.02
925 3.04 1402 3.45
934 3.65 1425 3.73
945 4.03
Although the invention has been described with reference to certain specific embodiments,
various modifications thereof will be apparent to those skilled in the art without departing from
the spirit and scope of the invention. All such modifications as would be apparent to one skilled
in the art are intended to be included within the scope of the following claims.
THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE PROPERTY
OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS:
1. A system for expression-based assessment of prostate cancer recurrence risk, said system
comprising one or more polynucleotides, each of said polynucleotides capable of specifically
hybridizing to a RNA transcript of a gene selected from the group of genes set forth in Table 3
and/or Table 6 .
2 . The system of claim 1, wherein said one or more polynucleotides comprise one or more
polynucleotide probes for the detection of the respective transcript.
3 . The system of claim 1, wherein said one or more polynucleotides comprise one or more
primer pairs, each of said primer pairs capable of amplifying a portion of a sequence
corresponding to the respective transcript.
4 . The system of any one of claims 1, 2 and 3, wherein said transcript comprises a sequence
corresponding to one or more of the sequences set forth in SEQ ID NOs: 1-21 14.
5 . The system of any one of claims 1, 2 and 3, wherein said transcript comprises a sequence
corresponding to one or more of the sequences set forth ina SEQ ID NOs: 914-21 14.
6 . The system according to any one of claims 1 to 5, wherein said system comprises at least 5
polynucleotides.
7 . The system according to any one of claims 1 to 5, wherein said system comprises at least 10
polynucleotides.
8 . The system according to any one of claims 1 to 5, wherein said system comprises at least 100
polynucleotides.
9 . The system of claim 1, wherein each of said one or more polynucleotides comprises a
sequence corresponding to, or complementary to, one or more of the sequences set forth in Table
4 or a fragment thereof.
10. The system of claim 1, wherein each of said one or more polynucleotides comprises a
sequence corresponding to one or more nucleic acid molecules selected from the group
consisting of:
(a) a nucleic acid depicted in any one of SEQ ID NOs: 1-21 14;
(b) an RNA form of any one of the nucleic acids depicted in SEQ ID NOs: 1-21 14;
(c) a peptide nucleic acid form of any one of the nucleic acids depicted in SEQ ID NOs:
1-21 14;
(d) a nucleic acid comprising at least 20 consecutive bases of any of (a-c);
(e) a nucleic acid comprising at least 25 consecutive bases having at least 90% sequence
identity to any of (a-c); and
(f) a complement to any of (a-e).
11. The system of claim 1, wherein said transcript comprises one or more of the sequences as set
forth in SEQ ID NOs: 1, 4, 6, 9, 14-16, 18-21 915-917, 920, 922, 928, 929, 931, 935 and 936.
12 . The system of claim 1, wherein said transcript comprises one or more of the sequences as
set forth in SEQ ID NOs: 3, 36, 60, 63, 926, 971, 978, 999, 1014 and 1022.
13. The system of claim 1, wherein said transcript comprises one or more of the sequences as
set forth in SEQ ID NOs 1-3, 32, 33, 36, 46, 60, 63, 66, 69, 88, 100, 241, 265, 334, 437, 920,
925, 934, 945, 947, 954, 971, 978, 999, 1004, 1014, 1022, 1023, 1032, 1080, 1093, 1101, 1164,
1248, 1304, 131 1, 1330, 1402, 1425.
14 . Use of the system according to any one of claims 1, 2 and 4 to 11 for the preparation of a
nucleic acid array
15. A nucleic acid array for expression-based assessment of prostate cancer recurrence risk,
said array comprising at least ten probes immobilized on a solid support, each of said probes
being between about 15 and about 500 nucleotides in length, each of said probes being derived
from a sequence corresponding to, or complementary to, a transcript of a gene selected from the
group of genes set forth in Table 3, or a portion of said transcript.
16. The nucleic acid array according to claim 15, wherein each of said probes comprises a
sequence as set forth in any one of SEQ ID NOs: 1-21 14 or complement thereof.
17. A method for expression-based assessment of prostate cancer recurrence risk, said method
comprising: (a) determining the expression level of one or more transcripts of one or more genes
in a test sample obtained from said subject to provide an expression pattern profile, said one or
more genes selected from the group of genes set forth in Table 3, and (c) comparing said
expression pattern profile with a reference expression pattern profile.
18. The method of claim 17, wherein an increased relative level of expression of one or more
transcripts, a decreased relative level of expression of one or more transcripts, or a combination
thereof is used to classify a prostate cancer as recurrent.
19. The method of claim 17, wherein an increased relative level of expression of one or more
transcripts, a decreased relative level of expression of one or more transcripts, or a combination
thereof is used to classify a prostate cancer as non-recurrent.
20. The method of claim 17; wherein said transcripts comprise a sequence as set forth in any
one of SEQ ID NOs: 1-21 14 or complement thereof.
21. The method of claim 17, wherein said one or more transcripts comprise one or more
sequences as set forth in any one of SEQ ID NOs: 1, 4, 6, 9, 14-16, 18-21 915-917, 920, 922,
928, 929, 93 1, 935 and 936.
22. The method of claim 17, wherein said transcript comprises one or more of the sequences
as set forth in SEQ ID NOs: 3, 36, 60, 63, 926, 971, 978, 999, 1014 and 1022.
23. The method of claim 17, wherein said transcript comprises one or more of the sequences
as set forth in SEQ ID NOs 1-3, 32, 33, 36, 46, 60, 63, 66, 69, 88, 100, 241, 265, 334, 437, 920,
925, 934, 945, 947, 954, 971, 978, 999, 1004, 1014, 1022, 1023, 1032, 1080, 1093, 1101, 1164,
1248, 1304, 131 1, 1330, 1402, 1425.
24. The method of claim 17, wherein the expression level of the one or more target sequences is
determined by a method selected from the group consisting of RT-PCR, Northern blotting, ligase
chain reaction, array hybridization, and a combination thereof.
25. The method of any one of claims 17 to 24, further comprising measuring the expression
level of at least one control nucleic acid in the sample.
26. The method of any one of claims 17 to 25, wherein the sample is a fine needle aspirate
biopsy [FNAB], cytology smear, cytology pellet, or a bulk tissue preparation.
27. The method of any one of claims 17 to 26, wherein the sample is fresh-frozen or fixed.
28. The method of any one of claims 17 to 27, wherein measuring the expression level utilizes a
pattern recognition method.
29. The method of claim 28, wherein the pattern recognition method comprises a linear
combination of expression levels of the target sequences.
30. The method of claim 28 or 29, wherein the pattern recognition method comprises a nonlinear
combination of expression levels of the target sequences.
3 1. A kit for characterizing the expression of one or more nucleic acid sequences depicted in
SEQ ID NOs: 1-21 14 comprising one or more nucleic acids selected from:
(a) a nucleic acid depicted in any of SEQ ID NOs: 1-21 14;
(b) an RNA form of any of the nucleic acids depicted in SEQ ID NOs: 1-21 14;
(c) a peptide nucleic acid form of any of the nucleic acids depicted in SEQ ID NOs: 1-
2 114;
(d) a nucleic acid comprising at least 20 consecutive bases of any of (a-c);
(e) a nucleic acid comprising at least 25 consecutive bases having at least 90% sequence
identity to any of (a-c); or
(f) a complement to any of (a-e); and
optionally instructions for correlating the expression level of said one or more nucleic
acid sequences with the disease state of prostate cancer tissue.
32. The kit of claim 3 1, wherein said one or more nucleic acids comprises one or more
sequences as set forth in any one of SEQ ID NOs: 1, 4, 6, 9, 14-16, 18-21 915-917, 920, 922,
928, 929, 931, 935 and 936.
33. The kit of claim 3 1, wherein said one or more nucleic acids comprises one or more of the
sequences as set forth in SEQ ID NOs: 3, 36, 60, 63, 926, 971, 978, 999, 1014 and 1022.
34. The kit of claim 3 1, wherein said one or more nucleic acids comprise s one or
more of the sequences as set forth in SEQ ID NOs 1-3, 32, 33, 36, 46, 60, 63, 66, 69, 88, 100,
241, 265, 334, 437, 920, 925, 934, 945, 947, 954, 971, 978, 999, 1004, 1014, 1022, 1023, 1032,
1080, 1093, 1101, 1164, 1248, 1304, 13 11, 1330, 1402, 1425.
35. The kit of any one of claims 3 1 to 34, wherein said one or more nucleic acids
comprises labeled nucleic acids.
36. The kit of any one of claims 3 1 to 35, wherein the kit comprises one or more primers
or primer pairs that specifically amplify at least a portion of said nucleic acids.
37. The kit of any of claims 3 1 to 36, wherein the kit further comprises one or more
control samples comprising expressed RNA, an amplification product produced therefrom or a
surrogate therefore, said control one or more control samples selected from:
a sample from non-recurrent prostate cancer; and
a sample from recurrent prostate cancer.
38. An array of probe nucleic acids certified for use in expression-based assessment of
prostate cancer recurrence risk, wherein said array comprises at least two different probe nucleic
acids that specifically hybridize to corresponding different target nucleic acids depicted in one of
SEQ ID NOs: 1-21 14, an RNA form thereof, or a complement to either thereof.
39. The array of claim 38, further comprising one or more probe nucleic acids that
specifically hybridize to house keeping genes.
40. A device for classifying a biological sample from a prostate cancer as recurrent or
non-recurrent, the device comprising:
means for measuring the expression level of one or more transcripts of one or more genes
selected from the group of genes set forth in Table 3 and/or 6;
means for correlating the expression level with a classification of prostate cancer status;
and
means for outputting the prostate cancer status.
4 1. The device of claim 40, wherein the machine utilizes an algorithm to characterize
said expression level.
42. A computer-readable medium comprising one or more digitally-encoded expression
pattern profiles representative of the level of expression of one or more transcripts of one or
more genes selected from the group of genes set forth in Table 3 and/or 6, each of said one or
more expression pattern profiles being associated with a value wherein each of said values is
correlated with the presence of recurrent or non-recurrent prostate cancer.
INTERNATIONAL SEARCH REPORT International application No
PCT/CA2009/000694
A CLASSIFICATION OF SUBJECT MATTERIPC C40B 40/06 (2006 01) , C12Q 1/68 (2006 01) , C40B 30/04 (2006 01) , C07H 21/00 (2006 01)According to International Patent Classification (IPC) or to both national classification and IPC
B FIELDS SEARCHED
Minimum documentation searched (classification system followed by classification symbols)
IPC C40B 40/06 (2006 01) , C12Q 1/68 (2006 01) , C40B 30/04 (2006 01) , C07H 21/00 (2006 01)
Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched
Electronic database(s) consulted durmg the international search (name of database(s) and, where practicable, search terms used)GenomeQuest, Delphion, Esp@cenet, Canadian Patent Database and NCBI, Authors Daviciom E and Bibikova M, Keywords prostate,cancer, biomarkers, expression and genes listed m Table 3 andr Table 6
C DOCUMENTS CONSIDERED TO BE RELEVANT
Category* Citation of document, with indication, where appropriate, of the relevant passages Relevant to claim No
X WO 01/60860 A2 (SCHLEGEL, R et al ) 23 August 2001 (23-08-2001) 1-1 1, 13-21, 23-32 and 34-42*see entire document*
X BIBIKOVA,M et al Expression signatures that correlated with Gleason score 1-3, 6, 14 and 40-42and relapse in prostate cancer Genomics 24 April 2007, VoI 89, No 6, pages666-672, ISSN 1089-8646*see entire document*
A COOPER, C S et al Mechanisms of Disease biomarkers and molecular targets 1-42from microarray gene expression studies in prostate cancer Nat CIm PractUrol December 2007, VoI 4, No 12, pages 677-687, ISSN 1743^289*see entire document*
[X] Further documents are listed in the continuation of Box C [X ] See patent family annex
Special categories of cited documents T later document published after the international films date or prioritydate and not m conflict with the application but cited to understand
A document defining the general state of the art which is not considered the principle or theory underlying the inventionto be of particular relevance
X document of particular relevance the claimed invention cannot beE earlier application or patent but published on or after the international considered novel or cannot be considered to involve an inventive
filing date step when the document is taken alone
L document which may throw doubts on priority claim(s) or which is Y document of particular relevance the claimed invention cannot becited to establish the publication date of another citation or other considered to involve an inventive step when the document isspecial reason (as specified) combined with one or more other such documents such combination
being obvious to a person skilled m the artO document referring to an oral disclosure use exhibition or other means
& document member of the same patent familyP document published prior to the international filing date but later than
the priority date claimed
Date of the actual completion of the international search Date of mailing of the international search report
7 October 2009 (07-10-2009) 14 October 2009 (14-10-2009)
Name and mailing address of the ISA/CA Authorized officerCanadian Intellectual Property OfficePlace du Portage I, Cl 14 - 1st Floor, Box PCT Sudeep Acharya (819) 994-424150 Victoria StreetGatmeau, Quebec KlA 0C9Facsimile No 001-819-953-2476
Form PCT/ISA/210 (second sheet ) (My 2009) Page 3 of 5
INTERNATIONAL SEARCH REPORT International application No
PCT/CA2009/000694
C (Continuation) DOCUMENTS CONSIDERED TO BE RELEVANT
Category* Citation of document, with indication, where appropriate, of the relevant passages Relevant to claim No
A MENDIRATTA, P and FEBBO, P G Genomic signatures associated with the 1-42development, progression, and outcome of prostate cancer Cancer 2007, VoI 11,
No 6, pages 345-354, ISSN 1177-1062*see entire document*
Form PCT/ISA/210 (continuation of second sheet) (July 2009) Page 4 of 5
INTERNATIONAL SEARCH REPORT International application No
Information on patent family members PCT/CA2009/000694
Patent Document Publication Patent Family PublicationCited in Search Report Date Member(s) Date
WO 0 160860A2 23-08-2001 AU 4154101A 27-08-2001US 2004259086A1 23-12-2004WO 0160860A3 13-06-2002
Form PC17ISA/210 (patent family annex ) (July 2009) Page 5 of 5