JBC

HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Originally published In Press as doi:10.1074/jbc.M000169200 on March 9, 2000

J. Biol. Chem., Vol. 275, Issue 25, 18946-18961, June 23, 2000
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
275/25/18946    most recent
M000169200v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Mummidi, S.
Right arrow Articles by Ahuja, S. K.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Mummidi, S.
Right arrow Articles by Ahuja, S. K.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Evolution of Human and Non-human Primate CC Chemokine Receptor 5 Gene and mRNA

POTENTIAL ROLES FOR HAPLOTYPE AND mRNA DIVERSITY, DIFFERENTIAL HAPLOTYPE-SPECIFIC TRANSCRIPTIONAL ACTIVITY, AND ALTERED TRANSCRIPTION FACTOR BINDING TO POLYMORPHIC NUCLEOTIDES IN THE PATHOGENESIS OF HIV-1 AND SIMIAN IMMUNODEFICIENCY VIRUS*,

Srinivas MummidiDagger §, Mike Bamshad§, Seema S. AhujaDagger ||||||, Enrique GonzalezDagger , Pablo M. FeuilletDagger , Kazi BegumDagger , M. Cristina GalvisDagger , Vannessa KosteckiDagger , Anthony J. ValenteDagger , Krishna K. Murthy**, Luis HaroDagger Dagger , Matthew J. Dolan§§, Jonathan S. Allan**, and Sunil K. AhujaDagger ||¶¶

From the Dagger  Departments of Medicine, University of Texas Health Science Center at San Antonio and South Texas Veterans Health Care System, Audie L. Murphy Division, San Antonio, Texas 78229, the  Department of Pediatrics, Eccles Institute of Human Genetics, University of Utah, Salt Lake City, Utah 84112, the ** Department of Virology and Immunology, Southwest Foundation for Biomedical Research, San Antonio, Texas 78228, the Dagger Dagger  Department of Life Sciences, University of Texas, San Antonio, Texas 78249, and the §§ Infectious Diseases Service, Wilford Hall Medical Center, Lackland Air Force Base, San Antonio, Texas 78236

Received for publication, July 22, 1999, and in revised form, January 10, 2000

    ABSTRACT
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Polymorphisms in CC chemokine receptor 5 (CCR5), the major coreceptor of human immunodeficiency virus 1 (HIV-1) and simian immunodeficiency virus (SIV), have a major influence on HIV-1 transmission and disease progression. The effects of these polymorphisms may, in part, account for the differential pathogenesis of HIV-1 (immunosuppression) and SIV (natural resistance) in humans and non-human primates, respectively. Thus, understanding the genetic basis underlying species-specific responses to HIV-1 and SIV could reveal new anti-HIV-1 therapeutic strategies for humans. To this end, we compared CCR5 structure/evolution and regulation among humans, apes, Old World Monkeys, and New World Monkeys. The evolution of the CCR5 cis-regulatory region versus the open reading frame as well as among different domains of the open reading frame differed from one another. CCR5 cis-regulatory region sequence variation in humans was substantially higher than anticipated. Based on this variation, CCR5 haplotypes could be organized into seven evolutionarily distinct human haplogroups (HH) that we designated HHA, -B, -C, -D, -E, -F, and -G. HHA haplotypes were defined as ancestral to all other haplotypes by comparison to the CCR5 haplotypes of non-human primates. Different human and non-human primate CCR5 haplotypes were associated with differential transcriptional regulation, and various polymorphisms resulted in modified DNA-nuclear protein interactions, including altered binding of members of the NF-kappa B family of transcription factors. We identified novel CCR5 untranslated mRNA sequences that were conserved in human and non-human primates. In some primates, mutations at exon-intron boundaries caused loss of expression of selected CCR5 mRNA isoforms or production of novel mRNA isoforms. Collectively, these findings suggest that the response to HIV-1 and SIV infection in primates may have been driven, in part, by evolution of the elements controlling CCR5 transcription and translation.

    INTRODUCTION
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Simian immunodeficiency viruses (SIVs)1 comprise a large and genetically diverse group of lentiviruses that originated in sub-Saharan Africa (1-3). SIVs isolated from chimpanzees and mangabeys are very similar to human immunodeficiency virus (HIV)-1 and HIV-2, respectively (3-7). This similarity suggests that HIVs arose via cross-species transmission from non-human primate viral reservoirs. Yet, despite their common ancestry and close similarity, HIVs and SIVs differ significantly with regard to clinical disease and pathogenesis. Human infection with HIVs results in a progressive immunodeficiency syndrome, whereas African apes and monkeys (e.g. African Green monkeys (AGM)) infected with SIV exhibit no evidence of disease (7-9). These differences in pathogenicity may be due, in part, to primate species-specific variation in the genes controlling the host response or expression of host HIV/SIV entry factors (10). Thus, understanding the evolution and factors that control the expression of these genes in primates will be an important step towards identifying the molecular mechanisms underlying the response of primates to infections with SIVs and HIVs. In turn, this may illuminate potential strategies that could be used to mitigate or prevent infection with HIV-1.

An important host genetic determinant of HIV-1 pathogenesis is polymorphisms in the open reading frame (ORF) and cis-regulatory region of CC chemokine receptor 5 (CCR5), a major coreceptor for the entry of HIV and SIV (reviewed in Refs. 10 and 11). These polymorphisms could potentially influence cell-surface density of CCR5 and thus have an impact on HIV/SIV pathogenesis. For example, homozygosity for a 32-bp deletion in CCR5 ORF leads to loss of surface expression and profound resistance against HIV-1 infection (12). Similarly, a 24-bp deletion in the CCR5 ORF that was discovered in selected non-human primates might influence SIV pathogenesis (13). More recently, we and others (14-17) have shown that polymorphisms in the cis-regulatory region of human CCR5 are also associated with altered rates of disease progression and transmission. Thus, due to this close interaction with the lentiviral life cycle, CCR5 is an excellent candidate for exploring whether differences in its gene/RNA structure and regulation account, in part, for the differential pathogenesis of HIV and SIV. Additional interest in analyzing host-specific differences in CCR5 regulation at the gene and RNA level is spurred by the observation that despite having nearly identical CCR5 coding regions, Asian macaques (e.g. Pig-tailed macaques (PTM)) but not AGMs infected experimentally with SIV become symptomatic (reviewed in Refs. 1, 18, and 19). Given that the differential pathogenicity of AGMs and PTMs to SIV is very reminiscent of the differential pathogenicity of HIV and SIV, studies aimed at dissecting the genetic basis for this differential pathogenicity might provide additional insights into the varied susceptibility to HIV-1 infection among humans.

The gene and RNA structure of human CCR5 is complex. We have demonstrated that alternative splicing in the 5'-untranslated regions (UTR) of CCR5 generates several distinct mRNA isoforms that are under the control of at least two distinct promoters (20). Furthermore, the 5'-UTR of CCR5 is encompassed within these cis-regulatory regions that contain several single nucleotide polymorphisms (SNPs) associated with altered rates of HIV-1 disease progression (14-17). Thus, polymorphisms in the noncoding region of CCR5 could influence not only cis-trans interactions that impact on gene expression but also CCR5 mRNA stability and/or the efficiency of translation.

Given the multiple levels at which CCR5 expression could be regulated, we performed a comprehensive analysis of the ORF, mRNA structure, and transcriptional regulatory units of CCR5 relative to four important events in human evolution (21): the divergence of humans from chimpanzees 6 million years ago, from the Orang-Utan lineage 15 million years ago, from the cercopithecoids (Old World monkeys (OWM)) ~35 million years ago, and from New World monkeys (NWM) 50 million years ago. Results from these analyses enabled us to build the evolutionary framework needed to define the relationships among human CCR5 haplotypes that influence HIV-1 pathogenesis. This evolutionary framework facilitated greatly our ability to determine the influence of CCR5 haplotypes on HIV-1 transmission and disease progression (16).2 Additionally, we were able to test directly the hypothesis that polymorphisms in the human and non-human primate cis-regulatory region of CCR5 confer differences in transcriptional efficiencies and/or interact with different trans-acting factors or to the same transcription factor but with varying avidity.

    EXPERIMENTAL PROCEDURES
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Primate CCR5 ORFs-- The CCR5 ORF was PCR-amplified with primers that flanked the human CCR5 ORF (gcg gcc gct tat gca cag ggt gga aca ag (forward) and tct aga cca ctt gag tcc gtg tca (reverse)), cloned, and sequenced on both strands from the following species (common names are in parentheses): Chlorocebus (Cercopithecus) aethiops sabaeus (sabaeus monkey); Hylobates agilis unko (Agile gibbon); Callithrix jacchus (marmoset); Ateles geoffroyi (black-handed spider monkey); and Lagothrix lagothricha (woolly monkey). In addition, the following primate sequences available in GenBankTM (common names and GenBankTM accession numbers are in parentheses) were used to construct the CCR5 ORF network (Fig. 1a): Homo sapiens (human, X91492); Pan troglodytes (chimpanzee, AF035214); Gorilla gorilla (gorilla, AF005659); Pongo pygmaeus (Orang-Utan, AF075446); Hylobates leucogenys (white-cheeked gibbon, AF075451); Cercocebus torquatus atys (sooty mangabey, AF051905); Cercocebus aterrimus (black mangabey, AF081578); Cercocebus galeritus (Tana River mangabey, AF035215); Macaca fascicularis (crab-eating macaque, AF005660); Macaca mulatta (Rhesus macaque; AF005662); Macaca assamensis (Assamese macaque, AF075449); Macaca arctoides (stump-tailed macaque, AF075450); Macaca nemestrina (pig-tailed macaque; AF105282) Papio hamadryas hamadryas (baboon, AF005658); Papio hamadryas anubis (Olive baboon, AF023452); Colobus guereza (black and white colobus, AF141639); Cercopithecus neglectus (De Brazza's guenon, AF035218); Cercopithecus nictitans (greater spot-nosed guenon, AF035219); Cercopithecus lhoesti (l'Hoest's monkey; AF081579); Cercopithecus cephus (mustached guenon; AF035217); Cercopithecus ascanius (red-tailed guenon, AF035216); Pygathrix nemeaus (red-shanked douc langur, AF075448); Pygathrix bieti (black snub-nosed monkey, AF075445); Rhinopithecus avunculus (Tonkin snub-nosed monkey, AF075447); Rhinopithecus roxellana (Golden snub-nosed monkey, AF075444); Trachypithecus francoisi (Francois langur, AF075442); Erythrocebus patas (Patas monkey, AF035220); Chlorocebus aethiops (grivet; AB015944); Chlorocebus aethiops sabaeus (AF035221); Chlorocebus aethiops pygerythrus (vervet; AF035222); Chlorocebus aethiops tantalus (tantalus monkey; AF081577); and Presbytis phayrei (Phayre's leaf monkey, AF075443). All PCR amplifications were performed with the following conditions: initial denaturation of the DNA template for 90 s, followed by 30 cycles of a profile consisting of 94 °C for 10 s, 55 °C for 30 s, and 72 °C for 1 min. To reduce the potential for errors, a proofreading polymerase, Pfu (Stratagene, La Jolla, CA), was included with the Taq DNA polymerase (Life Technologies, Inc.) at a 15:1 ratio during the PCR amplification.

CCR5 Numbering System, RNA and Promoter Nomenclature-- The CCR5 numbering system that we used previously (15, 16, 20) was based on the sequences deposited in GenBankTM (accession numbers AF031236 and AF031237) and considered the first nucleotide of the 5'-most UTR sequence as +1. However, since we had identified additional 5'-UTR sequences (Fig. 2), and to maintain uniformity in the numbering systems used by different investigators, it was proposed at the CCR5-AIDS symposium3 to designate the first nucleotide of the CCR5 translational start site as +1 and the nucleotide immediately upstream as -1 (22). It was also proposed at this meeting to change the numbering system for CCR5 noncoding exons that we had identified previously (20). Exons 2 and 3 are two noncoding exons that are not interrupted by an intron and are now designated as exons 2A and 2B; exon 4 is now designated as exon 3. Finally, to standardize the nomenclature of the CCR5 promoters that we had identified previously (20), in this paper we have designated the downstream CCR5 promoter as promoter 1 and the upstream promoter as promoter 2. A similar nomenclature has been used by other investigators for genes that have multiple promoters (23, 24).

Primate CCR5 cis-Regulatory Region-- The region corresponding to human CCR5 -2761 to -1835 was PCR-amplified, cloned, and sequenced on both strands from the following primates (the number of different members of the given non-human species that were sequenced are shown in parentheses): P. troglodytes (n = 4); G. gorilla; P. pygmaeus; P. hamadryas anubis (n = 3); M. mulatta (n = 2); M. fascicularis; M. nemestrina; C. torquatus torquatus (red-capped mangabey); C. galeritus chrysogaster (gold-bellied mangabey); C. guereza; C. guereza kikuyuensis (Kikuyu colobus); Cercopithecus petaurista (lesser spot-nosed guenon); C. neglectus; C. diana (Diana guenon); C. l'hoesti; C. (Miopithecus) talapoin (Talapoin); C. (Erythrocebus) patas; C. aethiops (grivet; n = 3); C. sabaeus (n = 8); C. pygerythrus (n = 3); Presbytis (Trachypithecus) francoisi; Saguinus oedipus (cotton-topped tamarin); C. jacchus; Aotus trivirgatus (owl monkey); A. geoffroyi; and L. lagotricha. A single allele per non-human primate was sequenced. For Homo sapiens, 60 alleles were sequenced, and they were derived from individuals with different genotypes, including those who were homozygous or heterozygous for -2733A or -2733G, -2135C or -2135T, -1835T or -1835C (15). The CCR5 promoter region from non-human primates was PCR-amplified using the following primers: cat aaa gaa cct gaa ctt gac c (forward) and tag aat ttc taa tat aaa att cta tta aca tac tcg tga acc aca aac ggt cta (reverse). The PCR protocol used to amplify the human and non-human primate CCR5 cis-regions and the ORF was identical. Notably, several alleles from a given species had identical sequences, and repeat amplification/sequencing of the identical allele gave identical results.

Genotype Analysis of Non-human Primates-- Genotyping methods for the CCR5 polymorphisms were described previously (15, 16).

5'-RACE and Reverse Transcription-PCR (RT-PCR)-- Total RNA from human and non-human primate peripheral blood mononuclear cells (PBMC) and human leukocyte subsets was extracted using Trizol reagent (Life Technologies, Inc.). 5'-Rapid amplification of cDNA ends (RACE) was performed on a human leukocyte cDNA library (CLONTECH, Palo Alto, CA) using an exon 2B-specific primer (ggg aac gga tgt ctc agc tct tct) according to the manufacturer's protocols. For RT-PCR, RNA was reverse-transcribed using a CCR5 exon 3-specific oligonucleotide (acc aaa gat gaa cac cag tga gta gag), and the resulting cDNA was amplified using a forward primer derived from the newly identified sequence of CCR5 exon 1 (tgt ctt ctc agc tct gct gac; Fig. 2) and a reverse primer derived from CCR5 exon 3 (gct ccg atg tat aat aat tga tgt). The specificity of the products obtained from the PCR was confirmed further by performing a nested PCR. The sequences of the primers used in the nested PCR were aat act tga gat ttt cag atg (forward) and aga ttg gac ttg aca ctt gat aat cca t (reverse). All RT-PCR reactions were run with a negative control that included no cDNA template.

Promoter Analysis-- To study the differences between the CCR5 promoter activity of sabaeus AGM and that of humans, we constructed a series of firefly luciferase-sabaeus (S1 to S5) and human (H1 to H5) CCR5 promoter constructs in the promoterless pGL3Basic vector (Promega, Madison, WI). A single sabaeus haplotype and a haplotype representative of CCR5 human haplogroup (HH)-A (16) were used to construct the reporter plasmids. The constructs were transfected into human embryonic kidney (HEK), human erythroleukemia (K562), and AGM kidney (COS) cell lines and tested for luciferase reporter activity, as described previously (20). To study differences in promoter activity exhibited by the cis-regulatory regions of human CCR5 haplotypes, the genomic region spanning -2761 to -1814 was PCR-amplified from haplotypes corresponding to HHA, HHC, HHE, HHF, or HHG (16) and cloned into the pGL3Basic vector. Transfection into K562 and Jurkat cell lines and dual luciferase assays were as described previously (20). For all promoter activity analyses, at least two different plasmid preparations were used, and the DNA in each plasmid preparation was quantified spectrophotometrically twice. The Wilcoxon signed-ranks test was used to compare the mean luciferase activity between homologous sabaeus and human promoter constructs. Statistical analysis to determine the differences in the mean luciferase activity among human CCR5 promoter alleles was by one-way analysis of variance followed by the Scheffe's post hoc test.

Electrophoretic Mobility Shift Assay (EMSA)-- All cell lines were obtained from American Type Culture Collection and were maintained as described previously (20). Nuclear extracts were prepared from K562, THP-1 (human monocyte), Jurkat (human T-cells), and COS cell lines, according to standard protocols (25). Nuclear proteins from PHA-stimulated human PBMCs were extracted according to the procedure of Schreiber et al. (26). Prior to nuclear protein extraction, human PBMCs were stimulated for 3 h with phorbol myristic acid (25 ng/ml) and ionomycin (1 µg/ml). PBMCs from SIV-uninfected sabaeus AGM and vervet AGM were stimulated with 10 µg/ml PHA for 2 days and were maintained in culture for 11 days in the presence of 10% human IL-2 (Advanced Biotechnologies Inc., Columbia, MD) before extracting the nuclear proteins. Nuclear extracts from similarly PHA/IL-2-treated human PBMCs served as controls for the EMSA experiments that used sabaeus/vervet PHA-blast nuclear extracts. The nucleotide sequences of the oligonucleotides used in EMSAs are shown in Table I. For competition experiments, unlabeled competitor oligonucleotides were incubated with the nuclear extracts for 10 min on ice prior to adding the labeled probe. The specificity of the binding reactions was confirmed by using nonspecific double-stranded oligonucleotide competitors. For supershift experiments, the indicated antibodies (Santa Cruz Biotechnology, Santa Cruz, CA) were incubated with the nuclear extracts and the radiolabeled probes for 45 min at room temperature prior to electrophoretic resolution of the complexes. Densitometry analysis of the EMSA gels was performed with the NIH Image software package (version 1.61).

                              
View this table:
[in this window]
[in a new window]
 
Table I
Nucleotide sequences of the oligonucleotides used in EMSAs
Only the sequence of the sense oligonucleotides is shown. The nucleotide differences between the oligonucleotides used to determine differential nuclear factor binding at human CCR5 SNPs are in capital letters and are in parentheses (see "Experimental Procedures").

Phylogenetic Analysis-- Sequences were aligned using the SEQUENCHER software package. Descriptive statistics were obtained using ARLEQUIN software.4 Mean nucleotide diversity within populations was estimated using the equation, pi  = (n/n - 1)Sigma xixjpi ij; where n is the number of DNA sequences examined, xi and xj are the population frequencies of the ith and jth type of DNA sequences, and pi ij is the proportion of nucleotides that differ between the ith and jth types of DNA sequence. Genetic distances between sequences were estimated using DNADIST of the PHYLIP software package using Kimura's two-parameter model.5 The transition to transversion ratio was varied from 2:1 to 10,000:1 but had no substantial impact on the results. Distances between populations were estimated from distances between individuals using NEIDIST (27). Relationships between lineages and/or populations were depicted as neighbor-joining networks (28), using NEIGHBOR. Inferred branch lengths with negative values were converted to branches of length zero. Parsimony networks were constructed using DNAPARS. The robustness of branches was assessed by using 100 bootstrap data sets obtained using SEQBOOT. Neighbor-joining and parsimony trees were produced using mouse sequence as an outgroup, and CONSENSE was used to find the consensus tree. Networks were visualized using TREETOOL. Estimates of the rates of nonsynonymous (dN) and synonymous (dS) substitutions for all pairwise comparisons were calculated using the method of Nei and Gojobori (29) as implemented in the PAML package (30).

    RESULTS
TOP
ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
REFERENCES

Molecular Evolution of the CCR5 ORF in Primates-- Comparison of the complete CCR5 ORF from 37 different primates revealed that the nucleotide sequence and amino acid identity of CCR5 were highly conserved (alignment is shown in Supplemental Figs. 1 and 2; species list is provided under "Experimental Procedures"). Of the variable sites, 184 were SNPs, including 148 transitions and 55 transversions. To quantify the amount of sequence variation found among primates, we estimated the nucleotide diversity of each group. Nucleotide diversity is equivalent to the number of nucleotide differences per site between a random pair of chromosomes drawn from a population. For all primates, the mean nucleotide diversity of the CCR5 ORF was 0.025 (~1 variant in every 40 bp). Levels of total nucleotide diversity differed substantially among hominoids (humans and apes), OWM, and NWM. Nucleotide diversity in hominoids (0.009) and OWM (0.0013) was substantially lower than that found within the total primate group, whereas nucleotide diversity in NWM was the highest of all primate groups (0.032).

In coding regions, mutation and selection are expected to have different effects on nonsynonymous (dN) and synonymous (dS) nucleotide substitutions. Consequently, comparisons of the rate of dN to dS substitutions (dN/dS) can be utilized to explore molecular sequence evolution (31). The neutral theory predicts that despite varying mutation rates between lineages, dN/dS should remain constant among lineages. Thus, variation of dN/dS among lineages is considered evidence against neutrality, whereas dN/dS ratios >1.0 are evidence for positive selection (32).

Pairwise maximum likelihood estimates of dN/dS among primate CCR5 ORFs were consistently <1.0. However, estimation of dN/dS for each of the domains of CCR5 (e.g. NH2 terminus, extracellular loops, and intracellular tail) revealed an interesting trend. Pairwise estimates of dN/dS among hominoids and NWM, for the sequence encoding the NH2 terminus, were consistently >1.0. Pairwise estimates of dN/dS for the second extracellular loop among hominoids and two of three NWM (spider or woolly monkey) were >1.0 (dN/dS between hominoids and marmosets was always <1.0). These findings suggested that the effects of natural selection might vary among specific domains of CCR5 ORF. Moreover, these results indicated that substitutions in the NH2 terminus and second extracellular loop may underlie a selective response to the pathogens after the NWM and Catarrhine split. This was consistent with the finding that the bulk of polymorphisms in the human and non-human CCR5 ORF have been found primarily in the NH2 terminus, and only a few are found elsewhere (33, 34).

Phylogenetic reconstruction of the genetic affinities among hominoids, OWM, and NWM demonstrated that NWM were substantially more divergent from either hominoids or OWM (Fig. 1a). That is the genetic distance between NWM and hominoids (0.054) or NWM and OWM (0.055) was much larger than the genetic distance between hominoids and OWM (0.014). These findings were consistent with estimates of genetic divergence among these groups based upon analysis of morphological and neutral genetic markers (35). Thus, it is noteworthy that despite the different roles that CCR5 may have played in mediating responses to pathogens (e.g. SIV and HIV-1) among African OWM, Asian OWM, and hominoids, the pattern of variation observed among primate CCR5 ORFs is similar to that observed for neutral markers. Overall these data suggest that the expression/function of CCR5 among OWM and hominoids is more likely to be controlled by factors that regulate CCR5 transcription, mRNA processing, and/or translation than selection for different ORF variants. For this reason, we explored the nature of variation in the mRNA structure and cis-regulatory region of CCR5 in NWM, OWM, and hominoids.


View larger version (38K):
[in this window]
[in a new window]
 
Fig. 1.   Phylogenetic relationship of and genetic variation in the coding and a cis-regulatory region of primate CCR5. a, phylogenetic networks for primate CCR5 OR5 and promoter 1 (downstream promoter, old nomenclature (20)). The higher primates included in the analysis are shown in red, OWM in blue, and NWM in purple. The number of alleles from a particular species that was included in the analysis is shown in parentheses. The bootstrap supports for each branch is denoted at the branch point (bootstrapping is a commonly used procedure for estimating the statistical significance of individual branches within a network). Estimates of bootstrap support for both phylogenetic networks were obtained using the homologous mouse sequences to root the consensus trees. The sequence alignment of the CCR5 ORF sequences and cis-regulatory region used in these analyses is shown in Supplemental Figs. 1 and 3, respectively. b, comparison of a CCR5 cis-regulatory region of primates (-2761 to -1835). The region corresponding to human CCR5 -2761 to -1835 was sequenced from genomic DNA of 60 humans and 43 non-human primates. The sequence from -2762 to -2903 was obtained by sequencing human and non-human primate CCR5 mRNA. The complete sequence alignment of the non-human primates is shown in Supplemental Fig. 3. The numbering system is based on the first nucleotide of the CCR5 ORF as +1. The sequence of two representative great apes and OWMs and a single NWM is shown. Dots, sequences not shown; dashes, deletions/gaps found (relative to human); asterisks, regions upstream of CCR5 -2761 that were not determined. -2761 denotes +1 in the old numbering system (20). Arrows demarcate the boundaries of human CCR5 exons 1, 2A, and 2B and introns 1 and 2. -2761 and -2903 correspond to the 5'-most end of the longest human CCR5 mRNA isoform characterized previously (20) and in this study (see Fig. 2), respectively. Exon sequences are in uppercase and represent the CCR5 mRNA identified in this study from humans, chimpanzee, rhesus, and sabaeus monkey. Intron sequence is in lowercase. Orang-Utan and woolly monkey CCR5 mRNA structure was not determined, and hence we cannot predict the exon-intron boundaries; the exon-intron boundaries shown are based on the sequence homology of the splice acceptor-donor sites between these two species and humans. The seven common polymorphic nucleotides identified in the CCR5 cis-regulatory region spanning from -2761 to -1835 are indicated on the top row in blue (human polymorphisms: -2733, -2554, -2459, -2135, -2132, -2086, and -1835). The sequence corresponding to these seven polymorphic sites in CCR5 human haplogroup A (HHA) and non-human primates is shown in red. Green residues indicate where chimpanzee sequence differs from humans. The gaps (relative to human sequence) present in the vast majority of OWMs (gaps 1-6) and the dinucleotide gap present in chimpanzees are boxed. Note that the nucleotide sequence differences between humans and Orang-Utan, rhesus, sabaeus, or woolly monkey are not highlighted. The polymorphic nucleotides/gaps (relative to human sequence) in chimpanzees and OWMs that are present between -2903 and -2761 are not shown. c-i, intra- and interspecies polymorphisms in non-human primates. The unique CCR5 haplotypes identified in four chimpanzees, three baboons, three vervets, two rhesus macaques, three grivets, one cynomolgus macaque, one PTM, and eight sabaeus AGMs are shown. Dots, sequences not shown; dashes, gaps found. The numbering system is identical to that in b. Exons and intron sequences are in upper and lowercase, respectively. The polymorphisms that are found at the same position in several species are in green (e.g. the polymorphism at a position corresponding to human CCR5 -2719 is common to rhesus and sabaeus). Intra-species polymorphisms are shown in red.

Evolution of a cis-Regulatory Region of CCR5 in Non-human Primates-- Alignment of the nucleotide sequence of a cis-regulatory regions of CCR5 from non-human primates revealed high sequence conservation (Fig. 1b and Supplemental Fig. 3). Nevertheless, substantial intra- and interspecies sequence variation was observed (Fig. 1, c-i). Compared with the human sequence, one gap was required to align the sequence of the chimpanzee CCR5 cis-regulatory region, and six gaps were inserted to align the OWM sequences. No gaps were required to align the gorilla and Orang-Utan CCR5 promoter sequences; however, several gaps were required to align the 44 primate sequences (Supplemental Fig. 3).

Compared with the CCR5 ORF, a cis-regulatory region of CCR5 demonstrated similar levels of nucleotide sequence diversity. Of the polymorphic sites, 237 were SNPs, including 177 transitions and 68 transversions. For all primates, the mean nucleotide diversity of the cis-regulatory region of CCR5 was 0.022, which is approximately 1 variant in every 45 bp. Mean nucleotide diversity was 0.007, 0.007, and 0.028 in hominoids, OWM, and NWM, respectively. The cis-regulatory regions of CCR5 in chimpanzee and human differed at 41 sites, including 8 fixed and 33 variable sites, suggesting that the CCR5 cis-regulatory regions of chimpanzees and humans were considerably more different from one another than their CCR5 ORFs.

Genetic distances estimated from the cis-regulatory region of CCR5 of hominoids, OWM, and NWM indicated that hominoids were nearly equally divergent from OWM and NWM (Fig. 1a), i.e. the genetic distance between hominoids and OWM (0.058) was comparable to the genetic distance between hominoids and NWM (0.067). This was in contrast to the closer affinity of hominoids and OWM, as estimated from analysis of the CCR5 ORF. In other words, the genetic distance between OWM and NWM was similar regardless of whether the CCR5 ORF or cis-regulatory regions were compared, whereas the genetic distances between OWM and hominoids estimated from the cis-regulatory region of CCR5 was 4 times greater than estimates of the genetic distance from the CCR5 ORF. These data suggested that the CCR5 cis-regulatory region of hominoids was substantially more divergent from OWM than is the CCR5 ORF. This underscores the potential role that different evolutionary forces (e.g. natural selection) may have played in shaping the genetic variation of the cis-regulatory region of CCR5 in hominoids versus OWM.

CCR5 mRNA Splicing Patterns in Primates: mRNA Isoform Diversity-- We designed experiments to investigate whether the complex mRNA structure of CCR5 (20) is conserved across different primate species. Furthermore, because the cis-regulatory region of CCR5 encompasses the 5'-UTR region of human CCR5 (Fig. 2a and b), we determined whether the genetic variation we observed in this region in non-human primates would influence the nature of the mRNA isoforms produced in different primates.


View larger version (45K):
[in this window]
[in a new window]
 
Fig. 2.   Conservation of a complex CCR5 mRNA structure in primates. a, genomic organization of human CCR5. Numbered boxes, exons; slashed lines, gaps; Promoter 1 and Promoter 2 indicate the previously designated downstream (Pd) and upstream (Pu) promoter, respectively (20). 5'-RACE extended the previously described exon 1 by an additional 142 bp (-2762 to -2903 which is equivalent to -1 to -142 of GenBankTM accession numbers AF031237 and AF031237). Arrows under the open boxes indicate the location of the sense and antisense primers used in RT-PCR experiments. b, novel CCR5 mRNA sequence (-2903 to -2762) and genomic sequence upstream of human CCR5 exon 1. Exon and intron/ cis-regulatory sequences are in uppercase and lowercase, respectively. The novel mRNA sequences identified in this paper are highlighted in red, and the previously identified exon 1 mRNA sequences (20) are in black uppercase. Arrows indicate the start of exon 1 and intron 1. Putative transcription factor-binding sites upstream of exon 1 are overlined with a straight arrow. c, schema of alternative CCR5 mRNA splicing patterns in primates. mRNA isoforms that contain exon 1 are arbitrarily designated as full-length transcripts (CCR5A and CCR5B), and those lacking it are designated as truncated transcripts and are not assigned a name (20). Tick marks indicate exon boundaries, and the angled dashed lines indicate the splicing patterns. gt and ag indicate the splice donor and acceptor sites, respectively. The mutations in the splice acceptor site are denoted in red. These mutations prevent the production of the CCR5 mRNA isoforms that are marked by an X (e.g. CCR5B was not found in cynomologous, rhesus or sabaeus AGM). In a sabaeus monkey, a CCR5 mRNA species was found whose structure includes a truncated exon 1 (lacks the region corresponding to human CCR5 -2752 to -2705) and exons 2A, 2B, and 3. The non-consensus splice donor site of the truncated sabaeus exon 1 is shown in green. d, expression of novel exon 1 sequence in primates. An ethidium bromide-stained gel of the RT-PCR products is shown. The sources for the RNA used in RT-PCR are indicated on the top, and abbreviations used are as follows: P, human peripheral blood mononuclear cells; L, human lymphocytes; AC, human activated CD4+ cells; and SAB, peripheral blood mononuclear cells from a sabaeus AGM. The location of the final set of primers used in the RT-PCR is shown in a. The length of the products amplified from human cells is indicated on the left. In sabaeus, the isoform homologous to CCR5A is shorter because of a deletion (48 nucleotides) in the 3' end of the region corresponding to human CCR5 exon 1.

We have shown previously that two "full-length" CCR5 mRNA transcripts (CCR5A and CCR5B) arise by alternative splicing (20) (Fig. 2c). Several "truncated" transcripts can also originate in either exon 2A (old nomenclature exon 2) or exon 2B (old nomenclature exon 3) of CCR5 (Fig. 2c). By using 5'-RACE on a human leukocyte cDNA library, we extended the known CCR5 mRNA sequence common to both CCR5A and CCR5B by 142 additional nucleotides (-2903 to -2762; Fig. 2, a and b). Analysis of the genomic DNA sequence immediately upstream of the newly identified mRNA sequence in exon 1 indicated the presence of several potential transcription factor binding sites (Fig. 2b). This new exon 1 sequence was subsequently found in different human leukocyte subsets as well as in mononuclear cells of several non-human primate species (Fig. 2, c and d, and data not shown).

Comparison of the genomic DNA sequence extending from exon 1 to exon 2B among non-human primates (Fig. 1b and Supplemental Fig. 3) and RNA transcripts in mononuclear cells derived from chimpanzees, rhesus macaque, cynomolgus macaque, and sabaeus AGM revealed the following (Fig. 2, c and d). The exon-intron splice donor and acceptor sites were conserved between humans and Orang-Utan, gorilla, langur, and NWM. Mutations in the exon-intron splice acceptor donor sites lead to loss of expression of selective CCR5 mRNA isoforms in different non-human primates. Alternatively, usage of a non-canonical splice donor site in exon 1 of sabaeus resulted in the expression of a novel mRNA isoform. Despite these differences, it appears that the overall mRNA structure of CCR5 has been conserved for at least 35 million years, suggesting that the retention of this complicated RNA organization may have afforded a selective advantage.

Evolution of a cis-Regulatory Region of CCR5 in Humans-- Sequence analysis of the cis-regulatory region (-2761 to -1835) of 60 human CCR5 alleles revealed a total of 32 variable sites that define 27 unique human haplotypes (Fig. 3). An additional unique CCR5 haplotype was found by sequencing a genomic clone (GenBankTM accession number AF009962; deposited by Dr. N. L. Michael's group). Sequencing of the homologous region from the 43 non-human primates (Fig. 1b and Supplemental Fig. 3) and genotypic data from 40 additional non-human primates, including 23 chimpanzees, enabled us to define the CCR5 haplotype ancestral to humans. That is we determined the polarity (the ancestral-descendant relationship) of each nucleotide variant in the cis-regulatory region of human CCR5. In previous studies (15, 20), we found seven common polymorphic sites in the region between CCR5 -2761 to -1835 (Fig. 3a). -2733A, -2554G, -2459G, -2135T, -2132C, -2086A, and -1835C represented the ancestral state for these variable sites in human CCR5 (Figs. 1b and 3c). The nucleotide identity at each of these positions was invariant among great apes (except Gorilla which had a CCR5 -2132T) and OWM (Fig. 1b). This ancestral CCR5 haplotype was used to root a phylogenetic network depicting the possible evolutionary relationships among unique human CCR5 haplotypes (Fig. 3b).


View larger version (33K):
[in this window]
[in a new window]
 
Fig. 3.   CCR5 gene map and phylogenetic network of CCR5 haplotypes and haplogroups. a, schema of CCR2 and CCR5 loci on chromosome 3 (not to scale; hatched marks denote gaps). The CCR5 exons (open boxes) and two introns are numbered; the ORF is in exon 3. Downward-pointing arrows indicate the common polymorphisms found in CCR5 ORF, CCR2 ORF, and in a cis-regulatory region spanning from CCR5 -2761 to -1835. The arrow above the gene map denotes the CCR5 promoter 1, previously referred as downstream promoter (20). b, a phylogenetic tree depicting the relationships among the seven CCR5 human haplogroups (HHA-HHG). A chimpanzee CCR5 haplotype was used as an outgroup. The sequences of 28 unique CCR5 haplotypes were used to generate the phylogenetic tree. Each haplotype was assigned a number (1-28), which is displayed at the tips of the branches. The CCR5 haplotypes that have a common evolutionary history are clustered together and are boxed. Each cluster of CCR5 haplotypes therefore defined a unique CCR5 haplogroup, and all haplotypes within a haplogroup share several distinct genetic features. The CCR5 cis-regulatory polymorphism(s) that define a haplogroup and the bootstrap support for each branch are denoted at the branch point. The subset of haplotypes within HHF and HHG that are in linkage disequilibrium with the CCR2-64I and CCR5-Delta 32 polymorphisms, respectively, are indicated by a suffix following their identification number. The CCR2-64I and CCR5-Delta 32 polymorphisms were genotyped as described previously (15, 16). c, a schematic representation of the nucleotide sequences of the unique human CCR5 haplotypes (-2761 to -1835). The sequences of human CCR5 haplotypes were compared with those found in the homologous region of chimpanzee CCR5. The numbers at the bottom of the panel correspond to human CCR5 sequence. The sequences found at the corresponding nucleotide positions in chimpanzee CCR5 are shown and are in red. Dashes represent gaps, and dots denote identity between human and chimpanzee CCR5 sequences for the indicated nucleotide position. Each row is numbered serially (1-28) and represents the sequence for the 28 haplotypes displayed in the phylogenetic tree. CCR5 SNPs common to several human haplotypes are boxed, whereas those that are unique to individual haplotypes are not boxed and are in black. Polymorphisms unique to chimpanzee CCR5 are not shown. CCR5 haplotypes that form a haplogroup are bracketed. d, classification of CCR5 human haplogroups. Each haplotype within a haplogroup is characterized by the constellation of invariant polymorphisms indicated but differ from each other by additional SNPs. This cassette of nucleotide sequences is designated by a 7-letter SNP signature motif. The sequences within a SNP signature motif that are common to those found in the ancestral CCR5 haplotype, designated as HHA, are in red. HHF*2 and HHG*2 designate the subset of haplotypes within HHF and HHG that are in linkage disequilibrium with the CCR2-64I and CCR5-Delta 32 polymorphisms, respectively. The 7-letter SNP signature motifs for HHF*2 and HHG*2 have the prefix, 64I, and the suffix, Delta 32, respectively. The sequence for the haplotype representing HHG*2 is derived from a CCR5 genomic DNA clone (GenBankTM accession number AF009962). The HHB haplotype was found by genotyping over 2,000 individuals (16) and confirmed by sequencing. e, a model illustrating the potential evolution of human CCR5 haplogroups. HHB, HHC, and HHD differ from HHA by having a -2554T mutation. However, unlike HHC or HHD, HHB is not mutated at either CCR5 -2132 or -2086. HHB may therefore be ancestral to HHC and HHD. HHG*1 and HHF*1 are likely to be ancestral to HHG*2 and HHF*2, respectively.

A phylogenetic network of unique CCR5 haplotypes provided the evolutionary framework for defining seven biologically distinct clusters of haplotypes that we designated as CCR5 human haplogroups (HH)-A, -B, -C, -D, -E, -F, and -G (Fig. 3, b-d). HHA represented the ancestral CCR5 haplogroup. The haplogroups, HHC through HHG, were defined by at least one SNP, i.e. SNPs -2086G, -2132T, -1835T, and -2733G distinguish CCR5 HHC, HHD, HHF, and HHG, respectively. HHB haplotypes had a -2554T mutation but lacked the -2132T and -2086G SNPs. An HHB haplotype is likely to be ancestral to HHC and HHD (Fig. 3e). SNPs -2459A and -2135C were in complete linkage disequilibrium. Haplotypes with -2459A and -2135C but lacking -2733G and -1835T defined HHE. The polymorphisms CCR5 -2733G, Delta 32, -1835T, and CCR2-64I defined the haplotypes that are descendants of ancestral haplotypes in HHE (Fig. 3e). The CCR2-64I and CCR5-Delta 32 polymorphisms were found only on CCR5 haplotypes in haplogroups F (HHF*2) and G (HHG*2), respectively. To assess the robustness of each of the branches that define, in part, human CCR5 haplogroups, a bootstrap analysis was performed. Each branch was observed in 60% or more of the networks generated (Fig. 3b). Collectively, these findings demonstrate that SNPs in CCR5 may have arisen by a nested mutational process and that this locus represents a complex multi-allelic system. A clear understanding of the extent and organization of genetic variation in human CCR5, including the classification system we developed herein, permitted us to investigate the role of CCR5 haplotypes in vertical/horizontal transmission and disease progression in infected HIV-1 adults and children (16).2

Functional Effects of Variation in a cis-Regulatory Region of Human CCR5: Haplotype-specific Differences in Transcriptional Activity-- Because of the influential effect that CCR5 haplotypes had on HIV-1 transmission/disease progression (14-17), we next initiated studies to determine the potential mechanisms underlying their effects. To test the hypothesis that polymorphisms in the cis-regions of human CCR5 haplotypes confer haplotype-specific differences in transcriptional activity, we cloned CCR5 -2761 to -1814 derived from haplotypes representing several of the CCR5 haplogroups upstream of the luciferase ORF. These constructs represent haplotype-specific CCR5 promoter constructs because they included the cis-region of CCR5 that encompassed the polymorphic sites that demarcated the major haplogroups. We found significant differences among the luciferase activity of the five haplotype-specific promoter constructs tested, with the HHA-specific promoter construct demonstrating the least promoter activity (Fig. 4a).


View larger version (34K):
[in this window]
[in a new window]
 
Fig. 4.   Haplotype-specific promoter efficiency and differential nuclear factor binding associated with CCR5 SNPs. a, mean (±S.E.) luciferase activity of haplotype-specific promoter constructs made from the region spanning CCR5 -2761 to -1814 is shown. The number of transfections in K562 cells performed with each haplotype-specific construct is included within each bar graph. *, p < 0.01 for the difference between HHA versus HHE, HHF, or HHG; dagger , for the difference between HHC versus HHF. Similar results were obtained in the Jurkat T-cell line (data not shown). b, EMSAs were performed with K562 (K), THP-1 (T), Jurkat (J), or no (N) nuclear extracts and radiolabeled oligonucleotides designated as -2733A and -2733G or -1835C and -1835T (Table I). These oligonucleotides span the region encompassing the SNPs at CCR5 -2733 and -1835, respectively, and differ by the indicated single nucleotide (A/G or C/T; Table I). Lanes 1-4, 5-8, 17-20, and 21-24 illustrate the interaction of the radiolabeled oligonucleotides -2733A, -2733G, -1835C, and -1835T with nuclear proteins, respectively. The competition with the homologous (lanes 9-11 and 25-27), heterologous (lanes 12-14 and 28-30), or nonspecific (lanes 15 and 16, and 31 and 32) oligonucleotides is also shown. The triangles above lanes 9-11, 12-14, 25-27, and 28-30 denote increasing concentrations (4, 40, and 400 ×) of the unlabeled oligonucleotide indicated. The competition-binding assays for -2733A/G and -1835C/T were with nuclear extracts from THP-1 and K562 cells, respectively. For -1835C/T oligonucleotides, results similar to K562 were obtained using Jurkat nuclear extracts. NF, nuclear factor; NS, nonspecific; oligo, oligonucleotide.

Functional Effects of Variation in a cis-Regulatory Region of Human CCR5: Altered DNA/Nuclear Factor Binding-- Another potential mechanism by which polymorphisms in the cis-region of CCR5 haplotypes could influence CCR5 expression is by binding to distinct nuclear factors or altering the avidity of binding to a transcription factor. This might also provide a reason for the haplotype-specific differences in transcriptional activity. Computer algorithms suggested that the cis-regions surrounding the SNPs that delimit the major human haplogroups could potentially bind to distinct transcription factors (Table II). We used EMSA and supershift assays to determine whether polymorphisms at -2733, -2554, -2459, -2135, -2132, -2086, or -1835 bound nuclear proteins with differing avidity or whether a particular polymorphism bound a novel transcription factor. Table III summarizes the findings of these studies.

                              
View this table:
[in this window]
[in a new window]
 
Table II
Transcription factors predicted to bind cis-regions that encompass human CCR5 SNPs
Two computer algorithms (TESS and MatInspector program) were used to predict the transcription factors that could potentially bind to the SNPs in a cis-regulatory region (-2761 to -1835) of human CCR5.

                              
View this table:
[in this window]
[in a new window]
 
Table III
Summary of EMSA findings for cis-regions spanning SNPs at CCR5 -2733, -2554, -2459, -2135, -2132, -2086, and -1835
Oligonucleotides that encompassed different SNPs (Table I) in a cis-regulatory region of CCR5 (-2761 to -1835) were incubated with nuclear extracts derived from different cell lines or PHA-stimulated human PBMCs (PHA-blasts). Whether a SNP resulted in the differential (diff.) nuclear factor binding and/or altered affinity of nuclear factor binding is indicated. Results of supershift assays and the competition experiments with oligonucleotides containing the consensus binding sequences for the indicated nuclear factors are also shown. The single letters within the parentheses indicate the cells from which the nuclear extracts were prepared and used in EMSAs (J, Jurkat; K, K562; T, THP-1; and P, PHA blast). + indicates that competition or supershift was present; - indicates that competition or a supershift was absent. NT indicates not tested. "Identical binding" indicates that the nuclear factor binding pattern of the two polymorphic oligomers to nuclear factors in a given cell line or in PHA-blasts was similar. Note, the binding pattern among the cell lines or between the cell lines and the PHA-blasts was not necessarily identical.

We found that SNPs in human CCR5 cis-regulatory sequences were associated with three different kinds of alterations in DNA/nuclear factor binding. First, gain-of-nuclear factor binding: radiolabeled -2733G but not -2733A oligomers (Table I) bound to a novel nuclear factor (NF1) in three cell human cell lines (Fig. 4b and Table III). When using -2733G, two additional bands were found below NF1 in Jurkat cells, and it was difficult to exclude the possibility that one of these bands corresponded to that observed in EMSAs with -2733A (Fig. 4b). These two bands were also found in EMSAs with the -2733G oligomer in K562 or THP1 cells; however, the intensity of these bands was significantly lower than that found in Jurkat cells (Fig. 4b, compare lane 8 with lanes 6 and 7). Only NF1 is highlighted in Fig. 4b because this gain-of-nuclear factor binding was consistently present in the three cell lines tested.

Second, loss-of-nuclear factor binding: radiolabeled -1835C oligomer bound specifically to two novel nuclear factors (NF2 and NF3), but in contrast the radiolabeled -1835T oligomer bound to only one of these factors (NF2; Fig. 4b and Table III). The radiolabeled -1835C and -1835T oligomers also bound an additional nuclear factor that appears as a band below NF3, and this band is competed weakly by unlabeled homologous or heterologous oligomers but not by a nonspecific oligomer (Fig. 4b). Notably, the altered DNA/NF1-NF3 binding was observed in human cell lines but not in PHA-blast cells, suggesting that this effect may be cell type- and/or cell activation-specific.

Finally, a SNP in human CCR5 was associated with altered avidity of nuclear factor binding. This observation was illustrated by the altered binding pattern of several members of the NF-kappa B family of transcription factors to the cis-region that encompasses the human CCR5 G-2554T polymorphism (Fig. 5a). Radiolabeled -2554G and -2554T oligomers each bound to four nuclear factor complexes present in nuclear proteins derived from human PHA blasts (NF4-NF7; Fig. 5b). In homologous competition assays, unlabeled -2554G and -2554T oligomers competed for the NF4-NF7 bound to the radiolabeled -2554G and -2554T oligomers, respectively, and the efficiency for competition was NF6 = NF7 > NF4 = NF5 (Fig. 5b, compare intensity of bands in lanes 2 and 3; and data not shown for -2554G competition). By densitometric analysis, the intensities of the NF5 and NF4 complexes that bound to the -2554T probe were 198 ± 28.13 and 371 ± 42% greater than the intensities of these complexes that bound to the -2554G probe, respectively (n = 5 separate experiments using nuclear extracts from the same donor). Notably, this altered DNA/NF-kappa B binding was observed in nuclear extracts from human PHA blasts but not from unstimulated Jurkat T-cells, again highlighting that these altered DNA/nuclear protein interactions to human CCR5 SNPs are likely to be cell type- and/or cell activation state-specific.


View larger version (41K):
[in this window]
[in a new window]
 
Fig. 5.   Differential binding of NF-kappa B/Rel family members and two as yet unidentified nuclear factors to polymorphisms at human CCR5 -2554. a, the cis-region spanning human CCR5 -2554 and the homologous region in non-human primates share sequence similarities with the consensus binding site for members of the NF-kappa B/Rel family of transcription factors. CCR5 -2554G is found in HH-A, -E, -F, and -G and non-human primates. -2554T is found in HH-B, -C, and -D. Boxes highlight differences among the different primates. b, EMSA and competition experiments were performed with nuclear extracts from PHA-stimulated human PBMCs with radiolabeled oligomers that contain either -2554G or -2554T (Table I). Both the radiolabeled -2554G (lane 1) and -2554T (lane 2) oligomers bound to four distinct nuclear factor complexes (NF4-NF7); however, the -2554T oligomer bound to these factors with greater avidity than the -2554G oligomer (compare intensity of bands in lanes 1 and 2; see "Results" for densitometric analysis). The unlabeled -2554T oligomer but not an unrelated oligomer competed efficiently and specifically for the binding of NF4-NF7 bound to the radiolabeled -2554T oligomer (lane 3, and data not shown). The unlabeled -2554G oligomer also competed efficiently and specifically for the binding of radiolabeled -2554G oligomer (data not shown). Lanes 4 and 5 show the competition with excess of the unlabeled consensus NF-kappa B-binding oligomer. c, supershift experiments to determine the identity of the nuclear proteins bound by the -2554T oligomer in PHA-stimulated human PBMCs. Radiolabeled -2554T or consensus NF-kappa B oligomers (Table I) were incubated with extracts from human PHA-blasts. The complexes were then electrophoretically resolved after addition of the antibodies to the proteins indicated. No interaction between the nuclear protein complexes and an isotype control antibody was observed (data not shown). Note, complexes equivalent to NF6 and NF7 did not bind to the radiolabeled NF-kappa B oligomer, and the intensity of the NF4 and NF5 bands is greater in lane 10 versus lane 1. d, efficiency of competition of unlabeled -2554T and -2554G oligomers for NF-kappa B family of transcription factors. The nuclear proteins in PHA blasts that bound a radiolabeled consensus NF-kappa B-binding oligomer was competed with excess unlabeled NF-kappa B (lanes 2-5), -2554G (lanes 6-9), or -2554T (lanes 10-13) oligomers. An equimolar amount of unlabeled -2554T oligomer was much more efficient in competing NF-kappa B/Rel family of proteins than the -2554G oligomer (compare lanes 6-9 versus 10-13). However, compared with the unlabeled consensus NF-kappa B oligomer (lanes 2-5), both the -2554G (lanes 6-9) and -2554T (lanes 10-13) oligomers were significantly less efficient in competing for NF-kappa B proteins.

Because the region encompassing human CCR5 G-2554T was predicted to bind to members of the NF-kappa B family of transcription factors (Table II and Fig. 5a), we examined the ability of an unlabeled oligomer that contains a canonical NF-kappa B-binding site to compete for the NF4-NF7 bound to the labeled -2554G or -2554T oligomers. The unlabeled consensus NF-kappa B-binding oligomer competed for the NF4-NF7 bound to the labeled -2554G or -2554T oligomers, but the efficiency of competition was different than that for unlabeled -2554G/-2554T and was NF5 > NF4 = NF7 > NF6 (Fig. 5b; compare intensity of bands in lanes 4 and 5 to lanes 2 and 3, and data not shown for competition to -2554G).

It was not clear from these EMSA competition assays (Fig. 5b) whether each or only a subset of the nuclear complexes that bound to the labeled -2554G and -2554T oligomers, i.e. NF4-NF7, contained components of the NF-kappa B/c-Rel family of nuclear factors. Thus, it was important to compare directly the nuclear protein binding pattern of radiolabeled -2554T or -2554G to that of the radiolabeled consensus NF-kappa B-binding oligomer (Table I; Fig. 5c). The radiolabeled consensus NF-kappa B-binding oligomer bound to a complex of factors that appeared to include all or portions of NF4 and NF5 but not NF6 or NF7 (Fig. 5c, compare lanes 1 and 10). In contrast, both radiolabeled -2554T oligomer (Fig. 5c) and -2554G (data similar to Fig. 5b, lane 1) bound NF4-NF7. Notably, the consensus NF-kappa B-binding oligomer bound to NF-kappa B family members with greater avidity than the -2554T oligomer (Fig. 5c, compare intensity of bands in lanes 1 and 10). These findings suggested that the cis-regions surrounding human CCR5 G-2554T bound to various members of the NF-kappa B proteins as well as to two nuclear proteins (NF6-NF7) that are not bound by an oligomer that contains a consensus NF-kappa B-binding site.

To confirm that NF4 and NF5 but not NF6 or NF7 contain members of the NF-kappa B family of transcription factors, we conducted supershift assays with antibodies to p65, p50, c-Rel, RelB, and p52. By supershift assays, only p50 was detected in the NF4 complex bound to radiolabeled -2554T oligomer, whereas parts of NF5 were supershifted by antibodies to p65, p50, and c-Rel (Fig. 5c, lanes 1-9). Note that the complex supershifted by c-Rel antibody comigrates with NF7. Overall, the pattern of the supershifted bands indicated that NF4 and NF5 bound to the radiolabeled -2554T oligomer contained p65 homodimers, p50/p65 heterodimers, p50 homodimers, and c-Rel/p50 heterodimers (Fig. 5c, lanes 1-9). No supershift was observed with antibodies to p52 or RelB (Fig. 5c, and data not shown). Notably, antibodies to the NF-kappa B family members did not induce a supershift of NF6 or NF7 bound to the labeled -2554G (data not shown) or the -2554T oligomers (Fig. 5c, lanes 1-9). In control experiments, the identical antibodies were directed against the nuclear proteins that bound to the radiolabeled consensus NF-kappa B-binding oligomer, and a similar supershift pattern was observed (Fig. 5c, lanes 10-18).

Direct evidence that -2554T and -2554G bound to the NF-kappa B family of transcription factor