Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters

  1. Qiang Wu1,
  2. Theresa Zhang2,
  3. Jan-Fang Cheng3,
  4. Youngwook Kim1,
  5. Jane Grimwood4,
  6. Jeremy Schmutz4,
  7. Mark Dickson4,
  8. James P. Noonan4,
  9. Michael Q. Zhang2,
  10. Richard M. Myers4, and
  11. Tom Maniatis1,5
  1. 1Department of Molecular and Cellular Biology, Harvard University, Cambridge, Massachusetts 02138, USA; 2Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA; 3Genome Sciences Department, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; 4Department of Genetics and The Stanford Human Genome Center, Stanford University School of Medicine, Stanford, California 94305, USA

Abstract

The genomic organization of the human protocadherin α, β, and γ gene clusters (designated Pcdhα [gene symbol PCDHA], Pcdhβ [PCDHB], and Pcdhγ [PCDHG]) is remarkably similar to that of immunoglobulin and T-cell receptor genes. The extracellular and transmembrane domains of each protocadherin protein are encoded by an unusually large “variable” region exon, while the intracellular domains are encoded by three small “constant” region exons located downstream from a tandem array of variable region exons. Here we report the results of a comparative DNA sequence analysis of the orthologous human (750 kb) and mouse (900 kb) protocadherin gene clusters. The organization of Pcdhα andPcdhγ gene clusters in the two species is virtually identical, whereas the mouse Pcdhβ gene cluster is larger and contains more genes than the human Pcdhβ gene cluster. We identified conserved DNA sequences upstream of the variable region exons, and found that these sequences are more conserved between orthologs than between paralogs. Within this region, there is a highly conserved DNA sequence motif located at about the same position upstream of the translation start codon of each variable region exon. In addition, the variable region of each gene cluster contains a rich array of CpG islands, whose location corresponds to the position of each variable region exon. These observations are consistent with the proposal that the expression of each variable region exon is regulated by a distinct promoter, which is highly conserved between orthologous variable region exons in mouse and human.

[The sequence data described in this paper have been submitted to the GenBank/EMBL/DDBJ data library under accession nos. AY013756AY013813,AY013873AY013878, AF332005, and AF332006.]

Footnotes

  • 5 Corresponding author.

  • E-MAIL maniatis{at}biohp.harvard.edu; FAX (617) 495 3537.

  • Article and publication are at www.genome.org/cgi/doi/10.1101/gr.167301.

    • Received October 16, 2000.
    • Accepted January 9, 2001.
| Table of Contents

Preprint Server