|
|
||||||||
Regular Article |
/ß-Like Factor Is Encoded by a Germ Cell-Specific Gene Whose Expression Is Up-Regulated with Other General Transcription Factors During Spermatogenesis in the Mouse1
a The University of Texas at Dallas, Department of Molecular and Cell Biology, Richardson, Texas 75080
b The University of Texas Southwestern Medical Center, Departments of Internal Medicine and Pharmacology, Dallas, Texas 75235
ABSTRACT
TFIIA
/ß-like factor (ALF) is a testis-specific counterpart of the large subunit of human general transcription factor TFIIA. Northern analysis shows that ALF mRNA first appears in mouse testis at Postnatal Day 14. Similarly, expression of the general transcription factors TBP, TRF2, TFIIA
/ß, TFIIA
, and TFIIIB90 is also increased beginning at Postnatal Day 14, suggesting that there is a coordinated induction of many general transcription factors during male germ cell differentiation. Analysis of male germ cells separated by Staput sedimentation shows that ALF is present in pachytene spermatocytes and haploid spermatids. In addition, in situ hybridization experiments with adult mouse testis shows that ALF is present in haploid spermatids. Searches of the human genome sequence database using the basic local alignment search tool reveal that the ALF and TFIIA
/ß (GTF2A1) genes are both composed of nine exons, whereas the TFIIA
(GTF2A2) gene is composed of five exons. Furthermore, nucleotide and amino acid comparisons among human and mouse ALF, TFIIA
/ß, and TFIIA
cDNA sequences show that ALF has diverged more rapidly than either TFIIA
/ß or TFIIA
. Finally, the ALF and SBLF (Stoned B-Like Factor) sequences present in the chimeric SALF cDNA are both present on human chromosome 2, and an analysis of the corresponding genes suggests a model for the formation of SALF.
developmental biology, gametogenesis, gene regulation, meiosis, spermatid, testes
INTRODUCTION
Accurate promoter recognition and initiation of mRNA synthesis require the assembly of a preinitiation complex (PIC) composed of the RNA polymerase II holoenzyme and a set of general transcription factors (GTFs; TFIIA, -B, -D, -E, -F, and -H) at the core promoters of class II genes [1, 2]. Recent studies have revealed several tissue-specific GTFs, raising the idea that PICs with distinct core promoter specificity, activator specificity, or both, may be utilized at some promoters.
The mRNAs for several GTFs and GTF-related factors display unique patterns of expression in testis. For instance, the TFIIA
/ß-like factor, ALF, is expressed only in this tissue, while TFIIA
/ß, TFIIA
, and TFIIB are expressed at higher levels [35]. In addition, the TATA-binding protein (TBP) is up-regulated during the early pachytene stage of meiosis, and its mRNA is estimated to be up to 1000-fold higher in haploid spermatids than in other cell types [3, 6, 7]. A TBP-related factor, TRF1, is preferentially expressed in the gonads and central nervous system (CNS) of adult Drosophila, and mutations in this gene result in male sterility [8, 9]. Another TBP-related factor, TRF2 (also called TLF, TRF, TRP, and TLP), has been identified in human, mouse, chicken, Drosophila, and C. elegans [1016]. Like TBP, TRF2 transcripts are also present at substantially higher levels in human and mouse testis [13, 15, 16]. Finally, genes for two transcription elongation factors, SII-T1 and Elongin A2, are expressed only in testis [17, 18]. Together, these observations suggest a unique role for the RNA polymerase II transcription machinery in testis, perhaps for germ cell-specific gene regulation.
In this paper we extend our characterization of the ALF gene, focusing on its structure and regulation. Previous studies have shown that the recombinant ALF protein, in conjunction with the small (
) subunit of TFIIA, can stabilize TBP-DNA interactions and can support basal and activated transcription by RNA polymerase II in vitro [4, 5]. The ALF sequence is also present as part of a less abundant transcript, SALF [4], which contains an upstream sequence from a gene that is similar to the membrane trafficking factor, Stoned B [19], and to the µ-chains of clathrin-associated adaptor complexes [20]. In this report we address three main questions: Do other organisms possess a gene for a testis-specific TFIIA large subunit, and what are the evolutionary relationships among such genes? Is the expression of ALF and other GTFs in testis localized within the germ cells, as is the case for TBP? Finally, what is the molecular basis for the formation of the chimeric human SALF transcript?
MATERIALS AND METHODS
Complementary DNA Cloning
Mouse ALF The 3' end of mouse ALF was isolated in polymerase chain reactions (PCRs) using 4 µl of mouse testis "Marathon" cDNA library (Clontech, Palo Alto, CA) with 10 pmol of primer 2a219 (5'-GACTTGAAGCAGCTCTGGGAAACCAAGG-3') and the library-specific adaptor primer, AP-1 (5'-CCATCCTAATACGACTCACTATAGGGC-3'; Clontech). The 1662-base pair (bp) product was subcloned into pGEM-T Easy (Promega, Madison, WI) and sequenced. The 5' end of mouse ALF was isolated in PCR reactions using the MALF-1 (5'-GACTTTGTAAAGGTGACCAGATG-3') and AP-1 primers, followed by reamplification with the nested primers, MALF-2 (5'-GTGGATCGGATAGCCAGCAAA-3') and AP-2 (5'-ACTCACTATAGGGCTCGAGCGGC-3'). The product, which was about 400 bp, was subcloned into pGEM-T Easy (Promega) for sequencing. These overlapping cDNAs were combined to form the composite mouse ALF sequence. Subsequent PCR reactions using EmALF1 (5'-ATTATGCTAGCGCCTTCATCAACCTGGTGCCCA-3') and EmALF2 (5'-TAGACGAGCTCTTACCACTCAGCTTCACCAATG-3') primers that span the open reading frame verified the existence of the full-length transcript.
Mouse TFIIA
/ß
Mouse TFIIA
/ß was isolated in PCR reactions using 3 µl of mouse testis "Marathon" cDNA library (Clontech) with primers mTFIIAL-1 (5'-ATGTCCCAAGTTGCAAACCGTCC-3') and mTFIIAL-2 (5'-GTGTTGTGTGTGGAAATGGCGAAC-3') based on partial mouse TFIIA
/ß expressed sequence tags (ESTs; GenBank accession numbers AA536742 and AA119905).
Mouse TFIIA
A full-length mouse TFIIA
EST clone (GenBank accession number AA060082) was identified using human TFIIA
as the query.
Rat TFIIA
/ß
Rat TFIIA
/ß was cloned using a combination of reverse transcription-PCR (RT-PCR) and library screening. First-strand cDNA was synthesized from poly(A)+ RNA (Clontech) using a random hexamer p(dN)6 (Roche, Indianapolis, IN). This cDNA was used as a template for PCR with degenerate, inosine-containing primers rIIA-1 (5'-GAIGAIGTIATIAAIGAIGTICGIGAIIKITTYYTIGAIGAIGGIGTIGAIGAICA-3') and rIIA-2 (5'-TTCATIATICCRTCYTTIAIITIRAAYTTCCA-3'). The product of this reaction was subcloned into pCR II (Invitrogen, Carlsbad, CA) and used to probe a rat liver 5'-Stretch Plus
gt11 cDNA library (Clontech). A full-length clone was isolated, subcloned into pBluescript II KS+, and sequenced.
Rat TFIIA
Rat TFIIA
was cloned using the insert of pRSEThp12 [21] as a probe to directly screen a rat liver 5'-Stretch Plus
gt11 cDNA library.
Human TRF2 A sequence related to TBP (GenBank accession number W07871) was identified by a basic local alignment search tool (BLAST) search using human TBP as the query. This clone was obtained from Research Genetics (Huntsville, AL) and sequenced. The clone is 1286 bp long, including a 36 nt poly(A) tail, and encodes a 186-amino acid protein that is about 70% similar to human TBP. The clone is identical to TRF2 [13, 15, 16].
Genomic PCR
Human genomic sequences flanking the junction sites in the ALF and SALF cDNAs were isolated using genomic PCR as described in the GenomeWalker system (Clontech). PCRs were performed with the appropriate combination of adaptor primers (Clontech) AP-1 (5'-GTAATACGACTCACTATAGGGC-3') and AP-2 (5'-ACTATAGGGCACGCGTGGT-3'), ALF-specific primers ALF-1 (5'-GCTGGAGGTGCTGTCATGGC-3') and ALF-4 (5'-GTGCTGTCATGGCCTGCCTCAA-3'), and SBLF-specific primers hSALF-1 (5'-GTGATGTCCAGCCACAGAAACA1-3'), hSALF-2 (5'-GGTCAGGTCTCTGGGAGTGGAG-3'), hSALF-3 (5'-CCTTCAATTACATCTTCAATTACA-3'), and hSALF-4 (5'-AGCAAATAGATTCCGAACTCCTTC-3'). PCRs were typically performed using Elongase (Life Technologies, Grand Island, NY) with 5 ng of genomic library DNA, and products were subcloned into pGEM-T Easy (Promega) for sequencing.
The human chromosomes on which ALF, SBLF, and TFIIA
/ß sequences are located were determined by genomic PCR of polychromosomal somatic cell hybrid DNAs (Q Biogene, Carlsbad, CA) according to the manufacturer's instructions. Primers hALF-6 (5'-TCCACTCAAACCCGCTCCATCT-3') and hALF-7 (5'-ACAAAAGGTGCAGGGTTCGGCT-3') were used to amplify a 270-bp ALF-specific product. Primers hSALF-1 and hSALF-5 (5'-TCTAGGGCAAATCTCACCACAC-3') were used to amplify a 173-bp SBLF-specific product. Primers hTFIIAL-5 (5'-TGGATTTCATTCAGAAGAGCAGCAGCT-3') and hTFIIAL-8 (5'-GATGCATTCATATGCTGTATCAACTTAGAATCTG-3') were used to amplify a 2.5-kilobase (kb) TFIIA
/ß-specific product that spans a 2.2-kb intron. DNA templates that produced PCR products were then compared with the chromosome complement of the somatic cell hybrid panel.
Genome Database Search and Analysis
Human ALF gene A search of the high throughput genomic sequence (htgs) database at the National Center for Biotechnology Information (NCBI; http://www.ncbi.nlm.nih.gov/) revealed a bacterial artificial chromosome (BAC) clone from the Washington University Genome Sequencing Center that contains unordered human genomic sequences (chromosome 2) that encodes the ALF gene (GenBank accession number AC073082).
Human SBLF gene A search of the htgs database revealed that the BAC clone (AC073082) also contains a complete Stoned B-Like Factor (SBLF) gene.
Human TFIIA
/ß gene
A search of the htgs database revealed a BAC from the Genoscope Sequencing Center that contains unordered human genomic sequences (chromosome 14) that encodes the TFIIA
/ß gene (GenBank accession number AL136040).
Human TFIIA
gene
A search of the htgs database revealed a BAC from the Washington University Genome Sequencing Center that contains unordered human genomic sequences (chromosome 15) that encodes the TFIIA
gene (GenBank accession number AC022480). A putative processed TFIIA
pseudogene was also identified in sequences from chromosome 8 (GenBank accession number AF252825). This TFIIA
-like sequence contains an Alu element in the 5'-UTR, and the open reading frame is not continuous.
Northern Analysis
Multiple tissue Northern blots containing polyA+ RNAs from six different tissues of rat and mouse were obtained from Origene. RNAs from rat reproductive tissues (Zivic-Miller, Zelienople, PA), and from the testes of CD-1 mice (Charles River Laboratories, Wilmington, MA) isolated at 6, 10, 12, 14, 16, 18, 20, 22, 24, 2730, and 40 days after birth were prepared using the method of Chomczynski and Sacchi [22].
Agarose gels (1.5%) for Northern blots were prepared in 1x MEA (20 mM MOPS, 8 mM Na acetate, 1 mM EDTA), and 2.2 M formaldehyde. RNAs (15 µg) were transferred overnight onto Zeta-probe nylon membranes (BioRad, Hercules, CA) and irradiated for 55 sec at 100 mJ/cm2 using an ultraviolet cross-linker (Fisher, Pittsburgh, PA). Filters were prehybridized in 5 ml/cm2 ExpressHyb solution (Clontech) at 68°C for 30 min and hybridized with 10 x 106 cpm/ml of [
-32P]dCTP-labeled DNA probe at 68°C for 1 h, or with 10 x 106 cpm/ml of [
-32P]ATP-labeled oligonucleotides. Probes were prepared by random primer labeling of DNA fragments using the Ready-to-Go system (Amersham Pharmacia Biotech, Uppsala, Sweden) or, for oligonucleotides, by T4 kinase labeling with [
-32P]ATP. Following hybridization, membranes were washed in 2x SSC and 1.0% SDS followed by 0.1x SSC and 0.5% SDS, and were exposed for several hours to a PhosphorImager screen (Molecular Dynamics, Sunnyvale, CA) or for 17 days with XAR-5 film (Kodak, Rochester, NY). For developmental expression studies, several identical blots were prepared so that probes could be used in parallel, and some were stripped and reprobed.
The probes for Northern experiments were as follows: hALF, a 447-bp HincII-HindIII fragment or an 895-bp HincII-BglII fragment; mALF, a 345-bp PstI fragment or a 504-bp PstI-EcoRI fragment; mTFIIA
/ß, a 315-bp PstI fragment; TFIIA
, a 355-bp NdeI-BamHI fragment; TRF2, a 607-bp EcoRI-NotI fragment; TFIIIB90, (5'-ATACTTCATGGTTCTTCTCCCCAAACTCCAGCAGATGAGCAAAGCGTGGAATGTA-3'); TBP, a 197-bp PvuII fragment; Protamine-1, (5'-GAAGTCTGGTAAAATTCTCACGCAGGAGTTTTGATGGACTTGCTATTCTGTGCAT-3'); and a partial ß-actin cDNA.
Staput Cell Separation
The EDTA-Trypsin method for the isolation of male germ cells by Staput sedimentation was used with minor modifications [2325]. Testes from two adult CD-1 mice were minced in 10 ml of PBS using a single razor blade. Cells were filtered through an 80-mesh screen and incubated with 0.25% EDTA-trypsin (Life Technologies) and 20 µg/ml of DNase I (Sigma, St. Louis, MO) for 10 min at room temperature. Cells were then centrifuged at 500 x g for 20 min and resuspended at 2 x 106 cells/ml in PBS containing 20 µg/ml of DNase, 0.002% purified soybean trypsin inhibitor (Sigma), and 0.5% BSA (Fisher). The final volume was between 20 and 30 ml.
Cells were loaded into a 12.5-cm-diameter Staput chamber (Kimble/Kontes, Vineland, NJ) and separated over an 800 ml linear gradient of BSA (1%4%) for 3-4 h. Fractions (2040 ml) were collected over a period of 1 h. Aliquots from each fraction were fixed with Bouins fixative, stained with periodic acid-Schiff base, counterstained with hematoxylin, and counted. Fractions 4, 9, and 20 were enriched in pachytene spermatocytes, round spermatids, and elongating spermatids, respectively, and were used to prepare RNA.
Reverse Transcription-Polymerase Chain Reaction
Total RNA from isolated germ cell fractions was analyzed using the Advantage RT-for-PCR kit (Clontech). First-strand cDNA was synthesized from total RNA using MMLV reverse transcriptase at 42°C for 1 h. PCRs (22-30 cycles) were performed with first-strand cDNA that had been normalized for the production of similar levels of a 944-bp ß-actin product using the mActn-11 (5'-AGGACTCCTATGTGGGTGACG-3') and mActn-12 (5'-CTCATCGTACTCCTGCTTGCT-3') primers. Primers for a mouse TFIIA
/ß product (654 bp) were mTFIIAL-1 (5'-GTGTTGTGTGTGGAAATGGCGAAC-3') and TFIIAL-4 (5'-TCCAGGAAGTGGGGCTAAGA-3'). Primers for a mouse ALF product (904 bp) were mALF-3 (5'-GTTTTACGCCGGAAGACCTGA-3') and mALF-5 (5'-GTCCTCGTTGTCGCTGCTA-3'). Primers for a protamine-1 product (136 bp) were mPrtm-2 (5'-AGATACCGATGCTGCCGCAGC-3') and mPrtm-4 (5'-CACCTTATGGTGTATGAGCGGC-3').
In Situ Hybridization
In situ hybridizations were performed using [
-35S] uridine 5'-triphosphate (UTP)-labeled antisense and sense RNA probes prepared from human or mouse ALF cDNA constructs. A 447-bp HincII-HindIII fragment from human ALF was subcloned into pBluescript II KS+; the antisense probe was prepared using T7 polymerase and the sense probe was prepared with T3 polymerase. A 345-bp PstI fragment from mouse ALF was subcloned into pBluescript II KS+; the antisense probe was prepared using T3 polymerase and the sense probe was prepared with the T7 polymerase.
Testes of adult mice were perfused with 4% paraformaldehyde, embedded in paraffin, and sectioned at 7 µm. Sections were deparaffinized and hybridized with the labeled sense or antisense transcripts using reagents in the SureSite II kit (Novagen, Milwaukee, WI) according to the recommended protocol. Exposure times were generally between 2 to 4 wk. Counterstaining was with methyl green or hematoxylin/eosin.
RESULTS
Expression of a Testis-Specific ALF Gene in Rodents
To determine whether rodents express ALF we examined RNA from various mouse and rat tissues. Northern analysis revealed a 1.8-kb mRNA species present in mouse testis (Fig. 1A, lane 5), but not in brain, heart, kidney, lung, or skin (lanes 14, 6). Likewise, rat testis RNA contained a 1.8-kb mRNA species (Fig. 1B, lane 5) that was not present in other tissues tested (lanes 14, 6). We also examined whether ALF was expressed in other male reproductive tissues. The results (Fig. 1B) show that ALF is not present in RNA from rat seminal vesicles (lane 7), vas deferens (lane 8), or epididymis (lane 9). Thus, rodents possess an ALF gene that expresses a testis-specific mRNA similar to that observed in humans. A partial ALF cDNA sequence has also been identified from pig (GenBank accession number AW416121), suggesting that ALF is present in other higher organisms. In contrast, a TFIIA
/ß-like factor was not identified in database searches of the Saccharomyces cerevisiae, C. elegans, or D. melanogaster genomes (data not shown). It is currently unknown whether TFIIA
/ß or ALF is orthologous to the single TFIIA large subunit present in these species.
|
Cloning of Rodent ALF, TFIIA
/ß, and TFIIA
cDNAs
Using a mouse testis cDNA library, we performed PCRs to isolate overlapping cDNAs that encode mouse ALF. The full-length composite sequence is 1614 bp, including a 27-bp poly(A) tail. The 1407-bp open reading frame encodes a 468-amino acid protein with a predicted mass of 51.5 kDa. A rodent counterpart of the chimeric human SALF cDNA was not detected in Northern analysis or in preliminary PCR experiments (data not shown).
We also isolated TFIIA
/ß subunit cDNAs from both mouse and rat. A mouse TFIIA
/ß cDNA was isolated from a mouse testis cDNA library using PCR. The full-length, 1227-bp clone predicts a protein of 378 amino acids with a mass of 41.5 kDa. A rat TFIIA
/ß cDNA was isolated using both RT-PCR and direct hybridization of a rat liver cDNA library. This 1212-bp clone predicts a protein of 377 amino acids with a mass of 41.6 kDa.
Mouse and rat TFIIA
cDNAs were also isolated. A full-length mouse TFIIA
was identified in EST database searches using human TFIIA
as the query. This clone predicts a 109-amino acid protein with a mass of 12.5 kDa. The rat TFIIA
cDNA was isolated by direct hybridization of a rat liver cDNA library. This 659-bp cDNA encodes a polypeptide identical to that of mouse TFIIA
. In contrast, human TFIIA
has a single amino acid variation at position 46 (alanine vs. serine).
Comparison of ALF and TFIIA
/ß Amino Acid Sequences
An alignment between the amino acid sequence of human ALF with mouse ALF cloned in this report reveals an overall similarity of 77% (Fig. 2A and Table 1). However, the N-terminal 60 amino acids (region I as defined in [29]) and C-terminal 62 residues (region IV) are 92% and 100% similar, respectively (region III is mainly conserved with respect to overall negative charge and is not included in this analysis). The conservation of regions I and IV reflects the importance of these domains for subunit-subunit interactions and TFIIA activity.
|
|
Region II from human and mouse ALF, defined here as residues 61339, is only 68% similar. Moreover, of the 100 amino acid substitutions within region II, only 13 were conservative changes, suggesting this sequence is under weak selective pressure. Related, alanine scanning mutations and deletions within region II of the yeast toa1 gene had little effect on TFIIA activity in vitro or in vivo [26]. These observations support the idea that region II is a flexible spacer, and imply that it is the highly conserved TFIIA
subunit that dictates the proximity and disposition of the N- and C-terminal ends of the large subunit [27, 28]. It is still possible, however, that region II could have a gene- or activator-specific role that may be related to its divergent sequence.
An alignment between the amino acid sequence of human TFIIA
/ß [29, 30] with the rat and mouse TFIIA
/ß subunit cDNAs cloned in this report is shown in Figure 2B. The comparison shows an overall similarity of 97%, with regions I and IV between 97100% similar. In contrast to ALF, region II from human and mouse TFIIA
/ß (residues 61273) is also very similar (95%). An alignment between the amino acid sequence of human TFIIA
[21, 31, 32] with the mouse and rat TFIIA
subunits reported here is shown in Figure 2C. These sequences are 99% similar (Table 1). Finally, a comparison between human and mouse TBP shows 98% similarity, while human TRF2 is identical to mouse TRF2 (Table 1).
Rapid Evolution of the Human and Mouse ALF Genes
To determine the rate of synonymous substitutions (those that do not result in an amino acid replacement) per synonymous site (KS) and nonsynonymous substitutions (those that do result in an amino acid replacement) per nonsynonymous site (KA) between the human and mouse ALF, TFIIA
/ß, and TFIIA
sequences, we used the MEGA program [33]. This analysis is presented in Table 1.
With respect to KS we find the following patterns: 1) Of the three sequence pairs examined (ALF, TFIIA
/ß, and TFIIA
), the highest substitution rate was seen with ALF (KS = 0.99 ± 0.10) and the lowest rate with TFIIA
(KS = 0.24 ± 0.064). 2) There is an apparent decrease in KS within the ALF sequence itself in the order of region I > region II > region IV, with a high value of 1.42 ± 0.51 (region I) and a low value of 0.52 ± 0.16 (region IV). Although the standard deviation in some of these calculations is relatively large due to the short sequence lengths and high degree of divergence, the data indicate that the rate of synonymous substitutions (KS) for ALF is, on average, twofold higher than the rate observed with TFIIA
/ß.
With respect to KA we find the following patterns: 1) Of the three sequences, the highest substitution rate was again seen with ALF (KA = 0.16 ± 0.013) and the lowest rate with TFIIA
(KA = 0.004 ± 0.004). 2) There is a decrease in KA within the ALF sequence itself in the order of region II > region I > region IV, with a high value of 0.23 ± 0.022 (region II) and a low value of 0.021 ± 0.012 (region IV). The KA/KS ratio is much higher in region II of ALF (0.24) compared with region I (0.035) and region IV (0.040), or with TFIIA
/ß (0.033). The relatively high KA value in region II of ALF suggests relatively weak selection against nucleotide substitutions that cause amino acid changes.
In a comparison of 47 human and mouse genes [34] an average rate of 3.51 x 10-9 synonymous substitutions per site per year (rS) was calculated (assuming a divergence time of 80 million years), and the values for individual genes ranged from 1.57 x 10-9 to 6.39 x 10-9. An average rate of 0.74 x 10-9 nonsynonymous substitutions per site per year (rA) was also calculated and individual values ranged from 0.00 x 10-9 to 3.06 x 10-9. In comparison, the rates calculated for TFIIA
/ß were about average (rS = 3.3 x 10-9 and rA = 0.11 x 10-9), whereas the rates for ALF were slightly higher (rS = 6.2 x 10-9 and rA = 1.0 x 10-9).
These data show that the human and mouse ALF sequences, and region II in particular, are more divergent than TFIIA
/ß, TFIIA
, TBP, or TRF2. Although the physiological significance of this observation is currently unknown, the results indicate that the ALF gene has experienced a more rapid mutation rate, a reduced selection pressure, or both. Interestingly, other genes whose products are related to male reproductive function also diverge rapidly (see Discussion).
Genomic Structures of Human ALF, SBLF, TFIIA
/ß, and TFIIA
Using ALF as a query of the htgs database at NCBI, a BAC clone that contained a complete ALF gene was identified. This gene is composed of nine exons (exon 1, 5'-UTR and aa 17; exon 2, aa 841; exon 3, aa 4282; exon 4, aa 83101; exon 5, aa 102129; exon 6, aa 130326; exon 7, aa 327413; exon 8, aa 414443; and exon 9, aa 444478 and 3'-UTR; Fig. 2A).
Surprisingly, the clone noted above also contains a human Stoned B/µ-chain-like gene composed of four exons (exon 1, 5'-UTR; exon 2, aa 1643; exon 3, aa 644711; and exon 4, aa 712736 and 3'-UTR). We denote this gene as Stoned B-Like Factor, or SBLF. The first three exons of the SBLF gene are identical to those present at the 5' end of SALF. However, several lines of evidence suggest that the normal transcript from this gene is not SALF, but is a 6.5-kb transcript reported earlier (RNA 6.5) [4]. Partial versions of this transcript have been identified in previous cloning experiments [4], by BLAST searches of human ESTs (e.g., GenBank accession number AI734225), and by PCR isolation of mouse SBLF clones from a mouse embryonic cDNA library (data not shown). Together with an examination of the SBLF-containing BAC clone, the results of these experiments show that SBLF encodes a 736-amino acid protein whose C-terminus extends 24 amino acids (VEIEKKWIKIDGEDPDKIGDCITQ) beyond the point at which SBLF and ALF are joined in SALF.
A gene for TFIIA
/ß was also identified in the htgs database. Like ALF, this gene also contains nine exons (exon 1, 5'-UTR and aa 110; exon 2, aa 1144; exon 3, aa 45112; exon 4, aa 113134; exon 5, aa 135159; exon 6, aa 160204; exon 7, aa 205311; exon 8, aa 312341; and exon 9, aa 342376 and 3'-UTR; Fig. 2B). Several exons divide the conserved regions of the ALF and TFIIA
/ß proteins at similar points; namely, exons 12, 23, 78, and 89 (Fig. 2, A and B). It is also important to note that the 5'- and 3'-UTRs have not been defined in the SBLF and TFIIA
/ß genes because cDNA clones for the full-length (6.07.0 kb) SBLF and TFIIA
/ß mRNAs have not been isolated. Thus the numbering of exons in these two genes is tentative.
A complete gene for TFIIA
was also identified, and is composed of five exons (exon 1, 5'-UTR; exon 2, 5'-UTR and aa 124; exon 3, aa 2559; exon 4, aa 60101; and exon 5, aa 102109 and 3'-UTR; Fig. 2C). The intron-exon junctions identified in all four genes are flanked by splice donor (GT) or splice acceptor (AG) sequences, as expected (data not shown).
Up-Regulation of GTF Expression in Mouse Testis
In mammals, particular types of male germ cells initially appear at specific times during postnatal development [35]. In mice, for example, spermatogonia appear at postnatal Day 6, primary spermatocytes between postnatal Days 10 to 18, secondary spermatocytes at Postnatal Day 18, and haploid round spermatids at postnatal Day 20, followed by a 2-wk maturation process that yields mature spermatozoa. To determine whether ALF is expressed during germ cell differentiation, and to compare its pattern of expression with other GTFs, we examined testis RNA prepared from mice that were between 6 and 40 days old.
Northern blotting experiments using an ALF-specific probe showed no expression up to and including Day 12 (Fig. 3, lanes 13). This result suggests that ALF is not expressed in spermatogonia or in somatic cells of the testis. Rather, ALF expression was first detected on Postnatal Day 14 (lane 4), and mRNA levels continued to increase through Day 40 (compare lane 4 to lane 11). Expression at Day 14 coincides with the appearance of primary spermatocytes in the pachytene stage of meiosis [35], and the increase in RNA at later time points suggests continued expression in haploid spermatids.
|
We tested several other GTFs in this assay, including TFIIA
/ß, TFIIA
, TBP, and TRF2. These genes, unlike ALF, are expressed as early as Day 6 (Fig. 3A, lanes 13), probably because their gene products are required in all somatic cells, including those of the testis. However, the RNA level for each of these genes was distinctly elevated on Postnatal Day 14 (lane 4), and increased thereafter (lanes 5-11). In the case of TFIIA
/ß, two transcripts were detected (6.5 and 2.0 kb), possibly due to testis-specific transcription initiation or RNA processing. TBP shows expression during Days 6-12 (Fig. 3A, lanes 13) and is increased dramatically from Day 14 onward (lanes 411), in agreement with previous reports [3, 6]. Two TBP-related RNA species were observed (1.8 and 1.7 kb), possibly due to the use of alternative initiation sites [36]. In addition, we observed that an RNA polymerase III factor, TFIIIB90, was expressed during Days 6-12 (lanes 13) and was also increased from Day 14 onward (lanes 411). RT-PCR experiments revealed that the RNA polymerase I factor, TAFI 48, was also up-regulated between Days 12 and 14, whereas UBF was constant from Day 6 to Day 40 (data not shown). Overall, these results reveal a simultaneous and apparently coordinated increase in many GTFs during male germ cell differentiation, and include components from all three eukaryotic RNA polymerase systems.
To show that similar amounts of RNA were loaded onto these gels we normalized the blots using ß-actin (Fig. 3B). The results show equivalent levels of a 2.1-kb transcript in all lanes, with a smaller 2.0-kb band appearing from Day 24 onward (lanes 9-11). We also used a protamine-1 probe to demonstrate regulation of a gene expressed later in germ cell differentiation [37]. The results show a characteristic increase in expression of a 0.6-kb transcript from Days 22 to 24 through Day 40 (Fig. 3B, lanes 811).
Expression of ALF and TFIIA
/ß in Male Germ Cells
Northern blotting analysis of ALF expression during postnatal testis development suggested that ALF is a germ cell-specific counterpart of TFIIA
/ß. It remained possible, however, that ALF was actually expressed in somatic cells of the testis in a temporal pattern that paralleled the development of haploid germ cells. To identify specific cell types that express ALF and TFIIA
/ß in adult testis, we analyzed expression in germ cell fractions separated by Staput cell sedimentation [2325]. The amount of first-strand cDNAs used in these experiments was normalized based on the results of RT-PCRs using ß-actin-specific primers (Fig. 3C, lower panel, lanes 15). PCRs with ALF-specific primers were then used to detect ALF expression in adult whole testis (Fig. 3C, lane 1) pachytene spermatocytes (lane 3), round spermatids (lane 4), and elongating spermatids (lane 5), but not in whole testis isolated at Postnatal Day 6 (lane 2). Likewise, PCRs with TFIIA
/ß-specific primers showed that TFIIA
/ß is also expressed in pachytene spermatocytes and spermatids (lanes 35), a result that suggests that ALF and TFIIA
/ß could be present in the same cells. Finally, reactions with protamine-1-specific primers were used to verify the characteristic expression of this gene in round and elongating spermatids (lanes 4 and 5).
In Situ Hybridization Analysis of ALF Expression in Mouse Testis
In situ hybridization was used to directly visualize the sites of ALF expression in testis. In these experiments, ALF mRNA was not detected in somatic cells of the testis or in spermatogonia. Instead, the antisense probe gave a strong signal in elongated spermatids of the seminiferous tubules at stages IV to VIII (Fig. 4). Certain tubules gave no signal (yellow arrowheads), whereas tubules containing round and elongating spermatids gave a more punctate signal (white arrows). This is presumably because tubules within a given section contain germ cells at distinct stages of development. No signal was detected in testis harvested at postnatal Day 14 (data not shown), suggesting that the level of expression at this time, although detectable by Northern analysis (Fig. 3), is too low to be seen by in situ hybridization. Neither the human nor the mouse sense probes detected ALF expression (Fig. 4 and data not shown). Collectively, these studies indicate that ALF and TFIIA
/ß mRNAs are present in meiotic and postmeiotic germ cells in the testis.
|
Chromosomal Location of ALF, SBLF, and TFIIA
/ß
To determine the chromosomal location of ALF, SBLF, and TFIIA
/ß, genomic PCR was performed on a panel of human-hamster and human-mouse chromosomal hybrids. The ALF-specific primers amplified a 270-bp product in the SM852 cell line (Fig. 5A, top panel, lane 15), while the SBLF-specific primers amplified a 173-bp product (Fig. 5A middle panel, lane 15). As SM852 contains only human chromosome 2, both genes must be on this chromosome. These results are consistent with the assignment of ALF to the short arm of chromosome 2 between markers D2S119 and D2S337 (Locus ID 11036), and with our identification of the ALF and SBLF genes in a human chromosome 2 database.
|
Experiments performed with TFIIA
/ß-specific primers produced a 2.7-kb product derived from hamster DNA and a 2.5-kb human-specific fragment that appeared only in somatic cell hybrids that contained human chromosome 14 (Fig. 5A, lower panel, lanes 9, 11, 12, 16, 17, and 18). Again, this result is consistent with our identification of the TFIIA
/ß gene in a human chromosome 14 database.
Model for the Formation of SALF
SALF is a chimeric human transcript [4] that contains sequences that are identical to the SBLF and ALF genes. Because of its unusual chimeric structure, we wanted to evaluate how the SALF transcript might be formed. To address this question, we cloned genomic DNA adjacent to each of the three adjoining cDNA sequences shown in lines 1 and 2 of Figure 5B. As shown in lines 35, there is a splice donor (GT; lines 3 and 5) or a splice acceptor (AG) (line 4) site at the point at which the genomic and cDNA sequences meet. These results indicate that the junctions in the ALF and SALF cDNAs are defined by the boundaries of corresponding exons (lines 35). The sequences identified in these experiments match the genomic sequences identified in database searches. Interestingly, the intron sequence near the junction in line 4 resembles an Alu-type repetitive element, and contains a (TAAA)10 repeat [38].
Based on these results, together with the chromosomal localization and genomic sequence analysis of SBLF and ALF, we propose a model for SALF formation. According to the model, splicing of a putative pre-mRNA transcript that contains both genes forms the chimeric SALF cDNA (Fig. 5C and Discussion).
DISCUSSION
Germ Cell-Specific Expression of Transcription Factors
In this study we show that ALF is a male germ cell-specific counterpart of TFIIA
/ß. In particular, expression of ALF begins in testis at Postnatal Day 14, corresponding to the pachytene stage of meiosis. Furthermore, transcript levels increase through Day 40, suggesting that haploid spermatids continue to express this gene. These conclusions are supported by RT-PCR experiments using RNA from isolated germ cells, and by in situ hybridization studies that show ALF mRNA is present in elongated spermatids within the seminiferous tubules. Similarly, TBP was also up-regulated at Postnatal Day 14, consistent with earlier reports showing expression in early pachytene cells [3, 6] and with the immunological detection of the TBP protein in haploid spermatids [3].
We also show that the RNA polymerase II factors TFIIA
/ß, TFIIA
and TRF2, as well as the RNA polymerase III factor TFIIIB90, are up-regulated during germ cell differentiation in mouse testis. Because these genes are expressed in a pattern that is very similar to that observed with ALF and TBP, we predict they are also enriched in meiotic and haploid germ cells. The current data therefore suggest a widespread and apparently coordinated up-regulation of GTF genes in male germ cells. It seems likely that this up-regulation will be directed by one or more pathways or processes that are activated only in male germ cells.
The transcription factors tested here and elsewhere fit into at least two categories. The first category includes those expressed in all somatic cells, including those of the testis, and which are also present in male germ cells. Examples include TBP, TRF2, TFIIA
/ß, TFIIA
, TFIIB [3], and TFIIIB90. The second category consists of genes whose expression is primarily restricted to male germ cells. Examples include ALF, the elongation factor SII-T1 [17] and Elongin A2 [18] genes, and perhaps the gonad- and CNS-specific Drosophila factor, TRF1 [8]. Factors whose expression is decreased or off in male germ cells have not been identified here, but would fall into a hypothetical third category.
A key question to be answered is the relationship between GTF expression and the biological functions of testis (reviewed in [39, 40]). It is possible that high levels of GTFs are required in order to express sufficient levels of genes needed for meiosis and spermatogenesis. Alternatively, the observation that some testis RNAs, for instance TBP [36], initiate at promoters that are not used in somatic cells raises the idea that transcription is somewhat indiscriminate in this tissue. This characteristic may be due to increased GTF expression, or to increased accessibility to genomic DNA as germ cell chromatin is remodeled for packaging into spermatozoa.
A hypothetical role for testis-specific GTF-related factors such as ALF is to regulate a particular subset of genes. In the case of the Drosophila TBP-related factor TRF1, this idea is supported by localization of the protein to distinct sites on polytene chromosomes [9]. In addition, mutations in the trf gene cause neurological defects and male sterility, demonstrating that TRF1 is important for CNS and gonadal function [8]. Likewise, TBP-associated factors (TAFs) in TFIID, as well as TFIIA, have been implicated in the regulation of genes that control cell cycle progression [4144]. Similarly, ALF may have a specialized role in the expression of genes that control progression through meiosis, or those that control germ cell development.
In some cases the level of GTF mRNA in testis is much higher than the level of the corresponding protein. For instance, the TBP protein is between twofold to 11-fold higher in testis compared with other tissues, whereas its mRNA is estimated to be between 40- and 1000-fold higher [3, 6, 7]. Likewise, transcripts for the TBP-related factor, TRF2, are expressed at high levels in mouse testis [11], but immunoblotting data show that the level of TRF2 protein is comparable to or lower than in other tissues [45]. A partial explanation for this discrepancy is suggested from the observation that testis-specific transcripts from the TBP gene are underrepresented among polyribosomal RNAs [7]. Thus, perhaps some RNA produced from the GTF genes is packaged into ribonucleoprotein complexes that repress or delay translation [46]. Potentially related, the protein levels for the
subunit of TFIIE, the large subunit of RNA polymerase II, and the 30-, 55-, 100-, and 135-kDa TAFII subunits show only a modest increase in testis [3, 7, 45].
Evolution of ALF and TFIIA
/ß
A comparison between human and mouse counterparts of TBP, TRF2, TFIIA
, and TFIIA
/ß shows that these sequences are 95%100% similar. In contrast, the testis-specific human and mouse ALF sequences are much more divergent (77% similarity), mainly due to differences in region II (68% similarity). Rapid evolution among genes involved in male reproductive physiology such as the sex-determining SRY gene [47, 48], the Pem homeobox transcription factor gene [49], and others [5052], has previously been noted. In addition, gene families that contain members expressed in distinct somatic and germ-cell patterns (like ALF and TFIIA
/ß) often show a similar pattern. For example, the human somatic transcription elongation factor, TFIIS, is 96% identical to its mouse counterpart, whereas the testis specific form (SII-T1) is 89% identical [53]. Likewise, the human somatic phosphoglycerate kinase (PGK1) gene is 98% similar to its mouse counterpart, whereas the testis-specific isoform (PGK2) is 86% similar [54]. Finally, the human lactate dehydrogenase (LDH) A isoform is 95% similar to its mouse counterpart, while the testis-specific LDH C isoform is only 73% similar [55]. The biochemical mechanisms that underlie the greater rate of evolution for these and other male reproductive genes are not understood, and it is unclear whether such divergence confers some reproductive advantage.
Structure of SALF
In this report we also sought to evaluate possible models that might explain the formation of the chimeric SALF transcript. One such possibility is that SALF is a product of two separate but adjacent human genes. Evidence for this idea is based on the fact that the 5' and 3' ends of SALF are unrelated, and on the observation that a probe from the 5' end of SALF detects a 6.5-kb mRNA in all tissues while a probe from the 3' end of SALF detects a 1.8-kb mRNA in testis (ALF) [4]. We have shown here that genomic sequences that border the junction within the ALF and SALF cDNAs contain putative splice donor (GT) and splice acceptor (AG) signals (Fig. 5A). Furthermore, PCR analysis of polychromosomal hybrid DNAs shows that the ALF and SBLF genes are both on chromosome 2. These results are substantiated by the intriguing discovery that complete ALF and SBLF genes are present on a single BAC clone.
Based on these data we propose that SALF is formed from a pre-mRNA transcript that initiates at an upstream SBLF gene and continues into the downstream ALF gene (Fig. 5C). The third exon of the SBLF gene is then spliced to the second exon of ALF to produce SALF. The model predicts that expression of SALF depends on the same regulatory elements that control expression of SBLF. This idea is consistent with the observation that a rare transcript detected with an ALF probe in nontestis tissues varies in proportion to the levels of the 6.5-kb SBLF transcript [4]. If correct, the model provides an unusual example of a eukaryotic pre-mRNA transcript that spans two independent genes. However, the model does not address whether the 1182-amino acid SALF polypeptide is physiologically important.
Conclusion
The results reported here demonstrate that ALF is a male germ cell-specific counterpart of TFIIA
/ß, and that an increase in the expression of many GTFs occurs during male germ cell development. It will be of interest to elucidate the factors and mechanisms that direct this expression, and to determine the relationship or relationships between GTF function and the cellular and chromosomal changes that occur during spermatogenesis.
ACKNOWLEDGMENTS
The sequences reported in this paper have been submitted to GenBank under the following accession numbers: rat TFIIA
(AF000944), rat TFIIA
/ß (AF000943), mouse ALF (AF250835), and mouse TFIIA
/ß (AF250834). The authors thank the Washington University and Genoscope sequencing centers for making human genomic sequence available prior to publication.
FOOTNOTES
1 This work was supported by grants from the American Cancer Society and the Welch Foundation (J.D.), and by NIH grant DK54480 (K.L.P.) ![]()
2 Correspondence: Jeff DeJong, The University of Texas at Dallas, Dept. of Molecular and Cell Biology, 2601 N. Floyd Rd., Richardson, TX 75080. FAX: 972 883 2409; dejong{at}utdallas.edu ![]()
Accepted: September 6, 2000.
Received: May 11, 2000.
REFERENCES
/ß homolog selectively expressed in testis. J Biol Chem 1999; 274:1804018048.
) stimulates TATA-binding protein-DNA binding and transcription activation. J Biol Chem 2000; 275:122128.
, encodes both the p35 and p19 subunits of human TFIIA. Genes Dev 1993; 7:22202234.
) subunit of human TFIIA reveals functions critical for activated transcription. Genes Dev 1994; 8:23242335.This article has been cited by other articles:
![]() |
M. Kim, D. Li, Y. Cui, K. Mueller, W. C. Chears, and J. DeJong Regulatory Factor Interactions and Somatic Silencing of the Germ Cell-specific ALF Gene J. Biol. Chem., November 10, 2006; 281(45): 34288 - 34298. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Wang, J. T. San Agustin, G. B. Witman, and D. L. Kilpatrick Novel Role for a Sterol Response Element Binding Protein in Directing Spermatogenic Cell-Specific Gene Expression Mol. Cell. Biol., December 15, 2004; 24(24): 10681 - 10688. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Han, W. Xie, S. H. Kim, L. Yue, and J. DeJong A Short Core Promoter Drives Expression of the ALF Transcription Factor in Reproductive Tissues of Male and Female Mice Biol Reprod, September 1, 2004; 71(3): 933 - 941. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. De Cesare, G. M. Fimia, S. Brancorsini, M. Parvinen, and P. Sassone-Corsi Transcriptional Control in Male Germ Cells: General Factor TFIIA Participates in CREM-Dependent Gene Activation Mol. Endocrinol., December 1, 2003; 17(12): 2554 - 2565. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Han, W. Xie, S. R. Hammes, and J. DeJong Expression of the Germ Cell-specific Transcription Factor ALF in Xenopus Oocytes Compensates for Translational Inactivation of the Somatic Factor TFIIA J. Biol. Chem., November 14, 2003; 278(46): 45586 - 45593. [Abstract] [Full Text] [PDF] |
||||
![]() |
S.-i. Kashiwabara, J. Noguchi, T. Zhuang, K. Ohmura, A. Honda, S. Sugiura, K. Miyamoto, S. Takahashi, K. Inoue, A. Ogura, et al. Regulation of Spermatogenesis by Testis-Specific, Cytoplasmic Poly(A) Polymerase TPAP Science, December 6, 2002; 298(5600): 1999 - 2002. |