| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
From the Human Papillomavirus Program, Centers for Disease Control and Prevention, Atlanta, Georgia
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
The purpose of this study was to select appropriate references for normalizing qRT-PCR assays of gene expression in exfoliated cervical cells. We have been investigating the gene expression of exfoliated cervical cells using cDNA microarrays to identify differential expression associated with cervical disease.7 The terminal differentiation of exfoliated cells, partial RNA degradation,8 and presence of neoplastic lesions must be accommodated in determining references for gene expression. No data are available on the stability of common reference genes in this type of sample. We examined total RNA content, DNA content, and the expression of 13 candidate housekeeping genes as references for qRT-PCR of exfoliated cells.
Total RNA mass determined either by UV photospectrometry or by densitometric assessment has been widely used as a reference to normalize the amount of RNA template. Its accuracy and suitability as a reference for highly sensitive real-time PCR technology is nonetheless questionable because mRNA makes up less than 1% of the total RNA, and the approach assumes that mRNA is a constant proportion of RNA in all cells. In addition, differences in the efficiency of the enzymatic reactions of qRT-PCR are not accounted for (ie, presence of inhibitors will not be detected).
Specimen cell number is seen as a universal reference for samples. However, the impracticality of quantitative assessment of cell number in solid tissues or other complex samples has limited its use.1, 9 Because the cellular DNA content is generally constant, DNA quantitation using qRT-PCR could be used to determine cell number. DNA was explored for normalization of microarray gene expression studies in bacteria, but no data are available for human gene expression profiling by PCR.10 Potential limitations of this approach include variation in the DNA-to-RNA ratio due to differentiation or disease and alterations in cellular DNA content due to polyploidy or aneuploidy. As with total RNA mass quantitation methods, sample-specific differences in tissue-borne inhibitors potentially affecting the enzymatic downstream processes are not measured.
Gene expression variation in cervical exfoliated cells might be indicative of neoplasia, and its detection holds promise as early cancer detection assays.7 To empirically evaluate references for qRT-PCR in this specimen, we used total nucleic acid extracts (TNAs; both DNA and RNA) from 30 samples, representing the full spectrum of preinvasive cervical neoplasia. We determined the number of cells by measuring the DNA content by PCR for the single-copy gene ß-globin quantitative and RNA content by using quantitative image analysis of ribosomal bands in denaturing agarose gels. In addition, we performed qRT-PCR for 13 genes selected as housekeeping genes in other studies. We used the Microsoft Excel add-ins geNorm1 and Norm-Finder5 to identify the best combination of housekeeping genes as references and correlated their expression with DNA content (ie, cell number) and gel representation of ribosomal RNA in the same samples.
| Materials and Methods |
|---|
|
|
|---|
Cervical disease status was determined based on the summary results of cytology, colposcopy, and biopsy examination. The 30 specimens used in this study were selected to represent women without abnormalities (n = 8) and three grades of cervical intraepithelial neoplasia (CIN): CIN1 (n = 9), CIN2 (n = 7), and CIN3 (n = 6). These samples also represented the spectrum of human papillomavirus (HPV) infection (HPV16 positive, other types of HPV, and no HPV detected), age (18 to 58 years), and ethnicity (black, white, Hispanic, and other) in the study.
Quantification of Total RNA
The quality of the isolated TNAs was evaluated with spectrophotometry and ethidium bromide-stained denaturing agarose electrophoresis. Quantitation of the total RNA extracted from each sample was assessed by densitometric measurement (FluorChem Digital Imaging System; Alpha Innotech, Inc., San Leandro, CA) of the ribosomal bands visualized on ethidium bromide-stained denaturing agarose gels in comparison with a standard 28Sand 18Scontrol marker. The standard was highly purified total RNA prepared from cultured Caski cells, quantitated using UV spectrophotometry.
cDNA Synthesis
In 0.2-ml thin-wall PCR tubes (Robbins Scientific Corp., Sunnyvale, CA), equal volumes (2.5 µl) of each sample were treated with 5 U of DNase I (GenHunter Corp., Nashville, TN) in a 10-µL reaction with 1x RT buffer (Invitrogen Corp., Carlsbad, CA) for 30 minutes at 37°C. We removed 1 µl to be tested for residual DNA (no-RT control). A master mix of primers and exogenous plant gene spike chlorophyll A-B binding protein (CAB) (Stratagene, La Jolla, CA) was prepared. Aliquots were added to each sample (final concentrations per sample were 300 ng of random primer, 50 ng of oligo-T1218 [Invitrogen], and 0.1 pg of CAB mRNA), and the mixtures were heated at 65°C for 5 minutes and then transferred to ice. After 1 minute on ice, the reaction mix was added (2 µl of RT buffer, 2 µl of dithiothreitol [100 mmol/L], 2 µl of dNTPs [10 mmol/L], 1 µl of dH2O, and 1 µl of Superscript III reverse transcriptase [Invitrogen]) to the final reaction volume of 20 µl. Samples were incubated at 25°C for 5 minutes, 50°C for 50 minutes, and 70°C for 15 minutes. All incubations were performed in a thermocycler. The product was diluted 1:5 in diethylpyrocarbonate-treated water and stored at 20°C in 25-µl aliquots until further processing.
Quantitative Real-Time RT-PCR
We developed quantitative SYBR green PCR assays for the 13 endogenous "housekeeping" genes shown in Table 1
, along with assays for spiked CAB and ß-globin DNA. We generated gene-specific primer sequences with the Primer Select application of Lasergene software (DNA Star, Madison, WI) (Table 2)
. We tested the specificity of the amplification conditions for each primer pair by melting curve analysis and by verifying the size of the amplicon on gel electrophoresis. We used cDNA from universal human reference RNA (Stratagene) in a four-step, 10-fold dilution series to calculate the PCR efficiency for each assay over a 1000-fold range of dilution (Table 2)
.
|
|
Rn = 0.1 and exported as tab-delimitated text files.
DNA Quantification (Cell Number)
We determined the DNA content in the same TNA samples using quantitative PCR for the single-copy ß-globin gene (HBB). Because DNA content is context independent and with few exceptions is the same in every cell, its Ct values directly correlate with the number of cells that were processed from the original specimens. One microliter of TNAs was amplified with HBBspecific primers (Table 2)
, applying the SYBR green PCR assay and data acquisition as described above.
Analysis
We required the coefficient variation (CV) of the Ct values for the spiked CAB among all samples to be less then 2% to control the experimental variability during cDNA synthesis. Duplicate Ct values for each candidate reference gene were averaged for each of the 30 samples, and the CVs had to be below 1% for a sample to be included in the analysis. The PCR efficiencies were calculated as e = 101/slope for each primer pair, with the slope determined by a linear regression model over log10-transformed Ct values of the template dilution series described above.
To evaluate the reference genes that are most suitable for RT-PCR normalization, we applied two previously published Microsoft Excel-based applications: 1) geNorm1 calculates a gene stability measure as the SD of the log2-transformed expression ratios of each housekeeping gene with all others tested throughout the samples. 2) Norm-Finder5 uses a model-based approach to estimate expression stability based on intra- and intergroup variations for candidate housekeeping genes. We applied a disease model comprised of three groups with different degrees of abnormality: no disease (CIN0; n = 8), mild dysplasia (CIN1; n = 9), and moderate to severe dysplasia (CIN2/CIN3; n = 13). Quantitative relationships between RNA transcripts and DNA content in the samples were analyzed via Pearson correlation in Microsoft Excel.
| Results |
|---|
|
|
|---|
We found the expression level of the TBP transcript to be close to the background of cervical exfoliated cells. This transcript could not be reliably amplified in all sample cDNAs, even with additional template (1:2 dilu tion instead of 1:5). Results from the remaining 12 genes and DNA content were of sufficient quality, and the raw Ct values were distributed over comparable ranges for most genes (Figure 1)
.
|
|
Table 4
presents the correlation coefficients between DNA content and the Ct values of the 30 samples for each housekeeping gene transcript. There was a wide range in these correlations, but the transcripts identified as most stable with either analysis also showed the highest correlation with DNA content. MBNL2appeared to be completely independent from ß-globin values. The geometric means of the three most stable genes by GeNorm (ACTB, PGK1, and RPLP0) had a correlation coefficient of 0.86 with ß-globin DNA (Figure 2A)
. No consistent relationship was observed with RNA amounts measured by gel densitometry (Figure 2B)
.
|
|
| Discussion |
|---|
|
|
|---|
GeNorm determines an internal stability measure for each gene calculated strictly from ratios. This approach seems suitable to identify a qRT-PCR reference without presumptions and resulted in the selection of ACTB, RPLP0, and PGK1. The model-based approach of Norm-Finder is advantageous if subpopulations with differential gene expression exist. This is especially significant if differences between these groups are being interrogated in the experiment. Accordingly, for analysis of genes indicative of the degree of CIN, a normalization factor generated from the Norm-Finder selection (RPL4and PGK1) might be preferable, because the pre-existing differences from varying CIN grades were used in the model.
The choice of references for qRT-PCR normalization is crucial for accurate comparisons of gene expression. However, the selection remains problematic for several reasons. Because of variations in the source of the sample and the biological variability encountered in the study, references must be evaluated empirically. To avoid a circular argument over the sample concentration, we made no attempt to equalize template concentration before each assay and used strictly equal input volumes from all samples. Although the problems of sample heterogeneity (contributing to variation in RNA content per cell or DNA unit) and RNA preservation (affecting observed relation between rRNA and mRNA) may well be greater for exfoliated cells than other kinds of samples, we believe our findings are relevant to other systems and indicate the importance of empirical validation of reference genes, regardless of the sample and experimental question.
Interestingly, the ranking of reference genes by GeNorm was paralleled by correlation with the DNA content individually (Table 3)
, and the geometric means of the three top genes (ACTB, RPLP0, and PGK1) showed a correlation coefficient of 0.86 with ß-globin (Figure 2A)
. This relationship might indicate a certain consistency of these gene expression levels in populations of exfoliated cells.
DNA content representing the cell number rationally appears as the most robust standard, and its good correlation with a number of housekeeping genes indicates a relative robust representation of mRNA levels. Its stability was nonetheless low when the CIN model was applied through Norm-Finder. An explanation could be given by haplotype variations that frequently occur in neoplastic lesions.12 Furthermore, the gene stability was calculated in relation to the other transcript levels in the study, and differences of global RNA expression relative to DNA might actually be a variable factor in different disease states. However, DNA could provide a valuable absolute reference in some types of experiments. Use of an additional external RNA control, like the plant CAB transcript we used here, is strongly advisable to monitor sample-specific differences that occur during the cDNA synthesis.
Ribosomal 18SRNA was moderately stable in the qRT-PCR assays and ranked fourth by geNorm and third by Norm-Finder, whereas 28SrRNA was ranked low by both algorithms (Table 3)
. Correlation of 28Sto DNA content was also insignificant compared with 18S. These results seem to underline the general supposition that 18Sexpression can be relatively stable and had in fact been suggested for normalization in other cell types and conditions.9, 13, 14
Fluctuation of rRNA levels especially of 28S are nonetheless apparent in exfoliated cells, influence total RNA levels accordingly, and might therefore not adequately represent mRNA activities. The finding that densitometric estimation of total RNA mass by the intensity of rRNA bands was completely independent from all PCR quantificationsDNA and RNAmeasured in the sample could originate from different degrees of partial degradation that influences UV absorbance of ribosomal bands but not the abundance of short PCR-amplified target templates. The relatively low precision of photometric technologies might further distort the true quantity. According to our results, the use of rRNA can generally not be recommended as a standard for cervical exfoliated cells.
No standard is likely to account perfectly for all aspects of the complexity and dynamic of the transcriptome, and mRNA quantification must therefore be seen as relative to a subjective reference. To normalize for sample-specific differences and accumulative experimental errors alike, we suggest the use of ACTB(ß-actin) as an acceptable standard for qRT-PCR studies in cervical exfoliated cells. The geometric means of RPL4and PGK1are recommended if special attention to intraepithelial lesions is appropriate.
| Acknowledgments |
|---|
| Footnotes |
|---|
Supported in part by the Early Detection Research Network Interagency Agreement of the National Cancer Institute (grant Y1-CN-0101-01).
The findings and conclusions in this report are those of the authors and do not necessarily represent the views of the funding agency.
Accepted for publication September 13, 2005.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. Steinau, M. S. Rajeevan, D. R. Lee, M. T. Ruffin, I. R. Horowitz, L. C. Flowers, T. Tadros, G. Birdsong, M. Husain, D. C. Kmak, et al. Evaluation of RNA Markers for Early Detection of Cervical Neoplasia in Exfoliated Cervical Cells Cancer Epidemiol. Biomarkers Prev., February 1, 2007; 16(2): 295 - 301. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |