| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |






From the Division of Graduate Medical Sciences
*
and the Departments of Medicine,
Genetics and Genomics,
and Pathology and Laboratory Medicine,
Boston University School of Medicine and Boston Medical Center, Boston, Massachusetts
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
In addition to material from fine needle aspirates and primary cell culture, many in vivo clinical samples are acquired by microdissection, particularly laser capture microdissection (LCM), which is a widely available technique used to harvest homogeneous cell populations within complex tissues.13
Although the use of LCM is now standard, few studies have tested the reliability and reproducibility of amplified LCM-generated RNA hybridizations on oligonucleotide arrays. Ohyama and colleagues7
have shown that LCM-captured cells can generate sufficient quantity of aRNA (after T7-based amplification) for oligonucleotide arrays. Luzzi and colleagues11
demonstrated that LCM-captured cell-amplified RNA could provide interpretable hybridization results, assessed using
1800 genes. This group also compared two independent 30-ng samples of LCM-captured cells-amplified RNA and found
4.3% variability between them (fold change >2).6
These studies are highly promising. However, limited information is available about several important issues, including: 1) the reproducibility of gene expression measurements made using LCM-captured cells-amplified RNA and oligonucleotide arrays; 2) the comparability of standard and LCM-captured cell gene expression measurements using these arrays; 3) whether the technical variability introduced into gene expression measurements by the processes of LCM, RNA amplification, and array hybridization is large enough to obscure differences in gene expression between biologically distinct samples. This information is very important because most studies of in vivo tissue will use microdissected, small-sample RNAs.
To address these issues, we performed two series of oligonucleotide array hybridizations. The first was designed to examine the reproducibility and reliability of the technique, and used as starting material RNA from a single primary human breast specimen. Microgram quantities of this RNA were examined by microarray analysis as standard samples, and compared with small-sample (100 ng) quantities that were obtained in a series of independent isolations (with and without LCM) and amplifications. The second experiment was designed to determine whether the technical variability associated with LCM and the small sample protocol would interfere with the identification of differences in gene expression because of biological variation. The starting material for this experiment was small samples of microdissected, paired normal epithelium, and ductal carcinoma in situ (DCIS) from three independent breast specimens.
| Materials and Methods |
|---|
|
|
|---|
Study Design
Figure 1
summarizes sample preparation and methods. As shown in Figure 1A
, a normal human breast specimen (breast 1), containing a mixture of epithelium, stroma, vasculature, adipocytes, lymphocytes, and possibly other cell types, but no tumor, was snap-frozen. From one piece, total RNA was extracted. Two 5-µg aliquots were processed using the standard Affymetrix protocol of one round of RNA amplification and labeling, and the resulting biotinylated aRNA was hybridized to U133A GeneChip arrays (standard A, standard B). In parallel, two 100-ng aliquots of the same total RNA underwent two rounds of amplification followed by labeling and hybridization (small A, small B). A piece of breast 1 was also embedded in optimal cutting temperature (O.C.T.), sectioned, and microdissected. One set of microdissections used no LCM caps: instead, after laser firing, all tissue elements were scraped from the slides and RNA extracted (small C, small D). The other set of microdissections used caps and RNA was extracted from all tissue elements on the caps (small E, small F). One hundred ng of RNA from each collection method was amplified using the two-round protocol for small samples. For small C and D, a first round amplification was performed on a single RNA aliquot that was then split for two independent second round amplifications. In sum, these samples were analyzed by four independent RNA isolations, five independent RNA amplifications, and eight independent biotin-labelings and hybridizations. As shown in Figure 1B
, samples from three different cancer-containing human breast specimens (breast 2, 3, 4) were snap-frozen and embedded in O.C.T. From each specimen, normal epithelium and DCIS were separately microdissected using caps, ie, the protocol used with small E and F. Equal amounts of RNA from each specimens normal and cancer pair (breast 2, 43 ng; breast 3, 100 ng; breast 4, 81 ng) were isolated, amplified using the two-round protocol, labeled, and hybridized as described below.
|
Microdissected RNA
O.C.T.-embedded tissue was sectioned at 12-µm intervals, mounted on uncoated slides, and immediately returned to 80°C. Subsequently, these sections were fixed in 70% ethanol for 1 minute, lightly stained with dilute Mayers hematoxylin and eosin (50% concentration hematoxylin and 10% concentration eosin), and dehydrated using two dips each of 70%, 95%, and 100% ethanol and two 2-minute xylene rinses, followed by air-drying for 2 minutes. In breast 1, all tissue elements on the slides were microdissected (PixCell; Arcturus Engineering, Mountain View, CA). For small E and F, microdissection was performed using the standard technique using caps. For small C and D, microdissection was performed without a cap: the laser was fired onto the tissue, without a cap in place, and all tissue elements were then scraped from the slide. All other specimens (breast 2, 3, 4) were microdissected using caps, as with small E and F. Slides awaiting microdissection were stored briefly under desiccation in a slide box at room temperature. Total RNA was extracted from each sample as described above.
Target Preparation
Standard RNA
Double-stranded cDNA was synthesized from 5 µg of total RNA using a SuperScript double-stranded cDNA synthesis kit (Invitrogen, Carlsbad, CA) and a (dT)24-T7 promoter primer. The products were purified by phenol/chloroform extraction using PLG Heavy (Brinkmann Instruments, Westbury, NY). Biotin-labeled aRNA was generated with a Bioarray High Yield RNA transcription kit (Enzo Diagnostics, Farmingdale, NY), purified on RNeasy affinity columns (Qiagen, Valencia, CA), and quantified on a Nanodrop ND-1000 spectrophotometer. To assess aRNA quality, 200 to 500 ng was electrophoresed on a 1% agarose gel stained with SYBR Gold.
Small Sample RNA
Breast 1:
The first round amplification was performed on 100 ng of total RNA using the MessageAmp aRNA kit (Ambion Inc., Austin, TX) following the manufacturers protocol. The in vitro transcription was performed at 37°C for 10 hours and 30 minutes. Between 300 ng to 2 µg aRNA was used in the second round amplification, whose first step uses random-priming rather than oligo-dT. After second round double-stranded cDNA synthesis and purification (MessageAmp aRNA kit, Ambion), in vitro transcription for small C and E was performed using the same reagents, supplemented with biotin-11-CTP and biotin-16-UTP (Enzo Diagnostics) according to the manufacturers protocol. For the other four small samples (small A, B, D, F), in vitro transcription was performed using the Bioarray High Yield RNA transcription kit (Enzo Diagnostics).
Breasts 2, 3, 4:
For first round amplification, the MessageAmp aRNA kit was used in breast 2 and the RiboAmp OA RNA amplification kit (Arcturus) was used in breast 3 and 4. All second round amplifications were performed using the MessageAmp aRNA kit (double-stranded cDNA) and Enzo Diagnostics reagents (in vitro transcription), see above section for details.
Hybridization
For each hybridization, 20 µg of biotin-labeled aRNA was fragmented to an average size of 35 to 200 bases by incubating in 40 mmol/L Tris-acetate, pH 8.1, 100 mmol/L KOAc, 30 mmol/L MgOAc, for 35 minutes at 94°C. Ten µg of fragmented RNA, along with hybridization controls supplied by Affymetrix (Santa Clara, CA), was hybridized to each U133A GeneChip array containing probes for 22,283 human genes (Affymetrix). The arrays were hybridized for 16 hours at 45°C and 60 rpm, and then washed and stained according to the standard antibody amplification for eukaryotic targets protocol (Affymetrix). The stained arrays were scanned at 488 nm using a G2500 Scanner (Agilent, Palo Alto, CA).
Data Analysis
The images from the scanned chips were quantified and scaled by using Affymetrix Microarray Suite 5.0 (for detailed information see the statistical algorithms description document at http://www.affymetrix.com/support/technical/whitepapers/sadd_whitepaper.pdf; Affymetrix). Signal intensities from the 22 probes for each gene were used to determine an overall expression level for that gene as well as a detection confidence score. The gene expression levels were linearly scaled to an average of 500 units on each chip. We used the detection confidence score, which is a measure of the sequence specificity of the hybridization intensities, to eliminate genes that are not expressed or not detected in any of the samples and this reduced our dataset from 22,283 genes to 14,377. Studentized extreme-value test statistics, correlation coefficients (r) and
2-test statistics (X2) were calculated with Excel (Microsoft Corp., Redmond, WA). Principal components analysis of gene z-scores was performed with DecisionSite (Spotfire, Inc., Somerville, MA).
| Results and Discussion |
|---|
|
|
|---|
We compared several parameters of RNA and hybridization quality among our 14 samples, as shown in Table 1
. The yield of biotinylated aRNA was, in all cases, sufficient to hybridize to the oligonucleotide array. The ratios of the hybridization intensity from the 3' and 5' ends of the glyceraldehyde-3-phosphate dehydrogenase (GAPDH) transcript were generally low, although higher in small samples compared with standard. This is consistent with the shorter biotinylated aRNAs from small samples seen on the agarose gels, and is caused by the amplification creating targets that are skewed to the 3' end of the transcript. We examined raw Q, scaling factor, and percent present call (% present) to evaluate whether the small sample procedure produces high-quality aRNA.14
As seen in Table 1
, these measures were similar among standard and small samples, and all indicated good quality targets and hybridizations. None of the yields or scaling factor values were significant outliers (P > 0.05, Studentized extreme-value test), although the 3'/5' ratio from breast 2N were significantly higher than the other samples (P < 0.05, Studentized extreme-value test). As expected,
50% of probesets were called present in most samples. Although the percentage of genes reliably detected in the samples obtained via LCM without a cap (small C and D) are somewhat lower than the other samples, neither of these values are significant outliers relative to the other 12 samples (P > 0.05, Studentized extreme-value test). The percent present probesets appears to vary slightly between specimens and between tissue types. This may reflect biological difference between samples or variable quality of input RNA. In sum, these results indicate that multiple, independently obtained, laser-captured, small sample RNAs can reliably produce sufficient satisfactory aRNA and generate good quality oligonucleotide hybridizations.
|
|
Overall, the variability between all six small samples of breast 1 is similar to the variability between replicates, suggesting that any small sample is as similar to another as each small sample is to its replicate. Thus, the process of LCM, use of independent RNA isolations, and of different kits to amplify and label the RNA do not appear to alter the quality of the isolated RNA in a way that affects the gene expression measurements.
Comparison of Standard versus Small Samples (Breast 1)
To determine the fidelity of hybridization intensity measurements between small sample and standard sample RNA, we compared the hybridization intensity for each probeset from the standard samples with the mean hybridization intensity of the same probeset from all small samples. The results are shown in Figure 2 e and i
. The standard and small samples are less correlated (r
0.84) and
31% of genes show more than twofold difference in hybridization intensity. The differences between the standard samples and the small samples appear fairly evenly spread out across the intensity spectrum. These data are consistent with that of others4, 8, 15, 16
and suggest that gene expression measurements obtained from standard and small-sample material cannot be directly compared.
The results from breast 1 lead us to propose that much of the sample-to-sample variation among the small samples is the result of amplification failure among low-abundance transcripts. From our data we cannot determine whether the variability between standard sample replicates also results from amplification failure of low-abundance transcripts. However, because replicate-to-replicate variability is similar between standard and small samples, it is reasonable to suggest that both arise via the same mechanism. The compound effect of amplification failures during two rounds of amplification is likely responsible for a fraction of the probesets that have more than twofold lower hybridization intensity in small compared to standard samples. Additional differences in hybridization intensity between the small and standard samples (Figure 2 e and i)
, may be because of sequence-specific differences in amplification efficiency resulting from the small sample protocols incorporation of a random-primed reverse transcription step (see Materials and Methods).
Comparison of Technical and Biological Variability
Encouraged by the high degree of reproducibility from microdissected and amplified small samples, we wished to determine whether this level of reproducibility is sufficient to permit detection of variation in gene expression because of biological differences between samples. We performed six hybridizations using pairs of normal epithelium and DCIS from three independent specimens (breasts 2, 3, 4). These six samples were obtained using the same methods as small E and F. Typical of in vivo tissues, lesion size and total RNA quantity were limited, so replicates could not be performed.
We used principal components analysis to identify the major axes of variability among all 14 arrays. Of the total variation observed among the samples 50.2% is accounted for in the first two principal components of variation. As shown in Figure 3
, all six small samples from breast 1 are similar to each other and dissimilar from the two standard samples (which are also similar to each other). The tight clustering of the six small samples from breast 1 indicates that the different sample preparation protocols result in similar measurements of gene expression. The tight clustering is consistent with the low number of differences and high degree of correlation seen between all six small samples (Figure 2; f to h and j to l)
. The separation between the six small samples and the two standard samples is consistent with the large number of hybridization intensity differences and low degree of correlation seen between the standard and small samples (Figure 2 e and i)
.
|
Consistent with these results, the variation in gene expression within the three DCIS samples also appears larger than what would be expected because of technical variability. The distribution of the DCIS samples relative to each other along the two major axes of variation is similar to the distribution of matched normal epithelial samples from these specimens. This suggests that the specimen effect is consistent between normal epithelium and DCIS. In addition to this specimen effect, a cancer effect distinguishes the normal epithelial samples from the DCIS samples. With enough cases, cancer-specific differences in gene expression could be detected with a paired-sample t-test. These findings are consistent with those of McClintick and colleagues,17 who found biological variability to be greater than technical variability when using the standard protocol in rat liver.
In sum, these findings suggest that technical variability associated with independent LCM preparations, sample processing, and array hybridizations is small relative to differences because of biological variability between specimens and disease states. Thus, oligonucleotide array experiments comparing small sample aRNA between different samples within a specimen, or similar samples between specimens, will likely generate useful, functionally important data that reflect the biological differences between samples.
Summary
The purpose of this study is to test the reliability, reproducibility, and potential utility of gene expression measurements made with oligonucleotide microarrays using small sample amplified RNA obtained from microdissected clinical specimens. In a series of eight oligonucleotide arrays hybridized with RNA isolated from a single human breast specimen, we found that independent microdissections, RNA isolations, and amplifications reliably produced reproducible gene expression measurements. The results obtained with these highly processed independent samples correlated with each other as well as results obtained using the standard protocol that requires 50 times more RNA as starting material. As expected, gene expression measurements from small sample RNA do not mirror measurements from standard sample RNA. Both amplification failure among low-abundance transcripts and sequence-specific differences in amplification efficiency may account for the differences. In a second series of six hybridizations using three normal/cancer pairs, we found specimen-specific and disease state-specific differences in gene expression that appear larger than differences because of technical variability. These findings suggest that the technical variability introduced by LCM, small sample amplification, and array hybridization is smaller than the biological variability between human breast samples. These results indicate that it will be possible to use LCM-isolated primary tissues with oligonucleotide microarrays to uncover gene expression differences associated with homogeneous cell populations in the breast in vivo.
| Footnotes |
|---|
Supported by NIH PHS CA081078, and the Department of Defense Breast Cancer Research Program, DAMD17-01-1- 0159.
C.K. and N.G. contributed equally to this work.
Accepted for publication April 22, 2004.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
E.-K. Lee, S.-G. Yi, and T. Park arrayQCplot: software for checking the quality of microarray data Bioinformatics, September 15, 2006; 22(18): 2305 - 2307. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wilhelm, J. P. Muyal, J. Best, G. Kwapiszewska, M. M. Stein, W. Seeger, R. M. Bohle, and L. Fink Systematic Comparison of the T7-IVT and SMART-Based RNA Preamplification Techniques for DNA Microarray Experiments Clin. Chem., June 1, 2006; 52(6): 1161 - 1167. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |