| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Published online before print December 28, 2007
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



From the Robert H. Lurie Comprehensive Cancer Center,
*and the Departments of Preventive Medicine,
Pathology,
and Surgery,
Feinberg School of Medicine, Northwestern University, Chicago, Illinois
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
DNA methylation is an attractive paradigm for cancer detection in that differential methylation of multiple genes in normal versus tumor tissue is well-established.3, 4, 5 Identical modification of DNA in multiple sites allows testing of multiple biomarker candidates by the same technique. Although analysis of each separate biomarker may not be adequate for diagnosis, combinations of biomarkers can produce accurate assays for cancer detection. Such assays together with the presence of abnormally methylated DNA in the blood of cancer patients6, 7 create a possibility for a minimally invasive diagnostic test.
We have developed a platform for multiplex detection of DNA methylation at multiple genomic sites8 and tested its performance in DNA from fixed human tissues.9 Here, we present proof-of-principle data on selection of informative methylated or unmethylated promoter sequences for cancer detection using DNA from gross sections of formalin-fixed, paraffin-embedded clinical specimens. Our approach allows detection of pathological changes via an observer-independent assay, which has obvious advantages for clinical practice.
| Materials and Methods |
|---|
|
|
|---|
|
DNA Isolation
After xylene deparaffination and ethanol precipitation, the tissue pellet was processed using a DNeasy Tissue kit (Qiagen, Valencia, CA). Purified DNA was dissolved in 10 mmol/L Tris, pH 7.8, 0.5 mmol/L EDTA.
Microarray-Mediated Methylation Assay: Overall Approach
In the microarray-mediated methylation assay (M3-assay), one portion of each genomic DNA sample was digested with a methylation-sensitive restriction enzyme, whereas another portion of the same sample served as an undigested control. Selected regions of the genomic DNA from each of the digested and undigested DNA samples were amplified by PCR using gene-specific primers that flank restriction sites. For the amplified product from the digested portion, only fragments with methylated sites were capable to serve as templates, whereas in the undigested (control) portion, all fragments were amplified. Comparison between the two sets of PCR products was done by gel electrophoresis (MSRE-PCR)8
or by competitive hybridization with custom-designed microarrays (M3-assay). Fluorescent signals of hybridized fragments in the M3-assay were separately scored, and the ratio between the signals from control and digested DNAs was calculated. This ratio was used to assign "methylated" or "unmethylated" calls to the targeted regions. The data were statistically assessed to select groups of informative fragments, which were then analyzed together as a composite biomarker. Details of the method are presented below.
Microarray-Mediated Methylation Assay
DNA Digestion
Hin6I (Fermentas, Hanover, MD) was used to digest one-half of each purified genomic DNA sample as described previously.8
The second half of each DNA sample was incubated in the digestion buffer without the enzyme and served as the control.
PCR Amplification
Nested PCR was performed as described previously.8
KlenTaq112
(DNA Polymerase Technology, St. Louis, MO) was used at 8 U per 30-µl reaction. Betain and dNTPs (Sigma, St. Louis, MO) were added to the PCR buffer to 1.5 mol/L and 0.25 mmol/L, respectively. The PCR reaction was assembled on ice, the tubes were placed into a thermocycler (ABI 9600; Applied Biosystems, Foster City, CA) and incubated at 95°C for 5 minutes, and KlenTaq1 was added. After 25 cycles (95°C for 45 seconds; 62°C for 1 minute; 72°C for 1 minute), the products were precipitated and dissolved in 10 mmol/L Tris, pH 7.8, 0.5 mmol/L EDTA, and 1.5 ng was used for the second PCR, assembled with aminoallyl-dUTP (Biotium, Hayward, CA) and dTTP (3:1), and performed as the first. PCR products were precipitated and dissolved for labeling in 20 µl of 100 mmol/L NaHCO3 buffer (pH 9.0).
DNA Labeling
Five microliters of Cy3 or Cy5 (Monoreactive Dye Pack, Amersham, Piscataway, NJ) in dimethyl sulfoxide were dried in a vacuum, and PCR products were added for 2 hours at room temperature. Unreacted dyes were quenched by 10 µl of 4 mol/L hydroxylamine, and the products were precipitated. The PCR products from undigested (control) DNA were labeled with Cy5, whereas Cy3 was used to label the PCR products from Hin6I-digested DNA.
Hybridization and Signal Detection
Custom-designed arrays (MWG Bioinformatics, High Point, NC) containing 60-mer probes for each amplified product were printed in triplicate on aminosilane-modified glass by Microarrays, Inc. (Nashville, TN). The slides were prehybridized for 1 hour at 42°C in 5x SSC, 0.1% SDS, and 1% BSA; rinsed with deionized water; and dried. Labeled DNA was dissolved in the hybridization buffer (100 µl; Ocimum Biosolutions, Indianapolis, IN), denatured (2 minutes at 95°C), and quenched on ice. Microarray GeneFrames (AbGene, Rochester, NY) were used to create space between the slide and the coverslip. Denatured DNA was added, the coverslip was sealed, and the slides were incubated 18 hours at 42°C. The GeneFrame and the coverslip were removed, and the slides were washed at 42°C for 5 minutes in 1x SSC, 0.1% SDS, and twice for 5 minutes in 0.1x SSC and 0.1% SDS. Slides were scanned using ScanArray XL4000 (sensitivity,
0.1 molecule/µm2; Perkin Elmer, Boston, MA) with ScanArray software. Intensity of each fluorophor was measured for each spot, and the background values were subtracted. Ratios of Cy5-to-Cy3 fluorescence were calculated to compare the yields of PCR products from control and Hin6I-digested DNA.
Statistical Analysis
Methylation calls were made independently for each spot, and final gene-specific calls were made according to the majority call from the triplicate spots for that gene. Nonspecific filtering removed uninformative spots; informative genes were selected by Fishers exact test for differential methylation in each pairwise analysis. Naïve Bayes classification with uninformative prior was used to classify samples assuming that methylation was independent for each of the analyzed sites. The predictive ability of the naïve Bayes classifier for all four pairwise comparisons (cancer versus normal, IDC versus normal, DCIS versus normal, and ADH versus normal) was evaluated using fivefold cross-validation. The data were partitioned into five sets with equal distribution of each type of specimens. Each set then served as a test set based on training of the naïve Bayes classifier with the other four sets. The number of misclassifications was counted over all five runs and over 25 random partitions of the data into five groups. Gene selection and classifier parameter estimation were performed anew with each round of cross-validation.
Assessment of Assay Variability
Methylation profiling of genomic DNA of MCF-7 was repeated five times. Forty-nine spots were unambiguously detected, and their methylation calls were independently established for each experiment, creating 49 groups (the number of fragments) of five calls each (five repeats). All calls different from the majority were counted; the number of these calls divided by the total number of calls was used as a measure of the assays variability.
| Results |
|---|
|
|
|---|
|
Age distribution was similar within each group (Table 1)
. The mean age was lower for reduction mammaplasty (normal) group (P < 0.001 using an analysis of variance model). The age difference was significant between the normal and other groups (adjusted P values <0.001 in pairwise comparisons with Bonferroni adjusted P values). Data on the expression of estrogen and progesterone receptors and p53 were not available for ADH and normal samples. In DCIS, the fraction of estrogen receptor-positive tumors (100%) was higher than reported (P < 0.001), but the fraction of progesterone-positive tumors (75%) was similar.13
In IDC, the fraction of tumors expressing estrogen and progesterone receptors was consistent with reported values.13
The percentage of p53-positive tumors was close to reported for both DCIS14
and IDC15
groups.
M3-Assay
DNA methylation analysis was performed as shown in Figure 1
. Fifty-six promoter fragments were interrogated (Figure 2)
in each experiment. Negative control fragments included coding sequences of three genes (asterisk in Figure 2
) and heterologous DNA from Arabidopsis thaliana. Each probe on the array was designed to detect corresponding PCR product. Each microarray contained three identical subarrays, so that every hybridization signal was confirmed in triplicate. Unreliable hybridization signals with intensities comparable with or less than background were excluded, and background was subtracted. The threshold for methylation was determined experimentally using "self-self" hybridizations16
; ie, PCR products from control (undigested) DNA were divided into two equal aliquots, labeled with either Cy3 or Cy5, mixed, and hybridized to the array; the average Cy5-to-Cy3 ratio was recorded. This "self-self" design assured equal representation of Cy3- and Cy5-labeled fragments as would be expected from samples of methylated DNA. This average ratio of intensities was used as a threshold to define methylation [standard methylation call (SMC)]. SMCs were used to assign calls for each gene: methylated to genes with Cy5/Cy3
SMC and unmethylated to genes with Cy5/Cy3>SMC. An example of data are shown in Table 2
. If no call could be assigned, the gene was scored as nonapplicable.
|
|
No attempt was made to correlate the results of the M3-assay and expression profile of analyzed samples. By its design, the M3-assay assessed methylation only in a few CpG sites in each promoter, so a rigorous correlation between gene expression and methylation results could not be expected.
Reproducibility of the M3-assay was evaluated using genomic DNA from MCF-7 cells. The assay was repeated five times, and the readout was evaluated for each fragment as described in Materials and Methods. Six of 245 total data points were variable (2.4%), suggesting a variability of less than 3% for the assay.
We also evaluated the link between the Cy5-to-Cy3 ratio and the level of methylation in heterogeneous samples. Control samples were prepared using a mixture of genomic DNA from MCF-7 and TD47D cells so that each sample contained a predetermined percentage of methylated and unmethylated genes. Cy5-to-Cy3 ratios below SMC were observed for samples with up to 50% unmethylated DNA (Figure 3)
. Samples with greater than 50% unmethylated genomic DNA fragments caused gradual increases of the Cy5-to-Cy3 ratio (Figure 3)
. These results indicate that the efficient detection of methylated fragments incorporated in the MSRE-PCR procedure8
was preserved in the M3-assay.
|
Classification of Samples
Each subarray contained 61 fragments and three empty spots (Figure 2)
producing 192 spots on the array, 183 of which contained probes. Methylation calls were made in a blinded manner and independently for each spot. The majority call for the three spots for each gene was assigned as a final gene-specific methylation call. If there was no majority, the final call was nonapplicable. In a total of 8418 calls made for 61 genes in each of 138 samples, 4725 were methylated (56.1%), 2045 were unmethylated (24.3%), and 1648 (19.6%) were nonapplicable.
Similar to expression microarray analysis,17 nonspecific filtering was used to eliminate uninformative genes with detectable calls in less than two-thirds of the samples or less than 10% differential methylation across the entire sample set (eg, 90% methylated and 10% unmethylated). Nonspecific filtering steps were repeated for four pairwise analyses, but only a few genes were eliminated, and more than 45 genes were selected for each comparison: DCIS versus normal, 46 genes; IDC versus normal, 48 genes; DCIS/IDC versus normal, 48 genes; and ADH versus normal, 49 genes. Informative features for classifiers were selected with Fishers exact test using P < 0.10. The moderate P value of 0.10 was chosen to narrow the set of genes but to include informative genes with occasionally inflated P values.
The apparent independence of methylation sites18 suggested selection of the naïve Bayes classifier,19 which performed surprisingly well even when independence was not satisfied.20 Naïve Bayes classifiers were constructed using the e1071 R (R Development Core Team, 2005) package,21 using an uninformative prior with probabilities of 0.5 for each group in the pairwise classification schemes.
Sensitivity and specificity of the assay and overall classification accuracy were determined (Table 3)
. Besides DCIS and IDC groups, a combined Cancer group was created, which contained both DCIS and IDC samples.
|
|
| Discussion |
|---|
|
|
|---|
Use of bisulfite degrades the target DNA (up to 95%)25 and hence may reduce amplifiable DNA.26 Biased amplification of remaining DNA (sequence-, strand-, and level of methylation-dependent bias) has been reported.27 Although these problems may not be significant for homogeneous or ample specimens, they can be critical for heterogeneous clinical specimens and may produce inaccurate results, especially if DNA degradation is specific to certain sequences. In addition, degradation of the major part of a limited clinical sample may prevent its comprehensive analysis, which will be also reflected in reduced analytical sensitivity. With this in mind, we have compared bisulfite-based techniques (methylation-specific PCR and bisulfite sequencing) with MSRE-PCR using homogeneous specimens from cultured cells where these problems are less likely to produce biased results.8 The inherent flaws in the bisulfite technique suggest that an alternative procedure for detection of methylated DNA in clinical samples is needed.
The M3-assay is similar to MSRE-PCR8 but relies on microarray-based rather than gel-based signal detection. As in many other DNA methylation techniques, the M3-assay evaluates methylation in a selected number of sites in each gene that may or may not correlate with sites critical for gene expression; this feature makes direct comparison of methylation and expression tenuous.8 The M3-assay is designed to efficiently detect methylated DNA fragments that can serve as templates for PCR in a heterogeneous sample. In the heterogeneous sample, any component can provide such a fragment, making it impossible to explicitly assign methylation to a specific part of the sample, eg, to neoplastic cells. The absence of PCR product, on the other hand, indicates that no tissue within the sample contains methylated fragments, so the absence of methylation in neoplastic tissue can be unequivocally established. This feature of the M3-assay makes the detection of unmethylated genes informative for specimen classification, whereas detection of methylated genes is uninformative.
Assignment of "methylated" and "unmethylated" calls in the M3-assay depends on the ratio of fluorescence produced by undigested and digested DNA, which in theory can only assume two values: 1/1 = 1, if the fragment is methylated and digestion has no effect, or 1/0 = infinity, if the fragment is unmethylated and no signal from digested DNA is detected. This type of ideal distribution is rarely seen even in cell lines.8
Quantitative measurement of signals expressed as Cy5-to-Cy3 ratio can produce significant discrepancies due to variability of experimental conditions and sampling differences. To manage experimental variability (eg, the dye bias), SMC is used to define a threshold for methylation (a "self-self" hybridization16
). This approach reduces numerical microarray data to a binary readout (Table 2)
, simplifies downstream analysis, and reduces the influence of sampling errors. As with the MSRE-PCR, the M3-assay efficiently detects methylated genes. For example, a sample containing equal amounts of methylated and unmethylated fragments (50% unmethylated) produces a "methylated" readout (Figure 3)
. Further increase of the share of the unmethylated fragment drives the Cy5-to-Cy3 ratio above the SMC level, so these fragments are scored as "unmethylated." Interestingly, the increase in the Cy5-to-Cy3 ratio is different for analyzed genes, suggesting certain influence of nucleotide composition on dye incorporation; for PAX5 even 10% of methylated fragments keep the Cy5-to-Cy3 ratio rather low (Figure 3)
.
Importantly, the M3-assay is not intended for quantitative assessment of methylation: it is designed for analysis of heterogeneous clinical samples where quantitative differences in methylation can depend on many reasons, including variations in tumor-to-stroma ratio and presence or absence of inflammation. These variations can be reduced by careful selection of samples but at the cost of their subjective evaluation.
Another feature of the M3-assay is the internal control for each spot provided by undigested DNA. This control is essential when damaged DNA (eg, DNA from formalin-fixed, paraffin-embedded samples) is used to ensure that a specific fragment is present. Data processing ignores all spots where hybridization signals for control (undigested) DNA are not detected.
Because of technical challenges of microarray-based techniques, the M3-assay is not intended for immediate clinical use; rather, the M3-assay provides the screening tool for selection of informative genes for a specific disease. Once such genes are identified, other, less demanding techniques can be applied to design the final clinical test.
Classifier Genes
The Classifier for Cancer is a combination of DCIS and IDC classifiers (Table 4)
. For example, TP73 and MSH2 are components of the DCIS but not of the IDC classifier, indicating differences important only to ductal carcinoma in situ. Conversely, PGR, THBS1, and FABP3 are not informative for DCIS classification but contribute to IDC classification, suggesting that disparities in their methylation status are significant only in invasive cancer.
Most of the promoters that define the Cancer classifier (six of nine) are also components of the ADH classifier, a result consistent with previously reported data that cancer-defining methylation changes appear very early in the process,28
extending these findings to unmethylated genes. Presence of PRKCDBP within the ADH and DCIS classifiers may indicate methylation changes that are informative during early stages of breast cancer but not during IDC. Unmethylation calls for each gene and P values from Fishers exact test for all pairwise comparisons are shown (Table 4)
. Blank cells indicate that the gene was not selected for the biomarker.
It is important that a useful biomarker for cancer contains unmethylated rather than methylated genes, because in a heterogeneous tissue, a methylated fragment may be amplified from any part of the sample, so the methylation signal is not necessarily produced by the tumor. Absence of methylation, however, explicitly indicates that the fragment is unmethylated everywhere in the sample, including tumor cells, so the difference in unmethylated genes between healthy tissue and cancer specimen can be used to identify tumors. It is expected that genes that are unmethylated in tumor but methylated in healthy tissue can be related to tumor growth, de-differentiation, and invasiveness. At least some of the genes found in our study meet these criteria (eg, EP300,29 TP73,30 THBS1,31 and FABP332 ).
The larger number of informative promoters identified for the ADH classifier (Table 4)
is reflected in a higher accuracy of the ADH classifier (Table 3)
, suggesting a systematic difference. The most consistent difference is the source of specimens in that all samples of ADH are from core biopsies, whereas other specimens are from gross sections of surgically removed tissues. These gross sections have not been enriched for tumor cells and contain variable amounts of stroma and tumor cells. Compared with gross sections, core biopsies of ADH are by far the most homogeneous.
The similarities in sets of informative genes found for the different stages of breast cancer indicate that no substantial difference can be detected and that differentiation of these stages is currently impossible. These observations raise two distinct possibilities, either that the current set of genes is insufficient to define specific biomarkers for each stage or that progression of breast cancer from ADH to IDC does not involve molecular differences, at least at the level of DNA methylation. Although there is no data to test either hypothesis, we believe that inclusion of additional genes will create a larger analytical space and will provide new biomarkers specific for each stage of breast cancer.
Results of this study may be affected by the age difference in the control and other groups (Table 1)
because DNA methylation increases with age.33
However, informative genes are chosen for their reduced methylation in abnormal samples, so it is unlikely that age-dependent increase of methylation has significantly influenced the results.
Although abnormal promoter methylation is an established feature of breast cancer cells,34 a diagnostic test based on DNA methylation has yet to be developed. One of the problems is the variability of methylation for each individual fragment. This variability indicates that analysis of a single gene may not provide sufficient accuracy for cancer detection. In the last 2 years, several groups reported multigene DNA methylation profiles for detection and classification of breast cancer,35, 36, 37, 38, 39 so the need for multigene profiles is widely recognized. The M3-assay is designed to quickly generate such profiles, facilitating selection of informative genes that can become targets for a clinical test.
Importantly, the M3-assay produces an integral methylation profile, where the signal from tumor cells is merged with signal from other tissues. As a result, the methylated call can be produced by any or all parts of the sample, so the informative value of the methylated calls is much lower than that of the unmethylated, which indicates that the fragment is unmethylated in all parts of the sample. Low informative value of the methylated calls explains why the composite biomarker contains only the unmethylated calls. This feature complicates direct comparison with data from other studies, where hypermethylation of a specific promoter is informative. Results of Fackler et al39 demonstrate this difference: All hypermethylated (and thus informative) promoters of their study tested in our project, are scored as methylated (and thus uninformative) by the M3-assay.
This study shows that complex and heterogeneous samples can be classified if methylation in multiple sites within the same specimen is evaluated. The current version of the assay is still insufficiently accurate and too complex for clinical application; however, it provides the platform for selection of informative genes that can produce a composite biomarker. Furthermore, tissue analysis has only limited clinical utility and serves only as a proof-of-principle that a combined analysis of multiple informative genes in heterogeneous samples is feasible and may lead to development of an accurate composite biomarker. It is possible that using the same assay with cell-free circulating DNA may provide a useful approach for cancer detection.
| Acknowledgments |
|---|
| Footnotes |
|---|
Supported by the Susan G. Komen Breast Cancer Foundation (grant BCTR 0402760 to V.V.L.).
Current address of E.L.W.: Division of Surgical Pathology, University of Illinois Medical Center, Chicago, IL.
Accepted for publication October 2, 2007.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
A. R. Sepulveda, D. Jones, S. Ogino, W. Samowitz, M. L. Gulley, R. Edwards, V. Levenson, V. M. Pratt, B. Yang, K. Nafa, et al. CpG Methylation Analysis--Current Status of Clinical Assays and Potential Applications in Molecular Diagnostics: A Report of the Association for Molecular Pathology J. Mol. Diagn., July 1, 2009; 11(4): 266 - 278. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Novak, T. J. Jensen, J. C. Garbe, M. R. Stampfer, and B. W. Futscher Stepwise DNA Methylation Changes Are Linked to Escape from Defined Proliferation Barriers and Mammary Epithelial Cell Immortalization Cancer Res., June 15, 2009; 69(12): 5251 - 5258. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Melnikov, D. Scholtens, A. Godwin, and V. Levenson Differential Methylation Profile of Ovarian Cancer in Tissues and Plasma J. Mol. Diagn., January 1, 2009; 11(1): 60 - 65. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |