Cytosine methylation profiling of cancer cell lines
- Mathias Ehrich*,†,
- Julia Turner*,
- Peter Gibbs‡,
- Lara Lipton‡,
- Mara Giovanneti‡,
- Charles Cantor*,†, and
- Dirk van den Boom*
- *Sequenom, Inc., 3595 John Hopkins Court, San Diego, CA 92109; and
- ‡Ludwig Institute for Cancer Research, P.O. Box 2008, Royal Melbourne Hospital, Parkville, Victoria 3050, Australia
-
Contributed by Charles Cantor, December 27, 2007 (received for review December 14, 2007)
-
Fig. 1.
Descriptive analysis of methylation data for normal and tumor cell-line samples. (a) A scatterplot depicts the results from a replicate analysis of ERBB2 by using two different primer designs. The quantitative measurements are highly concordant. (b) Relationship between CpG density and mean methylation levels in cancer and normal samples. CpG density of amplicons was calculated as the fraction of CpG nucleotides within the total amplicon sequence. The mean methylation value for each amplicon was generated by using all individual CpG sites' methylation values. Amplicons with >10% CpG content are likely to have lower methylation values in normal tissues. In cancer cell lines, DNA methylation is observed more frequently in these amplicons. (c) Amplicons were binned based on their average methylation values. Each bin contained amplicons within a 5% range of methylation values. Bins from 15% to 85% average methylation contain more amplicons in the set of cancer cell lines. (d) Histogram of methylation differences. For each amplicon, we calculated the difference in mean methylation between the group of normal samples and the group of tumor cell lines. Positive values translate to hypermethylation in cancer cell lines, whereas negative values indicate that the mean methylation was higher in the group of normal samples. The distribution of methylation differences is skewed toward hypermethylation. (e) DNA methylation in relation to the closest 5′ UTR. The distance from the 5′ UTR was calculated for every individual CpG site. Each data point contains 1,770 individual methylation values for the cancer cell lines and 180 values for the normal samples. It is necessary to adjust the number of data points in each group because of the difference in sample numbers in the two sets. A window of 1 kbp around the 5′ UTR shows low methylation values in the normal and the cancer cell-line samples. Methylation values in the cancer cell-line samples are generally elevated.
-
Fig. 2.
Two-way hierarchical cluster analysis of 59 tumor cell-line samples and 6 samples from normal tissues (rows) and DNA-methylation of CpG Units in 531 promoter regions (columns). DNA-methylation values are depicted in this false-color image on a continuous scale from red (nonmethylated) to yellow (100% methylated). Poor quality data are in gray. Samples are color-coded according to their cell-line tissue origin (legend, upper left) to simplify identification of potential sample clusters. Strong sample-cluster formation is observed for the group of normal samples, the group of colon cancer samples (brown), melanoma samples (green), and CNS tumors (yellow). Less-dominant clustering is observed in lung cancers (black), renal carcinoma (orange), and ovarian cancer (blue). The cell-line samples derived from breast cancer (pink), leukemia (red), and prostate cancer (gray) do not form obvious clusters. The normal samples are characterized by consistent low methylation levels. The cancer cell-line samples show more-variable methylation patterns. Sample annotations for the final branches of the tree are shown in SI Fig. 10.
-
Fig. 3.
Two-way hierarchical cluster analysis of 48 colon cancer samples, 48 adjacent normal colon tissue samples, 7 colon cancer cell lines, and 6 normal DNA samples (rows) and DNA-methylation of CpG Units in 64 promoter regions (columns). DNA-methylation values are depicted in this false-color image on a continuous scale from red (nonmethylated) to yellow (100% methylated). Poor quality data are annotated in gray. Samples are color-coded according to their origin (dark blue, colon cancer cell line; dark orange, normal control DNA; light blue, colon cancer tissue; light orange, normal colon tissue) to simplify identification of potential sample clusters. The hierarchical cluster algorithm separates colon cancer samples from normal tissue samples. Although they are clustered tightly together in a terminal branch of the dendrogram, all colon cancer cell lines are found among colon cancer tissue samples. The same applies to normal tissue samples within the group of normal colon tissue samples. Some (n = 10) colon cancer tissues are located in the group of normal tissue samples. Note that methylation differences for colon cancer cell lines tend to be higher than for colon cancer tissues.
-
Fig. 4.
This network diagram illustrates the relationship between significantly differentially methylated genes and the cell-line tumor types. Genes are shown as colored ellipses (yellow, PRC2 targets; gray, no PRC2 binding site) and cell-line types are shown as blue rectangles. A connection is shown between a cell-line tumor type and a gene when a statistically significant methylation difference was identified between the tumor type and the normal samples. Genes located on the outside are connected to a single tumor type. Genes located between tumor types are connected to at least two different tumor types. Most highly connected genes tend to be PRC2 targets, whereas genes connected to a single tumor type are less likely to be a target of PRC2. Fifty-eight percent of all genes connected to more than one tumor type are PCR2 targets, whereas only 30% of genes connected to a single tumor type are PRC2 targets.
Footnotes
- †To whom correspondence may be addressed. E-mail: mehrich{at}sequenom.com or ccantor{at}sequenom.com
- © 2008 by The National Academy of Sciences of the USA









