Description
Acute lymphoblastic leukemia (ALL), the commonest childhood malignancy, is characterized by recurring gross and submicroscopic structural genetic alterations that contribute to leukemogenesis. Disordered epigenetic regulation is a hallmark of many tumors, and while analysis of DNA methylation of limited numbers of genes or ALL samples suggests epigenetic alterations may also be important, a large-scale integrative genome-wide analysis evaluating DNA methylation in ALL has not been performed. Here, we report an integrated epigenomic, transcriptional and genetic analysis of 167 childhood ALL cases, comprising B-progenitor ALL with hyperdiploidy (N=26), ETV6-RUNX1 (N=27), TCF3-PBX1 (N=9), BCR-ABL1 (N=19), rearrangement of MLL (MLLr) (N=20), rearrangement of CRLF2 (N=11, CRLF2r), deletion of ERG (N=11), miscellaneous or normal karyotype (N=14), and T-lineage ALL (N=30), including 4 MLLr cases and 7 cases with early T-cell precursor immunophenotype. Genome-wide profiling of structural DNA alterations was performed for all cases using Affymetrix 500K and SNP 6.0 arrays. Affymetrix U133A gene expression profiling data was available for 154 cases. Genome-wide methylation profiling was performed using the HELP microarray assay, which measures methylation at approximately 50,000 CpGs distributed among 22,722 Refseq promoters. Methylation data was compared to that of normal pro-B (CD34+CD19+sIg-), pre-B (CD34-CD19+sIg-) and mature B (CD34-CD19+sIg+) cells FACS-sorted from bone marrow of 6 healthy individuals. Unsupervised hierarchical clustering of the top 4043 most variable methylation probesets identified 9 B-ALL clusters with significant correlation to specific genetic lesions including ETV6-RUNX1, MLLr, BCR-ABL1, CRLF2r, TCF3-PBX1 and ERG deletion. T-ALLs and hyperdiploid B-ALLs also defined specific DNA methylation clusters. Supervised analysis including limma and ANOVA identified distinct DNA methylation signatures for each subtype. Notably, the strength of these signatures was subtype dependent, with more differentially methylated genes observed in ALL cases with genetic alterations targeting transcriptional regulators (e.g. ETV6-RUNX1 and MLLr) and fewer genes in cases with alterations deregulating cytokine receptor signaling (e.g. CRLF2r). Aberrant DNA methylation affected specific and distinct biological processes in the various leukemia subtypes implicating epigenetic regulation of these pathways in the pathogenesis of these different forms of ALL (e.g. TGFB and TNF in ERG deleted leukemias; telomere and centriole regulation in BCR-ABL1 ALL). Aberrantly methylated genes were also enriched for binding sites of known or suspected oncogenic transcription factors that might represent cooperative influences in establishing the phenotype of the various B-ALL subtypes. Most importantly, an integrated analysis of methylation and gene expression of these ALL subtypes demonstrated striking inversely correlated expression of the corresponding gene transcripts. The methylation signatures of each subtype exhibited only partial overlap with those of normal B cells, indicating that the signatures do not simply reflect stage of lymphoid maturation. In a separate approach, we discovered that 81 genes showed consistent aberrant methylation across all ALL subtypes, including the tumor suppressor PDZD2, HOXA5, HOXA6 and MSH2. Inverse correlation with expression was confirmed in 66% of these genes. These data suggest the existence of a common epigenetic pathway underlying the malignant transformation of lymphoid precursor cells. Integrative genetic and epigenetic analysis revealed hypermethylation of genes on trisomic chromosomes that do not show increased expression, suggesting that epigenetic silencing may control genes within amplified regions and explain why only selected genes are overexpressed. Finally, analysis of individual genes targeted by recurring copy number alterations in ALL revealed a subset of genes also targeted by abnormal methylation, with corresponding changes in gene expression (e.g. ERG, GAB1), suggesting that such genes are inactivated far more frequently than suggested by genetic analyses alone. Collectively, the data support a key role of epigenetic gene regulation in the pathogenesis of ALL, and point towards a scenario where genetic and epigenetic lesions cooperatively determine disease phenotype.