Description
Pediatric acute lymphoblastic leukemia (ALL) contains cytogenetically distinct subtypes that respond differently to cytotoxic drugs. Subtype classification can be also achieved through gene expression profiling. However, how to apply such classifiers to a single patient and correctly diagnose the disease subtype in an independent patient group has not been addressed. Furthermore, the underlying regulatory mechanisms responsible for the subtype-specific gene expression patterns are still largely unknown. Here, by combining three published microarray datasets (PMIDs: 12086872, 12730115, 17002788) on 535 Caucasian samples and generating a new dataset on 100 Chinese children ALL samples, we were able to 1) identify a 62-gene classifier with 97.6% accuracy from the Caucasian samples and validated it on the completely independent set of 100 Chinese samples, 2) to uncover potential regulatory networks of ALL subtypes. The classifier we identified was so far the only one that could be applied directly to a single sample and sustained validation in a large independent patient group. Our results also suggest that the etiology of ALL is largely the same among different ethnic groups, and that the transcription factor hubs in the predicted regulatory network might play important roles in regulating gene expression and development of ALL.