Description
Reprogramming human somatic cells into induced pluripotent stem cells (iPSC) has been suspected of causing de novo copy number variations (CNVs). To explore this issue, we performed a whole-genome and transcriptome analysis of 20 human iPSC lines derived from primary skin fibroblasts of 7 individuals using next-generation sequencing. We find that, on average, an iPSC line manifests two CNVs not apparent in the fibroblasts from which the iPSC was derived. Using qPCR, PCR, and digital droplet PCR (ddPCR) to amplify across the CNVs'' breakpoints, we show that at least 50% of those CNVs are present as low frequency somatic genomic variants in parental fibroblasts and are manifested in iPSC colonies due to their clonal origin. Hence, reprogramming does not necessarily lead to de novo CNVs in iPSC, since most of line-manifested CNVs reflect somatic mosaicism in the human skin. Moreover, our findings demonstrate that clonal expansion, and iPSC lines in particular, can be used as a discovery tool to reliably detect low frequency CNVs in the tissue of origin. Overall, we estimate that approximately 30% of the fibroblast cells have somatic CNVs, suggesting widespread somatic mosaicism in the human body. Our study paves the way to understanding the fundamental question of the extent to which cells of the human body normally acquire structural alterations in their DNA post-zygotically. Overall design: We have generated and characterized hiPSC lines derived from skin fibroblasts collected from seven members of two families, which were competent to be differentiated into neuronal progenitors and neurons