Description
Protein-RNA interactions are fundamental to core biological processes, such as mRNA splicing, localization, degradation and translation. We developed a photoreactive nucleotide-enhanced UV crosslinking and oligo(dT) purification approach to identify the mRNA-bound proteome using quantitative proteomics and to display the protein occupancy on mRNA transcripts by next-generation sequencing. Application to a human embryonic kidney cell line identified close to 800 proteins. Close to one third of these proteins, were neither previously annotated nor could be functionally predicted to bind RNA. Protein occupancy profiling provides a transcriptome-wide catalog of potential cis-regulatory regions on mammalian mRNAs and showed that large stretches in 3'' UTRs can be contacted by the mRNA-bound proteome, with numerous putative binding sites in regions harboring disease-associated nucleotide polymorphisms. Our observations indicate the presence of a large number of unexpected mRNA-binders with novel molecular functions participating in combinatorial post-transcriptional gene-expression networks. Overall design: We generated protein occupancy cDNA libraries for two biological replicates. Briefly, we crosslinked 4SU-labeled cells and purified protein-mRNA complexes using oligo(dT)-beads. The precipitate was treated with RNAse I to reduce the protein-crosslinked RNA fragments to a length of about 30-60 nt. To remove non-crosslinked RNA, protein-RNA complexes were precipitated with ammonium sulfate and blotted onto nitrocellulose. The RNA was recovered by Proteinase K treatment, ligated to cloning adapters, and reverse transcribed. The resulting cDNA libraries were PCR-amplified and next-generation sequenced