• Keine Ergebnisse gefunden

Upstream open reading frames (uORFs) and start codons (uAUGs)

1.3 Global and gene-specific translational regulation

1.3.5 Upstream open reading frames (uORFs) and start codons (uAUGs)

Mechanisms of uORF-mediated translational control are not only complex and diverse, they are also not entirely understood yet, although uORFs are prevailing regulatory elements within mRNAs of higher eukaryotes: approximately ~50 % of human and ~45 % of mouse transcripts contain at least one uORF [82]–[86]. uORFs can be differentially defined which is mostly because uORFs can be subdivided in either being located completely upstream of the main ORF (a start codon followed by an in frame stop codon located within the 5’UTR) or overlapping with the CDS of the main ORF (a start codon within the 5’UTR followed by an in frame stop codon located downstream of the start codon of the main ORF). The latter is sometimes referred to as overlapping ORF (oORF) [85] or as uAUG [314]. Here, I will use the term uORF regardless of its relative location to the main ORF.

Albeit uORFs are widespread among mRNAs, they seem to be particularly abundant in transcripts of genes that play important roles in cell differentiation, cell cycle and stress response,

55

including also two-thirds of common oncogenes [18], [315], [316]. Usually, their presence within transcripts reduces gene expression levels by decreasing the number of scanning ribosomes that initiate at the main ORF or by decreasing transcript stability through activation of decay pathways like nonsense-mediated decay (NMD) [85], [317]. A large scale study on matched mRNA and protein datasets estimated that uORF-mediated translational repression typically results in a decrease of protein expression by 30-80 % [83]. In the developing zebrafish, the presence of uORFs had similar repressive effects on mRNA translation as miR-430-mediated translational inhibition during maternal-zygotic transition [85], demonstrating that uORFs are potent regulatory elements in post-transcriptional gene expression.

Moreover, uORFs are evolutionary conserved, though different studies claimed varying levels of conservation. While analyses of stringently selected sets of orthologous transcripts between human and mouse [318] and rat [314] indicated that uORF occurrence is rather a species-specific event, another study of global translation initiation sequencing in MEFs and HEK293 cells found that 64-85 % of upstream translation initiation sites were conserved between both cell lines [96]. Nevertheless, the repressiveness of uORFs is ubiquitous [82], [83], [319] and the conservation of functionally validated and confidently translated uORFs is high among vertebrates [85], [314]. As uORFs are less frequently present within 5’UTRs than expected, it seems that purifying selection triggered the conservation of functionally active uORFs, whereas non-functional AUG codons were eliminated from upstream sequences [314], [320]–[322].

Besides, upstream translation initiation can take place at near-cognate start codons. In fact, recent methodological advances that allowed for global mapping of translation events at nucleotide resolution revealed that not more than a quarter of upstream translation initiation events occurs at AUG codons meaning that the vast majority of uORF translation is indeed initiated at non-AUG codons with CUG (~30 %) being the most prominent one [91], [96].

The regulatory impact of an uORF depends largely on its position within the 5’UTR, adjacent secondary and tertiary mRNA structures, specific features of its encoded peptide or of the 5’UTR, as well as alternative promoter usage, alternative splicing and further cell type-specific trans acting factors [86]. Together these features determine a variety of mechanisms like leaky scanning, reinitiation, stalling and potentially shunting of ribosomes that shape the translation efficiency of authentic downstream ORFs. Leaky scanning through uORFs for example, is greatly influenced by the context of the surrounding Kozak sequence. Especially the -3 and +4 position (A of AUG = +1) of the Kozak sequence have been shown to be substantial in defining the strength of translation initiation at the start codon [17], [323]. Plenty of studies showed that the start codon context of uORFs is overall weaker than the start codon context of main ORFs and

56

that a stronger Kozak sequence context of an uORF correlates with higher levels of translational repression of the main ORF [18], [83]–[85], [96], [314], [318], [319], [324], [325]. Hence, leaky scanning at uORFs is controlled by the Kozak context which thereby controls the translational efficiency of main ORFs.

Reinitiation of ribosomes after uORF translation is mainly dependent on the uORF location within the 5’UTR, uORF length and downstream secondary and tertiary mRNA structures. It was shown that reinitiation is more efficient, when the intercistronic distance is long and the length of the uORF is short [101], [102], [109], [326]. The theory behind is that elongating ribosomes are turning over crucial reinitiation factors which need to reassociate in order to recycle post-termination ribosomes for downstream translation.

Classical examples how reinitiation after uORF translation controls downstream gene expression in response to cellular stresses are GCN4 in yeast and ATF4 in humans [327] [328].

The ATF4 mRNA contains two uORFs of which the 5’-proximal one encodes a peptide of three amino acids, while the downstream one is overlapping with the ATF4 CDS [328]. After translating the first uORF, ribosomes reinitiate at the second uORF which greatly represses translation of the overlapping ATF4 CDS [328]. However, stress conditions induce the phosphorylation of eIF2α which abrogates eIF2-GTP-Met-tRNAMeti complex formation. Under such conditions, reinitiation of ribosomes after uORF1 translation is delayed as loading of eIF2-GTP-Met-tRNAMeti complex takes longer resulting in scanning through uORF2 and reinitiation at the ATF4 CDS instead [328].

Translation efficiency of a downstream gene can also be controlled by ribosome stalling at an uORF. One well-known example is the mammalian AdoMetDC, an important gene of the polyamine biosynthesis pathway. AdoMetDC translation rates are linked to cellular polyamine concentrations by a regulatory feedback loop that involves ribosome stalling at the termination site of a six codon inhibitory uORF [316]. Ribosome stalling thereby depends on the uORF encoded peptide, which interacts in cis with terminating ribosomes [316], [329]. Another example, where an uORF encoded peptide represses main ORF translation in trans, is AS. The AS mRNA shows tissue-specific 5’-UTR isoform expression due to alternative transcription start site usage [316]. Longer isoforms contain a uORF encoding a 44 aa peptide that represses endogenous AS translation in bovine endothelial cells in a dosage dependent manner upon transfection of a AS-uORF construct [25].

Lately, it was shown that uORF-mediated gene expression cannot only be controlled by uORF encoded peptides but also by interacting RBPs. One example is the translational control of msl-2 mRNA in Drosophila, which is repressed by SXL protein. SXL binding downstream of a short uORF promotes recognition and translation of the uAUG, while main ORF translation is

57

inhibited presumably via SXL interference with ribosomal factors inducing conformational changes of the preinitiation complex needed for subsequent elongation [331]. The second example comes from AXIIR, which is very lowly expressed in most cell types [332]. AXIIR translational regulation is complex and involves four uORFs, two hnRNPs and HuR protein which cooperate to build a multiple fail-safe regulation system ensuring AXIIR expression levels [332]. In this system, the RBP binding most likely alleviate ribosome leaky scanning at uORFs with moderately strong Kozak sequence [332].

Of particular interest are reports on uORFs within IRESs that coordinate main ORF expression through functional and structural interdependency. One example is VEGF gene regulation which involves in addition to a short 3 codon uORF and two putative IRESs also an internal promoter, alternative CUG start codons, alternative splicing, miRNA binding sites, a riboswitch, alternative polyadenylation, a G-quadruplex structure and RBP binding to AU-rich elements [333], [334]. Translation initiation at the VEGF 121 isoform takes places at CUG codons only, whereas larger mRNA isoforms are efficiently translated at both CUG and AUG initiation sites [334]. The mechanism of translation inhibition at the AUG initiation site of VEGF 121 isoform involves the exon sequence and parts of the 5’UTR, which contains a uORF within an putative IRES, while the 3’UTR is not part of the regulatory module [333]. Transient transfections of mutation constructs in HeLa cells suggest, that the exon sequence of the VEGF 121 isoform promotes uORF translation and inhibits reinitiation at the main ORF, while longer VEGF isoforms stimulate the translation initiation at the main ORF and permit reinitiation after uORF translation [333]. The mode of translation initiation at the uORF is eIF4E-independent [333]. Probably, ITAFs interact with longer VEGF isoforms that substitute post-termination ribosomes after IRES-mediated uORF translation with initiation factors needed for reinitiation at the main ORF. On the other hand, it is possible that ITAFs interact with the shorter VEGF 121 isoform to block ribosome recycling after uORF translation. Interestingly, it was found that reinitiation after IRES-mediated uORF translation can only happen, when the eIF4F complex, or at least a fragment of eIF4G, is involved in primary initiation, suggesting that their interaction needs to be maintained for ribosome reinitiation [335]. This suggests that VEGF IRES-mediated uORF translation requires eIF4F family members except for eIF4E.

One more proposed example of uORF-IRES coordinated gene expression is the human GARS mRNA. GARS is expressed in two mRNA isoforms of which the longer one contains a 32 codon long uORF and cap-independent translation activity in in vitro translation assays [336].

The uORF overlaps with the 5’ proximal start codon of the mitochondrial GARS ORF so that mitochondrial GARS expression is efficiently blocked. However, translation of the cytosolic GARS ORF from a start codon downstream of the uORF is unrepressed so that the long mRNA

58

isoform exclusively expresses cytosolic GARS [336]. Subcellular fractionation revealed that both the short and the long mRNA isoform associate with the ER but only the long mRNA isoform is locally translated [336]. Although it still needs to be verified that cap-independent translation of the longer GARS isoform is indeed mediated by an IRES (supplementary promoterless control experiments reveal cryptic promoter activity of bicistronic GARS reporters), this is still an interesting combination of relaxed cap-dependency, a repressive uORF and localized translation on a single transcript. It will be exciting to see how the interplay of individual regulatory elements works in selective GARS isoform expression.

In conclusion, uORF-mediated translational control of gene expression is highly diverse including various modes of repression that can be dynamically regulated in response to altering cellular conditions. Progress in genome wide analyses enabled detection and quantification of uORF translation together with gene translation efficiencies, elucidating the wide range of uORF activities. Of special interest for future analyses are uORF and IRES mediated upregulation of gene expression during global translational repression and eIF2α phosphorylation. Insights in these mechanisms are needed to understand and overcome similar events happening in cell transformation during tumorigenesis. Further, a number of single nucleotide polymorphisms (SNPs) have been identified to disturb uORF-mediated translational control of gene expression resulting in human diseases [83], [315]. However, the impact of uORF translation on pathogenic gene expression is likely still underestimated and desires further in-depth analyses.