Analyses of whole genome suggest that many GTG start codons do not have strong SD sequences in Escherichia coli


Abstract

Two major signals on mRNA for translation initiation in E.coli are known, i.e. start codon, and Shine-Dalgarno sequence. E.coli has several kinds of start codons, i.e. AUG, GUG, and UUG. However, when start codons are GUG or UUG, translational efficiency is much less. In this case, strong Shine-Dalgarno sequence may restore the translational efficiency. By using unbiased start codons in the whole genome of E.coli, we have conducted the computational analysis of translation initiation sites to investigate whether Shine-Dalgarno sequences will be strong in case where the start codons are GTG. We have applied information theory for the accurate analysis. According to our results, there are many GTG start sequences with weak Shine-Dalgarno sequences.


Value of (B,L) in matrix Riw is calculated by the following formula.

By applying each sequence j to this matrix with the following formula, we obtain individual information Ri(j) according to the model of that matrix.

where s(B,L,j) = 1 if and only if there is base B at position L in the sequence j, otherwise s(B,L,j) = 0. (If you want to know more about this theory, look at Tom Schneider's page.)


Above:Distribution of Ri according to ATG model

Below:Transformed into Gaussian distribution


Above:Distribution of Ri according to GTG model

Below:Transformed into Gaussian distribution


Go back to my research page