Codon usage for tomato (L. esculentum) and potato (S.
tuberosum) genes and comparison with codon usage in other
organisms.
An analysis of codon usage in L. esculentum and S.
tuberosum was performed based on a dataset of 150000 tomato EST
sequences and 40000 potato EST sequences. The following methodology
was used:
For tomato:
- 1500000 EST sequences were assembled into a unigene set,
resulting in 27000 unigenes.
- Each unigene was translated in 6 frames, and the sequence was
selected if it had one frame containing no stop codons.
- The resulting set of 5575 sequences was trimmed up to the first
"ATG" codon.
- A codon usage table based on these sequences was
generated.
For potato:
- 400000 EST sequences were assembled into a unigene set,
resulting in 8000 unigenes.
- Each unigene was translated in 6 frames, and the sequence was
selected if it had one frame containing no stop codons.
- The resulting set of 2874 sequences was trimmed up to the first
"ATG" codon.
- A codon usage table based on these sequences was
generated.
The second step for each organism was done in order to identify
a subset of sequences with an obvious correct frame, and the third
step was used to eliminate the possibility of using 5' UTR while
building the codon usage tables.
The codon usage patterns in tomato and potato were compared with
each other, as well as A. thaliana, Z. mays, O.
sativa, S. cerevisiae, C. elegans and H.
sapiens. The published codon usage tables for these organisms
were retrieved from the Codon Usage Database.
The link below will display a histogram of the frequency of each
codon in the organisms mentioned above, grouped by the aminoacid
for which they code. Clicking on the name of an organism in the
histogram legend will display the codon usage table for that
organism.
Histogram comparison of codon
usage among organisms
You can also display idividual codon usage tables using the
links below:
If you have any questions regarding this data, please send email to
sgn-feedback@cornell.edu. |