Tandem repetitions in transcriptomes of some Solanaceae species
Abstract
Characterization of occurrence, density and motif sequence of tandem repeats in the transcribed regions is helpful in understanding the functional significance of these repeats in the modern genomes. We analyzed tandem repeats present in expressed sequences of thirteen species belonging to genera Capsicum, Nicotiana, Petunia and Solanum of family Solanaceae and the genus Coffea of Rubiaceae to investigate the propagation and evolutionary sustenance of these repeats. Tandem repeat containing sequences constituted 1.58% to 7.46% of sequences analyzed. Tandem repetitions of size 2, 15, 18 and 21 bp motifs were more frequent. Repeats with unit sizes 21 and 22 bp were also abundant in genomic sequences of potato and tomato. While mutations occurring in these repeats may alter the repeat number, genomes adjust to these changes by keeping the translated products unaffected. Surprisingly, in majority of the species under study, tandem repeat motif length did not exceed 228 bp. Conserved tandem repeat motifs of sizes 180, 192 and 204 bp were also abundant in the genomic sequences. Our observations lead us to propose that these tandem repeats are actually remnants of ancestral megasatellite repeats, which have split into multiple repeats due to frequent insertions over the course of evolution.