The large number of organisms and of genes sequenced at the present time permits now to study molecular evolution in such lower clades as genera, species, and subspecies. Here, we focus our attention on the genus Mycobacterium, in which we examined codon and aminoacid usage in 13 species, and in 12 subspecies for a total of 8,836,513 codons from 26,755 sequences. Within the genus Mycobacterium, frequencies of codon and aminoacid usage correlate between species and between subspecies. In the groups studied, aminoacid molecular weight and codon degeneracy influence correlations between frequencies, while GC content is the main factor influencing the effective number of codons. The coding GC, which is highly correlated with total genomic GC content, seems to be the main factor influencing present synonymous codon usage in the genus. In particular, the GC content at the 3rd base position seems to shape heavily the effective number of codons, giving indication that here mutational bias dominates over translational selection. Evolutionary trees based on codon and aminoacid usage are consistent with traditional phylogenies of species within the genus.

Codon and Aminoacid Usage Patterns in Mycobacteria

SCAPOLI, Chiara;DE LORENZI, Sonia;CARRIERI, Alberto;SALVATORELLI, Germano;BARRAI, Italo Enrico
2009

Abstract

The large number of organisms and of genes sequenced at the present time permits now to study molecular evolution in such lower clades as genera, species, and subspecies. Here, we focus our attention on the genus Mycobacterium, in which we examined codon and aminoacid usage in 13 species, and in 12 subspecies for a total of 8,836,513 codons from 26,755 sequences. Within the genus Mycobacterium, frequencies of codon and aminoacid usage correlate between species and between subspecies. In the groups studied, aminoacid molecular weight and codon degeneracy influence correlations between frequencies, while GC content is the main factor influencing the effective number of codons. The coding GC, which is highly correlated with total genomic GC content, seems to be the main factor influencing present synonymous codon usage in the genus. In particular, the GC content at the 3rd base position seems to shape heavily the effective number of codons, giving indication that here mutational bias dominates over translational selection. Evolutionary trees based on codon and aminoacid usage are consistent with traditional phylogenies of species within the genus.
Scapoli, Chiara; Bartolomei, E.; DE LORENZI, Sonia; Carrieri, Alberto; Salvatorelli, Germano; Rodriguez Larralde, A.; Barrai, Italo Enrico
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11392/533949
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 2
social impact