marginale subspecies centrale (Israel strain). The data revealed that all msp2 and msp3 differences with <90% identity were accurately detected ( Table 1). We then compared the msp2 and msp3 pseudogenes in all 10 U.S. strains of A. marginale and A. marginale subspecies centrale, by the same method ( Table 2). The results showed selleck kinase inhibitor that no msp2 or msp3 pseudogene
from any of these strains of A. marginale from the United States was shared with A. marginale subspecies centrale. Indeed, there was substantial variation in the repertoire of the msp2 and msp3 pseudogenes even within U.S. A. marginale strains, with no msp2 or msp3 copy shared between Oklahoma and St. Maries, Idaho strains and only one of each shared between Oklahoma and Florida strains. Interestingly, there was substantial variation mTOR inhibitor even between strains from the same state, with no msp3 pseudogene shared between the two strains from Idaho and only two msp3 pseudogenes shared between the two strains from Florida (Okeechobee and Florida). In contrast, there was no variation detected between Florida and Florida relapse strains, suggesting that the differences observed reflected evolutionary changes rather than, for example, continuous variation by gene conversion among pseudogenes. It is known from previous analyses that msp2 and msp3 expression site sequences are different in Florida and Florida-relapse strains [10] and [11].
The most conserved msp2 or msp3 pseudogene was AM1250, absent in only 2/10 strains examined (WA-O and OK). We examined
whether the diversity observed in msp2 and msp3 genes was also reflected in differences in SNP profiles across the genome. High confidence differences between the genomes obtained using Roche/454 gsMapper software are shown in Table 3. Again, few differences were detected between the the previous Sanger and current Roche/454 data. Only 38 differences (at 100% frequency) were detected in the Florida strain genome and 84 in the St. Maries, Idaho genome by the two sequencing strategies. Similarly, there were few differences in the Florida relapse strain compared to Florida. Therefore, pyrosequencing data correlated well with the previously reported sequences from traditional Sanger sequencing. Comparison of pyrosequencing of the Florida strain with the previously reported sequence (CP001079) shows high confidence differences, possibly due to true SNPs or error, of one base per 31,643 nucleotides (at 100% frequency), while comparison of pyrosequencing of the St. Maries strain with the previously reported genome sequence (CP000030) yields a difference of one base per 14,258 nucleotides (at 100% frequency). As seen in previous strain comparisons [27], the number of single nucleotide polymorphisms (SNPs) between U.S. strains of A. marginale is variable, from 0.20% to 0.58% of the genome. However, all strains of A. marginale sensu stricto have significantly increased numbers of SNPs when compared to the A. marginale subsp. centrale strain, ranging from 1.