Krishna, R, Xia, D, Sanderson, S, Shanmugasundram, A, Vermont, S, Bernal, A, Daniel-Naguib, G, Ghali, F, Brunk, BP, Roos, DS, Wastling, JM and Jones, AR (2015) A large-scale proteogenomics study of apicomplexan pathogens-Toxoplasma gondii and Neospora caninum. Proteomics, 15 (15). 2618 -2628.

A large-scale proteogenomics study of apicomplexan pathogens-Toxoplasma gondii and Neospora caninum.pdf - Published Version
Available under License Creative Commons Attribution.

Download (853kB) | Preview


Proteomics data can supplement genome annotation efforts, for example being used to confirm gene models or correct gene annotation errors. Here, we present a large-scale proteogenomics study of two important apicomplexan pathogens: Toxoplasma gondii and Neospora caninum. We queried proteomics data against a panel of official and alternate gene models generated directly from RNASeq data, using several newly generated and some previously published MS datasets for this meta-analysis. We identified a total of 201 996 and 39 953 peptide-spectrum matches for T. gondii and N. caninum, respectively, at a 1% peptide FDR threshold. This equated to the identification of 30 494 distinct peptide sequences and 2921 proteins (matches to official gene models) for T. gondii, and 8911 peptides/1273 proteins for N. caninum following stringent protein-level thresholding. We have also identified 289 and 140 loci for T. gondii and N. caninum, respectively, which mapped to RNA-Seq-derived gene models used in our analysis and apparently absent from the official annotation (release 10 from EuPathDB) of these species. We present several examples in our study where the RNA-Seq evidence can help in correction of the current gene model and can help in discovery of potential new genes. The findings of this study have been integrated into the EuPathDB. The data have been deposited to the ProteomeXchange with identifiers PXD000297and PXD000298.

Item Type: Article
Additional Information: Krishna, R. et al., 2015. A large-scale proteogenomics study of apicomplexan pathogens-Toxoplasma gondiiandNeospora caninum. PROTEOMICS, 15(15), pp.2618–2628. Published with Open Access and available from
Uncontrolled Keywords: Gene annotation, MS/MS, Microbiolgy, N. Caninum, Proteogenomics, T. gondii, Amino Acid Sequence, Apicomplexa, Databases, Genetic, Genes, Protozoan, Genomics, Molecular Sequence Annotation, Molecular Sequence Data, Neospora, Peptides, Proteome, Proteomics, Protozoan Proteins, Sequence Analysis, RNA, Sequence Homology, Amino Acid, Tandem Mass Spectrometry, Toxoplasma
Subjects: Q Science > QR Microbiology
Divisions: Faculty of Natural Sciences > School of Life Sciences
Related URLs:
Depositing User: Symplectic
Date Deposited: 20 Oct 2016 08:26
Last Modified: 30 Jun 2017 08:45

Actions (login required)

View Item View Item