Guillardia theta CCMP2712 Assembly and Gene Annotation
About Guillardia theta
Guillardia theta is a cryptomonad alga, it is an example of a cell-within-a-cell, being composed of a flagellate host cell, complete with mitochondria and nucleus, surrounding a plastid lying within a reduced cytoplasmic compartment that contains a vestigial nucleus (or nucleomorph). Nucleomorphs are the highly reduced nuclei of 'endosymbiotic' algal cells.
Assembly
G. theta AF041468 strain CCMP2712, 87Mbp genome is organised on 3 chromosomes and has one of the smallest nuclei sequenced to date - 0.55 million base pairs. G. theta genomes are gene rich with around 21,000 predicted protein genes. he assembly and annotation were imported from European Nucleotide Archive (ENA) accession GCA_000315625.
Annotation
The protein coding gene annotation of the Guillardia theta genome was sequenced by DOE Joint genome Institute.
Non-coding RNA genes have been annotated using tRNAScan-SE (Lowe, T.M. and Eddy, S.R. 1997), RFAM (Griffiths-Jones et al 2005), and RNAmmer (Lagesen K.,et al 2007); additional analysis tools have also been applied.
References
- Algal genomes reveal evolutionary mosaicism and the fate of
nucleomorphs.
Curtis BA, Tanifuji G, Burki F, Gruber A, Irimia M, Maruyama S, Arias MC, Ball SG, Gile GH, Hirakawa Y et al. 2012. Nature. 492:59-65.
Picture credit: Courtesy Dr. Geoff McFadden, University of Melbourne, Australia.
More information
General information about this species can be found in Wikipedia.
Statistics
Summary
Assembly | Guith1, INSDC Assembly GCA_000315625.1, Dec 2012 |
Database version | 113.1 |
Golden Path Length | 87,266,873 |
Genebuild by | JGI |
Genebuild method | Import |
Data source | Joint Genome Institute |
Gene counts
Coding genes | 24,945 |
Non coding genes | 279 |
Small non coding genes | 277 |
Long non coding genes | 1 |
Misc non coding genes | 1 |
Pseudogenes | 1 |
Gene transcripts | 25,249 |