Guillardia theta CCMP2712 (Guith1)

Guillardia theta CCMP2712 Assembly and Gene Annotation

About Guillardia theta

Guillardia theta is a cryptomonad alga, it is an example of a cell-within-a-cell, being composed of a flagellate host cell, complete with mitochondria and nucleus, surrounding a plastid lying within a reduced cytoplasmic compartment that contains a vestigial nucleus (or nucleomorph). Nucleomorphs are the highly reduced nuclei of 'endosymbiotic' algal cells.


G. theta AF041468 strain CCMP2712, 87Mbp genome is organised on 3 chromosomes and has one of the smallest nuclei sequenced to date - 0.55 million base pairs. G. theta genomes are gene rich with around 21,000 predicted protein genes. he assembly and annotation were imported from European Nucleotide Archive (ENA) accession GCA_000315625.


The protein coding gene annotation of the Guillardia theta genome was sequenced by DOE Joint genome Institute.

Non-coding RNA genes have been annotated using tRNAScan-SE (Lowe, T.M. and Eddy, S.R. 1997), RFAM (Griffiths-Jones et al 2005), and RNAmmer (Lagesen K.,et al 2007); additional analysis tools have also been applied.


  1. Algal genomes reveal evolutionary mosaicism and the fate of nucleomorphs.
    Curtis BA, Tanifuji G, Burki F, Gruber A, Irimia M, Maruyama S, Arias MC, Ball SG, Gile GH, Hirakawa Y et al. 2012. Nature. 492:59-65.

Picture credit: Courtesy Dr. Geoff McFadden, University of Melbourne, Australia.

More information

General information about this species can be found in Wikipedia.



AssemblyGuith1, INSDC Assembly GCA_000315625.1, Dec 2012
Database version111.1
Golden Path Length87,266,873
Genebuild byJGI
Genebuild methodImport
Data sourceJoint Genome Institute

Gene counts

Coding genes24,945
Non coding genes279
Small non coding genes277
Long non coding genes1
Misc non coding genes1
Gene transcripts25,249