About Guillardia theta

Guillardia theta is a cryptomonad alga, it is an example of a cell-within-a-cell, being composed of a flagellate host cell, complete with mitochondria and nucleus, surrounding a plastid lying within a reduced cytoplasmic compartment that contains a vestigial nucleus (or nucleomorph). Nucleomorphs are the highly reduced nuclei of 'endosymbiotic' algal cells.


G. theta AF041468 strain CCMP2712, 87Mbp genome is organised on 3 chromosomes and has one of the smallest nuclei sequenced to date - 0.55 million base pairs. G. theta genomes are gene rich with around 21,000 predicted protein genes. he assembly and annotation were imported from European Nucleotide Archive (ENA) accession GCA_000315625.


The protein coding gene annotation of the Guillardia theta genome was sequenced by DOE Joint genome Institute.

Non-coding RNA genes have been annotated using tRNAScan-SE (Lowe, T.M. and Eddy, S.R. 1997), RFAM (Griffiths-Jones et al 2005), and RNAmmer (Lagesen K.,et al 2007); additional analysis tools have also been applied.


Picture credit: Courtesy Dr. Geoff McFadden, University of Melbourne, Australia.

AssemblyGuith1, INSDC Assembly GCA_000315625.1, Dec 2012
Database version111.1
Golden Path Length87,266,873
Genebuild byJGI
Genebuild methodImport
Data sourceJoint Genome Institute

Gene counts

Coding genes24,945
Non coding genes279
Small non coding genes277
Long non coding genes1
Misc non coding genes1
Gene transcripts25,249