Guillardia theta CCMP2712 Assembly and Gene Annotation
About Guillardia theta
Guillardia theta is a cryptomonad alga, it is an example of a cell-within-a-cell, being composed of a flagellate host cell, complete with mitochondria and nucleus, surrounding a plastid lying within a reduced cytoplasmic compartment that contains a vestigial nucleus (or nucleomorph). Nucleomorphs are the highly reduced nuclei of 'endosymbiotic' algal cells.
G. theta AF041468 strain CCMP2712, 87Mbp genome is organised on 3 chromosomes and has one of the smallest nuclei sequenced to date - 0.55 million base pairs. G. theta genomes are gene rich with around 21,000 predicted protein genes. he assembly and annotation were imported from European Nucleotide Archive (ENA) accession GCA_000315625.
The protein coding gene annotation of the Guillardia theta genome was sequenced by DOE Joint genome Institute.
Non-coding RNA genes have been annotated using tRNAScan-SE (Lowe, T.M. and Eddy, S.R. 1997), RFAM (Griffiths-Jones et al 2005), and RNAmmer (Lagesen K.,et al 2007); additional analysis tools have also been applied.
- Algal genomes reveal evolutionary mosaicism and the fate of
Curtis BA, Tanifuji G, Burki F, Gruber A, Irimia M, Maruyama S, Arias MC, Ball SG, Gile GH, Hirakawa Y et al. 2012. Nature. 492:59-65.
Picture credit: Courtesy Dr. Geoff McFadden, University of Melbourne, Australia.
General information about this species can be found in Wikipedia.
|Assembly||Guith1, INSDC Assembly GCA_000315625.1, Dec 2012|
|Golden Path Length||87,266,873|
|Data source||Joint Genome Institute|
|Non coding genes||279|
|Small non coding genes||277|
|Long non coding genes||1|
|Misc non coding genes||1|