
Guillardia theta CCMP2712 Assembly and Gene Annotation
About Guillardia theta CCMP2712
Taxonomy ID 905079
(Text from Wikipedia.)
More information General information about this species can be found in Wikipedia
Assembly
G. theta AF041468 strain CCMP2712, 87Mbp genome is organised on 3 chromosomes and has one of the smallest nuclei sequenced to date - 0.55 million base pairs. G. theta genomes are gene rich with around 21,000 predicted protein genes. he assembly and annotation were imported from European Nucleotide Archive (ENA) accession GCA_000315625.
Annotation
The protein coding gene annotation of the Guillardia theta genome was sequenced by DOE Joint genome Institute.
Non-coding RNA genes have been annotated using tRNAScan-SE (Lowe, T.M. and Eddy, S.R. 1997), RFAM (Griffiths-Jones et al 2005), and RNAmmer (Lagesen K.,et al 2007); additional analysis tools have also been applied.
References
- Algal genomes reveal evolutionary mosaicism and the fate of
nucleomorphs.
Curtis BA, Tanifuji G, Burki F, Gruber A, Irimia M, Maruyama S, Arias MC, Ball SG, Gile GH, Hirakawa Y et al. 2012. Nature. 492:59-65.
Picture credit: Courtesy Dr. Geoff McFadden, University of Melbourne, Australia.
More information
General information about this species can be found in Wikipedia.
Statistics
Summary
Assembly | Guith1, INSDC Assembly GCA_000315625.1, Dec 2012 |
Database version | 115.1 |
Golden Path Length | 87,266,873 |
Genebuild by | JGI |
Genebuild method | Import |
Data source | Joint Genome Institute |
Gene counts
Coding genes | 24,945 |
Non coding genes | 279 |
Small non coding genes | 277 |
Long non coding genes | 1 |
Misc non coding genes | 1 |
Pseudogenes | 1 |
Gene transcripts | 25,249 |