Summary of assembly and annotation
 
 Scientific name   JCM no.
 Cystobasidium pallidum   3780
Genome assembly:
ALLPATHS-LG (ver. 52155)
Genome assembly ver. 001
Estimated genome size   23,349,888
Sequence coverage   200
# of contigs   257
Total length (contigs)   21,258,753
N50 (contigs)   208,756
# of scaffolds   69
Total scaffold length (with gaps)   21,721,408
% gaps   2.1
N50 (scaffold)   1,290,484
GC%   49.7
Gene prediction:
  MAKER annotation pipeline (release 2.31.8)
    Gene predictors used: Augustus (3.0.3), SNAP (2013-02-16), GeneMark-ES (Suite 4.21)
    Reference used: Rhodosporidium toruloides
    tRNAscan-SE (version 1.23)
  Infernal cmscan (version 1.1.1)
    Database used: Rfam (Release 12.0)
Functional annotaion:
  Sma3S: 2013-09-01
    Database used: Uniprot-TrEMBL (release 2015_11), Uniprot-sprot (release 2015_11)
Gene prediction & funcational annotation ver. 001
# of predicted gene models   7,365
# of predicted transcript models   7,365
Average transcript length (bp)   1,997
Average coding length (bp)   1,584
Average protein length (aa)   528
Average number of exons per gene model   5.4
Average exon size (bp)   295
Average intron size (bp)   93
# of non-coding RNA   388
# of functionally annotated genes trembl   4,278
# of functionally annotated genes sprot   1,779
CEGMA complete (%)   95.97
CEGMA partial (%)   96.77