|
|
|
Genome assembly: |
ALLPATHS-LG (ver. 52155) |
|
|
Genome assembly ver. 001 |
Estimated genome size |
|
21,341,943 |
Sequence coverage |
|
116 |
# of contigs |
|
145 |
Total length (contigs) |
|
19,128,883 |
N50 (contigs) |
|
415,963 |
# of scaffolds |
|
32 |
Total scaffold length (with gaps) |
|
19,377,211 |
% gaps |
|
1.3 |
N50 (scaffold) |
|
3,291,297 |
GC% |
|
30.4 |
|
|
Gene prediction: |
|
MAKER annotation pipeline (release 2.31.8) |
|
|
Gene predictors used: Augustus (3.0.3), SNAP (2013-02-16), GeneMark-ES (Suite 4.21) |
|
|
Reference used: Hansenula polymorpha |
|
|
tRNAscan-SE (version 1.23) |
|
Infernal cmscan (version 1.1.1) |
|
|
Database used: Rfam (Release 12.0) |
Functional annotaion: |
|
Sma3S: 2013-09-01 |
|
|
Database used: Uniprot-TrEMBL (release 2015_11), Uniprot-sprot (release 2015_11) |
|
|
Gene prediction & funcational annotation ver. 001 |
# of predicted gene models |
|
6,053 |
# of predicted transcript models |
|
6,053 |
Average transcript length (bp) |
|
1,882 |
Average coding length (bp) |
|
1,629 |
Average protein length (aa) |
|
543 |
Average number of exons per gene model |
|
1.5 |
Average exon size (bp) |
|
1,073 |
Average intron size (bp) |
|
481 |
# of non-coding RNA |
|
750 |
# of functionally annotated genes trembl |
|
4,212 |
# of functionally annotated genes sprot |
|
2,645 |
CEGMA complete (%) |
|
95.16 |
CEGMA partial (%) |
|
96.37 |
|