|
|
|
Genome assembly: |
ALLPATHS-LG (ver. 52155) |
|
|
Genome assembly ver. 001 |
Estimated genome size |
|
29,169,862 |
Sequence coverage |
|
156 |
# of contigs |
|
173 |
Total length (contigs) |
|
27,611,045 |
N50 (contigs) |
|
380,761 |
# of scaffolds |
|
63 |
Total scaffold length (with gaps) |
|
27,756,302 |
% gaps |
|
0.5 |
N50 (scaffold) |
|
1,182,044 |
GC% |
|
43.2 |
|
|
Gene prediction: |
|
MAKER annotation pipeline (release 2.31.8) |
|
|
Gene predictors used: Augustus (3.0.3), SNAP (2013-02-16), GeneMark-ES (Suite 4.21) |
|
|
Reference used: Malassezia globosa |
|
|
tRNAscan-SE (version 1.23) |
|
Infernal cmscan (version 1.1.1) |
|
|
Database used: Rfam (Release 12.0) |
Functional annotaion: |
|
Sma3S: 2013-09-01 |
|
|
Database used: Uniprot-TrEMBL (release 2015_11), Uniprot-sprot (release 2015_11) |
|
|
Gene prediction & funcational annotation ver. 001 |
# of predicted gene models |
|
9,297 |
# of predicted transcript models |
|
9,297 |
Average transcript length (bp) |
|
2,178 |
Average coding length (bp) |
|
1,670 |
Average protein length (aa) |
|
556 |
Average number of exons per gene model |
|
4.2 |
Average exon size (bp) |
|
393 |
Average intron size (bp) |
|
155 |
# of non-coding RNA |
|
1,034 |
# of functionally annotated genes trembl |
|
5,200 |
# of functionally annotated genes sprot |
|
1,886 |
CEGMA complete (%) |
|
93.95 |
CEGMA partial (%) |
|
94.35 |
|