|
|
|
Genome assembly: |
ALLPATHS-LG (ver. 52155) |
|
|
Genome assembly ver. 001 |
Estimated genome size |
|
23,349,888 |
Sequence coverage |
|
200 |
# of contigs |
|
257 |
Total length (contigs) |
|
21,258,753 |
N50 (contigs) |
|
208,756 |
# of scaffolds |
|
69 |
Total scaffold length (with gaps) |
|
21,721,408 |
% gaps |
|
2.1 |
N50 (scaffold) |
|
1,290,484 |
GC% |
|
49.7 |
|
|
Gene prediction: |
|
MAKER annotation pipeline (release 2.31.8) |
|
|
Gene predictors used: Augustus (3.0.3), SNAP (2013-02-16), GeneMark-ES (Suite 4.21) |
|
|
Reference used: Rhodosporidium toruloides |
|
|
tRNAscan-SE (version 1.23) |
|
Infernal cmscan (version 1.1.1) |
|
|
Database used: Rfam (Release 12.0) |
Functional annotaion: |
|
Sma3S: 2013-09-01 |
|
|
Database used: Uniprot-TrEMBL (release 2015_11), Uniprot-sprot (release 2015_11) |
|
|
Gene prediction & funcational annotation ver. 001 |
# of predicted gene models |
|
7,365 |
# of predicted transcript models |
|
7,365 |
Average transcript length (bp) |
|
1,997 |
Average coding length (bp) |
|
1,584 |
Average protein length (aa) |
|
528 |
Average number of exons per gene model |
|
5.4 |
Average exon size (bp) |
|
295 |
Average intron size (bp) |
|
93 |
# of non-coding RNA |
|
388 |
# of functionally annotated genes trembl |
|
4,278 |
# of functionally annotated genes sprot |
|
1,779 |
CEGMA complete (%) |
|
95.97 |
CEGMA partial (%) |
|
96.77 |
|