NCBI Austrofundulus limnaeus Annotation Release 100

The RefSeq genome records for Austrofundulus limnaeus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results.

The annotation products are available in the sequence databases and on the FTP site.

This report provides:

Annotation Release information: The name of the release, important dates, the software version
Assemblies: A brief description of the annotated assembly(ies)
Gene and feature statistics: The counts and characteristics of the annotated features
Alignment of the annotated proteins to a set of high-quality proteins: The number of annotated proteins with hits to a set of high-quality proteins
Masking of genomic sequence: How much of the genome was masked
Transcript and protein alignments: The number and type of evidence retrieved from public databases and used for gene prediction

For more information on the annotation process, please visit the NCBI Eukaryotic Genome Annotation Pipeline page.

Annotation Release information

This annotation should be referred to as NCBI Austrofundulus limnaeus Annotation Release 100

Annotation release ID: 100
Date of Entrez queries for transcripts and proteins: Sep 2 2015
Date of submission of annotation to the public databases: Sep 17 2015
Software version: 6.4

Assemblies

The following assemblies were included in this annotation run:

Assembly name	Assembly accession	Submitter	Assembly date	Reference/Alternate	Assembly content
Austrofundulus_limnaeus-1.0	GCF_001266775.1	Center for Life in Extreme Environments at Portland State University, Portland, OR	07-28-2015	Reference	unplaced scaffolds

Gene and feature statistics

Counts and length of annotated features are provided below for each assembly.

Feature counts

Feature	Austrofundulus_limnaeus-1.0
Genes and pseudogenes	26,712
protein-coding	23,844
non-coding	2,313
pseudogenes	555
genes with variants	6,136
mRNAs	35,329
fully-supported	33,945
with > 5% ab initio	552
partial	11,187
with filled gap(s)	10,926
known RefSeq (NM_)	0
model RefSeq (XM_)	35,329
Other RNAs	3,629
fully-supported	3,188
with > 5% ab initio	0
partial	36
with filled gap(s)	36
known RefSeq (NR_)	0
model RefSeq (XR_)	3,188
CDSs	35,368
fully-supported	33,945
with > 5% ab initio	639
partial	9,918
with major correction(s)	3,568
known RefSeq (NP_)	0
model RefSeq (XP_)	35,329

Detailed reports

Feature lengths

Feature	Count	Mean length (bp)	Median length (bp)	Min length (bp)	Max length (bp)
Genes	26,157	18,385	8,286	71	638,625
All transcripts	38,958	2,868	2,288	71	88,923
mRNA	35,329	3,063	2,454	183	88,923
misc_RNA	370	2,747	2,371	115	13,579
tRNA	441	74	73	71	87
lncRNA	2,818	884	676	89	6,645
Single-exon transcripts	1,638	1,516	1,291	183	7,265
coding transcripts (NM_/XM_ )	1,638	1,516	1,291	183	7,265
CDSs	35,329	2,011	1,434	96	87,576
Exons	246,940	264	137	2	17,283
in coding transcripts (NM_/XM_ )	239,150	262	136	2	17,283
in non-coding transcripts (NR_/XR_ )	10,381	281	140	2	7,351
Introns	215,079	2,084	353	8	369,543
in coding transcripts (NM_/XM_ )	210,044	2,037	351	8	369,543
in non-coding transcripts (NR_/XR_ )	7,596	3,392	463	30	180,124

Transcripts per gene, exons per transcript

	Mean	Median	Min	Max
Number of transcripts per gene	1.48	1	1	33
Number of exons per transcript	11	8	1	210

Alignment of the annotated proteins to a set of high-quality proteins

The final set of annotated proteins was searched with BLASTP against the prot-swissprot-euk, using the annotated proteins as the query and the high-quality proteins as the target. Out of 23805 coding genes, 21892 genes had a protein with an alignment covering 50% or more of the query and 10523 had an alignment covering 95% or more of the query.

Definition of query and target coverage. The query coverage is the percentage of the annotated protein length that is included in the alignment. The target coverage is the percentage of the target length that is included in the alignment.

Below is a cumulative graph displaying the number of genes with alignments above a given query or target coverage threshold. For comparison, corresponding statistics for other organisms annotated by the NCBI eukaryotic annotation pipeline were added to the graph.

Query: annotated proteins
Target: prot-swissprot-euk

Masking of genomic sequence

Transcript and protein alignments are performed on the repeat-masked genome. Below are the percentages of genomic sequence masked by WindowMasker and RepeatMasker for each assembly. RepeatMasker results are only used for organisms for which a comprehensive repeat library is available.

For this annotation run, transcripts and proteins were aligned to the genome masked with WindowMasker only.

Assembly name	Assembly accession	% Masked with RepeatMasker	% Masked with WindowMasker
Austrofundulus_limnaeus-1.0	GCF_001266775.1	2.12%	27.83%

Transcript and protein alignments

The annotation pipeline relies heavily on alignments of experimental evidence for gene prediction. Below are the sets of transcripts and proteins that were retrieved from Entrez, aligned to the genome by Splign or ProSplign and passed to Gnomon, NCBI's gene prediction software.

Depending on the other evidence available, long 454 reads (with average length above 250 nt) may be aligned as traditional evidence and reported in the Transcript alignments section or aligned with short reads and reported in the Short read transcript alignments section.

Transcript alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by Splign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Same-species EST	91	76 (83.52%)	63 (69.23%)	98.11%	98.03%

Short read transcript alignments

The following short reads (RNA-Seq) from the Sequence Read Archive were also used for gene prediction:

Hide alignments statistics, by sample (SAME, SAMN, SAMD, DRS)

Sample Id	Track name	Number of reads	Percent aligned reads	Percent spliced reads	Number of introns
All	Aggregate of all aligned samples	9,297,509,096	73%	20%	289,674
SAMN03610086	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610086)	65,482,658	63%	19%	92,998
SAMN03610087	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610087)	27,781,008	69%	23%	100,829
SAMN03610088	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610088)	80,461,834	83%	27%	143,566
SAMN03610089	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610089)	104,292,042	41%	12%	43,099
SAMN03610090	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610090)	98,800,042	64%	19%	90,811
SAMN03610091	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610091)	36,106,306	74%	22%	96,004
SAMN03610092	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610092)	34,812,044	77%	23%	81,084
SAMN03610093	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610093)	169,794,668	84%	28%	162,909
SAMN03610094	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610094)	57,047,140	83%	28%	131,874
SAMN03610095	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610095)	60,816,106	77%	25%	134,021
SAMN03610096	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610096)	233,634,042	70%	23%	123,418
SAMN03610097	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610097)	63,133,448	46%	15%	70,348
SAMN03610098	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610098)	68,727,284	81%	26%	109,230
SAMN03610099	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610099)	66,459,684	73%	23%	118,314
SAMN03610100	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610100)	163,202,322	55%	14%	37,348
SAMN03610101	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610101)	199,286,050	68%	18%	91,938
SAMN03610102	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610102)	131,091,500	70%	17%	91,826
SAMN03610103	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610103)	59,400,706	58%	11%	71,127
SAMN03610104	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610104)	67,256,334	68%	17%	120,882
SAMN03610105	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610105)	63,528,700	71%	19%	95,426
SAMN03610106	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610106)	34,349,674	71%	17%	58,273
SAMN03610107	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610107)	28,652,986	18%	1%	13,798
SAMN03610108	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610108)	27,716,546	42%	6%	43,004
SAMN03610109	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610109)	144,412,980	67%	18%	84,997
SAMN03610110	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610110)	64,929,310	60%	17%	35,027
SAMN03610111	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610111)	85,778,948	75%	21%	51,075
SAMN03610112	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610112)	57,940,924	47%	8%	64,285
SAMN03610113	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610113)	21,345,874	57%	14%	82,340
SAMN03610114	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610114)	38,045,142	9%	1%	25,972
SAMN03610115	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610115)	69,336,558	81%	22%	116,992
SAMN03610116	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610116)	93,772,654	81%	22%	157,011
SAMN03610117	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610117)	29,168,394	24%	4%	58,048
SAMN03610118	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610118)	95,343,360	77%	17%	112,161
SAMN03610119	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610119)	90,582,976	70%	20%	76,361
SAMN03610120	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610120)	121,305,688	75%	22%	167,104
SAMN03610121	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610121)	85,055,452	66%	16%	92,916
SAMN03610122	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610122)	107,454,260	75%	20%	110,095
SAMN03610123	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610123)	68,019,586	72%	18%	72,284
SAMN03610124	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610124)	60,404,380	41%	11%	82,849
SAMN03610125	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610125)	61,300,536	71%	19%	93,211
SAMN03610126	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610126)	59,249,360	72%	20%	134,118
SAMN03610127	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610127)	37,023,220	71%	15%	81,029
SAMN03610128	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610128)	29,477,244	49%	6%	60,326
SAMN03610129	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610129)	21,891,492	57%	15%	67,037
SAMN03610130	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610130)	109,483,322	76%	23%	100,551
SAMN03610131	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610131)	115,736,512	78%	21%	176,582
SAMN03610132	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610132)	80,006,994	71%	19%	67,173
SAMN03610133	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610133)	92,398,930	66%	19%	83,186
SAMN03610134	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610134)	19,854,232	65%	17%	93,967
SAMN03610135	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610135)	16,185,344	68%	18%	100,626
SAMN03610136	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610136)	112,272,176	81%	25%	181,275
SAMN03610137	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610137)	101,418,638	72%	20%	104,217
SAMN03610138	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610138)	42,293,650	34%	4%	40,182
SAMN03610139	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610139)	92,014,838	77%	24%	104,541
SAMN03610140	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610140)	130,577,092	81%	24%	158,637
SAMN03610141	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610141)	90,028,412	78%	23%	121,700
SAMN03610142	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610142)	70,210,446	48%	14%	180,861
SAMN03610143	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610143)	59,835,452	77%	22%	201,690
SAMN03610144	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610144)	64,311,724	79%	23%	203,431
SAMN03610145	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610145)	67,018,760	76%	22%	179,501
SAMN03610146	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610146)	59,184,032	81%	17%	139,302
SAMN03610147	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610147)	74,543,106	80%	3%	133,475
SAMN03610148	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610148)	58,775,514	74%	14%	91,720
SAMN03610149	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610149)	63,465,510	81%	20%	114,563
SAMN03610150	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610150)	69,387,632	73%	18%	145,760
SAMN03610151	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610151)	63,565,500	78%	20%	194,711
SAMN03610152	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610152)	71,305,514	80%	23%	199,366
SAMN03610153	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610153)	63,640,304	77%	21%	201,022
SAMN03610154	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610154)	58,746,908	80%	24%	205,392
SAMN03610155	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610155)	64,950,304	80%	21%	196,989
SAMN03610156	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610156)	73,241,526	80%	23%	191,417
SAMN03610157	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610157)	54,314,082	74%	22%	196,411
SAMN03610158	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610158)	64,106,198	78%	21%	168,815
SAMN03610159	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610159)	79,295,794	84%	4%	151,903
SAMN03610160	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610160)	60,838,212	81%	23%	196,168
SAMN03610161	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610161)	68,154,416	75%	22%	196,565
SAMN03610162	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610162)	67,033,662	79%	23%	190,657
SAMN03610163	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610163)	60,959,360	76%	20%	71,255
SAMN03610164	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610164)	61,698,948	81%	22%	181,928
SAMN03610165	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610165)	56,639,320	82%	23%	148,646
SAMN03610166	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610166)	59,341,026	76%	19%	139,163
SAMN03610167	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610167)	36,804,230	34%	6%	66,225
SAMN03610168	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610168)	75,217,912	82%	2%	119,475
SAMN03610169	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610169)	61,349,372	78%	21%	206,084
SAMN03610170	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610170)	69,046,344	80%	21%	158,871
SAMN03610171	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610171)	45,772,928	78%	18%	80,850
SAMN03610172	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610172)	125,268,064	80%	22%	195,090
SAMN03610173	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610173)	97,194,302	83%	20%	122,814
SAMN03610174	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610174)	75,454,724	84%	24%	191,862
SAMN03610175	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610175)	108,927,944	83%	22%	165,607
SAMN03610176	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610176)	114,527,434	83%	24%	182,048
SAMN03610177	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610177)	90,677,552	82%	23%	159,189
SAMN03610178	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610178)	119,442,778	69%	19%	134,443
SAMN03610179	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610179)	104,056,448	71%	20%	178,808
SAMN03610180	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610180)	126,606,308	51%	11%	82,536
SAMN03610181	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610181)	89,968,556	82%	24%	208,877
SAMN03610182	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610182)	94,413,338	81%	24%	202,609
SAMN03610183	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610183)	114,232,262	82%	25%	207,963
SAMN03610184	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610184)	88,758,100	82%	26%	218,725
SAMN03610185	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610185)	120,705,852	83%	25%	220,985
SAMN03610186	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610186)	153,625,546	84%	26%	228,467
SAMN03610187	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610187)	96,726,212	80%	24%	220,804
SAMN03610188	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610188)	99,235,558	66%	19%	198,941
SAMN03610189	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610189)	112,398,512	79%	22%	210,017
SAMN03610190	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610190)	99,499,326	82%	26%	220,441
SAMN03610191	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610191)	110,456,360	82%	25%	225,702
SAMN03610192	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610192)	94,891,942	82%	26%	191,324
SAMN03610193	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610193)	95,243,254	67%	20%	145,872
SAMN03610194	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610194)	104,694,732	81%	25%	184,632
SAMN03610195	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610195)	99,163,340	79%	25%	195,763
SAMN03610196	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610196)	101,876,908	78%	22%	152,611
SAMN03610197	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610197)	117,025,768	60%	18%	125,999
SAMN03610198	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610198)	132,599,150	82%	26%	184,824
SAMN03610199	Whole Embryo (Austrofundulus limnaeus, not collected, SAMN03610199)	121,143,188	82%	26%	178,502

Show alignments statistics, by run (ERR, SRR, DRR)

Run	Experiment	Project	Sample	Number of reads	Percent aligned reads	Percent spliced reads
SRR2032203	SRX1032647	SRP058458	SAMN03610086	65,482,658	63%	19%
SRR2032204	SRX1032648	SRP058458	SAMN03610087	27,781,008	69%	23%
SRR2032205	SRX1032649	SRP058458	SAMN03610088	80,461,834	83%	27%
SRR2032208	SRX1032652	SRP058458	SAMN03610089	104,292,042	41%	12%
SRR2032209	SRX1032653	SRP058458	SAMN03610090	98,800,042	64%	19%
SRR2032218	SRX1032662	SRP058458	SAMN03610091	36,106,306	74%	22%
SRR2032219	SRX1032663	SRP058458	SAMN03610092	34,812,044	77%	23%
SRR2032206	SRX1032650	SRP058458	SAMN03610093	169,794,668	84%	28%
SRR2032207	SRX1032651	SRP058458	SAMN03610094	57,047,140	83%	28%
SRR2032210	SRX1032654	SRP058458	SAMN03610095	60,816,106	77%	25%
SRR2032212	SRX1032656	SRP058458	SAMN03610096	233,634,042	70%	23%
SRR2032215	SRX1032659	SRP058458	SAMN03610097	63,133,448	46%	15%
SRR2032216	SRX1032660	SRP058458	SAMN03610098	68,727,284	81%	26%
SRR2032217	SRX1032661	SRP058458	SAMN03610099	66,459,684	73%	23%
SRR2032253	SRX1032697	SRP058458	SAMN03610100	163,202,322	55%	14%
SRR2032254	SRX1032698	SRP058458	SAMN03610101	199,286,050	68%	18%
SRR2032255	SRX1032699	SRP058458	SAMN03610102	131,091,500	70%	17%
SRR2032211	SRX1032655	SRP058458	SAMN03610103	59,400,706	58%	11%
SRR2032213	SRX1032657	SRP058458	SAMN03610104	67,256,334	68%	17%
SRR2032214	SRX1032658	SRP058458	SAMN03610105	63,528,700	71%	19%
SRR2032220	SRX1032664	SRP058458	SAMN03610106	34,349,674	71%	17%
SRR2032221	SRX1032665	SRP058458	SAMN03610107	28,652,986	18%	1%
SRR2032222	SRX1032666	SRP058458	SAMN03610108	27,716,546	42%	6%
SRR2032256	SRX1032700	SRP058458	SAMN03610109	144,412,980	67%	18%
SRR2032237	SRX1032681	SRP058458	SAMN03610110	64,929,310	60%	17%
SRR2032239	SRX1032682	SRP058458	SAMN03610111	85,778,948	75%	21%
SRR2032223	SRX1032667	SRP058458	SAMN03610112	57,940,924	47%	8%
SRR2032224	SRX1032668	SRP058458	SAMN03610113	21,345,874	57%	14%
SRR2032225	SRX1032669	SRP058458	SAMN03610114	38,045,142	9%	1%
SRR2032238	SRX1032683	SRP058458	SAMN03610115	69,336,558	81%	22%
SRR2032240	SRX1032684	SRP058458	SAMN03610116	93,772,654	81%	22%
SRR2032226	SRX1032670	SRP058458	SAMN03610117	29,168,394	24%	4%
SRR2032241	SRX1032685	SRP058458	SAMN03610118	95,343,360	77%	17%
SRR2032242	SRX1032686	SRP058458	SAMN03610119	90,582,976	70%	20%
SRR2032243	SRX1032687	SRP058458	SAMN03610120	121,305,688	75%	22%
SRR2032244	SRX1032688	SRP058458	SAMN03610121	85,055,452	66%	16%
SRR2032245	SRX1032689	SRP058458	SAMN03610122	107,454,260	75%	20%
SRR2032246	SRX1032690	SRP058458	SAMN03610123	68,019,586	72%	18%
SRR2032227	SRX1032671	SRP058458	SAMN03610124	60,404,380	41%	11%
SRR2032228	SRX1032672	SRP058458	SAMN03610125	61,300,536	71%	19%
SRR2032229	SRX1032673	SRP058458	SAMN03610126	59,249,360	72%	20%
SRR2032230	SRX1032674	SRP058458	SAMN03610127	37,023,220	71%	15%
SRR2032231	SRX1032675	SRP058458	SAMN03610128	29,477,244	49%	6%
SRR2032232	SRX1032676	SRP058458	SAMN03610129	21,891,492	57%	15%
SRR2032247	SRX1032691	SRP058458	SAMN03610130	109,483,322	76%	23%
SRR2032248	SRX1032692	SRP058458	SAMN03610131	115,736,512	78%	21%
SRR2032249	SRX1032693	SRP058458	SAMN03610132	80,006,994	71%	19%
SRR2032233	SRX1032677	SRP058458	SAMN03610133	92,398,930	66%	19%
SRR2032234	SRX1032678	SRP058458	SAMN03610134	19,854,232	65%	17%
SRR2032235	SRX1032679	SRP058458	SAMN03610135	16,185,344	68%	18%
SRR2032257	SRX1032701	SRP058458	SAMN03610136	112,272,176	81%	25%
SRR2032250	SRX1032694	SRP058458	SAMN03610137	101,418,638	72%	20%
SRR2032236	SRX1032680	SRP058458	SAMN03610138	42,293,650	34%	4%
SRR2032258	SRX1032702	SRP058458	SAMN03610139	92,014,838	77%	24%
SRR2032251	SRX1032695	SRP058458	SAMN03610140	130,577,092	81%	24%
SRR2032252	SRX1032696	SRP058458	SAMN03610141	90,028,412	78%	23%
SRR2032173	SRX1032617	SRP058458	SAMN03610142	70,210,446	48%	14%
SRR2032182	SRX1032628	SRP058458	SAMN03610143	59,835,452	77%	22%
SRR2032195	SRX1032639	SRP058458	SAMN03610144	64,311,724	79%	23%
SRR2032197	SRX1032641	SRP058458	SAMN03610145	67,018,760	76%	22%
SRR2032198	SRX1032642	SRP058458	SAMN03610146	59,184,032	81%	17%
SRR2032199	SRX1032643	SRP058458	SAMN03610147	74,543,106	80%	3%
SRR2032200	SRX1032644	SRP058458	SAMN03610148	58,775,514	74%	14%
SRR2032201	SRX1032645	SRP058458	SAMN03610149	63,465,510	81%	20%
SRR2032202	SRX1032646	SRP058458	SAMN03610150	69,387,632	73%	18%
SRR2032174	SRX1032618	SRP058458	SAMN03610151	63,565,500	78%	20%
SRR2032175	SRX1032619	SRP058458	SAMN03610152	71,305,514	80%	23%
SRR2032176	SRX1032620	SRP058458	SAMN03610153	63,640,304	77%	21%
SRR2032177	SRX1032621	SRP058458	SAMN03610154	58,746,908	80%	24%
SRR2032178	SRX1032622	SRP058458	SAMN03610155	64,950,304	80%	21%
SRR2032179	SRX1032623	SRP058458	SAMN03610156	73,241,526	80%	23%
SRR2032180	SRX1032624	SRP058458	SAMN03610157	54,314,082	74%	22%
SRR2032181	SRX1032625	SRP058458	SAMN03610158	64,106,198	78%	21%
SRR2032183	SRX1032626	SRP058458	SAMN03610159	79,295,794	84%	4%
SRR2032184	SRX1032627	SRP058458	SAMN03610160	60,838,212	81%	23%
SRR2032185	SRX1032629	SRP058458	SAMN03610161	68,154,416	75%	22%
SRR2032186	SRX1032630	SRP058458	SAMN03610162	67,033,662	79%	23%
SRR2032187	SRX1032631	SRP058458	SAMN03610163	60,959,360	76%	20%
SRR2032188	SRX1032632	SRP058458	SAMN03610164	61,698,948	81%	22%
SRR2032189	SRX1032633	SRP058458	SAMN03610165	56,639,320	82%	23%
SRR2032190	SRX1032634	SRP058458	SAMN03610166	59,341,026	76%	19%
SRR2032191	SRX1032635	SRP058458	SAMN03610167	36,804,230	34%	6%
SRR2032192	SRX1032636	SRP058458	SAMN03610168	75,217,912	82%	2%
SRR2032193	SRX1032637	SRP058458	SAMN03610169	61,349,372	78%	21%
SRR2032194	SRX1032638	SRP058458	SAMN03610170	69,046,344	80%	21%
SRR2032196	SRX1032640	SRP058458	SAMN03610171	45,772,928	78%	18%
SRR2032263	SRX1032707	SRP058458	SAMN03610172	125,268,064	80%	22%
SRR2032264	SRX1032708	SRP058458	SAMN03610173	97,194,302	83%	20%
SRR2032265	SRX1032709	SRP058458	SAMN03610174	75,454,724	84%	24%
SRR2032266	SRX1032710	SRP058458	SAMN03610175	108,927,944	83%	22%
SRR2032267	SRX1032711	SRP058458	SAMN03610176	114,527,434	83%	24%
SRR2032268	SRX1032712	SRP058458	SAMN03610177	90,677,552	82%	23%
SRR2032269	SRX1032713	SRP058458	SAMN03610178	119,442,778	69%	19%
SRR2032270	SRX1032714	SRP058458	SAMN03610179	104,056,448	71%	20%
SRR2032271	SRX1032715	SRP058458	SAMN03610180	126,606,308	51%	11%
SRR2032272	SRX1032716	SRP058458	SAMN03610181	89,968,556	82%	24%
SRR2032273	SRX1032717	SRP058458	SAMN03610182	94,413,338	81%	24%
SRR2032274	SRX1032718	SRP058458	SAMN03610183	114,232,262	82%	25%
SRR2032275	SRX1032719	SRP058458	SAMN03610184	88,758,100	82%	26%
SRR2032276	SRX1032720	SRP058458	SAMN03610185	120,705,852	83%	25%
SRR2032277	SRX1032721	SRP058458	SAMN03610186	153,625,546	84%	26%
SRR2032278	SRX1032722	SRP058458	SAMN03610187	96,726,212	80%	24%
SRR2032279	SRX1032723	SRP058458	SAMN03610188	99,235,558	66%	19%
SRR2032280	SRX1032724	SRP058458	SAMN03610189	112,398,512	79%	22%
SRR2032281	SRX1032725	SRP058458	SAMN03610190	99,499,326	82%	26%
SRR2032282	SRX1032726	SRP058458	SAMN03610191	110,456,360	82%	25%
SRR2032283	SRX1032727	SRP058458	SAMN03610192	94,891,942	82%	26%
SRR2032284	SRX1032728	SRP058458	SAMN03610193	95,243,254	67%	20%
SRR2032259	SRX1032703	SRP058458	SAMN03610194	104,694,732	81%	25%
SRR2032260	SRX1032704	SRP058458	SAMN03610195	99,163,340	79%	25%
SRR2032285	SRX1032729	SRP058458	SAMN03610196	101,876,908	78%	22%
SRR2032286	SRX1032730	SRP058458	SAMN03610197	117,025,768	60%	18%
SRR2032261	SRX1032705	SRP058458	SAMN03610198	132,599,150	82%	26%
SRR2032262	SRX1032706	SRP058458	SAMN03610199	121,143,188	82%	26%

Protein alignments

Source	Number of sequences retrieved from Entrez	Number (%) of sequences aligned by ProSplign	Number (%) of sequences passed to Gnomon	Average % identity	Average % coverage
Actinopteri GenBank	73,181	67,594 (92.37%)	67,594 (92.37%)	69.24%	75.56%
Actinopteri known RefSeq (NP_)	23,701	22,508 (94.97%)	22,508 (94.97%)	68.37%	73.54%
Homo sapiens GenBank	125,391	99,088 (79.02%)	99,088 (79.02%)	65.68%	64.52%
Homo sapiens known RefSeq (NP_)	39,276	32,770 (83.44%)	32,770 (83.44%)	65.88%	63.31%

References

RefSeq: Pruitt KD, Brown GR, Hiatt SM, Thibaud-Nissen F, Astashyn A, Ermolaeva O, Farrell CM, Hart J, Landrum MJ, McGarvey KM, Murphy MR, O'Leary NA, Pujar S, Rajput B, Rangwala SH, Riddick LD, Shkeda A, Sun H, Tamez P, Tully RE, Wallin C, Webb D, Weber J, Wu W, Dicuccio M, Kitts P, Maglott DR, Murphy TD, Ostell JM. Nucleic Acids Research 2014, 42(Database issue):D756-63
RepeatMasker: Smit AFA, Hubley R, Green P. RepeatMasker Open-3.0. 1996–2004. http://www.repeatmasker.org
WindowMasker: Morgulis A, Gertz EM, Schäffer AA, Agarwala R. Bioinformatics 2006, 2:134-41
Splign: Kapustin Y, Souvorov A, Tatusova T, Lipman D. Biology Direct 2008, 3:20

RefSeq

Integrated reference sequences