U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



fBetSpl5.4

Organism name:
Betta splendens (Siamese fighting fish)
BioSample:
SAMEA104381735
BioProject:
PRJEB30365
Submitter:
Wellcome Sanger Institute
Date:
2023/03/10
Assembly type:
haploid (principal pseudohaplotype of diploid)
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_900634795.4 (latest)
RefSeq assembly accession:
GCF_900634795.4 (latest)
RefSeq assembly and GenBank assembly identical:
no (hide details)
  • Only in RefSeq: chromosome MT
  • Data displayed for RefSeq version
WGS Project:
CAAAFW04
Assembly method:
various
Genome coverage:
48x
Sequencing technology:
PacBio,Illumina
Linked assembly:
GCA_900651605.2 (alternate pseudohaplotype of diploid)

IDs: 16215421 [UID] 41561838 [GenBank] 42248668 [RefSeq]

See Genome Information for Betta splendens

There are 5 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly fBetSpl5.4 is based on 
48x PacBio Sequel data and 
83x coverage Illumina HiSeqX data from a single individual (fBetSpl5, sample SAMEA104381735). Scaffolding and curation also integrated 
183x Illumina HiSeqX data generated from a 10X Genomics Chromium library ... and single-enzyme BioNano Irys data, both from a separate individual (fBetSpl1, sample SAMEA104381745). All data was generated at the Wellcome Sanger Institute. An initial PacBio assembly of fBetSpl5 was made using Falcon-unzip, and retained haplotigs were identified using purge_haplotigs. The primary contigs were then scaffolded using the 10X data from fBetSpl1 with scaff10x. After using the PacBio data to gap fill with PBJelly and polish with Arrow, the assembly was polished again using the fBetSpl5 Illumina data and freebayes. Finally, the assembly was manually improved using gEVAL to correct mis-joins and improve concordance with the BioNano data. By aligning with the existing Betta splendens assembly (GCA_003650155.1) and medaka, we were able to order and orient our scaffolds and place onto chromosomes. Chromosomes are named by synteny to medaka.  more

Global statistics

Total sequence length427,001,245
Total ungapped length426,969,779
Gaps between scaffolds0
Number of scaffolds36
Scaffold N5019,821,211
Scaffold L509
Number of contigs352
Contig N502,574,972
Contig L5046
Total number of chromosomes and plasmids22
Number of component sequences (WGS or clone)36

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
non-nuclear
Assembly Unit: Primary Assembly (GCF_900634794.4)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1LR132024.4=NC_040881.30
Chromosome 2LR132025.3=NC_040882.20
Chromosome 3LR132008.3=NC_040883.20
Chromosome 4LR132005.3=NC_040884.20
Chromosome 5LR132018.3=NC_040885.20
Chromosome 6LR132013.3=NC_040886.20
Chromosome 7LR132010.3=NC_040887.20
Chromosome 8LR132020.3=NC_040888.20
Chromosome 9LR132023.3=NC_040889.20
Chromosome 10LR132007.3=NC_040890.20
Chromosome 11LR132019.3=NC_040891.20
Chromosome 12LR132017.3=NC_040892.20
Chromosome 13LR132006.3=NC_040893.20
Chromosome 14LR132016.3=NC_040894.20
Chromosome 15LR132021.3=NC_040895.20
Chromosome 16LR132009.3=NC_040896.20
Chromosome 17LR132015.3=NC_040897.20
Chromosome 19LR132011.3=NC_040898.20
Chromosome 21LR132022.2=NC_040899.10
Chromosome 22LR132012.3=NC_040900.20
Chromosome 24LR132014.3=NC_040901.20
unplacedn/an/an/a14

Assembly statistics

MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
All426,984,26535426,952,79919,821,2113210
Chromosome 121,851,952121,850,54321,851,952170
Chromosome 228,833,069128,830,36028,833,069280
Chromosome 319,821,211119,820,07319,821,211120
Chromosome 434,127,283134,124,63434,127,283260
Chromosome 520,509,678120,507,34420,509,678240
Chromosome 619,740,300119,739,30519,740,30080
Chromosome 718,144,457118,143,17418,144,457130
Chromosome 816,195,137116,194,26816,195,137100
Chromosome 931,232,242131,230,13731,232,242220
Chromosome 1020,913,381120,912,51420,913,38180
Chromosome 1116,509,136116,508,06616,509,136130
Chromosome 1214,432,533114,432,23514,432,53330
Chromosome 1321,179,781121,177,77021,179,781150
Chromosome 1417,214,289117,213,14517,214,289110
Chromosome 1519,301,601119,299,98019,301,601190
Chromosome 1621,615,379121,613,82721,615,379140
Chromosome 1717,259,823117,257,46617,259,823210
Chromosome 1916,490,133116,488,50116,490,133160
Chromosome 2118,688,203118,687,22818,688,203130
Chromosome 2216,663,441116,662,36216,663,441110
Chromosome 2414,741,104114,739,96014,741,104110
unplaced1,520,132141,519,907814,13660
MoleculeTotal
Length
Mitochondrion MT16,980