U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



mEriEur2.1

Organism name:
Erinaceus europaeus (western European hedgehog)
BioSample:
SAMEA13207416
BioProject:
PRJEB61758
Submitter:
WELLCOME SANGER INSTITUTE
Date:
2023/05/01
Assembly type:
haploid (principal pseudohaplotype of diploid)
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_950295315.1 (latest)
RefSeq assembly accession:
GCF_950295315.1 (latest)
RefSeq assembly and GenBank assembly identical:
no (hide details)
  • Only in GenBank: chromosome MT
  • Data displayed for RefSeq version
WGS Project:
CATNUA01
Assembly method:
various
Genome coverage:
31x
Sequencing technology:
PacBio,Arima2
Linked assembly:
GCA_950295305.1 (alternate pseudohaplotype of diploid)

IDs: 16610581 [UID] 42735488 [GenBank] 43684938 [RefSeq]

See Genome Information for Erinaceus europaeus

There are 4 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly mEriEur2.1 is based on 31x PacBio data and Arima2 Hi-C data generated by the Darwin Tree of Life Project
(https://www.darwintreeoflife.org/). The assembly process included the following sequence of steps: initial PacBio assembly generation with Hifiasm, retained haplotig separation ... with purge_dups, and Hi-C based scaffolding with YaHS. The mitochondrial genome was assembled using MitoHiFi. Finally, the primary assembly was analysed and manually improved using TreeVal. Chromosome-scale scaffolds confirmed by the Hi-C data have been named in order of size. X chromosome identified based on alignment with Iberian mole (GCA_014898055.3;
https://www.science.org/doi/10.1126/science.aaz2582) and reduced HiC background signal (HiC from male sample, PacBio used for de novo assembly from female sample). The order and orientation of the contigs in the following regions are uncertain: SUPER_2, 164.5 Mb to 176.5 Mb; SUPER_9, 67.5 Mb to 72.5 Mb; SUPER_15, 88.5 Mb to 93.5 Mb; SUPER_17, 42 Mb to 53.5 Mb.  more

Global statistics

Total sequence length2,720,683,831
Total ungapped length2,719,931,831
Gaps between scaffolds0
Number of scaffolds1,174
Scaffold N50126,757,761
Scaffold L509
Number of contigs4,934
Contig N50999,919
Contig L50818
Total number of chromosomes and plasmids24
Number of component sequences (WGS or clone)1,174

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
Assembly Unit: Primary Assembly (GCF_950295314.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1OX485810.1=NC_080162.10
Chromosome 2OX485811.1=NC_080163.10
Chromosome 3OX485812.1=NC_080164.10
Chromosome 4OX485813.1=NC_080165.10
Chromosome 5OX485814.1=NC_080166.12
Chromosome 6OX485815.1=NC_080167.176
Chromosome 7OX485816.1=NC_080168.10
Chromosome 8OX485818.1=NC_080169.10
Chromosome 9OX485819.1=NC_080170.10
Chromosome 10OX485820.1=NC_080171.10
Chromosome 11OX485821.1=NC_080172.10
Chromosome 12OX485822.1=NC_080173.10
Chromosome 13OX485823.1=NC_080174.10
Chromosome 14OX485824.1=NC_080175.10
Chromosome 15OX485825.1=NC_080176.10
Chromosome 16OX485826.1=NC_080177.10
Chromosome 17OX485827.1=NC_080178.10
Chromosome 18OX485828.1=NC_080179.10
Chromosome 19OX485829.1=NC_080180.10
Chromosome 20OX485830.1=NC_080181.10
Chromosome 21OX485831.1=NC_080182.10
Chromosome 22OX485832.1=NC_080183.10
Chromosome 23OX485833.1=NC_080184.10
Chromosome XOX485817.1=NC_080185.10
unplacedn/an/an/a1,072

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule2,720,683,8311,1742,719,931,831126,757,7613,7600
Chromosome 1Assembled molecule210,889,7231210,832,123210,889,7232880
Chromosome 2Assembled molecule204,397,8181204,335,218204,397,8183130
Chromosome 3Assembled molecule185,697,0971185,642,697185,697,0972720
Chromosome 4Assembled molecule154,223,5761154,181,776154,223,5762090
Chromosome 5AllAssembled moleculeUnlocalized scaffolds138,102,163137,178,045924,118312138,063,563137,139,445924,118137,178,045137,178,045709,8701931930000
Chromosome 6AllAssembled moleculeUnlocalized scaffolds136,983,86277,877,76859,106,09477176136,961,46277,855,56859,105,89477,877,76877,877,768977,7321121111000
Chromosome 7Assembled molecule134,025,5291133,988,529134,025,5291850
Chromosome 8Assembled molecule127,350,0281127,310,028127,350,0282000
Chromosome 9Assembled molecule126,757,7611126,722,361126,757,7611770
Chromosome 10Assembled molecule125,528,4141125,491,614125,528,4141840
Chromosome 11Assembled molecule120,429,2271120,395,227120,429,2271700
Chromosome 12Assembled molecule102,567,7011102,535,901102,567,7011590
Chromosome 13Assembled molecule101,763,0571101,734,057101,763,0571450
Chromosome 14Assembled molecule101,497,9591101,466,159101,497,9591590
Chromosome 15Assembled molecule95,287,971195,257,17195,287,9711540
Chromosome 16Assembled molecule86,865,962186,838,36286,865,9621380
Chromosome 17Assembled molecule82,367,746182,341,74682,367,7461300
Chromosome 18Assembled molecule78,598,273178,577,27378,598,2731050
Chromosome 19Assembled molecule67,790,477167,772,27767,790,477910
Chromosome 20Assembled molecule58,519,840158,502,84058,519,840850
Chromosome 21Assembled molecule45,976,610145,963,81045,976,610640
Chromosome 22Assembled molecule18,871,047118,865,04718,871,047300
Chromosome 23Assembled molecule15,527,650115,521,25015,527,650320
Chromosome XAssembled molecule128,539,6041128,506,604128,539,6041650
unplacedAssembled molecule72,124,7361,07272,124,73682,65700