U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



ASM48841v1

Organism name:
Escherichia coli 907710 (E. coli)
Taxonomy check:
OK
Infraspecific name:
Strain: 907710
BioSample:
SAMN02436877
BioProject:
PRJNA183808
Submitter:
Washington University
Date:
2013/10/29
Assembly type:
na
Assembly level:
Scaffold
Genome representation:
full
GenBank assembly accession:
GCA_000488415.1 (latest)
RefSeq assembly accession:
GCF_000488415.1 (latest)
RefSeq assembly and GenBank assembly identical:
yes
WGS Project:
AXTH01
Assembly method:
Velvet v. 1.1.06
Genome coverage:
83x
Sequencing technology:
Illumina

IDs: 75801 [UID] 841248 [GenBank] 1331898 [RefSeq]

See Genome Information for Escherichia coli

There are 270135 assemblies for this organism

See more

History (Show revision history)

Comment

Bacteria provided by David Creely and William Dunne (BioMerieux, Inc., 595 Anglum Road, Hazelwood, MO 63042).

Coding sequences were predicted using GeneMark and Glimmer3. Intergenic regions not spanned by GeneMark and Glimmer3 were blasted against NCBI's non-redundant (NR) database and ... predictions generated based on protein alignments. tRNA genes were determined using tRNAscan-SE and non-coding RNA genes by RNAmmer and Rfam. The final gene set is processed through several programs such as Kegg, psortB and Interproscan to determine possible function. Gene product names are determined by BER. Gene names are generated at the contig level and may not necessarily reflect any known order or orientation between contigs.

This is a reference genome for the Human Microbiome Project. This project is co-owned with the Human Microbiome Project DACC. This work was funded by the National Human Genome Research Institute (NHGRI)/National Institutes of Health (NIH) grant 5U54HG00496804 for characterization of this genome.  more

Global statistics

Total sequence length4,780,859
Total ungapped length4,767,159
Gaps between scaffolds0
Number of scaffolds125
Scaffold N50193,518
Scaffold L508
Number of contigs262
Contig N5045,535
Contig L5032
Total number of chromosomes and plasmids0
Number of component sequences (WGS or clone)262

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
The primary assembly unit does not have any assembled chromosomes or linkage groups.
Please download the full sequence report for information on the scaffolds.

Assembly statistics

MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
unplaced4,780,8591254,767,159193,5181370

Assembly QA

Taxonomy Check Data

Declared organism

Organism nameSpecies name
Escherichia coli 907710Escherichia coli

Best-matching type-strain assembly for declared species

AssemblyOrganism nameType category
GCA_000010385.1Escherichia coli SE11claderef

Best-matching type-strain assembly

AssemblySpecies nameType category
GCA_000010385.1Escherichia colicladeref

Average Nucleotide Identity (ANI) data

ANIQuery coverageSubject coverage
Declared type99.2692.3485.39
Best-match type99.2692.3485.39

ANI result

Taxonomy check statusBest match statusComment
OKspecies-matchna