U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



ASM48845v1

Organism name:
Escherichia coli 907889 (E. coli)
Taxonomy check:
OK
Infraspecific name:
Strain: 907889
BioSample:
SAMN02436742
BioProject:
PRJNA183812
Submitter:
Washington University
Date:
2013/10/29
Assembly type:
na
Assembly level:
Scaffold
Genome representation:
full
GenBank assembly accession:
GCA_000488455.1 (latest)
RefSeq assembly accession:
GCF_000488455.1 (latest)
RefSeq assembly and GenBank assembly identical:
yes
WGS Project:
AXTJ01
Assembly method:
Velvet v. 1.1.06
Genome coverage:
74x
Sequencing technology:
Illumina

IDs: 75821 [UID] 841288 [GenBank] 861008 [RefSeq]

See Genome Information for Escherichia coli

There are 270135 assemblies for this organism

See more

History (Show revision history)

Comment

Bacteria provided by David Creely and William Dunne (BioMerieux, Inc., 595 Anglum Road, Hazelwood, MO 63042).

Coding sequences were predicted using GeneMark and Glimmer3. Intergenic regions not spanned by GeneMark and Glimmer3 were blasted against NCBI's non-redundant (NR) database and ... predictions generated based on protein alignments. tRNA genes were determined using tRNAscan-SE and non-coding RNA genes by RNAmmer and Rfam. The final gene set is processed through several programs such as Kegg, psortB and Interproscan to determine possible function. Gene product names are determined by BER. Gene names are generated at the contig level and may not necessarily reflect any known order or orientation between contigs.

This is a reference genome for the Human Microbiome Project. This project is co-owned with the Human Microbiome Project DACC. This work was funded by the National Human Genome Research Institute (NHGRI)/National Institutes of Health (NIH) grant 5U54HG00496804 for characterization of this genome.  more

Global statistics

Total sequence length5,321,900
Total ungapped length5,312,700
Gaps between scaffolds0
Number of scaffolds273
Scaffold N50101,494
Scaffold L5017
Number of contigs365
Contig N5070,422
Contig L5024
Total number of chromosomes and plasmids0
Number of component sequences (WGS or clone)365

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
The primary assembly unit does not have any assembled chromosomes or linkage groups.
Please download the full sequence report for information on the scaffolds.

Assembly statistics

MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
unplaced5,321,9002735,312,700101,494920

Assembly QA

Taxonomy Check Data

Declared organism

Organism nameSpecies name
Escherichia coli 907889Escherichia coli

Best-matching type-strain assembly for declared species

AssemblyOrganism nameType category
GCA_000350825.1Escherichia coli KTE26claderef

Best-matching type-strain assembly

AssemblySpecies nameType category
GCA_000350825.1Escherichia colicladeref

Average Nucleotide Identity (ANI) data

ANIQuery coverageSubject coverage
Declared type98.3387.1787.92
Best-match type98.3387.1787.92

ANI result

Taxonomy check statusBest match statusComment
OKspecies-matchna