Warning: The NCBI web site requires JavaScript to function. more...
An official website of the United States government
The .gov means it's official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you're on a federal government site.
The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.
Download
IDs: 836731 [UID] 3564188 [GenBank] 3615188 [RefSeq]
The WUSC is a large strain collection isolated from clinical samples. Each sample is associated with metadata, including source, isolation site, 16s rRNA, metabolic, and other phenotypic information. The goal is to sample an adequate number of important, yet ... minor species, further adding to the catelogue of sequenced bacterial genomes and improving the diversity of the genomes available to the public. WGS will be preformed on approximely 550 isolates. Samples were selected based on RDP analysis at the genus level. This project is co-owned with the Human Microbiome Project DACC. Coding sequences were predicted using GeneMark v3.3 and Glimmer3 v3.02. Intergenic regions not spanned by GeneMark and Glimmer3 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. tRNA genes were determined using tRNAscan-SE 1.23 and non-coding RNA genes by RNAmmer-1.2 and Rfam v8.1. The final gene set is processed through several programs such as Kegg (Release 56), psortB (Version 3.0.3) and Interproscan (Version 4.7) to determine possible function. Gene product names are determined by BER (Version 2.5). Gene names are generated at the contig level and may not necessarily reflect any known order or orientation between contigs. Annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (released 2013). Information about the Pipeline can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ more
##Genome-Annotation-Data-START##Annotation Provider::NCBIAnnotation Date::04/22/2016 15:35:57Annotation Pipeline::NCBI Prokaryotic Genome Annotation PipelineAnnotation Method::Best-placed reference protein set; GeneMarkS+Annotation Software revision::3.1Features Annotated::Gene; CDS; rRNA; tRNA; ncRNA; repeat_regionGenes (total)::5,254CDS (total)::5,176Genes (coding)::5,104CDS (coding)::5,104Genes (RNA)::78rRNAs::2, 2, 2 (5S, 16S, 23S)complete rRNAs::2 (5S)partial rRNAs::2, 2 (16S, 23S)tRNAs::62ncRNAs::10Pseudo Genes (total)::72Pseudo Genes (ambiguous residues)::0 of 72Pseudo Genes (frameshifted)::17 of 72Pseudo Genes (incomplete)::51 of 72Pseudo Genes (internal stop)::11 of 72Pseudo Genes (multiple problems)::7 of 72##Genome-Annotation-Data-END##
Your browsing activity is empty.
Activity recording is turned off.
Turn recording back on