U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

SRX14498589: GSM5962237: sample1_sub11_unfilteredCCS; Homo sapiens; OTHER
1 PACBIO_SMRT (Sequel) run: 9,251 spots, 12.4M bases, 3.1Mb downloads

External Id: GSM5962237_r1
Submitted by: Dementia Research Institute, Cardiff University
Study: Repeat Detector: accurate, efficient, and flexible sizing of expanded CAG/CTG repeats from targeted DNA sequencing
show Abstracthide Abstract
Targeted DNA sequencing approaches will improve how the size of short tandem repeats is measured for diagnostic tests and pre-clinical studies. The expansion of these sequences causes dozens of disorders, with longer tracts generally leading to a more severe disease. In addition, interruptions are sometimes present within repeats and can alter disease manifestation. Despite advances in methodologies, determining repeat size and identifying interruptions in targeted sequencing datasets remains a major challenge. This is because standard alignment tools are ill-suited for the repetitive nature of these sequences. To address this, we have developed Repeat Detector (RD), a deterministic profile weighting algorithm for counting repeats in targeted sequencing data. We tested RD using blood-derived DNA samples from Huntington's disease (HD) and Fuchs endothelial corneal dystrophy patients sequenced using either Illumina MiSeq or Pacific Biosciences single-molecule, real-time sequencing platforms. RD was highly accurate in determining repeat sizes of 609 HD blood-derived samples and did not require prior knowledge of the flanking sequences or their polymorphisms within the patient population. We demonstrate that RD can be used to identify individuals with repeat interruptions and may provide a measure of repeat instability within an individual. RD is therefore highly versatile and may find applications in the diagnosis of expanded repeat disorders and the development of novel therapies. Overall design: 22 DNA samples from Huntington's disease individuals obtained from Coriell.org. 4 Samples from HEK-derived GFP(CAG)x lines with different repeat sizes.
Sample: sample1_sub11_unfilteredCCS
SAMN26809433 • SRS12296846 • All experiments • All runs
Organism: Homo sapiens
Library:
Name: GSM5962237
Instrument: Sequel
Strategy: OTHER
Source: GENOMIC
Selection: other
Layout: SINGLE
Construction protocol: The HD LBCs and GFP(CAG)x datasets were generated by first isolating DNA using the Macherey-Nagel Nucleospin™ Tissue Mini kit. PCR products were generated from samples using barcoded primers as listed in Table S1 and Thermo™ Phusion II High Fidelity polymerase. To obtain sufficient quantities of PCR product to proceed with library preparation, multiple identical PCRs were pooled and purified using Macherey-Nagel™ Gel and PCR Clean-up kit columns. The library was generated using the SMRTbell Template Prep Kit (1.0-SPv3) according to manufacturer's instructions. Samples to be sequenced on the same flowcell were combined in equimolar pools. We loaded between 10 and 12 pM. SMRT sequencing was done using a Sequel at Cardiff University School of Medicine. CCSs were generated from the resulting sequences and processed using SMRT Link. SMRTbell Template Prep Kit (1.0-SPv3)
Runs: 1 run, 9,251 spots, 12.4M bases, 3.1Mb
Run# of Spots# of BasesSizePublished
SRR183640379,25112.4M3.1Mb2022-03-22

ID:
20695564

Supplemental Content

Search details

See more...

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...