Our view of genetic polymorphism is shaped by methods that provide a limited and reference-biased picture.
More...Our view of genetic polymorphism is shaped by methods that provide a limited and reference-biased picture. Although long-read sequencing technologies hold promise in delivering nearly comprehensive genome sequences for population samples, the challenge lies in characterizing and interpreting non-SNP variations even with flawless sequence data. In this study, we analyze 27 genomes of Arabidopsis thaliana in an attempt to address these issues, and illustrate how to visualize polymorphism in an unbiased manner. This project collects the genomic PacBio CLR, PCR-free short reads, and the corresponding de novo genome assemblies of 27 Arabidopsis thaliana accessions chosen to cover the global genetic diversity of the species.
NOTE: For accession KBS-Mac-74 (ecotype ID: 1741), one SMRTcell of PacBio CLR and PCR-free short-read data have been previously released under Run Accession IDs ERR2173371 and ERR2173372, respectively. However, the assembly presented in this study is new and improved.
Less...Accession | PRJEB73474 |
Scope | Monoisolate |
Submission | Registration date: 8-May-2024 max planck institute for biology tuebingen |
Project Data:
No public data is linked to this project. Any recently released data that cites this project will be linked to it within a few days.