Skip to Content
Merck

Challenges and Approaches to Genotyping Repetitive DNA.

G3 (Bethesda, Md.) (2019-11-24)
Elizabeth A Morton, Ashley N Hall, Elizabeth Kwan, Calvin Mok, Konstantin Queitsch, Vivek Nandakumar, John Stamatoyannopoulos, Bonita J Brewer, Robert Waterston, Christine Queitsch
ABSTRACT

Individuals within a species can exhibit vast variation in copy number of repetitive DNA elements. This variation may contribute to complex traits such as lifespan and disease, yet it is only infrequently considered in genotype-phenotype associations. Although the possible importance of copy number variation is widely recognized, accurate copy number quantification remains challenging. Here, we assess the technical reproducibility of several major methods for copy number estimation as they apply to the large repetitive ribosomal DNA array (rDNA). rDNA encodes the ribosomal RNAs and exists as a tandem gene array in all eukaryotes. Repeat units of rDNA are kilobases in size, often with several hundred units comprising the array, making rDNA particularly intractable to common quantification techniques. We evaluate pulsed-field gel electrophoresis, droplet digital PCR, and Nextera-based whole genome sequencing as approaches to copy number estimation, comparing techniques across model organisms and spanning wide ranges of copy numbers. Nextera-based whole genome sequencing, though commonly used in recent literature, produced high error. We explore possible causes for this error and provide recommendations for best practices in rDNA copy number estimation. We present a resource of high-confidence rDNA copy number estimates for a set of S. cerevisiae and C. elegans strains for future use. We furthermore explore the possibility for FISH-based copy number estimation, an alternative that could potentially characterize copy number on a cellular level.

MATERIALS
Product Number
Brand
Product Description

Sigma-Aldrich
Protease from Streptomyces griseus, BioReagent, DNase, RNase, and nickase, none detected (No RNase.)
Sigma-Aldrich
Poly-L-lysine hydrobromide, mol wt 150,000-300,000
Sigma-Aldrich
Proteinase K from Tritirachium album, buffered aqueous glycerol solution, for molecular biology, ≥800 units/mL
Sigma-Aldrich
Chitinase from Streptomyces griseus, lyophilized powder (essentially salt free), ≥200 units/g solid