Abstract
Transcriptome-wide targeted genotyping is highly attractive for evolutionary and ecological studies but, until recently, accomplishing this goal presented a major technical barrier for the study of non-model organisms. Our group has recently developed a high-throughput targeted genotyping approach (called HD-Marker) based on the high specificity and accuracy of oligo extension-ligation assays that facilitates the design of assays tailored to meet specific genotyping needs. HD-Marker allows for targeted genotyping of over 10 000 genes in a single tube, with strikingly high capture rate (98%-99%) and genotyping accuracy (97%-99%). With the remarkable advantages of cost-effectiveness and flexibility, we envision that HD-Marker has broad application potential in evolutionary and ecological studies.
Genetic variation profiling is at the heart of modern biology and enables the study of important evolutionary and ecological questions by characterizing mutations, genetic drift, natural selection, and other processes of evolution. 1 The central goal of evolutionary genetics is to identify genetic changes and molecular mechanisms underlying phenotypic diversity and to understand the evolutionary pressures under which phenotypic diversity evolves.2,3 The advent of next-generation sequencing (NGS) technologies has made it possible to cost-efficiently profile a large amount of genetic variations in less-studied non-model organisms and has boosted the rapid development of various high-throughput genotyping-by-sequencing (GBS) methods.4,5 Unfortunately, most of these GBS methods use restriction enzymes for reduction of genomic complexity and therefore are mostly suited for random/novel marker discovery and genotyping, but not for targeting specific genomic regions/loci that are of particular interest to researchers.
The use of transcriptome-wide targeted genotyping is broadly desirable in evolutionary and ecological studies. Gene-related markers are derived from the transcribed genomic regions and such ‘functional’ markers are advantageous for pinning down causal genes underlying phenotypic/physiological traits or adaptive evolution.6–8 Although transcriptome sequencing enables the efficient discovery of gene-related markers, cost-effective, large-scale targeted genotyping of these markers in non-model organisms is still a great challenge. For example, array-based genotyping is a popular and viable option for human and model organisms, but building custom arrays remains highly expensive to be applied to non-model organisms, and fixed arrays also lack the flexibility for rearrangement of targeted loci. 9 Sequence capture-based approaches are more suited for targeting genomic regions rather than specific loci of interest and therefore not practical for the genotyping of specific loci. 10
To address this issue, our group has recently developed a sequencing-based GoldenGate approach (called HD-Marker), which allows high-throughput targeted genotyping of user-defined markers with high flexibility in choice of multiplex levels and marker types. 11 Different from the original GoldenGate technology that is built on the BeadArray platform with low-to-moderate marker multiplexity, 12 our HD-Marker approach is specifically designed for high-throughput sequencing platforms. For HD-Marker, genotyping a locus requires the design of two locus-specific probes (LSPs), each with a 3′ locus-specific region that targets the flanking sequence of the locus of interest and a 5′ portion with a universal polymerase chain reaction (PCR) primer-binding sequence (Figure 1). For a small probe panel, probes can be column-synthesized, whereas array-synthesized probes allow remarkably high multiplexity (over 12 000 loci) at a very low cost of ∼US$0.001 per base. The locus under study (eg single-nucleotide polymorphisms [SNPs], microsatellites, or insertions/deletions [indels]) is placed in between the two LSPs, using DNA polymerase to fill in the gap and ligating the extended LSP to the downstream LSP, creating an amplifiable molecule. The preparation of an HD-Marker library starts with the attachment of genomic DNA to magnetic beads followed by the hybridization of two LSPs to the immobilized DNA via highly specific extension, ligation, and amplification. Libraries prepared from different samples can be pooled and sequenced on the NGS platform of choice. The whole procedure can be finished within 3 days, with all steps accomplished in a single tube.

Schematic overview of the HD-Marker approach.
We tested the applicability of HD-Marker using two non-model scallop species (
Our HD-Marker approach has several major advantages, making it appealing for evolutionary and ecological studies. First, HD-Marker is built on the high-throughput NGS platform, which is much more cost-effective (as low as US$0.002 per genotype) than array-based platforms. Second, the use of NGS platform enhances the capacity and flexibility of HD-Marker, by removing the technical limitations of array formats. Third, a high-quality reference genome is not a prerequisite for applying HD-Marker and, for many non-model organisms, the widely accessible transcriptomic datasets and associated SNPs are already sufficient for the design of probes in HD-Marker. Fourth, HD-Marker allows the use of the same probe panel for cross-platform application. Fifth, library preparation of HD-Marker can be performed practically in any laboratory, as no costly instruments are required. Finally, the flexibility of HD-Marker allows the use of a variety of markers, such as SNPs, microsatellites, or indels, and of sequencing platforms, and is therefore easily adaptable to specific needs.
Transcriptome-wide targeted genotyping has long been appealing for evolutionary and ecological studies; however, for researchers studying non-model organisms, high cost and technical complexity make it unappealing. The HD-Marker approach retains the benefits of the GoldenGate assay but eliminates its array-based limitations by adopting the NGS platform. HD-Marker enables researchers to create tailored assays to meet their specific genotyping needs, for example, focusing on targeted regions, candidate genes, or pathways. With the remarkable advantages of cost-effectiveness (as low as US$0.002 per genotype) and flexible choice of multiplex levels and marker types, we envision that HD-Marker would become a highly attractive tool with broad application potential in evolutionary and ecological studies.
Footnotes
Funding:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was supported by the National Key Research and Development Program of China (2018YFC0310802), National Natural Science Foundation of China (U1706203), Major Basic Research Projects of Shandong Natural Science Foundation (ZR2018ZA0748), and Blue Life Breakthrough Program of LMBB of Qingdao National Laboratory for Marine Science and Technology (MS2018NO01).
Declaration of conflicting interests:
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: The authors had filed a patent application based on the described technique.
Author Contributions
All authors contributed to the discussion and manuscript writing. XZ and JW contributed equally to this work.
