Published online:
14 November 2017

Table 1. Comparison of important characteristics of the most commonly used molecular markers.

Hybridization-based markers (RFLP)

RFLP was the first molecular marker technique and the only marker system based on hybridization. Individuals of same species exhibit polymorphism as a result of insertion/deletions (known as InDels), point mutations, translocations, duplications and inversions. Isolation of pure DNA is the first step in the RFLP methodology. This DNA is mixed with restriction enzymes which are isolated from bacteria and these enzymes are used to cut DNA at particular loci (known as recognition sites). This results in a huge number of fragments with different length. Agarose or polyacrylamide gel electrophoresis (PAGE) is applied for the separation of these fragments by producing a series of bands. Each band represents a fragment having different lengths. Base-pair deletions, mutations, inversions, translocations and transpositions are the main causes for the variation resulting in the RFLP pattern. These variations lead to the gain or loss of recognition sites, resulting in fragments of various length and polymorphism. The restriction enzymes will not cut the fragment if a single base-pair variation occurs in the recognition site. However, if this point mutation occurs in one chromosome but not the other, it is called heterozygous for the marker, as both bands are present [12Madhumati B. Potential and application of molecular markers techniques for plant genome analysis. Int J Pure App Biosci. 2014;2(1):16988.[Google Scholar]].

PCR-based markers

The PCR technique was developed by Cary Mullis in 1983, as a technique which could amplify a small quantity of DNA without the application of any living organisms [13Mullis K, Faloona F, Scharf S, et al. Specific enzymatic amplification of DNA in vitro: the polymerase chain reaction. Cold Spring Harb Symp Quant Biol. 1986;51:263273.[Crossref], [PubMed], [Google Scholar]]. Denaturation, annealing and extension are the most important steps involved in PCR reactions. For more information about PCR and its protocol, see the article of Joshi and Deshpande [14Joshi M, Deshpande JD. Polymerase chain reaction: methods, principles and application. Int J Biomed Res. 2011;2(1):8197.[Crossref], [Google Scholar]].

PCR primers

The primer is a small part of DNA or RNA from which synthesis of DNA starts. The efficiency of a primer plays a vital role in the sensitivity and efficiency of PCR [15He Q, Marjamäki M, Soini H, et al. Primers are decisive for sensitivity of PCR. Biotechniques. 1994;17(1):8284.[PubMed], [Web of Science ®], [Google Scholar]]. The primer efficiency depends on the following main factors: (1) primer–template duplex association and dissociation during the annealing step and the extension temperature; (2) stability of the duplex to mismatched nucleotides; (3) efficiency of polymerase in the identification and extension of mismatched duplex. Primer length, GC%, melting and annealing temperature, 3' end specificity and 5' end stability are important features playing an important role in the efficiency of a primer. For a successful PCR, designing of a primer is a most crucial parameter. If all things are balanced except a primer, it will lead to no/false working of the PCR protocol. Primer length is also critical for a successful PCR and normally primers of 18–30 nucleotides in length are considered the best primers. Melting temperatures (Tm) in the range of 52–58 °C provide good results. The GC content is the most important factor affecting the efficiency of a primer; 45%–60% is optimum GC% for a good primer [16Dieffenbach CW, Lowe TM, Dveksler GS. General concepts for PCR primer design. PCR Methods Appl. 1993;3(3):3037.[Crossref], [Google Scholar]].

Randomly amplified polymorphic DNA (RAPD)

This technique was developed by Williams et al. [17Williams JG, Kubelik AR, Livak KJ, et al. DNA polymorphisms amplified by arbitrary primers are useful as genetic markers. Nucleic Acids Res. 1990;18(22):65316535.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] and Welsh and Mcclelland [18Welsh J, McClelland M. Fingerprinting genomes using PCR with arbitrary primers. Nucleic Acids Res. 1990;18(24):72137218.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] independently. Amplification of genomic DNA is achieved by PCR using single, short (10 nucleotide) and random primer. During PCR, amplification takes place when two hybridization sites are similar to each other and in opposite direction. These amplified fragments are totally dependent on the length and size of both the target genome and the primer [5Jiang GL. Molecular markers and marker-assisted breeding in plants. In: Andersen SB, editor. Plant breeding from laboratories to fields. Rijeka: InTech; 2013. p. 4583.[Crossref], [Google Scholar]]. The selected primer should have minimum 40% GC content, as a primer having less than 40% GC content will probably not withstand the annealing temperature (72 °C) where DNA elongation occurs by DNA polymerase [17Williams JG, Kubelik AR, Livak KJ, et al. DNA polymorphisms amplified by arbitrary primers are useful as genetic markers. Nucleic Acids Res. 1990;18(22):65316535.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. For the visualization, the PCR product is then separated in agarose gel stained with ethidium bromide [19Jones CJ, Edwards KJ, Castaglione S, et al. Reproducibility testing of RAPD, AFLP and SSR markers in plants by a network of European laboratories. Mol Breed. 1997;3(5):381390.[Crossref], [Web of Science ®], [Google Scholar]]. Polymorphism present either at or between primer binding sites can be detected in the electrophoresis by confirming the presence or absence of specific bands [5Jiang GL. Molecular markers and marker-assisted breeding in plants. In: Andersen SB, editor. Plant breeding from laboratories to fields. Rijeka: InTech; 2013. p. 4583.[Crossref], [Google Scholar]]. The quantity and quality of DNA, PCR buffer, magnesium chloride concentration, annealing temperature and Taq DNA (type of DNA polymerase) are some important factors affecting the reproducibility of randomly amplified polymorphic DNA (RAPD) markers [20Wolff K, Schoen ED, Peters-Van Rijn J. Optimizing the generation of random amplified polymorphic DNAs in chrysanthemum. Theor Appl Genet. 1993;86(8):10331037.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

AFLP

The limitations present in the RAPD and RFLP technique were overcome through the development of AFLP markers [21Vos P, Hogers R, Bleeker M, et al. AFLP: a new technique for DNA fingerprinting. Nucleic Acids Res. 1995;23(21):44074414.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. AFLP markers combine the RFLP and PCR technology, in which digestion of DNA is done and then PCR is performed [22Lynch M, Walsh B. Genetics and analysis of quantitative traits. Sunderland (MA): Sinauer; 1998.[Google Scholar]]. AFLP markers are cost effective and there is no need of prior sequence information. In AFLP, both good-quality and partly degraded DNA can be used; however, this DNA should not contain any restriction enzymes or PCR inhibitors. For more information, see previous studies [23Ridout CJ, Donini P. Use of AFLP in cereals research. Trends Plant Sci. 1999;4(2):7679.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],24Blears MJ, De Grandis SA, Lee H, et al. Amplified fragment length polymorphism (AFLP): a review of the procedure and its applications. J Ind Microbiol Biotechnol. 1998;21(3):99114.[Crossref], [Web of Science ®], [Google Scholar]]. In AFLP, two restriction enzymes (a frequent cutter and a rare cutter) are used for the cutting of DNA. Each end of the resulting fragments is ligated with the oligonucleotides. Oligonucleotides are short nucleic acid fragments used for the ligation in PCR [12Madhumati B. Potential and application of molecular markers techniques for plant genome analysis. Int J Pure App Biosci. 2014;2(1):16988.[Google Scholar]]. One end is specific for the rare cutter (6-bp recognition site) and the other one, for the frequent cutter (3-bp recognition site). This will lead to the amplification of only those fragments which have been cut by these cutters. For the development of primers, known sequences of adapters are used. Adapters are actually short, enzyme specific DNA sequences generally used for fishing an unknown DNA sequence [21Vos P, Hogers R, Bleeker M, et al. AFLP: a new technique for DNA fingerprinting. Nucleic Acids Res. 1995;23(21):44074414.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. After performing PCR, visualization is done in either agarose gel or polyacrylamide gel stained with AgNO3 or by autoradiography [12Madhumati B. Potential and application of molecular markers techniques for plant genome analysis. Int J Pure App Biosci. 2014;2(1):16988.[Google Scholar]].

SSRs or microsatellites

Microsatellites [25Litt M, Luty JA. A hypervariable microsatellite revealed by in vitro amplification of a dinucleotide repeat within the cardiac muscle actin gene. Am J Hum Genet. 1989;44(3):397401.[PubMed], [Web of Science ®], [Google Scholar]] are also called as SSRs; [26Tautz D. Hypervariability of simple sequences as a general source for polymorphic DNA markers. Nucleic Acids Res. 1989;17(16):64636471.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], short tandem repeats and simple sequence length polymorphisms [27Schlotteröer C, Amos B, Tautz D. Conservation of polymorphic simple sequence loci in cetacean species. Nature. 1991;354(6348):6365.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. SSRs are tandem repeat motifs of 1–6 nucleotides that are present abundantly in the genome of various taxa [28Beckmann JS, Weber JL. Survey of human and rat microsatellites. Genomics. 1992;12(4):627631.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Microsatellites can be mononucleotide (A), dinucleotide (GT), trinucleotide (ATT), tetranucleotide (ATCG), pentanucleotide (TAATC) and hexanucleotide (TGTGCA) [29Weber JL. Informativeness of human (dC-dA) n·(dG-dT) n polymorphisms. Genomics. 1990;7(4):524530.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Microsatellites are distributed in the genome; however, they are also present in the chloroplast [30Provan J, Powell W, Hollingsworth PM. Chloroplast microsatellites: new tools for studies in plant ecology and evolution. Trends Ecol Evol. 2001;16(3):142147.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] and mitochondria [31Rajendrakumar P, Biswal AK, Balachandran SM, et al. Simple sequence repeats in organellar genomes of rice: frequency and distribution in genic and intergenic regions. Bioinformatics. 2007;23(1):14.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Studies have also confirmed the presence of SSRs in protein-coding genes and expressed sequence tags (ESTs) [32Morgante M, Hanafey M, Powell W. Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat Genet. 2002;30(2):194200.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. SSRs represent the lesser repetition per locus with higher polymorphism level [33Zane L, Bargelloni L, Patarnello T. Strategies for microsatellite isolation: a review. Mol Ecol. 2002;11(1):116.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. This high polymorphism level is due to the occurrence of various numbers of repeats in microsatellite regions and can be detected with ease by PCR [34Kalia RK, Rai MK, Kalia S, et al. Microsatellite markers: an overview of the recent progress in plants. Euphytica. 2011;177(3):309334.[Crossref], [Web of Science ®], [Google Scholar]]. Occurrence of SSRs may be due to slippage of single-strand DNA, recombination of double-strand DNA, transfer of mobile elements (retrotransposons) and mismatches. Common motifs present in SSRs are Mono: A, T; Di: AT, GA; Tri: AGG; Tetra: AAAC. Mainly the sequences which are flanking the SSRs are conserved and are used in the development of primers. Development of a genomic library and sequencing a segment of the studied genome will result in the development of these primers. For more information related to the development of SSRs, see the review by Kalia et al. [34Kalia RK, Rai MK, Kalia S, et al. Microsatellite markers: an overview of the recent progress in plants. Euphytica. 2011;177(3):309334.[Crossref], [Web of Science ®], [Google Scholar]]. The development of SSR markers involves the development of an SSR library and then detection of specific microsatellites. After this, the detection of favourable regions for primer designing is done and then PCR is performed. Interpretation and evaluation of banding patterns are performed and assessment of PCR products is performed for investigation of polymorphism [35Röder MS, Korzun V, Wendehake K, et al. A microsatellite map of wheat. Genetics. 1998;149(4):20072023.[PubMed], [Web of Science ®], [Google Scholar]]. SSR markers are considered a marker of choice, as they are co-dominant, with high reproducibility and greater genome abundance, and they can be used efficiently in plant mapping studies [26Tautz D. Hypervariability of simple sequences as a general source for polymorphic DNA markers. Nucleic Acids Res. 1989;17(16):64636471.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],34Kalia RK, Rai MK, Kalia S, et al. Microsatellite markers: an overview of the recent progress in plants. Euphytica. 2011;177(3):309334.[Crossref], [Web of Science ®], [Google Scholar]].

Chloroplast microsatellites (cpSSRs)

It is difficult to detect enough sequence variations due to lesser mutation rates that characterize the chloroplast genome. Contrary to this, cpSSRs provide higher polymorphism levels with easily genotyping, which has made them a very handful and popular marker for population genetic studies [30Provan J, Powell W, Hollingsworth PM. Chloroplast microsatellites: new tools for studies in plant ecology and evolution. Trends Ecol Evol. 2001;16(3):142147.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. cpSSRs typically contain mononucleotide motifs which are repeated 8–15 times. The polymorphism level in cpSSRs is quite changing across species and loci. Two important features distinguishing the cpSSRs from nuclear microsatellites are (i) chloroplasts are inherited uniparentlly and (ii) the chloroplast chromosome is a non-recombinant molecule due to which all cpSSRs loci are linked [36Navascues M, Emerson BC. Chloroplast microsatellites: measures of genetic diversity and the effect of homoplasy. Mol Ecol. 2005;14(5):13331341.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. CpSSRs have been successfully applied in agriculture and basic plant sciences [37Ebert D, Peakall RO. Chloroplast simple sequence repeats (cpSSRs): technical resources and recommendations for expanding cpSSR discovery and applications to a wide array of plant species. Mol Ecol Resour. 2009;9(3):673690.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]

Mitochondrial microsatellites

Plant mitochondrial DNA (mtDNA) is very dynamic, with the largest and the least gene density among eukaryotes. Its size ranges between 200 and 2500 kb and consists of different repeated elements and introns [38Liu Y, Xue JY, Wang B, et al. The mitochondrial genomes of the early land plants Treubia lacunosa and Anomodon rugelii: dynamic and conservative evolution. PLoS One. 2011;6(10):e25836.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Plant mtDNA is larger as compared to the animal mtDNA and is characterized by molecular heterogeneity seen as groups of circular chromosomes which differ in size and abundance. The evolution rate of mtDNA markers is slow, which is regrettable for plant population biologists; these markers have very limited application in population genetics. mtDNA markers exhibit many limitations like the fact that they represent only a single locus; uncertainty in genealogical analysis can be increased due to increased probability of missing links in mitochondrial haplotypes and underestimation of genetic diversity [39Zhang DX, Hewitt GM. Nuclear DNA analyses in genetic studies of populations: practice, problems and prospects. Mol Ecol. 2003;12(3):563584.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

RAMP (Randomly amplified microsatellite polymorphisms)

Microsatellite markers exhibit greater level of polymorphism with the drawback of being labour intensive. While RAPD markers are cost effective as compared to microsatellites, their level of polymorphism detection is low as compared to that of microsatellite markers. To overcome the imperfection of these two methods, randomly amplified microsatellite polymorphisms (RAMP) markers were developed [40Wu KS, Jones R, Danneberger L, Scolnik PA. Detection of microsatellite polymorphisms without cloning. Nucleic Acids Res. 1994;22(15):32573258.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. This marker system involves an SSR primer which is utilized for the amplification of genomic DNA in the absence or presence of RAPD primers. SSR primers are radiolabeled consisting of a ‘5’ anchor and ‘3’ repeats. The resulting products are resolved using submarine agarose electrophoresis [41Salazar JA, Rasouli M, Moghaddam RF, et al. Low-cost strategies for development of molecular markers linked to agronomic traits in Prunus. Agric Sci. 2014;5(05):430439.[Google Scholar]]. The melting temperature of this marker system is maintained 10–15 °C higher for the anchored primers as compared to the RAPD ones, which helps in the efficient annealing of the anchored primer [40Wu KS, Jones R, Danneberger L, Scolnik PA. Detection of microsatellite polymorphisms without cloning. Nucleic Acids Res. 1994;22(15):32573258.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. RAMP markers are cost effective, reflect higher polymorphism and have wide distribution in the genome. They have been successfully applied in various plants for molecular characterization [40Wu KS, Jones R, Danneberger L, Scolnik PA. Detection of microsatellite polymorphisms without cloning. Nucleic Acids Res. 1994;22(15):32573258.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],41Salazar JA, Rasouli M, Moghaddam RF, et al. Low-cost strategies for development of molecular markers linked to agronomic traits in Prunus. Agric Sci. 2014;5(05):430439.[Google Scholar]].

Molecular markes in plant studies pdf

Sequence-related amplified polymorphism (SRAP)

Li and Quiros [42Li G, Quiros CF. Sequence-related amplified polymorphism (SRAP), a new marker system based on a simple PCR reaction: its application to mapping and gene tagging in Brassica. Theor Appl Genet. 2001;103(2–3):455461.[Crossref], [Web of Science ®], [Google Scholar]] developed this method mainly for the amplification of open reading frames (ORFs). This marker system is based on amplification using two primers. The primers used for this marker system are 17–18 nucleotides long. They use the CCGG sequence in the forward primer and AATT in the reverse primer, and the annealing temperature in the first five cycles is set at 35 °C during PCR. The reaming 35 cycles are run at 50 °C annealing temperature. The PCR amplified product is then loaded on gel electrophoresis and DNA bands are visualized through autoradiography. Sequence-related amplified polymorphisms (SRAPs) are dominant in nature and DNA fragments are scored by the presence or absence of a band. This is a simple and efficient marker system which is widely used in a range of fields, including map construction, genomic and cDNA fingerprinting [41Salazar JA, Rasouli M, Moghaddam RF, et al. Low-cost strategies for development of molecular markers linked to agronomic traits in Prunus. Agric Sci. 2014;5(05):430439.[Google Scholar]]. SRAP is a dominant marker system which has been successfully applied to investigate the genetic variations in different taxa [43Uzun A, Yesiloglu T, Aka-Kacar Y, et al. Genetic diversity and relationships within citrus and related genera based on sequence related amplified polymorphism markers (SRAPs). Sci Hort. 2009;121(3):306312.[Crossref], [Web of Science ®], [Google Scholar]].

Inter simple sequence repeat (ISSR)

This technique was developed by Zietkiewicz et al. [44Zietkiewicz E, Rafalski A, Labuda D. Genome fingerprinting by simple sequence repeat (SSR)-anchored polymerase chain reaction amplification. Genomics. 1994;20(2):176183.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. It is based on amplification of DNA segments located in between two identical but oppositely oriented microsatellite repeat regions, at a distance which allows amplification. Primers used in this technique are also known as microsatellite and they might be di-, tri- and tetra- or penta-nucleotide repeats. Normally long primers having a size of 15–30 bases are used in this technique. The primers used in Inter simple sequence repeat (ISSR) may be unanchored [45Gupta M, Chyi YS, Romero-Severson J, et al. Amplification of DNA markers from evolutionarily diverse genomes using single primers of simple-sequence repeats. Theor Appl Genet. 1994;89(7–8):9981006.[PubMed], [Web of Science ®], [Google Scholar]] or more typically they are anchored at the 3′ or 5′ end having 1 to 4 degenerate bases, which are extended into the flanking sequences. ISSR allows the successful usage of high annealing temperature (about 45–60 °C); the amplified products are 200–2000 bp long and can be visualized through agarose or PAGE [46Fang DQ, Roose ML. Identification of closely related citrus cultivars with inter-simple sequence repeat markers. Theor Appl Genet. 1997;95(3):408417.[Crossref], [Web of Science ®], [Google Scholar],47Moreno S, Martín JP, Ortiz JM. Inter-simple sequence repeats PCR for characterization of closely related grapevine germplasm. Euphytica. 1998;101(1):11725.[Crossref], [Web of Science ®], [Google Scholar]]. Segregating by simple Mendelian laws of inheritance, they are characterized as dominant markers [44Zietkiewicz E, Rafalski A, Labuda D. Genome fingerprinting by simple sequence repeat (SSR)-anchored polymerase chain reaction amplification. Genomics. 1994;20(2):176183.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],48Tsumura Y, Ohba K, Strauss SH. Diversity and inheritance of inter-simple sequence repeat polymorphisms in Douglas-fir (Pseudotsuga menziesii) and sugi (Cryptomeria japonica). Theor Appl Genet. 1996;92(1):4045.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]; however, they can also be used in the development of co-dominant markers [49Ng WL, Tan SG. Inter-simple sequence repeat (ISSR) markers: are we doing it right? ASM Sci J. 2015;9:3039.[Google Scholar]]. ISSRs are simple, easy to understand as compared to RAPD and there is no need of prior knowledge of DNA sequences [50Chatterjee SN, Vijayan K, Roy GC, et al. ISSR profiling of genetic variability in the ecotypes of Antheraea mylitta Drury, the tropical tasar silkworm. Russ J Genet. 2004; 40(2):152159.[Crossref], [Web of Science ®], [Google Scholar],51Kar PK, Vijayan K, Mohandas TP, et al. Genetic variability and genetic structure of wild and semi-domestic populations of tasar silkworm (Antheraea mylitta) ecorace Daba as revealed through ISSR markers. Genetica. 2005;125(2–3):173183.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. However, they are dominant markers and they have less reproducibility with homology of co-migrating amplification products [11Semagn K, Bjørnstad Å, Ndjiondjop MN. An overview of molecular marker methods for plants. Afr J Biotechnol. 2006;(2540):2568.[Google Scholar]].

Retrotransposons

Transposons are mobile genetic elements capable of changing their locations in the genome. Transposons elements were discovered in maize almost 60 years ago [52Finnegan DJ. Eukaryotic transposable elements and genome evolution. Trends Genet. 1989; 5:103107.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],53Grzebelus D. Transposon insertion polymorphism as a new source of molecular markers. J Fruit Ornam Plant Res. 2006;14(Suppl 1):2129.[Google Scholar]]. There are two classes of transposable elements. Class I known as retro-elements, such as retrotransposons. Retrotransposons may be short interspersed nuclear elements or long interspersed nuclear elements and they are the mRNA-encoded element. In this class, a new copy of transposon is produced after each transposition event; however, the original copy remains intact at the donor site. Class II contains DNA transposons and their locations change by the cut-and-paste method in the genome [53Grzebelus D. Transposon insertion polymorphism as a new source of molecular markers. J Fruit Ornam Plant Res. 2006;14(Suppl 1):2129.[Google Scholar]]. Retrotransposons are an important class of repetitive DNA constituting 40%–60% of the entire plant genome [53Grzebelus D. Transposon insertion polymorphism as a new source of molecular markers. J Fruit Ornam Plant Res. 2006;14(Suppl 1):2129.[Google Scholar],54Kumar A, Bennetzen JL. Plant retrotransposons. Annu Rev Genet. 1999;33(1):479532.[Crossref], [PubMed], [Google Scholar]]. Retrotransposons belong to class I of transposon elements and they transpose through an RNA intermediate, which is not present in class II transposable elements [52Finnegan DJ. Eukaryotic transposable elements and genome evolution. Trends Genet. 1989; 5:103107.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Retrotransposons are grouped into two subclasses on the basis of their structure and transposition cycle. Long terminal repeats (LTRs) retrotransposons (LINE; long interspersed nuclear elements) and non-LTR retrotransposons (SINE; short interspersed nuclear elements). These two subclasses can be differentiated based on the presence or absence of LTRs at their ends [55Kalendar R, Vicient CM, Peleg O, et al. Large retrotransposon derivatives: abundant, conserved but nonautonomous retroelements of barley and related genomes. Genetics. 2004;166(3):14371450.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. LTR retrotransposons are widely distributed in the plant genome and in many crop plants, nearly 40%–70% of their DNA contains LTR retrotransposons [56Shirasu K, Schulman AH, Lahaye T, et al. A contiguous 66-kb barley DNA sequence provides evidence for reversible genome expansion. Genome Res. 2000;10(7):908915.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],57Pearce SR, Pich U, Harrison G, et al. TheTy1-copia group retrotransposons of Allium cepa are distributed throughout the chromosomes but are enriched in the terminal heterochromatin. Chromosome Res. 1996;4(5):357364.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. On the basis of integration, target site duplications of 4–6 bp are often produced by LTR retrotransposons. LTR retrotransposons contain ORFs, POL and GAG, as they are widely distributed within plant genomes [58Voytas DF, Boeke JD. Ty1 and Ty5 of Saccharomyces cerevisiae. In: Craigie R, Gellert M, Lambowitz A editors. Mobile DNA II American Society of Microbiology. Washington (DC): ASM Press; 2002. p. 631662.[Crossref], [Google Scholar]]. LTR retrotransposons are further divided into Ty1/copia and Ty3/gypsy retrotransposons on the basis of encoded gene order [59Roy NS, Choi JY, Lee SI, et al. Marker utility of transposable elements for plant genetics, breeding, and ecology: a review. Genes Genom. 2015;37(2):141151.[Crossref], [Web of Science ®], [Google Scholar]]. Class II of transposable elements is further divided into terminal inverted repeat (TIR) and non-TIR subclasses [60Schulman AH, Wicker T. A field guide to transposable elements. In: Fedoroff NV, editor. Vol. 2, Plant transposons and genome dynamics in evolution. Oxford, Wiley-Blackwell; 2013. p. 1540.[Crossref], [Google Scholar]]. As transposon elements have great abundance and wide dispersion in the genome, they are an ideal source for the development of molecular markers [61Kalendar R, Flavell AJ, Ellis TH, et al. Analysis of plant diversity with retrotransposon-based molecular markers. Heredity. 2011;106(4):520530.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The following are some important retrotransposon-based molecular markers.

IRAP

Inter-retrotransposon amplified polymorphism (IRAP) is a retrotransposon-type marker developed by Kalendar et al. [62Kalendar R, Grob T, Regina M, et al. IRAP and REMAP: two new retrotransposon-based DNA fingerprinting techniques. Theor Appl Genet. 1999;98(5):704711.[Crossref], [Web of Science ®], [Google Scholar]]. Sequences present between two adjacent LTR retrotransposons are amplified by the IRAP system through the application of primers which are complementary to the LTR sequence 3' end. The orientation of these LTR sequences can be (1) tail–tail, (2) head–head and (3) head–tail [63Poczai P, Varga I, Laos M, et al. Advances in plant gene-targeted and functional markers: a review. Plant Methods. 2013;9(1):6.[Crossref], [PubMed], [Google Scholar]]. Identical sequences are present in different strands that are separated by small inter-genic distances in head-to-head arrangement and are transcribed away from each other. However, in those with tail-to-tail orientation, the arrangement is opposite to the head-to-head one and they are transcribed towards each other. Both 5' and 3' primers are used for head-to-tail LTRs, while a single primer is used for those with head-to-head or tail-to-tail arrangement. Agarose gel is used to resolve the IRAP product [64Kalendar R, Schulman AH. IRAP and REMAP for retrotransposon-based genotyping and fingerprinting. Nat Protoc. 2006;1(5):24782484.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. A single IRAP reaction can produce many amplicons having different sizes ranging between 300 and 3000 bp [65Fan F, Cui B, Zhang T, et al. LTR-retrotransposon activation, IRAP marker development and its potential in genetic diversity assessment of masson pine (Pinus massoniana). Tree Genet Genomes. 2014;10(1):213222.[Crossref], [Web of Science ®], [Google Scholar]].

REMAP

Retrotransposon microsatellite amplification polymorphisms (REMAP) is an important retrotransposon-based marker commonly used to analyse the genetic diversity. The REMAP protocol is similar to IRAP; however, in REMAP, SSRs (microsatellites) are used in conjunction with specific primers of LTR during PCR [62Kalendar R, Grob T, Regina M, et al. IRAP and REMAP: two new retrotransposon-based DNA fingerprinting techniques. Theor Appl Genet. 1999;98(5):704711.[Crossref], [Web of Science ®], [Google Scholar],64Kalendar R, Schulman AH. IRAP and REMAP for retrotransposon-based genotyping and fingerprinting. Nat Protoc. 2006;1(5):24782484.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. During REMAP PCR, those primers are selected for microsatellite loci which contain a repeat motif anchored nucleotide at the 3' end aiming to avoid the primer slippage between individual SSR motifs [59Roy NS, Choi JY, Lee SI, et al. Marker utility of transposable elements for plant genetics, breeding, and ecology: a review. Genes Genom. 2015;37(2):141151.[Crossref], [Web of Science ®], [Google Scholar]].

Retrotransposon-based insertion polymorphism (RBIP)

This technique was developed by Flavell et al. [66Flavell AJ, Knox MR, Pearce SR, et al. Retrotransposon-based insertion polymorphisms (RBIP) for high throughput marker analysis. Plant J. 1998;16(5):643650.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. In it, the presence or absence of retrotransposon sequences is investigated, which can be used as molecular marker. In this technique, DNA amplification is achieved through a primer having 3' and 5' end regions flanking the retrotransposons insertion site. Detection of the presence of insertion is achieved through the development of primer from LTR. Sequence information about the regions flanking the retrotransposon insertion sites is needed in this technique and it results in the typing of a single locus as compared to other retrotransposon-based markers [65Fan F, Cui B, Zhang T, et al. LTR-retrotransposon activation, IRAP marker development and its potential in genetic diversity assessment of masson pine (Pinus massoniana). Tree Genet Genomes. 2014;10(1):213222.[Crossref], [Web of Science ®], [Google Scholar]]. Agarose gel electrophoresis is used for the detection of polymorphism [67Agarwal M, Shrivastava N, Padh H. Advances in molecular marker techniques and their applications in plant sciences. Plant Cell Rep. 2008;27(4):617631.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Tagged microarray marker, which is based upon fluorescent microarray marker scoring, is used for high-throughput retrotransposon-based insertion polymorphism (RBIP) analysis [68Jing R, Bolshakov V, Flavell AJ. The tagged microarray marker (TAM) method for high-throughput detection of single nucleotide and indel polymorphisms. Nat Protoc. 2007;2(1):168177.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Inter-primer binding site (iPBS)

The requirement for prior knowledge about the sequence of LTR is a big problem while using all retrotransposon-based markers. To obtain such information, cloning and sequencing of LTR is performed. To solve this problem, primer binding sites (PBSs) of retrotransposons are used in this technique. A tRNA complement is present in all LTR retrotransposons and retroviruses. PBSs are their binding sites adjacent to the 5' LTR and are highly conserved. Reverse transcription starts when the tRNA binds its 3' terminal sequences with the primer binding site. The role of primers is to bind in this area and amplification of diverse sequences is performed. Mostly retrotransposons are mixed, inverted, nested or truncated in the chromosomal sequences and their amplification can be achieved by using a conservative PBS primer. LTRs present in fragments having retrotransposons as their internal part are present with the other retrotransposons and result in close occurrence of PBS sequences with each other. PBS is a universal method, as they occur in all LTR-based retrotransposon sequences [69Kalendar R, Antonius K, Smýkal P, et al. iPBS: a universal method for DNA fingerprinting and retrotransposon isolation. Theor Appl Genet. 2010;121(8):14191430.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Recently, inter-primer binding site (iPBS) markers have emerged as the most important and universal method for the identification of genetic diversity and relationships in various plants [61Kalendar R, Flavell AJ, Ellis TH, et al. Analysis of plant diversity with retrotransposon-based molecular markers. Heredity. 2011;106(4):520530.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],69Kalendar R, Antonius K, Smýkal P, et al. iPBS: a universal method for DNA fingerprinting and retrotransposon isolation. Theor Appl Genet. 2010;121(8):14191430.[Crossref], [PubMed], [Web of Science ®], [Google Scholar], 70Baloch FS, Alsaleh A, de Miera LE, et al. DNA based iPBS-retrotransposon markers for investigating the population structure of pea (Pisum sativum) germplasm from Turkey. Biochem Syst Ecol. 2015;61:244252.[Crossref], [Web of Science ®], [Google Scholar]].

Cleaved amplified polymorphic sequences (CAPS)

Cleaved amplified polymorphic sequence markers (CAPS) originally named as the PCR–RFLP markers due to combination of RFLP and PCR [71Maeda M, Uryu N, Murayama N, et al. A simple and rapid method for HLA-DP genotyping by digestion of PCR-amplified DNA with allele-specific restriction endonucleases. Hum Immunol. 1990;27(2):111121.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. In this technique, target DNA is amplified using PCR and then its digestion is performed with restriction enzymes [72Jarvis P, Lister C, Szabo V, et al. Integration of CAPS markers into the RFLP map generated using recombinant inbred lines of Arabidopsis thaliana. Plant Mol Biol. 1994;24(4):685687.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],73Michaels SD, Amasino RM. A robust method for detecting single-nucleotide changes as polymorphic markers by PCR. Plant J. 1998;14(3):381385.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Agarose gel or acrylamide gel is used for the visualization of CAPS products. The primers used in this technique are developed from sequence information present in a databank of genomics or cloned RAPD bands or cDNA sequences. CAPS markers are versatile and the possibility to find DNA polymorphism can be increased by combining CAPS with single-strand conformational polymorphism, SCAR, AFLP or RAPD [67Agarwal M, Shrivastava N, Padh H. Advances in molecular marker techniques and their applications in plant sciences. Plant Cell Rep. 2008;27(4):617631.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. CAPS markers are co-dominant markers and have been used in genotyping, map-based cloning and molecular identification studies [74Spaniolas S, May ST, Bennett MJ, et al. Authentication of coffee by means of PCR-RFLP analysis and lab-on-a-chip capillary electrophoresis. J Agric Food Chem. 2006;54(20):74667470.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],75Weiland JJ, Yu MH. A cleaved amplified polymorphic sequence (CAPS) marker associated with root-knot nematode resistance in sugarbeet. Crop Sci. 2003;43(5):1814188.[Crossref], [Web of Science ®], [Google Scholar]].

SCAR (Sequence-characterized amplified regions)

Sequence-characterized amplified region (SCAR) markers were first developed in 1993 by Paran and Michelmore in lettuce for downy mildew resistance genes [76Paran I, Michelmore RW. Development of reliable PCR-based markers linked to downy mildew resistance genes in lettuce. Theor Appl Genet. 1993;85(8):985993.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. SCAR markers are more specific and more reproducible as compared to RAPD [77Yang L, Fu S, Khan A, et al. Molecular cloning and development of RAPD-SCAR markers for Dimocarpus longan variety authentication. Springer Plus. 2013;2:501.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. SCAR markers are co-dominant and mono-locus markers and are mostly applied for physical mapping [78Bhagyawant SS. RAPD-SCAR markers: an interface tool for authentication of traits. J Biosci Med. 2016;4:19.[Google Scholar]]. The procedure for the development of SCAR markers includes purification of PCR fragments followed by designing of SCAR primer [76–79Paran I, Michelmore RW. Development of reliable PCR-based markers linked to downy mildew resistance genes in lettuce. Theor Appl Genet. 1993;85(8):985993.
Yang L, Fu S, Khan A, et al. Molecular cloning and development of RAPD-SCAR markers for Dimocarpus longan variety authentication. Springer Plus. 2013;2:501.
Bhagyawant SS. RAPD-SCAR markers: an interface tool for authentication of traits. J Biosci Med. 2016;4:19.
Kiran U, Khan S, Mirza KJ, et al. SCAR markers: a potential tool for authentication of herbal drugs. Fitoterapia. 2010;81(8):969976.
]. Polymorphic bands are detected by using agarose gel and then the nucleotide sequence of the selected fragment of DNA is investigated. Analysis of the sequence of this polymorphic DNA is made by comparing it with the known DNA sequences available at the NCBI (National Center for Biotechnology Information) database for sequence uniqueness. Then this nucleotide sequence of polymorphic DNA is utilized for the synthesis of specifics SCAR primers [78Bhagyawant SS. RAPD-SCAR markers: an interface tool for authentication of traits. J Biosci Med. 2016;4:19.[Google Scholar]].

Sequence-based markers

Sequencing is a technique in which nucleotide bases and their order is identified along the DNA strand [80Franca LT, Carrilho E, Kist TB. A review of DNA sequencing techniques. Q Rev Biophys. 2002;35(2):169200.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], and molecular markers which are based on the identification of a particular sequence of DNA in a pool of unknown DNA are known as sequence-based markers. The development of this technology resulted from the fact that hybridization-based markers are less reliable and polymorphic. The advent of the sequencing technologies like next-generation sequencing (NGS) and genotyping by sequencing (GBS) revolutionized the plant breeding through development of SNPs resulting in high polymorphism [81Davey JW, Hohenlohe PA, Etter PD, et al. Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet. 2011;12(7):499510.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Various types of sequencing technologies have been developed so far and have been reviewed briefly by Heather and Chain [82Heather JM, Chain B. The sequence of sequencers: the history of sequencing DNA. Genomics. 2016;107(1):18.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. However, some important sequencing methods are also described as follows.

Sanger method of sequencing

The plus-and-minus method was the first method of DNA sequencing. It was used by Sanger and Coulson [83Sanger F, Coulson AR. A rapid method for determining sequences in DNA by primed synthesis with DNA polymerase. J Mol Biol. 1975;94(3):441448.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The basic principle of this method is that single-stranded DNA molecules which show length differentiation of a single nucleotide can be separated from each other with the help of PAGE. This method is also known as Sangers's dideoxy sequencing method, as it uses modified bases known as dideoxy nucleotides [82Heather JM, Chain B. The sequence of sequencers: the history of sequencing DNA. Genomics. 2016;107(1):18.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. During the early studies, bacteriophage T4 DNA polymerase and DNA polymerase I from Escherichia coli were used in this method [84Englund PT. Analysis of nucleotide sequences at 3′termini of duplex deoxyribonucleic acid with the use of the T4 deoxyribonucleic acid polymerase. J Biol Chem. 1971;246(10):32693276.[PubMed], [Web of Science ®], [Google Scholar],85Englund PT. The 3′-terminal nucleotide sequences of T7 DNA. J Mol Biol. 1972;66(2):209224.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The resultant polymerization products were loaded onto acrylamide gels and resolved through ionophoresis. However, this method contains a lot of limitations; hence, after two years, Sanger and his team introduced a new technique for sequencing in which oligonucleotides were sequenced by polymerization by enzymes [86Sanger F, Nicklen S, Coulson AR. DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci. 1977;74(12):54635467.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. This method facilitates the maximum measurement of variations. It has high reproducibility and requires a low quantity of DNA. However, this method is costly, time consuming, with low genome coverage and detects less polymorphism below the species level [80Franca LT, Carrilho E, Kist TB. A review of DNA sequencing techniques. Q Rev Biophys. 2002;35(2):169200.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Pyrosequencing

Pyrosequencing is a synthesis principle-based sequencing technique [87Hyman ED. A new method of sequencing DNA. Anal Biochem. 1988;174(2):423436.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] in which phosphate is identified during the synthesis of DNA [88Ronaghi M. Pyrosequencing sheds light on DNA sequencing. Genome Res. 2001;11(1):311.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. In this method, the primer used for sequencing is hybridized with a single-stranded DNA template which is biotin-labelled and is combined with specific enzymes [89Gharizadeh B, Herman ZS, Eason RG, et al. Large-scale Pyrosequencing of synthetic DNA: a comparison with results from Sanger dideoxy sequencing. Electrophoresis. 2006;27(15):30423047.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Deoxynucleoside triphosphates are added separately in the reaction mixture using four cycles. The reaction begins with the polymerization of nucleic acid where PPi (pyrophosphate), which is inorganic in nature, released. As the nucleotides are added, the reaction is accompanied by continuous release of inorganic phosphate and this released PPi is in equal amount to the incorporated nucleotide. Initially, the activity of DNA polymerase was monitored by this technique. Solid phase sequencing [90Ronaghi M, Karamohamed S, Pettersson B, et al. Real-time DNA sequencing using detection of pyrophosphate release. Anal Biochem. 1996;242(1):8489.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] and liquid phase sequencing [91Ronaghi M, Uhlén M, Nyren P. A sequencing method based on real-time pyrophosphate. Science. 1998;281(5375):363365.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] are two types of pyrosequencing.

Next-generation sequencing (NGS)

The development in the sequencing techniques increased the demand for extensive throughput sequencing at a low cost. This demand led to the development of NGS and currently this technique produces millions of sequences. This technique has the ability to produce several hundreds of millions to several hundreds of billions of DNA bases per run [92Shendure J, Ji H. Next-generation DNA sequencing. Nat Biotechnol. 2008;26(10):11351145.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Many organizations have developed this technique successfully and they provide their services commercially, such as Illumina MiSeq and HiSeq 2500 [93Bentley DR, Balasubramanian S, Swerdlow HP, et al. Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008;456(7218):5359.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], Roche 454 FLX Titanium [94Thudi M, Li Y, Jackson SA, et al. Current state-of-art of sequencing technologies for plant genomics research. Brief Funct Genomics. 2012;11(1):311.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] and Ion Torrent PGM [95Rothberg JM, Hinz W, Rearick TM, et al. An integrated semiconductor device enabling non-optical genome sequencing. Nature. 2011;475(7356):348352.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. These NGSs resulted in low prices with covering whole genome more precisely [96Deschamps S, Llaca V, May GD. Genotyping-by-sequencing in plants. Biology. 2012;1(3):460483.[Crossref], [PubMed], [Google Scholar]]. Similar methodology is used in all NGS techniques for the preparation of template DNA, where fragments of DNA are randomly sheared and ligated at both ends with universal adapters. This sequencing is performed in constant channel and one or more nucleotides are incorporated, resulting in the release of a signal that is detected by a sequencer [97Metzker ML. Sequencing technologies—the next generation. Nat Rev Genet. 2010;11(1):3146.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Some advantages of NGS are (1) NGS is more accurate to older sequencing methods and (2) low in cost with high throughput. (3) Recently, this technique is being used in whole genome sequencing in order to investigate the maximum numbers of SNPs and for consideration of diversity present within the species, construction of linkage/halophyte maps and in genome-wide association studies (GWAS) [98Elshire RJ, Glaubitz JC, Sun Q, et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PloS one. 2011;6(5):e19379.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. (4) Sequencing of older DNA samples is also performed by NGS and this technique has strengthened the field of metagenomics [99Mardis ER. Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. 2008;9:387402.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Genotyping by sequencing (GBS)

GBS is a simple and multitudinous technique successfully used nowadays. This technique was developed in the Buckler lab under the Illumina NGS platform. Modernization in the NGS technique has lowered the sequencing costs, assuring the successful application of GBS for large genome species having great magnitude of diversity [98Elshire RJ, Glaubitz JC, Sun Q, et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PloS one. 2011;6(5):e19379.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. On the basis of ion PGM system usage, there are two types of GBS techniques: (1) Digestion of restriction enzyme: this method is mainly used in marker-assisted selection (MAS) programmes for the identification of new markers and here no particular SNPs are identified. In this method, prior to the ligation of adapters, DNA is digested with one or two specific restriction enzymes. (2) Multiplex enrichment PCR: In this technique, specific PCR primers are selected for the amplification of points of interest. In contrast to the digestion in the restriction enzyme method, a complete set of SNPs are identified for a genome section. GBS was basically developed to investigate the high-resolution association in maize and now it is used for many other species having a complex genome. Main advantages of GBS are (I) less cost as compared to the other techniques, which made GBS a novel technique for the identification of SNPs in different species and crops. (II) This technique provides satisfactory results in the characterization of germplasm, population studies and breeding of diverse crops [100Poland JA, Rife TW. Genotyping-by-sequencing for plant breeding and genetics. Plant Genome. 2012;5(3):92102.[Crossref], [Web of Science ®], [Google Scholar]]. (III) GBS produces a great magnitude of SNPs which are used in genotyping and genetic analysis [101Beissinger TM, Hirsch CN, Sekhon RS, et al. Marker density and read depth for genotyping populations using genotyping-by-sequencing. Genetics. 2013;193(4):10731081.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. (IV) It lowers the handling of samples and (V) includes less PCR and purification sets [81Davey JW, Hohenlohe PA, Etter PD, et al. Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet. 2011;12(7):499510.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

On the basis of the sequencing techniques outlined above, the following sequence-based markers have been developed.

Single-nucleotide polymorphism (SNP)

Single base-pair changes present in the genome sequence of an individual are known as SNPs. SNPs may be transitions (C/T or G/A) or transversions (C/G, A/T, C/A or T/G) on the basis of the nucleotides substitution. Normally, in mRNA, single base changes are present, including SNPs that are insertion/deletions (InDel) in a single base. A single-nucleotide base is the smallest unit of inheritance and SNP can provide the simplest and maximum number of markers. SNPs are present in abundance in plants and animals and the SNP frequency in plants ranges between 1 SNP in every 100–300 bp [102Xu Y. Molecular plant breeding. Wallingford: CABI; 2010.[Crossref], [Google Scholar]]. SNPs are widely distributed within the genome and can be found in coding or non-coding regions of genes or between two genes (intergenic region) with different frequencies [102Xu Y. Molecular plant breeding. Wallingford: CABI; 2010.[Crossref], [Google Scholar]]. A large number of methods for SNP genotyping have been developed based on different techniques of allelic discrimination and detection platforms. Among these, RLFP (SNP–RFLP) is the simplest and easiest method and the CAPS marker technique also can be applied in the SNP detection. If binding sites for restriction enzymes are present on one allele, while other alleles have no binding site, their digestion will result in fragments of various length. Identification of SNPs is achieved through the analysis of sequence data stored in databases. Different kinds of SNPs genotyping assays have been developed based on different molecular mechanisms. Among them, primer extension, invasive cleavage, oligonucleotide ligation and allele-specific hybridization are most important [103Sobrino B, Brión M, Carracedo A. SNPs in forensic genetics: a review on SNP typing methodologies. Forensic Sci Int. 2005;154(2):181194.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Various recent high-throughput genotyping methods like NGS, GBS and chip-based NGS, allele-specific PCR makes SNPs as the most attractive markers for genotyping [67Agarwal M, Shrivastava N, Padh H. Advances in molecular marker techniques and their applications in plant sciences. Plant Cell Rep. 2008;27(4):617631.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Diversity array technology (DArT Seq)

It is a technique that provides a great opportunity for the genotyping of polymorphic loci (in several hundreds to several thousands), which are distributed over the genome. It is highly reproducible microarray hybridization technology. There is no need of previous sequence information for the detection of loci for a trait of interest [104Jaccoud D, Peng K, Feinstein D, et al. Diversity arrays: a solid state technology for sequence information independent genotyping. Nucleic Acids Res. 2001;29(4):E25.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],105Wenzl P, Carling J, Kudrna D, et al. Diversity Arrays Technology (DArT) for whole-genome profiling of barley. Proc Natl Acad Sci U S A. 2004;101(26):99159920.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The most important benefit of this technique is that it is highly throughput and very economical. To discover polymorphic markers by this technology, a single-reaction assay can genotype several thousand genomic loci. As little as 50–100 ng genomic DNA is sufficient for the genotyping purpose. For the scoring and discovery of markers, an identical platform is utilized. After the discovery of a marker, there is no need of specific assays for genotyping, except starting polymorphic markers assembly into an array of a single genotype. These polymorphic markers within the genotyping arrays are commonly used for genotyping [106Huttner E, Wenzl P, Akbari M, et al. Diversity arrays technology: a novel tool for harnessing the genetic potential of orphan crops. In: Serageldin I, Persley GJ, editors. Discovery to delivery: BioVision Alexandria 2004; Proceedings of the 2004 Conference of the World Biological Forum; 2004 Apr 3–6; Alexandria, Egypt. Wallingford: CABI; 2005. p. 145155.[Google Scholar]]. The advantages and disadvantages of different genetic markers are described in Table 2.

Published online:
14 November 2017

Table 2. Advantages and disadvantages of different genetic markers.

Uses of molecular markers in plant sciences

Evolution and phylogeny

In the past, initial studies related to evolution were totally dependent on the geographical and morphological changes among the organisms. Advancements in the techniques of molecular biology offer extended information related to the genetic structure [107Slatkin M. Gene flow and the geographic structure of natural populations. Science. 1987;236:787793.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. For the reconstruction of a genetic map, in order to get full information about the phylogeny and evolution, molecular markers are being used on a large scale nowadays [108–111Wei C, Wang L, Yang Y, et al. Identification of anS5nallele in Oryza rufipogon Griff. and its effect on embryo sac fertility. Chinese Sci Bull. 2010;55(13):12551262.
Tong J, Li Y, Yang Y, et al. Molecular evolution of rice S5n allele and functional comparison among different sequences. Chinese Sci Bull. 2011;56(19):20162024.
Peng H, Shahid MQ, Li YH, et al. Molecular evolution of S5 locus and large differences in its coding region revealed insignificant effect on indica× japonica embryo sac fertility. Plant Syst Evol. 2015;301(2):639655.
Wang Y, Ghouri F, Shahid MQ, et al. The genetic diversity and population structure of wild soybean evaluated by chloroplast and nuclear gene sequences. Biochem Syst Ecol. 2017;71:170178.
]. Molecular studies related to phylogeny are largely dependent on chloroplast genome sequence data due to their simple and stable genetic nature, making them ideal markers in the evaluation of plant phylogeny [112Dong W, Liu J, Yu J, et al. Highly variable chloroplast markers for evaluating plant phylogeny at low taxonomic levels and for DNA barcoding. PLoS One. 2012;7(4):e35071.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],113Wang Y, Shahid MQ, Baloch FS. Phylogeographical studies of Glycine soja: implicating the refugium during the quaternary glacial period and large-scale expansion after the last glacial maximum. Turk J Agric For. 2016;40(6):825838.[Crossref], [Web of Science ®], [Google Scholar]].

Markes

Investigation of heterosis

Heterosis describes the greater performance of progeny (F1) over the mean of the two crossed parents. If the effect in F1 is greater than that in its parents, such heterosis is known as positive heterosis; while where the effect in F1 is lower than in its parents, such type of heterosis is known as negative heterosis [114Comings DE, MacMurray JP. Molecular heterosis: a review. Mol Gen Metab. 2000;71(1):1931.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Various studies have been conducted by using molecular markers in various crop plants such as wheat [115Martin JM, Talbert LE, Lanning SP, et al. Hybrid performance in wheat as related to parental diversity. Crop Sci. 1995;35(1):104108.[Crossref], [Web of Science ®], [Google Scholar]], maize [116Betran FJ, Ribaut JM, Beck D, et al. Genetic diversity, specific combining ability, and heterosis in tropical maize under stress and nonstress environments. Crop Sci. 2003;43(3):797806.[Crossref], [Web of Science ®], [Google Scholar]] and rape seed [117Yu CY, Hu SW, Zhao HX, et al. Genetic distances revealed by morphological characters, isozymes, proteins and RAPD markers and their relationships with hybrid performance in oilseed rape (Brassica napus L.). Theor Appl Genet. 2005;110(3):511518.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], to investigate the genetic diversity and heterosis. Molecular markers like SSRs have been used in the investigation of diversity and heterosis in rice [118Wu JW, Hu CY, Shahid MQ, et al. Analysis on genetic diversification and heterosis in autotetraploid rice. Springer Plus. 2013;2(1):439.[Crossref], [PubMed], [Google Scholar]]. Recently, SSR markers were applied in order to investigate the heterotic groups and patterns in rice [119Xie F, He Z, Esguerra MQ, et al. Determination of heterotic groups for tropical Indica hybrid rice germplasm. Theor Appl Genet. 2014;127(2):407417.[Crossref], [Web of Science ®], [Google Scholar]]. Some studies have used transcriptome analysis to analyse the genes involved in heterosis [120Guo H, Mendrikahy JN, Xie L, et al. Transcriptome analysis of neo-tetraploid rice reveals specific differential gene expressions associated with fertility and heterosis. Sci Rep. 2017;7:40139.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],121Li X, Shahid MQ, Xia J, et al. Analysis of small RNAs revealed differential expressions during pollen and embryo sac development in autotetraploid rice. BMC Genomics. 2017;18(1):129.[Crossref], [PubMed], [Google Scholar]].

Identification of haploid/diploid plants and cultivars genotyping

Haploids are plants having a single set of gametophytic chromosomes and diploids are plants with two copies of each homologous chromosome [122Kasha KJ, Maluszynski M. Production of doubled haploids in crop plants. An introduction. In: Maluszynski M, Kasha KJ, Forster BP, Szarejko I, editors. Doubled haploid production in crop plants. Dordrecht: Springer; 2003. p. 14.[Crossref], [Google Scholar]]. These haploid/double-haploid (DH) plants are very important because they are used as a mapping population for quantitative trait loci (QTL) mapping [123Khush GS, Virmani SS. Haploids in plant breeding. In: Jain SM, Sopory SK, Veilleux RE, editors. In vitro haploid production in higher plants. Current plant science and biotechnology in agriculture. Dordrecht: Springer; 1996. p. 1133.[Crossref], [Google Scholar]] and in other breeding and genetic studies. DH plants are very important in the integration of physical and genetic maps and thus allow the accurate detection of candidate genes of interest [124Künzel G, Korzun L, Meister A. Cytologically integrated physical restriction fragment length polymorphism maps for the barley genome based on translocation breakpoints. Genetics. 2000;154(1):397412.[PubMed], [Web of Science ®], [Google Scholar],125Belicuas PR, Guimarães CT, Paiva LV, et al. Androgenetic haploids and SSR markers as tools for the development of tropical maize hybrids. Euphytica. 2007;156(1–2):95102.[Crossref], [Web of Science ®], [Google Scholar]]. The R1-nj (Navajo) anthocyanin colour marker has been successfully applied for the identification of haploids [126Melchinger AE, Winter M, Mi X, et al. Controlling misclassification rates in identification of haploid seeds from induction crosses in maize with high-oil inducers. Crop Sci. 2015;55(3):10761086.[Crossref], [Web of Science ®], [Google Scholar]]. Similarly, SSR and SNP markers have been applied to detect DH and genotypes of isogenic lines and hybrids [127–129Tang F, Tao Y, Zhao T, et al. In vitro production of haploid and doubled haploid plants from pollinated ovaries of maize (Zea mays). Plant Cell Tissue Organ Cult. 2006;84(2):233237.
Shahid MQ, Chen FY, Li HY, et al. Double-neutral genes, and, for pollen fertility in rice to overcome× hybrid sterility. Crop Sci. 2013;53(1):164176.
Wu J, Shahid MQ, Chen L, et al. Polyploidy enhances F1 pollen sterility loci interactions that increase meiosis abnormalities and pollen sterility in autotetraploid rice. Plant Physiol. 2015;169(4):27002717.
].

Genetic diversity assessment

Recent advancements in molecular markers and genome sequencing offer great opportunity to investigate the genetic diversity in a very big germplasm [111Wang Y, Ghouri F, Shahid MQ, et al. The genetic diversity and population structure of wild soybean evaluated by chloroplast and nuclear gene sequences. Biochem Syst Ecol. 2017;71:170178.[Crossref], [Web of Science ®], [Google Scholar],130Nawaz MA, Baloch FS, Rehman HM, et al. Development of a competent and trouble free DNA isolation protocol for downstream genetic analysis in glycine species. Turk J Agri Food Sci Tech. 2016;4(8):700705.[Google Scholar],131Nawaz MA, Sadia B, Awan FS, et al. 2013. Genetic diversity in hyper glucose oxidase producing Aspergillus niger UAF mutants by using molecular markers. Int J Agric Biol. 2013;15(2):362366.[Web of Science ®], [Google Scholar]]. Genetic diversity assessment is very helpful in the study of the evolution of plants and their comparative genomics, helping to understand the structure of different populations [132–134Nawaz MA, Yang SH, Rehman HM, et al. Genetic diversity and population structure of Korean wild soybean (Glycine soja Sieb. and Zucc.) inferred from microsatellite markers. Biochem Syst Ecol. 2017;71:8796.
Nawaz MA, Rehman HM, Baloch FS, et al. Genome and transcriptome-wide analysis of cellulose synthase gene superfamily in soybean. J Plant Physiol. 2017;215:163175.
Liu W, Shahid MQ, Bai L, et al. Evaluation of genetic diversity and development of a core collection of wild rice (Oryza rufipogon Griff.) populations in China. PloS One. 2015;10(12):e0145990.
]. Genetic markers have been successfully applied in the determination of genetic diversity and the classification of genetic material [135–137Wang Y, Shahid MQ, Ghouri F, et al. Evaluation of the geographical pattern of genetic diversity of glycine soja and glycine max based on four single copy nuclear gene loci: for conservation of soybean germplasm. Biochem Syst Ecol. 2015;62:229235.
Tiwari JK, Singh BP, Gopal J, et al. Molecular characterization of the Indian Andigena potato core collection using microsatellite markers. Afr J Biotechnol. 2013;12(10).10251033.
Naeem M, Ghouri F, Shahid MQ, et al. Genetic diversity in mutated and non-mutated rice varieties. Genet Mol Res. 2015;14(4):1710917123.
]. DArT markers and SNPs markers are the most commonly used markers for the determination of genetic diversity in various crops [138Baloch FS, Alsaleh A, Shahid MQ, et al. A whole genome DArTseq and SNP analysis for genetic diversity assessment in durum wheat from Central Fertile Crescent. PloS One. 2017;12(1):e0167821.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Utilization of molecular markers in backcrossing for a gene of interest

Introgression is a technique in which some genes of interest are transferred from plant genetic resources (PGR) to crop varieties. In this technique, some desired traits are selected from exotic germplasm and transferred into crop plants by backcrossing [139Simmonds NW. Introgression and incorporation. Strategies for the use of crop genetic resources. Biol Rev. 1993;68(4):539562.[Crossref], [Web of Science ®], [Google Scholar]]. MAS has played an important role in the usage of wild genes and their transfer into crop plants. Many genes of interest from wild plants have been transferred in nearly all economically important cultivated plants. Mainly SSR markers are used for this purpose. Two SSR markers have been successfully used to transfer the Lgc-1 locus related to low gluten level in japonica rice with 93–97% selection efficiency [140Wang YH, Liu SJ, Ji SL, et al. Fine mapping and marker-assisted selection (MAS) of a low glutelin content gene in rice. Cell Res. 2005;15(8):622630.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Barley yellow mosaic virus is an important disease in barley and rym4 and rym5 are genes incurring resistance to this disease and variety of markers have been developed for the selection of these genes [141You-Xin Y, Yan-Hong L, Jing-Fei T. Wide-compatibility gene exploited by functional molecular markers and its effect on fertility of intersubspecific rice hybrids. Crop Sci. 2012;52(2):669675.[Crossref], [Web of Science ®], [Google Scholar]].

Genetic mapping

Genetic mapping employs methods for identification of the locus of a gene as well as for determination of the distance between two genes. Gene mapping is considered as the major area of research in which molecular markers are used today. The principle of genetic mapping is chromosomal recombination during meiosis which results in the segregation of genes. Markers present close to the gene of interest on the same chromosome are known as linked markers.

QTL mapping

Most agricultural traits of economic interest are polygenic and quantitative in nature and are controlled by many genes on the same/different chromosome. The chromosomal regions having genes for these quantitative traits are referred to as QTL. QTL mapping is a method in which molecular markers are utilized to locate the genes that affect the traits of interest. Such traits are divided into two groups: one is quantitative and the second one is qualitative traits. Discontinuous variations can be shown by qualitative traits, while continuous variation occurs in quantitative traits. For QTL study, molecular markers are very important and considered as an ideal tool for the purpose; they can be used for MAS as well [142Angaji SA. QTL mapping: a few key points. Int J Appl Res Nat Prod. 2009;2(2):13.[Google Scholar]]. Some important steps in QTL mapping include the selection of two diverse parents having allelic variations that affect the trait of interest. After the phenotyping of the mapping population, polymorphic markers are used to obtain the genetic data. Then the genetic map is constructed and some statistical programs are applied to identify molecular markers linked with the trait of interest [4Collard BC, Jahufer MZ, Brouwer JB, et al. An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: the basic concepts. Euphytica. 2005;142(1–2):169196.[Crossref], [Web of Science ®], [Google Scholar]]. The QTL mapping methodology is described in Figure 1.

Published online:
14 November 2017

QTL mapping populations

For QTL mapping, two diverse parents should be selected and they should be diverse enough to exhibit an adequate level of polymorphism. Near-isogenic lines (NILs), DHs, backcrosses (BCs), recombinant inbred lines (RILs) and F2 populations can be used as the mapping population [143Paterson AH. Making genetic maps. In: Paterson AH, editor. Genome mapping in plants. Austin (TX): R.G. Landes Company; 1996. p. 2339.[Google Scholar]]. Practically 50–250 individuals are selected in a mapping population but for high-resolution and fine mapping, a larger size of the mapping population is required [144Mohan M, Nair S, Bhagwat A, et al. Genome mapping, molecular markers and marker-assisted selection in crop plants. Mol Breed. 1997;3(2):87103.[Crossref], [Web of Science ®], [Google Scholar],145Dhingani RM, Umrania VV, Tomar RS, et al. Introduction to QTL mapping in plants. Ann Plant Sci. 2015;4(04):10721079.[Google Scholar]].

Selection of markers for QTL mapping

Different types of markers like RFLP, AFLP, ISSR, SSR, ESTs, DArT and SNPs have been commonly used for the construction of linkage maps in several plants [11Semagn K, Bjørnstad Å, Ndjiondjop MN. An overview of molecular marker methods for plants. Afr J Biotechnol. 2006;(2540):2568.[Google Scholar]]. Normally, for genetic mapping studies, 100–200 markers have been used for linkage maps construction [144Mohan M, Nair S, Bhagwat A, et al. Genome mapping, molecular markers and marker-assisted selection in crop plants. Mol Breed. 1997;3(2):87103.[Crossref], [Web of Science ®], [Google Scholar]]. The marker number, however, varies according to the studies and directly depends on the species genome size, as larger genome sized species require a larger number of markers. However, with the advent of NGS, several thousands of DNA markers are now utilized for high-resolution genetic mapping [145Dhingani RM, Umrania VV, Tomar RS, et al. Introduction to QTL mapping in plants. Ann Plant Sci. 2015;4(04):10721079.[Google Scholar],146Bernardo A, Wang S, Amand PS, et al. Using next generation sequencing for multiplexed trait-linked markers in wheat. PloS One. 2015;10(12):e0143890.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Genetic/linkage map construction

The linkage map is a road map that describes the position and relative genetic distance between markers [143Paterson AH. Making genetic maps. In: Paterson AH, editor. Genome mapping in plants. Austin (TX): R.G. Landes Company; 1996. p. 2339.[Google Scholar]]. QTL mapping is based on marker segregation via chromosome recombination during meiosis, in which those markers which are tightly linked with each other will be transferred together more commonly during recombination as compared to those which are away from each other. This recombination frequency is used to calculate the recombination fractions. Through the segregation analysis, the actual distance and relative order of markers can be calculated [4Collard BC, Jahufer MZ, Brouwer JB, et al. An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: the basic concepts. Euphytica. 2005;142(1–2):169196.[Crossref], [Web of Science ®], [Google Scholar]]. Odds ratios (the ratio of linkage versus no linkage) are used for the calculation of the linkage between markers. This value is called LOD, or logarithm of odds [147Risch N. Genetic linkage: interpreting LOD scores. Science. 1992;255(5046):803805.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. For the construction of linkage maps, LOD values of >3 are considered ideal [4Collard BC, Jahufer MZ, Brouwer JB, et al. An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: the basic concepts. Euphytica. 2005;142(1–2):169196.[Crossref], [Web of Science ®], [Google Scholar]].

QTL detection

The most important methods developed for QTL detection are single-marker analysis (reviewed in [148Tanksley SD. Mapping polygenes. Annu Rev Genet. 1993;27(1):205233.[Crossref], [PubMed], [Google Scholar]]), simple interval analysis (reviewed in [149Lander ES, Botstein D. Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics. 1989;121(1):185199.[PubMed], [Web of Science ®], [Google Scholar]]) and composite interval analysis [150Silva LD, Wang S, Zeng ZB. Composite interval mapping and multiple interval mapping: procedures and guidelines for using Windows QTL cartographer. Methods Mol Biol. 2012;871:75119.[Crossref], [PubMed], [Google Scholar]]. Some most important statistical programs commonly used in QTL mapping are: R [151Broman KW, Wu H, Sen Ś, et al. R/qtl: QTL mapping in experimental crosses. Bioinformatics. 2003;19(7):889890.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], QTLNetwork [152Yang J, Hu C, Hu H, et al. QTLNetwork: mapping and visualizing genetic architecture of complex traits in experimental populations. Bioinformatics. 2008;24(5):721723.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], PLABQTL [153Utz HF, Melchinger AE. PLABQTL: a program for composite interval mapping of QTL. J Quant Trait Loci. 1996;2(1):15.[Google Scholar]], QGENE [154Nelson JC. QGENE: software for marker-based genomic analysis and breeding. Mol Breed. 1997;3(3):239245.[Crossref], [Web of Science ®], [Google Scholar]] and MapChart [155Voorrips RE. MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered. 2002;93(1):7778.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Factors affecting the QTL detection

The detection of QTLs in a segregating population is affected by several factors. Among these; genetic properties of QTL, environmental factors, experimental errors in phenotyping and size of population are main factors affecting the QTL detection [146Bernardo A, Wang S, Amand PS, et al. Using next generation sequencing for multiplexed trait-linked markers in wheat. PloS One. 2015;10(12):e0143890.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The environment directly affects the expression of quantitative traits and when some experiments are conducted on the same sites for various seasons, it helps to detect the effects of environments on the QTL having influence on the traits of interest [156George ML, Prasanna BM, Rathore RS, et al. Identification of QTLs conferring resistance to downy mildews of maize in Asia. Theor Appl Genet. 2003;107(3):544551.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The population size directly influences the QTL mapping studies. A larger sized population results in the more precise mapping and also facilitates the detection of the QTLs with less pronounced effects [148Tanksley SD. Mapping polygenes. Annu Rev Genet. 1993;27(1):205233.[Crossref], [PubMed], [Google Scholar],157Haley CS, Andersson LE. Linkage mapping of quantitative trait loci in plants and animals. In: Dear PH, editor. Genome mapping: a practical approach. Oxford: IRL Press; 1997. p. 4971.[Google Scholar]]. The experimental errors include the errors arising from imprecise phenotyping and genotyping. Non-accurate phenotypic data and errors in genotypic data influence the distance between markers [158Hackett CA. Statistical methods for QTL mapping in cereals. Plant Mol Biol. 2002;48(5–6):585599.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Several QTLs have been described in the literature for different traits of interest but most of the QTLs are false due to errors in phenotyping at multi locations and involvement of different factors in field experiments.

QTL validation

After the QTL detection, it is necessary to validate that particular QTL. For this purpose, diverse populations will be developed by crossing different parents in order to check the presence of a particular QTL in other populations with different genetic background. NILs are commonly used for the confirmation and validation of QTL [4Collard BC, Jahufer MZ, Brouwer JB, et al. An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: the basic concepts. Euphytica. 2005;142(1–2):169196.[Crossref], [Web of Science ®], [Google Scholar]]. NILs have been used to precisely evaluate the effect of different pollen sterility loci in rice [129Wu J, Shahid MQ, Chen L, et al. Polyploidy enhances F1 pollen sterility loci interactions that increase meiosis abnormalities and pollen sterility in autotetraploid rice. Plant Physiol. 2015;169(4):27002717.[PubMed], [Web of Science ®], [Google Scholar]]. Confirmation of QTL provides the information about the marker to be used or not for MAS [159Ogbonnaya FC, Subrahmanyam NC, Moullet O, et al. Diagnostic DNA markers for cereal cyst nematode resistance in bread wheat. Crop Pasture Sci. 2001;52(12):13671374.[Crossref], [Google Scholar]].

QTL cloning

Large numbers of QTLs have been isolated in plants and are mostly cloned by positional cloning. Positional cloning is also known as map-based cloning. Map-based cloning is mainly applied for detection of the genetic basis responsible for a mutant phenotype [160Lukowitz W, Gillmor CS, Scheible WR. Positional cloning in Arabidopsis. Why it feels good to have a genome initiative working for you. Plant Physiol. 2000;123(3):795806.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Map-based cloning facilitates the allocation of a QTL having a very small genetic interval and to detect the distance on the DNA sequence. In QTL mapping, mapping populations are developed and molecular markers are then used to assign the shortest genetic distance and to detect the distance on the DNA sequence. Advancement in the sequencing technologies saves time and helps in the detection of accurate and tightly linked QTLs. These new sequencing techniques provide precise results and save more time as compared to map-based cloning. However, these high-throughput techniques are costly, as they require high initial cost. Some important steps in positional cloning are development of NILs or F2 or a backcross population. Phenotyping of these populations is performed and their screening is done with different molecular markers; however, newly developed methods, i.e. NGS and MassARRAY System, can help in the more and fast detection of SNPs [81Davey JW, Hohenlohe PA, Etter PD, et al. Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet. 2011;12(7):499510.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Fine mapping is performed for the construction of a genetic map with the polymorphic marker to locate a very small genetic interval. Larger numbers of individuals are used in fine mapping to increase the recombination rate, which results in a decreased interval up to 0.16 cM. Nearly 3000–4000 plants should be used as the mapping population and 600 plants as first-pass mapping population in order to achieve higher level of recombination [160Lukowitz W, Gillmor CS, Scheible WR. Positional cloning in Arabidopsis. Why it feels good to have a genome initiative working for you. Plant Physiol. 2000;123(3):795806.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Such markers should be selected which are tightly linked. A physical map is constructed as the genetic resolution reaches the 0.1-cM level. Anchoring of the genetic map to the physical map is achieved by the utilization of markers near to the QTL [160Lukowitz W, Gillmor CS, Scheible WR. Positional cloning in Arabidopsis. Why it feels good to have a genome initiative working for you. Plant Physiol. 2000;123(3):795806.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],161Salvi S, Tuberosa R. To clone or not to clone plant QTLs: present and future challenges. Trends Plant Sci. 2005;10(6):297304.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. A candidate gene is selected and sequenced to design specific primers for PCR amplification of the candidate gene [162Gallavotti A, Whipple CJ. Positional cloning in maize (Zea mays subsp. mays, Poaceae). Appl Plant Sci. 2015;3(1):1400092.[Crossref], [Web of Science ®], [Google Scholar]].

Chromosome walking

The interval between a QTL and a marker can be decreased by chromosome walking/genome walking. Chromosome waking is a method of positional cloning mainly performed for the identification and isolation cloning of a particular allele. This is a very efficient technique used in the identification of unknown regions flanking a known DNA sequence. During the chromosome walking procedure, first, large insert libraries are developed and then positive clones are identified with a series of cloning steps so that walking should be achieved towards the gene of interest. Different types of chromosome walking techniques have been developed, including inverse PCR [163Ochman H, Gerber AS, Hartl DL. Genetic applications of an inverse polymerase chain reaction. Genetics. 1988;120(3):621623.[PubMed], [Web of Science ®], [Google Scholar]] and vectorette PCR [164Arnold C, Hodgson IJ. Vectorette PCR: a novel approach to genomic walking. Genome Res. 1991;1(1):3942.[Crossref], [Google Scholar]]. In these techniques, restriction enzymes are used for the digestion of genomic DNA and then genomic DNA is ligated. Then PCR is performed for the amplification of flanking regions where the ligated product is utilized as template. For the detection and isolation of promoter elements, genome walking kits are now available on the market [165Rishi AS, Nelson ND, Goyal A. Genome walking of large fragments: an improved method. J Biotechnol. 2004;111(1):915.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The main disadvantage of map-based/positional cloning is that it is a very time-consuming and laborious technique.

Advantages and drawbacks of QTL mapping

QTL mapping is used to detect the genes which control the trait of interest [144Mohan M, Nair S, Bhagwat A, et al. Genome mapping, molecular markers and marker-assisted selection in crop plants. Mol Breed. 1997;3(2):87103.[Crossref], [Web of Science ®], [Google Scholar]]. It is very useful for the genome-wide scan for QTLs detection in plants. Diseases are a big concern in agriculture and genes responsible for generation of resistance to these diseases can be detected by QTL mapping [166Young ND, Kumar L, Menancio-Hautea D, et al. RFLP mapping of a major bruchid resistance gene in mungbean (Vignaradiata, L. Wilczek). Theor Appl Genet. 1992;84(7-8):839844.[PubMed], [Web of Science ®], [Google Scholar]]. Some important drawbacks of QTL mapping include less allelic diversity, lower number of recombination events [167Price AH. Believe it or not, QTLs are accurate ! Trends Plant Sci. 2006;11(5):213216.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], being time consuming in case of mapping population development [168Neale DB, Savolainen O. Association genetics of complex traits in conifers. Trends Plant Sci. 2004;9(7):325330.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] and specificity of the detected QTLs to a given population [169Lübberstedt T, Zein I, Andersen JR, et al. Development and application of functional markers in maize. Euphytica. 2005;146(1):101108.[Crossref], [Web of Science ®], [Google Scholar]].

Association mapping (AM)

Association mapping (AM) is significant association of molecular markers with a phenotypic trait. Statistically, AM is the covariance between the polymorphism present in the marker and the trait of interest [170Jannink JL, Walsh B. Association mapping in plant populations. In: Kang MS, editor. Quantitative genetics, genomics and plant breeding. Oxford: CAB International; 2002. p. 5968.[Google Scholar],171Zhang P, Zhong K, Shahid MQ, et al. Association analysis in rice: from application to utilization. Front Plant Sci. 2016;7:1202.[PubMed], [Web of Science ®], [Google Scholar]]. It is more time saving as compared to linkage mapping and provides greater mapping resolution with a higher number of recombination events. AM facilitates the identification of a greater number of alleles due to availability of more genetic variations with larger background; historically measured phenotypic data can also be used for AM [172Zhang P, Zhong K, Tong H, et al. Association mapping for aluminum tolerance in a core collection of rice landraces. Front Plant Sci. 2016:7:1415.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],173Kraakman AT, Martinez F, Mussiraliev B, et al. Linkage disequilibrium mapping of morphological, resistance, and other agronomically relevant traits in modern spring barley cultivars. Mol Breed. 2006;17(1):4158.[Crossref], [Web of Science ®], [Google Scholar]].

Why association mapping?

Linkage mapping, known as bi-parental mapping, is a classical mapping technique used to study the linkage in several plant species over 20 years [174Holland JB. Genetic architecture of complex traits in plants. Curr Opin Plant Biol. 2007;10(2):156161.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The major limitations of QTL mapping are described above and these limitations could be overcome by the introduction of linkage disequilibrium (LD)-based AM [175Gupta PK, Rustgi S, Kulwal PL. Linkage disequilibrium and association studies in higher plants: present status and future prospects. Plant Mol Biol. 2005;57(4):461485.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],176Goldstein DB, Weale ME. Population genomics: linkage disequilibrium holds the key. Curr Biol. 2001;11(14):576579.[Crossref], [Web of Science ®], [Google Scholar]].

Linkage disequilibrium (LD)

Non-random association of alleles at different loci is known as LD. LD describes the increased or decreased (non-equal) frequency of haplotypes in a population. LD can be described as PABPA × PB [177Mackay I, Powell W. Methods for linkage disequilibrium mapping in crops. Trends Plant Sci. 2007;12(2):5763.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], where A and B are two alleles present at different loci; PAB describes the frequency of haplotypes having both alleles at the two loci; PA and PB show the frequency of haplotypes having a single A or B allele, respectively. LD is also known as gametic phase disequilibrium or gametic disequilibrium [166Young ND, Kumar L, Menancio-Hautea D, et al. RFLP mapping of a major bruchid resistance gene in mungbean (Vignaradiata, L. Wilczek). Theor Appl Genet. 1992;84(7-8):839844.[PubMed], [Web of Science ®], [Google Scholar]]. In 1917, LD was first defined by Jennings and quantified in 1964 by Lewtonin (reviewed in [178Abdurakhmonov IY, Abdukarimov A. Application of association mapping to understanding the genetic diversity of plant germplasm resources. Int J Plant Genomics. 2008;2008:574927.[Crossref], [PubMed], [Google Scholar]]). It is necessary to obtain knowledge about the LD patterns for the genomic areas of the targeted organism. Similarly, there should be prior knowledge about the specificity of the extent of LD present between various populations. The square of the correlation coefficient (r2) and the disequilibrium coefficient (D′) are two widely used statistical methods for measuring the LD. GOLD [179Abecasis GR, Cookson WO. Gold – graphical overview of linkage disequilibrium. Bioinformatics. 2000;16(2):182183.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], R and TASSEL [180Bradbury PJ, Zhang Z, Kroon DE, et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23(19):26332635.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] are the most commonly used software applications to describe the structure and pattern of LD.

Factors affecting LD

Genetic and demographic factors are responsible for generating haplotypic blocks [177Mackay I, Powell W. Methods for linkage disequilibrium mapping in crops. Trends Plant Sci. 2007;12(2):5763.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Recombination and mutation are responsible for the significant LD. New mutations, autogamy, epistasis, genetic isolation, population size, selection, kinship and genomic rearrangements are responsible for the increase in LD. LD decreases with the higher rates of mutation, recombination, gene conversion and recurrent mutations [177Mackay I, Powell W. Methods for linkage disequilibrium mapping in crops. Trends Plant Sci. 2007;12(2):5763.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

General methodology of association mapping

AM involves the selection of individuals from a natural population having a wide range of genetic diversity. Complete and precise phenotyping is performed for various traits of interest preferably in different locations and environments for many years. After genotyping with favourable markers, the structure of populations and their kinship are determined. Different statistics like D, D' or r2 are performed for the quantification of LD [181Sehgal D, Singh R, Rajpal VR. Quantitative trait loci mapping in plants: concepts and approaches. In: Rajpal V, Rao S, Raina S, editors. Vol. 2, Molecular breeding for sustainable crop improvement. Cham: Springer International; 2016. p. 3159. (Sustainable development and biodiversity; Vol. 11).[Crossref], [Google Scholar]]. Finally, the phenotyping and genotyping data are associated by using some statistical software programmes. TASSEL is the most widely used software for AM [180Bradbury PJ, Zhang Z, Kroon DE, et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007;23(19):26332635.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The methodology of AM is shown in Figure 2.

Published online:
14 November 2017

Types of association mapping

Generally, AM is divided into two categories: (i) candidate-gene-based AM and (ii) genome-wide association study.

Candidate-gene-based association mapping

It is a very useful technique where scientists study the correlation present between a trait of interest and the DNA polymorphism present in a gene. Candidate genes are generally genes having direct or indirect effect on the trait of interest with known biological functions [181Sehgal D, Singh R, Rajpal VR. Quantitative trait loci mapping in plants: concepts and approaches. In: Rajpal V, Rao S, Raina S, editors. Vol. 2, Molecular breeding for sustainable crop improvement. Cham: Springer International; 2016. p. 3159. (Sustainable development and biodiversity; Vol. 11).[Crossref], [Google Scholar],182Tabor HK, Risch NJ, Myers RM. Candidate-gene approaches for studying complex genetic traits: practical considerations. Nat Rev Genet. 2002;3(5):391397.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Biologically relevant candidates are selected with their trait dissection and they are ordered according to their evolutionary data obtained from their physiological, chemical and genetic studies [183Mackay TF. The genetic architecture of quantitative traits. Annu Rev Genet. 2001;35(1):303339.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. This technique needs the detection of SNPs present between lines and within specific genes. The simplest method to investigate the candidate gene depends on the re-sequencing of amplicons. The exon, promoter and introns with 5'/3' untranslated regions are accountable factors in the investigation of candidate gene SNPs. The amount of SNPs per unit length requires for the detection of significant association which can be described by the pace of LD decay for a specific candidate gene locus [184Whitt SR, Buckler ES. Using natural allelic diversity to evaluate gene function. Plant Func Genomics. 2003;236:123139.[Crossref], [Google Scholar]]. The candidate-gene technique has been successfully used for the characterization and cloning of QTLs in the last many years. This technique has been successfully used for the development of many tightly linked genes into functional markers (FMs) [185Lau WC, Rafii MY, Ismail MR, et al. Review of functional markers for improving cooking, eating, and the nutritional qualities of rice. Front Plant Sci. 2015;6:832.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Genome-wide association study (GWAS)

Recent advancements in the field of sequencing and genotyping have made possible the GWAS in various species. This is a powerful technique mainly used to study the genetics of natural variations and traits of interest. Now several organizations have developed GWAS platforms commercially. Normally, inbred lines are used for GWAS and, after the genotyping of these lines, multiple times of phenotyping are performed [186Atwell S, Huang YS, Vilhjálmsson BJ, et al. Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature. 2010;465(7298):627631.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. For the detection of QTLs, a large size of population (up to tens of thousands of individuals) is used to obtain high resolution. Millions of SNPs are produced through GWAS and the SNP number also increases, as more and more advancement in technology is coming [187Zargar SM, Raatz B, Sonah H, et al. Recent advances in molecular marker techniques: insight into QTL mapping, GWAS and genomic selection in plants. JCSB. 2015;18(5):293308.[Google Scholar]]. This technique facilitates greater resolution, ability to investigate the haplotype blocks small in size which are significantly correlated with quantitative trait variations [188Hindorff LA, Sethupathy P, Junkins HA, et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci. 2009;106(23):93629367.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] and is a very cost-effective method with high throughput [186Atwell S, Huang YS, Vilhjálmsson BJ, et al. Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature. 2010;465(7298):627631.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. GWAS has been performed in nearly all economically important crops, like maize, sorghum, millet and rice [189Jia P, Zhao Z. Network-assisted analysis to prioritize GWAS results: principles, methods and perspectives. Hum Genet. 2014;133(2):125138.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Nested association mapping (NAM)

This technique was first proposed by Yu et al. [190Yu J, Holland JB, McMullen MD, et al. Genetic design and statistical power of nested association mapping in maize. Genetics. 2008;178(1):539551.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. In nested association mapping (NAM), various designed mapping families are connected with each other. NAM is a technique that combines the effectiveness of both association and linkage mapping and has been applied successfully in the determination of FMs in various plants. Some important steps in NAM include the phenotyping for various traits of interest, followed by complete sequencing/genotyping of diverse founders/parents or dense genotyping. Then different markers are applied on both parents and progenies for genotyping in order to investigate the transfer of high-density maker information from parents to progenies. Finally, genome-wide analysis is performed for the association of phenotypic data with genotypic data [190Yu J, Holland JB, McMullen MD, et al. Genetic design and statistical power of nested association mapping in maize. Genetics. 2008;178(1):539551.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. This technique can be used effectively in the identification of FMs [191McMullen MD, Kresovich S, Villeda HS, et al. Genetic properties of the maize nested association mapping population. Science. 2009;325(5941):737740.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Multi-parent advanced generation inter-cross (MAGIC)

This technique provides higher rate of recombination and enhanced mapping resolution as compared to bi-parental mapping by interrogating several alleles. The main idea behind the development of the MAGIC populations is to enhance the intercrossing level and to increase the genome shuffling [192Cavanagh C, Morell M, Mackay I, et al. From mutations to MAGIC: resources for gene discovery, validation and delivery in crop plants. Curr Opin Plant Biol. 2008;11(2):215221.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Advanced inter-crossed lines are used as populations for MAGIC and are developed through performing random and subsequent inter-crosses in a population, which is developed when two inbred lines are crossed [193Darvasi A, Soller M. Advanced intercross lines, an experimental population for fine genetic mapping. Genetics. 1995;141(3):11991207.[PubMed], [Web of Science ®], [Google Scholar]]. MAGIC populations are very beneficial in different breeding programmes and can be used as permanent mapping populations in the determination of more accurate QTLs as well as directly or indirectly in the development of a variety [194Bandillo N, Raghavan C, Muyco PA, et al. Multi-parent advanced generation inter-cross (MAGIC) populations in rice: progress and potential for genetics research and breeding. Rice. 2013;6(1):11.[Crossref], [PubMed], [Google Scholar]].

Marker-assisted selection (MAS)

MAS is a technique in which phenotypic selection is made on the basis of the genotype of a marker [4Collard BC, Jahufer MZ, Brouwer JB, et al. An introduction to markers, quantitative trait loci (QTL) mapping and marker-assisted selection for crop improvement: the basic concepts. Euphytica. 2005;142(1–2):169196.[Crossref], [Web of Science ®], [Google Scholar]]. MAS is a molecular breeding technique that helps to avoid the difficulties concerned with conventional plant breeding. It has totally changed the standard of selection [144Mohan M, Nair S, Bhagwat A, et al. Genome mapping, molecular markers and marker-assisted selection in crop plants. Mol Breed. 1997;3(2):87103.[Crossref], [Web of Science ®], [Google Scholar],182Tabor HK, Risch NJ, Myers RM. Candidate-gene approaches for studying complex genetic traits: practical considerations. Nat Rev Genet. 2002;3(5):391397.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Plant breeders mostly use MAS for the identification of suitable dominant or recessive alleles across a generation and for the identification of the most favourable individuals across the segregating progeny [195Francia E, Tacconi G, Crosatti C, et al. Marker assisted selection in crop plants. Plant Cell Tiss Org. 2005;82(3):317342.[Crossref], [Web of Science ®], [Google Scholar]]. Some important steps involved in MAS are described in Figure 3.

Published online:
14 November 2017

Important MAS schemes

Important schemes used for MAS are:
(1)
marker-assisted backcrossing;
(2)
gene pyramiding;
(3)
marker-assisted recurrent selection;
(4)
genomic selection.
Marker-assisted backcrossing (MABC)

Backcrossing is a very old technique and its efficiency was improved when molecular markers were introduced. MABC is a backcrossing technique in which molecular markers are used [196Holland JB. Implementation of molecular markers for quantitative traits in breeding programs—challenges and opportunities. In: New directions for a diverse planet; Proceedings of the 4th International Crop Science Congress; 2004 Sep 26–Oct 1; Brisbane, Australia. Gosford: Regional Institute; 2004. Available from: www.cropscience.org.au/icsc2004[Google Scholar]]. MABC involves three levels. The first level is known as ‘foreground selection’ and markers are utilized in combination with or to substitute screening for the gene or QTL [197Charcosset A. Marker-assisted introgression of quantitative trait loci. Genetics. 1997; 147(3):14691485.[PubMed], [Web of Science ®], [Google Scholar]]. The second level is known as ‘recombinant selection’. At this level, backcross progeny having target genes or QTL is selected and recombination is performed between linked flanking markers and the target locus. By recombinant selection, the size of the donor chromosome segment is reduced [198Hospital F. Selection in backcross programmes. Philos Trans Biol Sci. 2005:15031511.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The third level of MABC is known as ‘background selection’. At this level, backcross progeny having a large amount of recurrent parent genome is selected using markers which are unlinked with the target locus [199Collard BC, Mackill DJ. Marker-assisted selection: an approach for precision plant breeding in the twenty-first century. Philos Trans R Soc B. 2008;363(1491):557572.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Marker-assisted recurrent selection (MARS)

This is very handful technique in which molecular markers are applied at each generation in order to target all traits of interest; it was proposed in the 1990s [200Bernardo R, Charcosset A. Usefulness of gene information in marker-assisted recurrent selection: a simulation appraisal. Crop Sci. 2006;46(2):614621.[Crossref], [Web of Science ®], [Google Scholar]]. In this technique, crossing is performed in selected individuals at every crossing and selection cycle. MARS is specially involved with the improvement of F2 population that is achieved through one cycle of MAS (having phenotypic data with marker scores) followed by performing 2–3 cycles of marker-based selections (having marker scores only). It is a simple technique which can be applied easily without requiring any prior knowledge of QTLs, and the selection totally depends on the associations established between the marker and trait during the MARS programme [201Eathington SR, Crosbie TM, Edwards MD, et al. Molecular markers in a commercial breeding program. Crop Sci. 2007;47(Suppl 3):S154S163.[Google Scholar]].

Marker-assisted gene pyramiding

This is a technique in which multiple QTLs/genes for a single or multiple traits are transferred into a cultivar which is deficient for these traits. This technique is mainly applied to increase the level of resistance to particular diseases and insects through the selection of two or more genes simultaneously [202Luo Y, Sangha JS, Wang S, et al. Marker-assisted breeding of Xa4, Xa21 and Xa27 in the restorer lines of hybrid rice for broad-spectrum and enhanced disease resistance to bacterial blight. Mol Breed. 2012;30(4):16011610.[Crossref], [Web of Science ®], [Google Scholar]]. MAS has been successfully applied to pyramid many desired genes in various crops [203Gupta PK, Langridge P, Mir RR. Marker-assisted wheat breeding: present status and future possibilities. Mol Breed. 2010;26(2):145161.[Crossref], [Web of Science ®], [Google Scholar],204Ye G, Smith KF. Marker-assisted gene pyramiding for inbred line development: basic principles and practical guidelines. Int J Plant Breed. 2008;2(1):110.[Google Scholar]].

Functional/diagnostic markers (FMs)

FMs are also known as the perfect markers or diagnostic markers. Diagnostic/functional molecular markers provide a unique opportunity to screen large collections of germplasm for allelic diversity in short time with high accuracy and for traits having FMs. FMs are developed from polymorphic regions present within the genome that cause variation in phenotypic traits [133Nawaz MA, Rehman HM, Baloch FS, et al. Genome and transcriptome-wide analysis of cellulose synthase gene superfamily in soybean. J Plant Physiol. 2017;215:163175.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],147Risch N. Genetic linkage: interpreting LOD scores. Science. 1992;255(5046):803805.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],205Andersen JR, Lübberstedt T. Functional markers in plants. Trends Plant Sci. 2003;8(11):554560.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],206Xu Y, McCouch SR, Zhang Q. How can we use genomics to improve cereals with rice as a reference genome? Plant Mol Biol. 2005;59(1):726.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Some important advantages of FMs are that any population can be studied by FMs [168Neale DB, Savolainen O. Association genetics of complex traits in conifers. Trends Plant Sci. 2004;9(7):325330.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] and they are directly linked with the allele of locus of interest. As FMs are directly linked with the genes of interest and recombination between gene and marker is absent, false selection and loss of information in marker-assisted breeding are avoided [205Andersen JR, Lübberstedt T. Functional markers in plants. Trends Plant Sci. 2003;8(11):554560.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. FMs have ability to fix the alleles in a population more efficiently and selection is more balanced and controlled. FMs can be used for the construction of linked FM haplotypes and also for the validation of cultivars identity [169Lübberstedt T, Zein I, Andersen JR, et al. Development and application of functional markers in maize. Euphytica. 2005;146(1):101108.[Crossref], [Web of Science ®], [Google Scholar]]. The most important steps involved in the development of FMs are described in Figure 4.

Published online:
14 November 2017
Figure 4. Important steps involves in the development of functional markers.
Figure 4. Important steps involves in the development of functional markers.

FMs in plant breeding

Mainly FMs have been successfully applied for the breeding of agronomic traits, quality traits and disease resistance in crop plants. Many FMs are available for different agronomic traits that provide an opportunity for plant breeders to select rare recombinants without wasting time and resources in the screening of large numbers of plants [207Liu Y, He Z, Appels R, et al. Functional markers in wheat: current status and future prospects. Theor Appl Genet. 2012;125(1):110.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. In wheat, 30 genes have been cloned and more than 97 FMs have been developed for various traits of interest, like disease resistance genes and processing quality, and these FMs are successfully being used for wheat breeding [208Zhang X, Yang S, Zhou Y, et al. Distribution of the Rht-B1b, Rht-D1b and Rht8 reduced height genes in autumn-sown Chinese wheats detected by molecular markers. Euphytica. 2006;152(1):109116.[Crossref], [Web of Science ®], [Google Scholar]]. Rht-B1 and Rht-D1 are FMs developed for the discrimination of semi-dwarf alleles and Rht-B1a and Rht-D1 for wild-type alleles [209Andeden E, Yediay F, Baloch F, et al. Distribution of vernalization and photoperiod genes (Vrn-A1, Vrn-B1, Vrn-D1, Vrn-B3, Ppd-D1) in Turkish bread wheat cultivars and landraces. Cereal Res Commun. 2011;39(3):352364.[Crossref], [Web of Science ®], [Google Scholar]]. Similarly, Phd-H1 (photoperiod response gene) and Vrn-A1, Vrn-B1, Vrn-D1 and Vrn-B3 (vernalization genes) have been screened as candidate genes in various Turkish bread wheat cultivars and landraces [208–210Zhang X, Yang S, Zhou Y, et al. Distribution of the Rht-B1b, Rht-D1b and Rht8 reduced height genes in autumn-sown Chinese wheats detected by molecular markers. Euphytica. 2006;152(1):109116.
Andeden E, Yediay F, Baloch F, et al. Distribution of vernalization and photoperiod genes (Vrn-A1, Vrn-B1, Vrn-D1, Vrn-B3, Ppd-D1) in Turkish bread wheat cultivars and landraces. Cereal Res Commun. 2011;39(3):352364.
Shaaf S, Sharma R, Baloch FS, et al. The grain Hardness locus characterized in a diverse wheat panel (Triticum aestivum L.) adapted to the central part of the Fertile Crescent: genetic diversity, haplotype structure, and phylogeny. Mol Genet Genomics. 2015;291(3):12591275.
]. A lot of candidate genes have been developed into FMs in various crops.

Targeting induced local lesions in genome (TILLING)

Targeting induced local lesions in genome (TILLING) was first developed by McCallum in the late 1990s while working on characterizing the function of two genes in Arabidopsis plants. It is a non-transgenic technique in reverse genetics and is satisfactorily applicable in most plants [211Anuradha K, Agarwal S, Rao YV, et al. Mapping QTLs and candidate genes for iron and zinc concentrations in unpolished rice of Madhukar× Swarna RILs. Gene. 2012;508(2):233240.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The first important step involved in TILLING is the development of a mutated population using a standard mutagen like ethyl methanesulfonate (EMS) [212McCallum CM, Comai L, Greene EA, et al. Targeting induced local lesions in genomes (TILLING) for plant functional genomics. Plant Physiol. 2000;123(2):439442.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Then identification of mutations in the targeted sequence is achieved by using various methods like high-performance liquid chromatography, mass spectrometry, array-based technologies and enzymatic mismatch cleavage [213Kurowska M, Daszkowska-Golec A, Gruszka D, et al. TILLING – a shortcut in functional genomics. J Appl Genet. 2011;52(4):371390.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. After this, some bioinformatics tools like PARSESNP (Project Aligned Related Sequences and Evaluate SNPs) are applied for the analysis of these mutants. This technique is applicable for any species and is not affected by genome size and ploidy levels. A greater rate of point mutations can be achieved through this technique. High-throughput TILLING is time saving and provides precise identification of new alleles at a less cost [212McCallum CM, Comai L, Greene EA, et al. Targeting induced local lesions in genomes (TILLING) for plant functional genomics. Plant Physiol. 2000;123(2):439442.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The key steps involved in TILLING are described in Figure 5.

Published online:
14 November 2017

Virus-induced gene silencing (VIGS)

Virus-induced gene silencing (VIGS) is a viral vector methodology that exploits an RNA-mediated defence mechanism. When a virus infects a plant cell, it also activates the RNA-based defence system against this virus. This infection leads to viral RNA replication which results in the production of a dsRNA replication intermediate. This dsRNA replication intermediate results in the production of siRNA in the infected cell. After this, the siRNAs base pairs guide the RNase complex in such a way that it specifically targets the single-stranded (ss) target RNA which is alike to the dsRNAs [214Comai L, Henikoff S. TILLING: practical single-nucleotide mutation discovery. Plant J. 2006;45(4):684694.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. VIGS is a virus vector technique which utilizes this defence system. Replication in the dsRNA intermediate would be processed in a way that siRNA present in the damaged cells would correspond to parts of the viral vector genome and also including any non-viral insert. Thus, when insertion is made in the host cell, the RNase complex is targeted by siRNA to the corresponding host mRNA and symptoms reveal the loss of function of the encoded protein in the infected plant [215Gupta B, Saha J, Sengupta A, et al. Recent advances on virus induced gene silencing (VIGS): plant functional genomics. J Plant Biochem Physiol. 2013:1:e116.[Crossref], [Google Scholar]]. In recent years, VIGS has been applied successfully in plant reverse genomics. It is a very simple, cost-effective and high-throughput method. Mainly, it is used in the identification of function loss of a gene of interest [214Comai L, Henikoff S. TILLING: practical single-nucleotide mutation discovery. Plant J. 2006;45(4):684694.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],216Lu R, Malcuit I, Moffett P, et al. High throughput virus-induced gene silencing implicates heat shock protein 90 in plant disease resistance. Embo J. 2003;22(21):56905699.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. The general methodology of VIGS is shown in Figure 6.

Published online:
14 November 2017

Recent advancements in multiplexed functional/linked markers

With the passage of time, advancements are coming consistently in markers technology. SNP markers have become the marker of choice after the development of NGS and have been applied in the genotyping of various crops. A large number of linked markers have been converted into FMs and successfully used in MAS programmes in different crops. However, most of these markers are present in uniplex form. In order to achieve more effective and precise results from MAS, uniplex assays are shifting to multiplex systems [146Bernardo A, Wang S, Amand PS, et al. Using next generation sequencing for multiplexed trait-linked markers in wheat. PloS One. 2015;10(12):e0143890.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. KASP (Kompetitive Allele Specific PCR) is a recent multiplexed technique used to convert uniplex to multiplex systems by combing several markers in a single assay.

KASP™ was first developed by KBioscience, or LGC Genomics [217Braae A, Thompson CE, Morgan K. Comparison of custom designed KASP and TaqMan genotyping assays for a rare genetic variant identified through resequencing GWAS loci. Available from: www.lgcgroup.com[Google Scholar]], in order to achieve in-house genotyping and was finally developed into a worldwide leading genotyping technology. It is a homogenous technology and its genotyping is based on fluorescence. This technique depends on allele-specific oligo extension and for the generation of signals, fluorescence resonance energy transfer is used. Plates with 96, 384 and 1536 wells can be used for genotyping by KASP. The success in assay designing in KASP is 98%–100% with 93%–94% successful conversions into a working assay. KASP is time saving and with low cost as compared to the GoldenGate® assay [218Kumpatla SP, Abdurakhmonov IY, Mammadov JA, et al. Genomics-assisted plant breeding in the 21st century: technological advances and progress. Rijeka: InTech; 2012.[Google Scholar]]. The KASP assay has been applied successfully mainly in wheat, maize, rice and in a few other crops. The KASP assay has been successfully applied with NGS to develop multiplexed trait lined markers in wheat [219Rasheed A, Wen W, Gao F, et al. Development and validation of KASP assays for genes underpinning key economic traits in bread wheat. Theor Appl Genet. 2016;129(10):18431860.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Recently, 70 KASP assays have been developed and successfully validated in wheat and are significantly associated with various traits of interest in wheat crops [219Rasheed A, Wen W, Gao F, et al. Development and validation of KASP assays for genes underpinning key economic traits in bread wheat. Theor Appl Genet. 2016;129(10):18431860.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Genomic selection (GS): a step forward from MAS

Genomic selection (GS) is an advanced form of marker-assisted selection and was first developed by Meuwissen et al. [220Meuwissen TH, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001;157(4):18191829.[PubMed], [Web of Science ®], [Google Scholar]]. It is a technique that has the ability to predict the genetic values of selected candidates depending on the genome-estimated breeding values (GEBVs) predicted from high density of markers that are distributed throughout the genome. GEBV is a prediction model that combines the phenotypic data with marker and pedigree data in order to increase the accuracy of prediction. As compared to MAS, GEBV is dependent on all markers including major and minor marker effects [221Newell MA, Jannink JL. Genomic selection in plant breeding. In: Fleury D, Whitford R, editors. Crop breeding: methods and protocols. New York (NY): Humana Press; 2014. p. 117130. (Methods in Molecular Biology; Vol. 1145).[Crossref], [Google Scholar]]. In this technique, genetic markers having the ability to cover the whole genome are selected and utilized in a way that all QTLs are in LD with at least a single marker [222Goddard ME, Hayes BJ. Genomic selection. J Anim Breed Genet. 2007;124(6):323330.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Genomic selection of complex traits and high-throughput phenotyping have brought a revolution in breeding by enhancing the accuracy level of selection [223Ingvarsson PK, Street NR. Association genetics of complex traits in plants. New Phytol. 2011;189(4):909922.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Important steps in GS are:
(1)
Development of a training population using diverse germplasm;
(2)
Phenotyping and genotyping of the training population;
(3)
Selection of individuals having superior GEBVs on the basis of their genotypic data;
(4)
Progeny of the genotypes which are used as study material in the testing population are taken as input for the GS model and give GEBVs;
(5)
Individuals with maximum GEBVs are again selected;
(6)
Selected individuals are used as parents of the next offspring for continuous selection and breeding [220Meuwissen TH, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001;157(4):18191829.[PubMed], [Web of Science ®], [Google Scholar],224Lorenz AJ, Chao S, Asoro FG, et al. Chapter 2: genomic selection in plant breeding: knowledge and prospects. In: Sparks DL, editor. Vol. 110, Advances in agronomy. San Diego (CA): Academic Press; 2011; p. 77123.[Google Scholar]]. The general methodology of GS is described in Figure 7.
Published online:
14 November 2017
Figure 7. General methodology of genomic selection (GS).
Figure 7. General methodology of genomic selection (GS).

High-throughput phenotyping

With the rapid increase in the world population, the demand for food is also increasing and there is a need to develop high-yielding varieties with more resistance to biotic and abiotic stress. There is a need to precisely correlate genotype with phenotype [225Furbank RT, Tester M. Phenomics – technologies to relieve the phenotyping bottleneck. Trends Plant Sci. 2011;16(12):635644.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. For precise phenotyping, high-throughput phenotyping platform (HTPP) was introduced. HTPP is successful in the precise acquiring of comprehensive measurement of plant attributes which provide accurate information about the traits of interest [226Finkel E. With ‘phenomics,’ plant scientists hope to shift breeding into overdrive. Science. 2009;325(5939):380381.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Advanced cameras, sensors, robotics and computers are used to collect precise data [227Spalding EP, Miller ND. Image analysis is driving a renaissance in growth measurement. Curr Opin Plant Biol. 2013;16(1):100104.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Similarly, development is also coming in the field of HTPP, making it possible to obtain precise data for various complex traits [228Montes JM, Technow F, Dhillon BS, et al. High-throughput non-destructive biomass determination during early plant development in maize under field conditions. Field Crop Res. 2011;121(2):268273.[Crossref], [Web of Science ®], [Google Scholar]].

Genomic selection and genome editing together: new way in crop improvement

With the advancements in the field of genetic engineering, many techniques have been evolved to modify a single locus of a target organism. This dream comes true with the development of CRISPR (clustered regularly interspaced short palindromic repeat), a gene-editing technology. Genome editing has revolutionized plant breeding and has been applied successfully in different economically important crops. This technique facilitates the direct improvement of less favourable alleles into more favourable alleles. For the production of improved crop varieties, it is necessary to utilize genome selection and genome editing collectively. Genome editing shortens the time when backcrossing is done between elite varieties and exotic germplasm. This exotic germplasm serves as the encyclopaedia for the ancient alleles that are referenced for the development of modern varieties having resistance against biotic and abiotic stress. For the recombination of alleles that are already adapted, GS is then applied [229Spindel JE, Begum H, Akdemir D, et al. Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement. Heredity. 2016;116;395408.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Genome editing (CRISPR)

CRISPR is a genome-editing technique applied successfully in various plants [230Feng Z, Zhang B, Ding W, et al. Efficient genome editing in plants using a CRISPR/Cas system. Cell Res. 2013;23(10):12291232.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Cas9 is a recent advancement in the genome-editing technology and is becoming the technique of choice due to its many advantages, like its being easy to use, genome-editing versatility and ability to cleave methylated loci [231Hsu PD, Scott DA, Weinstein JA, et al. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat Biotechnol. 2013;31(9):827832.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],232Lozano-Juste J, Cutler SR. Plant genome engineering in full bloom. Trends Plant Sci. 2014;19(5):284287.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. CRISPR RNAs and Cas protein are the two most important parts in the CRISPR technique. CRISPR RNA (crRNA) and trans-encoded CRISPR RNA (tracrRNA) are two short RNAs that can cleave a particular target site with the help of Cas9 endonuclease (the most explored Cas protein). sgRNA, known as single guide RNA, results when crRNA and tracrRNA are fused artificially [233Qi LS, Larson MH, Gilbert LA, et al. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell. 2013;152(5):11731183.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. When sgRNA is combined with Cas protein, this leads to the formation of RNA-guided endonuclease that mediates the cleavage at a particular sequence in the genome [234Jinek M, East A, Cheng A et al. RNA-programmed genome editing in human cells. Elife. 2013;2:e00471.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. On the basis of this Cas protein, the CRISPR–Cas system is grouped into three types; I, II and III. Cas1 and Cas2 are two different proteins which are commonly present in all three types. Type I is present in both archaea and bacteria, while type II is only present in bacteria; however, type III is most commonly present in archaea but also in some bacteria [235Makarova KS, Haft DH, Barrangou R et al. Evolution and classification of the CRISPR–Cas systems. Nat Rev Microbiol. 2011;9(6):467477.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. Genome editing has been performed successfully in model plants like Nicotiana tabacum [236Shan Q, Wang Y, Li J, et al. Targeted genome modification of crop plants using a CRISPR-Cas system. Nat Biotechnol. 2013;31(8):686688.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]], Arabidopsis [237Ali Z, Abul-faraj A, Piatek M, et al. Activity and specificity of TRV-mediated gene editing in plants. Plant Signal Behav. 2015;10:e1044191.[Taylor & Francis Online], [Web of Science ®], [Google Scholar]] and some economically important crops like maize [238Svitashev S, Young JK, Schwartz C, et al. Targeted mutagenesis, precise gene editing, and site-specific gene insertion in maize using Cas9 and guide RNA. Plant Physiol. 2015;169(2):931945.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]] and wheat [239Ma X, Zhang Q, Zhu Q et al. A robust CRISPR/Cas9 system for convenient, high-efficiency multiplex genome editing in monocot and dicot plants. Mol Plant. 2015;8(8):12741284.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Mechanism

Acquisition, expression and interference are the three steps which are used by the CRISPR-Cas system to identify and target the pathogen genetic material. Identification and consolidation of foreign DNA is performed within the CASPR locus as a spacer during acquisition. During the acquisition of DNA fragments, a Protospacer having a short stretch (2–5) of conserved nucleotides (PAMs) is used as the identification motif. The AT (adenine–thymine) leader side of the CRISPR array, a 30-bp single copy of spacer is inserted and duplicated [240Garneau JE, Dupuis ME, Villion M, et al. The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA. Nature. 2010;468(7320):6771.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. During the expression step, a long pre-crRNA is transcribed from the CRISPR locus, while tracrRNA and Cas proteins (Cas1, Cas2, Cas9 and Cas4/Casn2) are applied for its processing into crRNAs [241Karvelis T, Gasiunas G, Miksys A et al. crRNA and tracrRNA guide Cas9-mediated DNA interference in Streptococcus thermophilus. RNA Biol. 2013;10(5):841851.[Taylor & Francis Online], [Web of Science ®], [Google Scholar]]. The Cas protein complex is guided towards the particular target area of foreign DNA by crRNA for cleavage during the interference step, thus facilitating the immunity against the attack of pathogens [240Garneau JE, Dupuis ME, Villion M, et al. The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA. Nature. 2010;468(7320):6771.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],242Marraffini LA, Sontheimer EJ. CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea. Nat Rev Genet. 2010;11(3):181190.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Reasons for the underutilization of molecular markers in crop plants

DNA markers were developed in the 1980s and after the development of the first PCR-based markers in the 1990s, a large number of markers have been developed and have been applied for various aspects [243–246Xu Y, Crouch JH. Marker-assisted selection in plant breeding: from publications to practice. Crop Sci. 2008;48(2):391407.
Baloch FS, Alsaleh A, Andedenet al. High levels of segregation distortion in the molecular linkage map of bread wheat representing the West Asia and North Africa region. Turk J Agric For 2016;40(3):352364.
Baloch FS, Derya M, Andeden EE, et al. Inter-primer binding site retrotransposon and inter-simple sequence repeat diversity among wild Lens species. Biochem Syst Ecol. 2015;58:162168.
Andeden EE, Baloch FS, et al. Development, characterization and mapping of microsatellite markers for lentil (Lens culinaris Medik.). Plant Breed. 2015;134(5):589598.
]. However, wise utilization of these markers has begun over the previous few years [198Hospital F. Selection in backcross programmes. Philos Trans Biol Sci. 2005:15031511.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]]. After the development and advancement in the marker technology, huge numbers of research papers are being published annually. However, a large proportion of these papers fail to exert their effect on practical level breeding [243Xu Y, Crouch JH. Marker-assisted selection in plant breeding: from publications to practice. Crop Sci. 2008;48(2):391407.[Crossref], [Web of Science ®], [Google Scholar]]. Similarly, QTL mapping results in the generation of large numbers of publications providing information about newly identified QTLs. These QTLs have been developed during research programmes and there is a need to apply these linked markers after careful validation and to develop functional diagnostic markers that could lead to successful breeding programmes benefiting the farmer fields [198Hospital F. Selection in backcross programmes. Philos Trans Biol Sci. 2005:15031511.[Crossref], [PubMed], [Web of Science ®], [Google Scholar]].

Conclusions

The last 30 years have witnessed a continuous development in the molecular markers technology from RFLP to SNPs and a diversity of array-technology-based markers. Advancements in the sequencing technologies have led to the development of NGS platforms that are low cost with high throughput. In spite of the presence of these highly advanced molecular genetic techniques, we are still not achieving our goals. The main reason behind this lies in inaccurate phenotyping. High-throughput phenotyping techniques solve these problems by using light, cameras, sensors, computers and highly modified devices for the collection of very precise phenotypic data, which is a core requirement to achieving our breeding goals successfully. CRISPR technology has revolutionized the plant breeding and genetics and researchers are focusing on editing the genomes of all economically important plants. The coming years are likely to see continued innovations in molecular marker technology to make it more precise, productive and cost effective in order to investigate the underlying biology of various traits of interest.

Acknowledgement

Authors are very grateful to TUBİTAK (The Scientific and Technological Research Council of Turkey) for providing the doctoral fellowship to Muhammad Azhar Nadeem through project (Project Number: 215O630) and Abant izzet baysal university, Scientific Research Unit (Project number: 2015.10.07.872).

Disclosure statement

The authors report no conflicts of interest.

Table 1. Comparison of important characteristics of the most commonly used molecular markers.
CharacteristicsRFLPRAPDAFLPISSRSSRSNPDArTRetrotransposons
Co-dominant/DominantCo-dominantDominantDominantDominantCo-dominantCo-dominantDominantDominant
ReproducibilityHighHighIntermediateMedium–HighHighHighHighHigh
Polymorphism levelMediumvery highHighHighHighHighHighHigh
Required DNA qualityHighHighHighLowLowHighHighHigh
Required DNA quantityHighMediumLowLowLowLowLowLow
Marker indexLowHighMediumMediumMediumHighHighHigh
Genome abundanceHighVery highVery highMediumMediumVery highVery highHigh
CostHighLessHighHighHighVariableCheapestCheapest
SequencingYesNoNoNoYesYesYesNo
StatusPastPastPastPresentPresentPresentPresentPresent
PCR requirementNoYesYesYesYesYesNoYes
VisualizationRadioactiveAgarose gelAgarose gelAgarose gelAgarose gelSNP-VISTAMicroarrayAgarose gel
Required DNA (ng)1000020500–100050505050–10025–50
Table 2. Advantages and disadvantages of different genetic markers.
MarkersAdvantagesDisadvantagesReferences
MorphologicalEasy to use
Cheaper
Visually characterized
Less polymorphic
Influenced by environment
Influenced by plant growth stages
[6Eagles HA, Bariana HS, Ogbonnaya FC, et al. Implementation of markers in Australian wheat breeding. Crop Pasture Sci. 2001;52(12):13491356.[Crossref], [Google Scholar]]
IsozymesNo need of specific instrument
Easy to use
Co-dominant
Less polymorphic
Influenced by environmental factors
[10Mondini L, Noorani A, Pagnotta MA. Assessing plant genetic diversity by molecular tools. Diversity. 2009;1(1):1935.[Crossref], [Google Scholar]]
RFLPsCo-dominant
No need of prior sequence information
Time consuming
High quantity of pure DNA needed
Expensive
Time consuming
[12Madhumati B. Potential and application of molecular markers techniques for plant genome analysis. Int J Pure App Biosci. 2014;2(1):16988.[Google Scholar]]
RAPDEasy to use
Less quantity of DNA is required
Polymorphic
Dominant
Highly purified DNA is required.
Low reproducibility.
Not locus-specific
[5Jiang GL. Molecular markers and marker-assisted breeding in plants. In: Andersen SB, editor. Plant breeding from laboratories to fields. Rijeka: InTech; 2013. p. 4583.[Crossref], [Google Scholar],12Madhumati B. Potential and application of molecular markers techniques for plant genome analysis. Int J Pure App Biosci. 2014;2(1):16988.[Google Scholar]]
AFLPReliable
High reproducibility
More informative
Dominant marker
Highly purified DNA is required
High quantity of pure DNA needed
[12Madhumati B. Potential and application of molecular markers techniques for plant genome analysis. Int J Pure App Biosci. 2014;2(1):16988.[Google Scholar],23Ridout CJ, Donini P. Use of AFLP in cereals research. Trends Plant Sci. 1999;4(2):7679.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],24Blears MJ, De Grandis SA, Lee H, et al. Amplified fragment length polymorphism (AFLP): a review of the procedure and its applications. J Ind Microbiol Biotechnol. 1998;21(3):99114.[Crossref], [Web of Science ®], [Google Scholar]]
SSRsCo-dominant marker
Less quantity of DNA is required
High reproducibility
High developmental cost
Presence of more null alleles
Occurrence of homoplasy
[30Provan J, Powell W, Hollingsworth PM. Chloroplast microsatellites: new tools for studies in plant ecology and evolution. Trends Ecol Evol. 2001;16(3):142147.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],33Zane L, Bargelloni L, Patarnello T. Strategies for microsatellite isolation: a review. Mol Ecol. 2002;11(1):116.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],34Kalia RK, Rai MK, Kalia S, et al. Microsatellite markers: an overview of the recent progress in plants. Euphytica. 2011;177(3):309334.[Crossref], [Web of Science ®], [Google Scholar]]
ISSRHighly polymorphic
Simple and easy to use
No need of prior sequence information
low reproducibility
Pure DNA is required.
Fragment are not same sized
[44Zietkiewicz E, Rafalski A, Labuda D. Genome fingerprinting by simple sequence repeat (SSR)-anchored polymerase chain reaction amplification. Genomics. 1994;20(2):176183.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],47Moreno S, Martín JP, Ortiz JM. Inter-simple sequence repeats PCR for characterization of closely related grapevine germplasm. Euphytica. 1998;101(1):11725.[Crossref], [Web of Science ®], [Google Scholar],49Ng WL, Tan SG. Inter-simple sequence repeat (ISSR) markers: are we doing it right? ASM Sci J. 2015;9:3039.[Google Scholar]]
SRAPSimple
Reliable
Easy isolation of bands
Dominant marker
Moderate–high throughput ratio
[42Li G, Quiros CF. Sequence-related amplified polymorphism (SRAP), a new marker system based on a simple PCR reaction: its application to mapping and gene tagging in Brassica. Theor Appl Genet. 2001;103(2–3):455461.[Crossref], [Web of Science ®], [Google Scholar],43Uzun A, Yesiloglu T, Aka-Kacar Y, et al. Genetic diversity and relationships within citrus and related genera based on sequence related amplified polymorphism markers (SRAPs). Sci Hort. 2009;121(3):306312.[Crossref], [Web of Science ®], [Google Scholar]]
RetrotransposonsSimple and easy to use
No need of prior sequence information
High reproducibility
Dominant marker[59Roy NS, Choi JY, Lee SI, et al. Marker utility of transposable elements for plant genetics, breeding, and ecology: a review. Genes Genom. 2015;37(2):141151.[Crossref], [Web of Science ®], [Google Scholar],61Kalendar R, Flavell AJ, Ellis TH, et al. Analysis of plant diversity with retrotransposon-based molecular markers. Heredity. 2011;106(4):520530.[Crossref], [PubMed], [Web of Science ®], [Google Scholar],62Kalendar R, Grob T, Regina M, et al. IRAP and REMAP: two new retrotransposon-based DNA fingerprinting techniques. Theor Appl Genet. 1999;98(5):704711.[Crossref], [Web of Science ®], [Google Scholar]]
SNPCost effective
Widely distributed in genome
No need of prior sequence information
High reproducibility
Co-dominant marker
High developmental cost[5Jiang GL. Molecular markers and marker-assisted breeding in plants. In: Andersen SB, editor. Plant breeding from laboratories to fields. Rijeka: InTech; 2013. p. 4583.[Crossref], [Google Scholar],12Madhumati B. Potential and application of molecular markers techniques for plant genome analysis. Int J Pure App Biosci. 2014;2(1):16988.[Google Scholar]]
DArTCost effective
High throughput
Highly polymorphic
Prior sequence information not needed
High reproducibility
Dominant marker
High developmental cost
[104–106Jaccoud D, Peng K, Feinstein D, et al. Diversity arrays: a solid state technology for sequence information independent genotyping. Nucleic Acids Res. 2001;29(4):E25.
Wenzl P, Carling J, Kudrna D, et al. Diversity Arrays Technology (DArT) for whole-genome profiling of barley. Proc Natl Acad Sci U S A. 2004;101(26):99159920.
Huttner E, Wenzl P, Akbari M, et al. Diversity arrays technology: a novel tool for harnessing the genetic potential of orphan crops. In: Serageldin I, Persley GJ, editors. Discovery to delivery: BioVision Alexandria 2004; Proceedings of the 2004 Conference of the World Biological Forum; 2004 Apr 3–6; Alexandria, Egypt. Wallingford: CABI; 2005. p. 145155.
]