In which image will adenine (a) be the next nucleotide to be added to the primer?

How to Design a Primer


Primer Design for PCR

Oligonucleotide primers are necessary when running a PCR reaction. One needs to design primers that are complementary to the template region of DNA. They are synthesized chemically by joining nucleotides together. One must selectively block and unblock repeatedly the reactive groups on a nucleotide when adding a nucleotide one at a time. The main property of primers is that they must correspond to sequences on the template molecule (must be complementary to template strand). However, primers do not need to correspond to the template strand completely; it is essential, however, that the 3’ end of the primer corresponds completely to the template DNA strand so elongation can proceed. Usually a guanine or cytosine is used at the 3’ end, and the 5’ end of the primer usually has stretches of several nucleotides. Also, both of the 3’ ends of the hybridized primers must point toward one another.

The size of the primer is very important as well. Short primers are mainly used for amplifying a small, simple fragment of DNA. On the other hand, a long primer is used to amplify a eukaryotic genomic DNA sample. However, a primer should not be too long (> 30-mer primers) or too short. Short primers produce inaccurate, nonspecific DNA amplification product, and long primers result in a slower hybridizing rate. On average, the DNA fragment that needs to be amplified should be within 1-10 kB in size.

The structure of the primer should be relatively simple and contain no internal secondary structure to avoid internal folding. One also needs to avoid primer-primer annealing which creates primer dimers and disrupts the amplification process. When designing, if unsure about what nucleotide to put at a certain position within the primer, one can include more than one nucleotide at that position termed a mixed site. One can also use a nucleotide-based molecular insert (inosine) instead of a regular nucleotide for broader pairing capabilities.

Taking into consideration the information above, primers should generally have the following properties:

  • Length of 18-24 bases
  • 40-60% G/C content
  • Start and end with 1-2 G/C pairs
  • Melting temperature (Tm) of 50-60°C
  • Primer pairs should have a Tm within 5°C of each other
  • Primer pairs should not have complementary regions

    Note: If you will be including a restriction site at the 5’ end of your primer, note that a 3-6 base pair "clamp" should be added upstream in order for the enzyme to cleave efficiently (e.g. GCGGCG-restriction site-your sequence).

In which image will adenine (a) be the next nucleotide to be added to the primer?

Protocol Video

Watch the protocol video below to learn how to design primers for PCR.

Basics of DNA Biology and Genetics

John M. Butler, in Fundamentals of Forensic DNA Typing, 2010

Base pairing and hybridization of DNA strands

In its natural state in the cell, DNA is actually composed of two strands that are linked together through a process known as hybridization. Individual nucleotides pair up with their ‘complementary base’ through hydrogen bonds that form between the bases. The base-pairing rules are such that adenine can only hybridize to thymine and cytosine can only hybridize to guanine (Figure 2.2). There are two hydrogen bonds between the adenine–thymine base pair and three hydrogen bonds between the guanine–cytosine base pair. Thus, GC base pairs are stuck together a little stronger than AT base pairs. The two DNA strands form a twisted ladder shape or double helix due to this ‘base-pairing’ phenomenon (Figure 2.2).

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 2.2. Base pairing of DNA strands to form double-helix structure.

The two strands of DNA are ‘anti-parallel’; that is, one strand is in the 5′ to 3′ orientation and the other strand lines up in the 3′ to 5′ direction relative to the first strand. By knowing the sequence of one DNA strand, its complementary sequence can easily be determined based on the base-pairing rules of A with T and G with C. These combinations are sometimes referred to as Watson–Crick base pairs for James Watson and Francis Crick who discovered this structural relationship in 1953.

Hybridization of the two strands is a fundamental property of DNA. However, the hydrogen bonds holding the two strands of DNA together through base pairing may be broken by elevated temperature or by chemical treatment, a process known as denaturation. A common method for denaturing double-stranded DNA is to heat it to near boiling temperatures. The DNA double helix can also be denatured by placing it in a salt solution of low ionic strength or by exposing it to chemical denaturants such as urea or formamide, which destabilize DNA by forming hydrogen bonds with the nucleotides and preventing their association with a complementary DNA strand.

Denaturation is a reversible process. If a double-stranded piece of DNA is heated up, it will separate into its two single strands. As the DNA sample cools, the single DNA strands will find their complementary sequence and rehybridize or anneal to each other. The process of the two complementary DNA strands coming back together is referred to as renaturation or reannealing.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780123749994000023

Liquid crystalline DNA

Luciano De Sio, ... Roberto Bartolino, in Hybrid Polymer Composite Materials, Volume 4, 2017

13.1.1 DNA and its liquid crystalline phases

DNA is a semiflexible polymer made of two polynucleotide chains, held together by weak thermodynamic forces (Lander and Weinberg, 2000). The monomer units of DNA are the nucleotides. Each nucleotide consists of a 5-carbon sugar (deoxyribose), a nitrogen containing base attached to the sugar, and a phosphate group (Travers and Muskhelishvili, 2015). There are four different types of nucleotides found in DNA: adenine (A), thymine (T), cytosine (C), and guanine (G). A and T are connected by two hydrogen bonds while G and C are connected by three hydrogen bonds (Fig. 13.1).

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 13.1. DNA chemical structure.

An interesting and comprehensively studied phenomenon is represented by the tendency of flexible polymers in concentrated aqueous solutions to form liquid crystalline phases (Onsager, 1949; Nakata et al., 2007). Indeed, the ability of both long and ultrashort, hydrated, double-stranded DNA molecules to form liquid crystal (LC) phases has been known for more than 50 years and played a key role in the initial deciphering of the molecular DNA. Linear DNA fragments in aqueous solution form multiple LC phases whose nature depends on the polymer concentration: when increasing concentration, the isotropic solution transforms into either blue phase or precholesteric stage and then into a cholesteric phase which turns itself into columnar hexagonal (Strzelecka et al., 1988).

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780081007853000139

Gene–Environment Interplay

Jonathan Daw, Jason D. Boardman, in International Encyclopedia of the Social & Behavioral Sciences (Second Edition), 2015

Genetic Research Terminology

First, we provide some very brief background in the terminology of genetics for those who are not familiar with this field. The human genome consists of an ordered series of paired nucleotides in two matching strings, arrayed in a double helix as deoxyribonucleic acid, or DNA, in 23 chromosomes, or discrete strands of DNA. As shown in Figure 2, cells in the human body contain each of these 23 chromosomes. There are four nucleotides in the human genome: adenine (A), thymine (T), guanine (G), and cytosine (C). These are paired together determinatively, such that A is always paired with T and G with C. A polymorphism is a section of the genome in which variations in this base pair sequence are found between humans; a single nucleotide polymorphism (SNP) occurs when this consists of a single base pair difference. The different proteins are illustrated by the different colors in Figure 2. This variation in the human genome produces alleles, or DNA sequences which vary among humans. The observed combination of alleles at a location in one's genome is known as one's genotype at that location. Genetic researchers frequently seek to link genotypic variation to variation in phenotypes, which is the biological term for an organism's characteristics or traits. Phenotype is a synonym for ‘outcome variable’ in social scientific research.

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 2. Structure of the human genome: the key components. Note: This figure was taken from the collection of figures organized by the National Institute of General Medical Sciences: http://images.nigms.nih.gov/index.cfm?event=doSearch&searchTerm=genetics&typeID=&sizeID.

Behavioral genetics is a field of behavioral science which attempts to summarize the separate contributions of genetic and environmental influences on a given phenotype. Frequently behavioral genetic researchers attempt to decompose this variance into three sources: (1) heritability, which describes the proportion of variance in an outcome that is thought to be explained by all genetic influences; (2) shared environments, which characterizes the environmental influences which make pairs of individuals more phenotypically similar; and (3) unshared environments, which characterizes the environmental influences which make pairs of individuals more phenotypically dissimilar. Behavioral genetics studies are frequently conducted using twin studies (which compares the phenotypic similarity of monozygotic twins, who are genetically identical, and dizygotic twins, who share 50% of their genes on average by descent) and adoption studies (which compares the phenotypic similarity of adopted sibling pairs to that of nonadopted members of their biological family).

When genetic information is available from subjects in a study, researchers can do two types of research. The first is called a candidate gene study in which individuals are classified into groups in which they are comparable to one another across segments of the human genome that are responsible for known physiological functions. These different groups are then compared with one another to see whether they are different with respect to some behavior like smoking, height, etc. There are many different statistical techniques for candidate gene studies, but using the correct method, researchers can conclude that the gene is actually responsible for differences in the trait and accounts for the similarity of family members who are much more likely to share the gene than two unrelated individuals. Genome-wide association studies (GWAS) incorporate data on a large number of SNPs (typically between 100 000 and 2 000 000 at this time) in order to identify loci associated with a phenotype of interest. Whereas the candidate gene method is a deductive exercise in which a specific hypothesis is tested, GWAS methods are entirely inductive and are much better characterized as a discovery science. In essence, GWAS models compare those with a specific illness (e.g., cancer) to those who do not have this illness (in which large efforts are made to provide the most meaningful comparisons) across each SNP in the human genome. Currently, genome-wide genotyping provide nearly 2.5 million SNPs. The strongest associations that emerge from this data mining exercise are then investigated to see whether they have any physiological pathway through which they might actually affect the trait of interest.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B978008097086831025X

Short Tandem Repeat Markers

John M. Butler, in Fundamentals of Forensic DNA Typing, 2010

Genetic Markers and Repeated DNA Sequences

Because more than 99.7% of the human genome is the same from individual to individual, regions that differ need to be found in the remaining 0.3% in order to tell people apart at the genetic level. There are many repeated DNA sequences scattered throughout the human genome. Because these repeat sequences are typically located between genes, they can vary in size from person to person without impacting the genetic health of the individual.

Eukaryotic genomes are full of repeated DNA sequences. These repeated DNA sequences come in all types of sizes and are typically designated by the length of the core repeat unit and the number of contiguous repeat units or the overall length of the repeat region. Long repeat units may contain several hundred to several thousand bases in the core repeat.

These regions are often referred to as satellite DNA and may be found surrounding the chromosomal centromere. The term satellite arose because frequently one or more minor ‘satellite bands’ were seen in early experiments involving equilibrium density gradient centrifugation.

The core repeat unit for a medium-length repeat, sometimes referred to as a minisatellite or a VNTR (variable number of tandem repeats), is in the range of approximately 8 to 100 bases in length. As noted in Chapter 3, the previously used forensic DNA marker D1S80 is a minisatellite with a 16-bp repeat unit.

DNA regions with repeat units that are 2 to 7 bp in length are called microsatellites, simple sequence repeats (SSRs), or most usually short tandem repeats (STRs). STRs have become popular DNA repeat markers because they are easily amplified by the polymerase chain reaction (PCR) without the problems of differential amplification. This is because both alleles from a heterozygous individual are similar in size since the repeat size is small. The number of repeats in STR markers can be highly variable among individuals, which make these STRs effective for human identification purposes.

Literally thousands of polymorphic microsatellites have been characterized in human DNA and there may be more than a million microsatellite loci present depending on how they are counted. Regardless, microsatellites account for approximately 3% of the total human genome. STR markers are scattered throughout the genome and occur on average every 10,000 nucleotides.

Computer searches of the recently available human genome reference sequence have cataloged the number and nature of STR markers in the genome. A large number of STR markers have been characterized by academic and commercial laboratories for use in disease gene location studies. For example, the Marshfield Medical Research Foundation in Marshfield, Wisconsin (http://research.marshfieldclinic.org/genetics) has gathered genotype data on over 8000 STRs that are scattered across the 23 pairs of human chromosomes for the purpose of developing human genetic maps (see Chapter 17).

To perform analysis on STR markers, the invariant flanking regions surrounding the repeats must be determined. Once the flanking sequences are known, then PCR primers can be designed and the repeat region amplified for analysis (Figure 8.1). New STR markers are usually identified in one of two ways: (1) searching DNA sequence databases such as GenBank for regions with more than six or so contiguous repeat units or (2) performing molecular biology isolation methods. The availability of a reference human genome sequence now makes the first option a viable and productive one, and more than 20,000 tetranucleotide STR repeats have been located throughout the human genome. However, when the core STR loci that are widely used today were selected back in the mid-1990s, only a handful of STR loci were known and characterized.

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 8.1. (a) PCR primers anneal to unique, consistent sequences in the flanking regions that bracket the variable STR repeat region. (b) The overall sizes of STR alleles typically differ by the size of the inserted repeat unit.

Types of STR markers

STR repeat sequences are named by the length of the repeat unit. Dinucleotide repeats have two nucleotides repeated next to each other over and over again. Trinucleotides have three nucleotides in the repeat unit, tetranucleotides have four, pentanucleotides have five, and hexanucleotides have six repeat units in the core repeat. Theoretically, there are 4, 16, 64, 256, 1024, and 4096 possible motifs for mono-, di-, tri-, tetra-, penta-, and hexanucleotide repeats, respectively. However, because microsatellites are tandemly repeated, some motifs are actually equivalent to others (D.N.A. Box 8.1). For reasons that will be discussed below, tetranucleotide repeats have become the most popular STR markers for human identification.

D.N.A. Box 8.1

List of Possible Microsatellite Motifs

source:

Jin, L., Zhong, Y., & Chakraborty, R. (1994). The exact numbers of possible microsatellite motifs [letter]. American Journal of Human Genetics, 55, 582–583.

Theoretically, there are 4, 16, 64, 256, 1024, and 4096 possible motifs for mono-, di-, tri-, tetra-, penta-, and hexanucleotide repeats, respectively. However, because microsatellites are tandemly repeated, some motifs are actually equivalent to others. Two rules can be used to identify whether motif A is equivalent to motif B. Motif A is considered equivalent to motif B when (1) motif A is inversely complementary to motif B, or (2) motif A is different from motif B or the inversely complementary sequence of motif B by frameshift. For example, (GAAA)n is equivalent to (AGAA)n, (AAGA)n, (AAAG)n, (TTTC)n, (TTCT)n, (TCTT)n, and (CTTT)n. Note that (AGAG)n is considered a dinucleotide repeat instead of a tetranucleotide motif.

Because of this equivalence in repeat motif structure there are only 2, 4, 10, 33, 102, and 350 possible motifs for mono-, di-, tri-, tetra-, penta-, and hexanucleotide repeats, respectively (see below).

Mononucleotide repeats (2):
A C
Dinucleotide repeats (4):
AC AG AT CG
Trinucleotide repeats (10):
AAC AAG AAT ACC ACG ACT AGC AGG ATC CCG
Tetranucleotide repeats (33):
AAAC AAAG AAAT AACC AACG AACT AAGC AAGG AAGT AATC
AATG AATT ACAG ACAT ACCC ACCG ACCT ACGC ACGG ACGT
ACTC ACTG AGAT AGCC AGCG AGCT AGGC AGGG ATCC ATCG
ATGC CCCG CCGG the AGAT (or GATA) motif is the most common motif for STR loci used by forensic scientists

Penta- (102) and hexanucleotide (350) repeats are not shown due to the sheer number of motifs possible.

STR sequences not only vary in the length of the repeat unit and the number of repeats but also in the rigor with which they conform to an incremental repeat pattern. STRs are often divided into several categories based on the repeat pattern. Simple repeats contain units of identical length and sequence, compound repeats comprise two or more adjacent simple repeats, and complex repeats may contain several repeat blocks of variable unit length as well as variable intervening sequences. Complex hypervariable repeats also exist with numerous nonconsensus alleles that differ in both size and sequence and are therefore challenging to genotype reproducibly. This last category of STR markers is not commonly used in forensic DNA typing due to difficulties with allele nomenclature and measurement variability between laboratories, although two commercial kits now include the complex hypervariable STR locus SE33, sometimes called ACTBP2.

Not all alleles for an STR locus contain complete repeat units. Even simple repeats can contain nonconsensus alleles that fall in between alleles with full repeat units. Microvariants are alleles that contain incomplete repeat units. Perhaps the most common example of a microvariant is the allele 9.3 at the TH01 locus, which contains nine tetranucleotide repeats and one incomplete repeat of three nucleotides because the seventh repeat is missing a single adenine out of the normal AATG repeat unit.

STRs used in forensic DNA typing

For human identification purposes it is important to have DNA markers that exhibit the highest possible variation or a number of less polymorphic markers that can be combined in order to obtain the ability to discriminate between samples. As will be discussed further in Chapter 14, forensic specimens are often challenging to PCR amplify because the DNA in the samples may be severely degraded (i.e., broken up into small pieces). Mixtures are prevalent as well in some forensic samples, such as those obtained from sexual assault cases containing biological material from both the perpetrator and victim.

As noted in Chapter 3, the small size of STR alleles (∼100 to 400 bp) compared to minisatellite VNTR alleles (∼400 to 1000 bp) makes the STR markers better candidates for use in forensic applications where degraded DNA is common. PCR amplification of degraded DNA samples can be better accomplished with smaller product sizes. These reduced-size STR amplicons are often referred to as miniSTRs. Allelic dropout of larger alleles in minisatellite markers caused by preferential amplification of the smaller allele is also a significant problem with minisatellites. There are multiple reasons why the smaller STRs are advantageous compared to the larger minisatellite VNTRs.

Among the various types of STR systems, tetranucleotide repeats have become more popular than di- or trinucleotides. Penta- and hexanucleotide repeats are less common in the human genome but are being examined by some laboratories. As discussed in Chapter 10, a biological phenomenon known as ‘stutter’ results when STR alleles are PCR amplified. Stutter products are amplicons that are typically one repeat unit less in size than the true allele and arise during PCR because of strand slippage. STR product amounts vary depending on the STR locus but are usually less than 15% of the allele product quantity with tetranucleotide repeats. With di- and trinucleotides, the stutter percentage can be much greater (30% or more), making it difficult to interpret sample mixtures. In addition, the four-base spread in alleles with tetranucleotides makes closely spaced heterozygotes easier to resolve with size-based electrophoretic separations compared to alleles that could be two or three bases different in size with dinucleotides and trinucleotide markers, respectively.

STR allele nomenclature

To aid in interlaboratory reproducibility and comparisons of data, a common nomenclature has been developed in the forensic DNA community. DNA results cannot be effectively shared unless all parties are speaking the same language and referring to the same conditions. (It would do little good to describe the recipe for baking a cake in a language that is not understood by both the recipe giver and the chef. For example, if the recipe says to turn the oven on to 450°F and the chef uses 450 Kelvin [∼250°F], the results would be vastly different.)

If one laboratory calls a sample 15 repeats at a particular STR locus and the same sample is designated 16 repeats by another laboratory, a match would not be considered, and the samples would be assumed to come from separate sources. As will be discussed in Chapter 12, the advent of national DNA databases with many laboratories contributing information to those databases has made it crucial to have internationally accepted nomenclature for designating STR alleles.

A repeat sequence is named by the structure (base composition) of the core repeat unit and the number of repeat units. However, because DNA has two strands, either of which may be used to designate the repeat unit for a particular STR marker, more than one choice is available and confusion can arise without a standard format. Also, where an individual starts counting the number of repeats can also make a difference. With double-stranded DNA sequences being read in the 5′ to 3′ direction, the choice of the strand impacts the sequence designation. For example, the ‘top’ strand for an STR marker may be 5′-…(GATA)n…-3′ while the ‘bottom’ strand for the same sequence would be 5′-…(TATC)n…-3′. Depending on the sequence surrounding the repeat region, the core repeat could be shifted relative to the other strand (Figure 8.2).

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 8.2. Example of the DNA sequence in a STR repeat region. Note that using the top strand versus the bottom strand results in different repeat motifs and starting positions. In this example, the top strand has 6 TCTA repeat units while the bottom strand has 6 TGAA repeat units. Under ISFG recommendations, the top strand from GenBank should be used. Thus, this example would be described as having [TCAT] as the repeat motif. Repeat numbering, indicated above and below the sequence, proceeds in the 5′-to-3′ direction as illustrated by the arrows.

Allelic ladders

An allelic ladder is an artificial mixture of the common alleles present in the human population for a particular STR marker. They are generated with the same primers as tested samples and thus provide a reference DNA size for each allele included in the ladder. Allelic ladders have been shown to be important for accurate genotype determinations. These allelic ladders serve as a standard like a measuring stick for each STR locus. They are necessary to adjust for different sizing measurements obtained from different instruments and conditions used by various laboratories.

Allelic ladders are constructed by combining genomic DNA or locus-specific PCR products from multiple individuals in a population, which possess alleles that are representative of the variation for the particular STR marker. The samples are then coamplified to produce an artificial sample containing the common alleles for the STR marker (Figure 8.3). Allele quantities are balanced by adjusting the input amount of each component so that the alleles are fairly equally represented in the ladder. For example, to produce a ladder containing five alleles with 6, 7, 8, 9, and 10 repeats, individual samples with genotypes of (6,8), (7,10), and (9,9) could be combined. Alternatively, the combination of genotypes could be (6,9), (7,8), and (10,10) or (6,6), (7,7), (8,8), (9,9), and (10,10).

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 8.3. Principle of allelic ladder formation. STR alleles from a number of samples are analyzed and compared to one another. Samples representing the common alleles for the locus are combined and reamplified to generate an allelic ladder. Each allele in the allelic ladder is sequenced since it serves as the reference material for STR genotyping. Allelic ladders are included in commercially available STR kits.

Additional quantities of the same allelic ladder (second- and third-generation ladders) may be produced by simply diluting the original ladder 1/1000 to 1/1,000,000 parts with deionized water and then reamplifying it using the same PCR primers. It is imperative that allelic ladders be generated with the same PCR primers as those used to amplify unknown samples so that the allele ‘rungs’ on the ladder will accurately line up with that of the repeat number of the unknown sample when the unknown is compared to the ladder. Commercial manufacturers now provide allelic ladders in their STR typing kits so that individual laboratories do not have to produce their own allelic ladders.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780123749994000084

Genotype and Phenotype

Richard C Lewontin, in International Encyclopedia of the Social & Behavioral Sciences (Second Edition), 2015

Description of Genotypes and Phenotypes

As a result of the discoveries in molecular and cellular biology since Johanssen’s time genotypic and phenotypic descriptions have become more concrete, more detailed and greatly amplified in their scope. Originally genotypes had no independent physical descriptors but were distinguished from each other entirely on the basis of the external phenotypic manifestations and the patterns of similarity in phenotypes of related individuals. ‘Genes’ were simply abstract placeholders of unknown physical composition, inferred entities that could be characterized only by their phenotypic effects.

The modern identification of DNA as the physical material that behaves in heredity in the manner inferred for Mendel’s mysterious ‘factors’ has allowed a precise physical description of the genotype of an organism. It is the complete spatially-ordered sequence of the four alternative nucleotide bases, adenine, cytosine, guanine and thymine contained in the DNA of the organism’s cells. In its most complete form this genotypic description includes not only the DNA in the nucleus of the cell but in certain cellular organelles such as mitochondria or chloroplasts that carry the DNA that is related to their physiological function and that is passed across generational lines. Phenotypic description has also been greatly amplified beyond the external characteristics of individuals. It now includes the gross structure of the various cells of the organism, the physico-chemical description of the various molecules within cells, including their shape and intra-cellular localization, and the network of biochemical reactions in which these molecules participate.

The concepts of phenotype and genotype require a distinction between types and tokens. Phenotype and genotype are, as the words indicate, types, sets of which individual organisms are members, sets delineated by the complete descriptors of those organisms. Any individual member of these type sets is a token of the type. In addition it is necessary to distinguish the descriptors of organisms from the physical object or behavior that is being described. The concepts phenotype and genotype refer to the descriptors. The physical objects that are being described are the phenome and genome of an organism, the actual physical material or behavior of some particular individual and its actual DNA molecules. If we take the definitions literally, then no two organisms belong to the same phenotype because there is always some physical difference in morphology, physiology and behavior between any two organisms, even between identical twins, cloned individuals, or cells that result from asexual reproduction. Moreover, except for the results of twinning, cloning or vegetative reproduction, no two individuals belong to the same genotype, as the process of mutation guarantees some variation in their DNA between any two individuals.

The rate of mutation per nucleotide per cell division is of the order of one in a billion, so for mammals, with over a billion nucleotides of DNA making up their genes, even identical twins are likely to have some small difference in total genotype. There is, in addition, an ambiguity in the assignment of an individual to a genotype because mutations occur during the millions of cell divisions that occur in development, so that all cells in the body do not have identical DNA.

In practice phenotypic and genotypic descriptions are not total but partial, referring to some aspect of the genome and phenome that are of particular interest, while ignoring the rest. The partial phenotypic and genotypic descriptions are partial in two senses. First, only some aspect of the morphology or function of the organism is characterized; say the rate of enzymatic splitting of alcohol, a phenotype, and the DNA sequences that code for the enzyme proteins involved in the reaction, a genotype. Second, a decision must be made about the collection of actual partial phenotypes and genotypes that will be regarded as indistinguishable. At the phenotypic level, given the actual variation in enzymatic rates from individual to individual and from time to time, what range of rates will be regarded as belonging to the same phenotypic class? For any phenotypic variable that is continuous, some boundaries of distinguishable phenotypic classes must be laid down in order to make a mapping between phenotype and genotype because genotypes, by their nature, are discrete classes.

The problem of aggregation exists at the genotypic level as well. The relation of DNA differences to differences in the amino acid sequence of a protein is not one-to-one. Because the DNA triplet code is redundant, substitutions of nucleotides in the third position of the triplet often do not change the amino acid that is specified, so for some purposes many different alternative DNA sequences of a gene count as the same genotype. On the other hand, nucleotide substitutions that do not affect the amino acid sequence of a protein may affect the rate of synthesis of that protein and so may, for some questions, be counted as belonging to different genotypes.

The decision about which aspects of total phenotype and genotype are relevant in creating genotypic and phenotypic classes is a deep question of biology that is largely ignored because everything cannot be considered at once. Sometimes biologists act as if the rest of the organism really were constant, sometimes they recognize the variation but claim that it is causally irrelevant to the phenomenon of interest and sometimes they admit its causal relevance but claim that the variation can be treated as background experimental noise that can be averaged out by repeated observations on a large class of individuals. These decisions are usually made either for the sake of observational convenience without a coherent rationale or from a commitment to a reductionist view of the organism that asserts that an understanding of the whole system can be obtained by breaking it down into ‘obvious’ small components to be studied in isolation.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780080970868810089

Chemistry of hybrid multifunctional and multibranched composites

Sergio D. Garcia Schejtman, ... Miriam C. Strumia, in Hybrid Polymer Composite Materials, Volume 1, 2017

3.4.1 Flat solid surfaces

Gold electrodes. One interesting example is the self-assembled monolayer formation of a series of thiophene dendron thiols with different generations and alkyl chain lengths onto gold surfaces [60]. The purpose of the cited study was to understand the chemisorption behavior of a series of hyperbranched thiols formed onto gold electrodes in terms of parameters such as size, alkyl chain length, and bulk concentration. The chemical structures of these dendron thiols are shown in Fig. 3.6A. A rearrangement adsorption/desorption kinetics to describe their behavior at the interface was elucidated by using surface plasmon resonance, electrochemistry, quartz crystal microbalance and water contact angle measurements [60]. An empirical three-step model explained the observed adsorption kinetics, which involved a two-step rearrangement process including a relatively faster short range and a much slower long range course, followed by an initial fast adsorption/desorption. For the lower generation dendron thiols, much faster adsorption rate constants were found compared to higher generation dendron thiols, due to the additional adherence of the thiophene sulfurs to gold surfaces. Dendrons with the longest alkyl chain showed the most tightly packed monolayer arrangement [60]. Another focally substituted organothiol dendron shown in Fig. 3.6B) also adlayers on gold by chemisorption of the thiol moiety onto this surface [61]. The covalent approach onto gold surfaces is also achieved with sulfides at the focal point as shown by Friggeri et al., who investigated the insertion process of individual dendron-sulfide molecules (Fig. 3.6C) into self-assembled monolayers of 11-mercapto-1-undecanol by atomic force microscopy, wettability, and electrochemical measurements [62]. A mechanism consisting of rapid dissociation of surface thiols followed by slow dendron adsorption was proposed considering the insertion of the individual dendron molecules as the rate-determining step of the process. The immersion of alkanethiol self-assembled layers (SAM) in solutions of increasing concentrations of dendron-sulfide leads to an increase in the number of dendritic molecules inserted into the thiol layer; on the contrary, the initial SAM quality (reached at different incubation times of alkanethiols) is not a determining factor for the dendron insertion process.

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 3.6. Scheme of some examples of (A and B) dendron-thiol and (C) dendron-sulfide molecules. (A) Adapted from Ref. [60] Copyright (2010), with permission from Wiley; (B) Adapted with permission from Ref. [61] Copyright (1998) American Chemical Society; (C) Adapted with permission from Ref. [62] Copyright (2000) American Chemical Society.

Dendrons with a carboxylic acid at the focal point [3,5-bis (3,5-dinitrobenzoylamino) benzoic acid] and nitro groups at the periphery (G1-NO2) were immobilized onto gold surfaces in a noncovalent approach and studied by electrochemistry and scanning tunneling microscopy (STM) [63]. G1-NO2 adsorbs onto gold electrode surfaces spontaneously by dipping the metal surface in dendron solution and also via grafting of cystamine covalently attached to a gold electrode. The reduction of these layers exhibits a well-behaved redox response for the adsorbed nitroso/hydroxylamine couple, useful for the electrocatalysis of reduced nicotine adenine dinucleotide (NADH), an important feature in amperometric sensors design [63].

Dong et al. reported the formation of special surface structures on gold arising from the precisely tailored structure of surface-bound thiol-dendrons with different main structures and peripheral substituents [64]. Among the factors controlling aggregation behavior, chemisorption is fairly similar in these compounds. Thus, intermolecular interaction greatly influences the configuration of dendrons on gold and different results were obtained from SAMs composed of symmetrical and asymmetric structures (Fig. 3.7). The different aggregation behavior observed was probably influenced by peripheral substituents and the coadsorption process, as reflected in the results of both STM and electrochemical measurements [64].

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 3.7. Chemical structure of gold-bonded dendron-thiols and the corresponding STM images.

Adapted from Reference [64], Copyright (2003), with kind permission of Wiley.

A library of compact and monodisperse dendritic scaffolds based on the nontoxic 2,2-bis(methylol)propionic acid (bis-MPA) was explored for binding SAM onto gold surfaces to exploit the design of hydrophilic dendritic structures bearing sulfur-containing core functionalities [65]. The size of the dendritic framework (G1–G3), the nature of the sulfur (whether thiol or disulfide), the functional end group (mannose or hydroxyl), and the distance between the dendritic wedge and the sulfur were key structural elements affecting the packaging densities assembled on the substrates. Surface interactions between multivalently presented motifs and cells in a controlled surface setting were evaluated by the cell scavenging ability of these sensor surfaces for Escherichia coli Ms7fim+ bacteria that revealed 2.5-fold enhanced recognition for G3-mannosylated surfaces compared to G3-hydroxylated SADM surfaces [65].

Dendritic polyglycerol (PG) derivatives with different numbers of amino groups have been attached onto gold substrates via thioctic acid linker and the selective interaction of complementary fluorescently labeled DNA proved the availability of such end groups for biomolecule attachment. These results demonstrate a new way to tailor hyperbranched surfaces by introducing amino moieties which can act as suitable anchoring sites for specific biomolecule interactions, while maintaining the resistant properties against non-specific protein adhesion. The protein-resistant properties of these PG-coated surfaces depend on the amino content, providing a combination of both important characteristics in bioelectronics or in the development of biosensing platforms with improved sensitivity [1,2].

Carbon surfaces. The functionalization of carbon surfaces with dendrons provides controllable properties for the electrode surface due to multifunctional groups of these molecules. For example, the cooperative effect of phenyl rings and the multifunctionality of G1-NO2 and G2-NO2 dendrons (Fig. 3.8) allow a direct, rapid and spontaneous physisorption of these dendrons onto carbon surfaces [66]. The AFM images show a network film with embedded aggregates that completely cover the carbon surfaces after only a few minutes, with average heights suggesting a tilted preferred adsorption in the early stages of the film formation, and highlighting a noticeable increment of this effect with increasing dendron generation. These molecules form a layer covering the whole surface, but do not block the electron transfer reaction of redox probes like Fe(CN)63−/4− or Ru(NH3)63+/2+. This effect, together with the remarkable simplicity of obtaining nitroaryl-ended films, makes these modified electrodes promising for electrocatalysis and biosensing platforms [67].

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 3.8. Chemical structure of G1-NO2 and G2-NO2 dendrons and the corresponding AFM images of these dendrons attached to carbon surfaces.

Boltorn H30 molecules are the third generation of commercial polyhydroxylated hyperbranched polymers, are approximately spherical in shape, have an average diameter larger than 3 nm and inner cavities that can be used for small molecule or ion inclusion. The spontaneous adsorption of Boltorn H30 onto carbon substrates was carried out with the idea of generating a nanostructured layer capable of retaining copper cations (II) inside. The modified electrode was subsequently incubated for a certain length of time (between 1 and 3 h) in CuCl2 solutions, washed in water and taken to an electrochemical cell to reduce the cations and generate the metallic nanoparticles [68]. These platforms are valuable for the electrocatalysis of hydrogen peroxide.

Silicon surfaces (Si wafers in air are covered with a thin SiO2 layer bearing OH groups as final functions). Monodisperse dendrons with urea/malonamide are used as linkages for organic thin-film transistor gate insulators [69]. These dendrons are hydrogen bond-rich and able to form stable self-assembled structures. Due to their strong intermolecular interactions, they have been successfully employed in nonlinear optics. In addition, dendrons with hydrogen bond-rich urea/malonamide linkages and peripheral long alkyl chains would certainly interact with tetracarboxylic diimide derivatives possessing perfluorinated alkyl chains on imide rings [69].

Indium tin oxide (ITO) electrodes (transparent conducting films). Recently, new hyperbranched p-conjugated macromolecules were obtained by the versatile electropolymerization method and exhibit interesting electronic properties that yield thin and stable layers with good electrical conductivity. Mangione et al. reported the synthesis and properties of peripherally carbazole (CBZ) functionalized starburst monomers, featuring the presence or absence of electroactive central core triphenylamine (TPA) moieties connected by conjugated or saturated branches. For this purpose, the dendron monomers were obtained by a convergent strategy and the CBZ residues allowed the formation of hyperbranched polymeric layers over conductive substrates by electrochemical polymerization. The radical cation coupling of oxidized CBZ leads to the growth of the dendritic structures. Thus, two different fully p-conjugated dendritic polymers are formed, with and without an electroactive central core connected to peripherals moieties (Fig. 3.9) [70].

In which image will adenine (a) be the next nucleotide to be added to the primer?

Figure 3.9. Chemical structure of p-conjugated dendrimers.

Reprinted from Ref. [70]. Copyright (2016), with permission from Elsevier.

Read full chapter

URL: https://www.sciencedirect.com/science/article/pii/B9780081007914000033

Takashi Taki, ... Ryo Shimada, in IATSS Research, 2019

3.2 Human DNA that differs among individuals

The majority of human DNA is encapsulated in organelles called cell nuclei; this is called genomic DNA. Genomic DNA consists of four bases (adenine, thymine, guanine, and cytosine), and the arrangement of these bases specifies all the information necessary for human survival. An attempt to decode the whole genomic DNA sequence started in the United States in 1990 as the “Human Genome Project,” [7,8] and the entire sequence was determined in 2003 [9]. Human genomic DNA is composed of 3 billion base pairs, and about 0.1% of the sequence differs among individuals. As a result of detailed investigations on the sequences that differ among individuals in the HapMap project that began in 2003 [10] and in the 1000 genomes project that began in 2008 [11], their positions, sequence type, and their frequencies have become clear. When performing individual identification using DNA, analysts use sequences in which differences are often identified rather than sequences in which individual differences are observed very rarely. These genomic sequence differences with a frequency in the population of 1% or more are called “polymorphisms” and are distinguished from “mutations” with a frequency of 1% or less. The sequence used for individual identification is “polymorphism”.

There are various types of DNA polymorphisms, including single nucleotide polymorphism (SNPs), repetitive sequence polymorphism, insertion deletion polymorphism (Indel), and copy number polymorphism (CNV) [12].

Read full article

URL: https://www.sciencedirect.com/science/article/pii/S0386111219300354

ADHD and criminality: A primer on the genetic, neurobiological, evolutionary, and treatment literature for criminologists

Catrina M. Schilling, ... Ilhong Yun, in Journal of Criminal Justice, 2011

The genetics and neurobiology of ADHD

The precise etiology of ADHD is not known, but numerous family, twin, and adoption studies reveal a strong genetic component. Heritability estimates for ADHD average about h2 = .80 and are consistently found regardless of whether it is considered a categorical or continuous trait (Bobb, Castellanos, Addington, & Rapoport, 2005). Behavioral genetic designs that produce heritability estimates show only that genes are involved in the etiology of ADHD, but not which genes; molecular genetics is required to do that. Molecular genetic studies show ADHD to be highly polygenic, with at least 50 genes with small to moderate effects being involved (Comings et al., 2005). The highly polygenic nature of ADHD probably explains why it is so clinically heterogeneous and why it is linked to many other problems such as CD, ODD, substance abuse, pathological gambling, and antisocial behavior in general.

The hunt for candidate genes implicated in any disorder involves looking at genetic polymorphisms. Polymorphism refers to the differences in allelic combinations (an allele is an alternate form of a gene; one inherited maternally and one paternally) that make us different from one another even though we share the same genes. There are three different types of polymorphisms: single nucleotide polymorphisms (SNPs), minisatellites and microsatellites (Altukhov & Salmenkova, 2002). As the term SNP implies, a difference in just one nucleotide is all that differentiates one allele from another. A nucleotide is one of the four nitrogenous bases (adenine, cytosine, guanine, thymine or uracil) plus a sugar and phosphate backbone that make up DNA and RNA. When we consider that the human genome contains approximately 3 billion base pairs we can appreciate the potential number of polymorphisms it contains. It is estimated that about 85% of genetic causes of most behavioral disorders are attributable to SNPs, the most frequently occurring polymorphism (Plomin et al., 2001). The other types are often lumped together as VNTRs (variable number of tandem repeats) that consist of sections of DNA repeated a different number of times in different alleles. We limit ourselves to the three polymorphisms that meta-analyses (Bobb et al., 2005; Gizer, Ficks, & Waldman, 2009) have shown to be most replicated in ADHD genetic studies and which are also implicated in criminal and other forms of antisocial behavior: the dopamine receptor D4 (DRD4), the dopamine transporter (DAT1), and the serotonin transporter (5-HTTLPR). We emphasize that genes associated with complex disorders such as ADHD are likely to have low penetrance and thus many carriers will not develop the disorder.

Whatever the genes associated with ADHD, their function is to manufacture neurotransmitters, receptors, transporters, or enzymes. These substances are utilized in all parts of the brain, but the most salient part for understanding ADHD is the prefrontal cortex (PFC), “the most uniquely human of all brain structures” (Goldberg, 2001, p. 2). This vital area has extensive connections with other cortical regions as well as with deep structures in the limbic system. Because of its many connections with other brain areas, it is generally considered to play the major integrative, as well as the major supervisory role in the brain, and is vital to the forming of moral judgments, mediating affect, and for social cognition (Romaine & Reynolds, 2005). It is the last part of the brain to mature—to become fully myelinated—which typically does not occur until mid-adolescence-early adulthood (Sowell et al, 2004). Myelin is the fatty substance that coats the neuronal axons and acts as insulation. Myelinated axons allow for speedier transmission of the electric message along the length of the axon. Not coincidentally, it is at this time that many ADHD symptoms especially impulsivity tend to subside. Although it is important to note that about 90% of ADHD individuals continue to exhibit some symptoms into adulthood (Willoughby, 2003).

ADHD can also be understood as a disorder of neurological regulation and underarousal, making the PFC a key focus of neuroscientists researching ADHD. The PFC provides us with knowledge about how others see and think about us, thus moving us to adjust our behavior to consider their needs, concerns, and expectations. These PFC functions are collectively referred to as executive functions (Tripp & Wickens, 2009), and are clearly involved in prosocial behavior if functioning normally. Beaver, Wright, and DeLisi (2007) have shown that deficits in neuropsychological functioning in various areas of the PFC are related to self-control levels and suggest that this revered criminological concept is an executive function. Meta-analyses of brain imaging studies show that reductions in the gray matter volume in the both left and right cortices of the PFC are the most robust and consistent brain region deficits associated with ADHD (Valera et al., 2007). The PFC requires optimal levels of the neurotransmitters dopamine and norepinephrine to function well. Too little dopamine and norepinephrine and we become drowsy, bored, or fatigued; too much and we are stressed, either of which impairs PFC functioning. In short: “Imaging studies have demonstrated that patients with ADHD have alterations in PFC circuits and demonstrate weaker PFC activation while trying to regulate attention and behavior” (Arnsten, 2009, p. 22).

Read full article

URL: https://www.sciencedirect.com/science/article/pii/S0047235210001960

Where will the next nucleotide be added?

DNA is always synthesized in the 5'-to-3' direction, meaning that nucleotides are added only to the 3' end of the growing strand. As shown in Figure 2, the 5'-phosphate group of the new nucleotide binds to the 3'-OH group of the last nucleotide of the growing strand.

What adds nucleotides to the end of the primer?

DNA polymerase adds nucleotides to the 3' end of a primer, extending the new chain in a 5' → 3' direction.

Which enzyme adds nucleotides to the 3 end of primer?

During elongation, an enzyme called DNA polymerase adds DNA nucleotides to the 3′ end of the template. Because DNA polymerase can only add new nucleotides at the end of a backbone, a primer sequence, which provides this starting point, is added with complementary RNA nucleotides.

What starts at the primer adding nucleotides?

The primer primes DNA synthesis, i.e., gets it started. Once the RNA primer is in place, DNA polymerase "extends" it, adding nucleotides one by one to make a new DNA strand that's complementary to the template strand.