10 Molecular Genetics
10.1 Learning Objectives
By the end of this chapter, you should be able to:
- Describe the evidence establishing DNA as the genetic material
- Explain the structure of DNA and how it enables replication and information storage
- Compare different types of mutations and their molecular consequences
- Describe DNA repair mechanisms and their importance
- Explain how genes are organized in prokaryotic and eukaryotic genomes
- Analyze the relationship between genotype and phenotype at the molecular level
- Describe techniques for analyzing DNA and genes
- Apply molecular genetics principles to understand genetic disorders
10.2 Introduction
Molecular genetics bridges classical genetics with biochemistry, providing a physical and chemical understanding of heredity. This chapter explores the molecular basis of genetic phenomena, from the structure of DNA to the mechanisms of mutation and repair. We will examine how genetic information is encoded in DNA molecules, how this information can change through mutation, and how cells maintain genomic integrity. Understanding molecular genetics provides the foundation for modern biotechnology, genetic engineering, and personalized medicine.
10.3 The Genetic Material
10.3.1 Historical Background
Early 20th century: Chromosomes known to carry genetic information, but composition debated
Candidates: Proteins (complex, diverse) vs. nucleic acids (simple, repetitive)
10.3.2 Griffith’s Transformation Experiment (1928)
Organism: Streptococcus pneumoniae
Strains: Smooth (virulent, encapsulated) and rough (avirulent, unencapsulated)
Key experiment: Heat-killed smooth + live rough → live smooth
Conclusion: “Transforming principle” from dead cells could change living cells
10.3.3 Avery, MacLeod, and McCarty (1944)
Question: What is the transforming principle?
Approach: Purify components from smooth bacteria, test transformation
Treatments: Proteases (destroy proteins), RNase (destroy RNA), DNase (destroy DNA)
Result: Only DNase destroyed transforming activity
Conclusion: DNA is the transforming principle (genetic material)
10.3.4 Hershey-Chase Experiment (1952)
Organism: Bacteriophage T2 (infects E. coli)
Labeling: ³²P (DNA) vs. ³⁵S (protein)
Experiment: Infect bacteria, blend to remove phage coats, centrifuge
Result: ³²P (DNA) inside bacteria, ³⁵S (protein) outside
Conclusion: DNA enters bacteria, directs phage reproduction (DNA is genetic material)
10.3.5 Chargaff’s Rules (1950)
Observations:
- DNA composition varies between species
- Within species: A = T, G = C (A+T ≠ G+C)
Significance: Provided clues to DNA structure
10.3.6 Watson, Crick, Franklin, and Wilkins (1953)
X-ray diffraction: Rosalind Franklin’s Photo 51 showed helical structure
Model building: Watson and Crick proposed double helix
Key features: Complementary base pairing, antiparallel strands, right-handed helix
10.4 DNA Structure
10.4.1 Chemical Components
Nucleotides: Building blocks of DNA
- Deoxyribose sugar: 5-carbon sugar (2’-deoxy)
- Phosphate group: Links nucleotides
- Nitrogenous bases: Purines (A, G) and pyrimidines (C, T)
Nucleoside: Base + sugar (no phosphate)
Nucleotide: Base + sugar + phosphate(s)
10.4.2 Double Helix Structure
Key features:
- Two antiparallel strands: 5’→3’ and 3’→5’
- Right-handed helix: B-DNA is most common form
- Base pairing: A-T (2 hydrogen bonds), G-C (3 hydrogen bonds)
- Major and minor grooves: Sites for protein binding
- Diameter: 2 nm
- Helical repeat: ~10 base pairs per turn (3.4 nm)
Stabilizing forces:
- Hydrogen bonds: Between complementary bases
- Base stacking: Hydrophobic interactions between stacked bases
- Electrostatic interactions: Negative phosphate repulsion minimized by cations
10.4.3 DNA Conformations
B-DNA: Most common, right-handed, physiological conditions
A-DNA: Right-handed, shorter and wider, dehydrated conditions
Z-DNA: Left-handed, alternating purine-pyrimidine sequences
Triple helix: H-DNA, formed in polypurine-polypyrimidine regions
10.4.4 Chromosome Organization
Prokaryotes: Circular DNA, supercoiled, nucleoid-associated proteins
Eukaryotes: Linear chromosomes, histone proteins, nucleosomes, higher-order structures
Chromatin:
- Euchromatin: Less condensed, transcriptionally active
- Heterochromatin: Highly condensed, transcriptionally inactive
10.5 DNA Replication (Detailed)
10.5.1 Semiconservative Replication
Meselson-Stahl Experiment (1958):
- Method: Density gradient centrifugation with ¹⁵N (heavy) and ¹⁴N (light)
- Prediction: Conservative (all heavy or all light) vs. semiconservative (hybrid)
- Result: After one generation: all hybrid; after two: half hybrid, half light
- Conclusion: Semiconservative replication
10.5.2 Replication Machinery
Initiation complex:
- Origin recognition complex (ORC): Binds origin in eukaryotes
- Helicase: Unwinds DNA (MCM in eukaryotes)
- Single-strand binding proteins (SSBs): Stabilize single strands
- Topoisomerases: Relieve supercoiling (DNA gyrase in bacteria)
Priming:
- Primase: Synthesizes RNA primer (10-12 nucleotides)
- Primosome: Complex containing primase and helicase
Elongation:
- DNA polymerase III: Main replicase in bacteria (ε and δ in eukaryotes)
- Sliding clamp: PCNA (eukaryotes) or β-clamp (bacteria)
- Clamp loader: RFC (eukaryotes) or γ-complex (bacteria)
Termination:
- Bacteria: Specific termination sites, Tus protein
- Eukaryotes: Telomeres, telomerase in stem/germ cells
10.5.3 Leading vs. Lagging Strand
Leading strand: Continuous synthesis 5’→3’
Lagging strand: Discontinuous synthesis (Okazaki fragments)
- Size: 100-200 nucleotides in eukaryotes, 1000-2000 in bacteria
- Processing: RNA primer removal, gap filling, ligation
10.5.4 Fidelity Mechanisms
- Base pairing specificity: Hydrogen bonding geometry
- 3’→5’ exonuclease proofreading: DNA polymerase activity
- Mismatch repair: Post-replication correction
- Error rate: ~10⁻⁹ per base (1 error per billion bases)
10.5.5 Eukaryotic Features
Multiple origins: 30,000-50,000 in human genome
Replication timing: Euchromatin replicates early, heterochromatin late
Licensing: Pre-replication complex ensures single replication per cycle
Telomere replication: Telomerase adds repeats to chromosome ends
10.6 Mutations
10.6.1 Types of Mutations
Point mutations (single base changes):
- Transition: Purine to purine (A↔︎G) or pyrimidine to pyrimidine (C↔︎T)
- Transversion: Purine to pyrimidine or vice versa
Functional consequences:
Silent mutation: Same amino acid (synonymous)
Missense mutation: Different amino acid
- Conservative: Similar properties
- Non-conservative: Different properties
Nonsense mutation: Stop codon created
Frameshift mutation: Insertion/deletion not multiple of 3
Chromosomal mutations:
- Deletion: Loss of segment
- Duplication: Repeat of segment
- Inversion: Segment reversed
- Translocation: Segment moved to different chromosome
10.6.2 Causes of Mutations
Spontaneous:
- Replication errors: Polymerase mistakes
- Tautomeric shifts: Rare base forms mispair
- Depurination: Loss of purine base (~10,000/cell/day)
- Deamination: C → U (~100/cell/day)
Induced:
Chemical mutagens:
- Base analogs: 5-bromouracil (T analog)
- Alkylating agents: EMS, adds alkyl groups
- Intercalating agents: Ethidium bromide, between bases
Radiation:
- UV light: Pyrimidine dimers (TT, CT, CC)
- Ionizing radiation: Single- and double-strand breaks
10.6.3 Mutation Rates
Variation across organisms:
- RNA viruses: 10⁻³ to 10⁻⁵ per base per replication
- DNA viruses: 10⁻⁶ to 10⁻⁸
- Bacteria: ~10⁻⁹ to 10⁻¹⁰
- Eukaryotes: ~10⁻⁹ to 10⁻¹¹
Factors affecting rate:
- Genome size
- Generation time
- DNA repair efficiency
- Environmental exposure
10.7 DNA Repair Mechanisms
10.7.1 Direct Reversal
Photolyase: Light-dependent repair of pyrimidine dimers
O⁶-methylguanine methyltransferase (MGMT): Removes methyl groups from guanine
10.7.2 Base Excision Repair (BER)
For: Small, non-helix-distorting base lesions
Steps:
- DNA glycosylase removes damaged base
- AP endonuclease nicks backbone
- DNA polymerase fills gap
- DNA ligase seals nick
Examples: Uracil (from deaminated cytosine), oxidized bases
10.7.3 Nucleotide Excision Repair (NER)
For: Bulky, helix-distorting lesions
Steps:
- Recognition of damage
- Excision of oligonucleotide (24-32 bases)
- DNA synthesis to fill gap
- Ligation
Types:
- Global genomic NER: Throughout genome
- Transcription-coupled NER: Active genes
Diseases: Xeroderma pigmentosum (NER defect → skin cancer)
10.7.4 Mismatch Repair (MMR)
For: Replication errors (base mismatches, small insertions/deletions)
Recognition: MutS protein binds mismatch
Key feature: Distinguishes template strand from new strand (methylation in bacteria, nicks in eukaryotes)
Disease: Hereditary nonpolyposis colorectal cancer (HNPCC, Lynch syndrome)
10.7.5 Double-Strand Break Repair
Non-homologous end joining (NHEJ):
- Direct ligation of broken ends
- Error-prone (loss/gain of nucleotides)
- Major pathway in mammalian cells
Homologous recombination (HR):
- Uses sister chromatid as template
- Error-free
- Requires homologous sequence
- Major pathway in yeast
10.7.6 Translesion Synthesis
Purpose: Bypass lesions that block replication
Mechanism: Specialized DNA polymerases (error-prone)
Trade-off: Cell survival vs. mutation
10.8 Gene Organization
10.8.1 Prokaryotic Genes
Operons: Clustered genes with related functions
- Polycistronic mRNA: Single transcript encoding multiple proteins
- Examples: lac operon, trp operon
Gene density: High (~1 gene per kb)
Non-coding DNA: Minimal (regulatory sequences, origins)
10.8.2 Eukaryotic Genes
Exon-intron structure:
- Exons: Coding sequences
- Introns: Intervening non-coding sequences
- Alternative splicing: Different combinations of exons
Gene families: Related genes from duplication events
- Examples: Globin genes, histone genes
Non-coding DNA:
- Regulatory sequences: Promoters, enhancers, silencers
- Repetitive DNA: Tandem repeats, transposable elements
- Pseudogenes: Non-functional copies of genes
10.8.3 Genome Size Paradox (C-value paradox)
Observation: Genome size doesn’t correlate with organismal complexity
Examples:
- Human: 3.2 Gb
- Lungfish: 130 Gb
- Paris japonica (plant): 150 Gb
Explanation: Non-coding DNA varies widely
10.9 Genotype to Phenotype
10.9.1 Biochemical Pathways
One gene-one enzyme hypothesis (Beadle and Tatum, 1941):
- Each gene encodes one enzyme
- Modified to: One gene-one polypeptide
Metabolic pathways: Mutations block steps
Example: Phenylketonuria (PKU)
- Gene: Phenylalanine hydroxylase
- Block: Phenylalanine → tyrosine
- Result: Phenylalanine accumulation, mental retardation
10.9.2 Protein Structure and Function
Sickle cell anemia:
- Mutation: β-globin Glu6Val
- Effect: Hydrophobic patch causes polymerization
- Phenotype: Sickled cells, anemia, pain crises
CFTR in cystic fibrosis:
- Mutation: ΔF508 (deletion of phenylalanine)
- Effect: Misfolding, defective chloride channel
- Phenotype: Thick mucus, lung infections
10.9.3 Genetic Heterogeneity
Locus heterogeneity: Mutations in different genes cause same phenotype
Example: Retinitis pigmentosa (mutations in >60 genes)
Allelic heterogeneity: Different mutations in same gene cause same phenotype
Example: Cystic fibrosis (>2000 CFTR mutations)
10.9.4 Penetrance and Expressivity
Penetrance: Proportion of individuals with genotype who show phenotype
- Complete: All show phenotype
- Incomplete: Some show phenotype
Expressivity: Degree of phenotype expression
Example: Neurofibromatosis type 1 (variable expression)
10.10 Molecular Genetic Techniques
10.10.1 DNA Sequencing
Sanger sequencing (1977): Dideoxy chain termination
Next-generation sequencing (NGS): Massively parallel, high throughput
Third-generation sequencing: Single-molecule, long reads
10.10.2 Polymerase Chain Reaction (PCR)
Inventor: Kary Mullis (1983)
Components: Template DNA, primers, nucleotides, Taq polymerase
Cycles: Denaturation, annealing, extension
Amplification: 2ⁿ copies after n cycles
Applications: Cloning, diagnostics, forensics, ancient DNA
10.10.3 Gel Electrophoresis
Separates: DNA fragments by size
Agarose gels: Larger fragments (0.1-50 kb)
Polyacrylamide gels: Smaller fragments, higher resolution
10.10.4 Southern, Northern, Western Blotting
Southern: DNA detection (Edwin Southern, 1975)
Northern: RNA detection
Western: Protein detection
10.10.5 Genetic Engineering
Recombinant DNA technology: Combining DNA from different sources
Restriction enzymes: Cut DNA at specific sequences
DNA ligase: Joins DNA fragments
Vectors: Plasmids, viruses, artificial chromosomes
Applications: Gene cloning, protein production, transgenic organisms
10.11 Chapter Summary
10.11.1 Key Concepts
- DNA is genetic material: Evidence from transformation, phage, and biochemical experiments
- DNA structure: Double helix with complementary base pairing enables replication and information storage
- DNA replication: Semiconservative, requires multiple enzymes, high fidelity
- Mutations: Changes in DNA sequence, spontaneous and induced, various types
- DNA repair: Multiple mechanisms maintain genomic integrity
- Gene organization: Differs between prokaryotes and eukaryotes
- Genotype-phenotype relationship: Through biochemical pathways and protein function
- Molecular techniques: Enable analysis and manipulation of DNA
10.11.2 Evidence for DNA as Genetic Material
| Experiment | Organism | Key Finding | Year |
|---|---|---|---|
| Griffith | S. pneumoniae | Transforming principle | 1928 |
| Avery et al. | S. pneumoniae | DNA is transforming principle | 1944 |
| Hershey-Chase | T2 phage | DNA enters bacteria | 1952 |
| Chargaff | Various | A=T, G=C | 1950 |
| Watson-Crick | - | Double helix structure | 1953 |
10.11.3 DNA Repair Pathways
| Pathway | Damage Type | Key Enzymes | Diseases if Defective |
|---|---|---|---|
| BER | Small base lesions | Glycosylases, AP endonuclease | - |
| NER | Bulky lesions | XPA-XPG proteins | Xeroderma pigmentosum |
| MMR | Mismatches | MutS, MutL | HNPCC (Lynch syndrome) |
| NHEJ | Double-strand breaks | Ku, DNA-PKcs, XRCC4 | Radiation sensitivity |
| HR | Double-strand breaks | Rad51, BRCA1/2 | Breast/ovarian cancer |
10.11.4 Mutation Types and Effects
| Mutation Type | Molecular Change | Possible Effect |
|---|---|---|
| Silent | Same amino acid | None |
| Missense | Different amino acid | Variable (beneficial to deleterious) |
| Nonsense | Stop codon | Truncated protein, often nonfunctional |
| Frameshift | Insertion/deletion | Completely different protein sequence |
| Splice site | Alters splicing | Altered mRNA, often nonfunctional |
10.12 Review Questions
10.12.1 Level 1: Recall and Understanding
- Summarize the evidence that DNA is the genetic material.
- Describe the structure of the DNA double helix.
- What does “semiconservative replication” mean?
- List three types of DNA repair mechanisms.
- Distinguish between exons and introns.
10.12.2 Level 2: Application and Analysis
- If a DNA molecule is 20% adenine, what are the percentages of thymine, cytosine, and guanine?
- Explain how a point mutation in a gene could have no effect, a small effect, or a large effect on the protein product.
- Why are mismatch repair systems more effective immediately after DNA replication?
- Compare the organization of genes in prokaryotes and eukaryotes.
- How does the one gene-one enzyme hypothesis explain inborn errors of metabolism?
10.12.3 Level 3: Synthesis and Evaluation
- Design an experiment to determine whether a new chemical is mutagenic.
- Evaluate the statement: “DNA repair systems have evolved to balance fidelity with adaptability.”
- How does our understanding of molecular genetics explain why some genetic diseases show variable expression?
- Propose how knowledge of DNA repair mechanisms could lead to new cancer treatments.
10.13 Key Terms
- Transformation: Uptake and expression of foreign DNA
- Double helix: Two antiparallel polynucleotide chains wound around each other
- Complementary base pairing: A with T, G with C
- Semiconservative replication: Each new DNA molecule contains one old and one new strand
- Mutation: Change in DNA sequence
- Point mutation: Change in single base pair
- Frameshift mutation: Insertion or deletion altering reading frame
- DNA repair: Correction of DNA damage
- Exon: Coding region of gene
- Intron: Non-coding intervening sequence
- Operon: Cluster of prokaryotic genes with related functions
- Polymerase chain reaction (PCR): Amplification of specific DNA sequences
- Restriction enzyme: Enzyme cutting DNA at specific sequences
- Recombinant DNA: DNA combined from different sources
10.14 Further Reading
10.14.1 Books
- Watson, J. D., et al. (2014). Molecular Biology of the Gene (7th ed.). Pearson.
- Lewin, B., et al. (2018). Genes XII. Jones & Bartlett.
- Alberts, B., et al. (2022). Molecular Biology of the Cell (7th ed.). W. W. Norton.
10.14.2 Scientific Articles
- Watson, J. D., & Crick, F. H. C. (1953). Molecular structure of nucleic acids: A structure for deoxyribose nucleic acid. Nature, 171(4356), 737-738.
- Meselson, M., & Stahl, F. W. (1958). The replication of DNA in Escherichia coli. PNAS, 44(7), 671-682.
- Avery, O. T., MacLeod, C. M., & McCarty, M. (1944). Studies on the chemical nature of the substance inducing transformation of pneumococcal types. Journal of Experimental Medicine, 79(2), 137-158.
10.14.3 Online Resources
- DNA Learning Center: https://dnalc.cshl.edu
- NCBI Molecular Biology Resources: https://www.ncbi.nlm.nih.gov/guide/molecular-biology/
- Protein Data Bank: https://www.rcsb.org
10.15 Quantitative Problems
- DNA Composition:
- If a DNA molecule contains 18% cytosine, what are the percentages of the other bases?
- If a DNA molecule is 1,000 base pairs long and has 30% adenine, how many hydrogen bonds hold the two strands together?
- Calculate the length (in μm) of this DNA molecule if each base pair extends 0.34 nm.
- Mutation Rates: The human genome is 3.2 × 10⁹ bp. Mutation rate is 1 × 10⁻⁸ per base per generation.
- How many new mutations per genome per generation?
- If 2% of the genome is coding, how many affect protein-coding regions?
- If 90% of these are neutral, how many potentially deleterious mutations occur per generation?
- PCR Amplification: Starting with 1 DNA molecule, after 30 PCR cycles:
- How many copies are theoretically produced?
- If efficiency is 90% per cycle, how many copies are actually produced?
- If each amplicon is 500 bp, what is the total mass of DNA (in ng) if each bp averages 660 Da?
10.16 Case Study: BRCA Genes and DNA Repair
Background: BRCA1 and BRCA2 genes are involved in DNA repair, particularly homologous recombination.
Questions:
- What types of DNA damage do BRCA proteins help repair?
- How do mutations in BRCA genes increase cancer risk?
- Why are breast and ovarian tissues particularly susceptible?
- How does understanding the molecular function of BRCA genes inform cancer treatment strategies?
Data for analysis:
- BRCA1: 1863 amino acids, multiple domains
- BRCA2: 3418 amino acids, binds RAD51
- Mutation carriers: 45-65% lifetime breast cancer risk (vs. 12% general)
- Treatment implications: PARP inhibitors for BRCA-deficient cancers
Next Chapter: Regulation of Gene Expression