7 The Central Dogma in Action
7.1 Learning Objectives
By the end of this chapter, you should be able to:
- Describe the flow of genetic information according to the central dogma of molecular biology
- Explain the processes of DNA replication, transcription, and translation in detail
- Compare and contrast prokaryotic and eukaryotic gene expression
- Describe post-transcriptional and post-translational modifications
- Explain how mutations occur and how cells repair DNA damage
- Analyze how information theory applies to biological information processing
- Describe experimental evidence supporting key concepts in molecular biology
- Apply understanding of the central dogma to explain genetic disorders and biotechnology applications
7.2 Introduction
The central dogma of molecular biology describes the flow of genetic information within biological systems: DNA → RNA → protein. This chapter explores how cells implement this information flow, examining the molecular mechanisms that ensure accurate transmission of genetic information from one generation to the next and its expression within each cell. We will investigate how the abstract concept of biological information, introduced in Chapter 3, manifests in concrete molecular processes that underlie all cellular functions.
7.3 The Central Dogma: Overview
7.3.1 Historical Development
Francis Crick (1958): First articulated the central dogma
Original formulation: Information flows from nucleic acids to proteins, not reverse
Modern understanding: DNA → RNA → protein, with important exceptions
7.3.2 Information Flow Pathways
General transfers (occur in all cells):
- DNA → DNA (replication)
- DNA → RNA (transcription)
- RNA → protein (translation)
Special transfers (occur in specific cases):
- RNA → RNA (replication in RNA viruses)
- RNA → DNA (reverse transcription in retroviruses)
Never observed:
- Protein → protein
- Protein → DNA
- Protein → RNA
7.3.3 Exceptions and Modifications
Prions: Protein → protein conformational change
Epigenetic inheritance: Information transmitted without DNA sequence change
RNA editing: Post-transcriptional alteration of RNA sequences
Alternative splicing: Multiple proteins from single gene
7.4 DNA Replication
7.4.1 Overview
Purpose: Duplicate genetic material for cell division
Requirements: Template DNA, nucleotides, enzymes, energy
Key features: Semiconservative, bidirectional, semidiscontinuous
7.4.2 Molecular Mechanism
Initiation:
- Origin of replication: Specific sequences where replication begins
- Initiation proteins: Bind origin, separate strands
- Replication bubble: Region of separated strands
- Replication forks: Y-shaped regions where synthesis occurs
Elongation:
- DNA helicase: Unwinds double helix
- Single-strand binding proteins: Stabilize separated strands
- Topoisomerase: Relieves torsional strain ahead of fork
- Primase: Synthesizes RNA primers
- DNA polymerase III: Main replicase in bacteria
- Leading strand: Continuous synthesis 5’→3’
- Lagging strand: Discontinuous synthesis (Okazaki fragments)
Termination:
- Termination sequences: Specific sites in bacteria
- Telomeres: Ends of eukaryotic chromosomes
- Telomerase: Adds repeats to telomeres in stem/germ cells
7.4.3 Enzymes and Proteins
Prokaryotic replication:
- DNA polymerase I: Removes primers, fills gaps
- DNA polymerase III: Main replicase
- DNA ligase: Joins Okazaki fragments
- DNA gyrase: Type II topoisomerase
Eukaryotic replication:
- Multiple DNA polymerases (α, δ, ε)
- PCNA: Sliding clamp
- RFC: Clamp loader
7.4.4 Accuracy and Proofreading
Error rate: ~10^-9 per base pair
Mechanisms:
- Base pairing specificity: Hydrogen bonding geometry
- 3’→5’ exonuclease activity: Proofreading by DNA polymerase
- Mismatch repair: Post-replication correction
- Nucleotide excision repair: Removes damaged bases
Fidelity: Combined mechanisms achieve high accuracy
7.5 Transcription
7.5.1 Overview
Purpose: Synthesize RNA from DNA template
Types of RNA:
- mRNA: Messenger RNA (codes for proteins)
- tRNA: Transfer RNA (carries amino acids)
- rRNA: Ribosomal RNA (structural component of ribosomes)
- Other RNAs: snRNA, miRNA, siRNA, lncRNA
7.5.2 Prokaryotic Transcription
Initiation:
- Promoter: -10 (Pribnow box) and -35 regions
- RNA polymerase: Holoenzyme (core + σ factor)
- Closed complex: Polymerase binds promoter
- Open complex: DNA unwinds at start site
Elongation:
- RNA polymerase: Synthesizes RNA 5’→3’
- Transcription bubble: ~17 bp unwound region
- RNA-DNA hybrid: ~8 bp
Termination:
- Rho-dependent: ρ protein binds rut site, terminates
- Rho-independent: Hairpin formation in RNA
7.5.3 Eukaryotic Transcription
RNA polymerases:
- RNA Pol I: rRNA (except 5S)
- RNA Pol II: mRNA, snRNA, miRNA
- RNA Pol III: tRNA, 5S rRNA, other small RNAs
Transcription factors:
- General TFs: Required for all Pol II transcription
- Specific TFs: Regulate specific genes
- Enhancers: Distant regulatory sequences
- Mediator complex: Bridges TFs and polymerase
Initiation complex: >50 proteins at promoter
7.5.4 RNA Processing (Eukaryotes)
5’ capping: 7-methylguanosine cap added immediately
3’ polyadenylation: Poly-A tail added (100-250 A’s)
Splicing: Removal of introns, joining of exons
- Spliceosome: snRNP complex catalyzes splicing
- Alternative splicing: Different exons combined
RNA editing: Base modification (A→I, C→U)
Nuclear export: Mature mRNA exported through nuclear pores
7.6 Translation
7.6.1 Overview
Purpose: Synthesize protein from mRNA template
Components: mRNA, ribosomes, tRNA, amino acids, factors
Genetic code: Triplet code, degenerate, nearly universal
7.6.2 Genetic Code Properties
Triplet code: 3 nucleotides = 1 amino acid
Degenerate: Multiple codons for most amino acids
Non-overlapping: Read in consecutive triplets
Commaless: No gaps between codons
Universal: Same in nearly all organisms (minor variations)
Start codon: AUG (methionine)
Stop codons: UAA, UAG, UGA
7.6.3 Transfer RNA (tRNA)
Structure: Cloverleaf secondary structure, L-shaped tertiary
Anticodon: Base pairs with mRNA codon
Amino acid attachment: Covalent bond to 3’ end
Aminoacyl-tRNA synthetases: Attach correct amino acid to tRNA
- Accuracy: High specificity prevents errors
- Proofreading: Some synthetases have editing sites
7.6.4 Ribosomes
Prokaryotic: 70S (50S + 30S subunits)
Eukaryotic: 80S (60S + 40S subunits)
Components: rRNA and proteins
Functional sites:
- A site: Aminoacyl-tRNA binding
- P site: Peptidyl-tRNA binding
- E site: Exit site
7.6.5 Translation Mechanism
Initiation:
- Prokaryotic: Shine-Dalgarno sequence, initiation factors
- Eukaryotic: 5’ cap scanning, Kozak sequence, more factors
Elongation:
- Aminoacyl-tRNA binding: To A site (EF-Tu in bacteria)
- Peptide bond formation: Catalyzed by peptidyl transferase
- Translocation: Ribosome moves one codon (EF-G in bacteria)
- Rate: ~15-20 amino acids per second
Termination:
- Release factors: Recognize stop codons
- Polypeptide release: From ribosome
- Ribosome recycling: Subunits separate
7.6.6 Post-translational Modifications
Folding: Assisted by chaperones
Cleavage: Removal of signal peptides, propeptides
Chemical modifications: Phosphorylation, glycosylation, acetylation
Disulfide bonds: Stabilize tertiary structure
Proteolytic processing: Activation of zymogens
Targeting: Signals for specific cellular locations
7.7 Regulation of Gene Expression
7.7.1 Prokaryotic Regulation
Operon model (Jacob and Monod):
- Operon: Cluster of genes with related functions
- Promoter: RNA polymerase binding site
- Operator: Repressor binding site
- Structural genes: Code for enzymes
Lac operon: Inducible system for lactose metabolism
- Lactose absent: Repressor bound, no transcription
- Lactose present: Allolactose binds repressor, transcription occurs
- Glucose effect: Catabolite repression via cAMP-CAP
Trp operon: Repressible system for tryptophan synthesis
- Tryptophan absent: No repression, transcription occurs
- Tryptophan present: Corepressor binds repressor, transcription blocked
7.7.2 Eukaryotic Regulation
Transcriptional control:
- Chromatin remodeling: Histone modifications, nucleosome positioning
- DNA methylation: Typically represses transcription
- Transcription factors: Activate or repress transcription
- Enhancers/silencers: Distant regulatory elements
Post-transcriptional control:
- Alternative splicing: Different mRNA isoforms
- RNA stability: Control of mRNA degradation
- RNA interference: miRNA, siRNA regulation
Translational control:
- Initiation factors: Phosphorylation regulates activity
- mRNA secondary structure: Affects ribosome binding
- microRNAs: Block translation or cause degradation
Post-translational control:
- Protein modification: Affects activity, stability, localization
- Protein degradation: Ubiquitin-proteasome pathway
7.8 Mutations and DNA Repair
7.8.1 Types of Mutations
Point mutations:
- Silent: No amino acid change (same codon)
- Missense: Different amino acid
- Nonsense: Stop codon introduced
- Frameshift: Insertion/deletion not multiple of 3
Chromosomal mutations:
- Deletion: Loss of chromosome segment
- Duplication: Repeat of segment
- Inversion: Segment reversed
- Translocation: Segment moved to different chromosome
7.8.2 Causes of Mutations
Spontaneous:
- Replication errors: Polymerase mistakes
- Tautomeric shifts: Rare base forms
- Depurination: Loss of purine base
- Deamination: C → U conversion
Induced:
- Chemical mutagens: Base analogs, alkylating agents
- Radiation: UV light (pyrimidine dimers), ionizing radiation
- Intercalating agents: Insert between bases
7.8.3 DNA Repair Mechanisms
Direct reversal:
- Photolyase: Repairs pyrimidine dimers (light-dependent)
- MGMT: Removes methyl groups from guanine
Excision repair:
- Base excision repair: Removes damaged bases
- Nucleotide excision repair: Removes oligonucleotides (bulky lesions)
- Mismatch repair: Corrects replication errors
Double-strand break repair:
- Non-homologous end joining: Error-prone, direct ligation
- Homologous recombination: Error-free, uses sister chromatid
Translesion synthesis: Error-prone bypass of lesions
7.9 Information Theory Applications
7.9.1 Information Content of DNA
Maximum information: 2 bits per base (4 equally likely bases)
Actual information: ~1.8-1.9 bits/base (non-random distribution)
Coding regions: Lower entropy (conserved sequences)
Non-coding regions: Higher entropy (more variable)
7.9.2 Error Rates and Channel Capacity
DNA replication: Error rate ~10^-9 per base
Transcription: Error rate ~10^-4 per base
Translation: Error rate ~10^-4 per amino acid
Channel capacity: DNA → protein information transmission
- Noise sources: Polymerase errors, environmental damage
- Error correction: Multiple repair mechanisms
- Redundancy: Genetic code degeneracy provides error buffering
7.9.3 Evolutionary Information
Sequence conservation: Indicates functional importance
Information gain: Natural selection increases information about environment
Molecular evolution rates:
- Synonymous substitutions: Higher rate (neutral)
- Nonsynonymous substitutions: Lower rate (subject to selection)
7.10 Experimental Foundations
7.10.1 Key Experiments
Avery-MacLeod-McCarty (1944): DNA is transforming principle
Hershey-Chase (1952): DNA is genetic material of phage
Meselson-Stahl (1958): Semiconservative DNA replication
Nirenberg-Matthaei (1961): Deciphering genetic code
Jacob-Monod (1961): Operon model of gene regulation
7.10.2 Modern Techniques
DNA sequencing: Sanger, next-generation, third-generation
Gene expression analysis: Microarrays, RNA-seq
Protein detection: Western blot, immunofluorescence, mass spectrometry
Gene editing: CRISPR-Cas9, TALENs, zinc finger nucleases
7.11 Chapter Summary
7.11.1 Key Concepts
- Central dogma: DNA → RNA → protein information flow
- DNA replication: Semiconservative, requires multiple enzymes
- Transcription: RNA synthesis from DNA template
- Translation: Protein synthesis from mRNA template
- Genetic code: Triplet, degenerate, nearly universal
- Gene regulation: Controls expression at multiple levels
- Mutations: Changes in DNA sequence, repaired by multiple mechanisms
- Information theory: Applies to accuracy and evolution of biological information
7.11.2 Comparison: Prokaryotes vs. Eukaryotes
| Process | Prokaryotes | Eukaryotes |
|---|---|---|
| Transcription/Translation | Coupled | Separated (nuclear/cytoplasmic) |
| mRNA processing | Minimal | Extensive (capping, polyA, splicing) |
| Gene organization | Operons | Individual genes with introns |
| Regulation | Mainly transcriptional | Multiple levels |
7.11.3 Accuracy of Information Transfer
| Step | Error rate | Correction mechanisms |
|---|---|---|
| DNA replication | ~10^-9/base | Proofreading, mismatch repair |
| Transcription | ~10^-4/base | Lower fidelity acceptable |
| Translation | ~10^-4/amino acid | Proofreading, kinetic selection |
| Overall | Very high | Multiple redundant systems |
7.11.4 Information Flow Summary
DNA (replication) → DNA
↓
RNA (transcription)
↓
Protein (translation)
↓
Cellular function
7.12 Review Questions
7.12.1 Level 1: Recall and Understanding
- State the central dogma of molecular biology and its three general transfers.
- Describe the semiconservative model of DNA replication.
- List the three types of RNA involved in protein synthesis and their functions.
- What are the properties of the genetic code?
- Name three types of DNA repair mechanisms.
7.12.2 Level 2: Application and Analysis
- Compare transcription and translation in prokaryotes and eukaryotes.
- Explain how the lac operon is regulated in response to lactose and glucose.
- A mutation changes a codon from UUU to UUA. What type of mutation is this, and what are its potential effects?
- How does alternative splicing increase proteomic diversity?
- Calculate the information content of a DNA sequence with base frequencies: A=0.25, T=0.25, C=0.30, G=0.20.
7.12.3 Level 3: Synthesis and Evaluation
- Design an experiment to determine whether a newly discovered virus has DNA or RNA as its genetic material.
- Evaluate the statement: “The central dogma needs revision in light of modern discoveries like prions and epigenetics.”
- How does information theory help explain why some DNA sequences evolve faster than others?
- Propose a mechanism for how a cell might coordinate the expression of genes involved in a metabolic pathway.
7.13 Key Terms
- Central dogma: DNA → RNA → protein information flow
- Replication: Process of copying DNA
- Transcription: Synthesis of RNA from DNA template
- Translation: Synthesis of protein from mRNA template
- Codon: Three-nucleotide sequence specifying an amino acid
- Anticodon: Three-nucleotide sequence on tRNA complementary to codon
- Operon: Cluster of prokaryotic genes with related functions
- Promoter: DNA sequence where RNA polymerase binds to initiate transcription
- Intron: Non-coding sequence in eukaryotic genes, removed by splicing
- Exon: Coding sequence in eukaryotic genes, retained in mature mRNA
- Mutation: Change in DNA sequence
- DNA repair: Mechanisms correcting DNA damage
- Genetic code: Correspondence between nucleotide triplets and amino acids
- Ribosome: Cellular structure where translation occurs
- RNA polymerase: Enzyme that synthesizes RNA from DNA template
- Transcription factor: Protein regulating transcription
7.14 Further Reading
7.14.1 Books
- Watson, J. D., et al. (2014). Molecular Biology of the Gene (7th ed.). Pearson.
- Lodish, H., et al. (2021). Molecular Cell Biology (9th ed.). W. H. Freeman.
- Berg, J. M., et al. (2015). Biochemistry (8th ed.). W. H. Freeman.
7.14.2 Scientific Articles
- Crick, F. H. C. (1970). Central dogma of molecular biology. Nature, 227(5258), 561-563.
- Meselson, M., & Stahl, F. W. (1958). The replication of DNA in Escherichia coli. PNAS, 44(7), 671-682.
- Nirenberg, M. W., & Matthaei, J. H. (1961). The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides. PNAS, 47(10), 1588-1602.
7.14.3 Online Resources
- NCBI Molecular Biology Resources: https://www.ncbi.nlm.nih.gov/guide/molecular-biology/
- DNA Learning Center: https://dnalc.cshl.edu
- Protein Data Bank: https://www.rcsb.org
7.15 Quantitative Problems
Information Content:
- Calculate Shannon entropy for a DNA sequence with equal base frequencies.
- For a sequence with A=0.2, T=0.2, C=0.3, G=0.3.
- Which sequence has higher information content? Why?
Mutation Rate Analysis:
The human genome is 3.2 × 10^9 bp. Mutation rate is 10^-8 per base per generation.
- How many new mutations per genome per generation?
- If only 2% is coding DNA, how many affect protein sequences?
- What percentage of these are likely deleterious?
Translation Rate:
A protein has 300 amino acids. Translation rate is 15 amino acids/second.
- How long does synthesis take?
- If ribosome footprint is 30 nucleotides, how many ribosomes can be on an mRNA simultaneously?
- Calculate protein production rate if mRNA half-life is 2 hours.
7.16 Case Study: Thalassemia and Gene Expression Defects
Background: β-thalassemia results from reduced or absent β-globin synthesis.
Questions:
- How could mutations in promoter, splice sites, or coding sequence cause thalassemia?
- Why do some mutations affect RNA processing while others affect translation?
- How might understanding transcription factors help develop therapies?
- Design experiments to determine which step in gene expression is affected by a new thalassemia mutation.
Data for analysis:
- β-globin gene: 3 exons, 2 introns
- Normal hemoglobin: 2 α-globin + 2 β-globin chains
- Mutations: >200 known in β-globin gene
- Symptoms: Anemia, organ damage, bone deformities
Next Chapter: Cell Communication and Signaling