Genomics is the study of all the genes in an organism. It builds on recombinant DNA technology by using high-throughput approaches to analyze larger datasets computationally. Key techniques in genomics include sequencing entire genomes, assembling sequences, understanding gene expression and networks, and applying genomics to medicine through drug development, diagnostics, and personalized healthcare. Microarrays allow simultaneous analysis of thousands of genes and have applications in cancer diagnosis, prognosis, and identifying genes involved in metastasis. DNA sequencing methods like chain termination sequencing use dideoxynucleotides to terminate DNA synthesis at each base. Subcloning propagates DNA fragments using vectors and recombinant DNA techniques.
2. What is Genomics?
Genomcis is the study of all genes present in an
organism
In 1986 mouse geneticist Thomas Roderick used
Genomics for mapping, sequencing and
characterizing genomes
3. Introduction
Genomics built on recombinant-DNA technology
(developed since early 1970s)
Thorough understanding of recombinant-DNA
techniques
Prerequisite for understanding genomics
technologies
Differences between genomics and
recombinant-DNA technology
Genomics is high throughput approaches to allow
more analyses in parallel
Genomics is dependent on computational
analysis due to larger data sets
4. Sequence the entire genome by cutting it into small,
manageable pieces (fragments)
Assemble the entire genome from the pieces
(fragments)
Make sense of the genome
Understand how gene expression takes place?
How life processes are networked?
Understand life??
5. Technical Foundations of
Genomic and cDNA libraries
Genomics
DNA
Hybridization and Northern blots
Subcloning in vectors
Restriction-enzyme mapping
DNA sequencing
PCR amplification
Genomics and Medicine
6. What we hope to gain from
genomics
-Drug, diagnostics, and prognostics development
- Genotyping to predict patient susceptibility to
disease
- Personalized healthcare based on an individuals
genomic features
genome
decision support systems
genotype
molecular profile
patient history
knowledge base
drugs diagnostics prognostics
health
8. How to make a genomic library
ori
total genomic DNA
ampR
genomic
DNA
restriction
enzyme
anneal
and ligate
ori
ampR
ori
plasmid (black)
ampR
ori
ampR
ori
ampR
same
restriction
enzyme
transform E. coli;
select for
Amp resistance
11. Microarrays
Basis of microarrays for determining gene expression
Process by which complementary strands find each
other
AT and CG base pairing
speed and fidelity: dependent on temperature, salt,
sequence, and concentration (High temp and low
salt)
12.
Microarrays permit the simultaneous analysis of the
RNA expression of thousands of genes.
For fully sequenced genomes, microarrays can be
used to analyze the expression of every gene.
Prior to the introduction of microarrays, RNA
abundance was usually analyzed through
hybridization to RNA bound to filters. These Northern
blots normally had no more than 2040 lanes, and no
more than three probes could be used
simultaneously.
In contrast, microarrays can interrogate 30,000
genes at the same time, vastly increasing our ability
to analyze RNA expression.
14. Cross-hybridization
Hybridization to a related, but not identical,
sequence = cross-hybridization
Example: A probe from one member of a gene family
is likely to hybridize to all other members
Problem in microarrays, particularly cDNA arrays
Oligonucleotide arrays prescreened to eliminate
sequences likely to cross-hybridize
15. Improved disease diagnostics from
genomics
Microarray analysis of
gene expression from four
different types of tumors
16. Microarrays and cancer
Histology not always effective tool for prognosis and
diagnosis
Microarrays distinguish cancerous tissues on the basis
of a gene expression profile
Use in diagnosis (presence)
Example: characterizing acute lymphoblastic leukemia. Also
breast cancer.
Use in prognosis
Example: assessing the likelihood of metastasis in
medulloblastoma (brain tumor in children)
17. Microarrays in the prognosis of metastasis
(childhood cancer: medulloblastoma)
Identified 85 genes with
different levels of expression in
metastatic (M+) and nonmetastatic tumors (M_)
59 up and 26 down
72% accuracy in predicting
metastasis
Identified genes induced in
metastasis
Could serve as potential
drug targets for in vitro
experiments
platelet derived growth
factor receptor alpha
(PDGFR留). Antibodies
prevent migration.
M
M+
green =
down
regulated
red = up
regulated
18. Cancer genome projects
One in three people will suffer from cancer in his or her lifetime.
Cancer Genome Anatomy Project (CGAP)
Established 1997 by National Cancer Institute (USA)
Specializes in EST sequencing
Human Cancer Genome Project (HCGP)
Established 1999 by Brazilian research groups
Cancer Genome Project (CGP)
Established 2000 by Wellcome Trust and Sanger Institute
(United Kingdom)
Specializes in genomic mutations leading to cancer
Funding: $15 million to $60 million
20. Affymetrix oligonucleotide arrays
The array elements
are a series of 25mer oligos designed
from known sequence
and synthesized
Directly on the surface
The entire array is
formed by >500,000
cells, each containing
a different oligo
21. subcloning
Propagating fragments
of cloned DNA
Used for sequencing
and protein production
Plasmid vectors
Replicate in bacteria
Resistant to antibiotics
Cloning sites
ORI
Region
into which
DNA can
be inserted
Plasmid
cloning
vector
ampr
22. Subcloning: vector and fragment
Vector and fragment to be
DNA
fragment
inserted must have compatible
ends
Sticky ends anneal
Enzyme ligase makes covalent
cloning
vector
bond between vector and
fragment
Use of recombination instead of
restriction sites
recombinant
plasmid
restriction
enzymes
24. DNA sequencing
Most current sequencing projects use the chain
termination method
Based on action of DNA polymerase
Also known as Sanger sequencing, after its inventor, Fredrick
Sanger
Adds nucleotides to complementary strand
Requires template DNA and primer
28. Sequence detection
To detect products of sequencing
reaction
Include labeled nucleotides
Formerly, radioactive labels were
used
Now fluorescent labels
Use different fluorescent tag for
each nucleotide
Can run all four reactions in
same lane
29. Pyrosequencing
based on the sequencing by synthesis principle.
it relies on the detection of pyrophosphate release on
nucleotide incorporation
The technique was developed by Mostafa Ronaghi
and P奪l Nyr辿n at the Royal Institute of Technology in
Stockholm in 1996
33. Resources
Bioinformatics, Genomics, and Proteomics (Ann Finney Batiza, Ph.D.)
SciencePages
Functional Genomics (Michael Kaufmann and Claudia Klinger
Private Universitt, Witten/Herdecke gGmbH, Witten, Germany)
Introduction to Genomics by Arthur M. Lesk, 2007, Oxford University Press