SlideShare a Scribd company logo
3
Most read
6
Most read
10
Most read
STRUCTURAL
GENOMICS
Presented By
Aqsa Javed
IBI2016001
Introduction to Structural Genomics
• When the human genome was completely sequenced in 2003,
researchers started thinking in how many ways it used.
• One hope was that the genome sequences would lead to a greater
understanding of how genes and their encoded proteins function
• As the number of known gene sequences grew, many scientists realized
they could not catch up simply by determining protein structures one by
one. So a group of scientists embarked on a strategic plan to uncover
the three-dimensional structures of all the proteins that these genes
encode.
• This endeavor is called structural genomics.
Introduction to Structural Genomics
 Structural genomics is a term that refers to high-throughput three-dimensional
structure determination and analysis of biological macromolecules, at this stage
primarily individual protein domains.
 It mainly aims to solve the 3 dimensional structures of proteins at a rapid rate and in
a cost effective manner.
 It uses a genome based approach to describe the 3- dimensional structure of every
protein encoded by a given genome.
 Structural genomics has emerged as one of the most powerful approaches for
defining the structure of proteins .
 Several structural proteomics groups pursue the structures of proteins that are
"unique", generally ones that have less than 30% sequence identity to a protein with
a known structure in the Protein Data Bank.
 Structural genomics is expected to yield a large number of experimental
protein structures (tens of thousands) and an even larger number of
calculated comparative protein structure models (millions). This enormous
body of structural data will be freely available, and promises to accelerate
scientific discovery in all areas of biological science.
 As the protein structure and function are closely linked, the importance of
structural genomics in understanding the function of proteins is paramount.
Structural genomics can also provide insight in dynamic properties such as
protein folding and identify possible targets that may be used for drug
discovery.
Structural Genomics
Differences between Structural Genomics and Traditional
Structural Prediction
 Structural genomics attempts to
determine the structure of every
protein encoded by the genome,
rather than focusing on one
particular protein.
 Structural genomics aims to first
determine the structure of proteins,
and then investigate their function
later.
Traditional Structural Prediction
 Traditional structure prediction
focuses on one particular protein at
a time.
 The function of a protein of interest
is first identified and then its
structure determined by means of
X-ray or NMR experiments.
In general terms the process involves:
1. PCR amplification of the coding sequence from the target genome.
2. Cloning the coding sequence into an appropriate expression vector.
3. Expressing the protein at a sufficiently high level.
4. Sequencing the cloned gene to verify that the coding sequence was
correctly amplified.
Structural Genomics Process
5. Confirming the identity of the expressed protein and
characterizing it as a prelude to NMR or crystallographic studies.
6. Obtaining the protein in sufficient amounts and do purification for
either approach.
7. Defining suitable crystallization or NMR solution conditions.
8. NMR or X-ray measurement .
Structural Genomics Process(Contd.)
9. Determining and refining the experimental structure
10.Calculating comparative protein structure models using this new
template
11.Making functional inferences from the structure
12..Publishing the structure to PDB.
• Failures are anticipated at every step, making the process somewhat
akin to a funnel.
Structural Genomics Process(Contd.)
Structural Genomics Pipeline
Strategic considerations
• Target selection is the most important strategic issue confronting the
structural genomics pilot studies. Their respective performances will
be measured in terms of the number of structures determined, what
fraction contain novel folds, their impact on biology, and the cost per
structure.
• Before selecting as to which structure to target , one needs to be
convinced of the social and medical benefits of the structural
genomics initiative.
• Projects have been made from exhaustive studies of all proteins
found in a model organism (Methanococcus jannaschii,
Mycobacterium tuberculosis) to selectively chosen targets from a
large number of different organisms.
• Another important strategic issue concerns money. To date, no
clear picture has emerged as to how much it will cost to determine
10,000–20,000 experimental structures..
• Time is also an issue because it is not a simple matter to decide
when a structural genomics initiative will be complete.
Strategic considerations(Contd.)
• In the year 2002-
2003,314 structures
resulting from structure
genomics were reported
by TargetDB.
• During the same period,
a total of 3324
structures were
deposited with the PDB
.
• Thus structure
genomics is currently
contributing
approximately 10% of
structures to the field of
structural biology.
 The flood of new genomic sequence information together with technological
innovations in protein structure determination have led to worldwide structural
genomics (SG) initiatives.
 The goals of SG initiatives are to accelerate the process of protein structure
determination, to fill in protein fold space and to provide information about the
function of uncharacterized proteins. In the long-term, these outcomes are
likely to impact on medical biotechnology and drug discovery, leading to a
better understanding of disease as well as the development of new
therapeutics
Structural Genomics Initiatives
Protein Structure Initiative
• Prominent Structural Genomics projects can be roughly lumped into
three groups: the Japan-based program, led by RIKEN, called Protein
3000; the Protein Structure Initiative (PSI) and the efforts from the
European research community.
• The most important among them was Protein Structure
Initiative(PSI).It was a USA based project established in 2000 by the
National Institute of Health (NIH) and National Institute of General
Medical Sciences(NIGMS) that aimed at accelerating discovery in
structural genomics.
• It was the most productive of the initiatives taken for structural genomics,
having solved around 3000 new structures between 2000 and 2008.
• The project was organized into three separate phases.
• The first phase of the Protein Structure Initiative (PSI-1) spanned from
2000 to 2005, and was dedicated to demonstrating the feasibility of high-
throughput structure determination, solving unique protein structures,
and preparing for a the second phase.
• The second phase, PSI-2, focused on implementing the high-throughput
structure determination methods developed in PSI-1, as well as
addressing bottlenecks like modeling membrane proteins.
• The third phase, PSI : Biology, began in 2010 and consisted of networks
of investigators applying high-throughput structure determination to
study a broad range of biological and biomedical problems.
• PSI program ended in 2015
Protein Structure Initiative
Structural Genomics
Tools and Databases for Structural Genomics
Examples of Structural Genomics
1. Thermotogo maritima proteome
• The Protein Structure Initiative (PSI) solved the structures for all the proteins in
Thermotogo maritima, a thermophillic bacterium.
• T. maritima was selected as a structural genomics target based on its relatively small
genome consisting of 1,877 genes and the hypothesis that the proteins expressed
by a thermophilic bacterium would be easier to crystallize.
• E.coli was used to express all the open-reading frames (ORFs) of T. martima. These
proteins were then crystallized and structures were determined for successfully
crystallized proteins using X-ray crystallography.
• Among other structures, this structural genomics approach allowed for the
determination of the structure of the TM0449 protein, which was found to exhibit a
novel fold(a tetramer with four interconnected active sites, each containing a flavin
adenine dinucleotide molecule).
Examples of Structural Genomics(Contd.)
2. Mycobacterium tuberculosis proteome
• The structures of potential drug targets in Mycobacterium tuberculosis, the
bacterium that causes tuberculosis were determined by TB Structural
Genomics Consortium.
• The development of novel drug therapies against tuberculosis are particularly
important given the growing problem of multi-drug-resistant tuberculosis.
• The fully sequenced genome of M. tuberculosis allowed scientists to clone
many of these protein targets into expression vectors for purification and
structure determination by X-ray crystallography.
• So far, structures have been determined for 708 of the proteins encoded by M.
tuberculosis.
Limitations and challenges to Structural Genomics
• As the structural genomics efforts continues to mature and
structure determination for the easy proteins or ‘low hanging
fruit’ is completed, attention will naturally turn to the more
difficult problems.
• The future challenges include expression and purification of
proteins from eukaryotic organisms containing post-
translational modifications, membrane proteins and large
multi-domain protein complexes.
• There are some regions of protein structure space that will not
succumb immediately to either NMR or X-ray methods.
Limitations and Challenges to Structural Genomics(Contd.)
• Novel high throughput, cost-effective expression protocols
will need to be developed .
• Membrane protein crystallization continues to represent a
considerable technical challenge, but advances in protein
purification and crystallization may ease these difficulties.
References
1. Goldsmith-Fischman Sharon,Honig B. 2003 “Structural
genomics: Computational methods for structure analysis”
Protein Sci. 2003 Sep; 12(9): 1813–1821
2. Burley SK “An overview of structural genomics” Nat Struct
Biol. 2000 Nov;7 Suppl:932-4.
3. Bourne PE et al. “The status of structural genomics defined
through the analysis of current targets and structures” Pac
Symp Biocomput. 2004:375-86.
THANK YOU

More Related Content

PPTX
Structural genomics
PPT
Structural genomics
PDF
Genome Assembly
PPTX
Functional genomics, and tools
PPTX
Genomics(functional genomics)
PPTX
Functional genomics
PPT
PPTX
Comparative genomics
Structural genomics
Structural genomics
Genome Assembly
Functional genomics, and tools
Genomics(functional genomics)
Functional genomics
Comparative genomics

What's hot (20)

PPTX
Express sequence tags
PPTX
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
PPTX
Comparative genomics
PDF
Secondary Structure Prediction of proteins
PDF
PPTX
Physical mapping
PPTX
Proteomics
PPTX
GENOMICS AND BIOINFORMATICS
PPTX
PHYSICAL MAPPING STRATEGIES IN GENOMICS
PPTX
PPTX
PPTX
Functional proteomics, methods and tools
PPTX
Techniques in proteomics
PPTX
Proteomics
PPTX
Comparative genomics
PPTX
PROTEIN MICROARRAYS
PPT
COMPARATIVE GENOMICS.ppt
PPTX
Proteomics ppt
PPTX
RFLP ,RAPD ,AFLP, STS, SCAR ,SSCP & QTL
Express sequence tags
STRUCTURAL GENOMICS, FUNCTIONAL GENOMICS, COMPARATIVE GENOMICS
Comparative genomics
Secondary Structure Prediction of proteins
Physical mapping
Proteomics
GENOMICS AND BIOINFORMATICS
PHYSICAL MAPPING STRATEGIES IN GENOMICS
Functional proteomics, methods and tools
Techniques in proteomics
Proteomics
Comparative genomics
PROTEIN MICROARRAYS
COMPARATIVE GENOMICS.ppt
Proteomics ppt
RFLP ,RAPD ,AFLP, STS, SCAR ,SSCP & QTL
Ad

Similar to Structural Genomics (20)

PPTX
GENOMICS
PPTX
genomics proteomics metbolomics.pptx
PPTX
Genomics and proteomics by shreeman
PPTX
INTRODUCTION OF Genes AND GENOMICS .pptx
PPTX
GENOMICS_functional_genomics_INTRODUCTION.pptx
PPTX
BASIC OF BIOINFORMATICS.pptx
PDF
Protein structure analysis
PPTX
Protein engineering
PPTX
Bioinformatics, comparative genemics and proteomics
DOCX
Bioinformatics
PDF
proteomics and bioinformatics lecturesss
PDF
Importance of microbial research
PDF
Proteomics contributes to your microbial research
PDF
Proteomics, techniques, applications.pdf
PDF
omics field soils. Soil Science (2015..pdf
PDF
Role of genomics proteomics, and bioinformatics.
PPTX
genomics, types, applications and importance
PPTX
Topic 5 of the genomics and proteomics.pptx
PPT
Genome data management
PPTX
APLICATION OF BIOINFORMATION ANALYST.pptx
GENOMICS
genomics proteomics metbolomics.pptx
Genomics and proteomics by shreeman
INTRODUCTION OF Genes AND GENOMICS .pptx
GENOMICS_functional_genomics_INTRODUCTION.pptx
BASIC OF BIOINFORMATICS.pptx
Protein structure analysis
Protein engineering
Bioinformatics, comparative genemics and proteomics
Bioinformatics
proteomics and bioinformatics lecturesss
Importance of microbial research
Proteomics contributes to your microbial research
Proteomics, techniques, applications.pdf
omics field soils. Soil Science (2015..pdf
Role of genomics proteomics, and bioinformatics.
genomics, types, applications and importance
Topic 5 of the genomics and proteomics.pptx
Genome data management
APLICATION OF BIOINFORMATION ANALYST.pptx
Ad

Recently uploaded (20)

PPTX
BODY FLUIDS AND CIRCULATION class 11 .pptx
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
PPTX
A powerpoint on colorectal cancer with brief background
PPTX
TORCH INFECTIONS in pregnancy with toxoplasma
PPT
6.1 High Risk New Born. Padetric health ppt
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PPT
Animal tissues, epithelial, muscle, connective, nervous tissue
PPTX
Fluid dynamics vivavoce presentation of prakash
PPTX
BIOMOLECULES PPT........................
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PPTX
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
PPTX
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PPT
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
PPTX
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
PPTX
Substance Disorders- part different drugs change body
PPT
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PPTX
endocrine - management of adrenal incidentaloma.pptx
BODY FLUIDS AND CIRCULATION class 11 .pptx
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
A powerpoint on colorectal cancer with brief background
TORCH INFECTIONS in pregnancy with toxoplasma
6.1 High Risk New Born. Padetric health ppt
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
Animal tissues, epithelial, muscle, connective, nervous tissue
Fluid dynamics vivavoce presentation of prakash
BIOMOLECULES PPT........................
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
Substance Disorders- part different drugs change body
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
endocrine - management of adrenal incidentaloma.pptx

Structural Genomics

  • 2. Introduction to Structural Genomics • When the human genome was completely sequenced in 2003, researchers started thinking in how many ways it used. • One hope was that the genome sequences would lead to a greater understanding of how genes and their encoded proteins function • As the number of known gene sequences grew, many scientists realized they could not catch up simply by determining protein structures one by one. So a group of scientists embarked on a strategic plan to uncover the three-dimensional structures of all the proteins that these genes encode. • This endeavor is called structural genomics.
  • 3. Introduction to Structural Genomics  Structural genomics is a term that refers to high-throughput three-dimensional structure determination and analysis of biological macromolecules, at this stage primarily individual protein domains.  It mainly aims to solve the 3 dimensional structures of proteins at a rapid rate and in a cost effective manner.  It uses a genome based approach to describe the 3- dimensional structure of every protein encoded by a given genome.  Structural genomics has emerged as one of the most powerful approaches for defining the structure of proteins .  Several structural proteomics groups pursue the structures of proteins that are "unique", generally ones that have less than 30% sequence identity to a protein with a known structure in the Protein Data Bank.
  • 4.  Structural genomics is expected to yield a large number of experimental protein structures (tens of thousands) and an even larger number of calculated comparative protein structure models (millions). This enormous body of structural data will be freely available, and promises to accelerate scientific discovery in all areas of biological science.  As the protein structure and function are closely linked, the importance of structural genomics in understanding the function of proteins is paramount. Structural genomics can also provide insight in dynamic properties such as protein folding and identify possible targets that may be used for drug discovery.
  • 5. Structural Genomics Differences between Structural Genomics and Traditional Structural Prediction  Structural genomics attempts to determine the structure of every protein encoded by the genome, rather than focusing on one particular protein.  Structural genomics aims to first determine the structure of proteins, and then investigate their function later. Traditional Structural Prediction  Traditional structure prediction focuses on one particular protein at a time.  The function of a protein of interest is first identified and then its structure determined by means of X-ray or NMR experiments.
  • 6. In general terms the process involves: 1. PCR amplification of the coding sequence from the target genome. 2. Cloning the coding sequence into an appropriate expression vector. 3. Expressing the protein at a sufficiently high level. 4. Sequencing the cloned gene to verify that the coding sequence was correctly amplified. Structural Genomics Process
  • 7. 5. Confirming the identity of the expressed protein and characterizing it as a prelude to NMR or crystallographic studies. 6. Obtaining the protein in sufficient amounts and do purification for either approach. 7. Defining suitable crystallization or NMR solution conditions. 8. NMR or X-ray measurement . Structural Genomics Process(Contd.)
  • 8. 9. Determining and refining the experimental structure 10.Calculating comparative protein structure models using this new template 11.Making functional inferences from the structure 12..Publishing the structure to PDB. • Failures are anticipated at every step, making the process somewhat akin to a funnel. Structural Genomics Process(Contd.)
  • 10. Strategic considerations • Target selection is the most important strategic issue confronting the structural genomics pilot studies. Their respective performances will be measured in terms of the number of structures determined, what fraction contain novel folds, their impact on biology, and the cost per structure. • Before selecting as to which structure to target , one needs to be convinced of the social and medical benefits of the structural genomics initiative.
  • 11. • Projects have been made from exhaustive studies of all proteins found in a model organism (Methanococcus jannaschii, Mycobacterium tuberculosis) to selectively chosen targets from a large number of different organisms. • Another important strategic issue concerns money. To date, no clear picture has emerged as to how much it will cost to determine 10,000–20,000 experimental structures.. • Time is also an issue because it is not a simple matter to decide when a structural genomics initiative will be complete. Strategic considerations(Contd.)
  • 12. • In the year 2002- 2003,314 structures resulting from structure genomics were reported by TargetDB. • During the same period, a total of 3324 structures were deposited with the PDB . • Thus structure genomics is currently contributing approximately 10% of structures to the field of structural biology.
  • 13.  The flood of new genomic sequence information together with technological innovations in protein structure determination have led to worldwide structural genomics (SG) initiatives.  The goals of SG initiatives are to accelerate the process of protein structure determination, to fill in protein fold space and to provide information about the function of uncharacterized proteins. In the long-term, these outcomes are likely to impact on medical biotechnology and drug discovery, leading to a better understanding of disease as well as the development of new therapeutics Structural Genomics Initiatives
  • 14. Protein Structure Initiative • Prominent Structural Genomics projects can be roughly lumped into three groups: the Japan-based program, led by RIKEN, called Protein 3000; the Protein Structure Initiative (PSI) and the efforts from the European research community. • The most important among them was Protein Structure Initiative(PSI).It was a USA based project established in 2000 by the National Institute of Health (NIH) and National Institute of General Medical Sciences(NIGMS) that aimed at accelerating discovery in structural genomics.
  • 15. • It was the most productive of the initiatives taken for structural genomics, having solved around 3000 new structures between 2000 and 2008. • The project was organized into three separate phases. • The first phase of the Protein Structure Initiative (PSI-1) spanned from 2000 to 2005, and was dedicated to demonstrating the feasibility of high- throughput structure determination, solving unique protein structures, and preparing for a the second phase. • The second phase, PSI-2, focused on implementing the high-throughput structure determination methods developed in PSI-1, as well as addressing bottlenecks like modeling membrane proteins. • The third phase, PSI : Biology, began in 2010 and consisted of networks of investigators applying high-throughput structure determination to study a broad range of biological and biomedical problems. • PSI program ended in 2015 Protein Structure Initiative
  • 17. Tools and Databases for Structural Genomics
  • 18. Examples of Structural Genomics 1. Thermotogo maritima proteome • The Protein Structure Initiative (PSI) solved the structures for all the proteins in Thermotogo maritima, a thermophillic bacterium. • T. maritima was selected as a structural genomics target based on its relatively small genome consisting of 1,877 genes and the hypothesis that the proteins expressed by a thermophilic bacterium would be easier to crystallize. • E.coli was used to express all the open-reading frames (ORFs) of T. martima. These proteins were then crystallized and structures were determined for successfully crystallized proteins using X-ray crystallography. • Among other structures, this structural genomics approach allowed for the determination of the structure of the TM0449 protein, which was found to exhibit a novel fold(a tetramer with four interconnected active sites, each containing a flavin adenine dinucleotide molecule).
  • 19. Examples of Structural Genomics(Contd.) 2. Mycobacterium tuberculosis proteome • The structures of potential drug targets in Mycobacterium tuberculosis, the bacterium that causes tuberculosis were determined by TB Structural Genomics Consortium. • The development of novel drug therapies against tuberculosis are particularly important given the growing problem of multi-drug-resistant tuberculosis. • The fully sequenced genome of M. tuberculosis allowed scientists to clone many of these protein targets into expression vectors for purification and structure determination by X-ray crystallography. • So far, structures have been determined for 708 of the proteins encoded by M. tuberculosis.
  • 20. Limitations and challenges to Structural Genomics • As the structural genomics efforts continues to mature and structure determination for the easy proteins or ‘low hanging fruit’ is completed, attention will naturally turn to the more difficult problems. • The future challenges include expression and purification of proteins from eukaryotic organisms containing post- translational modifications, membrane proteins and large multi-domain protein complexes. • There are some regions of protein structure space that will not succumb immediately to either NMR or X-ray methods.
  • 21. Limitations and Challenges to Structural Genomics(Contd.) • Novel high throughput, cost-effective expression protocols will need to be developed . • Membrane protein crystallization continues to represent a considerable technical challenge, but advances in protein purification and crystallization may ease these difficulties.
  • 22. References 1. Goldsmith-Fischman Sharon,Honig B. 2003 “Structural genomics: Computational methods for structure analysis” Protein Sci. 2003 Sep; 12(9): 1813–1821 2. Burley SK “An overview of structural genomics” Nat Struct Biol. 2000 Nov;7 Suppl:932-4. 3. Bourne PE et al. “The status of structural genomics defined through the analysis of current targets and structures” Pac Symp Biocomput. 2004:375-86.

Editor's Notes