Lecture 12 (Structural Bioinformatics) Cbdb30310921cec2c447276bb2d88a8f
Lecture 12 (Structural Bioinformatics) Cbdb30310921cec2c447276bb2d88a8f
Introduction to Structural
bioinformatics
Lecture #12
Anas Abdelrahman Elzein
PhD Biochemistry and Molecular Biology
Zhejiang University
1
February 2025
Structural bioinformatics
Structural bioinformatics is the
sub-discipline of bioinformatics
that focuses on the
representation, storage retrieval,
analysis, and display of structural
information at the atomic and sub-
cellular scales.
2
STRUCTURE → FUNCTION relationship
• X-ray crystallography
• NMR (Nuclear Magnetic Resonance)
3
X-ray Crystallography & NMR
➢ Obtained much more slowly.
➢ Techniques involve elaborate technical procedures
➢ Many proteins fail to crystallize at all and/or
cannot be obtained or dissolved in large enough
quantities for NMR measurements
➢ The size of the protein is also a limiting factor for
NMR
5
Four Levels of Protein Structure
Proteins have four levels of structure: primary,
secondary, tertiary, and quaternary.
7
8
Can we predict structure from
sequence?
GCTCCTCACTGTCTGTGTTTATTCTTTTAGCTTCTTC
AGATCTTTTAGTCTGAGGAAGCCTGGCATGTGCA
AATGAAGTTAACCTAA
9
Structure Prediction
?
GPSRYIVMVVM…
Note:-
Secondary structure prediction is hard,
tertiary structure prediction is even harder. 15
Structure Prediction
It could be determined in silico using
comparative or homology modelling, which
allows the construction of a 3D model of the
"target" protein from:-
• PredictProtein
(https://www.predictprotein.org/)
• …..etc
12
13
PDB
https://www.rcsb.org/
14
PDB
Since structure and function are closely related,
the PDB aims at bridging the gap between the
function of large biomolecules and the role that
they universally play in number of organisms
including bacteria, yeast, plants, mice, and in
healthy as well as diseased humans.
The key to understanding the function of a
molecule is its shape.
VIEWING THE 3D STRUCTURE OF
PROTEINS (AND OTHER BIOLOGICAL
MACROMOLECULES)
16
The PDB ID is a four-character unique
identifier, consisting of numbers and
letters, assigned to a protein or other
biological macromolecule submitted to the
PDB. The PDB is an archive of the structure
of proteins and other biological
macromolecules.
17
• PDB IDs are usually written in uppercase.
Some examples of PDB IDs are 2HHD
(human hemoglobin, deoxy form), 9INS
(pig insulin), and 2VRY (mouse
neuroglobin).
25
Paste and Click Go
26
22
Three-dimensional structure
➢ Helps in the rational design of site-directed
mutations
John C. Kendrew
Today
and his model of
myoglobin
Max F. Perutz
and his first
high resolution
model of
haemoglobin
29
Homology Modeling
• Simplest, reliable approach
• Basis: proteins with similar sequences tend to
fold into similar structures
• Has been observed that even proteins with
25% sequence identity fold into similar
structures
• Does not work for remote homologs (< 25%
pairwise identity)
The homology modelling procedure:
• Template selection; Select those
structures that will be used as templates.
This step could be done by the modelling
software.
32
• Successful homology modelling depends on
the following:
▪ Template quality
▪ Alignment (add biological information)
▪ Modelling program/procedure (use more than one)
29
30