Tutorial Discovery Studio
Tutorial Discovery Studio
Kimmo Mattila
[email protected]
Discovery Studio
Discovery Studio at CSC
Installing Discovery Studio
Structure of Discovery Studio
• Both client and server are normally installed in the
same machine
• separate server machines can be used too
• In most cases you do not need to worry about this
• If you use Hippu1 as your DS server you do not
need to install the whole package but just the client
DS
DS Client DS Server
• Pipe Line Pilot
Default
ports: • Apache
•Interface
9943
•Visualization 9944 • Protocols (BLAST,
CHARMM, Modeler,
•Commands and tools CDOCKER etc. )
Discovery Studio interface
Command menus
Toolbars
Protocols
Tools
3D Window
Hierarchy view (graphics)
Discovery Studio menu commands
File menu: Contains commands for tasks such as opening molecular data
files, saving files to disk, printing, and accessing windows.
Edit menu: Contains commands for tasks such as copying and pasting,
selecting, finding, and setting preferences.
View menu: Contains commands for tasks such as changing the way
objects appear in the various views and for choosing which views should
be shown or hidden.
Chemistry menu: Contains commands for tasks that modify the chemical
makeup of the molecules.
Structure menu: Contains commands for tasks such as adding or
removing labels, adding or removing structure monitors, calculating the
solvent accessibility, cleaning up geometry, and superimposing multiple
molecules.
Sequence menu: Contains submenus and commands to manage protein
sequences and protein sequence alignments.
Window menu: Contains commands that allow you to control the display
of open windows in the current Discovery Studio session.
Help menu: Contains commands to access the Discovery Studio Help
system and the Accelrys website.
DS toolbars
DS tools
“Tools” contain methods to analyze
and modify your molecular model
Tools panel can be made visible
from:
View | Explorers | Tools
The CSC license covers most but
not all the tools
Most of the tools are run within the
client but some require connection to
the DS server
DS Protocols
Hierarchy panel
Data table
Help
Discovery studio contains a large help system. No
WWW or printed manual is available DS interface
Open help form Help | Topics (if search tools are not
visible in the Help window, press CONTROL + s)
Using PDB data in DS
http://www.rcsb.org/
experimentally determined protein structures are
stored into PDB (Protein Data Bank) database
Sources: X-ray diffraction (about 80%), NMR (15 %),
others (5 %)
over 65 000 structures (many of them related and
nearly similar however)
this is much less than the amount of known protein
sequences (UniprotKB contains over 10 million
sequences)
PDBe Database
http://www.ebi.ac.uk/pdbe/
Using PDB data in Discovery Studio
Note that there can be several thighs that may need editing in
your PDB file before you can start to use modeling tools
Checking and fixing a protein
PDB file format
PDB file format
HEADER RETINOL TRANSPORT 27-JUL-92 1BRP 1BRP 2
COMPND RETINOL BINDING PROTEIN (HOLO FORM) 1BRP 3
SOURCE HUMAN (HOMO SAPIENS) PLASMA
…
HELIX 1 1 VAL 6 SER 8 4 ONE SHORT TURN 1BRP 71
HELIX 2 2 PRO 146 GLU 158 1 1BRP 72
SHEET 1 S1 9 GLY 22 LYS 30 0 1BRP 73
…
SEQRES 1 A 182 GLU ARG ASP CYS ARG VAL SER SER PHE ARG VAL LYS GLU
SEQRES 2 A 182 ASN PHE ASP LYS ALA ARG PHE SER GLY THR TRP TYR ALA
SEQRES 3 A 182 MET ALA LYS LYS ASP PRO GLU GLY LEU PHE LEU GLN ASP
…
ATOM 1 N GLU 1 22.826 21.377 -30.151 1.00100.00 1 1BRP 99
ATOM 2 CA GLU 1 23.744 21.686 -29.074 1.00100.00 1 1BRP 100
ATOM 3 C GLU 1 23.395 23.023 -28.464 1.00100.00 1 1BRP 101
ATOM 4 O GLU 1 22.798 23.102 -27.389 1.00100.00 1 1BRP 102
ATOM 5 CB GLU 1 25.225 21.681 -29.508 1.00100.00 1 1BRP 103
ATOM 6 CG GLU 1 26.155 20.992 -28.489 1.00100.00 1 1BRP 104
ATOM 7 CD GLU 1 27.285 21.840 -27.971 1.00100.00 1 1BRP 105
ATOM 8 OE1 GLU 1 28.301 22.075 -28.603 1.00100.00 1 1BRP 106
ATOM 9 OE2 GLU 1 27.087 22.244 -26.741 1.00100.00 1 1BRP 107
ATOM 10 N ARG 2 23.771 24.073 -29.182 1.00100.00 1 1BRP 108
ATOM 11 CA ARG 2 23.485 25.397 -28.690 1.00 86.16 1 1BRP 109
ATOM 12 C ARG 2 22.026 25.784 -28.629 1.00100.00 1 1BRP 110
…
HETATM 1483 O HOH 229 2.848 65.969 -30.833 1.00 53.03 1BRP1581
HETATM 1484 O HOH 230 38.756 38.831 -49.928 1.00 75.69 1BRP1582
Forcefield methods
Force field methods
Force Filed methods….
Force Filed methods….
Advantage Limitations
• large systems in • no atom level
reasonable electrostatics.
calculation time.
• dependent on the
• in some cases FF
can provide results quality and
as accurate as the availability of
highest QM in a parameters.
fraction of the CPU- • The calculated
time. energy is relative.
General form for Forcefield
Example. Functional form of CHARMm force field:
Components of forcefield
Forcefield….
Forcefield functions and parameters
Parameter assignment
Parameter assignment...
Isoleucine:
How to use forcefield ?
The ability to perform a calculation is no guarantee
that results can be trusted !
• unsuitable forcefield gives wrong results.
A common problem: a lack of (good) parameters.
different forcefields cannot usually be merged but
the results can be compared.
forcefield methods are good for predicting properties
for classes of molecules where a lot of information
exists.
Forcefields in DS
https://extras.csc.fi/msimanual/doc/insight2005/ffbs/FF_SimulTOC.html
Applications of forcefield methods
Search strategies
Energy minimization
Three major protocols for minimization
Steepest Descent
Conjugate Gradient
Adopted Bases Newton-Rhapson
Powell
The steepest descents method
Conjugate gradient method
Newton-Rhapson method
Comparison of minimization methods
Strategy in minimization
DS Smart minimizer:
1. steepest descents (max 1000 iterations)
2. conjugate gradient (max 1000 iterations)
Strategy in minimization
Local or global energy minimum?
Local or global energy minimum?
Applications of energy minimization
Molecular dynamics
Molecular dynamics
Using molecular dynamics
Molecular dynamics parameters
Time step
• is limited by the highest frequency in the model
• 0,5-5 fs => millions of simulation steps are needed
to reach nano second time scale.
Molecular dynamics parameters
Temperature
• normally 300 K
• Thermal equilibrium requires temperature control
or careful heating
• Simulated annealing utilizes higher temperatures
Other parameters
• Force field scaling
• dielectricity
Solvent environment
Solvent (water) environment requires more
computing but makes model more realistic
Periodic boundary conditions are used to create
continuous solvent environment
Discovery Studio: Protocol Simulation/Solvation
Analysis of molecular dynamics
Docking
many protein related biological processes are
regulated or enabled by specific binding of small
organic molecules (ligands) to the proteins
• signal transduction
• enzyme activity /inhibition
Docking
Docking
Scoring function
Ligand Preparation in DS
A. Manually
Protein Preparation in DS
Generate ligand
CDOCKER conformations trough
high temperature MD
Full minimization
Output # of refined
ligand poses sorted by
energy
After CDOCKER
You can
Rescore the structures using protocol:
• Calculate Binding Energies
Optimize binding site with the ligand using protocol:
• Ligand Minimization
Study the results with protocol:
• Analyze Ligand Poses
CDOCKER: Papers to read