Biophysical Characterization of Carbohydrate-Lectin Interactions
Inauguraldissertation zur
Erlangung der Würde eines Doktors der Philosophie vorgelegt der
Philosophisch-Naturwissenschaftlichen Fakultät der Universität Basel
von
Pascal Severin Zihlmann
aus Malters, Luzern
Basel, 2018
Originaldokument gespeichert auf dem Dokumentenserver der Universität Basel
edoc.unibas.ch
Genehmigt von der Philosophisch-Naturwissenschaftlichen Fakultät
auf Antrag von:
Prof. Dr. Beat. Ernst
Institut für Molekulare Pharmazie Universität Basel
Prof. Dr. Gerhard Klebe
Institut für Pharmazeutische Chemie Philipps-Universität Marburg, Deutschland
Basel, den 23. Februar 2016
Prof. Dr. Jörg Schibler
Dekan
Acknowledgements
Acknowledgements
First and foremost, I would like to thank Prof. Dr. Beat Ernst for giving me the opportunity to work in his outstanding research group. Your vast experience in drug discovery and your willingness to share your knowledge with me made the last three years an enlightening experience. I highly appreciate all your contributions of time, ideas, encouragement and funding to make my doctorate possible. I truly enjoyed this time.
I further would like to thank Prof. Dr. Gerhard Klebe for accepting to be co-referee on my thesis-committee. This is a real honor for me as your talks and publications have been a great inspiration for my work.
I would like to thank Prof. Dr. Timm Maier for supporting me with your time and knowledge in my crystallization projects. I would like to point out the great achievements of Dr. Roman Peter Jakob. The crystal structures you solved were of inestimable value for my research projects.
A very special thanks goes to Dr. Timothy Sharpe for his invaluable support in biophysics.
There is probably no biophysical question you can’t answer (at least I was never able to pose one). You have been a real support and friend.
Tobias Mühlethaler, Stefan Siegrist, and Michael Misev earn my high appreciation for being the best master students I could have imagined. I’m really grateful for all the work you contributed to my thesis. I wish you all the best for your futures and your scientific careers.
Several people from the IMP deserve special thanks including Anja Sigl and Christoph
Sager for the amazing atmosphere in our “Sunnestübli”. You were precious friends and
inspiring co-workers and I truly enjoyed the stimulating discussions we had. Another
special “thank you” goes to Dr. Said Rabbani for sharing his extensive experience in
molecular biology and for providing me with incredible amounts of FimH protein. I
would like to thank Claudia Huber for reminding me to sign up for courses, for keeping
me awake with coffee, and for being the good soul of the IMP.
Acknowledgements
Furthermore, I would like to thank all former members of the IMP that my work is based on.
Your amazing spadework has made this thesis possible. I’m especially grateful to Dr. Roland Preston, Dr. Katrin Lemme, and Dr. Adam Zalewski who initiated many of those fascinating projects I was involved with.
My time in Basel was made much more enjoyable due to the many friends in the IMP-family. One thing I always loved about the IMP was that every new member was immediately integrated without any exceptions. I remember those memorable trips into the Alps, the vivid discussions in the coffee corner, and the wonderful evenings we spent together during our “summer beer” events. I would like to thank Brigitte, Philipp, Katja, Simon, Jacqueline, Deniz, Marleen, Meike, and Daniela from the biology group. Huge thanks go also to all the chemists, namely Bea, Xiaohua, Giulio, Olly, Priska, Kathi, Mirko, Norbert, Blijke, Lijuan, Fan, Rachel, Hélène, Maja, and Wojtek. The members of the “Molecular Modeling” group from Prof. Dr. Angelo Vedani have been a source of friendships, as well as good advice and collaboration. Thank you, Martin, Sameh, Oya, Joël, and Zhenquan. I hope to spend much more time with you all.
Last but not least, I want to thank my family and friends and my wife for all of their love
and encouragement. Thank you, Maya and Bruno, for believing in me and supporting me
over more than three decades. Thank you, Barbara, for your faithful support during the
last 10 years and for the wonderful relationship we have.
Abstract
Abstract
Improved knowledge of the biological role of lectins has raised the demand for carbohydrate-based therapeutics in recent years. The potential market is estimated to be greater than 20 billion dollars. However, lectins are challenging drug targets due to the unique binding properties of their extensively hydroxylated carbohydrate ligands.
Hydroxyl groups provide directionality and therefore specificity, but are penalized with high desolvation costs. Consequently, monovalent carbohydrate-lectin interactions tend to be rather weak, often in the millimolar range. Moreover, the polar character of carbohydrates creates large obstacles for drug application regarding oral availability and long-lasting plasma levels. The key to the successful development of carbohydrate-based drugs is the simultaneous optimization of carbohydrate lead structures in terms of pharmacodynamics and pharmacokinetics. To further enhance the success rate of carbohydrate-based drug candidates, the understanding of carbohydrate-lectin interactions on a molecular basis has to be improved. For this purpose, we combined structural information (X-ray crystallography and nuclear magnetic resonance spectroscopy), binding data (isothermal titration calorimetry, microscale thermophoresis, and fluorescence polarization assay) and computational methods (quantum mechanical calculations and molecular dynamics simulations) to explore the lectins FimH and E-selectin and their interaction with carbohydrates and mimetics thereof.
FimH is a virulence factor of uropathogenic E. coli located at the tip of the bacterial type 1 pili. It interacts with the mannosylated glycoprotein uroplakin 1a in the urothelial mucosa and thereby mediates adhesion to the bladder wall as the initial step of urinary tract infections (UTI). In manuscript 1 we investigated the energy contribution to binding of the hydroxyl groups mediating the interaction between FimH and the carbohydrate moiety of its ligands. The rigidity of this bacterial lectin was demonstrated in manuscript 2, where we showed that the affinity of a septanose as a mannose mimic is reduced by a factor of 10, mainly due to its flexibility in solution and the consequent conformational restrictions upon binding. In manuscript 3 we analyzed interactions between the tyrosine gate motif of FimH and the aglycones of different ligand classes.
This motif (Tyr48, Tyr137) forms the entrance of the binding pocket and significantly
contributes to binding affinity. In manuscript 4 we explored 2-C-branched mannosides as a
Abstract
novel family of FimH antagonists. In manuscript 5, a pharmacodynamically and pharmacokinetically optimized FimH antagonist was explored by oral application in a mouse model for UTI, resulting in a 1000-fold reduction of the bacterial load in the bladder. Finally, in manuscript 6 we reanalyzed ITC results from our previous studies with the novel analytical tool kinITC, allowing the determination of kinetics in addition to the thermodynamics of binding. The hydrophobic aglycone turned out to be mainly responsible for guiding the antagonist to its binding site whereas the hydrogen bond network between the mannose moiety and the protein had predominantly an impact on the off-rate.
E-selectin is a lectin expressed on the surface of vascular endothelial cells and is involved
in the recruitment of leukocytes to the site of inflammation. By interacting with the
tetrasaccharide epitope sialyl Lewis
x, E-selectin establishes the initial contact and enables
leukocytes to roll along the endothelial surface. Whereas this process is a defense
mechanism in case of infections and injuries, excessive extravasation of leukocytes can
have deleterious consequences in case of numerous diseases with an inflammatory
component, e.g. asthma, psoriasis or stroke. Thus, blocking the interaction of E-selectin
with its physiological ligands is a promising strategy to suppress the inflammatory
response at the beginning of the cascade. For reliable and materially efficient affinity
measurements, we developed and evaluated a novel assay for E-selectin based on
microscale thermophoresis technology in manuscript 7. In the subsequent manuscripts, we
applied the microscale thermophoresis assay. In manuscript 8 we performed a competitive
library screen, whereby four promising small-molecule fragments were identified for
further development towards a non-carbohydrate E-selectin antagonist. Finally, in
manuscript 9 we were able to improve the affinity of a sialyl Lewis
xmimic to E-selectin by
pre-organizing the acid in its bioactive conformation.
Abbreviations
Abbreviations
3-D Three-dimensional space [α]
D20Optical rotation
ABP L-arabinose-binding protein
ABTS 2,2’-azino-bis[3-ethylbenzthiazoline-6-sulfonic acid]
Ac Acetyl
AcOH Acetic acid
AFM Atomic force microscopy
Ala Alanine
AM 1,5-Anhydro-D-mannitol
aq. Aqueous
Ar Aryl
Arg Arginine
Asn Asparagine
Asp Aspartic acid
ATP Adenosine triphosphate AUC Area under the curve
ax Axial
BF 4-Biphenyl-α-D-mannopyranoside Bicine 2-(Bis(2-hydroxyethyl)amino)acetic acid
BisTris 2-[Bis(2-hydroxyethyl)amino]-2-(hydroxymethyl)propane-1,3-diol
Bn Benzyl
Boc tert-butyloxycarbonyl BSA Bovine serum albumin
Bu Butyl
Bz Benzoyl
c Wiseman parameter
C
0Initial concentration
Caco-2 cells Caucasian colon adenocarcinoma cells calcd Calculated
cat. Catalyst / Catalytic amount CD Circular-dichroism
CD22 Cluster of differentiation 22 / Siglec-2 CES Carboxylesterase
CFU Colony forming units
C-HEGA Cyclohexylbutanoyl-N-hydroxyethyl-D-glucamide
CIP Ciprofloxacin
CL
totTotal clearance ConA Concanavalin A conc. Concentration Conf. I. Confidence Interval
COPD Chronic obstructive pulmonary disease
COSY Correlation spectroscopy (NMR)
COX Cyclooxygenase
Abbreviations
ΔC
pChange in heat capacity
CRD Carbohydrate recognition domain Cryo-EM Cryo-electron microscopy
CSP Chemical shift perturbation Cy(c) Cyclohexyl
CYP Cytochrome
Cys Cysteine
δ Chemical shift
d Doublet (NMR)
D Distribution coefficient D Dielectric constant
D
2O Heavy water, deuterium oxide
DAMP Damage-associated molecular pattern DAST Diethylaminosulfur trifluoride
DC-SIGN Dentritic cell-specific ICAM-3 grabbing nonintegrin DCE 1,2-Dichloroethane
DCM Dichloromethane
DL Detection limit
DMAP 4-Dimethylaminopyridine DME 1,2-Dimethoxyethane
DMEM Dulbecco’s modified eagle medium
DMF N,N-Dimethylformamide
DMSO Dimethyl sulfoxide
DMTST Dimethyl(methylthio)sulfonium triflate DNA Deoxyribonucleic acid
DsF FimF donor strand
DsG FimG donor strand
dsRNA Double stranded RNA
DTBMP 2,6-Di-tert-butyl-4-methylpyridine ε
0Vacuum permittivity
E. coli Escherichia coli
e.g. For example
EA Ethyl acetate
EDTA Ethylenediaminetetraacetic acid
ee Enantiomeric excess
EGF Epidermal growth factor
ELAM Endothelial-leukocyte adhesion molecule ELISA Enzyme-linked immunosorbent assay ELLA Enzyme-linked lectin assay
eq Equatorial
equiv Equivalent
ESI Electronspray ionization (-MS)
Abbreviations
EtOAc Ethyl acetate
EtOH Ethanol
FACS Fluorescence-activated cell sorting FCS Fetal calf serum
FimH
LFimH lectin domain FimH
PFimH pilin domain
FITC Fluorescein isothiocyanate FP Fluorescence polarization
Fuc Fucose
ΔG Change in free energy of binding
ΔG°
intChange in intrinsic free energy of binding ΔG° Change in standard free energy of binding ΔG°
solvChange in solvation free energy of binding
Gal Galactose
GdmCl Guadinium chloride GFP Green fluorescent protein
Glc Glucose
GlcNAc N-Acetylglucosamine
Gln Glutamine
Glu Glutamic acid
Gly Glycine
GlyCAM-1 Glycosylation-dependent cell adhesion molecule-1 GMP-140 Granule membrane protein-140
ΔH°
ionChange in ionization enthalpy ΔH° Change in enthalpy
H bond Hydrogen bond
h(rs) Hour(s)
HA Hemagglutination assay hCAII Human carbonic anhydrase II
HEPES 2-[4-(2-Hydroxyethyl)piperazin-1-yl]ethanesulfonic acid HEV High endothelial venule
HIA Hemagglutination inhibition assay
His Histidine
HIV Human immunodeficiency virus HM n-Heptyl α-D-mannopyranoside
HMBC Heteronuclear multiple bond correlation HPLC High performance liquid chromatography HRMS High-resolution mass spectrometry
HSQC Heteronuclear single quantum coherence
Hz Hertz
IBC Intracellular bacterial community
IC
50Half-maximal inhibitory concentration
ICAM-3 Intercellular adhesion molecule-3
IFD Induced fit docking
Abbreviations
IgG Immunoglobulin G
IL-1 Interleukin-1
Ile Isoleucine
IMP Institute of Molecular Pharmacy
iPrOH 2-Propanol
IPTG Isopropyl β-D-1-thiogalactopyranoside ITC Isothermal titration calorimetry
iv Intravenous
K
AEquilibrium association constant K
DEquilibrium dissociation constant K
iInhibitory constant
kinITC Kinetic ITC
k
offDissociation rate constant k
onAssociation rate constant
L Ligand
Lac Lactic acid
LAM Lymphocyte adhesion molecule
LB Luria–Bertani
LC Liquid chromatography (-MS)
LECAM Leukocyte-endothelial cell adhesion molecule LED Light-emitting diode
LF Ligand-free
LPS Lipopolysaccharide
Lys Lysine
m Multiplet (NMR)
m/z Mass-to-charge ratio
MAC
90Minimal concentration to inhibit 90% adhesion MAdCAM-1 Mucosal vascular addressin cell adhesion molecule-1 MAG Myelin-associated glycoprotein / Siglec-4
Man Mannose
MD Molecular dynamics
Me Methyl
MeCN Acetonitrile
MeOH Methanol
MFI Mean fluorescence intensity min(s) Minute(s)
MM-GBSA Molecular Mechanics - Generalized Born Surface Area
mol. Molecular
MOPS 3-Morpholinopropane-1-sulfonic acid
MPLC Medium-pressure liquid chromatography
mRNA Messenger ribonucleic acid
Abbreviations
n.b. Not binding
n.d. Not determined
NADPH Nicotinamide adenine dinucleotide phosphate Neu5Ac N-Acetylneuraminic acid, sialic acid
NHS N-Hydroxysuccinimide
Ni-NTA Nickel-nitrilotriacetic acid
NIS N-Iodosuccinimide
NMR Nuclear magnetic resonance NOE Nuclear Overhauser effect
OD Optical density
OH Hydroxyl
P Partitioning coefficient
P Protein
p-TSOH p-Toluenesulfonic acid
PAA Polyacrylamide
PAMP Pathogen-associated molecular pattern
PAMPA Parallel artificial membrane permeation assay P
appApparent permeability
PBS Phosphate buffered saline PCR Polymerase chain reaction
PD Pharmacodynamics
PDB Protein data bank P
eEffective permeability
PE Petroleum ether
PEG Polyethylene glycol PES Polyethersulfone
PGA-LM Poly- γ-glutamic acid low-molecular-weight polymer
Ph Phenyl
Phe Phenylalanine
PK Pharmacokinetics
PL Protein-ligand complex
po Per os (oral)
PPB Plasma protein binding ppm Parts per million
Pr Propyl
Pro Prolin
PRR Pattern recognition receptor PSGL-1 P-selectin glycoprotein ligand-1 PTFE Poly(1,1,2,2-tetrafluoroethylene)
py Pyridine
q Quartet
QIR Quiescent intracellular reservoir
QM Quantum mechanics
QTOF Quadrupole TOF MS
R Universal gas constant
Abbreviations
R
fRetention factor RLM Rat liver microsome
RMSD Root-mean-square deviation RNA Ribonucleic acid
RP Reversed-phase
rt Room temperature
ΔS° Change in entropy
ΔS°
confChange in conformational entropy ΔS°
solvChange in solvation entropy
ΔS°
trans/rotChange in translational and rotational entropy
s Second or singulet (NMR) SAR Structure-activity relationship SASA Solvent accessible surface area
sat. Saturated
SAXS Small-angle X-ray scattering
sc Subcutaneous
SCR Short consensus repeat
SDS-PAGE Sodium dodecyl sulfate - Polyacrylamide gel electrophoresis sec(s) Second(s)
Ser Serine
Siglec Sialic acid-binding immunoglobulin-like lectins SKR Structure-kinetic relationship
sLe
aSialyl Lewis
asLe
xSialyl Lewis
xSNF Swiss National Science Foundation SPR Surface plasmon resonance
STD Saturation-transfer difference (NMR)
Std. Standard
τ Residence time
t Triplet (NMR)
T Temperature
t
1/2Dissociation half-life, residence time
TAPS N-Tris(hydroxymethyl)methyl-3-aminopropanesulfonic acid TBAB Tetra-n-butylammonium bromide
TBAF Tetra-n-butylammonium fluoride TBAI Tetra-n-butylammonium iodide TBDMS tert-Butyldimethylsilyl
TBDPS tert-Butyldiphenylsilyl
TCDI 1,1'-Thiocarbonyldiimidazole TEER Transepithelial electrical resistance
Tf Triflate
Abbreviations
Thr Threonine
TLC Thin-layer chromatography TMS Trimethylsilyl
TMSOTf Trimethylsilyl trifluoromethanesulfonate TNF-α Tumor necrosis factor alpha
TOCSY Total correlation spectroscopy TOF Time of flight
Tricine N-(Tri(hydroxymethyl)methyl)glycine Tris Tris(hydroxymethyl)aminomethane trNOE Transfer NOE (-NMR)
Trp Tryptophan
TS
‡Transition state
TSP 3-(Trimethylsilyl)propanoic acid
Tyr Tyrosine
Tys Sulfotyrosine
UPEC Uropathogenic E. coli UPK1a Uroplakin 1a protein UTC Urinary tract infection
UV Ultraviolet
Val Valine
WT Wild type
Table of contents
Table of contents
I. Introduction 1
1. Molecular Interaction 3
1.1. Milestones in the research history of molecular interaction 3 1.2. Mathematical description of a molecular binding event 4 1.3. Driving forces of molecular interactions: Thermodynamics 5
1.3.1. Enthalpic driving forces 6
1.3.2. Entropic driving forces 10
1.3.3. Solvent effects 11
2. Evaluating Molecular Interaction 13
2.1. Methods to determine the structure of complexes 13 2.2. Methods to determine the inhibitory constant 14
2.3. Methods to determine the binding constants 15
3. Carbohydrate-Lectin Interactions 19
3.1. Carbohydrates 19
3.2. Lectins 19
3.2.1. Microbial lectins 20
3.2.2. Human lectins 20
3.3. Lectins and carbohydrates in drug discovery 21 3.4. Molecular basis of the regulation of carbohydrate-lectin affinity 22
3.4.1. Affinity regulation: Carbohydrates 23
3.4.2. Affinity regulation: Lectins 24
3.4.3. Multivalency 25
3.5. Thermodynamics of carbohydrate-lectin interactions 25
4. References 28
II. The Bacterial Adhesin FimH 35
1. Introduction 37
1.1. Urinary tract infection 37
1.2. Infection cycle of uropathogenic Escherichia coli 37
1.3. Structure 39
1.3.1. Type 1 pili and fim operon 39
1.3.2. The lectin FimH 40
1.3.3. FimH binding site 44
1.4. FimH antagonists 46
Table of contents
2. Manuscripts 53
2.1. Manuscript 1:
3.1 High-affinity carbohydrate-lectin interactions: How
nature makes it possible 53
2.2. Manuscript 2:
3.2 The price of flexibility – A case study on septanoses as
pyranose mimetics 95
2.3. Manuscript 3:
3.3 The tyrosine gate of the bacterial lectin FimH: A conformational analysis by NMR Spectroscopy and
X-ray crystallography. 107
2.4. Manuscript 4: 2-C-Branched mannosides as a novel family of FimH
antagonists – Synthesis and biological evaluation 121 2.4. Manuscript 5:
3.4 FimH antagonists: Bioisosteres to improve the in vitro
and in vivo PK/PD Profile. 133
2.5. Manuscript 6: KinITC – One method supports both thermodynamic
and kinetic SARs 155
III. E-selectin 203
1. Introduction 205
1.1. A brief overview of the selectin family members 205
1.2. Structural features 205
1.3. Selectin ligands 207
1.4. The role of selectins in inflammation 208
1.4.1. Inflammatory cascade 208
1.4.2. Role of selectins in disease 209
1.5. Structural binding properties 210
1.5.1. E-selectin 211
1.5.2. P-selectin 213
1.5.3. Catch-bond mechanism 214
1.6. Selectin antagonists 215
1.7. References 218
2. Manuscripts 2.1. Manuscript 7:
3.1 223
2.2. Manuscript 8:
3.2 245
2.4. Manuscript 9:
3.4
Microscale thermophoresis: A powerful opportunity for carbohydrate-based drug discovery
Fragment screening towards an orally available
E-selectin antagonist using microscale thermophoresis Is it a hydrophobic clash that pre-organizes sLe
xmimics in the bioactive conformation for the
interaction with E-selectin? 263
Section I.
Introduction
Section I. Introduction – Molecular Interaction
1. Molecular Interaction
1.1. Milestones in the research history of molecular interaction
It has been more than 120 years since Emil Fischer discovered a fundamental underlying principle of molecular interaction.
(1)He observed that methyl-α-D-glucoside but not methyl-β-D-glucoside is hydrolyzed in the presence of the yeast extract invertin. In contrast, methyl-β-D-glucoside but not its α-anomer was hydrolyzed in the presence of the almond extract emulsin. He interpreted his findings with the “lock and key”
principle:
“Um ein Bild zu gebrauchen, will ich sagen, dass Enzym und Glucosid wie Schloss und Schlüssel zu einander passen müssen, um eine chemische Wirkung auf einander ausüben zu können.”
The “lock and key” principal of Fischer and the finding of Paul Ehrlich that agents can only work when they are bound (“corpora non agunt nisi fixata”) provided the basis for modern pharmacology.
(2,3)They allowed medicinal chemists to synthesize “keys” that activate (agonist) or inhibit (antagonist) the function of a given “lock” (protein).
(4)Since then, the number of known “locks” has been steadily growing and nowadays includes proteins of various groups like e.g. enzymes, lectins, hormone receptors, ion channels, transporters, structural proteins or transcription regulators.
However, the static ‘key and lock’ principle could not explain why non-competitive inhibition can occur and why, for example, glucose but not water is phosphorylated in the binding pocket of hexokinase. In consequence, Daniel E. Koshland introduced the dynamic ‘induced-fit model’ in 1958.
(5,6)“a) a precise orientation of catalytic groups is required for enzyme
action, b) the substrate causes an appreciable change in the three-
dimensional relationship of the amino acids at the active site, and
c) the changes in the protein structure caused by the substrate will
bring the catalytic groups into the proper alignment, whereas a
non-substrate will not.”
Section I. Introduction – Molecular Interaction
In recent years, the concept of ‘conformational selection’ introduced from Ruth Nussinov and co-workers
(7)has become more and more popular as an alternative dynamic model.
This model assumes a selection for a conformational subpopulation in a system of dynamically fluctuating protein and ligand species. It is in contrast to the ‘induced-fit model’, where the conformational change is thought to be a consequence of the interaction of protein and ligand. Both models might be combined to an initial conformational selection followed by a conformational adjustment.
(8)1.2. Mathematical description of a molecular binding event
The equations to describe a protein-ligand binding event is the law of mass action deduced by Guldberg and Waage in 1879.
(9)aA + bB ⇌ cC + dD (eq. 1)
K = k
1k
-1= [C]
c[D]
d[A]
a[B]
b(eq. 2)
Equation 2 describes the ratio between the concentrations of the reactants ([A], [B]) and products ([C], [D]) at the equilibrium state of the reaction described in equation 1, where a, b, c, and d are the stoichiometric coefficients. The equilibrium constant K defines the extent of the reaction, whereas the rate is defined by the rate constant of the forward reaction (k
1) and the reverse reaction (k
-1). Equilibrium is reached when the rate of reaction in forward and reverse directions is equal.
The law of mass action can be used analogously to describe a protein-ligand binding interaction (equations 3 and 4).
P + L ⇌ PL (eq. 3)
K
A= k
onk
off= PL P L = 1
K
D(eq. 4)
k1
k-1
kon
koff
Section I. Introduction – Molecular Interaction
The equilibrium constants (binding constants) are called the association constant (K
A) and the dissociation constant (K
D). They describe the concentration in an equilibrium at which half of the protein in a system is complexed by a ligand. The rate constants are called the association rate constant k
onand the dissociation rate constants k
off. From k
offthe half-life (t
1/2) of the protein-ligand complex can be obtained (equation 5).
1.3. Driving forces of molecular interactions: Thermodynamics
A spontaneous interaction between ligand and receptor only takes place when the overall change of free energy of binding (ΔG) is negative. ΔG of a reaction is determined by the nature of the reactants and their concentrations with R is the universal gas constant (8.314 J/mol K) and T being the absolute temperature (equation 6):
(10)∆G = ∆G°+RTln [P][L]
[PL] = ∆G°+RT lnK
D(eq. 6) ΔG° is the change in free energy of binding under standard condition that is a reactant concentration of 1 M in solution. The system is at equilibrium when ΔG = 0. Therefore, the relation between the standard free energy and the equilibrium constant can be derived from equations 7 and 8.
0 = ∆G° + RT lnK
D(eq. 7)
∆G° = RT lnK
D= − RT lnK
A(eq. 8) ΔG° is furthermore determined by the change of enthalpy (ΔH°) and change of entropy (ΔS°) in dependence of the absolute temperature (T) (equation 9):
∆G° = ∆H° − T∆S° (eq. 9)
The change in entropy results from the sum of the changes in solvation entropy,
conformational entropy, and the rigid body motions (translational and rotational
entropy). The change in enthalpy reflects the change in bond interaction energy (covalent
and non-covalent, including short and long-range electrostatics).
Section I. Introduction – Molecular Interaction
In drug discovery often a phenomenon called enthalpy-entropy compensation is observed, where an enthalpic improvement is linearly correlated with a loss in the entropy and vice versa.
(11,12)A newly formed interaction between protein and ligand may lead to an improved enthalpy, but simultaneously constrains the conformational flexibility of protein and ligand, leading to an entropic penalty. It is furthermore accepted that water plays a key role in enthalpy-entropy compensation. When a binding event takes place in aqueous solution (solvent), both ligand and protein (solutes) shed their solvation shell in the contact area. The costs of this desolvation process have to be considered when calculating the binding energy. Hence, the change in free energy of binding is the difference between the solute-solute and the solute-solvent interactions. Depending on the polarity of the binding site and the ligand, this desolvation process is enthalpically more or less costly as hydrogen bonds between solute and solvent are to be broken.
(13)However, this effect might again be partially compensated by a beneficial entropy term as water molecules are released to bulk.
1.3.1. Enthalpic driving forces
Enthalpic driving forces of a binding reaction are characterized by their release or consumption of heat. A negative enthalpy means the release of heat and contributes beneficially to the free energy of binding. The nature of enthalpic interactions is electrostatic.
This means that they are formed between charged or dipolar atoms and are therefore subject to the Coulomb’s law (equation 10):
E = q
1q
24πε
0rD (eq. 10)
E is the electrostatic energy, r is the distance, q
1and q
2are the charges of two interacting
atoms, ε
0is the vacuum permittivity, and D is the dielectric constant of the surrounding
medium. Electrostatic interactions are therefore either attractive (oppositely charged) or
repulsive (same charge). As described in Coulomb’s law, the energy of an interaction
does not only depend on the individual charges of the interacting, but also on their
distance and the solvent it takes place in. The solvent of biological systems (water) has a
Section I. Introduction – Molecular Interaction
to be 1.
(14,15)Hence, the magnitude of the energy of an electrostatic interaction is increased in proximity to the protein due to the lower dielectric constant. Furthermore, electrostatic energies vary according to the nature of the charges and their distance. Between ions with net charges, the energy falls off slowly with distance 1/r (long-range electrostatics), while the energy of randomly oriented dipoles falls off rapidly with distance 1/r
6(short-range electrostatics).
(16)Hydrogen bonds
A hydrogen bond is a non-covalent interaction between a hydrogen bond donor (D
δ-covalently bound to a hydrogen atom H
δ+) and a hydrogen bond acceptor with a free lone pair (:A
δ-). Both donor and acceptor are electronegative atoms, usually nitrogen or oxygen.
D
δ-− H
δ++ :A
δ-⇌ D
δ-− H
δ+··· :A
δ-Hydrogen bonds are electrostatic dipole-dipole interactions with covalent features: the distance between the donor and acceptor is shorter than the sum of the van-der-Waals radii of the interacting atoms (but longer than a covalent bond) and their quality depends on geometrical rules (bond length between 2.5-3-2 Å; bond angle 130-180°). Directionality provides specificity and is therefore crucial for molecular recognition.
(17)According to Jeffrey and Saenger (1991), a hydrogen bond contributes 1-4 kcal/mol to the free energy of binding.
(18)A formal charge on one (charge-assisted hydrogen bond) or both (salt bridge) interaction partner(s) increases the energy of the bond.
(19)However, hydrogen bonds are not restricted to the interaction between protein and ligand, but can also be formed with the solvent (water). Whether the disruption of these interactions during desolvation (desolvation penalty) is compensated by the newly formed interactions between ligand and protein depends on the specific case. However, even if hydrogen bonds do not always contribute significantly to the free energy of binding they might nonetheless be important for the specificity of a molecular binding event.
(17)Hydrogen bond
Section I. Introduction – Molecular Interaction
Halogen bonds
A halogen bond is built between a covalently bound halogen atom (R-X
δ+) (X=halogen atom) and a halogen bond acceptor (A
δ-). The halogen atom is the donor, while the acceptor is usually a lone pair of a nitrogen or an oxygen atom. However, also π electrons of an unsaturated conjugated system or a second halogen atom may act as an acceptor.
(20)R − X
δ++ :A
δ-⇌ R − X
δ+···A
δ-Although often described as entirely negative, the electrons of halogens are distributed anisotropically (Figure 1). The outer lobe of the half-filled p-Orbital is electronically depleted and therefore forms a more negative region along the transaxial plane and a positive region (σ-hole) along the R-X axis. Hence, the optimal bond angle for a halogen bond is 180°.
(21)In theory, all halogens are able to form halogen bonds, but the positive electrostatic potential and the bond energy increases with the polarizability of the halogen atom (F < Cl < Br < I).
(21)However, the electronegativity and a high level of sp-hybridization of fluorine cause an electron influx that neutralizes the σ-hole.
(22)Halogen bond
Section I. Introduction – Molecular Interaction
Like hydrogen bonds, halogen bonds are highly directional, electrostatically driven non- covalent interactions. Halogen bonds are weaker than hydrogen bonds, but nevertheless still of importance in drug discovery because halogens are often used to provide solubility, metabolic stability and bioavailability to molecules.
Van der Waals contacts
Van der Waals (VdW) interactions are non-specific electrostatic interactions formed between dipoles. Although one single contact is weak, a large number of interactions contribute significantly to the overall binding energy.
(24-26)While polar molecules have permanent dipoles, non-polar molecules only have induced dipoles formed by short- termed random fluctuations in the distribution of the electron density. A distinction can be made between the interaction of a permanent dipole with an induced dipole (Debye force), two permanent dipoles (Keesom interaction) or two induced dipoles (London dispersion force). The VdW interactions are mathematically described by the 12-6 Lennard-Jones potential (V) (equation 11),
(27)where A and B are specific molecular constants (based on the VdW radii and the magnitude of attraction) of the interacting atoms and r describes their distance:
V = A r
12− B
r
6(eq. 11)
A/r
12represents the repulsive potential and B/r
6the attractive potential. The closer two atoms approach, the higher the attraction is (negative potential V). Yet, when the atoms come too close together, the repulsion term rapidly increases due to overlapping electron clouds (positive potential V). At V=0, when attraction and repulsion are in balance, the atoms are in VdW contact, which is the sum of the VdW radii of the two interacting atoms.
(28)Stacking with π systems
The amino acids phenylalanine, tryptophan, tyrosine, and histidine as well as many drug
molecules contain π-systems. The conjugation of their p-orbitals enables delocalization of
π electrons. This electron delocalization polarizes the aromatic ring resulting in two
partially negative charged rings above and below the plane of the partially positive
Section I. Introduction – Molecular Interaction
charged atom ring (quadrupole).
(29)Two aromatic systems mainly interact in a parallel or perpendicular configuration. The parallel arrangement is more favored when electron- withdrawing groups are introduced that weaken the quadrupole.
(30)On the contrary, electron donating groups increase the quadrupole charge distribution and therefore promote the perpendicular constellation between two aromatic rings, but also the interaction with other formally positive charged groups (cation-π interaction).
(31)Furthermore, interactions with aliphatic groups and halogens are able to favorably contribute to the free energy of binding.
(32)1.3.2. Entropic driving forces
The entropy of a thermodynamic system is the measure of its disorder (motion). In contrast to the change in enthalpy, there is no direct measure of the change in entropy.
However, since the K
Aand ΔH° can be determined, ΔS° can be calculated according to equations 8 and 9. A positive change in entropy contributes beneficially to the change in free energy of binding. Like ΔH°, ΔS° is an additive measure of different processes taking place during a binding event and it can be further dissected (equation 12):
(33,34)∆S° = ∆S°
solv+ ∆S°
conf+ ∆S°
trans/rot(eq. 12) The change in solvation entropy (ΔS°
solv) depends on the hydration shell that alters upon the binding of a ligand to a protein. Water molecules are released to bulk water where they are assumed to gain motion. The hydration shell “melts” with increasing temperature and it completely passes to bulk water at a temperature of 385 K (T
R).
(35,36)Therefore, determination of the temperature dependence of the thermodynamic constants allows the calculation of ΔS°
solv(equation 13 and 14), where ∆C
pis the change in heat capacity and T is the absolute temperature.
∆C
p= ∂∆H°
∂T (eq. 13)
∆S° = ∆C ln T
(eq. 14)
Section I. Introduction – Molecular Interaction
The change in translational and rotational entropy (ΔS°
trans/rot) is called the cratic term. It is a constant loss of energy due to the loss of degrees of freedom as upon complex formation when two rigid bodies (each being able to translate and rotate) join to one.
Literature values for the cratic term range from -5.4 up to -62 kJ/mol (at 298.15 K).
(37-39)However, the most often used change in entropy at room temperature is -10 kJ/mol calculated from equation 15, where R is the universal gas constant (8.314 J/mol K) and 55.6 in the divisor corresponds to the molarity of 1 liter water.
(39,40)∆S°
trans/rot= R ln 1
55.6 (eq. 15)
The change in conformational entropy (ΔS°
conf) can therefore be calculated according to equation 12. ΔS°
confis the change of conformational degrees of freedom of the bound protein-ligand complex compared to their unbound states in solvent. ΔS°
confis usually unfavorable (negative) as the amino acid side chains involved in binding, as well as the rotational bonds of the ligand, are more restricted in a complex. Conformational changes of the protein (induced fit) may also contribute unfavorably to ΔS°
conf.
1.3.3. Solvent effects
To understand the role of water is crucial to understand binding. It is often neglected, but each water molecule has to be considered as an additional interaction partner when a protein-ligand interaction takes place in an aqueous environment. Therefore, the change in free energy of binding of a protein-ligand binding event contains the energy change of the whole system, including the desolvation of the interaction interfaces, newly formed interactions of the protein-ligand complex and water reorganization.
Liquid water favorably forms approx. three hydrogen bonds while retaining a certain
amount of mobility.
(18)However, water molecules in the first solvation layer with close
contact to the solute might not be able to form three hydrogen bonds (enthalpically
frustrated water) or be limited in their mobility (conformationally trapped water).
(24,41-44)To
release these water molecules to the bulk may be highly beneficial. Dunitz calculated an
entropic gain of up to 2 kcal/mol (at 300 K) for the release of a trapped water
molecule.
(45)In terms of enthalpy, the release of a trapped water molecule might be less
Section I. Introduction – Molecular Interaction
beneficial, as some of them play an important role in mediating interactions between protein and ligand (structural water).
(32,46)Still, by far the most water molecules are loosely associated with the protein and can be displaced with minor effect to the overall free energy of binding. Hence, the solvation of an unbound protein binding site is predetermining the binding characteristics. While some water patterns promote binding, others request high desolvation costs and prevent strong interactions.
Furthermore, the properties of the solute are decisive for solvation, in particular, its
polarity. The highly polar water molecules tend to form hydrogen bonds to hydrophilic
solutes. Breaking these hydrogen bonds upon desolvation is enthalpically expensive, but
is partially compensated by the increased mobility of the water molecules in the bulk. On
the other hand, hydrophobic solutes do not interact via hydrogen bonds with the solvent
but rather form a cavity disrupting the water structure. The concept of the classical
hydrophobic effect proposed by Kauzman assumes a more ordered water structure
surrounding the non-polar solute.
(47-51)Desolvation upon binding is therefore
accompanied by a gain of entropy. However, ITC measurements revealed a non-classical
hydrophobic effect characterized by an enthalpy gain upon desolvation.
(44,52-55)This effect is
explained by the gain of electrostatic interactions between the non-polar solutes as well as
by the optimized formation of hydrogen bonds between the water molecules in the bulk.
Section I. Introduction – Evaluating Molecular Interaction
2. Evaluating Molecular Interaction
2.1. Methods to determine the structure of complexes
A full understanding of a molecular binding interaction requires not only knowledge of
the energetics of binding, but also structural information about the bound and unbound
states of the binding partners. Otherwise, crucial effects such as conformational changes
and oligomerization might not be detected and can lead to a misinterpretation of binding
data. Probably the most important source for high resolution structural information is
X-ray crystallography, where the diffraction pattern of a molecular structure (e.g. lectin-
carbohydrate complex) is gained by the exposure of its crystal to a beam of
monochromatic X-rays.
(56)The X-ray scatters at the atoms in the crystal and the emerging
diffraction pattern can be transformed into 3-dimensional information regarding
individual atom positions. However, information about complex structure from X-ray
crystallography may be compromised by poor resolution or unfavorable crystal contacts
that prevent binding or distort the structure of the bound state. Moreover, an X-ray
structure is a static picture of a dynamic system and it is wise to consider other sources of
structural information like small-angle X-ray scattering (SAXS),
(57)cryo-electron
microscopy (Cryo-EM),
(58)nuclear magnetic resonance (NMR) spectroscopy, atomic
force microscopy (AFM)
(59,60)or homology modeling.
(61)Computational approaches like
molecular dynamics (MD) simulations or quantum mechanical (QM) calculations are
very useful to evaluate questions of motion, conformation, solvation and energy
composition.
(62)The most powerful technique to obtain both structural information and
binding data is NMR spectroscopy and it is, in contrast to X-ray crystallography, measured
in solution. A range of informative NMR data such as chemical shifts and intra- or
intermolecular nuclear Overhauser effects (NOE) can be combined to derive the 3D
structure of a protein or protein-ligand complex.
(63,64)Further experiments can be applied
to elucidate the dynamics of a protein, ranging from fast backbone or side chain
fluctuations to global conformational changes on a slow timescale.
(65,66)Protein-ligand
binding events can be detected and quantified by a broad range of experiments with
either observation of the ligand signals (e.g. saturation transfer difference, transfer NOE
or relaxation-based experiments)
(67-69)or of the protein signals (chemical shift perturbation
experiments, CSP).
(70)CSP experiments are especially suited to the identification of
unknown binding sites on a protein target.
(71)Section I. Introduction – Evaluating Molecular Interaction
2.2. Methods to determine the inhibitory constant
Carbohydrate-lectin interactions are frequently weak, having K
Dvalues in the high µM to mM range. Many techniques that are routinely used to quantify high-affinity interactions (K
Din the µM to pM range) are not applicable or practicable for measuring such lower affinities. Therefore, the next two sections will focus on techniques suitable for the measurement of the carbohydrate-lectin interactions.
Many competitive assay formats have been established to quantify carbohydrate-lectin interactions. The phenomenon of hemagglutination that led to the discovery of the first lectin by Stillmark in the 19
thcentury, is still the most often used method.
(72,73)The hemagglutination assay (HA) takes advantage of the heterogeneous glycosylation of erythrocytes and the multivalent binding properties of most lectins.
(74,75)While the presence of anticoagulants red blood cells would segregate in suspension, the addition of a multivalent lectin cross-links the blood cells and creates a lattice structure. An assay format where the carbohydrate competes with erythrocytes for binding to the lectin allows the determination of a carbohydrate concentration that inhibits 50% of the agglutination (IC
50) (Hemagglutination inhibition assay, HIA). IC
50values are helpful to compare and rank the inhibition potency of different ligands, but they are not suitable to compare inhibition concentrations measured with different methods or in different laboratories (e.g. when differently glycosylated erythrocytes are used). Furthermore, the HIA is subject to the numerous limitations, e.g. irreversible cross-linkage without formation of an equilibrium or multivalent ligands cross-linking the erythrocytes. These drawbacks are eliminated by the enzyme-linked lectin assay (ELLA).
(73)In the ELLA method, a polyvalent ligand is immobilized on a microtiter plate and a lectin-horseradish peroxidase conjugate is incubated with a serial dilution of competing ligand of interest.
After equilibration, the supernatant containing the unbound ligand of interest and lectin
is washed off and replaced by a solution of chromogenic peroxidase substrate. The
protein bound on the microtiter plate can now be quantified using UV spectroscopy. This
method can be modified, e.g. when the lectin is immobilized on a microtiter plate and the
ligand is competing with a spectroscopically detectable glycopolymer (Polymer-binding
Section I. Introduction – Evaluating Molecular Interaction
by Wang.
(77)However, for both assays, HIA and ELLA, the Wang equation cannot be applied as the exact concentration of all participating molecules as well as the dissociation constant (K
D) between the standard competitor and the lectin is not known.
A competitive assay where the Wang equation can be applied is a technique based on fluorescence polarization (FP). Here, the competing ligand (tracer) is linked to a fluorophore.
(78)In steady-state fluorescence polarization measurements, polarized light is used to excite a fluorophore-containing tracer compound, and the degree of polarization of fluorescence emission is determined by measuring fluorescence through polarizers parallel and perpendicular to the axis of excitation polarization. The extent of depolarization of fluorescence emission depends upon the rotation of the fluorophore during its fluorescence lifetime. This depends upon the tumbling rate of the fluorophore, which is influenced by its hydrodynamic radius. Hence, a tracer bound to a protein exhibits slower tumbling compared to a free tracer in solution and has increased fluorescence polarization. Titrating a dilution series of protein against a constant concentration of tracer gives a binding curve from which it is possible to derive the exact K
Dusing a standard single-site binding model (e.g. the function given by Cooper
(79)).
Therefore, all constants are known to use the Wang equation for the competitive assay format.
2.3. Methods to determine the binding constants
More reliable is the direct determination of the K
Dwithout the complication of a
competitive assay format, e.g. using microscale thermophoresis (MST).
(80)We routinely use
MST to determine affinities of E-selectin ligands because the extremely small assay
volume preserves valuable protein and ligand resources (Manuscripts 7-9). The assay is
based on the phenomenon that molecules move along a temperature gradient, a
phenomenon known as thermophoresis. The rate of thermophoresis is sensitive to
changes in size, charge, and solvation of a molecule, which allows discriminating bound
from unbound species.
(81)For some proteins it is possible to follow thermophoresis by
intrinsic tryptophan fluorescence, while for many others it is necessary to label one
interaction partner with a fluorescent dye.
Section I. Introduction – Evaluating Molecular Interaction
Another label-free technique to determine K
D, as well as the association rate constant (k
on) and the dissociation rate constant (k
off) is surface plasmon resonance (SPR).
(82)Either the ligand or the protein is immobilized on a functionalized gold surface. The backside of the gold layer reflects a light source through a prism to a detector. At a certain resonance angle, the light is absorbed by the electrons of the gold layer causing them to resonate.
The angle at which this occurs depends upon the refractive index of the solution at the functionalized gold surface. A binding event at the surface changes that refractive index and is observable as a shift of the minimum in the reflectivity curve. Monitoring the change in resonance over time enables to derive the rate constants k
onand k
off.
The only technique to determine directly the thermodynamics of a binding reaction is isothermal titration calorimetry (ITC).
(83)ITC is the most frequently used technique throughout this thesis and is therefore explained in more detail. With one ITC experiment, it is possible to simultaneously measure the change in enthalpy (ΔH°), the binding stoichiometry (N) and K
A. According to a recent publication by Burnouf et al.
even the rate constants k
onand k
offcan be deduced from ITC measurements.
(84)For a standard ITC experiment, a threefold surplus of ligand is stepwise titrated through a syringe into a cell containing the protein solution (Figure 2A). However, also an inverse experiment is possible.
Figure 2. Illustrated mode of operation of an isothermal titration calorimeter. (A) The small temperature changes