Abstract
Antibody-drug conjugates (ADCs) constitute a category of anticancer targeted therapy that has gathered great interest during the last few years because of their potential to kill cancer cells while causing significantly fewer side effects than traditional chemotherapy. In this paper, a process of computational construction of ADCs is described, using the surface lysines of an antibody and a non-covalent linker molecule, as well as a cytotoxic substance, as files in Protein Data Bank format. Also, aspects related to the function, properties, and development of ADCs are discussed.
Introduction
Cancer is a term used for a group of diseases in which abnormal cells divide uncontrollably, having the ability to invade other tissues and spread to other parts of the body through the blood and the lymph systems. It is one of the dominating causes of death worldwide, being responsible for 8.2 million deaths in 2012, while annual cancer cases are expected to rise from 14 millions in 2012 to 22 millions during the next two decades. There are a number of cancer treatment approaches, with surgery, chemotherapy, and radiotherapy being some of the most common ones. Traditional chemotherapy drugs act against all rapidly dividing cells, including the non-cancerous ones, a disadvantage that led to an increased interest in the development of targeted anticancer therapies (TATs). TAT refers to the systemic administration of drugs or other substances with particular mechanisms that interfere with specific molecules involved in cancer cell growth and survival, and have minimized adverse effects on healthy cells.1–7
Antibody–drug conjugates (ADCs) constitute a class of TAT. An ADC molecule consists of three components: a cytotoxic drug, also called payload; a tumor-targeting monoclonal antibody or antibody fragment; and a molecule that connects the previous two, called linker. Its mechanism of action can be described in general terms as the binding of the antibody to its antigen and the subsequent release of the cytotoxin, which results in the death of the cancer cell. The choice of the antibody depends on the target antigen, usually a molecule present in cancer cells at a much higher concentration than in normal cells. Therefore, the expected performance from an ADC upon its administration to the patient is to remain stable in circulation and deliver the cytotoxic substance selectively to cancer cells, maximizing their exposure to the drug, while minimizing the exposure of healthy cells. Some ADCs are internalized by the cell upon the binding of the antibody to the target, although other ADCs reduce the blood supply of the tumor by targeting endothelial cells within the tumor vasculature. There are three internalization routes: clathrin-mediated endocytosis, caveolae-mediated uptake, and pinocytosis.8–18
The most common antibody form incorporated in ADCs is the full monoclonal antibody, or immunoglobulin (IgG). IgGs have a long half-life in blood, usually spanning from days to weeks, which enables them to travel in the blood vessels for a time long enough to locate the tumor cells.19–22 The second part of the ADC, the linker, is required to be stable during the circulation of the ADC in the bloodstream in order to avoid the premature release of the drug, as well as able to discharge the drug after the binding of the antibody to its antigen. Linkers currently explored can be categorized into cleavable and non-cleavable linkers. Cleavable linkers include lysosomal protease-sensitive linkers, acid-sensitive linkers, and glutathione-sensitive linkers.23–25 Regarding the cytotoxic part of the ADC, there are two main classes of ADC payloads that have been explored, the first one being drugs that disrupt microtubule assembly, such as auristatins and maytansinoids, and the second one being drugs that target DNA structure, such as calicheamicins and duocarmycins.25–29
Since the hypervariable regions of the antibody are expected to bind to the cancer-specific antigen, they are not advisable as conjugation sites. Instead, the conjugation of a linker to an antibody takes place at the more preserved regions of the antibody, at solvent-accessible reactive amino acids.30,31 Two amino acids typical for conjugation with drugs are lysines and cysteines, the latter being exposed after the reduction of the interchain disulfide bonds of the antibody. Even though the number of drugs linked to an antibody is usually 0–8, conjugation can occur at 40 different lysines and at 8 different cysteines per antibody. The numerous possible conjugation sites, in combination with the fact that the drug-to-antibody ratio (DAR) can be larger than 1, means that numerous different ADCs can be generated from the same antibody, drug, and linker. In addition, controlling the site and stoichiometry of drug conjugation to the antibody is not easy and typically results in heterogeneous mixtures of ADCs. Heterogeneous ADCs do not have well-defined
Drug development is a complicated, time-consuming, and expensive process. Bioinformatics is a multidisciplinary scientific field that aids this process, giving the opportunity to model biologically active molecules and make estimations about their properties. For example, computational drug design and molecular mechanics methods, although imperfect, aim to predict whether a given molecule will bind to a target and with what binding affinity. The information obtained through those computational methods, even though it is not exact, accelerates the process of drug discovery. However, even though a number of computational tools for drug design exist, the specialized field of ADC computational design has been explored to a much smaller degree.38–45
The
Methods
Conjugation Process.
In this section, the process of the computational conjugation of the antibody, the linker, and the drug is described in more detail. As previously explained, the goal of the program developed is to produce the PDB file of an ADC molecule, given the PDB files of an antibody, a linker, and a drug. The drug and the linker are reconfigured via rotation and translation in order to be brought in positions appropriate for the hydrogen bonding to occur between the linker and the drug, as well as between the linker and a surface lysine of the antibody. The change in the configuration of the antibody, the linker, and the drug was executed computationally by changing their atomic coordinates. First, the drug was rotated and translated in relation to the linker, while the linker remained stable. The two files were subsequently merged into a linker–drug conjugate PDB file. Second, the linker–drug conjugate was rotated and translated in relation to the selected surface lysine of the antibody, while the antibody remained stable. Similarly, the two files were merged to produce the final PDB file of the antibody–drug conjugate.
In order for the reconfiguration of the molecules to be correct, the changes in their atomic coordinates had to preserve the initial lines and distances between the atoms, which was accomplished with the use of an affine transformation. An affine transformation is any transformation that preserves collinearity, which means that all points placed on a line before the transformation still lie on a line after the transformation. It also conserves the ratios of distances, which means that the midpoint of a line segment remains the midpoint after the application of the transformation.47,48 According to the homogeneous transformation matrix defined in Ref. 49, the change in the position of an atom can be described with the following equations,
In order to rotate a molecule in relation to another molecule, an axis was defined for each of them. The axis of the linker was defined as the line connecting its two most distant non-hydrogen atoms, in order to be representative of the shape of the molecule. For that purpose, the distance between every pair of heavy atoms of the linker was calculated, and the pair with the largest distance was chosen. These two atoms also participated in the connections of the linker with the drug and the antibody, as will be explained in the following paragraph. The axis of the drug was defined as the line connecting the hydrogen atom of the drug that took part in the hydrogen bond with the linker and the electronegative atom of the drug covalently bonded to it. The axis of the linker–drug conjugate was defined as the line connecting the two most distant non-hydrogen atoms of the linker–drug conjugate, similarly to the axis of the linker. Finally, the axis of the lysine was defined as the line connecting the alpha carbon of the lysine and the nitrogen atom of its side chain, since this line is directed toward the exterior of the antibody, which is the desired direction for the linker–drug conjugate.
Similarly, in order to translate a molecule in relation to another molecule, a distance between them had to be defined. The distance between the drug and the linker was defined as the distance between the hydrogen atom of the drug that participates in the hydrogen bond with the linker and the linker atom that participates in the hydrogen bond with the drug, which was selected to be one of the two most distant atoms of the linker. The distance between the linker–drug conjugate and the antibody is defined as the distance between the linker atom that participates in the hydrogen bond with the antibody, which was selected to be one of the two most distant atoms of the linker–drug conjugate, the one that belongs to the linker, and a hydrogen atom covalently bonded to the nitrogen of the lysine side chain.
As a molecule (ie, the drug or the linker–drug conjugate) rotates in relation to a stable molecule (ie, the linker or the antibody, respectively), or equivalently as the values of the variables
According to the Steiner–Saenger definition of the hydrogen bond, a hydrogen bond is any cohesive interaction X—H•••A where H carries a positive and A a negative (partial or full) charge and the charge on x is more negative than on H. The molecule C15N, which represents a common non-cleavable linker, can form hydrogen bonds with a drug as well as a lysine amino acid of an antibody. In particular, every hydrogen atom of the drug that is connected to a heavy atom more electronegative than hydrogen can form a hydrogen bond with a heavy atom of the linker, such as carbon. Similarly, the nitrogen atom of the lysine side chain and the hydrogen atom covalently bonded to it can form a hydrogen bond with a carbon atom of the linker. After a molecule has been rotated and translated in relation to the stable molecule, the hydrogen bond is simulated computationally by deleting the hydrogen atom of the hydrogen donor from the PDB file, before the two files are merged into one PDB file. 50
The final ADC molecule obtains a realistic conformation, as ensured by a final minimization step using the molecular dynamics package Gromacs, and specifically the GROMOS bonding library. 51
Results
The application of the process described in the Methods section resulted in the generation of a number of ADC PDB files. Since the antibody and the drug, as well as the conjugation sites of the antibody and the drug, can vary, the number of different combinations of initial molecules generating distinct ADCs is quite large.
Here, four different ADCs are demonstrated as results of this process. The two conjugates depicted in Figure 2 contain the antibody with PDB id 4GAG. In the left conjugate, the linker has formed a hydrogen bond with the surface lysine with residue sequence number 147, located at the antibody chain with id L, while the drug with sequence number 600 (nci_600.pdb) has formed a hydrogen bond with the linker. In the right conjugate, the linker has been connected with a hydrogen bond to the surface lysine with residue sequence number 115, located at the antibody chain with id H, while the drug with sequence number 450 (nci_450.pdb) has formed a hydrogen bond with the linker. The conjugation areas of these two ADCs are depicted in Figure 3, from a different angle and a smaller distance.

General scheme of the molecules and the bonds forming an ADC, 9 according to the method described in this paper.

ADCs composed by the antibody with pdb id 4GAG, the linker molecule C15N, and a drug from the Open NCI database. Left: the drug with sequence number 600 (nci_600.pdb) has been conjugated through the linker molecule C15N to the surface lysine with residue sequence number 147, at the chain with id L. Right: the drug with sequence number 450 (nci_450.pdb) has been conjugated through the linker molecule C15N to the surface lysine with residue sequence number 115, at the chain with id H.

Zoomed images of the ADCs depicted in Figure 2, from different angles for better view of the conjugation. (
In Figure 4, two different ADCs are illustrated, both containing the antibody with PDB id 4GAJ. In the left conjugate, the linker has formed a hydrogen bond with the surface lysine with residue sequence number 75, located at the antibody chain with id H, while the drug with sequence number 700 (nci_700.pdb) has formed a hydrogen bond with the linker. In the right conjugate, the linker has been connected to the surface lysine with residue sequence number 209, located at the chain with id H, with a hydrogen bond. Also, the drug with sequence number 14 (nci_14.pdb) has formed a hydrogen bond with the linker. The conjugation areas of these two ADCs are depicted in Figure 5, from a different angle and a smaller distance (Figs. 2–6).

ADCs composed by the antibody with pdb id 4GAJ, the linker molecule C15N, and a drug from the Open NCI database. Left: the drug with sequence number 700 (nci_700.pdb) has been conjugated through the linker molecule C15N to the surface lysine with residue sequence number 75, at the chain with id H. Right: the drug with sequence number 14 (nci_14.pdb) has been conjugated through the linker molecule C15N to the surface lysine with residue sequence number 209, at the chain with id H.

Zoomed images of the ADCs depicted in Figure 4, from different angles for better view of the conjugation. (

Discussion
Computational Prospects in the Field of Antibody–Drug Conjugation.
In this paper, a process of computational construction of ADCs has been described, choosing surface lysines as the antibody conjugation sites. For this purpose, drug PDB files from the Open National Cancer Institute Database, antibody PDB files from the RCSB Protein Data Bank, and the PDB file of the non-cleavable linker molecule C15N were used. Between the linker and the drug, as well as between the antibody lysine and the linker–drug conjugate, hydrogen bonds were formed. The change in the configuration of the molecules was accomplished using the affine transformation.
The method described in this paper represents one of the many ways a drug can be conjugated with an antibody. Parameters such as the amino acid participating in the conjugation, the DAR, the type of linker and drug used, as well as the chemical bonds formed between the components of the conjugate can vary. Besides lysine, other natural or synthetic amino acids such as cysteine, selenocysteine, or acetylphenylalanine can be connected with a linker–drug conjugate. In addition, other non-cleavable or cleavable linkers can be used, while maytansinoids, auristatins, calicheamicins, and duocarmycins can be used as cytotoxic substances.
An example of a different conjugation technique is the formation of a covalent bond between an amino acid and a reactive functional group pendent to the linker–drug conjugate. The direct conjugation of a lysine with a linker–drug conjugate through amide bonds is also possible, using an
The need for the development of homogeneous ADCs led to an increasing interest in site-specific conjugation, through which the number of conjugated drugs and the site of conjugation can be controlled. Site-specific conjugation has been achieved with a number of methods, such as cysteine engineering, amino acid insertion, enzymatic conjugation, and glycoengineering. THIOMAB–drug conjugates (TDCs) are a class of engineered antibodies in which cysteines have been introduced into the amino acid sequence by single point mutations. The ideal sites for mutation are identified with phage display techniques, while the engineered antibody is reduced and reoxidized to present the thiols of the mutated cysteines for conjugation. Conjugation using the engineered cysteine site leaves the antigen-binding regions unaffected and is quite homogeneous, with greater than 92% of the engineered antibody (THIOMAB) conjugates containing two drugs. Another approach is the replacement of interchain cysteines with serines in order to reduce the number of potential conjugation sites. Site-specific conjugation has also been accomplished by inserting non-native amino acids into the antibody such as selenocysteine, acetylphenylalanine,
Another aspect that could be examined is the evaluation of the computationally produced ADCs, perhaps taking into account the existing analytical technologies that assess the physicochemical properties of real ADCs. Properties of real ADCs, such as the DAR, drug distribution, aggregation, and fragmentation, are evaluated with a variety of techniques. For instance, UV/VIS spectroscopic analysis and hydrophobic interaction chromatography (HIC) are two methods used to measure the DAR, while technologies such as mass spectroscopic analysis and chromatographic analysis are used to evaluate drug distribution. Therefore, computational techniques could be developed in order to perform the equivalent analytical evaluations on computational ADCs, in order to further aid the design and manufacturing processes of real ADCs.10,61 In addition, computational techniques that make estimations about the pharmacokinetic and therapeutic properties of a given ADC would be beneficial to the process of ADC design, since an ADC could not only be constructed, but also evaluated
In conclusion, ADCs have emerged as a promising anticancer targeted therapy that aims to treat cancer by selectively attacking cancer cells while leaving normal cells unaffected for the most part. In comparison to conventional chemotherapy, this targeted mechanism of action has fewer side effects and enables the use of more potent drugs. The fact that ADCs are formed from three molecules instead of one makes their design and manufacturing even more complex than the already complicated discovery process of standalone drugs. However, it also results in an increased number of possible combinations of antibody conformation, linker, drug, and conjugation method. Therefore, the scientific field of ADCs provides vast research opportunities, not only in biology but also in bioinformatics, since specialized computational methods could benefit considerably the evolution of TAT.
Author Contributions
Conceived and designed the experiments: AF, DV. Analyzed the data: AF, DV. Wrote the first draft of the manuscript: AF. Contributed to the writing of the manuscript: DV, GM, SK. Agree with manuscript results and conclusions: AF, DV, GM, SK. Jointly developed the structure and arguments for the paper: AF, DV. Made critical revisions and approved final version: AF, DV, GM, SK. All authors reviewed and approved of the final manuscript.
