In genetics, crosslinking of DNA occurs when various exogenous or endogenous agents react with two nucleotides of DNA, forming a covalent linkage between them. This crosslink can occur within the same strand (intrastrand) or between opposite strands of double-stranded DNA (interstrand). These adducts interfere with cellular metabolism, such as DNA replication and transcription, triggering cell death. These crosslinks can, however, be repaired through excision or recombination pathways.
DNA crosslinking also has useful merit in chemotherapy and targeting cancerous cells for apoptosis, as well as in understanding how proteins interact with DNA.
Many characterized crosslinking agents have two independently reactive groups within the same molecule, each of which is able to bind with a nucleotide residue of DNA. These agents are separated based upon their source of origin and labeled either as exogenous or endogenous. Exogenous crosslinking agents are chemicals and compounds, both natural and synthetic, that stem from environmental exposures such as pharmaceuticals and cigarette smoke or automotive exhaust. Endogenous crosslinking agents are compounds and metabolites that are introduced from cellular or biochemical pathways within a cell or organism.
Nitrogen mustards are exogenous alkylating agents which react with the N7 position of guanine. These compounds have a bis-(2-ethylchloro)amine core structure, with a variable R-group, with the two reactive functional groups serving to alkylate nucleobases and form a crosslink lesion. These agents most preferentially form a 1,3 5'-d(GNC) interstrand crosslink. The introduction of this agent slightly bends the DNA duplex to accommodate for the agent's presence within the helix. These agents are often introduced as a pharmaceutical and are used in cytotoxicchemotherapy.
Cisplatin (cis-diamminedichloroplatinum(II)) and its derivatives mostly act on adjacent guanines at their N7 positions. The planar compound links to nucleobases through water displacement of one or both of its chloride groups, allowing cisplatin to form monoadducts to DNA or RNA, intrastrand DNA crosslinks, interstrand DNA crosslinks, and DNA-protein crosslinks. When cisplatin generates DNA crosslinks, it more frequently forms 1,2-intrastrand crosslinks (5'-GG), but also forms 1,3-intrastrand crosslinks (5-GNG) at lower percentages. When cisplatin forms interstrand crosslinks (5'-GC), there is a severe distortion to the DNA helix due to a shortened distance between guanines on opposite strands and a cytosine that is flipped out of the helix as a consequence of the GG interaction. Similar to nitrogen mustards, cisplatin is used frequently in chemotherapy treatment - especially for testicular and ovarian cancers.
Chloro ethyl nitroso urea (CENU), specifically carmustine (BCNU), are crosslinking agents that are widely used in chemotherapy, particularly for brain tumors. These agents differ from other crosslinkers as they alkylate O6 of guanine to form an O6-ethanoguanine. This intermediate compound then leads to an interstrand crosslink between a GC basepair. These crosslinking agents only result in small distortions to the DNA helix due to the molecules' smaller size.
Psoralens are natural compounds (furocoumarins) present in plants. These compounds intercalate into DNA at 5'-AT sequence sites and form thymidine adducts when activated in the presence of Ultra Violet-A (UV-A) rays. These covalent adducts are formed by linking the 3, 4 (pyrone) or 4', 5’ (furan) edge of psoralen to the 5, 6 double bond of thymine. Psoralens can form two types of monoadducts and one diadduct (an interstrand crosslink) with thymine. These adducts result in local distortions to DNA at the site of intercalation. Psoralens are used in the medical treatment of skin diseases, such as psoriasis and vitiligo.
Mitomycin C (MMC) is from a class of antibiotics that are used broadly in chemotherapy, often with gastrointestinal related cancers. Mitomycin C can only act as a crosslinker when a DNA nucleotide has had a reduction to its quinone ring. When two dG's have been rearranged and methylated in this manner, a 5'-GC interstrand crosslink can be formed with the exo amines of each nucleobase. Mitomycin also harbors the ability to form monoadducts and intrastrand crosslinks with DNA as well. The interstrand crosslinks of Mitomycin C are formed in the minor groove of DNA, inducing a moderate widening or stretching to the DNA helix in order to accommodate for the presence of the molecule within the two strands.
Nitrous acid is formed as a byproduct in the stomach from dietary sources of nitrites and can lead to crosslink lesions in DNA through the conversion of amino groups in DNA to carbonyls. This type of lesion occurs most frequently between two guanosines, with 1 of 4 deaminated guanosines resulting in an interstrand crosslink. It induces formation of interstrand DNA crosslinks at the amino group of exocyclic N2 of guanine at 5'-CG sequences. This lesion mildly distorts the double helix.
Bifunctional aldehydes are reactive chemicals that are formed endogenously via lipid peroxidation and prostoglandin biosynthesis. They create etheno adducts formed by aldehyde which undergo rearrangements to form crosslinks on opposite strands of DNA. Malondialdehyde is a prototypical example that can crosslink DNA via two exocyclic guanine amino groups. Other aldehydes, such as formaldehyde and acetylaldehyde, can introduce interstrand crosslinks and often act as exogenous agents as they are found in many processed foods. Often found within pesticides, tobacco smoke, and automotive exhaust, α,β unsaturated aldehydes, such as acrolein and crotonaldehyde, are further exogenous agents that may induce DNA crosslinks. Unlike other crosslinking agents, aldehyde-induced crosslinking is an intrinsically reversible process. NMR structure of these types of agents as interstrand crosslinks show that a 5'-GC adduct results in minor distortion to DNA, however a 5'-CG adduct destabilizes the helix and induces a bend and twist in the DNA.
DNA crosslinking lesions can also be formed when under conditions of oxidative stress, in which free oxygen radicals generate reactive intermediates in DNA, and these lesions have been implicated in aging and cancer. Tandem DNA lesions are formed at a substantial frequency by ionizing radiation and metal-catalyzed H2O2 reactions. Under anoxic conditions, the predominant double-base lesion is a species in which the C8 of guanine is linked to the 5-methyl group of an adjacent 3'-thymine (G[8,5- Me]T), forming intrastrand lesions.
DNA crosslinks generally cause loss of overlapping sequence information from the two strands of DNA. Therefore, accurate repair of the damage depends on retrieving the lost information from an undamaged homologous chromosome in the same cell. Retrieval can occur by pairing with a sister chromatid produced during a preceding round of replication. In a diploid cell retrieval may also occur by pairing with a non-sister homologous chromosome, as occurs especially during meiosis. Once pairing has occurred, the crosslink can be removed and correct information introduced into the damaged chromosome by homologous recombination.
Cleavage of the bond between a deoxyribose sugar in DNA's sugar-phosphate backbone and its associated nucleobase leaves an abasic site in double stranded DNA. These abasic sites are often generated as an intermediate and then restored in base excision repair. However, if these sites are allowed to persist, they can inhibit DNA replication and transcription. Abasic sites can react with amine groups on proteins to form DNA-protein crosslinks or with exocyclic amines of other nucleobases to form interstrand crosslinks. To prevent interstrand or DNA-protein crosslinks, enzymes from the BER pathway tightly bind the abasic site and sequester it from nearby reactive groups, as demonstrated in human alkyladenine DNA glycosylase (AAG) and E. coli 3-methyladenine DNA glycosylase II (AlkA).in vitro evidence demonstrated that the Interstand Cross-Links induced by abasic site (DOB-ICL) is a replication-blocking and highly miscoding lesion. Compared to several other TLS pols examined, pol η is likely to contribute to the TLS-mediated repair of the DOB-ICL in vivo. By using O6-2'-deoxyguanosine-butylene-O6-2'-deoxyguanosine (O6-dG-C4-O6-dG) DNA lesions which is a chemically stable structure, the bypassing activity of several DNA polymerases had been investigated and the results demonstrated that pol η exhibited the highest bypass activity; however, 70% of the bypass products were mutagenic containing substitutions or deletions. The increase in the size of unhooked repair intermediates elevates the frequency of deletion mutation. 
Treatment of E. coli with psoralen-plus-UV light (PUVA) produces interstrand crosslinks in the cells’ DNA. Cole et al. and Sinden and Cole presented evidence that a homologous recombinational repair process requiring the products of genes uvrA, uvrB, and recA can remove these crosslinks in E. coli. This process appears to be quite efficient. Even though one or two unrepaired crosslinks are sufficient to inactivate a cell, a wild-type bacterial cell can repair and therefore recover from 53 to 71 psoralen crosslinks. Eukaryotic yeast cells are also inactivated by one remaining crosslink, but wild type yeast cells can recover from 120 to 200 crosslinks.
Crosslinking of DNA and protein
Biochemical interaction methods
DNA-protein crosslinking can be caused by a variety of chemical and physical agents, including transition metals, ionizing radiation, and endogenous aldehydes, in addition to chemotherapeutic agents.
Similar to DNA crosslinking, DNA-protein crosslinks are lesions in cells that are frequently damaged by UV radiation. The UV's effect can lead to reactive interactions and cause DNA and the proteins that are in contact with it to crosslink. These crosslinks are very bulky and complex lesions. They primarily occur in areas of the chromosomes that are undergoing DNA replication and interfere with cellular processes.
The advancement in structure-identification methods has progressed, and the addition in the ability to measure interactions between DNA and protein is a requirement to fully understand the biochemical processes. The structure of DNA-protein complexes can be mapped by photocrosslinking, which is the photoinduced formation of a covalent bond between two macromolecules or between two different parts of one macromolecule. The methodology involves covalently linking a DNA-binding motif of the target sequence-specific DNA-binding protein with a photoactivatable crosslinking agent capable of reacting with DNA nucleotides when exposed to UV. This method provides information on the interaction between the DNA and protein in the crosslink.
DNA repair pathways can result in the formation of tumor cells. Cancer treatments have been engineered using DNA cross-linking agents to interact with nitrogenous bases of DNA to block DNA replication. These cross-linking agents have the ability to act as single-agent therapies by targeting and destroying specific nucleotides in cancerous cells. This result is stopping the cycle and growth of cancer cells; because it inhibits specific DNA repair pathways, this approach has a potential advantage in having fewer side effects.
In humans, the leading cause of cancer deaths worldwide is lung cancer, including non small cell lung carcinoma (NSCLC) which accounts for 85% of all lung cancer cases in the United States. Individuals with NSCLC are often treated with therapeutic platinum compounds (e.g. cisplatin, carboplatin or oxaliplatin) (see Lung cancer chemotherapy) that cause interstrand DNA crosslinks. Among individuals with NSLC, low expression of the breast cancer 1 gene (BRCA1) in the primary tumor has correlated with improved survival after platinum-containing chemotherapy. This correlation implies that low BRCA1 in the cancer, and the consequent low level of DNA repair, causes vulnerability of the cancer to treatment by the DNA crosslinking agents. High BRCA1 may protect cancer cells by acting in the homologous recombinational repair pathway that removes the damages in DNA introduced by the platinum drugs. The level of BRCA1 expression is potentially an important tool for tailoring chemotherapy in lung cancer management.
Clinical chemotherapeutics can induce enzymatic and non-enzymatic DNA-protein crosslinks. An example of this induction is with platinum derivatives, such as cisplatin and oxaliplatin. They create non-enzymatic DNA-protein crosslinks through non-specific crosslinking of chromatin-interacting proteins to DNA. Crosslinking is also possible in other therapeutic agents by either stabilizing covalent DNA–protein reaction intermediates or by creating a pseudosubstrate, which traps the enzyme on DNA. Camptothecin derivatives, such as irinotecan and topotecan, target and trap specific DNA topoisomerase 1 (TOP1) by intercalating within the enzyme–DNA interface. Because the toxicity of these drugs depends on TOP1 trapping, cellular sensitivity to these compounds depends directly on TOP1 expression levels. As a result, the function of these drugs is to serve as enzyme poisons rather than inhibitors. This can be applied to treat tumor cells by utilizing TOP 2 enzyme poisons.
^Rudd GN, Hartley JA, Souhami RL (1995). "Persistence of cisplatin-induced DNA interstrand crosslinking in peripheral blood mononuclear cells from elderly and young individuals". Cancer Chemother. Pharmacol. 35 (4): 323–6. doi:10.1007/BF00689452. PMID7828275. S2CID24036376.
^Qi Wu, Laura A Christensen, Randy J Legerski & Karen M Vasquez, Mismatch repair participates in error-free processing of DNA interstrand crosslinks in human cells,EMBO Reports 6, 6, 551–557 (2005).
^Kirchner, James J.; Sigurdsson, Snorri T.; Hopkins, Paul B. (1992-05-01). "Interstrand cross-linking of duplex DNA by nitrous acid: covalent structure of the dG-to-dG cross-link at the sequence 5'-CG". Journal of the American Chemical Society. 114 (11): 4021–4027. doi:10.1021/ja00037a001. ISSN0002-7863.
^Dooley, Patricia A.; Zhang, Mingzhou; Korbel, Gregory A.; Nechev, Lubomir V.; Harris, Constance M.; Stone, Michael P.; Harris, Thomas M. (2003-01-08). "NMR determination of the conformation of a trimethylene interstrand cross-link in an oligodeoxynucleotide duplex containing a 5'-d(GpC) motif". Journal of the American Chemical Society. 125 (1): 62–72. doi:10.1021/ja0207798. ISSN0002-7863. PMID12515507.
^Box, Harold C.; Budzinski, Edwin E.; Dawidzik, Jean D.; Wallace, John C.; Evans, Marianne S.; Gobey, Jason S. (1996). "Radiation-Induced Formation of a Crosslink between Base Moieties of Deoxyguanosine and Thymidine in Deoxygenated Solutions of d(CpGpTpA)". Radiation Research. 145 (5): 641–643. Bibcode:1996RadR..145..641B. doi:10.2307/3579285. JSTOR3579285. PMID8619032.
^Cole RS, Levitan D, Sinden RR (1976). "Removal of psoralen interstrand cross-links from DNA of Escherichia coli: mechanism and genetic control". J. Mol. Biol. 103 (1): 39–59. doi:10.1016/0022-2836(76)90051-6. PMID785009.