Figures
Abstract
Nucleic acids rich in guanine are able to fold into unique structures known as G-quadruplexes. G-quadruplexes consist of four tracts of guanylates arranged in parallel or antiparallel strands that are aligned in stacked G-quartet planes. The structure is further stabilized by Hoogsteen hydrogen bonds and monovalent cations centered between the planes. RHAU (RNA helicase associated with AU-rich element) is a member of the ATP-dependent DExH/D family of RNA helicases and can bind and resolve G-quadruplexes. RHAU contains a core helicase domain with an N-terminal extension that enables recognition and full binding affinity to RNA and DNA G-quadruplexes. PITX1, a member of the bicoid class of homeobox proteins, is a transcriptional activator active during development of vertebrates, chiefly in the anterior pituitary gland and several other organs. We have previously demonstrated that RHAU regulates PITX1 levels through interaction with G-quadruplexes at the 3’-end of the PITX1 mRNA. To understand the structural basis of G-quadruplex recognition by RHAU, we characterize a purified minimal PITX1 G-quadruplex using a variety of biophysical techniques including electrophoretic mobility shift assays, UV-VIS spectroscopy, circular dichroism, dynamic light scattering, small angle X-ray scattering and nuclear magnetic resonance spectroscopy. Our biophysical analysis provides evidence that the RNA G-quadruplex, but not its DNA counterpart, can adopt a parallel orientation, and that only the RNA can interact with N-terminal domain of RHAU via the tetrad face of the G-quadruplex. This work extends our insight into how the N-terminal region of RHAU recognizes parallel G-quadruplexes.
Citation: Ariyo EO, Booy EP, Patel TR, Dzananovic E, McRae EK, Meier M, et al. (2015) Biophysical Characterization of G-Quadruplex Recognition in the PITX1 mRNA by the Specificity Domain of the Helicase RHAU. PLoS ONE 10(12): e0144510. https://doi.org/10.1371/journal.pone.0144510
Editor: Heidar-Ali Tajmir-Riahi, University of Quebec at Trois-Rivieres, CANADA
Received: October 14, 2015; Accepted: November 19, 2015; Published: December 9, 2015
Copyright: © 2015 Ariyo et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Data Availability: All relevant data are within the paper.
Funding: Funded by the Cancer Research Society (Canada) #18206. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
G-quadruplexes (G4) are four-stranded structures of DNA or RNA in which one guanine base from each chain associates via cyclic Hoogsteen [1] hydrogen bonding to form planar quartets. Two or more such quartets hydrophobically stack on top of each other to form the G4 and are stabilized by the presence of a mandatory monovalent cation (typically K+) in the center between the planes [2]. G4s in DNA and RNA can adopt a parallel, anti-parallel, or hybrid (mixture of both parallel and antiparallel) strand orientation [3]. Biophysical studies and high-resolution structures of RNA G4s reveal that they are thermodynamically more stable in vitro than their DNA counterparts under near-physiological conditions because of the 2′-OH on the ribose sugar that permits additional hydrogen bonds to form. As a result, RNA G4s preferentially adopt a parallel conformation over an antiparallel one [4–6].
A survey of the evolutionary conservation of DNA and RNA motifs revealed that G4 motifs are significantly conserved in the genomes of living organisms [7–10], and it was recently demonstrated that G4 formation is regulated dynamically during cell-cycle progression [11, 12]. Accumulating evidence suggests an important role of G4 structures in regulating gene expression [10]. Genome-wide computational analysis has identified more than 300,000 potential intramolecular G4-forming sequences in the human genome [9, 13] and revealed a higher prevalence of these sequences in functional genomic regions such as telomeres, promoters [10, 14], untranslated regions (UTRs) [15, 16] and introns [17]. Taken together, these observations suggest that G4 structures participate in regulating myriad biological processes.
DNA G4 recognition and remodeling by helicases such as Fanconi anaemia group J protein (FANCJ), Bloom syndrome protein (BLM), DNA repair protein (REV1) and Werner’s syndrome protein (WRN) have been reported [18–21]. RNA Helicase Associated with AU-rich element (RHAU, DHX36, G4R1) is a member of the human ATP-dependent DEAH-box family of RNA helicases, although DNA G4 helicase activity has also been observed with this enzyme [22, 23]. RHAU uses a local, non-processive mechanism to unwind G4s, similar to that of eukaryotic initiation factor 4A on double-stranded substrates [24, 25]. RHAU has nanomolar to sub-nanomolar affinity for G4s, and orders of magnitude weaker affinity for other observed nucleic acid conformations [26–31]. Furthermore, RHAU has 100-fold higher affinity for parallel relative to non-parallel G4s [24, 28, 30]. Helicase activity is highly sensitive to G4 stability, with an inverse correlation observed [24]. Based on domain conservation with other helicases, RHAU’s core DEAH-box helicase domain (residues 210–614) is flanked by an N-terminal G4-recognition domain (residues 1–210) and C-terminal helicase associated domains (residues 670–1008) that have yet to be fully characterized [32]. G4 specificity is mediated by the RHAU-specific motif (RSM), a 13-residue stretch (residues 54–66) in an N-terminal subdomain that is necessary, but not sufficient for full G4 binding affinity [27, 33]. A truncation of the full-length protein, RHAU53-105, adopts a defined and extended conformation in solution, orienting the RSM at one end [31]. RHAU53-105 retains both nanomolar affinity for G4s in vitro and the ability to outcompete endogenous RHAU for G4 targets in a cellular context [26, 31]. Investigation of RHAU53-105 in complex with RNA G4 from human telomerase RNA by NMR, SAXS, and complimentary biophysical approaches suggested interaction with the G-tetrad face (as opposed to the sugar-phosphate backbone) as the recognition surface for the RSM [31]. This mode of recognition was partially supported by a recent high-resolution structure of an 18 amino acid peptide (RHAU53-70) in complex with a parallel DNA G4 showing 4 hydrophobic amino acids (G59, I62, G63, A67) mediating interaction on the G-tetrad faces [30]. While the structure also suggests electrostatic interactions with the phosphate backbone, a previous study demonstrated no significant impact of charged amino acid mutations in the RSM [33]. Differences between DNA and RNA G4 binding by RHAU have also previously been reported, likely owing to the conformational restraints imposed by the 2’-OH in RNA [28, 30, 31].
To expand our understanding of biologically relevant RNA G4 recognition by the helicase RHAU, we previously performed an RNA co-immunoprecipitation screen and identified the messenger RNA (mRNA) for the protein Pituitary homeobox 1 (PITX1, P-OTX, backfoot) [26]. PITX1 functions as a transcription factor that plays a pivotal role in the differentiation of the developing pituitary gland, craniofacial structures and hind limbs in early embryonal development [34–37]. Recently, malformations in the lower limbs could be attributed to mutations in the PITX gene [38]. Deletions in PITX1 cause a spectrum of lower-limb malformations including mirror image polydactyly. PITX1 expression is down regulated in a number of tumor types including lung, colorectal, gastric and esophageal cancer and reduced PITX1 expression has been correlated with decreased overall patient survival [39–41]. Most interestingly, the PITX1 mRNA possesses three distinct G4 forming sequences in the 3′-untranslated region (UTR) of its mRNA (Q1: PITX11371-1400, Q2: PITX11901-1930, and Q3: PITX12044-2079). These G4s play roles in the recruitment of RHAU to the PITX1 mRNA and ultimately regulate PITX1 protein translation [26]. In cell lysates and with purified components, both RHAU and RHAU53-105 can interact with Q1, Q2, or Q3. Here, we characterize the Q2RNA/RHAU53-105 complex using a combination of electrophoretic mobility shift assays, UV-VIS spectroscopy, circular dichroism, dynamic light scattering, small angle X-ray scattering (SAXS) and nuclear magnetic resonance spectroscopy. Our integrated approach suggests that the RSM recognizes the planar guanine quartet face of parallel RNA G4s.
Materials and Methods
G4 preparation
All synthetic RNA and DNA were ordered from Integrated DNA Technologies (Coralville, Iowa), and were certified by the manufacturer by mass spectrometry and provided desalted. PITX11901-1930 RNA (Q2RNA) and PITX11901-1930 DNA (Q2DNA) were dissolved in 10 mM Tris (pH 7.5), 100 mM KCl, 1 mM EDTA at a concentration of 5 μM. Samples were heated to 95°C for 5 min and cooled slowly to room temperature to form G4. Nucleic acid in conformations other than G4 were then removed by size exclusion chromatography (SEC) on a HiLoad Superdex 75 26/60 in 10 mM Tris (pH 7.5), 100 mM KCl, and 1 mM EDTA (ÄKTA, GE Healthcare) as described previously (5 mL load volume) [42]. Sample purity was confirmed by both native and denaturing gel electrophoresis separately (15% native or denaturing Tris/Borate/EDTA polyacrylamide gels using urea as a denaturant). Quadruplex-forming nucleic acids were detected with N-methyl mesoporphyrin IX (NMM) staining (Frontier Scientific, Logan, UT, USA). Extinction coefficients (260 nm) were calculated from the sequence using IDT SciTools (OligoAnalyzer3.1, Integrated DNA Technologies) and corrected for hyperchromicity using the absorption spectra at 20°C and 90°C: Q2RNA, 253550 M-1cm-1; Q2DNA 264200 M-1cm-1.
Protein expression and purification
RHAU53-105 and full length RHAU were expressed and purified as described previously [31, 43]. After removal of the hexahistidine affinity tag by thrombin digestion, the protein was further purified by size exclusion chromatography on a HiLoad Superdex 75 26/60 (ÄKTA GE Healthcare, Mississauga, Canada) in 10 mM HEPES (pH 7.5), 150 mM NaCl (5 mL load volume). Isotopically enriched 15N-labelled RHAU53-105 was overexpressed in M9 minimal medium according to the method described previously [31]. The extinction coefficient (7020 M-1cm-1) was used to determine the protein concentration by measuring absorbance at 280 nm, and confirmed by Bradford assay.
G4-protein complex preparation
G4s and RHAU53-105 were diluted to 10 μM in the corresponding buffers described above, mixed in an equimolar ratio, and agitated slowly on a rotator for 15 min at room temperature. Complexes were separated from individual components by SEC on a HiLoad Superdex 75 26/60 column (GE-Healthcare, Mississauga, Canada) for purification in 10 mM Tris (pH 7.5), 100 mM KCl, and 1 mM EDTA. Both protein and nucleic acid components in the complex were confirmed by gel electrophoresis following complex purification. The concentration of the complex was determined by UV absorption using the extinction coefficient ε260nm of Q2 RNA, since the nucleic acid dominates the spectrum.
Electrophoretic mobility shift assays (EMSA)
EMSAs were performed by combining RNA (150 nM) with increasing concentrations of RHAU or RHAU53-105 (0–700 nM) in 50 mM Tris-acetate (pH7.8), 100 mM KCl, 10 mM NaCl, 3 mM MgCl2, 70 mM glycine, 10% glycerol, and incubating them at room temperature for 15 minutes. RNA-protein complexes were resolved by native 15% polyacrylamide gels (29:1 acrylamide:bis ratio) in 0.5x Tris/Borate/EDTA (TBE) at 80 V, 4°C for 2 hours. After electrophoresis, gels were stained with SYBR Gold fluorescent nucleic acid dye (Invitrogen, Burlington, ON), and imaged on a Fluorchem Q imager using Cy2 excitation LEDs and emission filters (ProteinSimple, San Jose, California). Bands were quantified from three independent experiments using AlphaView-FlorChem Q software provided by the manufacturer.
Microscale thermophoresis (MST)
Binding reactions were prepared in 50mM Tris-HCl buffer (pH 7.8) with 150 mM NaCl, 10 mM MgCl2, 0.5% Glycerol and 0.05% Tween to a total volume of 20 μL. RHAU53-105 was diluted 16 times by 2:1 serial dilution to achieve concentrations ranging from 250–0.6nM, and mixed with fluorescent 3’-FAM labeled Q2RNA (purchased from Integrated DNA Technologies, Coralville, Iowa) was held constant at 25 nM. Premium coated capillaries (NanoTemper Technologies, San Francisco, CA) were used for all measurements. Measurements were performed at an LED power 90% and MST-IR power 40% on the Monolith NT.115 instrument under room temperature conditions (21.5°C). For each run the infrared laser was applied for 35 seconds and the reverse T-Jump data signals of the MST-traces were fitted using the law of mass action for 1:1 binding to obtain KD values.
Dynamic Light Scattering (DLS)
DLS data were collected on a Nano-S Dynamic Light Scattering system (Malvern Instruments Ltd., Malvern, UK) as previously reported [44]. Samples were filtered through a 0.1-μm filter (Millipore) and equilibrated for 5 minutes at 20°C before measurements. 15 measurements were made per sample, and for each condition three independent samples were tested.
Thermal difference spectra
UV/VIS spectra were obtained on a dual beam Evolution 260 Bio UV-Visible spectrophotometer (Thermo Scientific). Q2RNA and Q2DNA (2 μM) in 10 mM Tris (pH 7.5), 100 mM KCl, and 1 mM EDTA were measured in triplicate and background corrected against spectra of buffer alone. Thermal difference spectra (TDS) were generated by subtracting buffer-corrected spectra at 20°C from those at 90°C. For direct comparison between Q2RNA and Q2DNA, differences were normalized to the maximum observed absorbance value, as previously suggested by Mergny et. al. [45].
Circular dichroism spectropolarimetry (CD)
All spectra were recorded on a calibrated Alfa Aesar J-810 spectropolarimeter (Jasco Inc., USA) from 200–340 nm in a 1.0 mm cell and a 32 s integration time. Sample concentrations were kept at 20 μM in 10 mM Tris (pH 7.5), 100 mM KCl, 1 mM EDTA. Measurements were performed in triplicate and baseline-corrected by subtraction of the buffer alone. Circular dichroism thermal melting curves were generated in the same buffer, following the ellipticity at 262 nm with spectra normalized by the number of nucleotides (glycosidic bonds) per unit volume.
Small angle X-ray scattering (SAXS)
SAXS data were collected using a Rigaku 3-pinhole camera (S-MAX3000) equipped with a Rigaku MicroMax + 002 microfocus sealed tube (Cu-Kα radiation at 1.54 Å) and Confocal Max-Flux (CMF) optics operating at 40 W as previously reported [46]. Scattering data were collected at the following sample concentrations; Q2RNA and Q2DNA (0.8, 1.1, and 1.7 mg/ml), and Q2RNA/RHAU53-105 complex (1.3 and 1.5 mg/mL) in 10 mM Tris (pH 7.5), 100 mM KCl, 1 mM EDTA. The raw intensity data were integrated with the SAXSGUI software package (JJ X-Ray Systems A/S, LyngBy, Denmark). Buffer subtraction and merging of data of multiple concentrations were performed using the program PRIMUS [47]. The pair distance distribution function plot, root mean square radius of gyration (rG) and the maximum particle dimension (Dmax) were obtained using the program GNOM [48]. Ab initio shape modeling was performed using the program DAMMIF based on a simulated annealing protocol [49, 50]. Twenty models for each entity were then generated, rotated, aligned and averaged using the program DAMAVER [51]. HYDROPRO [52] was used to calculate solution hydrodynamic properties of the averaged-filtered models using a similar approach as outlined previously [46]. Sample quality was confirmed for each sample before and after data collection by gel electrophoresis and DLS.
Nuclear magnetic resonance (NMR) spectroscopy
15N RHAU53-105 in complex with Q2RNA (85 μM) in 10 mM Tris, (pH 7.5), 100 mM KCl, 1 mM EDTA, and 10% deuterium (v/v) was prepared in an identical manner as previously described [31]. All spectra were acquired on a Varian Unity INOVA 600 MHz spectrometer. Data processing and spectrum generation were performed using iNMR (http://www.inmr.net).
Results
Q2RNA and Q2DNA each adopt a single, monomeric conformation
Synthetic Q2RNA was heat denatured, cooled, and purified by size exclusion chromatography (see Materials and Methods). Q2RNA elutes as a compact dominant peak with a shoulder corresponding to larger hydrodynamic volumes (Fig 1) from the HiLoad Superdex 75 26/60 column. The nucleic acid sequences used in this study are presented in (Fig 2A). Native gel electrophoresis confirmed that the dominant peak contains a single RNA conformation (Fig 2B). To understand the potential differences between RNA and DNA G4 recognition, we also investigated the DNA equivalent to Q2RNA (Q2DNA). Using an identical procedure, Q2DNA eluted in a single symmetric peak that contains a single conformation as determined by native gel electrophoresis (Figs 1 and 2B).
Elution profiles obtained on a HiLoad Superdex 75 26/60 column, with each species labeled.
(A) Schematic showing the G4 forming regions of PITX1 mRNA; the RNA and DNA equivalent sequences with the guanylate tracts underlined are also shown. 250 pmol of Q2RNA and Q2DNA were separated by native-Tris-borate EDTA (TBE) polyacrylamide gel electrophoresis, stained with (B) toluidine blue, and (C) G4-specific dye N-methyl mesoporphyrin IX alongside their positive (G4: hTR1-17) and negative (double stranded RNA) controls.
Q2RNA, but not Q2DNA, stains with a dye specific for parallel G4
To determine whether the purified nucleic acids adopt a parallel G4 conformation, we employed native gel electrophoresis in combination with N-methyl mesoporphyrin IX (NMM), a dye specific to parallel G4 conformations. The crystal structure of NMM bound to a parallel DNA G4 demonstrates the selectivity for parallel G4s [53]. To confirm the validity of the approach, non-G4 double-stranded RNA (dsRNA) and a known parallel RNA G4 (hTR1-17) were included as negative and positive controls, respectively. While both stain efficiently with toluidine blue (nucleic acid stain, pH 2.8), the positive, but not negative control stains with NMM (Fig 2B and 2C). Q2RNA stains efficiently with NMM, indicating that it adopts a parallel G4 conformation. Interestingly, the Q2DNA does not stain with the dye, suggesting that either it adopts a non-G4 or non-parallel G4 conformation.
Q2RNA adopts a parallel, while Q2DNA adopts an alternate, G4 conformation
To confirm G4 formation by Q2RNA, thermal difference spectra (TDS) were generated by subtracting the UV absorption spectrum of the folded state (recorded at 20°C) from the spectrum of the partially denatured state (measured at 90°C). Specific nucleic acid conformations result in specific TDS, generally reflecting the conformational change of the molecule in solution due to a disruption in base-stacking interactions. The TDS obtained for Q2RNA demonstrated features characteristic of G4s [45] with a minimum at 297 nm and two maxima at 240 and 276 nm (Fig 3). Interestingly, TDS analysis of Q2DNA showed similar overall features suggestive that it also adopts a G4 structure.
TDS of Q2RNA (circles) and Q2DNA (squares) in 10 mM Tris, pH 7.5, 100 mM KCl, 1 mM EDTA. For details of analysis, see Materials and Methods.
Next, we performed circular dichroism (CD) spectroscopy on the partially denatured and native states at 80°C and 20°C, respectively, in the same buffer as used for TDS analysis. The far-UV CD spectrum of Q2RNA at 20°C presented features consistent with previously characterized parallel G4s, with an ellipticity minimum at 242 nm and maximum at 264 nm [54] (Fig 4A). At 80°C similar overall spectral features were observed, however the intensity was modestly muted (approximately 45% at 264 nm), presumably due to partial unstacking of the G4 structure. Q2DNA has similar overall features to Q2RNA at 20°C with the prominent exception of an additional maxima at 290 nm (Fig 4A). The Q2DNA spectrum is consistent with the features of a group II G4 spectrum that has three parallel and one antiparallel strands [55–57]. At 80°C, the Q2DNA is almost completely denatured. To determine the relative stabilities of the RNA and DNA G4s, CD spectra were collected during the process of thermal melting (Fig 4B). Q2RNA was significantly more resistant to denaturation than its DNA counterpart, but the melting profile is similar to that of previously characterized RNA G4s [31]. We conclude that Q2RNA adopts, as expected, a parallel G4 conformation, whereas Q2DNA assumes a hybrid-type G4 structure with parallel and antiparallel strands.
(A) Far-UV CD spectra of Q2RNA and Q2DNA obtained in 10 mM Tris, pH 7.5, 100 mM KCl, 1 mM EDTA buffer of Q2RNA at 20°C (closed circles) and at 80°C (open circles) as well as of Q2DNA at 20°C (closed triangles) and at 80°C (closed squares). (B) CD melting curves of Q2RNA (——) and Q2DNA (------) monitored by spectropolarimetry at 264 nm in the same buffer.
RHAU interacts with Q2RNA but not its DNA counterpart
Previously, an N-terminal truncation of RHAU (RHAU53-105) containing the RSM has been identified to play a significant role in the recognition of G4s [31, 33]. To confirm the original observation, we performed electrophoretic mobility shift assays (EMSA) between Q2RNA and either RHAU53-105 or full-length RHAU (Fig 5A). Both RHAU53-105 and full-length RHAU shift Q2RNA towards a higher molecular weight species in a concentration dependent manner. We observed a higher affinity with full-length RHAU than with the truncated version (as expected). Interestingly, the DNA counterpart, Q2DNA, did not show any appreciable affinity for RHAU53-105. (Fig 5A). Microscale thermophoresis measurements were used to determine a dissociation constant of 1.7±0.3 nM for the RHAU53-105 complex with fluorescently labeled 3’-FAM-Q2RNA (Fig 5B).
(A) Electrophoretic mobility shift assays (EMSA) were performed using a constant 150 nM concentration of Q2RNA or Q2DNA and a variable concentration from 0–700 nM of RHAU53-105 or full-length RHAU. The 12% native Tris borate-EDTA (TBE) polyacrylamide gels were stained with SYBR Gold for visualization. (B) Microscale thermophoresis measurements performed using 3’-FAM Q2RNA (25 nM) in complex with RHAU53-105 at several concentrations (0.6–250 nM). Reverse T-Jump signals from the traces were fit as described in the Materials & Methods.
To further characterize nucleic acid-protein complexes, we prepared pure RHAU53-105 as well as its complex with Q2RNA or Q2DNA, and subjected them to size exclusion chromatography (Fig 1). RHAU53-105 and its complex with Q2RNA eluted as single peaks, with an expected increase in hydrodynamic size accompanying complex formation. Not surprisingly, no complex formation was observed between Q2DNA and RHAU53-105 (data not shown).
The association of N-terminal RHAU with Q2RNA does not disrupt G4 structure
To determine whether RHAU53-105 binding disrupts G4 structure we performed a CD experiment on the purified Q2RNA/RHAU53-105 complex (Fig 6). No significant differences were observed between CD spectra from Q2RNA/RHAU53-105 and Q2RNA in the region unique to nucleic acids (~250–320 nm), suggesting that the G4 remains intact upon protein binding.
Far-UV CD spectra of Q2RNA (——); and Q2RNA/RHAU53-105 complex (··—··) at 20 μM and 20°C with all the spectra normalized to the number of nucleotides. The G4 features of the RNA in the context of the complex are observed in the region unique to nucleic acids (~250–320 nm).
Solution structures of G4s and their complexes with RHAU53-105
To further understand the recognition of G4s by RHAU53-105, we used SAXS to study Q2RNA, Q2DNA, and the Q2RNA/RHAU53-105 complex purified by size exclusion chromatography. DLS was employed as an initial quality control step to ensure sample monodispersity over the range of concentrations used for SAXS acquisition (Fig 7A). Decreasing hydrodynamic radii (rH) were observed for the molecules in the following order: Q2RNA/RHAU53-105 complex (3.65 nm), Q2RNA (2.01 nm) and Q2DNA (1.65 nm) (Table 1). Samples did not display any significant self-association in the concentration range subsequently used for SAXS analysis, suggesting suitability for further structural studies (Fig 7B).
(A) DLS profiles of Q2RNA, Q2DNA and Q2RNA/RHAU53-105 complex. (B) Concentration dependence of hydrodynamic radii measured by DLS of the molecules in (A). (C) Merged SAXS data of Q2RNA, Q2DNA and Q2RNA/RHAU53-105 complex. (D) The corresponding pair distance distribution functions. (Color code: red, Q2RNA; green, Q2DNA; golden, Q2RNA/RHAU53-105 complex).
Standard deviations are indicated in parentheses, and dashes indicate that the values could not be determined.
SAXS data for Q2RNA, Q2DNA AND Q2RNA/RHAU53-105 complex collected at multiple concentrations were merged to obtain a single scattering profile (Fig 7C). The pair distance distribution function, P(r), which represents a histogram of all observed distances between electron pairs in the molecule was obtained from merged data using program GNOM (Fig 7D). Both Q2RNA and Q2DNA demonstrate a P(r) plot consistent with a globular structure, whereas the Q2RNA/RHAU53-105 complex likely adopts an extended conformation based on the elongated tail at longer distances. From this analysis, the radius of gyration (rG) and maximum particle dimension (Dmax) were determined (Table 1), and used as constraints to generate 20 individual low-resolution models. Individual models, with the chi (χ) values shown in Table 1, were rotated and superimposed to obtain an averaged solution conformation (Fig 8). Excellent superimposition of individually calculated models were confirmed by the normalized spatial discrepancy (NSD) parameter (≤ 0.63) for each molecule ensemble. Both RNA and DNA G4s adopt disc-shaped structures with concave bevels at the top and bottom, while the Q2RNA/RHAU53-105 complex adopts an extended shape. Superposition of the previously determined RHAU53-105 solution structure by SAXS onto the Q2RNA/RHAU53-105 complex suggests that G4 recognition is occurring via one of the termini of the protein.
Color code: Red, Q2RNA; green, Q2DNA; golden, Q2RNA/RHAU53-105 complex. Dmax values are shown beneath each species. The SAXS shape model of RHAU53-105 (blue) was superimposed on the complex.
Amino acids in and adjacent to the RSM mediate recognition of RNA G4
To determine the region of RHAU53-105 involved in mediating the interaction with G4, we expressed and purified isotopically-enriched 15N-RHAU53-105. After equimolar mixture with Q2RNA, we successfully purified Q2RNA/15N-RHAU53-105 complex, and acquired its 15N- HSQC spectrum (Fig 9). To determine the region(s) perturbed by G4 binding, we compared the determined HSQC spectrum to those previously determined for free 15N-RHAU53-105 and 15N-RHAU53-105 in complex with another minimal RNA G4 (hTR1-20) [31]. We observed nearly identical chemical shift perturbations as previously reported, with significant chemical shift perturbations clustering to the RSM and adjacent α-helix (K58, R60, E61, I62, G63, M64, W65, Y66, A67, K68, K69, N74, K75, A77, and E78). Therefore, we conclude that, as previously observed, residues in the RSM and the adjacent helix are responsible for mediating the G4 interaction.
15N-HSQC spectral overlay of RHAU53-105 free (black) and in complex with Q2RNA (orange) or hTR1-20 (blue) with a subset of assigned resonances labeled. Data for RHAU53-105 and RHAU53-105/hTR1-20 were previously obtained, and used for comparison (30).
Discussion
G4s were predicted to play key roles in a number of biological activities including the regulation of gene transcription and translation [10], and evidence for that has accumulated in recent years both in vitro and in vivo [15, 58, 59]. Various proteins interact specifically with G4s, suggesting they fulfill important functions in cellular processes [60]. RHAU has been observed in a number of contexts to interact with RNA G4s, but the mechanism of how it recognizes and unwinds these structures is not well characterized. We have chosen as our model system a specific G4 (Q2RNA) found in the 3’-UTR of the PITX1 mRNA, primarily because its interaction with RHAU in a cellular context is established [26]. Based on several methods, including staining with an orientation-specific dye, TDS, and spectropolarimetry, Q2RNA is a parallel G4, and adopts a compact, disc-shaped conformation in solution that is consistent with another previously determined RNA G4 structure by SAXS [31]. Interestingly, the DNA equivalent (Q2DNA) presented markedly different features, namely in terms of its CD profile and its inability to stain with a parallel G4 dye, despite adopting a similar shape in solution as determined by SAXS and sharing similar hydrodynamic features as Q2RNA. Although a high-resolution structure would unambiguously highlight the differences, our data is consistent with Q2DNA adopting the hybrid-type G4 orientation observed in group II G4s, which has three parallel and one antiparallel strands [55–57]. We anticipate that the ability to accommodate 2′-OH groups in RNA G4s (affecting hydrogen bonding and sugar puckering) is central to the observed conformational differences between Q2RNA and Q2DNA [61].
Previous low-resolution structural and biophysical studies suggest that the N-terminal domain of RHAU interacts with the G-quartet face on the top or bottom plane of the G4 for both RNA and DNA [31]. Recently, a high-resolution structure of a short N-terminal peptide in complex with a DNA G4 has reinforced this mode of recognition, but also suggested that certain basic amino acid residues mediate specificity through interaction with the phosphodiester backbone [30]. Recognition of the Q2RNA G4 by the N-terminal region of RHAU (containing the RSM) uses nearly identical amino acid residues to those previously observed as important with another RNA G4 [31]. The elongated solution structure of Q2RNA/RHAU53-105 by SAXS is consistent with the same protein truncation in complex with another RNA G4 [31], and the superimposition of individual Q2RNA and RHAU53-105 models onto the complex model are consistent with recognition of the G-quartet face as the primary site of recognition.
Mechanistic studies have also suggested the importance of a parallel orientation for the recognition by RHAU [24, 30]. The parallel G4 specific dye used in this study (NMM) interacts by stacking on the G-tetrad faces [53] and we have observed a significant reduction in the staining intensity of the dye where the G4 is bound to RHAU53-105 as opposed to free G4 (data not shown). This suggests that the protein occupies the tetrad face. A previous study investigating a G4 from the human telomerase RNA (hTR) and its DNA counterpart has revealed that both adopt a parallel orientation and that both interact with RHAU53-105 by means of the RSM. However, the DNA G4 made additional interactions with RHAU that were not observed in the RNA G4 [31]. DNA G4s generally demonstrate lower affinity for RHAU than their RNA counterparts [28, 30, 31], and whether the 2’-OH, a parallel arrangement, or both are important remains to be determined for RNA binding. Given these observations, it was not surprising that different strand orientations adopted by Q2RNA and Q2DNA significantly impact their affinity for RHAU. In the absence of a high-resolution RHAU-RNA G4 structure, our results strongly support the previously observed mode of recognition where strand directionality is key to presenting a parallel G4 face for RHAU binding. High-resolution structural studies of RNA G4s in complex with RHAU will likely confirm the hypothesis that both electrostatic and steric impacts of the 2’-OH also fulfill an important role.
While the importance of the N-terminal domain of RHAU has clearly been established, the mechanism whereby full-length protein binds and unwinds G4 structures remains to be elucidated. Binding of truncated RHAU53-105 to RNA or DNA G4 does not attain the full binding affinity observed in full-length RHAU nor does it induce unwinding [27, 28, 31]. These features are clearly confirmed again in this study as full-length RHAU has higher affinity than the N-terminal fragment for Q2RNA, and comparison of the CD spectra of Q2RNA free and in complex with RHAU53-105 indicates no G4 unwinding. Therefore, future studies geared towards an understanding of G4 helicase activity in the context of the full-length protein, remain a priority. The work presented here, while focused specifically on the in vitro study of a purified RNA-protein complex, provide the template for an eventual mechanistic understanding of G4 impact on translational regulation of mRNAs, including PITX1.
Acknowledgments
The authors would like to thank the Manitoba Institute for Materials for providing access to SAXS instrumentation and Dr. Joe O’Neil (University of Manitoba) for access to the spectropolarimeter.
Author Contributions
Conceived and designed the experiments: JS SAM. Performed the experiments: EOA EPB. Analyzed the data: EOA TRP ED MM. Contributed reagents/materials/analysis tools: EPB EKM. Wrote the paper: EOA TRP MM SAM. Provided technical support for data collection of SAXS data: KM.
References
- 1. Hoogsteen K. Crystal and Molecular Structure of a Hydrogen-Bonded Complex between 1-Methylthymine and 9-Methyladenine. Acta Crystallogr. 1963;16(9):907. WOS:A19631326A00007.
- 2. Burge S, Parkinson GN, Hazel P, Todd AK, Neidle S. Quadruplex DNA: sequence, topology and structure. Nucleic Acids Res. 2006;34(19):5402–15. pmid:17012276; PubMed Central PMCID: PMC1636468.
- 3. Bochman ML, Paeschke K, Zakian VA. DNA secondary structures: stability and function of G-quadruplex structures. Nat Rev Genet. 2012;13(11):770–80. pmid:23032257; PubMed Central PMCID: PMC3725559.
- 4. Sacca B, Lacroix L, Mergny JL. The effect of chemical modifications on the thermal stability of different G-quadruplex-forming oligonucleotides. Nucleic Acids Res. 2005;33(4):1182–92. pmid:15731338; PubMed Central PMCID: PMC549566.
- 5. Zhang AY, Bugaut A, Balasubramanian S. A sequence-independent analysis of the loop length dependence of intramolecular RNA G-quadruplex stability and topology. Biochemistry. 2011;50(33):7251–8. pmid:21744844; PubMed Central PMCID: PMC3522851.
- 6. Collie GW, Haider SM, Neidle S, Parkinson GN. A crystallographic and modelling study of a human telomeric RNA (TERRA) quadruplex. Nucleic Acids Res. 2010;38(16):5569–80. pmid:20413582; PubMed Central PMCID: PMC2938214.
- 7. Frees S, Menendez C, Crum M, Bagga PS. QGRS-Conserve: a computational method for discovering evolutionarily conserved G-quadruplex motifs. Hum Genomics. 2014;8:8. pmid:24885782; PubMed Central PMCID: PMC4017754.
- 8. Verma A, Halder K, Halder R, Yadav VK, Rawal P, Thakur RK, et al. Genome-wide computational and expression analyses reveal G-quadruplex DNA motifs as conserved cis-regulatory elements in human and related species. J Med Chem. 2008;51(18):5641–9. pmid:18767830.
- 9. Huppert JL, Balasubramanian S. Prevalence of quadruplexes in the human genome. Nucleic Acids Res. 2005;33(9):2908–16. pmid:15914667; PubMed Central PMCID: PMC1140081.
- 10. Huppert JL, Balasubramanian S. G-quadruplexes in promoters throughout the human genome. Nucleic Acids Res. 2007;35(2):406–13. pmid:17169996; PubMed Central PMCID: PMC1802602.
- 11. Biffi G, Tannahill D, McCafferty J, Balasubramanian S. Quantitative visualization of DNA G-quadruplex structures in human cells. Nat Chem. 2013;5(3):182–6. pmid:23422559; PubMed Central PMCID: PMC3622242.
- 12. Brooks TA, Kendrick S, Hurley L. Making sense of G-quadruplex and i-motif functions in oncogene promoters. FEBS J. 2010;277(17):3459–69. pmid:20670278; PubMed Central PMCID: PMC2971675.
- 13. Todd AK, Johnston M, Neidle S. Highly prevalent putative quadruplex sequence motifs in human DNA. Nucleic Acids Res. 2005;33(9):2901–7. pmid:15914666; PubMed Central PMCID: PMC1140077.
- 14. Hershman SG, Chen Q, Lee JY, Kozak ML, Yue P, Wang LS, et al. Genomic distribution and functional analyses of potential G-quadruplex-forming sequences in Saccharomyces cerevisiae. Nucleic Acids Res. 2008;36(1):144–56. pmid:17999996; PubMed Central PMCID: PMC2248735.
- 15. Kumari S, Bugaut A, Huppert JL, Balasubramanian S. An RNA G-quadruplex in the 5' UTR of the NRAS proto-oncogene modulates translation. Nat Chem Biol. 2007;3(4):218–21. pmid:17322877; PubMed Central PMCID: PMC2206252.
- 16. Huppert JL, Bugaut A, Kumari S, Balasubramanian S. G-quadruplexes: the beginning and end of UTRs. Nucleic Acids Res. 2008;36(19):6260–8. pmid:18832370; PubMed Central PMCID: PMC2577360.
- 17. Eddy J, Maizels N. Conserved elements with potential to form polymorphic G-quadruplex structures in the first intron of human genes. Nucleic Acids Res. 2008;36(4):1321–33. pmid:18187510; PubMed Central PMCID: PMC2275096.
- 18. Crabbe L, Verdun RE, Haggblom CI, Karlseder J. Defective telomere lagging strand synthesis in cells lacking WRN helicase activity. Science. 2004;306(5703):1951–3. pmid:15591207.
- 19. Sarkies P, Reams C, Simpson LJ, Sale JE. Epigenetic instability due to defective replication of structured DNA. Mol Cell. 2010;40(5):703–13. pmid:21145480; PubMed Central PMCID: PMC3145961.
- 20. Sun H, Karow JK, Hickson ID, Maizels N. The Bloom's syndrome helicase unwinds G4 DNA. J Biol Chem. 1998;273(42):27587–92. pmid:9765292.
- 21. Wu Y, Shin-ya K, Brosh RM Jr. FANCJ helicase defective in Fanconia anemia and breast cancer unwinds G-quadruplex DNA to defend genomic stability. Mol Cell Biol. 2008;28(12):4116–28. pmid:18426915; PubMed Central PMCID: PMC2423121.
- 22. Brazda V, Haronikova L, Liao JC, Fojta M. DNA and RNA quadruplex-binding proteins. Int J Mol Sci. 2014;15(10):17493–517. pmid:25268620; PubMed Central PMCID: PMC4227175.
- 23. Gao X, Ma W, Nie J, Zhang C, Zhang J, Yao G, et al. A G-quadruplex DNA structure resolvase, RHAU, is essential for spermatogonia differentiation. Cell Death Dis. 2015;6:e1610. pmid:25611385.
- 24. Chen MC, Murat P, Abecassis K, Ferre-D'Amare AR, Balasubramanian S. Insights into the mechanism of a G-quadruplex-unwinding DEAH-box helicase. Nucleic Acids Res. 2015;43(4):2223–31. pmid:25653156; PubMed Central PMCID: PMC4344499.
- 25. Rogers GW Jr., Richter NJ, Merrick WC. Biochemical and kinetic characterization of the RNA helicase activity of eukaryotic initiation factor 4A. J Biol Chem. 1999;274(18):12236–44. pmid:10212190.
- 26. Booy EP, Howard R, Marushchak O, Ariyo EO, Meier M, Novakowski SK, et al. The RNA helicase RHAU (DHX36) suppresses expression of the transcription factor PITX1. Nucleic Acids Res. 2014;42(5):3346–61. pmid:24369427; PubMed Central PMCID: PMC3950718.
- 27. Booy EP, Meier M, Okun N, Novakowski SK, Xiong S, Stetefeld J, et al. The RNA helicase RHAU (DHX36) unwinds a G4-quadruplex in human telomerase RNA and promotes the formation of the P1 helix template boundary. Nucleic Acids Res. 2012;40(9):4110–24. pmid:22238380; PubMed Central PMCID: PMC3351167.
- 28. Creacy SD, Routh ED, Iwamoto F, Nagamine Y, Akman SA, Vaughn JP. G4 resolvase 1 binds both DNA and RNA tetramolecular quadruplex with high affinity and is the major source of tetramolecular quadruplex G4-DNA and G4-RNA resolving activity in HeLa cell lysates. J Biol Chem. 2008;283(50):34626–34. pmid:18842585; PubMed Central PMCID: PMC2596407.
- 29. Giri B, Smaldino PJ, Thys RG, Creacy SD, Routh ED, Hantgan RR, et al. G4 resolvase 1 tightly binds and unwinds unimolecular G4-DNA. Nucleic Acids Res. 2011;39(16):7161–78. pmid:21586581; PubMed Central PMCID: PMC3167620.
- 30. Heddi B, Cheong VV, Martadinata H, Phan AT. Insights into G-quadruplex specific recognition by the DEAH-box helicase RHAU: Solution structure of a peptide-quadruplex complex. Proc Natl Acad Sci U S A. 2015;112(31):9608–13. pmid:26195789.
- 31. Meier M, Patel TR, Booy EP, Marushchak O, Okun N, Deo S, et al. Binding of G-quadruplexes to the N-terminal recognition domain of the RNA helicase associated with AU-rich element (RHAU). J Biol Chem. 2013;288(49):35014–27. pmid:24151078; PubMed Central PMCID: PMC3853254.
- 32. Marchler-Bauer A, Derbyshire MK, Gonzales NR, Lu S, Chitsaz F, Geer LY, et al. CDD: NCBI's conserved domain database. Nucleic Acids Res. 2015;43(Database issue):D222–6. pmid:25414356; PubMed Central PMCID: PMC4383992.
- 33. Lattmann S, Giri B, Vaughn JP, Akman SA, Nagamine Y. Role of the amino terminal RHAU-specific motif in the recognition and resolution of guanine quadruplex-RNA by the DEAH-box RNA helicase RHAU. Nucleic Acids Res. 2010;38(18):6219–33. pmid:20472641; PubMed Central PMCID: PMC2952847.
- 34. Lamonerie T, Tremblay JJ, Lanctot C, Therrien M, Gauthier Y, Drouin J. Ptx1, a bicoid-related homeo box transcription factor involved in transcription of the pro-opiomelanocortin gene. Genes Dev. 1996;10(10):1284–95. pmid:8675014.
- 35. Lanctot C, Lamolet B, Drouin J. The bicoid-related homeoprotein Ptx1 defines the most anterior domain of the embryo and differentiates posterior from anterior lateral mesoderm. Development. 1997;124(14):2807–17. pmid:9226452.
- 36. Shang J, Li X, Ring HZ, Clayton DA, Francke U. Backfoot, a novel homeobox gene, maps to human chromosome 5 (BFT) and mouse chromosome 13 (Bft). Genomics. 1997;40(1):108–13. pmid:9070926.
- 37. Szeto DP, Ryan AK, O'Connell SM, Rosenfeld MG. P-OTX: a PIT-1-interacting homeodomain factor expressed during anterior pituitary gland development. Proc Natl Acad Sci U S A. 1996;93(15):7706–10. pmid:8755540; PubMed Central PMCID: PMC38811.
- 38. Klopocki E, Kahler C, Foulds N, Shah H, Joseph B, Vogel H, et al. Deletions in PITX1 cause a spectrum of lower-limb malformations including mirror-image polydactyly. Eur J Hum Genet. 2012;20(6):705–8. pmid:22258522; PubMed Central PMCID: PMC3355260.
- 39. Chen Y, Knosel T, Ye F, Pacyna-Gengelbach M, Deutschmann N, Petersen I. Decreased PITX1 homeobox gene expression in human lung cancer. Lung Cancer. 2007;55(3):287–94. pmid:17157953.
- 40. Knosel T, Chen Y, Hotovy S, Settmacher U, Altendorf-Hofmann A, Petersen I. Loss of desmocollin 1–3 and homeobox genes PITX1 and CDX2 are associated with tumor progression and survival in colorectal carcinoma. Int J Colorectal Dis. 2012;27(11):1391–9. pmid:22438068.
- 41. Lord RV, Brabender J, Wickramasinghe K, DeMeester SR, Holscher A, Schneider PM, et al. Increased CDX2 and decreased PITX1 homeobox gene expression in Barrett's esophagus and Barrett's-associated adenocarcinoma. Surgery. 2005;138(5):924–31. pmid:16291394.
- 42. Booy EP, Meng H, McKenna SA. Native RNA purification by gel filtration chromatography. Methods Mol Biol. 2012;941:69–81. pmid:23065554.
- 43. Booy EP, McRae EK, McKenna SA. Biochemical characterization of G4 quadruplex telomerase RNA unwinding by the RNA helicase RHAU. Methods Mol Biol. 2015;1259:125–35. pmid:25579584.
- 44. Patel TR, Morris GA, Zwolanek D, Keene DR, Li J, Harding SE, et al. Nano-structure of the laminin gamma-1 short arm reveals an extended and curved multidomain assembly. Matrix Biol. 2010;29(7):565–72. pmid:20688161.
- 45. Mergny JL, Li J, Lacroix L, Amrane S, Chaires JB. Thermal difference spectra: a specific signature for nucleic acid structures. Nucleic Acids Res. 2005;33(16):e138. pmid:16157860; PubMed Central PMCID: PMC1201377.
- 46. Dzananovic E, Patel TR, Deo S, McEleney K, Stetefeld J, McKenna SA. Recognition of viral RNA stem-loops by the tandem double-stranded RNA binding domains of PKR. RNA. 2013;19(3):333–44. pmid:23329698; PubMed Central PMCID: PMC3677244.
- 47. Konarev PV, Volkov VV, Sokolova AV, Koch MHJ, Svergun DI. PRIMUS: a Windows PC-based system for small-angle scattering data analysis. J Appl Crystallogr. 2003;36:1277–82. WOS:000185178600026.
- 48. Svergun DI. Determination of the regularization parameter in indirect-transform methods using perceptual criteria. J Appl Crystallogr. 1992;25:495–503.
- 49. Svergun DI. Restoring low resolution structure of biological macromolecules from solution scattering using simulated annealing. Biophys J. 1999;76(6):2879–86. pmid:10354416; PubMed Central PMCID: PMC1300260.
- 50. Franke D, Svergun DI. DAMMIF, a program for rapid ab-initio shape determination in small-angle scattering. J Appl Crystallogr. 2009;42:342–6. WOS:000264292800030.
- 51. Volkov VV, Svergun DI. Uniqueness of ab initio shape determination in small-angle scattering. J Appl Crystallogr. 2003;36:860–4.
- 52. de la Torre JG, Huertas ML, Carrasco B. Calculation of hydrodynamic properties of globular proteins from their atomic-level structure. Biophysical Journal. 2000;78(2):719–30. WOS:000085249300016. pmid:10653785
- 53. Nicoludis JM, Miller ST, Jeffrey PD, Barrett SP, Rablen PR, Lawton TJ, et al. Optimized end-stacking provides specificity of N-methyl mesoporphyrin IX for human telomeric G-quadruplex DNA. J Am Chem Soc. 2012;134(50):20446–56. pmid:23181361.
- 54. Kypr J, Kejnovska I, Renciuk D, Vorlickova M. Circular dichroism and conformational polymorphism of DNA. Nucleic Acids Res. 2009;37(6):1713–25. pmid:19190094; PubMed Central PMCID: PMC2665218.
- 55. Ambrus A, Chen D, Dai J, Bialis T, Jones RA, Yang D. Human telomeric sequence forms a hybrid-type intramolecular G-quadruplex structure with mixed parallel/antiparallel strands in potassium solution. Nucleic Acids Res. 2006;34(9):2723–35. pmid:16714449; PubMed Central PMCID: PMC1464114.
- 56. Lam EY, Beraldi D, Tannahill D, Balasubramanian S. G-quadruplex structures are stable and detectable in human genomic DNA. Nat Commun. 2013;4:1796. pmid:23653208; PubMed Central PMCID: PMC3736099.
- 57. Randazzo A, Spada GP, da Silva MW. Circular dichroism of quadruplex structures. Top Curr Chem. 2013;330:67–86. pmid:22752576.
- 58. Arora A, Dutkiewicz M, Scaria V, Hariharan M, Maiti S, Kurreck J. Inhibition of translation in living eukaryotic cells by an RNA G-quadruplex motif. RNA. 2008;14(7):1290–6. pmid:18515550; PubMed Central PMCID: PMC2441988.
- 59. Wieland M, Hartig JS. RNA quadruplex-based modulation of gene expression. Chem Biol. 2007;14(7):757–63. pmid:17656312.
- 60. Fry M. Tetraplex DNA and its interacting proteins. Front Biosci. 2007;12:4336–51. pmid:17485378.
- 61. Egli M, Portmann S, Usman N. RNA hydration: a detailed look. Biochemistry. 1996;35(26):8489–94. pmid:8679609.