Figures
Abstract
A strong genetic predictor of outcome following untreated HIV-1 infection is the carriage of specific alleles of human leukocyte antigens (HLAs) that present viral epitopes to T cells. Residual variation in outcome measures may be attributed, in part, to viral adaptation to HLA-restricted T cell responses. Variants of the endoplasmic reticulum aminopeptidases (ERAPs) influence the repertoire of T cell epitopes presented by HLA alleles as they trim pathogen-derived peptide precursors to optimal lengths for antigen presentation, along with other functions unrelated to antigen presentation. We investigated whether ERAP variants influence HLA-associated HIV-1 adaptation with demonstrable effects on overall HIV-1 disease outcome. Utilizing host and viral data of 249 West Australian individuals with HIV-1 subtype B infection, we identified a novel association between two linked ERAP2 single nucleotide polymorphisms (SNPs; rs2248374 and rs2549782) with plasma HIV RNA concentration (viral load) (P adjusted = 0.0024 for both SNPs). Greater HLA-associated HIV-1 adaptation in the HIV-1 Gag gene correlated significantly with higher viral load, lower CD4+ T cell count and proportion; P = 0.0103, P = 0.0061, P = 0.0061, respectively). When considered together, there was a significant interaction between the two ERAP2 SNPs and HLA-associated HIV-1 adaptation on viral load (P = 0.0111). In a comprehensive multivariate model, addition of ERAP2 haplotypes and HLA associated adaptation as an interaction term to known HLA and CCR5 determinants and demographic factors, increased the explanatory variance of population viral load from 17.67% to 45.1% in this dataset. These effects were not replicated in publicly available datasets with comparably sized cohorts, suggesting that any true global epistasis may be dependent on specific HLA-ERAP allelic combinations. Our data raises the possibility that ERAP2 variants may shape peptide repertoires presented to HLA class I-restricted T cells to modulate the degree of viral adaptation within individuals, in turn contributing to disease variability at the population level. Analyses of other populations and experimental studies, ideally with locally derived ERAP genotyping and HLA-specific viral adaptations are needed to elucidate this further.
Author summary
HIV infection outcome is variable between individuals and understanding the factors that impact this variation is important for efforts towards a HIV cure or vaccine. Here, we found that the level of HIV in the blood is affected by whether an individual carries a specific form of ERAP2, a molecule that influences processing and presentation of the virus to the immune system, as well as the degree to which HIV has mutated to adapt to immune responses. We also show that the interaction of ERAP2 and other known genetic factors explains greater variation in infection outcome than these factors alone. These findings expand our knowledge of the potential importance of viral processing and presentation in the immune response to HIV.
Citation: Al-kaabi M, Deshpande P, Firth M, Pavlos R, Chopra A, Basiri H, et al. (2024) Epistatic interaction between ERAP2 and HLA modulates HIV-1 adaptation and disease outcome in an Australian population. PLoS Pathog 20(7): e1012359. https://doi.org/10.1371/journal.ppat.1012359
Editor: Daniel C. Douek, Vaccine Research Center, UNITED STATES OF AMERICA
Received: October 17, 2023; Accepted: June 19, 2024; Published: July 9, 2024
Copyright: © 2024 Al-kaabi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: GenBank accession's numbers of HIV sequences and individual host genetic data are provided in 'S1 Data and S2 Data' supplementary information files.
Funding: This work was funded by the National Health and Medical Research Council (APP1148284) grant awarded to SG, MJ, SK and SM; National Institutes of Health-funded Tennessee Center for AIDS Research grant (P30AI110527) awarded to SM and SK; University of Western Australia grant (2022/GR000787) awarded to SG; Government of Western Australia grant (2021/GR000223) awarded to SG; BioZone Collaboration Grant awarded to EA; BioZone Completion Scholarship awarded to EA; Australian Government Research Training Program awarded to EA; Australian Government Research Training Program Fees Offset awarded to MA; The Swiss HIV Cohort Study (201369) awarded to JF. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Human immunodeficiency virus-1 (HIV-1) infection continues to present a global health challenge with no existing curative therapy or effective vaccine. Understanding the impact of host and viral factors on the natural course of HIV-1 infection remains an ongoing area of investigation. Importantly, exploring the influence of genetic factor interactions, such as epistasis, rather than individual single nucleotide polymorphisms (SNPs) on HIV-1 infection outcome will allow a better understanding of disease pathogenesis [1].
A major host immune-related genetic factor that impacts plasma HIV RNA concentration (viral load) are the human leukocyte antigen (HLA) class I loci [2–4], which encode proteins that act as antigen presenting molecules to T cells and as NK cell receptor ligands. The HLA class I alleles B*57 and B*27 have been shown to be strong predictors of lower viral load and slower disease progression [5]. However, about 30% of individuals with HIV infection who maintain robust viral control do not carry these protective HLA-B alleles [6]. Furthermore, the combined variations in the HLA class I region and in CCR5 (32 bp deletion within the receptor for HIV-1 cell binding and entry) only explain ~25% of set-point viral load (spVL) in genome wide association studies (GWAS), where set-point refers to a point of equilibrium following the peak in viral load after acute infection [4]. This highlights the need to identify additional factors associated with HIV-1 infection outcome and/or factors that modulate the effect of HLA alleles.
The endoplasmic reticulum aminopeptidases 1 (ERAP1) and 2 (ERAP2) are polymorphic genes that have been found to interact with HLA class I genes and influence clinical outcome in several cancers, autoimmune and infectious diseases [7–9]. These genes encode enzymes that trim N-terminal peptide precursors into an optimal length to accommodate the highly variable HLA binding groove [10,11]. Thereby, ERAP1 and ERAP2 define the peptide repertoire presented to T cells restricted by specific HLA genotypes [12]. As genetic variations in the ERAP loci mark a broad range of enzymatic activities that vary in terms of catalytic efficiency and substrate affinity [13], greater clarity is needed on how variation in these ERAP genes impact HLA-restricted T cell immune responses. While polymorphisms in the ERAP1 and ERAP2 genes are found to alter peptide repertoire [9,14,15], recent evidence also suggests an influence of ERAP2 on cytokine expression [16] and other factors involved in modulating anti-viral immune responses [17]. ERAP molecules may therefore affect HIV-1 infection outcome by mechanisms that are independent of peptide repertoire effects on antigen presentation.
Viral genetic diversity is another factor that contributes to HIV-1 infection outcome. The observed genetic diversity of HIV-1 includes specific polymorphisms (adaptations) selected by HLA-restricted T cell immune pressure [18], with the level of adaptation in the autologous virus associated with infection outcome [19,20]. The selection of de novo adaptations or reversion of transmitted viral adaptations during HIV-1 infection reflects the cost of the specific polymorphism to the replicative fitness of the virus in the selective immune environment of the host [20,21]. Shifts in the targeted peptide repertoire would be predicted to alter where adaptations occur and in turn, the overall balance between immune escape and replicative capacity cost as reflected by viral load.
In this study, we aimed to investigate viral adaptations and host genetic factors and their association with viral load and other disease markers in a well characterized cohort of people living with HIV-1 infection from Western Australia (WAHIV) [22]. The historical nature of this cohort enabled the examination of host and viral factors on pre-treatment viral load. Access to pre-treatment viral load data is increasingly challenging in cohort studies due to early or immediate initiation of antiretroviral therapy (ART) becoming standard of care [23]. We sought to investigate the potential role of ERAP2 in altering HIV-1 infection outcome and if there was evidence of additive or synergistic interaction between ERAP and HLA.
Results
ERAP2 significantly influences HIV-1 infection outcome
Our previous study of the WAHIV cohort has shown that carriage of the HLA class I alleles B*57:01 and B*27:05 are associated with lower pre-treatment viral load [24]. We extended our earlier study to determine whether variation in ERAP1 and ERAP2 influences infection outcome. Initially, we investigated the non-independent impact of two main ERAP2 SNPs rs2549782 and rs2248374 [25] on HIV-1 infection outcome. Both SNPs encode for notable changes at the functional level of the enzyme: rs2549782 is a missense variant [25] that switches enzyme specificity and activity [26] and rs2248374 is an intron variant that disrupts RNA splicing leading to the loss of ERAP2 expression [25]. The two SNPs are in strong linkage disequilibrium (LD; r2 = 0.936, D`= 0.975; Fig 1A and 1B). Association analysis using different genetic inheritance models showed a strong significant association with viral load for both SNPs, in which the presence of the G allele of the rs2248374 SNP and the T allele of the rs2549782 SNP associate with lower viral load (P adjusted Codominant = 0.0081, P adjusted Dominant = 0.0024 for both SNPs; S1 Table). A trend with CD4+ T cell count was identified for both SNPs (P adjusted Dominant = 0.05), while no association was detected with CD4+ T cell proportion after false discovery rate (FDR) correction (S1 Table).
A) Manhattan plot for ERAP1 and ERAP2 SNPs and association with HIV-1 disease outcome measured by log viral load (RNA copies/ml). Negative log10 FDR adjusted P values are plotted against the physical position of each SNP on chromosome 5. The blue dashed line represents the nominal significant level 0.05. The dominant model of inheritance is applied comparing the homozygous major alleles versus the heterozygous and the homozygote minor alleles together. The SNPs are coloured according to their linkage strength measured by correlation coefficient (r2) with the tag SNP rs2248374. The ANOVA statistical analysis was performed using the SNPassoc R package. Sex, race and CCR5Δ32 deletion were included in the statistical model as covariates. The analysis included 249 treatment naïve HIV-1 infected participants. The plot was generated in RStudio software. B) Linkage disequilibrium plot showing the linkage patterns of the ERAP1 and ERAP2 SNPs measured by D`. The ERAP1 and ERAP2 SNPs are ordered according to their position in the human genome. Confidence bounds colour scheme is used to represents the strength of the LD patterns among ERAP1 and 2 SNPs, where the dark grey colour reflects strong evidence of LD and the white reflects independent genotypes. The values of the D`are displayed in each square and the dark grey squares without numbers indicate D`equals to 1. Plot was constructed using HaploView 4.2 software [74].
HIV-1 infection outcome is not affected by ERAP1 genotype
Previous studies suggest that variations in ERAP1 can alter clinical outcome in certain infections [9]. We sought to investigate whether this effect can be detected in participants with HIV infection. We screened for an independent association signal for 10 ERAP1 SNPs with HIV-1 infection outcome; eight of the SNPs are the most frequent ERAP1 SNPs (with >1% frequency) reported [13,27] and shown to be a risk factor for many diseases [28,29]. Three ERAP1 SNPs (rs26653, rs27895 and rs3734016) showed a significant association with viral load (rs26653: P Codominant = 0.0462, P Recessive = 0.0131; rs27895: P Codominant = 0.0411, P Recessive = 0.0172; rs3734016: P Codominant = 0.0483, P Dominant = 0.0163, P Overdominant = 0.0138). However, these associations did not reach significance following FDR correction (S1 Table; trend for rs3734016, P adjusted = 0.0652). When investigating the association with CD4+ T cell count, rs26653 also had a significant association under the recessive model (P = 0.0392) and rs26618 under the dominant model (P = 0.0442) but did not reach significance following FDR correction. A similar association with CD4+ T cell proportion was observed for rs26653 and rs3734016 (rs26653: P Codominant = 0.0279, P Recessive = 0.0076; rs3734016: P Dominant = 0.0399, P Overdominant = 0.0438) (S1 Table). Both SNPs are in strong LD with other SNPs in the ERAP1 gene region (Fig 1A and 1B).
ERAP2 haplotype B, carrying a truncated version of ERAP2, predicts better HIV-1 infection outcome
ERAP haplotypes were constructed to investigate the additive effect of ERAP1 and ERAP2 variants on disease outcome. ERAP2 haplotypes are defined based on the alteration in gene transcriptional length caused by the SNP rs2248374. Haplotype A is associated with a full-length ERAP2 protein, whereas haplotype B is associated with a shortened protein that is subject to nonsense-mediated decay of the mRNA [25]. The majority of the cohort carry either haplotype A (rs2549782G and rs2248374A, 51.1%) or haplotype B (rs2549782 T and rs2248374 G, 47.4%), with a few participants carrying different combinations (i.e. rs2549782 G and rs2248374 G or rs2549782 T and rs2248374 A, 1.5%; S2 Table). The frequency of the SNP combinations is comparable to the frequencies observed by others [13].
Overall, ERAP2 haplotypes associated significantly with differences in viral load (P = 0.0007, ηp2 = 0.06, ANOVA; Fig 2A) and CD4+ T cell count (P = 0.0088, ηp2 = 0.04; Fig 2B) more so than CD4+ T cell proportion (P = 0.097, ηp2 = 0.02; Fig 2C) (S3 Table). In particular, carriage of haplotype B was associated with a lower viral load (P adjusted = 0.0002, ηp2 = 0.06, ANOVA), higher CD4+ T cell count (P adjusted = 0.0030, ηp2 = 0.04) and higher CD4+ T cell proportion (P adjusted = 0.0363, ηp2 = 0.02) (S3 Table).
A) Participants carrying Haplotype B, the truncated type of ERAP2, have lower viral load than participants with Haplotype A. B) High CD4+ T cell counts is associated with Haplotype B. C) No significant difference in CD4+ T cell proportion in participants with different ERAP2 haplotype combinations. Analysis performed using ANOVA model adjusted for sex, race and CCR5Δ32 deletion.
We utilized the genotype data of eight ERAP1 SNPs to construct haplotypes for the study participants. The frequency of the estimated ERAP1 haplotypes in the cohort reflected frequencies observed in different populations [13] (S4 Table). However, due to the high number of ERAP1 haplotype patterns in this cohort, this study was underpowered to investigate the association of ERAP1 haplotypes with disease outcome.
Viral adaptations to HLA-restricted T cell immune responses in Gag correlate significantly with clinical outcome
Viral adaptations to HLA-restricted CD8+ T cell immune responses have been shown to influence HIV-1 infection outcome [19,20]. Using each individual’s autologous HIV-1 sequence and HLA repertoire, we found that high levels of viral adaptation in Gag to host HLA alleles correlated significantly with high viral load and lower CD4+ T cell count and proportion (P = 0.0103, P = 0.0061, P = 0.0061, respectively; S5 Table and Fig 3). This remained significant when limiting the analysis to subtype B only (P = 0.0250, P = 0.0129, P = 0.0144, respectively), as the major HIV subtype in the cohort (Table 1). In contrast, Pol and Nef adaptation scores did not show a correlation with the clinical markers, even when restricted to subtype B. (S5 Table).
A) Significant correlation of the proportion of viral adaptation in Gag with log viral load RNA copies/ml (P = 0.0103, ρ = 0.3; Spearman correlation test). B) Correlation of CD4+ T cell count with Gag adaptation score (P = 0.0061, ρ = -0.33; Spearman correlation test). C) Correlation of CD4+ T cell proportion with Gag adaptation score (P = 0.0061, ρ = -0.33; Spearman correlation test). The adaptation score ranges from 0 = non-adapted to 1 = fully adapted. Only sequences with ~100% coverage of protein sequence are included in this analysis (n = 70).
Examining the level of autologous viral adaptation scores across the HIV-1 genome did not show significant differences between the genes examined in this study (P = 0.162, Kruskal-Wallis test; S1 Fig).
Interaction effect of ERAP2 with viral adaptation may shape outcome following HIV-1 infection
We further sought to examine the interaction effect of ERAP2 haplotypes and viral adaptations on HIV-1 infection outcome. Although there is limited data showing the impact of ERAP2 on antigen presentation, it has been shown that viral adaptations in the flanking region of CD8+ T cell epitopes can prevent optimal peptide trimming by ERAP1 [30]. Such changes to the repertoire of T cell epitopes presented by HLA molecules may impact the sites of HIV-1 adaptation. To investigate this relationship, we fitted an ANOVA model with an interaction term adjusted for sex, race, CCR5Δ32 and HIV-1 subtype to investigate the possible interaction effect of adaptation score with ERAP2 haplotypes. Seventy participants with full sequence coverage of the known adaptation sites for specific HLA alleles [31] across Gag were used in this analysis. There was a significant interaction effect between variants of ERAP2 and the level of the adaptation score in Gag on viral load (P = 0.0404, Fig 4A); this effect was most evident when applying a dominant model (haplotype AA vs carriers of haplotype B, P = 0.0111; Fig 4B). This interaction effect was lost when additional participants were added to the analysis with less than 90% sequence coverage of the adaptation sites in Gag (P = 0.4895) and may be due to missing data for key adaptation sites, as it has been shown that a single epitope in the Gag gene can reduce viral load by log10 0.21 [32]. There was no indication of a bias in the location of unsequenced regions of Gag (S2 Fig). Note, carriage of the ERAP2 haplotype B was associated with lower Gag adaptation score (P = 0.0409, ANOVA), but this was driven by a small number of data points. There were no statistically significant interactions of ERAP2 haplotypes and adaptation score in Gag on CD4+ T cell count or proportion.
A) Shows adaptation patterns in participants vary in their ERAP2 haplotypes with Gag adaptation score can result in significantly different outcome using the codominant model. B) Distinct adaptation patterns are seen with carriers of haplotype B using dominant inheritance model. Analysis was performed by fitting ANOVA model with an interaction term adjusted for sex, race, CCR5Δ32 deletion, HIV-1 subtype for 70 participants with complete adaptation profile for Gag. Hap A = ERAP2 haplotype A, Hap B = ERAP2 haplotype B.
ERAP2 haplotypes associate with different sites under putative immune selection pressure
As an approach to indirectly assess if different ERAP2 haplotype combinations alter peptide repertoire, we separated the participants into those that have the ERAP2 AA haplotype (n = 68) and those that carry the B haplotype (n = 180) and then examined the Gag sequences in each group to identify HIV-1 polymorphisms associated with specific HLA alleles. Here, all available Gag sequences were used as individual sites were examined. Polymorphisms at 66 positions in the Gag protein were associated with 102 HLA alleles in haplotype AA group and 58 positions were associated with 83 HLA alleles in the group that carry haplotype B (P < 0.05; fisher exact test). The associations for the two groups overlap in only six positions (positions 12, 26, 28, 242, 357). Correction for multiple comparisons was not applied here due to the small number of participants.
To investigate the likely relevance of the HLA associations found, we determined if the associated residue fell within experimentally defined epitopes restricted by the same HLA allele (https://www.hiv.lanl.gov/content/immunology) or if the region is predicted to bind to the specific HLA allele using NetMHCpan 4.1 [33] (S6 Table and S3 Fig). For the ERAP2 haplotype AA group, 17.6% of the HLA-associated viral polymorphisms fell within known T cell epitopes and 29.4% within a predicted T cell epitope with the correct HLA restriction. Similarly, for the participants carrying the ERAP2 haplotype B, 22.9% of the HLA-associated viral polymorphism sites fell within known T cell epitopes and 32.5% within a predicted T cell epitope with the correct HLA restriction. For the five positions that were predicted to fall in regions with strong binding affinity to HLA, two sites were associated with a reduced binding affinity score with change from the consensus amino acid. It should be noted that some of the association sites (33.3% in AA group and 22.9% in AB/BB group) were found to be in the flanking region of epitopes (within 10 amino acid positions of the N- and C-terminal flanking regions), potentially affecting peptide processing [30, 34, 35]. The results suggest some overlap in the presented peptide repertoire by individuals with either ERAP2 haplotype AA or carrying haplotype B, but also distinct peptide presentation by specific HLA and ERAP2 allelic combinations.
Pattern of T cell immune responses in hosts with different ERAP2 haplotypes suggests overlapping immune targets
Another indirect measure we used to assess if carriage of ERAP2 haplotypes affects the peptide repertoire was to examine ex-vivo HIV-1-specific T cell responses in a cohort with known HLA and ERAP genotypes. We re-examined the HIV-1-specific T cell responses in 15 individuals with HIV infection in our previous studies [36–38], in which those with the same HLA genotype were tested with a series of peptides representing known T cell epitopes across the HIV-1 proteome, including Gag. In these previous studies, antigen-specific recognition and/or responses were identified using specific HLA class I allele-peptide tetramers or IFNγ capture via intracellular cytokine staining. Of the 15 participants, two carried the haplotype AA combination with only one peptide eliciting an IFNγ response above background from eight different peptides (12.5%). For the five participants with haplotype BB there were four IFNγ responses to peptides from 13 different tests (30.77%). Interestingly, for participants with the ERAP2 haplotype AB, we observed 14 positive IFNγ responses out of 20 tests (70%) [36] (S7 Table). These results suggest overlap in the T cell immune targets for participants with different ERAP2 haplotype combinations.
ERAP2 variants and viral adaptation in Gag gene explain a substantial proportion of the variation observed in viral load
Previous studies have shown that variations at the CCR5 locus and HLA loci are significantly associated with viral load and explained a proportion of the variance observed in viral load ranging from 13% (increasing to 22% when including demographic factors) [2] to 14.5% (increases to 25% when including additive host genetic effects) [4]. In line with these findings, sex, CCR5Δ32 and carriage of HLA protective alleles (HLA-B*57 and -B*27) showed a significant association with differences in viral load in this cohort, explaining 17.67% of the observed variation (S8 Table). Adding ERAP2 haplotypic variation increased the total explained variance by 5.2%. Furthermore, in the cohort subset (n = 70) with full viral sequence coverage at the adaptation sites, HIV-1 subtype, viral adaptation score in Gag and ERAP2 haplotypes increased the explained proportion of the variance to 39.34%. Adding the interaction effect between ERAP2 haplotypes and adaptation score in Gag into the model increased the explained variance in viral load to 45.1%, in keeping with the interaction between ERAP 2 and HLA being epistatic and mediated by modifying HLA-associated viral adaptation patterns.
Examination of the impact of ERAP2 SNPs alone and the interaction effect with viral adaptation on spVL in other HIV-1 cohorts
We sought to replicate these findings in independent datasets, where host genetic and linked clinical data were collected and available. We obtained data derived from multiple cohorts collected as part of 12 genome-wide association studies of HIV-1 outcome, eight combined under the International Collaboration for the Genomics of HIV (ICGH) as described in [39] (S10 Table). It should be noted that several of these cohorts were of non-European ancestry, where HLA and other genetic loci display significant variation in allele frequencies to the WAHIV cohort. Furthermore, these studies did not include HIV sequencing, so analyses of HIV adaptation were not possible. ERAP2 rs2549782 SNP was genotyped in 14 out of the 21 cohorts only and there was no genotyping for the tag SNP of haplotype B, rs2248374. Using an ANOVA analysis adjusted for sex where applicable, no significant association was found between ERAP2 rs2549782 genotypes and spVL in these cohorts (S10 Table).
In the remaining seven cohorts that did not include ERAP2 rs2549782 genotyping, we used proxy SNPs of rs2248374. In six of those cohorts, we found proxy SNPs that had high LD with rs2248374 (S11 Table). After performing an ANOVA analysis adjusted for sex where applicable, no significant association was found between the proxy SNPs and spVL.
Data from the Swiss HIV Cohort Study (SHCS) had broadly matching ancestry, genome wide SNPs, HIV subtype B sequences and a derived value for spVL in 163 participants [40]. We performed HLA-class I imputation to 4 digit resolution using the “HIBAG” package in RStudio [41]. Utilising host HLA-class I genotypes and autologous Gag HIV-1 sequence for each participant in this cohort, we calculated HLA-associated HIV-1 adaptation score of Gag while also considering viral sequence coverage at the possible adaptation sites. There was no significant association between Gag adaptation score and spVL (P = 0.1477, ρ = -0.16; Spearman correlation test). We next fitted an ANOVA model including sex as a covariate to investigate the interaction effect between ERAP2 rs2549782 genotypes and viral adaptation score of Gag on spVL. There was no significant interaction effect detected in this analysis (P = 0.6583; ANOVA).
Discussion
Many studies have emphasized the dominant role of HLA on HIV-1 disease outcome [42] and the impact of HLA-restricted T cell immune pressure on the evolution of HIV-1 [18]. There are comparatively few studies on ERAP 1 and 2, though these enzymes are fundamental to the HLA genotype-specific immunopeptidome. In this study of an Australian population, where the effects of known HLA and CCR5 determinants are evident, we show that genetic variation in ERAP2 additionally influences HIV-1-infection outcome. Furthermore, we showed a significant interaction effect on the impact of HIV Gag viral adaptation on pre-treatment viral load among 70 participants carrying different versions of the ERAP2 gene. Research in other disease settings has revealed epistatic interactions between HLA and ERAP [14,43]. ERAPs can alter the peptide presented by a specific HLA allele or redirect the response to novel T cell epitopes [9]. With regards to HIV, a previous study showed that viral adaptations can affect epitope processing, including trimming by ERAPs in commonly targeted regions in Gag (P17 and P24), thereby affecting the immunodominance of CD8+ T cell responses [44]. In addition, ERAP1 and ERAP2 haplotype A trimming activity has been found to be significantly affected by the internal sequence of the targeted peptide [14,45], and high viral adaptation will likely impair peptide trimming by ERAP.
The interaction effect observed in this study was significant in participants who carried ERAP2 haplotype B (AB and BB haplotypes) showing a linear relationship such that the viral load increases as the level of adaptation in the Gag gene increases, but this pattern was not observed for participants with ERAP2 haplotype AA. As the adaptation score is calculated based on a list of known HLA-associated polymorphisms [31], it is plausible that subdominant or novel T cell epitopes are targeted in hosts who carry haplotype A. This is supported by experimental evidence showing that ERAP2 haplotype A enzymatic activity is influenced by variation in the rs2549782 SNP that causes changes in the specificity and activity of the ERAP2 enzyme [26]. Importantly, this change is equivalent to two orders of magnitude, resulting in an alteration in the peptide repertoire and impairment of the capacity of this enzyme to complement epitope processing mediated by ERAP1 [26]. In another study that investigated the underlying cause of Birdshot Uveitis, an autoimmune disease associated with ERAP2 haplotype A [46], the authors found that ERAP2 haplotype A generates an HLA-A29-specific antigen repertoire by increasing the abundance of a distinct motif (with F and Y amino acids at P2) [14]. Our analysis of HLA-associated polymorphism in the Gag showed limited overlap in the associated amino acid positions between ERAP2 haplotypes, suggesting ERAP2 variation likely influences the repertoire of HLA-peptide presentation.
We also showed that ERAP2 haplotype B, which encodes truncated ERAP2 transcripts, predicts a lower HIV-1 viral load. Earlier studies have reported that the ERAP2 protein encoded by haplotype B undergoes nonsense-mediated decay [25], but more recent work has shown that virally-infected cells (with influenza or HIV-1) can trigger distinctive short transcripts of ERAP2 (termed ERAP2/Iso3 and ERAP2/Iso4) from haplotype B that could partially replenish the loss of ERAP2 expression [17,47]. The RNA expression of ERAP2/Iso3 can be translated into protein and correlates with the abundance of viral antigens (P24 antigen for HIV-1 and overall gene segments for the flu virus) in vitro [17,47]. Although the biological effect of ERAP2/Iso3 is still unclear, ERAP1 and ERAP2 interact to form heterodimers that enhance peptide trimming efficacy [48,49] and a possible interaction of ERAP2/Iso3 with ERAP2/haplotype A or with ERAP1 has been proposed (reviewed in [50]). Ye et al. [47] suggested that the translation of these short isoforms that lack the aminopeptidase domain might result in a “dominant negative” effect to increase the immunogenicity of the peptide repertoire in the case of viral infection.
The role of ERAP2 in biological processes other than antigen presentation (reviewed in [50]) suggest other immunological pathways involved in HIV-1 pathogenesis may also be affected by variation in ERAP2. A recent report presented evidence that ERAP2 haplotype A conferred protection during the Black Death [16], while haplotype B predicts better prognosis in HIV-1 infection and lowers risk of chronicity in Hepatitis C [51], suggesting that bacterial and viral pathogens may elicit distinct ERAP expression profiles, as shown for haplotype B (ERAP2/Iso3) expression [17,47].
The ERAP2 haplotype A may exert effects relevant to HIV-1 infection but not for chronic T cell responses. Studies of HIV-1 exposed seronegative individuals have found that ERAP2 genotype AA is overrepresented, suggesting possible effects operating at acquisition of infection and distinct from antigen processing for antiviral T cell responses [52,53]. In addition, in-vitro infection of peripheral blood mononuclear cells (PBMCs) with HIV-1 showed lower p24 levels in ERAP2 AA than in AB and BB genotype backgrounds; however, relevant interactions with other factors such as ERAP/iso3 heterodimers and global T cell responses are not captured in such in-vitro studies [53].
Our study does confirm previous literature regarding the impact of viral adaptation on disease prognosis [19,20]. While there was no correlation between clinical measurements and the level of viral adaptation of Pol and Nef genes, there was a correlation signal detected for the Gag gene. This result supports previous findings that Gag is an important immunogen [54] and the breadth of Gag-specific CD8+ T cell responses are associated with low levels of viremia [32]. Furthermore, the presence of Gag-specific CD8+ T cell responses (> two T cell targets) delays onset to AIDS [32]. Targeting specific Gag epitopes likely imposes strong selection pressure on HIV-1 [32], and adaptation occurring in this region likely results in sufficient loss of immune control that would affect clinical measurements [19].
We did analyse other datasets derived from GWAS with comparable data, though in ethnically distinct populations. While GWAS have contributed largely to understand the role of HLA in controlling HIV-1 infection [2–4,40,55], there are limitations in detection of determinants with moderate effect size in reaching genome-wide significance, and with interactive effects such as epistasis (reviewed in [56]). Furthermore, the effects of genes subject to significant transcriptional regulation under certain biological conditions such as inflammation, may be underestimated in GWAS. The impact of this limitation in GWAS is apparent in studies examining Crohn’s disease in which applying a genetic model that allows epistasis interactions could account for around 80% of heritability in this disease, whereas the many loci identified via GWAS explain only a minor percentage of disease heritability [1]. Of note, ERAP2 SNPs (rs2549782 and rs2248374) have not been identified through previous GWAS examining the role of host genetic factors on HIV-1disease outcomes [2–4,40,55]. Our analysis with GWAS HIV-1 datasets reported in [39,40] did not show significance for ERAP2 rs2549782 SNP associations with spVL. This may be attributed to differences in the definition of spVL. While in our study, the viral load is defined as the first pre-treatment viral load measurement taken at clinical presentation; in the replicated cohorts, spVL was defined as the mean of at minimum two viral load measurements taken during the chronic phase of infection [2,39,40,55] with some including the criteria of using VL data over a period of three years and diverging by no more than 0.5 log [2]. Another possible reason might be related to different combinations of ERAP2 haplotypes in the other cohorts and WAHIV cohort. ERAP2 is a highly polymorphic genetic region with two major haplotypes that branches further into other haplotypes with low frequencies [52,57]. A study conducted to investigate the lack of association of the rs2549782 SNP with preeclampsia in Chileans showed that, unlike other populations, there is a lack of LD between rs2549782 and rs2248374 SNPs, reflecting different ERAP2 haplotypes [58]. Our analysis of the proxy SNPs (S11 Table) shows that although the two proxy SNPs rs2548533 and rs2549797 are in complete LD with rs2248374, there was a markedly different minor allele frequency in a cohort of European ancestry compared with others reported in the 1000 Genomes project [59], suggesting there may be different ERAP2 haplotype structures in different populations. Furthermore, evidence of the distinctive influence of ERAP2 based on HLA binding specificities in Ankylosing Spondylitis studies [60,61] suggests the likelihood of different HLA alleles and/or ERAP1 and 2 haplotypes in our study and other cohorts limits the ability to replicate results with existing data. In addition, several of the GWAS were in populations of non-European ancestry in which the prevalence of HLA alleles and ERAP variants differ significantly and this may influence the list of relevant SNPs and viral adaptations used in the analyses.
The association between overall autologous HLA-associated HIV-1 adaptation score (calculated for near full-length of HIV-1 genome) and HIV-1 clinical outcome has been well documented [19]. The fact that we did not find a significant association between spVL and the adaptation score in the SHCS cohort is not unexpected. Carlson et al. [19] reported that Gag-specific adaptation score was significantly associated with VL (defined as a cross-sectional single measurement viral load taken mostly upon first presentation) in the Southern Africa (HIVC) cohort (P = 0.03) but not in the British Columbia (HIVB) cohort (P = 0.51), although in both cohorts the overall adaptation score was significantly associated with VL (P = 1*10−16, Southern Africa (HIVC) cohort; P = 0.03, British Columbia (HIVB) cohort) [19]. Notwithstanding these multiple and significant limitations in replication analyses in other cohort datasets, further population analyses and experimental studies are required to validate our findings and assess the generalizability in other chronic infections. We had insufficient statistical power to examine interaction between different combinations of specific HLA alleles, ERAPs and T cell epitopes, which may underpin global epistatic effects and account for population differences. Given the strong likelihood that epistasis and other interactions between genetic loci operate in-vivo, further studies are important not just for understanding HIV pathogenesis but also the broader ways in which antigen processing contributes to immune defenses against human pathogens.
Materials and methods
Study design and participants
Participants (n = 249) for this study are a subset of participants from the WAHIV cohort; a prospective observational study that commenced in 1992 [22]. The stratification criteria included participants who were treatment naïve, had well documented demographic information and pre-ART clinical data (viral load and CD4+ T cell count and/or proportion) (Table 1). Plasma and peripheral blood mononuclear cells (PBMCs) were isolated from whole blood and stored at -80°C. The participants in this study were predominantly male (78.78%), of Caucasian origin (73.62%) and infected with HIV-1 subtype B (76.73%). Race was self-reported.
Ethics statement
Ethics approval for the conduct of this study was obtained from the Royal Perth Hospital Human Ethics Committee (EC2004/005) and Murdoch University Human Research Committee (2014/048; 2017/242). The protocol and the procedures of the study were conducted in conformity with the ethical guidelines of the World Medical Association Declaration of Helsinki. Participants enrolled in this study gave written informed consent prior to study commencement.
DNA and viral RNA extraction
Genomic DNA was extracted from PBMCs using the DNA Isolation Kit (Beckman Coulter, Brea, CA, USA) and viral RNA was extracted from plasma samples using the MagMAX-96 viral RNA Isolation kit (Thermo Fisher Scientific, Waltham, MA, USA), according to the manufacturers’ instructions.
Viral sequencing
Viral RNA was converted to cDNA using the SuperScript III One-Step RT-PCR System with Platinum Taq High Fidelty Kit (Life Technologies, Carlsbad, CA, USA), as recommended in the manufacturer’s instructions. Samples were then amplified using a nested PCR approach to target the HIV-1 genes Pol, Gag and Nef as previously described [20]. Samples were sequenced either via the sanger-based method or using the 2×300bp paired-end chemistry kit on the Illumina MiSeq sequencer (Illumina, San Diego, CA, USA). Raw sequencing reads were filtered and analysed using the visual genomics analysis studio (VGAS) tool [62]. HIV sequences have been submitted to GenBank and the relevant accession numbers are available in S2 Data.
HLA genotyping
Genotyping of the HLA class I (HLA-A, -B and -C) and class II (HLA-DRB1) loci was based on exon 2 and 3 using locus-specific PCR amplification primers [63]. Where the HLA type was ambiguous, primers specific for HLA subtypes were used to perform allele-specific PCR followed by sequencing to resolve typing ambiguity. Assign (Conexio Genomics, WA, Australia) software was used to interpret sequencing chromatograms.
ERAP genotyping
Fourteen ERAP1 (rs3734016, rs73148308, rs26653, rs27895, rs2287987, rs27434, rs73144471, rs27529, rs78649652, rs30187, rs10050860, rs111363347, rs17482078, rs375081137) and two ERAP2 SNPs (rs2248374 and rs2549782) were genotyped using next generation sequencing as previously described [20]. An additional genotyping step was performed for the ERAP1 SNP rs26618 using the predesigned rhAmp genotyping assay [64] (Integrated DNA Technologies, Inc., USA). Instruction of the rhAmp assay reaction mixture and PCR cycling conditions were according to the manufacture’s protocol. Controls for the assay included genomic DNA extracted from cell lines (CIR, RML (IHW09016) and WT100BIS (IHW09006)) and an amplified genomic DNA sample (ACH-2) with confirmed rs26618 genotype combinations using next generation sequencing. Data were analyzed using the CFX manager 3.1 software (Bio-Rad Laboratories Pty Ltd).
ERAP1 and ERAP2 haplotype construction
ERAP1 and ERAP2 haplotypes were constructed based on the genotyping of 10 non-synonymous variants using the Haplo.Stats package version 1.8.9 [65] in RStudio version 4.2.3 that computes maximum likelihood estimates of haplotype probabilities using the expectation maximization algorithm (EM) [66,67]. The estimated haplotypes and their frequencies were compared to previously reported haplotypes [13], with estimated haplotypes with frequencies lower than 0.5% excluded from the analysis (S2 and S4 Table). The estimated haplotypes were used to infer the haplotype combination for each participant in the cohort and the haplotype pair with the highest posterior probability was selected for subsequent analysis.
HIV-1 subtype assignment
HIV-1 subtype was obtained from clinical records and confirmed using a combination of bioinformatic tools that infer subtype using different methodologies: REGA (phylogenetic-based tool) [68], COMET (statistical-based tool) [69], QC LANL (sequence quality check tool, https://www.hiv.lanl.gov/content/sequence/QC/index.html) and MEGA X (software for manual check of the possible subtype by constructing phylogenetic trees) [70]. HIV-1 subtype was assigned where there was concordance between the clinical records and the different bioinformatic tools.
Calculation of viral adaptation score
HIV-1 adaptation scores were calculated as previously described [20] and were used to assess the level of HLA-associated HIV-1 adaptation. Adaptation was based on a previously identified list of variations within the HIV-1 genome that were statistically associated with carriage of specific HLA alleles at the population level [31]. Adaptation scores were calculated for each participant by dividing the number of adaptations present within their viral sequence for the Gag, Pol and Nef genes, separately, by the total number of possible adaptations based on an individual’s HLA alleles. Only those sequences that included 100% of the potential adaptations for each HIV-1 gene were selected to perform the association analysis, as this removes any bias in the sequence coverage across a gene and provides equal coverage of putative adaptations for all participants. It should be noted that the adaptation list contains sites identified for HIV-1 subtype B (76.73% of infections in this cohort), but it is likely that there will be overlaps in T cell targets between subtypes [21,71–73]. However, to account for a possible bias in the association analysis, viral subtype (coded as subtype B and non-B) was included as a covariate whenever the adaptation score was used in an analysis. Furthermore, results were confirmed by limiting the analysis to subtype B only when the sample size was sufficient to perform the analysis.
Estimating linkage disequilibrium
To avoid the confounding effect of linkage disequilibrium (LD) in the association analysis, we first sought to identify the LD patterns in the host immune genetic data in the cohort. For HLA class I (HLA-A, -B, -C) and class II (HLA-DRB1), an online tool was used to explore LD patterns using fisher exact test comparisons (https://www.hiv.lanl.gov/content/immunology). For investigating LD among ERAP1 and ERAP2 SNPs, HaploView version 4.2 software [74] was used to compute D’ and r2 values for SNPs of interest. Fig 1 shows the results of the analysis. A cut-off D’ value >0.80 was set to indicate LD.
Hardy-Weinberg equilibrium test
Deviation from Hardy-Weinberg equilibrium (HWE) was examined for the HLA alleles and the ERAP1 and ERAP2 SNPs with an alpha value less than 0.05 considered statistically significant for HWE deviation. For the HLA data, the analysis was done in RStudio version 4.2.3 using the midasHLA package version 1.4.0 [75] that is specifically designed for HLA data. Testing the ERAP1 and ERAP2 SNPs for HWE deviation was performed using the SNPassoc package [76] based on a method that computes an exact test for HWE and controls for type I error regardless of sample size [77]. Deviation from HWE was detected in 13.6% of HLA class I and II alleles, which is expected as the HLA loci are under selection [78]. All the tested ERAP1 and ERAP2 SNPs were in HWE (S9 Table).
Statistical analysis
Normality
To approximate normal distribution some variables were transformed; viral load was transformed using log10 transformation and CD4+ T cell counts were transformed using the Box-Cox method [79] with an estimated transformation parameter (lambda) equal to 0.4628. Following transformation, log10 viral load remained slightly left-skewed and CD4+ T cell counts were slightly right-skewed. Therefore, a non-parametric test was applied where appropriate and model assumptions were checked for every model implemented using diagnostic plots, particularly for assumption of the normality of the residuals of the model.
Covariates.
Heterogenicity in pre-treatment clinical markers of HIV-1 infection has been shown to be affected by sex and race [80] as well as a deletion in the CCR5 gene (CCR5Δ32) [2,4]. Therefore, to avoid cofounding effects on the analysis, the above-mentioned variables were included as covariates in models where disease outcome was used as a response variable in addition to controlling for the HIV-1 subtype whenever adaptation score was used in the model. Race factor was coded as Caucasians and ‘other’, as the majority of the cohort was of Caucasian origin (73.62%) and other racial groups were in low frequencies (26.38%). An alpha value <0.05 was considered significant. P values were corrected for multiple comparisons using the FDR approach. All data analyses were performed in RStudio, version 4.2.3
Tests.
The independent association of ERAP1 and ERAP2 SNPs with disease outcome was done using ANOVA and adjusted as above with different genetic inheritance models in the SNPassoc package [76] in RStudio. Loci that were monomorphic or showed genotype variability <1% were excluded from the analysis. Also, SNPs that had <2% of the minor allele frequency for any genetic inheritance model were excluded from the analysis.
ERAP2 haplotype association with either disease outcome or viral adaptation score was done using ANOVA analysis and model assumption checked using diagnostic plots. Interaction effect of ERAP2 haplotypes and viral adaptation score on disease outcome was done by fitting the ANOVA model with an interaction term.
Differences in the level of adaptation score across different HIV-1 genomic regions was examined using the Kruskal–Wallis test followed by Dunn’s Test of multiple comparisons for post-hoc analysis. A Spearman’s rank correlation coefficient test was used to examine the association of gene-specific adaptation score with disease outcome. Fisher exact test was done to examine the association of variation in the HIV Gag gene (consensus vs non-consensus sequence) with HLA among different ERAP2 haplotypes at the 0.05 significance level.
The total variance explained by host and viral genetic factors (sex, race, carriage of CCR5Δ32, HLA-B*57 and B*27, carriage of different ERAP2 haplotypes, adaptation score of Gag gene, HIV-1 subtype) was estimated by fitting multiple regression models and assessing the coefficient of determination (R2) values. In addition, the change in adjusted R2 values were compared for each model after adding additional covariates. S8 Table shows all the regression models used, the R2 and the adjusted R2 values.
Supporting information
S1 Table. Association of ERAP1 and ERAP2 SNPs with HIV-1 clinical markers
https://doi.org/10.1371/journal.ppat.1012359.s001
(XLSX)
S2 Table. Estimated ERAP2 haplotype distribution.
https://doi.org/10.1371/journal.ppat.1012359.s002
(XLSX)
S3 Table. Association of ERAP2 haplotypes with disease outcome in study participants using a codominant inheritance model.
https://doi.org/10.1371/journal.ppat.1012359.s003
(XLSX)
S4 Table. Estimated ERAP1 haplotype distribution.
https://doi.org/10.1371/journal.ppat.1012359.s004
(XLSX)
S5 Table. Correlation of Gag, Pol and Nef autologous adaptation score with disease markers.
https://doi.org/10.1371/journal.ppat.1012359.s005
(XLSX)
S6 Table. Association of HLA alleles with Gag sequence among individuals carrying different ERAP2 haplotypes.
https://doi.org/10.1371/journal.ppat.1012359.s006
(XLSX)
S7 Table. IFN-γ responses to immunodominant HIV-1 T cell epitopes among individuals with different ERAP2 haplotypes.
https://doi.org/10.1371/journal.ppat.1012359.s007
(XLSX)
S8 Table. Proportion of variance explained by host and viral genetic factors.
https://doi.org/10.1371/journal.ppat.1012359.s008
(XLSX)
S9 Table. Description of ERAP1 and ERAP2 SNPs and HWE outcomes.
https://doi.org/10.1371/journal.ppat.1012359.s009
(XLSX)
S10 Table. Description of HIV-1 GWAS dataset reported by McLaren et al. [39].
https://doi.org/10.1371/journal.ppat.1012359.s010
(XLSX)
S11 Table. ERAP2 Proxy SNPs with near complete LD with rs2248374.
https://doi.org/10.1371/journal.ppat.1012359.s011
(XLSX)
S1 Fig. No significant difference in the level of viral adaptation to HLA-restricted immune response across HIV genome.
The adaptation list used in this calculation is based on statistical association of specific adaptation sites to HLA-restricted immune response described by [31]. Viral sequences with complete coverage for the adaptation sites were used in this analysis. Kruskal-Wallis test was performed to examine the difference between viral genes and the level of the adaptations to HLA class I-restricted immune responses.
https://doi.org/10.1371/journal.ppat.1012359.s012
(TIF)
S2 Fig. Lack of bias in sequence coverage of the Gag region.
This map represents sequence coverage for 70 individuals (highlighted with the green side ribbon) with complete coverage of the possible adaptation sites, and additional 31 (highlighted with the yellow side ribbon) that have between 90 to 96% of sequence coverage at the adaptation sites. Missing sequences in the map colored in red, whereas blue represents the presence of sequence.
https://doi.org/10.1371/journal.ppat.1012359.s013
(TIF)
S3 Fig. The distribution of the top significant associations of HLA alleles with Gag sequence among individuals carrying different ERAP2 haplotypes derived from S6 Table.
A threshold of <0.01 p-value was applied to filter the top significant associations in both groups. EV = experimentally verified. NP = not predicted. WB = weak binder. N-terminal and C-terminal indicates sites within 10 amino acids flanking epitopes.
https://doi.org/10.1371/journal.ppat.1012359.s014
(TIF)
Acknowledgments
We thank Professor Paul J. McLaren at University of Manitoba for providing an ICGH dataset for analysis. We thank the authors colleagues at the Vanderbilt University Medical Center, TN, USA and the Institute for Immunology and Infectious Diseases, Murdoch University, WA, Australia.
References
- 1. Zuk O, Hechter E, Sunyaev SR, Lander ES. The mystery of missing heritability: Genetic interactions create phantom heritability. Proceedings of the National Academy of Sciences. 2012;109(4):1193–8. pmid:22223662
- 2. Fellay J, Ge D, Shianna KV, Colombo S, Ledergerber B, Cirulli ET, et al. Common genetic variation and the control of HIV-1 in humans. PLoS Genet. 2009;5(12):e1000791. pmid:20041166
- 3. Pereyra F, Jia X, McLaren PJ, Telenti A, de Bakker PI, Walker BD, et al. The major genetic determinants of HIV-1 control affect HLA class I peptide presentation. Science. 2010;330(6010):1551–7. pmid:21051598
- 4. McLaren PJ, Coulonges C, Bartha I, Lenz TL, Deutsch AJ, Bashirova A, et al. Polymorphisms of large effect explain the majority of the host genetic contribution to variation of HIV-1 virus load. Proc Natl Acad Sci U S A. 2015;112(47):14658–63. pmid:26553974
- 5. Gao X, O’Brien TR, Welzel TM, Marti D, Qi Y, Goedert JJ, et al. HLA-B alleles associate consistently with HIV heterosexual transmissi, viral load, and progression to AIDS, but not susceptibility to infection. AIDS. 2010;24(12):1835–40.
- 6. Emu B, Sinclair E, Hatano H, Ferre A, Shacklett B, Martin JN, et al. HLA Class I-Restricted T-Cell Responses May Contribute to the Control of Human Immunodeficiency Virus Infection, but Such Responses Are Not Always Necessary for Long-Term Virus Control. Journal of Virology. 2008;82(11):5398–407. pmid:18353945
- 7. Strange A, Capon F, Spencer CC, Knight J, Weale ME, Allen MH, et al. A genome-wide association study identifies new psoriasis susceptibility loci and an interaction between HLA-C and ERAP1. Nat Genet. 2010;42(11):985–90. pmid:20953190
- 8. Jiang P, Veenstra RN, Seitz A, Nolte IM, Hepkema BG, Visser L, et al. Interaction between ERAP Alleles and HLA Class I Types Support a Role of Antigen Presentation in Hodgkin Lymphoma Development. Cancers (Basel). 2021;13(3).
- 9. Kemming J, Reeves E, Nitschke K, Widmeier V, Emmerich F, Hermle T, et al. ERAP1 allotypes shape the epitope repertoire of virus-specific CD8(+) T cell responses in acute hepatitis C virus infection. J Hepatol. 2019;70(6):1072–81. pmid:30769005
- 10. Komlosh A, Momburg F, Weinschenk T, Emmerich N, Schild H, Nadav E, et al. A role for a novel luminal endoplasmic reticulum aminopeptidase in final trimming of 26 S proteasome-generated major histocompatability complex class I antigenic peptides. J Biol Chem. 2001;276(32):30050–6. pmid:11373290
- 11. Chen H, Li L, Weimershaus M, Evnouchidou I, van Endert P, Bouvier M. ERAP1-ERAP2 dimers trim MHC I-bound precursor peptides; implications for understanding peptide editing. Scientific Reports. 2016;6(1):28902. pmid:27514473
- 12. Blanchard N, Kanaseki T, Escobar H, Delebecque F, Nagarajan NA, Reyes-Vargas E, et al. Endoplasmic reticulum aminopeptidase associated with antigen processing defines the composition and structure of MHC class I peptide repertoire in normal and virus-infected cells. J Immunol. 2010;184(6):3033–42. pmid:20173027
- 13. Hutchinson JP, Temponeras I, Kuiper J, Cortes A, Korczynska J, Kitchen S, et al. Common allotypes of ER aminopeptidase 1 have substrate-dependent and highly variable enzymatic properties. J Biol Chem. 2021;296:100443. pmid:33617882
- 14. Venema WJ, Hiddingh S, de Boer JH, Claas FHJ, Mulder A, den Hollander AI, et al. ERAP2 Increases the Abundance of a Peptide Submotif Highly Selective for the Birdshot Uveitis-Associated HLA-A29. Frontiers in Immunology. 2021;12. pmid:33717175
- 15. Evnouchidou I, Kamal RP, Seregin SS, Goto Y, Tsujimoto M, Hattori A, et al. Cutting Edge: Coding Single Nucleotide Polymorphisms of Endoplasmic Reticulum Aminopeptidase 1 Can Affect Antigenic Peptide Generation In Vitro by Influencing Basic Enzymatic Properties of the Enzyme. The Journal of Immunology. 2011;186(4):1909–13. pmid:21242517
- 16. Klunk J, Vilgalys TP, Demeure CE, Cheng X, Shiratori M, Madej J, et al. Evolution of immune genes is associated with the Black Death. Nature. 2022;611(7935):312–9. pmid:36261521
- 17. Saulle I, Vanetti C, Goglia S, Vicentini C, Tombetti E, Garziano M, et al. A New ERAP2/Iso3 Isoform Expression Is Triggered by Different Microbial Stimuli in Human Cells. Could It Play a Role in the Modulation of SARS-CoV-2 Infection? Cells. 2020;9(9).
- 18. Moore CB, John M, James IR, Christiansen FT, Witt CS, Mallal SA. Evidence of HIV-1 adaptation to HLA-restricted immune responses at a population level. Science. 2002;296(5572):1439–43. pmid:12029127
- 19. Carlson JM, Du VY, Pfeifer N, Bansal A, Tan VY, Power K, et al. Impact of pre-adapted HIV transmission. Nat Med. 2016;22(6):606–13. pmid:27183217
- 20. Currenti J, Chopra A, John M, Leary S, McKinnon E, Alves E, et al. Deep sequence analysis of HIV adaptation following vertical transmission reveals the impact of immune pressure on the evolution of HIV. PLoS Pathog. 2019;15(12):e1008177. pmid:31821379
- 21. Alves E, Al-Kaabi M, Keane NM, Leary S, Almeida C-AM, Deshpande P, et al. Adaptation to HLA-associated immune pressure over the course of HIV infection and in circulating HIV-1 strains. PLOS Pathogens. 2022;18(12):e1010965. pmid:36525463
- 22. Mallal SA. The Western Australian HIV Cohort Study, Perth, Australia. J Acquir Immune Defic Syndr Hum Retrovirol. 1998;17 Suppl 1:S23–7. pmid:9586648
- 23. Panel on Antiretroviral Guidelines for Adults and Adolescents. Guidelines for the Use of Antiretroviral Agents in Adults and Adolescents with HIV. Department of Health and Human Services [updated Dec. 18, 2019. Available from: https://clinicalinfo.hiv.gov/en/guidelines/hiv-clinical-guidelines-adult-and-adolescent-arv/initiation-antiretroviral-therapy.
- 24. Gaudieri S, DeSantis D, McKinnon E, Moore C, Nolan D, Witt CS, et al. Killer immunoglobulin-like receptors and HLA act both independently and synergistically to modify HIV disease progression. Genes Immun. 2005;6(8):683–90. pmid:16121209
- 25. Andrés AM, Dennis MY, Kretzschmar WW, Cannons JL, Lee-Lin SQ, Hurle B, et al. Balancing selection maintains a form of ERAP2 that undergoes nonsense-mediated decay and affects antigen presentation. PLoS Genet. 2010;6(10):e1001157. pmid:20976248
- 26. Evnouchidou I, Birtley J, Seregin S, Papakyriakou A, Zervoudi E, Samiotaki M, et al. A common single nucleotide polymorphism in endoplasmic reticulum aminopeptidase 2 induces a specificity switch that leads to altered antigen processing. J Immunol. 2012;189(5):2383–92. pmid:22837489
- 27. Ombrello MJ, Kastner DL, Remmers EF. Endoplasmic reticulum-associated amino-peptidase 1 and rheumatic disease: genetics. Curr Opin Rheumatol. 2015;27(4):349–56. pmid:26002026
- 28. Zhang S, Liu S, Liu N, Li C, Wang H, Shi L, et al. Polymorphisms in ERAP1 and ERAP2 Genes Are Associated With Tuberculosis in the Han Chinese. Front Genet. 2020;11:566190. pmid:33250919
- 29. Saulle I, Vicentini C, Clerici M, Biasin M. An Overview on ERAP Roles in Infectious Diseases. Cells. 2020;9(3). pmid:32183384
- 30. Draenert R, Le Gall S, Pfafferott KJ, Leslie AJ, Chetty P, Brander C, et al. Immune selection for altered antigen processing leads to cytotoxic T lymphocyte escape in chronic HIV-1 infection. J Exp Med. 2004;199(7):905–15. pmid:15067030
- 31. Carlson JM, Brumme CJ, Martin E, Listgarten J, Brockman MA, Le AQ, et al. Correlates of protective cellular immunity revealed by analysis of population-level immune escape pathways in HIV-1. Journal of virology. 2012;86(24):13202–16. pmid:23055555
- 32. Kiepiela P, Ngumbela K, Thobakgale C, Ramduth D, Honeyborne I, Moodley E, et al. CD8+ T-cell responses to different HIV proteins have discordant associations with viral load. Nature Medicine. 2007;13(1):46–53. pmid:17173051
- 33. Reynisson B, Alvarez B, Paul S, Peters B, Nielsen M. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 2020;48(W1):W449–w54. pmid:32406916
- 34. Ranasinghe SR, Kramer HB, Wright C, Kessler BM, di Gleria K, Zhang Y, et al. The antiviral efficacy of HIV-specific CD8+ T-cells to a conserved epitope is heavily dependent on the infecting HIV-1 isolate. PLoS Pathog. 2011;7(5):e1001341.
- 35. Milicic A, Price DA, Zimbwa P, Booth BL, Brown HL, Easterbrook PJ, et al. CD8+ T cell epitope-flanking mutations disrupt proteasomal processing of HIV-1 Nef. J Immunol. 2005;175(7):4618–26. pmid:16177107
- 36. Currenti J, Law BMP, Qin K, John M, Pilkinton MA, Bansal A, et al. Cross-Reactivity to Mutated Viral Immune Targets Can Influence CD8+ T Cell Functionality: An Alternative Viral Adaptation Strategy. Frontiers in Immunology. 2021;12. pmid:34764960
- 37. Conrad JA, Ramalingam RK, Duncan CB, Smith RM, Wei J, Barnett L, et al. Antiretroviral Therapy Reduces the Magnitude and T Cell Receptor Repertoire Diversity of HIV-Specific T Cell Responses without Changing T Cell Clonotype Dominance. Journal of Virology. 2012;86(8):4213–21. pmid:22258246
- 38. Sadagopal S, Lorey SL, Barnett L, Basham R, Lebo L, Erdem H, et al. Enhancement of human immunodeficiency virus (HIV)-specific CD8+ T cells in cerebrospinal fluid compared to those in blood among antiretroviral therapy-naive HIV-positive subjects. J Virol. 2008;82(21):10418–28. pmid:18715919
- 39. McLaren PJ, Porreca I, Iaconis G, Mok HP, Mukhopadhyay S, Karakoc E, et al. Africa-specific human genetic variation near CHD1L associates with HIV-1 load. Nature. 2023;620(7976):1025–30. pmid:37532928
- 40. Bartha I, Carlson JM, Brumme CJ, McLaren PJ, Brumme ZL, John M, et al. A genome-to-genome analysis of associations between human genetic variation, HIV-1 sequence diversity, and viral control. eLife. 2013;2:e01123. pmid:24171102
- 41. Zheng X, Shen J, Cox C, Wakefield JC, Ehm MG, Nelson MR, et al. HIBAG—HLA genotype imputation with attribute bagging. Pharmacogenomics J. 2014;14(2):192–200. pmid:23712092
- 42. Carrington M O’Brien SJ. The Influence of HLA Genotype on AIDS. Annual Review of Medicine. 2003;54(1):535–51.
- 43. D’Amico S, D’Alicandro V, Compagnone M, Tempora P, Guida G, Romania P, et al. ERAP1 Controls the Interaction of the Inhibitory Receptor KIR3DL1 With HLA-B51:01 by Affecting Natural Killer Cell Function. Front Immunol. 2021;12:778103. pmid:34917091
- 44. Tenzer S, Wee E, Burgevin A, Stewart-Jones G, Friis L, Lamberth K, et al. Antigen processing influences HIV-specific cytotoxic T lymphocyte immunodominance. Nature Immunology. 2009;10(6):636–46. pmid:19412183
- 45. Evnouchidou I, Momburg F, Papakyriakou A, Chroni A, Leondiadis L, Chang SC, et al. The internal sequence of the peptide-substrate determines its N-terminus trimming by ERAP1. PLoS One. 2008;3(11):e3658. pmid:18987748
- 46. Kuiper JJ, Van Setten J, Ripke S, Van TSR, Mulder F, Missotten T, et al. A genome-wide association study identifies a functional ERAP2 haplotype associated with birdshot chorioretinopathy. Hum Mol Genet. 2014;23(22):6081–7. pmid:24957906
- 47. Ye CJ, Chen J, Villani AC, Gate RE, Subramaniam M, Bhangale T, et al. Genetic analysis of isoform usage in the human anti-viral response reveals influenza-specific regulation of ERAP2 transcripts under balancing selection. Genome Res. 2018;28(12):1812–25. pmid:30446528
- 48. Evnouchidou I, Weimershaus M, Saveanu L, van Endert P. ERAP1-ERAP2 dimerization increases peptide-trimming efficiency. J Immunol. 2014;193(2):901–8. pmid:24928998
- 49. Saveanu L, Carroll O, Lindo V, Del Val M, Lopez D, Lepelletier Y, et al. Concerted peptide trimming by human ERAP1 and ERAP2 aminopeptidase complexes in the endoplasmic reticulum. Nat Immunol. 2005;6(7):689–97. pmid:15908954
- 50. Kuśnierczyk P. To Be or Not to Be: The Case of Endoplasmic Reticulum Aminopeptidase 2. Front Immunol. 2022;13:902567. pmid:35769458
- 51. Liu S, Cao D, Shen Y, Li Y, Li Y, Shi L, et al. The ERAP gene is associated with HCV chronic infection in a Chinese Han population. Hum Immunol. 2017;78(11–12):731–8. pmid:29037997
- 52. Cagliani R, Riva S, Biasin M, Fumagalli M, Pozzoli U, Lo Caputo S, et al. Genetic diversity at endoplasmic reticulum aminopeptidases is maintained by balancing selection and is associated with natural resistance to HIV-1 infection. Hum Mol Genet. 2010;19(23):4705–14. pmid:20843824
- 53. Biasin M, Sironi M, Saulle I, de Luca M, la Rosa F, Cagliani R, et al. Endoplasmic reticulum aminopeptidase 2 haplotypes play a role in modulating susceptibility to HIV infection. Aids. 2013;27(11):1697–706. pmid:23435305
- 54. Martinez-Picado J, Prado JG, Fry EE, Pfafferott K, Leslie A, Chetty S, et al. Fitness cost of escape mutations in p24 Gag in association with control of human immunodeficiency virus type 1. J Virol. 2006;80(7):3617–23. pmid:16537629
- 55. Fellay J, Shianna KV, Ge D, Colombo S, Ledergerber B, Weale M, et al. A whole-genome association study of major determinants for host control of HIV-1. Science. 2007;317(5840):944–7. pmid:17641165
- 56. Tam V, Patel N, Turcotte M, Bossé Y, Paré G, Meyre D. Benefits and limitations of genome-wide association studies. Nature Reviews Genetics. 2019;20(8):467–84. pmid:31068683
- 57. Raja A, Kuiper JJW. Evolutionary immuno-genetics of endoplasmic reticulum aminopeptidase II (ERAP2). Genes & Immunity. 2023. pmid:37925533
- 58. Vanhille DL, Hill LD, Hilliard DD, Lee ED, Teves ME, Srinivas S, et al. A Novel ERAP2 Haplotype Structure in a Chilean Population: Implications for ERAP2 Protein Expression and Preeclampsia Risk. Mol Genet Genomic Med. 2013;1(2):98–107. pmid:24040622
- 59. Auton A, Abecasis GR, Altshuler DM, Durbin RM, Abecasis GR, Bentley DR, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. pmid:26432245
- 60. Martín-Esteban A, Guasp P, Barnea E, Admon A, López de Castro JA. Functional Interaction of the Ankylosing Spondylitis-Associated Endoplasmic Reticulum Aminopeptidase 2 With the HLA-B*27 Peptidome in Human Cells. Arthritis Rheumatol. 2016;68(10):2466–75. pmid:27110896
- 61. Lorente E, Redondo-Antón J, Martín-Esteban A, Guasp P, Barnea E, Lauzurica P, et al. Substantial Influence of ERAP2 on the HLA-B*40:02 Peptidome: Implications for HLA-B*27-Negative Ankylosing Spondylitis. Mol Cell Proteomics. 2019;18(11):2298–309. pmid:31530632
- 62. Leary S, Cooper D, Chopra A, Watson M. IIID VGAS—A Visual Genome Analysis Studio incorporating high performance next generation sequence analysis2013.
- 63. Witt CS, Price P, Kaur G, Cheong K, Kanga U, Sayer D, et al. Common HLA-B8-DR3 haplotype in Northern India is different from that found in Europe. Tissue Antigens. 2002;60(6):474–80. pmid:12542740
- 64. Beltz K, Tsang DCW, Wang J, Rose SD, Bao Y, Wang Y, et al. A High-Performing and Cost-Effective SNP Genotyping Method Using rhPCR and Universal Reporters. Advances in Bioscience and Biotechnology. 2018;9:497–512.
- 65. Sinnwell JP, Schaid DJ. haplo.stats: Statistical Analysis of Haplotypes with Traits and Covariates when Linkage Phase is Ambiguous. R package version 1.8.9. ed2022.
- 66. Long JC, Williams RC, Urbanek M. An E-M algorithm and testing strategy for multiple-locus haplotypes. Am J Hum Genet. 1995;56(3):799–810. pmid:7887436
- 67. Excoffier L, Slatkin M. Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Mol Biol Evol. 1995;12(5):921–7. pmid:7476138
- 68. de Oliveira T, Deforche K, Cassol S, Salminen M, Paraskevis D, Seebregts C, et al. An automated genotyping system for analysis of HIV-1 and other microbial sequences. Bioinformatics. 2005;21(19):3797–800. pmid:16076886
- 69. Struck D, Lawyer G, Ternes AM, Schmit JC, Bercoff DP. COMET: adaptive context-based modeling for ultrafast HIV-1 subtype identification. Nucleic Acids Res. 2014;42(18):e144. pmid:25120265
- 70. Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol Biol Evol. 2018;35(6):1547–9. pmid:29722887
- 71. Gupta SB, Mast CT, Wolfe ND, Novitsky V, Dubey SA, Kallas EG, et al. Cross-clade reactivity of HIV-1-specific T-cell responses in HIV-1-infected individuals from Botswana and Cameroon. J Acquir Immune Defic Syndr. 2006;42(2):135–9. pmid:16760794
- 72. Geels MJ, Dubey SA, Anderson K, Baan E, Bakker M, Pollakis G, et al. Broad cross-clade T-cell responses to gag in individuals infected with human immunodeficiency virus type 1 non-B clades (A to G): importance of HLA anchor residue conservation. J Virol. 2005;79(17):11247–58. pmid:16103177
- 73. Gudmundsdotter L, Bernasconi D, Hejdeman B, Sandstrom E, Alaeus A, Lidman K, et al. Cross-clade immune responses to Gag p24 in patients infected with different HIV-1 subtypes and correlation with HLA class I and II alleles. Vaccine. 2008;26(40):5182–7. pmid:18479789
- 74. Barrett JC, Fry B, Maller J, Daly MJ. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005;21(2):263–5. pmid:15297300
- 75. Migdal M, Ruan DF, Forrest WF, Horowitz A, Hammer C. MiDAS—Meaningful Immunogenetic Data at Scale. PLOS Computational Biology. 2021;17(7):e1009131. pmid:34228721
- 76. González JR, Armengol L, Solé X, Guinó E, Mercader JM, Estivill X, et al. SNPassoc: an R package to perform whole genome association studies. Bioinformatics. 2007;23(5):654–5.
- 77. Wigginton JE, Cutler DJ, Abecasis GR. A note on exact tests of Hardy-Weinberg equilibrium. Am J Hum Genet. 2005;76(5):887–93. pmid:15789306
- 78. Hughes AL, Nei M. Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection. Nature. 1988;335(6186):167–70. pmid:3412472
- 79. Box GEP, Cox DR. An Analysis of Transformations. Journal of the Royal Statistical Society: Series B (Methodological). 1964;26(2):211–43.
- 80. Anastos K, Gange SJ, Lau B, Weiser B, Detels R, Giorgi JV, et al. Association of race and gender with HIV-1 RNA levels and immunologic progression. J Acquir Immune Defic Syndr. 2000;24(3):218–26. pmid:10969345