Printer Friendly

Computer-Aided Design of an Epitope-Based Vaccine against Epstein-Barr Virus.

1. Introduction

Epstein-Barr virus (EBV), or human herpesvirus 4, is a large enveloped virus that belongs to the family herpesviruses y. It has a size of 120-180 nm and a double-stranded linear DNA genome (~171 Kb long), encoding ~90 genes [1]. The genome is enclosed within a nucleocapsid protein which is in turn surrounded by a lipid envelope that contains the viral surface proteins essential for infection [2]. According to its expression, EBV genes are divided into immediate early (expressed very early during lytic infection, coding for transcription factors), early (interfere with the host metabolism and DNA synthesis), and late genes (including structural and nonstructural glycoproteins). There are two major subtypes of EBV (type 1 and type 2), which mainly differ in their nuclear antigen-3 gene (EBNA-3). Both types are detected all over the world, yet type 1 is dominant in most populations [3].

EBV is present in over 90% of the adult world population [4]. Most people become infected with EBV during childhood and develop little or no symptoms. However, if the infection occurs later in life, it can cause infectious mononucleosis (IM) in about 30-50% of the cases [5]. Viral transmission is primarily through saliva; hence, the nickname of kiss disease for IM. The virus can infect and replicate in epithelial and B cells. Infection of epithelial cells of the oropharynx has a relevant role in EBV expansion during primary infection [2]. However, B cells are the main targets of the virus. They are fundamental to establish an EBV infection--X-linked agammaglobulinemic patients are not infected by the virus [6]--and can pass the virus to epithelial cells by direct contact [7]. Moreover, it is in memory B cells that the virus persists as a long-term latent infection [8]. Tropism of EBV for B lymphocytes is mediated by cell surface molecules CD21 (i.e., complement receptor 2 (CR2)) and HLA-II that serve as receptors of the viral envelope glycoproteins gp350 and gp42, respectively [9]. Infection of B cells by EBV does not usually release viral progeny. Instead, the virus activates the cell cycle driving the expansion of latently infected B cells, inducing its own proliferation, thus getting persistently established in the lymphoid system [7, 8]. Latency is not permanent though, as EBV can periodically switch between latent and lytic states. Reactivation from latency is triggered by environmental stimuli and the process is tightly controlled by the immune system [10].

Immunity against EBV has been studied extensively [10, 11]. Natural killer (NK) cells play an important role in the innate immune response, delaying or preventing the EBV transformation of B cells through the production of interferon gamma (IFN-[gamma]) [12]. Subsequently, the virus elicits strong adaptive immune responses, primarily mediated by cytotoxic CD8 T cells. CD8 T cell responses eliminate viral-infected cells upon recognition of EBV peptide antigens bound to MHC I molecules in the surface of target cells. Cytotoxic CD8 T cell response against EBV infection is so dramatic that, in IM patients, up to 50% of CD8 T cells recognize EBV-specific CD8 T cell epitopes, most derived from immediate early or early antigens [13, 14]. In contrast, CD4 T cell responses against the virus are less dramatic and focused [13]. CD4 T cells recognize peptide antigens bound to MHC II molecules and commit into different phenotypes of cytokine-producing T helper cells (Th) that control the immune response. Most EBV-specific CD4 T cells produce IFN-y and tumor necrosis factor alpha (TNF-a), with a smaller number producing IL-2 which is the usual and expected Th1 antiviral response [15]. Regarding the humoral immune response, EBV infection triggers a potent reaction against various viral antigens. The acute primary infection is associated with the induction of IgM antibodies against the virus capsid antigen (VCA), which switches to an IgG isotype. IgG anti-VCA antibodies are not neutralizing and remain for life. Neutralizing IgG antibodies targeting viral major glycoprotein gp350 arise only after the resolution of the primary infection [16]. Other antibodies targeting nonneutralizing antigens (e.g., viral proteins located intracellularly) also appear sometime after the resolution of the primary infection [16, 17].

The immune system is capable of controlling EBV primary infection and reactivation phases, forcing the virus to stay latent in memory B cells. Such a control likely has a toll in the immune system. In fact, after extended periods of latency and being facilitated by its potent growth transforming capability, EBV appears to promote an increasing number of human cancers. Frequent cancers linked to EBV include several B cell malignancies, such as Burkitt's lymphoma (BL) and Hodgkin's lymphoma (HL), and epithelial cell malignancies, notably nasopharyngeal carcinoma (NPC) [18]. Furthermore, EBV infection has been implicated with autoimmunity and it is clearly a risk factor for developing multiple sclerosis and to a lesser systemic lupus erythematosus [19].

Currently, no medicine can cure EBV infection and there is no prophylactic or therapeutic vaccine against it. Clearly, a prophylactic vaccine against EBV will have a major impact in public health as it will prevent both EBV infection and related diseases [20]. In this study, we explored a reverse-vaccinology approach to design a prophylactic vaccine against EBV based on CD8 and CD4 T cell epitopes and B cell epitopes. For designing the T cell epitope vaccine component, we relied on combining legacy experimentation with bioinformatics analysis aimed to identify conserved and highly promiscuous T cell epitopes [21-23]. Given the size and complexity of EBV, we also introduced expression criteria to reduce the number of T cell epitopes and focus on those from early antigens with acknowledged function at the initial steps of primary infection [23,24]. As for the B cell component, we included highly conserved experimentally determined B cell epitopes from EBV gp350 protein as well as potential B cell epitopes predicted in flexible solvent-exposed regions of other envelope proteins important for infection like gp42, gB, and gL. We are confident that our epitope vaccine ensemble poses a basis for developing a powerful and effective vaccine against EBV. Moreover, we trust that the approach and methods introduced in this work ought to become a paradigm of general use in reverse vaccinology.

2. Materials and Methods

2.1. Collection of EBV-Specific Epitopes. We retrieved experimentally defined EBV-specific T and B epitope sequences from the EPIMHC [25] and IEDB [26]. As inclusion criteria, we considered positive assays (excluding low-positive responses) and epitopes being linked to the course of a natural infection in humans for T cell epitopes and any human disease for B cell epitopes. We discarded duplicate peptides and when available, we also retrieved the MHC restriction elements of T cell epitopes. For B cell epitopes, we considered all unique sequences that were not included as part of longer peptides. In total, we obtained 247 unique B cell epitopes and 109 unique T cell epitopes (88 CD8 T cell epitopes and 21 CD4 T cell epitopes). These epitopes are available as supplementary data in Additional File S1 available online at, including Tables S1A, S1B, and S1C for CD8, CD4, and B cell epitopes, respectively. Perl scripts used to identify unique B and T cell epitopes from IEDB search outputs can be obtained from the corresponding author.

2.2. Generation of Clusters and Multiple Sequence Alignments of EBV Protein Sequences. We used CD-HiT [27] with default settings to generate clusters from 13,899 EBV protein sequences that included 89 translated coding DNA sequences (CDS) from a reference genome virus (accession: NC_007605). The protein sequences were downloaded following the links in the NCBI taxonomy database (TAX ID: 10376) [28]. We processed CD-HIT clusters with reference EBV proteins, removed identical sequences, and subsequently generated multiple sequence alignments (MSA) using MUSCLE [29]. As a result, we obtained 85 referenced MSA of EBV proteins that were used for further analysis. Software for clustering the sequences will be provided by the corresponding author upon written request.

2.3. Generation of EBV-Reference Proteome with Variable Sites Masked and Identification of Conserved Epitopes. We generated EBV-reference sequences with variable sites masked upon sequence variability analyses on the referenced MSA of EBV proteins. Briefly, we calculated the sequence variability in the MSA of EBV proteins using the Shannon entropy [30], H, as a variability metric [21, 24, 31]. Shannon entropy per site in a MSA is given by

H = [M.summation over (i=1)][P.sub.i][log.sub.2][P.sub.i], (1)

where Pi is the fraction of residues of amino acid type i and M is equal to 20, the number of amino acid types. H ranges from 0 (total conservation, only one amino acid type is present at that position) to 4.322 (all 20 amino acids are equally represented in that position). We considered gaps as no data. To generate reference EBV consensus sequences, we assigned the computed variability, H, to the EBV-reference proteins included in the MSA and subsequently masked all positions with a variability, H, greater than 0.5 [32, 33]. We used this reference sequence to discard epitope sequences that did not match entirely with it. Hence, the epitopes that we considered conserved did not have a single residue with H > 0.5.

2.4. Prediction of Peptide HLA Presentation Profiles and Computation of Population Protection Coverage. T cells only recognize peptides when presented in the cell surface of antigen-presenting cells bound to HLA molecules (MHC molecules in humans). Therefore, we anticipated HLA presentation profiles of peptides by predicting peptide-HLA binding. For CD8 T cell epitopes, we predicted peptide binding using 55 HLA I-specific motif profiles [34-36]. A top 2% rank percentile was used to consider binding to the relevant HLA I molecule. For CD4 T cell epitopes, we predicted peptide binding to 15 reference HLA-DR molecules [37] using the IEDB binding tool [38]. We used a 5% percentile rank cutoff to consider that binding had occurred. The population protection coverage (PPC) of a set of epitopes is the proportion of the population that could elicit an immune response against any of them and can be computed by knowing the gene frequencies of the HLA I alleles that can present the epitopes [21]. For HLA I-restricted T cell epitopes, we used EPISOPT to compute epitope PPC [39]. EPISOPT uses HLA I allele frequencies for 5 distinct ethnic groups in the USA population (Caucasian, Hispanic, Black, Asian, and North American natives) [40] and can identify combinations of epitopes reaching a determined PPC in each of the population groups. We aimed to identify epitope combinations reaching a PPC of 95% in the 5 ethnic groups. For HLA II-restricted epitopes, we used IEDB PPC tool [41] to compute PPC for the world population using the epitope-HLA II presentation profiles predicted previously. We identified combinations of CD4 T cell epitopes reaching a maximum PPC by introducing into the IEDB PPC tool different combinations of epitopes with their corresponding HLA II binding profiles.

2.5. B Cell Epitope Prediction and Calculation of Flexibility and Solvent Accessibility. We considered flexible protein fragments identified in available 3D structures of the relevant antigens with relative solvent accessibility > 50% as potential B cell epitopes. As residue flexibility values, we used normalized B factors, [Z.sub.B] (2):

[Z.sub.B] = B - [[mu].sub.B]/[[partial derivative].sub.B], (2)

where B is the residue B factor from the relevant PDB, [[mu].sub.B] is the mean of the [C.sub.[alpha]] residue of B factors, and [[partial derivative].sub.B] is the standard deviation of [C.sub.[alpha]] B factors. Flexible regions, potential B cell epitopes, consisted of 9 consecutive residues or more with flexibility equal or greater than the computed [[partial derivative].sub.B] (1.0). For each selected protein fragment, we obtained a flexibility score consisting of the average flexibility of the fragment residues and a solvent accessibility value consisting of the average relative solvent accessibility (RSA) of the residues. We obtained residue RSAs from the relevant PDB coordinates using NACCESS [42]. Solvent accessibility values and flexibility scores were computed in the same manner for experimental B cell epitopes.

2.6. Blast Searches, Protein Annotation, and Analysis Procedures. We mapped epitopes onto three-dimensional (3D) structures and retrieved UniProtKB [43] entries upon BLAST searches [44] against the PDB and Swissprot databases at NCBI ( We also carried out BLAST searches with conserved epitope sequences as query against human proteins and human microbiome proteins to detect epitope identity to human or human microbiome proteins. These BLAST searches were carried out locally with standalone programs using an expectation value (-e) of 10,000. Human microbiome protein sequences for BLAST searches were obtained from the NIH Human Microbiome Project [45] at NCBI ( As human protein sequences, we used all human proteins available in the nonredundant (NR) collection at NCBI. We used PyMOL Molecular Graphics System, Version 1.8 Schrodinger, to visualize B cell epitopes on 3D structures. We identified function, subcellular localization, and temporal expression of selected EBV proteins (developmental stage) from UniProtKB [43].

3. Results

3.1. Reference EBV Proteome with Variable Residues Masked. Epitope-based vaccines can force the immune system to recognize conserved antigen regions. Therefore, a key step in our approach to epitope vaccine design is to carry out sequence variability analyses enabling the selection of conserved epitopes. To that end, we clustered all available EBV protein sequences around a reference EBV proteome (NC_007605), obtaining 85 protein clusters with EBV reference proteins on them (details in Materials and Methods). Upon aligning the sequences in the clusters, we subjected them to sequence variability analyses using the Shannon entropy, H, as variability metric. As a result, we identified that only 960 residue sites of the 42,998 evaluated had H [greater than or equal to] 0.5 and generated reference consensus EBV sequences with those variable sites masked. A variability of H < 0.5 is a very stringent threshold for low variability and that only a few sites (960 residue sites) with H [greater than or equal to] 0.5 were found indicates that EBV, as most dsDNA viruses, has a low mutation rate [1]. By matching EBV epitopes with this reference EBV proteome, we were able to select only those epitopes consisting of conserved residues (H < 0.5).

3.2. CD8 T Cell Epitope Component. To design the CD8 T cell vaccine component, we started with 88 unique EBV-specific CD8 T cell epitope sequences that were experimentally verified to be recognized in the course of a natural infection by EBV in humans. That set was reduced to 58 epitopes when we selected only those with a length of 9 residues (9 mers). We selected 9 mer peptides because most peptides presented by MHC I molecules are of that size [36]. Among those, we found 40 epitopes that did not have a single residue with H > 0.5 and none were 100% identical to human proteins or human microbiome proteins (sequences and identity data included in Additional File S2, Table S2A). A strong Cd8 T cell response to early antigens is key to clear the virus [14]. Therefore, after identifying the function and developmental stage of the relevant antigens in UniprotKB, we selected 16 CD8 T cell epitopes that were present in early antigens and had a reported functionality in primary EBV infection (Table 1). For each selected CD8 T cell epitope, we predicted its potential HLA I presentation profile (see Materials and Methods) and subsequently computed the population protection coverage (PPC) for 5 distinct ethnic groups present in the USA population (see Materials and Methods). PPC of CD8 T cell epitopes ranged from 5.08% to 57.84% (Table 1). Epitopes ARYAYYLQF and VSFIEFVGW had little PPC and were discarded for further analysis. Subsequently, we used EPISOPT [39] to identify epitope combinations within the remaining 14 CD8 T cell epitopes that could provide a PPC of 95% in each one of the ethnic groups. We found that just 5 epitopes were required to reach it. Moreover, we identified 40 different epitope combinations, 3 with 5 epitopes and 37 with 6 epitopes, that reached PPC [greater than or equal to] 95% (data not shown). EPISOPT did not report more numerous epitope combinations because adding more epitope sequences did not increase the PPC [39]. The combination with only 5 epitopes that reached the largest PPC (96.0%) consisted of epitopes YVLDHLIVV, VLKDAIKDL, RVRAYTYSK, LPCVLWPVL, and AYSSWMYSY. However, the epitope combination that provided the highest PPC (97.1%) included 6 CD8 T cell epitopes: YVLDHLIVV, YRSGIIAVV, SVRDRLARL, RVRAYTYSK, LPCVLWPVL, and RRIYDLIEL. All the 14 CD8 T cell epitopes were found in at least one of the epitope combinations reaching 95% PPC. Subsequently, we considered all the 14 CD8 T cell epitopes for inclusion in the CD8 T cell vaccine component. The selected epitopes originate from 6 different viral antigens, including EBNA3, BRLF1, EBNA6, EBNA1, BMRF1, and BZLF1 (Table 1), and thus will also contribute to a multiantigenic response.

3.3. CD4 T Cell Epitope Component. We identified a total of 21 EBV-specific CD4 T cell epitopes from the relevant epitope databases that were elicited in the course of a natural infection by EBV in humans (Table S1B in Additional File S1). Of those, we selected 10 epitopes that were conserved (Table 2) and none were 100% identical to human proteins or human microbiome proteins (see Table S2B in Additional File 2). The size of the conserved CD4 T cell peptides ranged from 15 to 20 residues long. We next identified their HLA II presentation profile by predicting peptide-MHC II binding to 15 distinct HLA-DR molecules that are frequently expressed in the population (see Materials and Methods). We chose to target HLA-DR molecules for two reasons: the alpha chain is nonpolymorphic [32] and HLA-DR are expressed at a much higher density in the cell surface of antigen-presenting cells than any other HLA II molecules [46] and thus are more relevant for epitope vaccine design [47].

Upon determining epitope HLA II presentation profiles, we computed the PPC for the world population as indicated in Materials and Methods. The maximum PPC that could be reached by considering the entire set of HLA-DR molecules is 81.81%. The PPC of selected CD4 T cell epitopes ranged from 0% (QKRAAPPTVSPSDTG) to 69.85% (MLGQDDFIKFKSPLV). The PPC that could be reached by combining all distinct HLA-DR molecules that were found to bind the selected CD4 T cell epitopes was 81.81% (Table 2). This PPC was reached by considering only the epitopes MLGQDDFIKFKSPLV, AGLTLSLLVICSYLFISRG, SRDELLHTRAASLLY, and PPVVRMFMRERQLPQ derived from antigens BFRF1, BHRF1, BARF1 and EBNA6, respectively. Antigens BFRF1 and EBNA6 are nuclear proteins, whereas BARF1 is a secreted protein and BHRF1 is a membrane-bound antigen. We considered this 4-epitope combination as the optimal CD4 T cell vaccine component.

3.4. B Cell Epitope Component. We assembled the B cell epitope vaccine component from a set of 247 EBV-specific unique linear B cell epitope sequences ranging from 4 to 38 amino acids (Table S1C in Additional File S1). From those, we discarded B cell epitopes shorter than 9 residues and kept 117 that were conserved with no single residue with H > 0.5 (details in Materials and Methods). Moreover, none of these 117 B cell epitopes were identical to human proteins or to human microbiome proteins (data provided in Additional File S2, Table S2C). We analyzed the subcellular location of selected antigens to identify those that are expressed in the viral surface, accessible for antibody recognition. We found that the vast majority of the selected epitopes originated from viral intracellular antigens and therefore have no interest for B cell epitope vaccine design. We only found 9 B cell epitopes that were present in viral envelope glycoproteins: 7 from the major surface antigen gp350, the main viral determinant mediating viral attachment to B cells [48] and 2 from the envelope glycoprotein B (gB), key for the fusion of viral and host cell membranes during viral entry [49] (Table 3). However, only the 7 gp350 B cell epitopes mapped on the protein ectodomain and were further considered for the B cell epitope vaccine component. The 2 gB epitopes, QKRAAQRAAGPSVAS and VSGFISFFKNPFGGM, mapped onto the inner and transmembrane regions, respectively (Table 3).

Flexible and accessible linear B cell epitopes are often cross-reactive with antibodies against native antigens and are thereby of prime interest for epitope vaccine design [50]. Therefore, to further analyze the suitability for vaccine design of the 7 remaining gp350 B cell epitopes, we devised a system to quantify the flexibility and solvent accessibility of B cell epitopes from the known 3D structures. Briefly, we used normalized B factors and relative residue solvent accessibility computed from the relevant PDBs as measures of flexibility and accessibility (details in Materials and Methods). Following these criteria, we discarded the gp350 B cell epitope PSTSSKLRPRWTFTSPPVTT, for it mapped onto a region of the gp350 without a 3D structure and we could not readily evaluate its flexibility and accessibility. Of the 6 gp350 B cell epitopes that mapped onto the available gp350 3D structure (PDB: 2H6O), only 3 of them, SKAPESTTTSPTLNTTGFA, YVFYSGNGPKASGG DYCIQS, and QNPVYLIPETVPYIKWDN, had flexibility and solvent accessibility values supporting that they were readily accessible for antibody recognition (Table 3). In fact, visual inspection of epitopes SVKTEMLGNEID and QVSLESVDVYFQDVFGTMWC in the gp350 3D structure revealed that they were buried and thus not accessible for antibody recognition, while B cell epitope TNTTDI TYVGD though accessible (60%) was located in a rigid region of the protein (Figure S1 in Additional File S3). These epitopes will likely induce antibodies that will be unable to recognize native antigens and were discarded from the B cell vaccine component.

Following the hypothesis that highly flexible protein regions are suitable B cell epitopes for epitope vaccine design, we identified inner antigenic regions in the gp350 B cell epitopes SKAPESTTTSPTLNTTGFA, YVFYSGNGPKASG GDYCIQS, and QNPVYLIPETVPYIKWDN (APESTTTSP TLNTTGFA, GNGPKASGGD, and ETVPYIKWDN, resp.), encompassing only residues with a high degree of flexibility ([greater than or equal to] 1.0) and solvent accessibility greater than 50% (Table 3). Visual inspection of the gp350 B cell epitopes in the 3D structure clearly showed that the selected core fragments were located in highly flexibly and accessible regions of the structure while some parts of the remaining epitope were buried or semiburied (Figure 1). Therefore, we regarded the antigenic core regions (APESTTTSPTLNTTGFA, GNGPKA SGGD, and ETVPYIKWDN) identified in the gp350 B cell epitopes as the experimental B cell component of the EBV epitope vaccine ensemble.

As all experimental B cell epitopes suitable for epitope vaccine design were in gp350, we sought to identify potential B cell epitopes from the 3D structures of EBV envelope proteins gp42 (PDB: 3FD4), gB (PDB: 3FVC) and the heterodimer conformed by gH and gL (PDB: 5T1D). These proteins have been described to participate in the viral attachment and/or fusion to the host cell membrane required for viral entry [49, 51, 52]. We considered as potential B cell epitopes, antigen fragments in the relevant 3D structures consisting of 9 or more consecutive residues with flexibility [greater than or equal to] 1.0 and an average accessibility > 50% (details in Materials and Methods). As a result, we identified a potential B cell epitope in gp42 protein (KLPHWTPTLH), two at the gB protein (NTTVGIELPDA and SSHGDLFRFSSDIQCP), and one in the gL monomer (FSVEDLFGAN) (Table 4). No epitopes fulfilling the required criteria were identified at the gH protein. These predicted B cell epitopes were mapped to their corresponding 3D structures to confirm that they were in readily accessible regions for antibody recognition (Figure 2). KLPHWTPTLH mapped at the N-terminal region of gp42, which is involved in gH interaction and sits opposite to the HLA-DR binding site of the molecule (colored in red in Figure 2(a)). The gB epitopes mapped onto two distinct regions, domains II and III, that are likely relevant for interaction with other glycoproteins involved in viral entry [49] (Figures 2(a) and 2(b)). The single gL epitope mapped in a region in close proximity to gH and the binding site of a monoclonal antibody (mAb) EID1 that interferes with EBV infection of epithelial cells [52] (Figure 2(d)). We also verified that none of the predicted B cell epitopes were identical to human proteins or human microbiome proteins (Table 4).

4. Discussion

Over 90% of human adults are infected with EBV. Most infections occur in childhood and are asymptomatic or course with nonspecific symptoms. Nonetheless, EBV is the primary cause of IM when infection occurs in early adulthood. Furthermore, the viral infection is associated with autoimmunity and a number of lymphocyte and epithelial cell malignancies [18, 19]. Despite its wide impact, there is no treatment available, hence the growing interest in finding a prophylactic and/or therapeutic EBV vaccine.

The target population for an EBV prophylactic vaccine in the developed world would be 10- or 11-year-old children, before they are susceptible to most severe IM symptomatologies. It is acknowledged that by precluding the initial viral infection, the risks of developing EBV-associated autoimmune and cancer disorders would also be reduced [53]. In sub-Saharan Africa and southern China, where Burkitt's lymphoma and nasopharyngeal carcinoma are major public health problems and children are infected by EBV earlier in life, the vaccine target would be much younger infants. EBV-naive transplant recipients susceptible to suffer posttransplant lymphoproliferative disorders (PTLD) would also benefit from a prophylactic vaccine [54].

Currently, the most advanced EBV vaccine clinically tested consists of a gp350 subunit that was administered with AS04 adjuvant to virus-naive young adults [55]. The gp350 subunit vaccination strategy follows the approach successfully used in other viral infections, that is, induction of neutralizing antibodies (nAbs) against the most abundant glycoprotein on the virus, which also represents the main target of naturally occurring nAbs [16]. In this regards, a microneutralization assay based on an EBV expressing green-fluorescent protein has been very recently developed to provide measurement of humoral EBV vaccine responses in large clinical trials [56]. Another EBV vaccine trial was designed to control the expansion of EBV-infected B cells, based on the generation of CD8 T cell immunity to EBNAs [57]. Specifically, the vaccine consisted of a single EBNA3A epitope restricted by HLA-B08 administered as a peptide along with tetanus toxoid as adjuvant [57].

A major outcome of the Sokal et al. [55] clinical trial was that immunization with gp350 did not protect from new viral infections [55]. Therefore, it has been suggested that a prophylactic vaccine against EBV should elicit B cell responses also against all 5 major viral envelope proteins involved in host-cell attachment and entry, including gp42, gH, gL, BMRF2 (gp350), and gB [58]. Among these, at least the first four are known to elicit neutralizing antibodies [59]. The induction of cytotoxic T cell responses against early viral antigens has been as well suggested in order to destroy recently infected B cells [14, 53, 59]. Attaching to these premises, we used a computer-assisted strategy to design a prophylactic epitope vaccine ensemble against EBV infection.

The strategy that we followed to design the EBV vaccine relied on combining legacy experimentation consisting of experimentally defined epitopes with immunoinformatics predictions. This strategy was first conceived to assemble CD8 T cell epitope vaccines [21, 39] and latter extended to include CD4 T cell epitope vaccines [22]. The main advantage of this approach is that of saving time and resources as it mainly relies on experimentally validated epitopes, not on predicted epitopes, using immunoinformatics to identify those that are more suitable for epitope vaccine design. It is worth noting that epitope prediction is not a precise science and epitope prediction methods only facilitate epitope discovery by providing candidates that need to be validated experimentally. Therefore, our strategy ought to gain widening acceptance as a vaccine design tool whenever ample experimental epitope data is readily available. Key criteria for epitope inclusion/selection are conservation and binding to multiple MHC molecules for maximum population protection coverage. Here, we also added that the source of CD8 T cell epitopes had to be from early EBV antigens with defined function in the primary infection process. Moreover, we checked that peptides were nonself and did not have exact matches with human proteins or human microbiome proteins and extended the approach to B cell epitopes. To that end, we devised a system to select from experimentally defined B cell epitopes those that were conserved, nonself and located on the ectodomains of viral envelope antigens and consisted of highly flexible and solvent-accessible residues (Figure 3). Note that we are not discriminating B cell epitopes from non-B cell epitopes in primary sequences. In fact, solvent accessibility or flexibility alone cannot discriminate B cell epitopes from non-B cell epitopes in primary sequences [60]. Instead, we are selecting known B cell mapping in the antigen surface that isolated from the antigen context can elicit antibodies cross-reacting with the native antigens and hence are worth for epitope vaccine design [61].

The composition of the epitope vaccine ensemble designed in this study includes 14 CD8 T cell epitopes, 4 CD4 T cell epitopes, and 7 B cell epitopes (Table 5). None of these epitopes matched exactly to human proteins or human microbiome proteins. This result is somewhat predictable for we focused mostly on epitopes that have been verified experimentally and it should be expected that the immune system selected nonself targets for recognition. Nonetheless, a few of the selected epitopes have a high identity with human microbiome proteins (around 88.9%, Table 5). Whether this high identity to human microbiome proteins could be a source of trouble is arguable: detection of epitope identity to self-proteins required using BLAST with expectation values of 10000, epitope matches may not be available for recognition, and epitope recognition can be disrupted by single amino acid changes.

According to some authors, the ideal EBV CD8 T cell epitope component should include antigens EBNA2, EBNA-LP, and BHRF1, which are abundant at the very initial stage of B cell infection [14]. Our epitope vaccine ensemble does not include CD8 T cell epitopes from these three antigens. However, it includes CD8 T cell epitopes from other EBV early antigens, such as EBNA1, EBNA3, EBNA6, BMRF1, BRLF1, and BZLF1 (Tables 1 and 5). Although a 95% PPC was reached with just 5 CD8 T cell epitopes, the key importance of a broad multiantigenic cytotoxic response prompted us to incorporate 14 CD8 T cell epitopes. For the CD4 T cell component, our proposed vaccine ensemble includes 4 epitopes reaching the maximum PPC possible of 81.8% provided by the reference set of HLA II molecules targeted for binding predictions [37]. The PPC of the CD4 T cell component is likely an underestimation. HLA II molecules are very promiscuous [62] and the selected epitopes will surely bind and be presented by other HLA II molecules not included in the selected reference set [37].

For the B cell epitope vaccine component, we included 7 B cell epitopes consisting of 3 experimental B cell epitopes from gp350 plus 4 other predicted B cell epitopes from EBV envelope proteins gp42, gB, and gL, all of them continuous and with high flexibility and solvent accessibility. We focused on linear B cell epitopes because they can be delivered isolated from their antigen context to induce selective humoral responses. We sought to predict B cell epitopes on gp42, gB, and gL that can be used to elicit antibodies that are cross-reactive with the native antigens. To that end, we needed to identify solvent-exposed B cell epitopes in the mentioned antigens and we could have used a number of methods to predict conformational B cell epitopes from the available 3D structures (reviewed in [63]). However, conformational B cell epitopes can not be isolated from their protein context and used as immunogens. Therefore, we turned our attention to linear B cell epitopes as they can be delivered isolated from the antigen and induce selective humoral responses. There are also a number of methods to predict linear B cell epitopes from primary sequences (reviewed in [60, 64]), but the predicted epitopes seldom match in solvent-accessible regions and are notoriously unreliable [60, 65, 66]. Hence, in this study, we assumed that highly flexible and solvent-accessible fragments in protein surfaces are potential linear B cell epitopes [50] and devised a system to identify them from the relevant 3D structures (details in Materials and Methods). Specifically, predicted B cell epitopes consisted of conserved fragments with at least 9 consecutive residues with flexibility (normalized B factor) > 1 and an average relative solvent-exposed accessibility [greater than or equal to] 50%.

Analysis of the structural mapping of the selected B cell epitopes onto the relevant 3D structure can reveal their importance for epitope vaccine design. The gp42 B cell epitope (KLPHWTPTLH) is located in the N-terminal portion of the protein far and opposite from the HLA-DR contact region (Figure 2(a)). Therefore, antibodies against this gp42 B cell epitope will unlikely block the gp42 interaction with HLA-DR required for viral entry into B cells. The gp42 Nterminal region, where KLPHWTPTLH maps, interact with gH at a site in close proximity to the [beta]1-integrin-binding motif "KGD" [52]. Both gp42 and peptides from the Nterminal region of gp42 that binds to gH interfere with [beta]1-integrin interaction and viral entry in epithelial cells [52]. In this context, the role of antibodies against this gp42 epitope with regard to viral entry in epithelial cells is unclear. Binding of antibodies to the epitope when gp42 is in complex with gH could prevent epithelial infection by EBV. However, such prevention is unlikely if antibodies against the epitope block the interaction between gp42 and gH. Despite poor neutralizing qualities of the gp42 B cell epitope KLPHWTPTLH, antibodies against it could still contribute to viral clearance by promoting complement activation and phagocytosis. The two predicted B cell epitopes in gB, NTTVGIELPDA, and SSHGDLFRFSSDIQCP, mapped onto two distinct protein domains (Figures 2(b) and 2(c)) that are thought to be relevant in the mechanism of EBV fusion to host membranes [49]. Hence, antibodies binding at this region could interfere in the vital fusion step required for viral entry. The B cell epitope predicted in gL, FSVEDLFGAN, mapped onto a region intertwined with gH and is in close proximity to the binding site of mAb E1D1 [52]. This antibody has been described to inhibit gH fusion to epithelial cells despite locating far from the gH integrin binding site (KGD). Whether an antibody against gL-protruding epitope FSVEDLFGAN might also exert a similar distant effect is unknown but remains a possibility.

Flexibility and accessibility were also key criteria to select and refine experimental B cell epitopes, leading to the selection of the gp350 B cell epitopes ETVPYIKWDN, GNGPKASGGD, and APESTTTSPTLNTTGFA (Table 3 and Figure 1). Two of these B cell epitopes, ETVPYIKWDN and GNGPKASGGD, mapped onto the glycan-free region of gp350 described to interact with the CR2 receptor [48]. Furthermore, residues E155, I160, and W162 from ETVPYIKWDN and D296 from GNGPKASGGD have been shown to contact the CR2 receptor (Figure 4) [67]. Noteworthy, the well-characterized EBV nAb 72A1 binds to gp350 in this glycan-free region [67]. Therefore, B cell epitopes ETVPYIKWDN and GNGPKASGGD have a great potential to induce neutralizing antibodies. In fact, GNGPKASGGD and ETVPYIKWDN are within peptide fragments that have been shown already to elicit antibodies that block binding of mAb 72A1 to gp350 [68]. Lastly, epitope APESTTTSPTLNTTGFA mapped onto the Cterminal end of the solved structure of gp350 (Figure 1). Mutagenesis of its E425 and S426 residues did not inhibit binding of gp350 to mAb 72A1 [48]. Although initially far from the receptor interaction region and containing a glycosylated asparagine residue (N435), it cannot be discarded that an antibody targeting it could help to control viral infection, for example through antibody-mediated complement activation and phagocytosis. Overall, these results validate the conservancy, flexibility, and accessibility criteria followed for the selection and prediction of B cell epitopes.

We trust that the application of the knowledge-based approach depicted in this work to design an epitope vaccine ensemble against EBV can save time and effort developing such a vaccine, as most of the components consist on experimentally defined EBV-specific epitopes. However, our epitope-based vaccine ensemble is theoretical, and extra validations will be required prior to formulating a vaccine that can actually be tested. For example, T cell epitopes used in our vaccine have been shown to be immunogenic in the context of experimentally defined HLA restriction elements (see Tables 1 and 2). However, we predicted that these epitopes will be also immunogenic in the context of different HLAs. To test that, T cells from subjects expressing the relevant HLA molecules can be expanded using dendritic cells loaded with the corresponding epitope peptides and cloned. Subsequently, T cell clone immunoreactivity can be checked through a number of assays (ELISPOT, intracellular cytokine staining, etc) using B-LCL 721.221 cells expressing single HLA molecules as described elsewhere [21, 69]. Selected B cell epitopes should also be subjected to extra validations, in particular to test whether they elicit antibodies cross-reacting with native antigens. To that end, sera from immunized mice with B cell epitope peptides could be used to check whether they recognize native antigens in ELISA assays and/or interfere with EBV infection of epithelial and B cells as described elsewhere [68]. Once the individual components of the epitope vaccine ensemble had passed experimental validation, it will still remain to elucidate how to formulate such a vaccine for delivering the epitopes.

There are several choices to formulate epitope vaccines ranging from peptide-based formulations to genetic formulations. Regardless of the choice, CD4 T cell epitopes need to be physically linked with the other selected epitopes, particularly B cell epitopes, to elicit productive Th cells [70]. A peptide-based vaccine has already been tested for the delivery of an EBV CD8 T cell epitope fused with tetanus toxoid to increase immunogenicity and elicit Th responses [57]. Similarly, a polymeric epitope concatemer in the form of a "string-of-beads" could be chemically synthesized or formulated as a genetic construct [71]. In either cases, the order of the epitopes and the presence of cleavage sites between them are crucial features to address [71]. Concatenating epitopes can result in toxic products and tools to predict toxicity can also be used to optimize epitope concatemers [72]. Toxicity of epitope vaccine formulations should nevertheless be checked in cellular assays prior to carrying out any immunization studies. In general, poor immunogenicity is an important issue with peptide-based formulations [22]. A recent development in vaccine formulation that increases the immunogenicity of the epitope-peptide components consists in the use of nanoparticles of diverse nature [73]. For example, Kuai et al. [74] used high-density lipoprotein-mimicking nanodiscs coupled with peptides to stimulate potent tumor-specific CD8 T cell responses that inhibited tumor growth in a murine model of colon carcinoma. Nanoparticles have also been used to deliver genetic constructs, particularly RNA constructs. RNA-based vaccine formulations offer lower safety concerns and enhanced immunogenicity with regard to those based on DNA, and inherent RNA instability can be overcame using nanoparticles for delivery [75].

Ideally, the B cell response should only be focused on B cell epitopes. To that end, a solution would be formulating the epitope vaccine as liposomal or virosome-like particles, where the selected T cell epitopes, either alone or concatenated, ought to be placed encapsulated inside the particle and the B cell epitopes displayed linked in the outer part of the particle [76, 77]. These liposomal vaccine formulations are also more immunogenic than those consisting of genetic or synthetic peptide-based constructs [76, 77]. Moreover, immunogenicity can be further enhanced by the inclusion of appropriated adjuvants [78].

Epitope vaccine formulations, as any vaccine candidate, should be evaluated in preclinical animal models prior to clinical testing in humans. However, in the case of EBV, this stands as a major drawback as there is a lack of appropriate animal models that recapitulate EBV infection and its immune control [79]. Thus, EBV vaccine immunogenicity and protection capabilities have to be assessed in clinical studies. Although this is very informative and may accelerate the developmental process, it also carries high associated costs early in the discovery path and involves enrollment of participants, which is not at the reach of many research groups. The clinical status of the target population to test EBV prophylactic vaccine candidates should also be considered. For instance, the phase II study by Sokal et al. [55], the most advanced of any EBV vaccine tested so far [54], involved a total of 181 EBV-seronegative, healthy, young volunteers between 16 and 25 years of age that were randomized in a double-blind fashion to receive either placebo or a recombinant EBV subunit glycoprotein 350.

5. Conclusions and Limitations

EBV infection is associated with a number of human diseases, including cancer and autoimmunity. Currently, it is unclear why some individuals with apparently proper responses to EBV develop associated diseases while others do not, but surely genetic and environmental factors, including life style and past pathogen encounters, play a role [80-82]. In any case, a prophylactic EBV vaccine will be beneficial in preventing EBV-associated diseases [53, 59]. We herein provide an epitope ensemble that would serve to develop an epitope-based prophylactic vaccine against EBV infection, eliciting both adaptive cellular and humoral immunity. The T cell component consists of highly conserved experimental EBV-specific epitopes capable of eliciting cellular responses in virtually the whole population. The B cell component consists of conserved experimental and predicted B cell epitopes from EBV envelope proteins gp350, gp42, gB, and gL. These epitopes were selected from the relevant 3D structures applying a novel structure-based reverse vaccinology approach that includes calculation of flexibility and solvent accessibility values. As a result, we identified B cell epitopes that could elicit antibodies interfering with EBV entry in epithelial and B cells. Whether our epitope vaccine ensemble has also any therapeutic value is arguable but, clearly, it is harder to combat EBV once it has established a latent infection.

This study has limitations that may handicap its translation into an EBV vaccine. Appropriate antigen processing is a key limiting factor in the immunogenicity of T cell epitopes [83]. Therefore, we selected experimental T cell epitopes that were shown to be processed and presented in the course of a natural infection with EBV and assumed that T cell epitope immunogenicity will be then only determined by their binding to MHC molecules. This assumption has not been thoroughly tested and it is very sensitive to possible errors in the databases where we collected the data. In the same line, population coverage estimates for the T cell component need to be tested as they are inferred from peptide binding predictions to MHC molecules. Nonetheless, the reliability of peptide-MHC binding predictions has been widely proved [84]. With regard to the B cell component, we deliberately failed to include conformational epitopes as they cannot be isolated from their context and solely focused on linear B cell epitopes. Whether these B cell epitopes are able to elicit antibodies recognizing the native protein conformations needs to be tested.

MHC:   Major histocompatibility complex
HLA:   Human leukocyte antigens
gp:    Glycoprotein
nAb:   Neutralizing antibody.

Conflicts of Interest

Julio Alonso-Padilla is a postdoctoral researcher at ISGlobal supported by the Juan de la Cierva Program (MINECO, Spain) and a visiting scientist at the Laboratory of Immunomedicine, Faculty of Medicine, UCM, led by Pedro A. Reche. ISGlobal is a member of the CERCA Programme, Generalitat de Catalunya. The authors declare that they have no conflict of interests.


The authors wish to thank Inmunotek S.L. and the Spanish Department of Science at MINECO for supporting the research of the Immunomedicine Group through Grants SAF2006:07879, SAF2009:08301, and BIO2014:54164-R to Pedro A. Reche. Julio Alonso-Padilla acknowledges the support provided by Joaquim Gascon, director of the ISGlobal Chagas Disease Program.


[1] Z. Lin, X. Wang, M. J. Strong et al., "Whole-genome sequencing of the Akata and Mutu Epstein-Barr virus strains," Journal of Virology, vol. 87, no. 2, pp. 1172-1182, 2013.

[2] K. Sathiyamoorthy, J. Jiang, Y. X. Hu et al., "Assembly and architecture of the EBV B cell entry triggering complex," PLoS Pathogens, vol. 10, no. 8, article e1004309, 2014.

[3] M. Neves, J. Marinho-Dias, J. Ribeiro, and H. Sousa, "EpsteinBarr virus strains and variations: geographic or disease-specific variants?," Journal of Medical Virology, vol. 89, no. 3, pp. 373-387, 2017.

[4] L. S. Young and A. B. Rickinson, "Epstein-Barr virus: 40 years on," Nature Reviews Cancer, vol. 4, no. 10, pp. 757-768, 2004.

[5] E. K. Vetsika and M. Callan, "Infectious mononucleosis and Epstein-Barr virus," Expert Reviews in Molecular Medicine, vol. 6, no. 23, pp. 1-16, 2004.

[6] G. C. Faulkner, S. R. Burrows, R. Khanna, D. J. Moss, A. G. Bird, and D. H. Crawford, "X-linked agammaglobulinemia patients are not infected with Epstein-Barr virus: implications for the biology of the virus," Journal of Virology, vol. 73, no. 2, pp. 1555-1564, 1999.

[7] C. D. Shannon-Lowe, B. Neuhierl, G. Baldwin, A. B. Rickinson, and H. J. Delecluse, "Resting B cells as a transfer vehicle for Epstein-Barr virus infection of epithelial cells," Proceedings of the National Academy of Sciences of the United States of America, vol. 103, no. 18, pp. 7065-7070, 2006.

[8] B. Kempkes and E. S. Robertson, "Epstein-Barr virus latency: current and future perspectives," Current Opinion in Virology, vol. 14, pp. 138-144, 2015.

[9] M. M. Mullen, K. M. Haan, R. Longnecker, and T. S. Jardetzky, "Structure of the Epstein-Barr virus gp42 protein bound to the MHC class II receptor HLA-DR1," Molecular Cell, vol. 9, no. 2, pp. 375-385, 2002.

[10] G. S. Taylor, H. M. Long, J. M. Brooks, A. B. Rickinson, and A. D. Hislop, "The immunology of Epstein-Barr virus-induced disease," Annual Review of Immunology, vol. 33, pp. 787-821, 2015.

[11] D. A. Thorley-Lawson, "Epstein-Barr virus: exploiting the immune system," Nature Reviews Immunology, vol. 1, no. 1, pp. 75-82, 2001.

[12] T. Strowig, F. Brilot, F. Arrey et al., "Tonsilar NK cells restrict B cell transformation by the Epstein-Barr virus via IFN-gamma," PLoS Pathogens, vol. 4, no. 2, article e27, 2008.

[13] A. D. Hislop, G. S. Taylor, D. Sauce, and A. B. Rickinson, "Cellular responses to viral infection in humans: lessons from Epstein-Barr virus," Annual Review of Immunology, vol. 25, pp. 587-617, 2007.

[14] J. M. Brooks, H. M. Long, R. J. Tierney et al., "Early T cell recognition of B cells following Epstein-Barr virus infection: identifying potential targets for prophylactic vaccination," PLoS Pathogens, vol. 12, no. 4, article e1005549, 2016.

[15] E. Amyes, C. Hatton, D. Montamat-Sicotte et al., "Characterization of the CD4+ T cell response to Epstein-Barr virus during primary and persistent infection," The Journal of Experimental Medicine, vol. 198, no. 6, pp. 903-911, 2003.

[16] W. Bu, G. M. Hayes, H. Liu et al., "Kinetics of Epstein-Barr virus (EBV) neutralizing and virus-specific antibodies after primary infection with EBV," Clinical and Vaccine Immunology, vol. 23, no. 4, pp. 363-369, 2016.

[17] M. De Paschale and P. Clerici, "Serological diagnosis of Epstein-Barr virus infection: problems and solutions," World Journal of Virology, vol. 1, no. 1, pp. 31-43, 2012.

[18] M. P. Thompson and R. Kurzrock, "Epstein-Barr virus and cancer," Clinical Cancer Research, vol. 10, no. 3, pp. 803-821, 2004.

[19] A. Ascherio and K. L. Munger, "EBV and autoimmunity," Current Topics in Microbiology and Immunology, vol. 390, Part 1, pp. 365-385, 2015.

[20] J. I. Cohen, "Epstein-Barr virus vaccines," Clinical & Translational Immunology, vol. 4, no. 1, article e32, 2015.

[21] P. A. Reche, D. B. Keskin, R. E. Hussey, P. Ancuta, D. Gabuzda, and E. L. Reinherz, "Elicitation from virus-naive individuals of cytotoxic T lymphocytes directed against conserved HIV-1 epitopes," Medical Immunology, vol. 5, p. 1, 2006.

[22] Q. M. Sheikh, D. Gatherer, P. A. Reche, and D. R. Flower, "Towards the knowledge-based design of universal influenza epitope ensemble vaccines," Bioinformatics, vol. 32, no. 21, pp. 3233-3239, 2016.

[23] M. Molero-Abraham, J. P. Glutting, D. R. Flower, E. M. Lafuente, and P. A. Reche, "EPIPOX: immunoinformatic characterization of the shared T-cell epitome between variola virus and related pathogenic Orthopoxviruses," Journal of Immunology Research, vol. 2015, Article ID 738020, 11 pages, 2015.

[24] C. M. Diez-Rivero and P. A. Reche, "CD8 T cell epitope distribution in viruses reveals patterns of protein biosynthesis," PLoS One, vol. 7, no. 8, article e43674, 2012.

[25] P. A. Reche, H. Zhang, J. P. Glutting, and E. L. Reinherz, "EPIMHC: a curated database of MHC-binding peptides for customized computational vaccinology," Bioinformatics, vol. 21, no. 9, pp. 2140-2141, 2005.

[26] Q. Zhang, P. Wang, Y. Kim et al., "Immune epitope database analysis resource (IEDB-AR)," Nucleic Acids Research, vol. 36, Web Server issue, pp. W513-W518, 2008.

[27] W. Li and A. Godzik, "Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences," Bioinformatics, vol. 22, no. 13, pp. 1658-1659, 2006.

[28] S. Federhen, "Type material in the NCBI taxonomy database," Nucleic Acids Research, vol. 43, Database issue, pp. D1086-D1098, 2015.

[29] R. C. Edgar, "MUSCLE: multiple sequence alignment with high accuracy and high throughput," Nucleic Acids Research, vol. 32, no. 5, pp. 1792-1797, 2004.

[30] C. E. Shannon, "The mathematical theory of communication," The Bell System Technical Journal, vol. 27, pp. 379-423, 1948, 623-656.

[31] M. Garcia-Boronat, C. M. Diez-Rivero, E. L. Reinherz, and P. A. Reche, "PVS: a web server for protein sequence variability analysis tuned to facilitate conserved epitope discovery," Nucleic Acids Research, vol. 36, Web Server issue, pp. W35-W41, 2008.

[32] P. A. Reche and E. L. Reinherz, "Sequence variability analysis of human class I and class II MHC molecules: functional and structural correlates of amino acid polymorphisms," Journal of Molecular Biology, vol. 331, no. 3, pp. 623-641, 2003.

[33] J. J. Stewart, C. Y. Lee, S. Ibrahim et al., "A Shannon entropy analysis of immunoglobulin and T cell receptor," Molecular Immunology, vol. 34, pp. 1067-1082, 1997.

[34] P. A. Reche, J.-P. Glutting, and E. L. Reinherz, "Enhancement to the RANKPEP resource for the prediction of peptide binding to MHC molecules using profiles," Immunogenetics, vol. 56, pp. 405-419, 2004.

[35] P. A. Reche, J. P. Glutting, and E. L. Reinherz, "Prediction of MHC class I binding peptides using profile motifs," Human Immunology, vol. 63, no. 9, pp. 701-709, 2002.

[36] P. A. Reche and E. L. Reinherz, "Prediction of peptide-MHC binding using profiles," Methods in Molecular Biology, vol. 409, pp. 185-200, 2007.

[37] J. Greenbaum, J. Sidney, J. Chung, C. Brander, B. Peters, and A. Sette, "Functional classification of class II human leukocyte antigen (HLA) molecules reveals seven different supertypes and a surprising degree of repertoire sharing across supertypes," Immunogenetics, vol. 63, no. 6, pp. 325-335, 2011.

[38] P. Wang, J. Sidney, Y. Kim et al., "Peptide binding predictions for HLA DR, DP and DQ molecules," BMC Bioinformatics, vol. 11, p. 568, 2010.

[39] M. Molero-Abraham, E. M. Lafuente, D. R. Flower, and P. A. Reche, "Selection of conserved epitopes from hepatitis C virus for pan-populational stimulation of T-cell responses," Clinical & Developmental Immunology, vol. 2013, Article ID 601943, 10 pages, 2013.

[40] K. Cao, J. Hollenbach, X. Shi, W. Shi, M. Chopek, and M. A. Fernandez-Vina, "Analysis of the frequencies of HLA-A, B, and C alleles and haplotypes in the five major ethnic groups of the United States reveals high levels of diversity in these loci and contrasting distribution patterns in these populations," Human Immunology, vol. 62, no. 9, pp. 1009-1030, 2001.

[41] H. H. Bui, J. Sidney, K. Dinh, S. Southwood, M. J. Newman, and A. Sette, "Predicting population coverage of T-cell epitope-based diagnostics and vaccines," BMC Bioinformatics, vol. 7, p. 153, 2006.

[42] S. J. Hubbard and J. M. Thornton, NACCESS, Computer Program, Department of Biochemistry and Molecular Biology, University College London, London, England, UK, 1993.

[43] M. Magrane, "UniProt Knowledgebase: a hub of integrated protein data," Database: The Journal of Biological Databases and Curation, vol. 2011, article bar009, 2011.

[44] S. F. Altschul, T. L. Madden, A. A. Schaffer et al., "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs," Nucleic Acids Research, vol. 25, no. 17, pp. 3389-3402, 1997.

[45] J. Peterson, S. Garges, M. Giovanni et al., "The NIH Human Microbiome Project," Genome Research, vol. 19, no. 12, pp. 2317-2323, 2009.

[46] L. H. Glimcher and C. J. Kara, "Sequences and factors: a guide to MHC class-II transcription," Annual Review of Immunology, vol. 10, pp. 13-49, 1992.

[47] L. J. Stern and J. M. Calvo-Calle, "HLA-DR: molecular insights and vaccine design," Current Pharmaceutical Design, vol. 15, no. 28, pp. 3249-3261, 2009.

[48] G. Szakonyi, M. G. Klein, J. P. Hannan et al., "Structure of the Epstein-Barr virus major envelope glycoprotein," Nature Structural & Molecular Biology, vol. 13, no. 11, pp. 996-1001, 2006.

[49] M. Backovic, R. Longnecker, and T. S. Jardetzky, "Structure of a trimeric variant of the Epstein-Barr virus glycoprotein B," Proceedings of the National Academy of Sciences of the United States of America, vol. 106, no. 8, pp. 2880-2885, 2009.

[50] E. Westhof, D. Altschuh, D. Moras et al., "Correlation between segmental mobility and the location of antigenic determinants in proteins," Nature, vol. 311, no. 5982, pp. 123-126, 1984.

[51] A. N. Kirschner, J. Sorem, R. Longnecker, and T. S. Jardetzky, "Structure of Epstein-Barr virus glycoprotein 42 suggests a mechanism for triggering receptor-activated virus entry," Structure, vol. 17, no. 2, pp. 223-233, 2009.

[52] K. Sathiyamoorthy, Y. X. Hu, B. S. Mohl, J. Chen, R. Longnecker, and T. S. Jardetzky, "Structural basis for Epstein-Barr virus host cell tropism mediated by gp42 and gHgL entry glycoproteins," Nature Communications, vol. 7, article 13557, 2016.

[53] C. Smith and R. Khanna, "The development of prophylactic and therapeutic EBV vaccines," Current Topics in Microbiology and Immunology, vol. 391, pp. 455-473, 2015.

[54] H. H. Balfour Jr., "Progress, prospects, and problems in Epstein-Barr virus vaccine development," Current Opinion in Virology, vol. 6, pp. 1-5, 2014.

[55] E. M. Sokal, K. Hoppenbrouwers, C. Vandermeulen et al., "Recombinant gp350 vaccine for infectious mononucleosis: a phase 2, randomized, double-blind, placebo-controlled trial to evaluate the safety, immunogenicity, and efficacy of an Epstein-Barr virus vaccine in healthy young adults," The Journal of Infectious Diseases, vol. 196, no. 12, pp. 1749-1753, 2007.

[56] R. Lin, D. Heeke, H. Liu et al., "Development of a robust, higher throughput green fluorescent protein (GFP)-based Epstein-Barr virus (EBV) micro-neutralization assay," Journal of Virological Methods, vol. 247, pp. 15-21, 2017.

[57] S. L. Elliott, A. Suhrbier, J. J. Miles et al., "Phase I trial of a CD8+ T-cell peptide epitope-based vaccine for infectious

mononucleosis," Journal of Virology, vol. 82, no. 3, pp. 14481457, 2008.

[58] L. M. Hutt-Fletcher, "EBV glycoproteins: where are we now?," Future Virology, vol. 10, no. 10, pp. 1155-1162, 2015.

[59] V. Dasari, K. H. Bhatt, C. Smith, and R. Khanna, "Designing an effective vaccine to prevent Epstein-Barr virus-associated diseases: challenges and opportunities," Expert Review of Vaccines, vol. 16, no. 4, pp. 377-390, 2017.

[60] J. Ponomarenko and M. Van Regenmortel, "B-cell epitope prediction," in Structural Bioinformatics, pp. 849-879, John Wiley & Sons, Hoboken, NJ, USA, 2009.

[61] M. H. Van Regenmortel, "What is a B-cell epitope?," Methods in Molecular Biology, vol. 524, pp. 3-20, 2009.

[62] P. A. Reche and E. L. Reinherz, "Definition of MHC supertypes through clustering of MHC peptide-binding repertoires," Methods in Molecular Biology, vol. 409, pp. 163-173, 2007.

[63] P. Sun, H. Ju, Z. Liu et al., "Bioinformatics resources and tools for conformational B-cell epitope prediction," Computational and Mathematical Methods in Medicine, vol. 2013, Article ID 943636, 11 pages, 2013.

[64] L. Potocnakova, M. Bhide, and L. B. Pulzova, "An introduction to B-cell epitope mapping and in silico epitope prediction," Journal of Immunology Research, vol. 2016, Article ID 6760830,11 pages, 2016.

[65] M. J. Blythe and D. R. Flower, "Benchmarking B cell epitope prediction: underperformance of existing methods," Protein Science, vol. 14, no. 1, pp. 246-248, 2005.

[66] J. Gao and L. Kurgan, "Computational prediction of B cell epitopes from antigen sequences," Methods in Molecular Biology, vol. 1184, pp. 197-215, 2014.

[67] K. A. Young, A. P. Herbert, P. N. Barlow, V. M. Holers, and J. P. Hannan, "Molecular basis of the interaction between complement receptor type 2 (CR2/CD21) and Epstein-Barr virus glycoprotein gp350," Journal of Virology, vol. 82, no. 22, pp. 11217-11227, 2008.

[68] J. E. Tanner, M. Coincon, V. Leblond et al., "Peptides designed to spatially depict the Epstein-Barr virus major virion glycoprotein gp350 neutralization epitope elicit antibodies that block virus-neutralizing antibody 72A1 interaction with the native gp350 molecule," Journal of Virology, vol. 89, no. 9, pp. 4932-4941, 2015.

[69] V. Litwin, J. Gumperz, P. Parham, J. H. Phillips, and L. L. Lanier, "Specificity of HLA class I antigen recognition by human NK clones: evidence for clonal heterogeneity, protection by self and non-self alleles, and influence of the target cell type," The Journal of Experimental Medicine, vol. 178, no. 4, pp. 1321-1336, 1993.

[70] R. Arnon and T. Ben-Yedidia, "Old and new vaccine approaches," International Immunopharmacology, vol. 3, no. 8, pp. 1195-1204, 2003.

[71] J. L. Whitton, N. Sheng, M. B. Oldstone, and T. A. McKee, "A "string-of-beads" vaccine, comprising linked minigenes, confers protection from lethal-dose virus challenge," Journal of Virology, vol. 67, no. 1, pp. 348-352, 1993.

[72] S. Gupta, P. Kapoor, K. Chaudhary, A. Gautam, R. Kumar, and G. P. Raghava, "In silico approach for predicting toxicity of peptides and proteins," PLoS One, vol. 8, no. 9, article e73957, 2013.

[73] E. M. Varypataki, N. Benne, J. Bouwstra, W. Jiskoot, and F. Ossendorp, "Efficient eradication of established tumors in mice with cationic liposome-based synthetic long-peptide vaccines," Cancer Immunology Research, vol. 5, no. 3, pp. 222-233, 2017.

[74] R. Kuai, L. J. Ochyl, K. S. Bahjat, A. Schwendeman, and J. J. Moon, "Designer vaccine nanodiscs for personalized cancer immunotherapy," Nature Materials, vol. 16, no. 4, pp. 489-496, 2017.

[75] M. Brazzoli, D. Magini, A. Bonci et al., "Induction of broad-based immunity and protective efficacy by self-amplifying mRNA vaccines encoding influenza virus hemagglutinin," Journal of Virology, vol. 90, no. 1, pp. 332-344, 2015.

[76] R. A. Schwendener, "Liposomes as vaccine delivery systems: a review of the recent advances," Therapeutic Advances in Vaccines, vol. 2, no. 6, pp. 159-182, 2014.

[77] D. Felnerova, J. F. Viret, R. Gluck, and C. Moser, "Liposomes and virosomes as delivery systems for antigens, nucleic acids and drugs," Current Opinion in Biotechnology, vol. 15, no. 6, pp. 518-529, 2004.

[78] Y. Perrie, D. Kirby, V. W. Bramwell, and A. R. Mohammed, "Recent developments in particulate-based vaccines," Recent Patents on Drug Delivery & Formulation, vol. 1, no. 2, pp. 117-129, 2007.

[79] C. Gujer, B. Chatterjee, V. Landtwing, A. Raykova, D. McHugh, and C. Munz, "Animal models of Epstein Barr virus infection," Current Opinion in Virology, vol. 13, pp. 6-10, 2015.

[80] C. Calcagno, R. Puzone, Y. E. Pearson et al., "Computer simulations of heterologous immunity: highlights of an interdisciplinary cooperation," Autoimmunity, vol. 44, no. 4, pp. 304-314, 2011.

[81] Proceedings of the IARC working group on the evaluation of carcinogenic risks to humans. Epstein-Barr virus and Kaposi's sarcoma herpesvirus/human herpesvirus 8. Lyon, France, 17-24 June 1997," IARC Monographs on the Evaluation of Carcinogenic Risks to Humans, vol. 70, pp. 1-492, 1997.

[82] A. W. Lee, W. Foo, O. Mang et al., "Changing epidemiology of nasopharyngeal carcinoma in Hong Kong over a 20-year period (1980-99): an encouraging reduction in both incidence and mortality," International Journal of Cancer, vol. 103, no. 5, pp. 680-685, 2003.

[83] W. Zhong, P. A. Reche, C. C. Lai, B. Reinhold, and E. L. Reinherz, "Genome-wide characterization of a viral cytotoxic T lymphocyte epitope repertoire," The Journal of Biological Chemistry, vol. 278, no. 46, pp. 45135-45144, 2003.

[84] E. M. Lafuente and P. A. Reche, "Prediction of MHC-peptide binding: a systematic and comprehensive overview," Current Pharmaceutical Design, vol. 15, no. 28, pp. 3209-3220, 2009.

Julio Alonso-Padilla, (1) Esther M. Lafuente, (2) and Pedro A. Reche (2)

(1) Barcelona Institute for Global Health (ISGlobal), Centre for Research in International Health (CRESIB), Hospital Clinic-University of Barcelona, Barcelona, Spain

(2) Laboratory of Immunomedicine, Faculty of Medicine, University Complutense of Madrid, Ave Complutense S/N, 28040 Madrid, Spain

Correspondence should be addressed to Pedro A. Reche;

Received 19 May 2017; Revised 7 August 2017; Accepted 20 August 2017; Published 28 September 2017

Academic Editor: Peirong Jiao

Caption: Figure 1: Structural mapping of selected experimental EBV-specific B cell epitopes. Conserved EBV epitopes map onto two different regions of the 3D structure of gp350 (PDB code: 2H6O): QVNYLIPETVPYIKWDN and YVFYSGNGPKASGGDYCIQS map at the glycan-free surface of the CR2 receptor binding site; SKAPESTTTSPTLNTTGFA maps at the C-term tail of the PDB. (a) General view of gp350 featured as ribbon with B cell epitopes highlighted in red, blue, and purple. Protein regions of the selected epitopes are zoomed in panels (b, c, d). We show in sticks the part of the epitopes that exhibited greater flexibility and accessibility which was ultimately selected for the proposed vaccine ensemble. In ribbon, we show the B cell epitope residues that do not comply with the flexibility and accessibility criteria (typed in a minor case in the corresponding sequence indicated at the bottom of each panel). Figures were rendered using PyMOL.

Caption: Figure 2: Structural mapping of predicted B cell epitopes in EBV envelope proteins. (a) KLPHWTPTLH in EBV gp42 3D structure (PDB: 3FD4 chain A); epitope shown as sticks and gp42 region interaction with HLA-DR is shown in red. (b) NTTVGIELPDA and (c) SSHGDLFRFSSDIQCP at EBV gB 3D structure (PDB: 3FVC) map, respectively, in its domain II and domain III; epitopes shown as sticks. (d) FSVEDLFGAN at gL 3D structure (PDB: 5T1D chain B) in its domain I (colored in blue); gH is colored in pale green. In (b, c, d), the corresponding whole structure is shown minimized at the bottom left of each panel; the magnified epitope mapping region is circled in them. In (a, b, c), the protein backbone is featured as pale green ribbon. Figures were rendered using PyMOL.

Caption: Figure 3: Strategy for experimental B cell epitope selection. Overview of the approach devised to select invariant experimental EBV-specific B cell epitopes for the B cell component of an epitope-based vaccine against EBV. The approach comprises 5 steps: (1) selection of unique epitopes from databases; (2) sequence variability filtering and testing for self-peptides; (3) selection of epitopes from viral envelope antigens; (4) progression of epitopes located to envelope protein ectodomains; (5) final output of epitopes that fulfill the flexibility and accessibility criteria established in the text. None of the epitopes that we selected were identical to human proteins or proteins from the human microbiome.

Caption: Figure 4: The EBV gp350 contact region with CR2. EBV B cell epitopes ETVPYIKWDN and GNGPKASGGD map onto a gp350 region that interacts with CR2; epitopes colored blue and red and the gp350 backbone featured as pale green ribbon. Side chains of the residues described to interact with CR2 receptor by Young et al. [67] are shown as sticks. Figure was rendered using PyMOL.
Table 1: Conserved EBV-specific CD8 T cell epitopes from early

Epitope     Antigen    AN (1)         HLA I
              gene               restriction (2)

RPPIFIRRL    EBNA3     P12977   B*07, B*08, B*0702

SVRDRLARL    EBNA3     P12977         A*0201

YVLDHLIVV    BRLF1     P03209      A*0201, A*02

QPRAPIRPI    EBNA6     P03204         B*0702

LPCVLWPVL    BZLF1     P03206         B*0702

RVRAYTYSK    BRLF1     P03209      A*0301, A*03

AYSSWMYSY    EBNA3     P12977          A*30

VLKDAIKDL    EBNA1     P03211         A*0203

QAKWRLQTL    EBNA3     P12977          B*08

RRIYDLIEL    EBNA6     P03204         B*2705

RLRAEAQVK    EBNA3     P12977      A*03, A*0301

CYDHAQTHL    BMRF1     P03191         A*2402

SENDRLRLL    BZLF1     P03206      B*4002, B60

YRSGIIAVV    BMRF1     P03191      B*3906, Cw6

ARYAYYLQF     DBP      P03227         B*2705

VSFIEFVGW    EBNA3     P12977          B*58

Epitope                Predicted HLA I profile              PPC% (3)

RPPIFIRRL      B*0702, B*0801, B*3501, B*5101, B*5102,       57.84
                   B*5103, B*5301, B*5401, C*0102

SVRDRLARL      A*0201, A*0203, A*0206, A*0214, B*0702,       56.66
                           B*0801, B*1517

YVLDHLIVV   A*0201, A*0202, A*0203, A* 0204, A* 0205, A*     47.34
                0206, A*0209, A*0214, B*1517, B* 5701

QPRAPIRPI      B*0702, B*3501, B*5101, B*5102, B*5103,       43.56
                       B*5301, B*5401, B*5502

LPCVLWPVL      B*0702, B*3501, B*5101, B*5102, B*5103,        42.4
                            B*5301, B5401

RVRAYTYSK      A*0301, A*1101, A*3101, A*3301, A*6801        41.46

AYSSWMYSY              A*0101, B*2701, C*0702                36.38

VLKDAIKDL      A*0203, A*0204, A*0205, A*0206, A*0207,       33.72
                       A*0214, B*0801, C*0304

QAKWRLQTL          B*0702, B*0801, B*1400, C*0102            32.48

RRIYDLIEL      B*1400, B*2702, B*2703, B*2704, B*2705,       30.42
                       B*2706, B*2709, C*0702

RLRAEAQVK              A*0301, A*1101, B*1513                 28.7

CYDHAQTHL              A*0207, A*2402, B*3801                 27.3

SENDRLRLL                   B*4002, B4402                    14.18

YRSGIIAVV      A*0202, A*0203, A*0204, A*0205, A*0209,       12.82
               B*1509, B*1510, B*1516, B*2709, B*3801,
                          B*39,011, B*3909

ARYAYYLQF      B*1400, B*1517, B*2701, B*2702, B*2703,        7.56
                   B*2704, B*2705, B*2706, B*2709

VSFIEFVGW                  B*5701, B*5702                     5.08

(1) Antigen accession number from the UniProtKB database. (2)
Experimental restriction profile obtained from epitope databases. (3)
Average population protection coverage (PPC) of PPCs computed for 5
ethnic groups in the USA population (Black, Caucasian, Hispanic,
North American natives, and Asians) using the relevant HLA I genetic
frequencies [40]. The combination that reached the largest PPC
(97.1%) included the CD8 T cell epitopes YVLDHLIVV, YRSGIIAVV,

Table 2: Conserved EBV-specific CD4 T cell epitopes.

Epitope                Antigen gene   AN (1)       HLA II
                                               restriction (2)

MLGQDDFIKFKSPLV           BFRF1       P03185      DRB1*0701

AGLTLSLLVICSYLFISRG       BHRF1       P03182         DR2

LEKQLFYYIGTMLPNTRPHS      BXLF2       P03231        DR51

SRRFSWTLFLAGLTLSLLVI      BHRF1       P03182         DR2

SRDELLHTRAASLLY           BARF1       P0CAP6      DRB1*0701

PPVVRMFMRERQLPQ           EBNA6       P03204    HLA class II

QQRPVMFVSRVPAKK           EBNA6       P03204    HLA class II

PAQPPPGVINDQQLHHLPSG      EBNA2       P12978      DRB1*0301

VKLTMEYDDKVSKSH           BMRF1       P03191      DRB1*0301

QKRAAPPTVSPSDTG           EBNA6       P03204    HLA class II

Epitope                     Predicted HLA II profile         PPC (3)

MLGQDDFIKFKSPLV         DRB1*0901, DRB1* 1501, DRB1*0701,     69.85
                        DRB1*0405, DRB1*0101, DRB1*0301,
                              DRB5*0101, DRB1*0401

AGLTLSLLVICSYLFISRG     DRB1* 1501, DRB5*0101, DRB1*1101,     57.97
                        DRB1*0405, DRB1*0401, DRB1*0301,
                              DRB1*1201, DRB1*0802

LEKQLFYYIGTMLPNTRPHS    DRB5*0101, DRB1*1101, DRB1*0401,      57.97
                       DRB1*0405, DRB1* 1201, DRB1* 1501,
                              DRB1*0301, DRB1*0802

SRRFSWTLFLAGLTLSLLVI    DRB1*0401, DRB1*0101, DRB1*0901,      55.25
                        DRB1*0301, DRB1*0701, DRB1* 1201

SRDELLHTRAASLLY         DRB1*0701, DRB1*0101, DRB1*1201,      42.9
                        DRB3*0202, DRB1*0901, DRB1* 1302

PPVVRMFMRERQLPQ         DRB1*1101, DRB5*0101, DRB1*0301,      36.88
                              DRB1*0401, DRB4*0101

QQRPVMFVSRVPAKK         DRB5*0101, DRB1*0802, DRB1*1101,      29.35

PAQPPPGVINDQQLHHLPSG          DRB1*0301, DRB4 *0101           17.84

VKLTMEYDDKVSKSH                     DRB1*0301                 17.84

QKRAAPPTVSPSDTG                        --                       0

(1) Antigen accession number from the UniProtKB database. (2)
Experimental HLA II restriction profile obtained from epitope
databases. (3) Population protection coverage (PPC) was computed for
the world population using the IEDB Analysis Resources tool with the
HLA-DR allele reference set provided by the tool [37]. The italicized
sequence is shared by the two epitopes that contain it.

Table 3: Experimentally defined conserved EBV-specific B cell epitopes.

Epitope                 Antigen (gene)   AN (1)   Epitope location

SKAPESTTTSPTLNTTGFA     gp350 (BLLF1)    P03200      Ectodomain
YVFYSGNGPKASGGDYCIQS    gp350 (BLLF1)    P03200      Ectodomain
QNPVYLIPETVPYIKWDN      gp350 (BLLF1)    P03200      Ectodomain
SVKTEMLGNEID            gp350 (BLLF1)    P03200      Ectodomain
QVSLESVDVYFQDVFGTMWC    gp350 (BLLF1)    P03200      Ectodomain
TNTTDITYVGD             gp350 (BLLF1)    P03200      Ectodomain
PSTSSKLRPRWTFTSPPVTT    gp350 (BLLF1)    P03200      Ectodomain
QKRAAQRAAGPSVAS          gpB (BALF4)     P03188     Inner domain
VSGFISFFKNPFGGM          gpB (BALF4)     P03188    Transmembrane

Epitope                  PDB hit (2)     Flexibility (3)   Access. (4)

SKAPESTTTSPTLNTTGFA     2H6O [422-440]    2.486 (2.672)    59.2 (63.4)
YVFYSGNGPKASGGDYCIQS    2H6O [282-301]    1.102 (2.004)    31.7 (51.2)
QNPVYLIPETVPYIKWDN      2H6O [147-164]    0.618 (1.191)    62.4 (77.5)
SVKTEMLGNEID            2H6O [197-208]       -0.347           19.8
QVSLESVDVYFQDVFGTMWC    2H6O [122-141]       -0.575           17.5
TNTTDITYVGD             2H6O [317-327]        0.121           60.1
PSTSSKLRPRWTFTSPPVTT          No               N/A             N/A
QKRAAQRAAGPSVAS               No               N/A             N/A
VSGFISFFKNPFGGM               No               N/A             N/A

(1) Accession number from UniProtKB database. (2) Epitope hit with
corresponding PDBs (in bracket sequence hit). Values of (3) flexibility
(arbitrary units) and (4) solvent accessibility (%) were calculated as
explained in Materials and Methods. N/A: not applicable; gp:
glycoprotein. We show the italicized regions in B cell epitopes
consisting of 9 or more consecutive residues with flexibility > 1 and
we show in brackets the corresponding flexibility and accessibility
values of these regions.

Table 4: Predicted conserved B cell epitopes from EBV envelope

Epitope              Antigen      Accession        PDB (2)
                      (gene)      number (1)

KLPHWTPTLH         gp42 (BZLF2)     P03205     3FD4:A [45-54]

NTTVGIELPDA        gpB (BALF4)      P03188     3FVC [307-317]

SSHGDLFRFSSDIQCP   gpB (BALF4)      P03188      3FVC [32-47]

FSVEDLFGAN          gL (BKRF2)      P03212     5T1D:B [95-104]

Epitope            Flex. (3)   Acc. (%) (4)       BLAST hit
                                                 HMP (%) (5)

KLPHWTPTLH           2.256         80.0       EJZ65106.1 (70.00)

NTTVGIELPDA          1.890         67.0       EHM53795.1 (72.73)

SSHGDLFRFSSDIQCP     1.369         69.8       KGF26221.1 (50.00)

FSVEDLFGAN           1.505         53.1       EKB09257.1 (65.00)

Epitope            BLAST hit human (%) (6)

KLPHWTPTLH           AAH22472.1 (60.00)

NTTVGIELPDA        XP_011519547.1 (63.64)

SSHGDLFRFSSDIQCP   XP_011520599.1 (50.00)

FSVEDLFGAN         XP_005271219.1 (70.00)

(1) Accession number from the UniProtKB database. (2) Epitope
location in their corresponding PDBs is shown in brackets. The
specific chain is indicated along with the PDB code. (3) Values of
flexibility (arbitrary units) and (4) solvent accessibility (%) were
calculated as explained in Materials and Methods. (5,6) Accession
number of closest epitope BLAST hit in human microbiome proteins
and human proteins, respectively (percentage of identity
in parenthesis).

Table 5: Proposed epitope vaccine ensemble for EBV.

CD8 T cell epitope vaccine component

Sequence                 Antigen      AN (1)   BLAST hit HMP (2)

RPPIFIRRL                 EBNA3       P12977   EFI49553.1 (55.56)
SVRDRLARL                 EBNA3       P12977       No hit (-)
YVLDHLIVV                 BRLF1       P03209   EPH07203.1 (88.89)
QPRAPIRPI                 EBNA6       P03204   EEZ70880.1 (66.67)
LPCVLWPVL                 BZLF1       P03206   ETN46892.1 (77.78)
RVRAYTYSK                 BRLF1       P03209   EEY91922.1 (66.67)
AYSSWMYSY                 EBNA3       P12977   EKB85112.1 (77.78)
VLKDAIKDL                 EBNA1       P03211   KXB56071.1 (88.89)
QAKWRLQTL                 EBNA3       P12977   EHR35488.1 (66.67)
RRIYDLIEL                 EBNA6       P03204   EDS12420.1 (77.78)
RLRAEAQVK                 EBNA3       P12977       No hit (-)
CYDHAQTHL                 BMRF1       P03191   EFF75621.1 (77.78)
SENDRLRLL                 BZLF1       P03206   EGG37664.1 (77.78)
YRSGIIAVV                 BMRF1       P03191   OFQ99895.1 (88.89)

CD4T cell epitope vaccine component

Sequence                 Antigen      AN (1)   BLAST hit HMP (2)

MLGQDDFIKFKSPLV           BFRF1       P03185   EIY33207.1 (46.67)
AGLTLSLLVICSYLFISRG       BHRF1       P03182   EKN19533.1 (47.37)
SRDELLHTRAASLLY           BARF1       P0CAP6   EPB87510.1 (66.67)
PPVVRMFMRERQLPQ           EBNA6       P03204   EFV04068.1 (46.67)

B cell epitope vaccine component

Sequence                 Antigen      AN (1)   BLAST hit HMP (2)

APESTTTSPTLNTTGFA     gp350 (BLLF1)   P03200   EGY79509.1 (58.82)
GNGPKASGGD            gp350 (BLLF1)   P03200   EHM51909.1 (70.00)
ETVPYIKWDN            gp350 (BLLF1)   P03200   EET62946.1 (50.00)
KLPHWTPTLH            gp42 (BZLF2)    P03205   EJZ65106.1 (70.00)
NTTVGIELPDA            gpB (BALF4)    P03188   EHM53795.1 (72.73)
SSHGDLFRFSSDIQCP       gpB (BALF4)    P03188   KGF26221.1 (50.00)
FSVEDLFGAN             gL (BKRF2)     P03212   EKB09257.1 (80.00)

CD8 T cell epitope vaccine component

Sequence               BLAST hit humans (3)    PPC% (4)

RPPIFIRRL             NP_001182344.1 (66.67)     >95
SVRDRLARL                  3HR0 (55.56)
YVLDHLIVV             XP_011535331.1 (66.67)
QPRAPIRPI               AFC01212.1 (55.56)
LPCVLWPVL             XP_011511695.1 (55.56)
RVRAYTYSK               CAE46202.1 (55.56)
AYSSWMYSY               EAW88404.1 (66.67)
VLKDAIKDL               EAX00446.1 (66.67)
QAKWRLQTL             XP_005255827.1 (77.78)
RRIYDLIEL               EAW88480.1 (66.67)
RLRAEAQVK             XP_011507142.1 (77.78)
CYDHAQTHL               CAH10644.1 (66.67)
SENDRLRLL               EAW88969.1 (77.78)
YRSGIIAVV               BAC03504.1 (66.67)

CD4T cell epitope vaccine component

Sequence               BLAST hit humans (3)    PPC% (4)

MLGQDDFIKFKSPLV       NP_001284364.1 (53.33)    >81.8
SRDELLHTRAASLLY       XP_011514101.1 (66.67)
PPVVRMFMRERQLPQ         AAP34452.1 (60.00)

B cell epitope vaccine component

Sequence               BLAST hit humans (3)    PPC% (4)

APESTTTSPTLNTTGFA     NP_001276932.1 (52.94)      E
GNGPKASGGD             NP_055501.2 (70.00)        E
ETVPYIKWDN            NP_001193968.1 (50.00)      E
KLPHWTPTLH              AAH22472.1 (60.00)        P
NTTVGIELPDA           XP_011519547.1 (63.64)      P
SSHGDLFRFSSDIQCP      XP_011520599.1 (50.00)      P
FSVEDLFGAN            XP_005271219.1 (70.00)      P

(1) Accession number from UniProtKB database. (2,3) Accession number
of the closest epitope BLAST hit to human microbiome proteins and
human proteins, respectively (percentage of identity in parenthesis).
(4) Population protection coverage (PPC) of the CD8 and CD4 T cell
epitope ensemble. (5) Src., source, whether the epitope derived from
an experimental B cell epitope (E) or it was predicted (P).
COPYRIGHT 2017 Hindawi Limited
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2017 Gale, Cengage Learning. All rights reserved.

Article Details
Printer friendly Cite/link Email Feedback
Author:Alonso-Padilla, Julio; Lafuente, Esther M.; Reche, Pedro A.
Publication:Journal of Immunology Research
Date:Jan 1, 2017
Previous Article:Getting "Inside" Type I IFNs: Type I IFNs in Intracellular Bacterial Infections.
Next Article:Propyl Gallate Exerts an Antimigration Effect on Temozolomide-Treated Malignant Glioma Cells through Inhibition of ROS and the NF-[kappa]B Pathway.

Terms of use | Privacy policy | Copyright © 2021 Farlex, Inc. | Feedback | For webmasters