Jump to content

SSHHPS

From Wikipedia, the free encyclopedia

SSHHPS is an acronym for short stretches of homologous host pathogen sequences. The acronym was first coined by Legler in a 2019 publication.[1] Legler used BLAST to search for host protein substrates for the nsP2 protease of the Venezuelan equine encephalitis virus (VEEV) and the protease from Zika virus. These viruses are Group 4 (+)ssRNA viruses. Short ~20-25 amino acid sequences from the viral polyprotein [2] containing the scissile bond were used to search the human proteome. Many of the sequence alignments were spurious, while some matched well with the residues surrounding the scissile bond. When all known host proteins shown to be cut by viral proteases were consolidated into a table, it became clear that the targets were not random.[3] Most were related to innate immunity while others appeared to be related to viral pathogenesis and the virus-induced phenotype. Some hits were related to both. The list of experimentally confirmed host targets of Group IV viral proteases included key proteins involved in innate immunity e.g MAVS, RIG-I, STING, TRIF, and TRIM14.[3] In 1984, one of the first host proteins shown to be cut by a viral protease was histone H3 by foot-and-mouth disease virus.[4][5] The histone tails are strategic targets of the viral proteases, the cleavage can shut down host cell transcription[6][7] and the many effects of interferon.

The VEEV nsP2 cysteine protease recognizes a motif. The subsite tolerances can be extracted from the polyprotein.

Viral proteases recognize sequence motifs. The subsite tolerances in the protease can vary, leading to the recognition of many sequences. The protease is a complement to many peptides.

Silence

[edit]

Silencing can occur at the level of DNA, RNA, and protein. The 3rd mechanism of silencing would involve proteases and proteins.[1] SSHHPS cleavage is a type of target specific co- or post-translational silencing.

SSHHPS2.pdf

Silencing can occur at the level of DNA, RNA, and Protein. SSHHPS are short stretches of homologous host pathogen sequences. These sequences can be found at the viral protease cleavage sites, they correspond to specific proteins in the host. The cleavage of these sequences can be co- or post-translational. Original figure can be found in Morazzani, et al.[1]

SSHHPS Predicted for SARS-CoV-2

[edit]

Using PHI-BLAST and a sequence pattern (e.g. L[RK]GG) a shorter list of host targets could be obtained, however the searches still produced hundreds of host targets. To sort them and rank order them Legler used clustering. Plotting 'percent positives' vs. 'alignment length' from the PHI-BLAST output file, the cleavable proteins were found to cluster and localize to the right of the graph.[8] The hit lists could now be sorted by alignment length and percent positives and a rank-ordered list could be produced. At the top of the list are the most likely substrates and at the bottom the less likely substrates. This and experimental data[9][10][11][1] became the basis for the first sequence-to-symptom software for viruses.[8] An example of the software output can be found here.

Clustering of potential host targets of viral proteases. Plot of PHI-BLAST search results for the SARS-CoV-2 papain-like protease. Hits can be found in a file that can be downloaded on the BLAST website. On the right side of the graph are the sequences that have the strongest alignment to residues at the cleavage site over the longest continuous stretch. For SARS-CoV-2 papain-like protease (PLpro) the host proteins with the highest similarity were the cardiac myosins, myomesin, and PROS1. A distribution can be extracted as there are multiple proteins under each point. ADGRA2 (GPR124) is a common hit among neuroinvasive viruses. The alignments on the left side of the graph were more spurious and gapped. Points in blue correspond to proteins experimentally shown to be cut.

After sorting the hits, Legler found that the hits at the top of the list had similarities to the virus-induced phenotype.[3] For the COVID-19 SARS-CoV-2 papain-like protease (PLpro), cardiac myosins were the strongest predicted hit (MYH6, MYH7); MYOM1, POT1, VWF, PROS1, HER4, and FOXP3 were also predicted and the sequences were shown to be cleavable.[8] A group at UCSF, showed the cleavage of myofibrils in cardiomyocytes after infection with SARS-CoV-2.[12][13] Fragments of the sarcomere are still visible showing that the cleavage of the myofibrils occurs post-translationally and after the assembly of the myofibril. The viral proteases have also been suspected in COVID coagulopathy.[14] The PLpro of SARS-CoV-2 was able to cut sequences in PROS1 and VWF.

SSHHPS Predicted for Zika Virus

[edit]

Zika virus has been associated with microcephaly and anencephaly. Using the sorting and graphical method described above, hits related to these phenotypes emerged, such as GIT1, FOXG1, and SFRP1. GIT1 knockout mice develop microcephaly.[15] Mice and rats have not been shown to develop microcephaly after infection with Zika virus (ZIKV). However, Goodfellow, et al[16] showed that chickens can produce microcephaly when infected with ZIKV. Both humans and chickens have the same sequence at the predicted cleavage site in SFRP1.[1] SFRP1 is a predicted host protein substrate for the Zika viral protease. The sequence is identical in humans and chickens, two species which both produce microcephaly after infection with Zika virus.

Sequence alignment of SFRP1 across species. The predicted cleavage site sequence of humans and chickens match.
Predicted SSHHPS for Zika Virus. GIT1 and FOXG1 were predicted and are associated with microcephaly. The GIT1 knockout mouse has the phenotype of a small brain. This was shown by Hong and Mah, and their original figure can be found in PMID 25792865 [15]

The Uncleavable Mouse Experiment

[edit]

In 1996, Blom, et al.[10] created a neural network to predict the host targets of picornaviral proteases. One of the predicted hits was dystrophin (DMD). Badorff, et al[9] confirmed that dystrophin could be cleaved by the enteroviral 2A protease. Lim, et al[11] went one step further and generated a transgenic mouse. The knock-in mice had a mutation at the predicted 2A protease cleavage site in dystrophin. When the viral protease was expressed in cardiomyocytes the cleavage-resistant dystrophin inhibited the cardiomyopathy induced by the viral protease. This experiment brought the idea full circle, i.e. that the viral protease is related to the virus-induced phenotype (i.e. cardiomyopathy). Moreover, the experiment indicated that the clinical presentation could be predicted directly from the viral genome sequence.[8] While Blom's predictions were accurate and could be confirmed by others, a common hit was never found across family or genus.

A Common Hit

[edit]

Using Python, the PHI-BLAST searches and UniProt descriptions could be combined and automated.[8] The search could be repeated several times. Running the searches for 9 neuroinvasive viruses, Legler found that if the viruses were clustered by a common virus-induced phenotype (e.g. neuroinvasiveness) a common hit emerged. One protein common to all 9 hit lists was the orphan G-protein coupled receptor ADGRA2 (also known as GPR124). When ADGRA2 is knocked-out in mouse models of ischemia and glioblastoma blood-brain barrier (BBB) disruption is observed.[17][18] The cleavage sites for the viral proteases of 9 neuroinvasive viruses were all found on this one protein, in some cases the cleavage sites were predicted to be on the outside of the cell, in other cases the cleavage site is predicted to be in the cytoplasm.

Predicted viral protease cleavage sites in ADGRA2 for 9 neuroinvasive viruses (WNV, SARS, SARS-2, MERS, ZIKV, VEEV, EEEV, EV71, JEV).

Interestingly, the software did not predict a specific cleavage site sequence or a particular type of protease (e.g. serine, cysteine, aspartyl) but rather a general pathway and common target. A strategy to enter the brain may have been preserved during viral evolution.[8]

Where do SSHHPS come from?

[edit]

RNA viruses are known to acquire host sequences. Host genomes serve as the largest source of foreign genetic material.[19] Using the RNA sequence of a viral protease cleavage site for SARS-CoV-2 and the bat genome, sequence matches can be found.[3] In Group 4 viruses, the protein sequence of the SSHHPS matches the host and reservoir, while the RNA sequences matches sequences in the reservoir species suggesting that they were acquired.[3]

When do these cleavages occur?

[edit]

Host protein cleavages can occur between 2 to 8 h post-infection.[5] Not all viral proteases produce stable cleavage products in cells, in fact in many cases a smear or just a transient cleavage product may exist and then disappear as the cell identifies it as 'damaged' making it difficult to trap. Co-translational cleavage can also occur. Antibodies may or may not recognize the cut products that persist and cleavage products may be further degraded by other mechanisms. The viral proteases can be cytotoxic. Protease inhibitors can be used to stabilize the cut product from cell lysates. Cells should be treated with protease inhibitors and frozen 1 to 8 h post-infection without trypsin. N-termini can also be acetylated and C-termini amidated to prevent amino and carboxypeptidases from destroying the cut proteins.

Where is the information about the virus-induced phenotype encoded in other viruses?

[edit]

The sequences associated with the virus-induced phenotypes for other viruses may be hidden in transcription factors, endonuclease cleavage sites, phosphorylation sites, etc. For Group IV (+)ssRNA viruses, the information can be found in the protease cleavage sites (the SSHHPS). For Group VI (+)ssRNA retroviruses the information may be in the protease cleavage sites [20] and elsewhere.

What makes the sequence a SSHHPS?

[edit]

Evidence of sequence homology between host and pathogen and a host-pathogen interaction. The sequence in the viral genome may not be identical to a host sequence, but highly similar. If a protein is found in another species and shares a common evolutionary origin with the protein in the first species, then it is considered a "homologue" of that protein; essentially meaning they are both derived from the same ancestral gene in a common ancestor. SSHHPS appear to be acquired rather than products of accumulated random mutations. David Baltimore proposed a Copy Choice mechanism for RNA recombination in RNA viruses where the viral RNA-dependent RNA polymerase switches templates during negative strand synthesis.[21] RNA has secondary structure and pauses in replication may occur. As to whether RNA-binding proteins or enzymes[22] in the reservoir species (e.g. bats) affect or promote RNA recombination is still unclear.

References

[edit]
  1. ^ a b c d e Morazzani, EM; Compton JR; Leary DH; Berry AV; Hu X; Marugan JJ; Glass PJ; Legler PM. (2019). "Proteolytic cleavage of host proteins by the Group IV viral proteases of Venezuelan equine encephalitis virus and Zika virus". Antiviral Res. 164: 106–122. doi:10.1016/j.antiviral.2019.02.001. PMC 9575189. PMID 30742841.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  2. ^ Hu, X; Legler, PM (2019). "Analysis of Group IV Viral SSHHPS Using In Vitro and In Silico Methods". JoVE. 154 (154). doi:10.3791/60421. PMID 31904018.
  3. ^ a b c d e Reynolds, ND; Aceves, NM; Liu, JL; Compton, JR; Leary, DH; Freitas, BT; Pegan, SD; Doctor, KZ; Wu, FY; Hu, X; Legler, PM (2021). "The SARS-CoV-2 SSHHPS Recognized by the Papain-like Protease". ACS Infect Dis. 7 (6): 1483–1502. doi:10.1021/acsinfecdis.0c00866. PMC 8171221. PMID 34019767.
  4. ^ Grigera, PR; Tisminetzky, SG (1984). "Histone H3 modification in BHK cells infected with foot-and-mouth disease virus". Virology. 136 (1): 10–19. doi:10.1016/0042-6822(84)90243-5. PMID 6330987.
  5. ^ a b Falk, MM; Grigera, PR; Bergmann, IE; Zibert, A; Multhaup, G; Beck, E (1990). "Foot-and-mouth disease virus protease 3C induces specific proteolytic cleavage of host cell histone H3". J Virol. 64 (2): 748–756. doi:10.1128/jvi.64.2.748-756.1990. PMC 249169. PMID 2153239.
  6. ^ Tesar, M; Marquardt, O (1990). "Foot-and-mouth disease virus protease 3C inhibits cellular transcription and mediates cleavage of histone H3". Virology. 174 (2): 364–374. doi:10.1016/0042-6822(90)90090-e. PMID 2154880.
  7. ^ Yi, S-J; Kim, K (2018). "Histone tail cleavage as a novel epigenetic regulatory mechanism for gene expression". BMB Rep. 51 (5): 211–218. doi:10.5483/BMBRep.2018.51.5.053. PMC 5988574. PMID 29540259.
  8. ^ a b c d e f Doctor, K; Gilmour E; Recarte M; Beatty TR; Shifa I; Stangel M; Schwisow J; Leary DH; Legler PM. (2023). "Automated SSHHPS Analysis Predicts a Potential Host Protein Target Common to Several Neuroinvasive (+)ssRNA Viruses". Viruses. 15 (2): 542. doi:10.3390/v15020542. PMC 9961674. PMID 36851756.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  9. ^ a b Badorff, C; Lee, GH; Lamphear, BJ; Martone, ME; Campbell, KP; Rhoads, RE; Knowlton, KU (1999). "Enteroviral protease 2A cleaves dystrophin: evidence of cytoskeletal disruption in an acquired cardiomyopathy". Nat. Med. 5 (3): 320–326. doi:10.1038/6543. PMID 10086389.
  10. ^ a b Blom, N; Hansen, J; Blaas, D; Brunak, S (1996). "Cleavage site analysis in picornaviral polyproteins: Discovering cellular targets by neural networks". Protein Science. 5 (11): 2203–2216. doi:10.1002/pro.5560051107. PMC 2143287. PMID 8931139.
  11. ^ a b Lim, B-K; Peter, AK; Xiong, D; Narezkina, A; Yung, A; Dalton, ND; Hwang, K-K; Yajima, T; Chen, J; Knowlton, KU (2013). "Inhibition of Coxsackievirus-associated dystrophin cleavage prevents cardiomyopathy". J Clin Invest. 123 (12): 5146–5151. doi:10.1172/JCI66271. PMC 3859391. PMID 24200690.
  12. ^ Perez-Bermejo, Juan A. "Novel Coronavirus Could be Slicing Heart Muscles Cells into Pieces". Weather Channel. The Weather Channel.
  13. ^ Perez-Bermejo, Juan A. (2020). "SARS-CoV-2 infection of human iPSC-derived cardiac cells predicts novel cytopathic features in hearts of COVID-19 patients". BioRxiv: The Preprint Server for Biology. Cold Spring Harbor Laboratory. doi:10.1101/2020.08.25.265561. PMC 7491510. PMID 32935097.
  14. ^ Baroni, M; Beltrami, S; Schiuma, G; Ferraresi, P; Rizzo, S; Passaro, A; Molina, JMS; Rizzo, R; Di Luca, D; Bortolotti, D (2024). "In Situ Endothelial SARS-CoV-2 Presence and PROS1 Plasma Levels Alteration in SARS-CoV-2-Associated Coagulopathies". Life. 14 (2): 237. Bibcode:2024Life...14..237B. doi:10.3390/life14020237. PMC 10890393. PMID 38398746.
  15. ^ a b Hong, ST; Mah, W (2015). "A Critical Role of GIT1 in Vertebrate and Invertebrate Brain Development". Exp Neurobiol. 24 (1): 8–16. doi:10.5607/en.2015.24.1.8. PMC 4363336. PMID 25792865.
  16. ^ Goodfellow, FT (2016). "Zika Virus Induced Mortality and Microcephaly in Chicken Embryos". Stem Cells Dev. 25 (22): 1691–1697. doi:10.1089/scd.2016.0231. PMC 6453490. PMID 27627457.
  17. ^ Chang, J (2017). "Gpr124 is essential for blood-brain barrier integrity in central nervous system disease". Nat Med. 23 (4): 450–460. doi:10.1038/nm.4309. PMC 5559385. PMID 28288111.
  18. ^ Posokhaova, E; Shukla, A; Seaman, S; Volate, S; Hilton, MB; Wu, B; Morris, H; Swing, DA; Zhou, M; Zudaire, E; Rubin, JS; St. Croix, B (2015). "GPR124 Functions as a WNT7-Specific Coactivator of Canonical β-Catenin Signaling". Cell Reports. 10 (2): 123–130. doi:10.1016/j.celrep.2014.12.020. PMC 4331012. PMID 25558062.
  19. ^ Gorbalenya, A (1992). "Host-related sequences in RNA viral genomes". Seminars in Virology. 3: 359. doi:10.1016/1044-5773(92)050359 (inactive 5 December 2024).{{cite journal}}: CS1 maint: DOI inactive as of December 2024 (link)
  20. ^ Nie, Z; Phenix, BN; Lum, JJ; Alam, A; Lynch, DH; Beckett, B; Krammer, PH; Sekaly, RP; Badley, AD (2002). "HIV-1 protease processes procaspase 8 to cause mitochondrial release of cytochrome c, caspase cleavage and nuclear fragmentation". Cell Death & Differentiation. 9 (11): 1172–1184. doi:10.1038/sj.cdd.4401094. PMID 12404116.
  21. ^ Kirkegaard, K; Baltimore, D (2004). "The mechanism of RNA recombination in poliovirus". Cell. 47 (3): 433–443. doi:10.1016/0092-8674(86)90600-8. PMC 7133339. PMID 3021340.
  22. ^ Lindley, SR (2024). "Ribozyme-activated mRNA trans-ligation enables large gene delivery to treat muscular dystrophies". Science. 386 (6723): 762–767. Bibcode:2024Sci...386..762L. doi:10.1126/science.adp8179. PMID 39541470.