A Quantitative Analysis of the Similarities and Differences of the HIV/FIV Gag Genome
Authors: Charlotte Siu*
*Corresponding Author: Charlotte Siu, Miami University Oxford, 501 E High St, Oxford, OH 45056, USA
Received Date: 31 January 2022
Accepted Date: 07 February, 2022
Published Date: 01 March 2022
Citation: Siu C (2022) A Quantitative Analysis of the Similarities and Differences of the HIV/FIV Gag Genome. J Community Med Public Health 6: 235. DOI: https://doi.org/10.29011/2577-2228.100235
Abstract
Among viruses, Human Immunodeficiency Virus (HIV) presents the greatest challenge to humans. Here, we retrieved genome sequences from NCBI and were then run through LALIGN bioinformatics software to compute the E value, bit score, Waterman eggert score, and percent identity, which are four important indicators of how similar the sequences are. These 4 values are 3.1 x 109, 51.9, 241 and 55.4%. Bases 1600 to 1990 in HIV and bases 800 to 910 in FIV have a higher than normal similarity. This reflects that while the genomic sequences of the gag region of both the HIV and FIV genomes are rather similar, it is unlikely that this similarity is due to random chance; therefore, there are a noticeable number of differences. A better understanding of the level of similarity and differences in the gag region of the genome sequence would facilitate our understanding of structural and cellular behavioural differences between FIV and HIV, and in the long term, it will provide new insights into the differences observed in previous studies or even facilitate the development of an effective HIV treatment.
Keywords: HIV; FIV; Gag
Introduction
irst discovered in 1983, HIV has infected approximately 80 million people worldwide so far according to data published by the WHO. In 2020, over 680,000 people died from the virus. Despite its deadliness, there is still no effective and specific treatment for this virus. Three years after HIV was first sequenced, scientists were able to discover a similar virus in cats, feline immunodeficiency virus or FIV [1,2]. Like many viruses, both HIV and FIV have gag proteins [3]. The gag protein is known to play an important role in many stages of the replication cycle of a retrovirus. For example, they play an important role in viral assembly, interact with numerous host cell proteins, and regulate viral gene expression. They also provide the main driving force for virus intracellular trafficking and budding and are involved in pathogenicity [4]. Past studies have suggested that the genomic sequences in both viruses are similar, but it was not clear to what extent the gag genome similarity is. When computed in the DNA analysis and alignment software FASTA, the E value and bit score are good indications of similarity between two sequences. The lower the E value is, the more similar the nucleotide sequences are, and the less likely this “match” in the nucleotide sequence is due to random chance.
Generally, an E value below 0.01 is considered low. A bit score of 50 or above almost always indicates that the match between two nucleotide sequences is very significant and similar [5]. The percent identity is the percent of nucleotides that match exactly and is adjusted for the length of the nucleotide sequence. A percent identity of 50 percent or more would mean that a majority of the nucleotides match when adjusted for the length of the nucleotide sequences. The waterman eggert score is a good indicator of sequence similarity too, and previous articles have cited that any score of 37 or above is considered as significant [6]. Since previous studies have shown that HIV and FIV share similar pathogenesis and the gag protein coded by the gag genome plays an important role in pathogenesis [7], it is hypothesized that the gag region of the FIV and HIV genome should be similar in sequence as defined by the aforementioned standards regarding the E value, bit score, waterman eggert score and percent identity.
Materials and Methods
Preparing the PCR templates
Before the gag genome could be sequenced, the samples needed to be amplified by PCR. This could increase the number of copies of the same nucleotide sequences available for sequencing [8]. In this experiment, the master mix of the PCR consisted of 1 µL of big dye terminator (Table 1), 1.5 µL of big dye dilution buffer (Table 1), 0.5 µL of the primer (Table 1), 4.5 µL of the gag genes. (Table 1), and 5.5 µL of molecular grade water (Table 1). This adds up to a total volume of 10 µL. The same recipe was used for both the HIV and FIV gags.
The PCR run: After preparing the master mix, we programmed the standard cycle sequencing protocol on the thermocycler. Step one of the cycles lasted for 1 minute at 96°C, step 2 lasted 10 seconds at 96°C, step 3 lasted 5 seconds at 50°C, and step 4 lasted 4 minutes at 60°C. The cycler was repeated for 35 times.
Purification of the PCR products: After completing the PCR run, the master mix was transferred to a 1.5 mL Eppendorf tube, to which 1 µL of 1.5 M NaOAC/EDTA and 80 µL 95% ethanol were added. After that, the master mix was centrifuged for 15 minutes at 12,000 g. All supernatants were removed with a pipette, and 100 µL of 70% ethanol was added. The sample was stored at -20°C.
The next generation sequencing machine: After the purification and PCR process, the sample was handed over to a technician to run on an ABI 3100 machine.
Checking the accuracy: The gag genome sequences received were “nucleotide blasted” on NCBI. The BLAST results showed that the sequences were accurate because they showed a 100% match to the HIV and FIV gag genome sequences in their records.
Bioinformatics software analysis: The HIV and FIV gag sequences obtained were entered into FASTA software, a software designed to compare DNA sequences. FASTA software was used to compute the E value, bit score and percent identity of the two sequences. The results were recorded and presented below.
Results
The E value of the overall sequences was 3.1x109, with a bit score of 51.9. The percent identity of the overall sequences was 55.4% whereas the waterman eggert score is 241 (Table 2). In addition to calculating the four values we mentioned as “indicators of degree of similarity” in our introduction/ hypothesis, we have one unexpected finding when doing the study. The software alerted us that there is a region in the HIV/FIV genome that special attention is needed, since it shows a striking degree of similarity that can be said as one of the highest in the overall gag genome. It should be somewhere between the base pair 800 to 1,200 in FIV and 1,500 to 2,500 in HIV .With the software’s alert in hand, a FASTA analysis was performed. The “region of concern” is base 1600 to 1990 in HIV and base 800 to 910 in FIV (Figure 1). The “region of concern” shows a 3.4x10^-10 E value and a 55.1 bit score (Figure 1). Both of these are higher than the overall gag genome.
Discussion
This discovery of the exact degree of similarity of the HIV and FIV gag genome in this study could pave the way for explaining many other scientific observations referred to in other studies as well as some other new studies to understand more about HIV and FIV. One example is, the high degree of similarity we mentioned in our study could explain why previous studies have concluded that the FIV and HIV generally share a similar pathogenesis pathway. However, little is known about where in the gag genome is the capsid protein for the FIV and HIV coded for, as well as the capsid protein. It is possible that the “region of significant concern” mentioned above is where the FIV capsid protein’s fundamental structure is coded for, so that might explain why there is evidence that the FIV capsid protein retains the same fundamental structure as the HIV protein [9]. In subsequent studies, we can investigate whether this hypothesis is correct or not. A second example is, other studies have suggested that the FIV matrix is not found as a trimer in the crystal structure [9]. Existing research shows that there is little similarity in the amino acid sequence of the FIV and HIV protein matrix [9]. HIV-1 and FIV Gag differ strikingly in a key intracellular trafficking property. FIV Gag is a nuclear shuttling protein that utilizes the CRM1 nuclear export pathway, while HIV1 Gag is excluded from the nucleus [10,11]. It is possible that the genome that codes for the matrix’s structure or the matrix in HIV and FIV are really different in sequence and could be within the “46.6 percent of dissimilar genomic sequences”. Again, this is what future studies could look at. In addition to the above, future studies could also try and investigate whether there are regions of the gag genome that show an unusual pattern such as repetition, or even if there is a particular region of the FIV and HIV gag genome that shows unusual differences. Therefore, it is clear that this study has given a clearer direction as to where to start or continue with when studying more about the similarities and differences regarding the behaviour of gag protein in HIV and FIV, as well as things the gag protein plays a role in such as the matrix structure, capsid structure etc. It is important for us to try and understand more about the gag genome of the HIV and FIV on a molecular level because the more we understand about the gag, the more we understand about the pathogenicity as well as structure of the HIV and FIV, and subsequently how or whether it is possible to apply the kind of treatment we use for the FIV virus to the HIV virus.
Conclusion
The gag genomes of HIV and FIV, on a whole, are very similar. While they are generally similar, approximately 46.6 percent of the nucleotide bases do not exactly match. Therefore, we can reasonably conclude that our hypothesis is correct.
Acknowledgement
We acknowledge Miami University Microbiology department as well as CBFG for providing us with the funding, help and resources needed to complete this study.
Figures
Tables
Ingredient |
Quantity (µL) |
Big dye terminator |
1 |
Big dye dilution buffer |
1.5 |
HIV/ FIV primer |
0.5 |
HIV/ FIV gag DNA |
4.5 |
Molecular grade water |
5.5 |
Table 1: The ingredients used to make the PCR master mix for both HIV and FIV gag and their respective quantities.
Indicator |
Value |
E value |
3.1x109 |
Bit score |
51.9 |
Percent identity |
55.40% |
Waterman eggert score |
241 |
Table 2: The LALIGN results of the HIV/ FIV gag genome.
References
- World Health Organization (2022) Number of deaths due to HIV/AIDS. World Health Organization.
- Pedersen NC, Ho EW, Brown ML, Yamamoto JK (1987) Isolation of a T-lymphotropic virus from domestic cats with an immunodeficiencylike syndrome. Science 235: 790-793.
- Coffin JM, Hughes SH, Varmus HE (1997) Retroviruses. Cold Spring
- Müllers E (2013) The foamy virus gag proteins: What makes them different? Viruses 5: 1023-1041.
- Pearson WR (2013) An introduction to sequence similarity (“homology”) Current Protocols in Bioinformatics 42.
- Root-Bernstein R, Churchill B (2021) Co-evolution of opioid and adrenergic ligands and receptors: Shared, complementary modules explain evolution of functional interactions and suggest novel engineering possibilities. Life 11: 1217.
- Friedman H, Specter S, Bendinelli M (2006) FIV as a model for HIV: An In Vivo Models of HIV Disease and Control 2006: 149-237.
- Casbon JA, Osborne RJ, Brenner S, Lichtenstein CP (2011) A method for counting PCR template molecules with application to nextgeneration sequencing. Nucleic Acids Research 39: e81.
- González S, Affranchino J (2018) Properties and functions of feline immunodeficiency virus gag domains in Virion Assembly and budding. Viruses 10: 261.
- Kemler I, Saenz D, Poeschla E (2012) Feline immunodeficiency virus gag is a nuclear shuttling protein. Journal of Virology 86: 8402-8411.
- Miller C, Abdo Z, Ericsson A, Elder J, VandeWoude S (2018) Applications of the FIV model to study HIV pathogenesis. Viruses 10:
© by the Authors & Gavin Publishers. This is an Open Access Journal Article Published Under Attribution-Share Alike CC BY-SA: Creative Commons Attribution-Share Alike 4.0 International License. Read More About Open Access Policy.