Open Access

A comparative analysis of viral matrix proteins using disorder predictors

Virology Journal20085:126

DOI: 10.1186/1743-422X-5-126

Received: 05 October 2008

Accepted: 23 October 2008

Published: 23 October 2008

Abstract

Background

A previous study (Goh G.K.-M., Dunker A.K., Uversky V.N. (2008) Protein intrinsic disorder toolbox for comparative analysis of viral proteins. BMC Genomics. 9 (Suppl. 2), S4) revealed that HIV matrix protein p17 possesses especially high levels of predicted intrinsic disorder (PID). In this study, we analyzed the PID patterns in matrix proteins of viruses related and unrelated to HIV-1.

Results

Both SIVmac and HIV-1 p17 proteins were predicted by PONDR VLXT to be highly disordered with subtle differences containing 50% and 60% disordered residues, respectively. SIVmac is very closely related to HIV-2. A specific region that is predicted to be disordered in HIV-1 is missing in SIVmac. The distributions of PID patterns seem to differ in SIVmac and HIV-1 p17 proteins. A high level of PID for the matrix does not seem to be mandatory for retroviruses, since Equine Infectious Anemia Virus (EIAV), an HIV cousin, has been predicted to have low PID level for the matrix; i.e. its matrix protein p15 contains only 21% PID residues. Surprisingly, the PID percentage and the pattern of predicted disorder distribution for p15 resemble those of the influenza matrix protein M1 (25%).

Conclusion

Our data might have important implications in the search for HIV vaccines since disorder in the matrix protein might provide a mechanism for immune evasion.

Background

The viral matrix protein underlies the envelope of a virion, representing essentially a link between the envelope and the nucleocapsid [1, 2]. The functions of matrix proteins are usually multifaceted, and not completely understood [35]. They are however known to be involved in the viral assembly and stabilization of the lipid envelope [6]. Matrix proteins of different viral types are often structurally, functionally, and evolutionarily related [4]. For instance, the influenza M1 and HIV p17 proteins are known to be related and both have similar RNA and membrane binding domains [4].

Lentivirinae is among the genii of viruses that possess a matrix layer [7, 8]. Viruses that belong to this genus include Human Immunodeficiency Virus (HIV), Simian Immunodeficiency Virus (SIV), and Equine Infectious Anemia Virus (EIAV). The viruses in this family have different characteristics [7, 9, 10]. This is especially so with respect to the onset of diseases such as AIDS, the viral loads and the success or failure in finding vaccines.

There are three known HIV viruses in the world today, HIV-0, HIV-1, and HIV-2 [8, 10, 11]. The latter two are of the most interest to our study. The HIV-1 is the predominant virus spreading around the globe. HIV-2, by contrast, is predominantly spread in certain parts of Africa, being found in about 10% of HIV cases in West Africa, and has recently been found to be spreading in some parts of India [8, 11]. While the onset of AIDS usually occurs within an average of 6 years of virtually all HIV-1 infections, those infected with HIV-2 are allowed a much longer time before the AIDS symptoms appear, if at all [8, 11, 12]. As for SIV, a few strains such as SIVcpz are more closely related to HIV-1, whereas most of the others, especially SIVsm, and SIVmac, are closer to HIV-2 [10]. It should be noted that SIV does not usually cause AIDS among African non-human primates [13]. It does however cause AIDS among Asia monkeys [14].

Similar to HIV and SIV, EIAV is another retrovirus [8, 9], which, however, spreads by insects, and the host targets are non-cd4 white blood cells such as macrophages and monocytes [8, 15]. The disease caused by EIAV is not usually as fatal to its host as that of HIV and ~90% of infected equine recovers from an initial onset of symptoms [8]. While the search for vaccines for HIV continues to be difficult and elusive, effective vaccine for EIAV had been found 20 years ago in China [10, 15, 16]. A major difficulty facing the search for HIV vaccines is a puzzling problem of the inability of HIV protein-binding antibodies in eliciting effective broad immune response [17]. While the reason for this remains largely unknown [18], a finding of high levels of intrinsically disordered proteins at the surface, envelope or perhaps, matrix could provide a mechanism by which the HIV virus evades the immune response.

Therefore, these data clearly show that related viruses might affect their hosts differently, possessing variable virulence and different modes of interaction with their host's immune systems. A question then arises is whether some of the mentioned variability in the behavior can be reflected in some peculiar features of the corresponding viral proteins. This paper examines matrix proteins of several related viruses using computational tools such as intrinsic disorder predictors to search for the crucial differences in the levels and distributions of intrinsic disorder in the matrix proteins.

The concept of protein intrinsic disorder is used in this paper to investigate characteristics pertaining to the various viral matrix proteins. Intrinsically disordered proteins have been described by other names such as "intrinsically unstructured" [19, 20], "natively unfolded" [21, 22], and "natively disordered" [23] among others. Historically, the investigation of intrinsic disorder began with finding and characterizing several proteins-exceptions from the paradigm stating that unique rigid protein structure is an unavoidable prerequisite for the specific protein function. Although such counterexamples were periodically observed, it was not till the end of the last century when researchers started to pay significant attention to this phenomenon [24]. As a result, the last decade witnessed the real rise of unfoldomics, a new field of protein science dealing with the various aspects of IDPs. It is recognized now that many crucial biological functions are performed by proteins which lack ordered tertiary and/or secondary structure; i.e., by IDPs [1921, 2331]. The fact that amino acid sequences/compositions of IDPs and ordered proteins are rather different was utilized to develop numerous disorder predictors, which became instrumental in the pursuit of a greater understanding of intrinsic disorder. Access to important information on many of these predictors is provided via the DisProt database [32]. In this paper, we utilized two members of the PONDR® family of disorder predictors, VLXT and VL3 [3338], to examine the matrix proteins of the various viruses especially those related to HIV. PONDR® VL3 was chosen because of its high accuracy in the prediction of long disordered regions [36], whereas PONDR® VLXT was shown to be extremely sensitive for finding function-related disordered regions [35, 39, 40]. Uniqueness of this study is in the fact that we applied disorder predictors to proteins with known 3D-structure. This approach revealed some peculiar patterns of PID that can be used to better understand behavior of the HIV matrix proteins.

Results

Quantifying Disorder by Calculating the Percentage of Predicted Disordered Residues

Table 1 represents the estimations of the percentage of predicted disordered residues in the analyzed matrix and capsid proteins. Even though influenza virus is quite unrelated to lentiviruses, its M1 matrix protein is placed here for comparison. It is important to remember also that the M1 protein is believed to be evolutionarily and structurally related to the p17 matrix protein of HIV. Table 1 shows that the amount of intrinsic disorder varies from 20 to 61%, and from 0 to 40%, being evaluated by PONDR® VLXT and VL3 respectively. Data for the four matrix proteins with known 3-D structures are further illustrated by Figure 1 showing the results of the PONDR® VLXT analysis as bar chart. High level of predicted intrinsic disorder in SIV and HIV-1 matrix proteins is clearly seen. In our earlier paper [41], the following classification of proteins characterized by X-ray crystallography but possessing various levels of predicted disorder was introduced: proteins with percentage of residues predicted to be disordered by PONDR® VLXT between 20–29% were considered moderately disordered; those in the range of 30–39% were considered as quite disordered by prediction; whereas, proteins that were disordered 40% and above were considered as very disordered by prediction. Therefore, the influenza M1 protein and EIAV p15 should be considered as moderately disordered by prediction. By the same rule, the SIVmac and HIV-1 p17 matrix proteins have to be considered as highly disordered by prediction.
Table 1

A summary of matrix proteins and their percentages of disordered residues in a chain.

Protein

Virus

Accession

% Predicted Disorder

M1

Influenza A

1ea3

25 (0)

Matrix (p17)

SIVmac

1ed1

52 (40)

Matrix (p17)

HIV-1

1hiw

61 (39)

Matrix(p15)

EIAV

1hek

21 (12)

Capsid

HIV-1

1afv

48 (0)

Capsid

EIAV

1eia

30 (12)

All of the samples analyzed were structurally characterized using X-ray crystallography. Percentage of predicted disorder corresponds to values produced by the PONDR® VLXT (VL3) analyses

Figure 1

A bar chart comparing matrix proteins across virus types.

PONDR/B-Factor Plots and Contact Points

While Figure 1 and Table 1 represent the predicted disorder of whole polypeptide chains, the PONDR® VLXT plots in Figure 2 represent per-residue distributions of disorder scores. They can be used to measure and compare factors that are not easily quantifiable. For example, Figure 2 allows us to correlate the protein-protein contact sites (when such data are available) with the disorder score profiles. It also compares the normalized B-factor values [42] with the PONDR® VLXT plots.
Figure 2

PONDR/B-factor plots of matrix proteins of various viruses A) the influenza A M1 protein B) the SIV mac P17 protein. C) HIV-1 p17 Matrix protein D) The EIAV p15 Matrix protein. Protein-protein contacts between chains are annotated by thick gray horizontal spots. The normalized B-factor values are seen in the light gray curves. PONDR-VLXT scores are seen in the black curve in each plot.

Analysis of Figure 2 shows that contact sites (shown by thick horizontal gray lines) always correlate either with high B-factors or with high PONDR® VLXT scores suggesting that highly flexible regions of matrix proteins are responsible for protein-protein interactions. For example, contacts between the subunits of HIV-1 p17 are located near or within regions predicted to be disordered, whereas contact sites of the EIAV p15 are mostly located in regions with high B-factor. These observations are in a good agreement with earlier studies which established the usefulness of intrinsic disorder for protein-protein interaction [1921, 23, 25, 2931, 35, 39, 40, 4346].

3-D Structures with Predicted Disorder

Figure 3 provides 3-D representations of the matrix proteins from various viruses. The areas in magenta are the protein regions predicted to be disordered by PONDR® VL3 (and probably PONDR® VLXT also), whereas the regions marked in red are those predicted to be disordered by PONDR® VLXT. Different colors such as yellow and green are used to denote different subunit regions. This presentation of structured proteins allows visualization of regions with the intrinsic propensity for being highly flexible.
Figure 3

The three dimensional structures of the matrix proteins of the various viruses with predicted disorder annotation by red and magenta colors. A) The influenza m1 protein B) SIVmac p17 Protein C) HIV-1 p17 protein shown as a monomer D) The EIAV p15 E) The HIV-1 p17 shown as a multimer. The regions in magenta are regions predicted to be disordered by VL3 (and probably also by VLXT). By contrast, the regions in red are areas predicted to be disordered by VLXT.

Discussion

HIV-1 Versus HIV-2 and SIVmac: Missing Regions Predicted to be Disordered

SIVmac is Very Similar to HIV-2

The HIV-2 and HIV-1 viruses, while related, differ in substantial ways in term of immune response, infection, and the onset of AIDS [8, 11, 12]. SIVmac is a subtype of SIV, which was first found in macaques and is known to be very closely related to HIV-2 [10]. While development of AIDS symptoms are seen in virtually all HIV-1 infected patients, AIDS symptoms of HIV-2 infection appears only in a small percentage of patients. We believe that a comparative analysis of PID in related viral proteins could shed some light on the reasons behind these behaviors.

PID Rates of Matrix Proteins Correlate with the Difficulties in Finding Vaccines

A brief glance at Table 1 and Figure 1 shows that the PID rates of SIVmac and HIV-1 are quite similar, even though the percentage of PID in SIVmac p17 (50% by PONDR® VLXT) is smaller than that in HIV-1 p17 (61%). The similarity in the level of PID is likely indicative of the ability of both viruses to evade the immune system. Further support for this hypothesis can be retrieved by analyzing the level of predicted disorder in the influenza M1 protein and in the EIAV p15 protein. Matrix proteins of both of the viruses have low percentage disorder rates, 25% in M1 and 21% in p15. Interestingly, effective vaccines were developed for both of these viruses, even though the mutation rates of the influenza virus is extremely high causing well-known difficulties in the development of new vaccines. Apparently, the PID rate is a good predictor of the ease of vaccine development of a virus. This should not be surprising as our earlier study [41] suggested that the viral matrix likely helps viruses to evade detection by the immune system due to its highly dynamic nature and constant motions. This dynamic behavior is correlated with the high propensity of matrix proteins for intrinsic disorder. Furthermore, it has been hypothesized that the role may be intertwined with the glycoprotein on the surface acting as a broom in a sweeping motion provided by the matrix [41]. This highly dynamic nature of the viral surface may explain the difficulties in the development of vaccine for HIV.

Qualitative Differences in Predicted Disorder and Protein-Protein Interactions

Even though the rates of predicted disorder in the SIVmac and HIV-1 p17 proteins seem to be similarly high, the PONDR® VLXT plots revealed subtle differences in the disorder distribution within the protein sequences. Figures 2B and 2C show that a long region predicted to be disordered by HIV-1 p17 (53–76 fragment) is missing in SIVmac p17. Figures 3B, 3C, and 3E illustrate that this fragment in HIV-1 p17 forms an α-helix and is involved in protein-protein interactions between the subunits. In fact, residues 70–73 from one subunit contact with residues 71, 60, 40, and 46 from another subunit. Analysis of Figure 2C revealed that all these inter-subunit contact sites are located within the PID regions. Therefore, intrinsic disorder plays a crucial role in the inter-subunit interactions, which can be classified as disorder-disorder type of contact. The lack of a predicted to be disordered segment in HIV-2 and SIVmac which seems to be crucial for inter-subunit contacts suggests that disorder-disorder protein-protein interactions are replaced by the order-disorder or order-order interactions.

Predicted Disorder Patterns Correlate with High B-Factors

Figure 2 shows that, in general, there is a rather good correlation between the predicted disorder patterns and the normalized B-factor curves. For example, the 79–95 fragment of the HIV-1 matrix protein is both predicted to be disordered and is characterized by the high normalized B-factor values (Figure 2C, 1hiw.pdb). In several occasions, there are noticeable lags between the PONDR® VLXT and B-factor curves, as it is seen, e.g. in Figure 2A (M1 matrix proteins of the influenza A virus, 1ea3.pdb), where large B-factor peaks are seen in the 70–90 region, whereas the corresponding PID fragment is located in the 90–105 region.

HIV versus EIAV: Higher Predicted Disorder in HIV

Matrix of EIAV Is Relatively Ordered

Matrix protein of EIAV was predicted to be less disordered than that of HIV (see Table 1 and Figure 1). However, even in this case less abundant PID regions could be crucial for the inter-subunit interactions. In fact, analysis of the crystal structure of the p15 protein revealed that residues 46 and 78 of the chain A are involved in interaction with the residues 114 and 105 of the chain B. All these interaction sites are shown as thick gray lines in Figure 2D, which clearly indicates that the interactions between the 15 subunits are less rigorous than that of HIV-1 p17 subunits and can be ascribed to the order-disorder contact type. Since EIAV is from the same genus as HIV, that is, lentiviranae [9, 15], these data suggest that the high PID levels are not a common characteristics of the retroviradae family, or even the lentiviranae genus. Apparently, the high level of intrinsic disorder in the matrix proteins is a characteristic feature of HIV-1 and its closest relatives, SIV and HIV-2. These differences in the abundance of disorder seem to be largely constrained to the matrix proteins as the capsids of both HIV-1 and EIAV viruses are quite disordered by prediction (48% and 30% by PONDR® VLXT, see Table 1).

Predicted Disorder Patterns of EIAV Are Closer to Those of Influenza than of disorder patterns of HIV/SIV

Analysis of Figure 2 reveals that the pattern of the predicted disorder in EIAV matrix protein is closer to that of the influenza virus than to the disorder profiles of the EIAV's cousins HIV and SIV. Furthermore, EIAV and Influenza A matrix proteins are similar in their relatively low percentages of the predicted disorder (21% in EIAV and 25% in Influenza). The other similarity has to do with the interaction mode between the matrix protein subunits. In fact, contact sites of both Influenza A and EIAV matrix proteins can be classified as disorder-order contacts. In the case of HIV-1, most of the contact sites between the subunits are predicted disorder-disorder interactions. Comparison of the disorder and B-factor profiles of the HIV-1 and SIVmac p17 proteins allows extrapolation to be made of the potential modes of inter-subunit interactions in SIVmac p17. In fact, if potential interaction sites are distributed similarly within the amino acid sequences of HIV-1 and SIVmac p17 proteins, then at least some of the SIVmac p17 inter-subunit interactions site can be assigned as disorder-disorder interactions (e.g. if the residue 111 of one SIVmac p17 subunit is in contact with the residue 97 from another subunit, then disorder-disorder contact takes place as both of these residues are predicted to be disordered, as seen in Figure 2B).

High Intrinsic Disorder and Immune Response

Potential Implications of More Disordered Matrix Proteins: Immune Evasion

The question then arose: What are the potential implications of more rigid or more disordered matrix proteins? It is likely that more rigid p17 proteins may be less effective in evading immune response. This may be a reason why HIV-2 and SIVmac are less pathogenic than HIV-1. It is generally assumed that HIV-2 is less pathogenic than HIV-1 because of the fact that HIV-2 has lesser affinity for CD4 than HIV-1. On the other hand, our data show that there are subtle but important differences between HIV-1 and SIVmac (HIV-2) in their patterns of predicted disorder distribution, which also might contribute to the virus's ability to evade the host immune system.

Implication to the Search for HIV Vaccines

Our findings might also have some implications to the search for HIV vaccines. One possibility is related to the use animal models and SIVmac as in the search for HIV vaccination and drugs. SIVmac and SIVstm were the first subtypes found in laboratory macaques [13]. Asian primates such as macaques, unlike their African cousins, developed AIDS on the average of 10 years after infection [8]. For this reason, the use of SIV on Asian monkeys has become the standard animal model [47]. However, the extrapolation of data from animal models to HIV in human remains a challenge. Our results suggest that some of these challenges could be explained by the differences in disorder prediction between HIV-1 and SIV (or HIV-2). It is also important to remember that although the high levels of mutation caused difficulties in the development of vaccines against new strains of the influenza, there are effective vaccines against specific strains of the virus. Similarly, there are also effective vaccines available of EIAV. Note, matrix proteins of both influenza virus and EIAV are shown in our study to contain less amount of intrinsic disorder.

Joint Role of Glycoproteins and Matrix Disorder

It is established that the HIV envelope glycoprotein gp120 is one of the most glycosylated proteins in nature [48]. Oligosaccharide moieties of viral glycoproteins often hide them from recognition by immune agents such as antibodies [16]. We propose that abnormally disordered matrix proteins might help the surface glycoprotein in eluding immune responses. In other words, intrinsic disorder (read high dynamics) underneath the envelope would work in a tandem with envelope glycoproteins to help viruses in the avoiding of the induction of immune response. The questions then arose: How and why would surface glycoprotein and matrix disorder work in cooperation? A likely scenario is shown in Figure 4. Here, the oligosaccharide moieties of the glycoproteins act as an entropic brush that protects viral surface proteins such as gp120 and gp41 from contacts with immune agents such as antibodies. The matrix protein could then provide the additional motion to the sweep. An advantage of motions that resemble a broom in a sweep is that it enables some regulatory roles via the matrix protein. Earlier it has been already observed that the envelope proteins are very sensitive to the behavior of the matrix proteins [3].
Figure 4

Schematic diagram: glycoconjugate acts as a broom with sweeping motion arising from matrix. The striped arrows depict the motions of oligosaccharides arising from the bobbing of the lipid bilayer. The motions of the membrane is also dependent on the matrix for stability or lack of it. The motions of the oligosaccharide may allow and prevent the binding of CD4 and antibodies respectively.

Conclusion

Matrix Disorder of Retroviruses Varies with Nature of the Virus

A peculiar finding of this paper is the pattern of predicted disorder of EIAV p15 matching more closely the disorder profile of the influenza M1 protein than those of the matrix proteins of its closer relatives, namely the HIV-1 and SIVmac p17 proteins. This feature may be attributed to the ways the viruses are evolved and are transmitted to their hosts. It should be reminded that EIAV is transmitted between horses via insect vectors. In other words, the virus experience dramatic change in the environment during the transmission. It is likely that this mode of transmission has evolutionary requirements similar to those of the influenza virus, which is transmitted via respiratory tract and mucus. HIV and SIV, on the other hand, spread by blood contact or sexual activities. Since it there lesser chance for the exposure to the outside environment in the transmission mode, there is hence lesser evolutionary pressure for the matrix proteins to be ordered. This highlights a role for the matrix protein in many viruses. In many instances, the matrix acts as an encasement for the virion, thereby protecting the virion from damage especially in adverse environments. We have also seen that disorder at the matrix is not an absolute characteristic of retroviruses.

Implication for the Immune System Invisibility Puzzle of HIV

A single nagging puzzle in the search for vaccines against HIV is the unknown mechanisms helping the virus to evade immune response. Our study suggests that this ability might arise from the abnormal levels of intrinsic disorder at the viral matrix. This hypothesis is supported by the fact that the matrix proteins of other viruses, where vaccines have been more easily found, were predicted to be more ordered. Therefore, there are several ways how disorder predictions can be utilized in the future strategies of the vaccine development. Particularly, one of the new directions in the anti-HIV drug development could be a search for the therapeutic agents able to stabilize the HIV matrix protein.

Another puzzle of HIV viruses is the inability of virologists to account for the waves of the HIV strains seen, even after taking into account the fact that the mutation rate of HIV-1 is 25-times that of influenza. Yet another HIV puzzle is the greater pathogenicity of HIV-1 as compared to HIV-2. It has been generally understood that this is due to the fact that the HIV-1 affinity for CD4 is 28 times greater than that of HIV-2 [49]. Our data suggest that it is not just the affinity for CD4 that give rise to a greater pathogenesis or viral load in HIV-1. Perhaps, it is also the differences in the abilities of the viruses in evading the immune system via disorder at the matrix. This also explains a related observation among epidemiologists that the more easily that HIV-1 spreads sexually the more virulent it becomes [8], since the ease of transmission via blood or sexual intercourse lessens the requirements for a rigid encasement of the virion, which is used in other viruses to prevent virion damage due to harsh environmental factors.

Potential implications for the Immune Evasion of Cancer Cells and Oncolysis

While this paper has been largely focused on the study of immune evasion as applied to HIV and HIV-related viruses, it may provide a model for immune evasion by other entities, such as cancer cells. There are either very few or no studies done in this area. Perhaps, our results could invigorate interest in this area, given the models and approach used. Furthermore, the results of this paper likely have novel strategic implications for experimental studies on the use of viruses as oncolytic agents, which have often been observed to be rendered ineffective by the immune system. In fact, one of the greatest problems in using the oncolytic viruses is that they are detected by the immune system very quickly so they are only useful for localized treatment of tumors [50]. Our data suggest that this does not have to be always the case and new oncolytic viruses with disordered matrix should be considered.

Methods

PDB Accessions

A full description of implementation techniques can be found in a previous paper [41]. The search for important proteins suitable for analysis was done using the Entrez website [51]. Proteins from retroviruses and relatives of HIV were carefully reviewed. The accession codes were grouped into two classes containing proteins whose structures were elucidated using NMR or X-ray diffraction. It should be also noted that suitable data were unavailable for HIV-2. SIVmac was used in lieu of HIV-2 since the two are genetically close and the X-ray diffraction data for EIAV matrix and capsid proteins were readily available.

Query Language

Given the appropriate accessions selected, JAVA programs were used to automatically place the necessary information into the MYSQL database. The data were often checked using the SQL (Sequel Query Language) [52].

PONDR® VLXT and PONDR® VL3

PONDR® (P redictor O f N atural D isordered R egions) is a set of neural network predictors of disordered regions on the basis of local amino acid composition, flexibility, hydropathy, coordination number and other factors. These predictors classify each residue within a sequence as either ordered or disordered. PONDR® VL-XT integrates three feed forward neural networks: the Variously characterized Long, version 1 (VL1) predictor from Romero et al. 2001 [33], which predicts non-terminal residues, and the X-ray characterized N- and C-terminal predictors (XT) from [53], which predicts terminal residues. Output for the VL1 predictor starts and ends 11 amino acids from the termini. The XT predictors output provides predictions up to 14 amino acids from their respective ends. A simple average is taken for the overlapping predictions; and a sliding window of 9 amino acids is used to smooth the prediction values along the length of the sequence. Unsmoothed prediction values from the XT predictors are used for the first and last 4 sequence positions.

PONDR® VL3 combines the predictions of 30 neural networks for the entire protein sequence and was trained using disordered regions from more than 150 proteins characterized by the methods mentioned above plus circular dichroism, limited proteolysis and other physical approaches [36].

Protein-Protein Contacts and PONDR Plots

In order to detect the locations of protein-protein contacts between the different chains of proteins (i.e., when atoms of neighboring chains are within 3.0 Å from each other), a JAVA program was written to check the interchain atom-atom distance. The program generated graphs with PONDR plots with locations of the protein-protein contacts.

Three Dimensional Analysis with Disorder Prediction

The JAVA programming language was used to generate codes readable by the molecular 3D software, Jmol [54]. In resulting structures, regions of predicted disorder were annotated by red (VLXT) or magenta (VL3). Areas shaded by magenta were also regions likely predicted to be disordered by VLXT.

B-Factor

B-Factor values were imported directly from the PDB files into the table access_seq [41] that was modified to accommodate for the B-factor values [42].

Declarations

Acknowledgements

This work was supported in part by the grants R01 LM007688-01A1 and GM071714-01A2 from the National Institutes of Health. We gratefully acknowledge the support of the IUPUI Signature Centers Initiative.

Authors’ Affiliations

(1)
Center for Computational Biology and Bioinformatics, Indiana University School of Medicine
(2)
Institute for Intrinsically Disordered Protein Research, Indiana University School of Medicine
(3)
Institute for Biological Instrumentation, Russian Academy of Sciences
(4)
Institute of Molecular Cell Biology

References

  1. Turner BG, Summers MF: Structural biology of HIV. J Mol Biol 1999, 285: 1-32. 10.1006/jmbi.1998.2354View ArticlePubMedGoogle Scholar
  2. Cannon PM, Matthews S, Clark N, Byles ED, Iourin O, Hockley DJ, Kingsman SM, Kingsman AJ: Structure-function studies of the human immunodeficiency virus type 1 matrix protein, p17. J Virol 1997, 71: 3474-3483.PubMed CentralPubMedGoogle Scholar
  3. Dorfman T, Mammano F, Haseltine WA, Gottlinger HG: Role of the matrix protein in the virion association of the human immunodeficiency virus type 1 envelope glycoprotein. J Virol 1994, 68: 1689-1696.PubMed CentralPubMedGoogle Scholar
  4. Harris A, Sha B, Luo M: Structural similarities between influenza virus matrix protein M1 and human immunodeficiency virus matrix and capsid proteins: an evolutionary link between negative-stranded RNA viruses and retroviruses. J Gen Virol 1999,80(Pt 4):863-869.View ArticlePubMedGoogle Scholar
  5. Hearps AC, Jans DA: Regulating the functions of the HIV-1 matrix protein. AIDS Res Hum Retroviruses 2007, 23: 341-346. 10.1089/aid.2006.0108View ArticlePubMedGoogle Scholar
  6. Lyles DS, McKenzie M, Parce JW: Subunit interactions of vesicular stomatitis virus envelope glycoprotein stabilized by binding to viral matrix protein. J Virol 1992, 66: 349-358.PubMed CentralPubMedGoogle Scholar
  7. Clements JE, Zink MC: Molecular biology and pathogenesis of animal lentivirus infections. Clin Microbiol Rev 1996, 9: 100-117.PubMed CentralPubMedGoogle Scholar
  8. Goudsmit J: Viral Sex: The Nature of AIDS. Oxford University Press, New York; 1997.Google Scholar
  9. Leroux C, Cadore JL, Montelaro RC: Equine Infectious Anemia Virus (EIAV): what has HIV's country cousin got to tell us? Vet Res 2004, 35: 485-512. 10.1051/vetres:2004020View ArticlePubMedGoogle Scholar
  10. Marx PA, Li Y, Lerche NW, Sutjipto S, Gettie A, Yee JA, Brotman BH, Prince AM, Hanson A, Webster RG, et al.: Isolation of a simian immunodeficiency virus related to human immunodeficiency virus type 2 from a west African pet sooty mangabey. J Virol 1991, 65: 4480-4485.PubMed CentralPubMedGoogle Scholar
  11. Jurriaans S, van Gemen B, Weverling GJ, van Strijp D, Nara P, Coutinho R, Koot M, Schuitemaker H, Goudsmit J: The natural history of HIV-1 infection: virus load and virus phenotype independent determinants of clinical course? Virology 1994, 204: 223-233. 10.1006/viro.1994.1526View ArticlePubMedGoogle Scholar
  12. Morgan D, Mahe C, Mayanja B, Whitworth JA: Progression to symptomatic disease in people infected with HIV-1 in rural Uganda: prospective cohort study. BMJ 2002, 324: 193-196. 10.1136/bmj.324.7331.193PubMed CentralView ArticlePubMedGoogle Scholar
  13. Chen Z, Telfer P, Reed P, Zhang L, Getti A, Ho DD, Marx PA: Isolation and characterization of the first simian immunodeficiency virus from a feral sooty mangabey (Cercocebus atys) in West Africa. J Med Primatol 1995, 24: 108-115.View ArticlePubMedGoogle Scholar
  14. Apetrei C, Lerche NW, Pandrea I, Gormus B, Silvestri G, Kaur A, Robertson DL, Hardcastle J, Lackner AA, Marx PA: Kuru experiments triggered the emergence of pathogenic SIVmac. AIDS 2006, 20: 317-321.View ArticlePubMedGoogle Scholar
  15. Beyrer C: Injecting drug users and HIV vaccine trials: What does the science say? AIDScience 2002, 2: 1-6.Google Scholar
  16. Burton DR, Stanfield RL, Wilson IA: Antibody vs. HIV in a clash of evolutionary titans. Proc Natl Acad Sci USA 2005, 102: 14943-14948. 10.1073/pnas.0505126102PubMed CentralView ArticlePubMedGoogle Scholar
  17. McMichael A, Mwau M, Hanke T: Design and tests of an HIV vaccine. Br Med Bull 2002, 62: 87-98. 10.1093/bmb/62.1.87View ArticlePubMedGoogle Scholar
  18. Burton DR: Antibodies, viruses and vaccines. Nat Rev Immunol 2002, 2: 706-713. 10.1038/nri891View ArticlePubMedGoogle Scholar
  19. Wright PE, Dyson HJ: Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. J Mol Biol 1999, 293: 321-331. 10.1006/jmbi.1999.3110View ArticlePubMedGoogle Scholar
  20. Tompa P: Intrinsically unstructured proteins. Trends Biochem Sci 2002, 27: 527-533. 10.1016/S0968-0004(02)02169-2View ArticlePubMedGoogle Scholar
  21. Uversky VN, Gillespie JR, Fink AL: Why are "natively unfolded" proteins unstructured under physiologic conditions? Proteins 2000, 41: 415-427. 10.1002/1097-0134(20001115)41:3<415::AID-PROT130>3.0.CO;2-7View ArticlePubMedGoogle Scholar
  22. Weinreb PH, Zhen W, Poon AW, Conway KA, Lansbury PT Jr: NACP, a protein implicated in Alzheimer's disease and learning, is natively unfolded. Biochemistry 1996, 35: 13709-13715. 10.1021/bi961799nView ArticlePubMedGoogle Scholar
  23. Daughdrill GW, Pielak GJ, Uversky VN, Cortese MS, Dunker AK: Natively disordered proteins. In Protein Folding Handbook. Edited by: Buchner J, Kiefhaber T. Wiley-VCH, Verlag GmbH & Co. KGaA, Weinheim, Germany; 2005:271-353.Google Scholar
  24. Dunker AK, Oldfield CJ, Meng J, Romero P, Yang JY, Cheng JW, Vacic V, Obradovic Z, Uversky VN: The unfoldomics decade: An update on intrinsically disordered proteins. BMC Genomics 2008, 9: S1. 10.1186/1471-2164-9-S2-S1View ArticleGoogle Scholar
  25. Uversky VN, Oldfield CJ, Dunker AK: Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling. J Mol Recognit 2005, 18: 343-384. 10.1002/jmr.747View ArticlePubMedGoogle Scholar
  26. Xie H, Vucetic S, Iakoucheva LM, Oldfield CJ, Dunker AK, Obradovic Z, Uversky VN: Functional anthology of intrinsic disorder. 3. Ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins. J Proteome Res 2007, 6: 1917-1932.PubMed CentralView ArticlePubMedGoogle Scholar
  27. Vucetic S, Xie H, Iakoucheva LM, Oldfield CJ, Dunker AK, Obradovic Z, Uversky VN: Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions. J Proteome Res 2007, 6: 1899-1916.PubMed CentralView ArticlePubMedGoogle Scholar
  28. Xie H, Vucetic S, Iakoucheva LM, Oldfield CJ, Dunker AK, Uversky VN, Obradovic Z: Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. J Proteome Res 2007, 6: 1882-1898.PubMed CentralView ArticlePubMedGoogle Scholar
  29. Dunker AK, Brown CJ, Lawson JD, Iakoucheva LM, Obradovic Z: Intrinsic disorder and protein function. Biochemistry 2002, 41: 6573-6582. 10.1021/bi012159+View ArticlePubMedGoogle Scholar
  30. Dunker AK, Lawson JD, Brown CJ, Williams RM, Romero P, Oh JS, Oldfield CJ, Campen AM, Ratliff CM, Hipps KW, Ausio J, Nissen MS, Reeves R, Kang C, Kissinger CR, Bailey RW, Griswold MD, Chiu W, Garner EC, Obradovic Z: Intrinsically disordered protein. J Mol Graph Model 2001, 19: 26-59. 10.1016/S1093-3263(00)00138-8View ArticlePubMedGoogle Scholar
  31. Dyson HJ, Wright PE: Intrinsically unstructured proteins and their functions. Nat Rev Mol Cell Biol 2005, 6: 197-208. 10.1038/nrm1589View ArticlePubMedGoogle Scholar
  32. Sickmeier M, Hamilton JA, LeGall T, Vacic V, Cortese MS, Tantos A, Szabo B, Tompa P, Chen J, Uversky VN, Obradovic Z, Dunker AK: DisProt: the Database of Disordered Proteins. Nucleic Acids Res 2007, 35: D786-793. 10.1093/nar/gkl893PubMed CentralView ArticlePubMedGoogle Scholar
  33. Romero P, Obradovic Z, Li X, Garner EC, Brown CJ, Dunker AK: Sequence complexity of disordered protein. Proteins 2001, 42: 38-48. 10.1002/1097-0134(20010101)42:1<38::AID-PROT50>3.0.CO;2-3View ArticlePubMedGoogle Scholar
  34. Vucetic S, Brown CJ, Dunker AK, Obradovic Z: Flavors of protein disorder. Proteins 2003, 52: 573-584. 10.1002/prot.10437View ArticlePubMedGoogle Scholar
  35. Garner E, Romero P, Dunker AK, Brown C, Obradovic Z: Predicting binding regions within disordered proteins. Genome Inform Ser Workshop Genome Inform 1999, 10: 41-50.PubMedGoogle Scholar
  36. Obradovic Z, Peng K, Vucetic S, Radivojac P, Brown CJ, Dunker AK: Predicting intrinsic disorder from amino acid sequence. Proteins 2003,53(Suppl 6):566-572. 10.1002/prot.10532View ArticlePubMedGoogle Scholar
  37. Dunker AK, Garner E, Guilliot S, Romero P, Albrecht K, Hart J, Obradovic Z, Kissinger C, Villafranca JE: Protein disorder and the evolution of molecular recognition: theory, predictions and observations. Pac Symp Biocomput 1998, 473-484.Google Scholar
  38. Romero P, Obradovic Z, Kissinger CR, Villafranca JE, Garner E, Guilliot S, Dunker AK: Thousands of proteins likely to have long disordered regions. Pac Symp Biocomput 1998, 437-448.Google Scholar
  39. Oldfield CJ, Cheng Y, Cortese MS, Brown CJ, Uversky VN, Dunker AK: Comparing and combining predictors of mostly disordered proteins. Biochemistry 2005, 44: 1989-2000. 10.1021/bi047993oView ArticlePubMedGoogle Scholar
  40. Cheng Y, Oldfield CJ, Meng J, Romero P, Uversky VN, Dunker AK: Mining alpha-helix-forming molecular recognition features with cross species sequence alignments. Biochemistry 2007, 46: 13468-13477. 10.1021/bi7012273PubMed CentralView ArticlePubMedGoogle Scholar
  41. Goh GK-M, Dunker AK, Uversky VN: Protein intrinsic disorder toolbox for comparative analysis of viral proteins. BMC Genomics 2008, 9: S4. 10.1186/1471-2164-9-S2-S4PubMed CentralView ArticlePubMedGoogle Scholar
  42. Radivojac P, Obradovic Z, Smith DK, Zhu G, Vucetic S, Brown CJ, Lawson JD, Dunker AK: Protein flexibility and intrinsic disorder. Protein Sci 2004, 13: 71-80. 10.1110/ps.03128904PubMed CentralView ArticlePubMedGoogle Scholar
  43. Dunker AK, Cortese MS, Romero P, Iakoucheva LM, Uversky VN: Flexible nets. The roles of intrinsic disorder in protein interaction networks. FEBS J 2005, 272: 5129-5148. 10.1111/j.1742-4658.2005.04948.xView ArticlePubMedGoogle Scholar
  44. Uversky VN: What does it mean to be natively unfolded? Eur J Biochem 2002, 269: 2-12. 10.1046/j.0014-2956.2001.02649.xView ArticlePubMedGoogle Scholar
  45. Uversky VN: Natively unfolded proteins: a point where biology waits for physics. Protein Sci 2002, 11: 739-756. 10.1110/ps.4210102PubMed CentralView ArticlePubMedGoogle Scholar
  46. Uversky VN: Protein folding revisited. A polypeptide chain at the folding-misfolding-nonfolding cross-roads: which way to go? Cell Mol Life Sci 2003, 60: 1852-1871. 10.1007/s00018-003-3096-6View ArticlePubMedGoogle Scholar
  47. Hulskotte EG, Geretti AM, Osterhaus AD: Towards an HIV-1 vaccine: lessons from studies in macaque models. Vaccine 1998, 16: 904-915. 10.1016/S0264-410X(97)00292-2View ArticlePubMedGoogle Scholar
  48. Vigerust DJ, Shepherd VL: Virus glycosylation: role in virulence and immune interactions. Trends Microbiol 2007, 15: 211-218. 10.1016/j.tim.2007.03.003View ArticlePubMedGoogle Scholar
  49. Chakravarty J, Mehta H, Parekh A, Attili SV, Agrawal NR, Singh SP, Sundar S: Study on clinico-epidemiological profile of HIV patients in eastern India. J Assoc Physicians India 2006, 54: 854-857.PubMedGoogle Scholar
  50. Chernajovsky Y, Layward L, Lemoine N: Fighting cancer with oncolytic viruses. BMJ 2006, 332: 170-172. 10.1136/bmj.332.7534.170PubMed CentralView ArticlePubMedGoogle Scholar
  51. Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res 2000, 28: 235-242. 10.1093/nar/28.1.235PubMed CentralView ArticlePubMedGoogle Scholar
  52. Pratt PJ, Adamski JJ: Concepts of Database Management. 4th edition. Thomson Course Technology, Boston, MA; 2002.Google Scholar
  53. Li X, Romero P, Rani M, Dunker AK, Obradovic Z: Predicting protein disorder for N-, C-, and internal regions. Genome Inform Ser Workshop Genome Inform 1999, 10: 30-40.PubMedGoogle Scholar
  54. Herráez A: Biomolecules in the computer: Jmol to the rescue. Biochemistry and Molecular Biology Education 2006, 34: 255-261. 10.1002/bmb.2006.494034042644View ArticlePubMedGoogle Scholar

Copyright

© Goh et al; licensee BioMed Central Ltd. 2008

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Advertisement