Reference gene selection for quantitative real-time PCR analysis in virus infected cells: SARS corona virus, Yellow fever virus, Human Herpesvirus-6, Camelpox virus and Cytomegalovirus infections

Ten potential reference genes were compared for their use in experiments investigating cellular mRNA expression of virus infected cells. Human cell lines were infected with Cytomegalovirus, Human Herpesvirus-6, Camelpox virus, SARS coronavirus or Yellow fever virus. The expression levels of these genes and the viral replication were determined by real-time PCR. Genes were ranked by the BestKeeper tool, the GeNorm tool and by criteria we reported previously. Ranking lists of the genes tested were tool dependent. However, over all, β-actin is an unsuitable as reference gene, whereas TATA-Box binding protein and peptidyl-prolyl-isomerase A are stable reference genes for expression studies in virus infected cells.


Background
Quantitative real-time PCR (QPCR) has become the favoured tool in mRNA expression analysis and also in virus diagnostics [1]. Real-time PCR has outperformed classical and semi-quantitative PCR methods in terms of accuracy, reproducibility, safety and convenience for the precise monitoring of viral load in clinical material, as well as for the investigation of the expression of cellular genes in response to virus infection. However, the most prominent problem in quantitative mRNA expression analysis is the selection of an appropriate control gene. For years, the glyceraldehyde 3-phosphate dehydrogenase (GAP) gene and the β-actin (Act) gene were used as control genes in classical molecular methods for RNA detection. Recently, evidence accumulated that especially these two genes, GAP and Act, are unsuitable controls in quantitative mRNA expression analysis due to setting depend-ent variations in expression [2][3][4]. Recently, we have confirmed these results by investigating the expressional stability of 13 potential reference genes in 16 different tissues and presented more suitable genes like the RNA polymerase II gene [5]. However, an evaluation of reference genes in virus infected cells has not been performed so far. Therefore, the selection of the 10 most promising reference genes, GAP, Act, peptidyl prolyl isomerase A (PPI), glucose 6-phosphate dehydrogenase (G6P), TATA-Box binding protein (TBP), β2-microglobulin (β2M), αtubulin (Tub), ribosomal protein L13 (L13), phospholipase A2 (PLA) and RNA polymerase II (RPII) were evaluated in cell lines infected with members of different virus families: coronavirus (SARS-coronavirus), flavivirus (yellow fever virus, (YF)), herpesvirus (Human herpesvirus-6 (HHV-6) and cytomegalovirus (CMV)) and orthopoxvirus camelpox (CAMP), covering also DNA and RNA viruses.
Quantification of viral RNA was performed to proof and monitor infection. Thereafter the candidate reference genes were evaluated by the BestKeeper tool [6], the GeNorm tool [7] and the algorithm we described previously [5].

Results
An efficient infection could be evidenced by a significant increase of viral RNA or DNA for all 5 viruses over time (table 1). Despite progressing viral replication, the expression of some of the reference genes remained constant, while other genes were varying in expression according to accumulation of infected cells.
The experimentally obtained data for each virus and each gene were analysed using three different methods. The reference gene evaluation of the BestKeeper tool is shown in table 2. A low standard deviation (SD) of the C T values should be expected for useful reference genes and a high SD for genes that are susceptible to virus replication. Corresponding to the recent estimation the SD of the C T value was highest for Act in 4 of 5 viruses, indicating that Act is no reliable reference gene in this setting. In contrast, TBP and PPI displayed the highest expressional stability for 4 of 5 viruses. To find a general conclusion, the total of all SD values from all virus experiments (sum V ) was calculated for each reference gene. As shown in table 2, TBP and PPI seemed to be the least regulated genes in this analysis (sum v = 2.29 for both), followed by GAP (sum v = 3.49) and β2M (sum v = 3.96). All other genes showed moderate total SD values (sum v > 4.58), except Act (sum v = 11.28), confirming to be the most inappropriate reference gene. It is remarkable that the obtained BestKeeper index values are low, despite the inclusion of Act in the calculation. Calculating BestKeeper vs. each reference gene using Pearson correlation displayed very inconsistent results (table 3). Act showed the highest SD values in all virus infections, but a significantly high correlation. In contrast TBP displayed low correlation that was statistically not significant in most cases. When summing up the SD values of all reference genes for each virus infection (sum HRG ), it seems that CAMP infection caused the highest variations in reference gene expression.
Analysing the expression data with the GeNorm tool showed slightly deviant results (table 4). First, the value sum V , representing the SD of a reference gene over all viruses, was lowest for PPI (sum V = 6.08) confirming the results obtained by the Bestkeeper tool. However, β2M (sum V = 6.11), GAP (sum V = 6.19) and TBP (sum V = 6.29) turned out to be comparably reliable as reference genes. Second, also the GeNorm tool showed that Act is by far the worst reference gene (sum V = 14.20).
Applying the calculation mode presented previously [5], that is based on the calculation of ∆∆C T values (table 5), Act was most susceptible to virus infection for 3 of 5 viruses and displayed the highest ∆∆C T value over all viruses (sum V = 45.23). The two genes with the lowest ∆∆C T value were TBP (sum V = 9.82) and PPI (sum V =

Discussion
To date, it is generally accepted, that the selection of the ideal reference gene in gene expression analysis has to be done for each individual experimental setting by evaluat-ing several genes and using the best two or three of these genes as reference. Obviously there is no "one good gene for all experiments" recommendation. However, it is helpful to find putative candidates that can be shortlisted when setting up a new experimental design. Therefore, we determined the expression of previously tested reference genes in a setting of virus infected human cell lines. Capa-   ble reference genes were evaluated using three independent methods: Bestkeeper, GeNorm and the ∆∆C T method, and their results were compared.
All three tools ranked actin at the last position, indicating that it is an unsuitable reference gene in virus infected cells. The actin gene shows significant variations with increasing degree of infection.
The best genes obtained from all three calculation tools were TBP and PPI. TBP seems to be a relative stable expressed gene during the course of virus replication of different viruses in different cells. However, as previously shown [5] TBP is not expressed in all tissues and therefore its use may be limited.
Interestingly, classical reference genes like β2M and GAP were also acceptable regarding to a stable expression in virus infected cells. All other genes showed moderate expression stability. However, one problem still remains to be solved; both tools, the BestKeeper and the GeNorm, can not compare paired probes. This is the great advantage of the ∆∆C T method, or any other method which directly compares paired samples. From this point of view the use of a method like the ∆∆C T should be applied first before considering additional tools for further elucidation of the acquired data.

Conclusions
In summary, TBP and PPI turned out to be the best reference genes in virus infected cells. These genes are a good point to start reference gene selection in gene expression studies in virus infection experiments.
The respective MOI and time of cell culture are shown in table 1 and were chosen to allow maximal infection as determined by immunofluorescence and real-time PCR [8][9][10][11]. For kinetic studies, cells were harvested at several time points (table 1) and RNA was extracted. The RNA transcription level of putative reference genes was determined by quantitative real-time PCR as described below.

Extraction of RNA
Total RNA from 1 × 10 6 cells was prepared using the QIAamp RNA Blood Mini Kit and RNase-free DNase set (Qiagen, Hilden, Germany) according to the manufacturer's recommendations for cultured cells. RNA solution was treated with DNA-free (Ambion, Huntingdon, United Kingdom).
cDNA synthesis cDNA was produced using the Superscript III RT-PCR System (Invitrogen, Karlsruhe, Germany) according to the manufacturer's recommendations for oligo(dT) 20 primed cDNA-synthesis. cDNA synthesis was performed using 1 µg of RNA, at 50°C. Finally, cDNA was diluted 1:5 before use in QPCR.

Quantitative TaqMan PCR
Primers, TaqMan probes and QPCR conditions for reference gene analysis were used as previously described [5]. PCR was performed in a Perkin Elmer 7700 Sequence Detection System in 96-well microtiter plates using a final volume of 25 µl.

Calculations
Analysis was performed with the BestKeeper [6] and GeNorm [7] tools. The ∆∆C T value was calculated as follows: First the ∆C T for each time point of probe assessment between virus and Mock infected cells was calculated. In a second step the maximal differences between the time points were calculated as ∆∆C T .