- Open Access
Identification of stage-related and severity-related biomarkers and exploration of immune landscape for Dengue by comprehensive analyses
Virology Journal volume 19, Article number: 130 (2022)
At present, there are still no specific therapeutic drugs and appropriate vaccines for Dengue. Therefore, it is important to explore distinct clinical diagnostic indicators.
In this study, we combined differentially expressed genes (DEGs) analysis, weighted co-expression network analysis (WGCNA) and Receiver Operator Characteristic Curve (ROC) to screen a stable and robust biomarker with diagnosis value for Dengue patients. CIBERSORT was used to evaluate immune landscape of Dengue patients. Gene Ontology (GO) enrichment, Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis and Gene set enrichment analysis (GSEA) were applied to explore potential functions of hub genes.
CD38 and Plasma cells have excellent Area Under the Curve (AUC) in distinguishing clinical stages for Dengue patients, and activated memory CD4+ T cells and Monocytes have good AUC for this function. ZNF595 has acceptable AUC in discriminating dengue hemorrhagic fever (DHF) from dengue fever (DF) in whole acute stages. Analyzing any serotype, we can obtain consistent results. Negative inhibition of viral replication based on GO, KEGG and GSEA analysis results, up-regulated autophagy genes and the impairing immune system are potential reasons resulting in DHF.
CD38, Plasma cells, activated memory CD4+ T cells and Monocytes can be used to distinguish clinical stages for dengue patients, and ZNF595 can be used to discriminate DHF from DF, regardless of serotypes.
Dengue was listed by the World Health Organization (WHO) as one of the top ten global health threats announced at the beginning of 2019 . In the past few decades, Dengue has become the fastest-growing mosquito borne disease in the world [2,3,4], seriously endangering human health [5, 6]. Vaccine research and development continue to make progress [7,8,9,10,11,12], but the antibody-dependent enhancement (ADE) limits the effectiveness of vaccines [13,14,15,16,17]. Asymptomatic infections increase incidence of dengue [16, 18] and effective treatments have not been identified. Therefore, it’s urgent to explore the pathogenic mechanism of dengue fever and screen out molecular markers for better diagnosis and treatments.
Autophagy, a catabolic process that degrades damaged or abnormal intracellular components to recover nutrients, is essential for maintaining cell and body homeostasis [19, 20], and benefits to proliferation and infection of the Dengue virus (DENV) [21,22,23,24]. In DENV-ADE infection, cross-reactive antibodies mediate infection by inducing autophagy related proteins, and then suppress the innate immunity mediated by the mitochondria antiviral protein (MAVS) . Immune response affects directly or indirectly host response to DENV in varying degrees, including symptomatic infection, asymptomatic infection [26, 27], dengue shock syndrome (DSS) and dengue hemorrhagic fever (DHF) [28,29,30]. Therefore, it is essential to explore autophagy and immune response during DENV infection.
Transcriptomics researches are beneficial to assist researchers in better understanding disease causes  and locating biomarkers [32,33,34]. Our precious transcriptomics researches contributed to understand viral evolution and its impact on pathogenicity and vaccine development of DENV [35,36,37,38]. However, studies [39,40,41] published focused on multi-gene researches, and single analytical method (differentially expressed genes (DEGs) analyses), and did not link genomics with immune landscapes. In this study, we used a combination of DEG analyses, weighted co-expression network analysis (WGCNA) and Receiver Operator Characteristic Curve (ROC) to identify, validate and test biomarkers with diagnostic value of stages and severity in independent datasets, and applied the CIBERSORT website to analyze immune landscape differences among three stages and between DHF and Dengue Fever (DF) and explore correlations between genes and immune cells.
Materials and methods
Gene expression datasets of Dengue patients were selected from a public database named the Gene Expression Omnibus databases (GEO, http://www.ncbi.nlm.nih.gov/geo). We followed these reference points: 1. datasets analyzed whole blood samples of different stages from Dengue patients including DF and DHF, and normal samples; 2. Datasets should include at least 20 dengue patients. Based on these criteria, we obtained GSE43777 dataset analyzing Two chip types (affymetrix HG-U133 plus 2 in GLP570 platform and HG-Focus in GLP201 platform) , GSE28405 dataset  and GSE51808 dataset .
In GSE43777, more than 200 samples (one sample for each stage) were collected from 51 DF and 13 DHF subjects in Maracay, Venezuela, and demographics and clinical, immunological and hematological characteristics of participants were preciously summarized . In GSE28405, 31 clinically undifferentiated DENV RT-PCR positive patient samples were selected in Singapore within 72 h (Early Acute stage, EA), 4–7 days (Late Acute stage, LA) and 3–4 weeks (Convalescence stage, C) after self-reported fever onset, and other clinical information had been recorded as preciously described . In GSE51808, whole blood samples were obtained from 47 Dengue patients (DF n = 31, DHF = 16) hospitalized at the Siriraj Hospital in Bangkok within days 2 and at 4 weeks or later after discharge (the Convalescence stage) and 9 normal subjects were analyzed， and detail clinical information were also showed .
Regarding gene expression levels in C or EA stages as baselines, these patients were divided into 3 groups, C vs EA, C vs LA and EA vs LA, as precious studies . The Principal Component Analysis (PCA) analysis  was used to explore whether different stages can be distinguished clearly.
DEGs analysis and functional annotation of DEGs
A website (http://sangerbox.com/Tool) based on a R package “Limma” was applied to analyze DEGs among three phases (we showed the analysis regardless of serotypes and also performed separate analysis for each serotype) and between DHF and DF for Dengue patients. A R package “clusterProfiler” in R version 4.1.0 and a gene set enrichment analysis (GSEA) software (4.1.0)  were used to performed Gene Ontology (GO) enrichment analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis and GSEA to explore potential bio-function and enrichment pathways for DEGs.
WGCNA analysis 
We obtained a total of 4843 genes in the C vs EA group, 4472 genes in the C vs the LA group and 4463 genes in the EA vs LA group by a “WGCNA” package. We converted the adjacency matrix into the topological overlap matrix (TOM) when the power of β were equal to 5, 4 and 12. According to a height cutoff of 0.25, we merged similar modules and the module displaying the highest conformity with the development disease of Dengue was used to intersect with DEGs to identify more stable DEGs.
Evaluation immune cells of the whole blood samples
we estimated fractions of immune cells in whole blood samples by the “CIBERSORT” website (https://cibersortx.stanford.edu/index.php) which can be used to input gene expression data and then obtain a fraction estimate of 22 immune cell types in whole blood samples .
ROC  was applied to identify, valid and test value of biomarkers and immune cells in distinguish stages and severity based on Area Under The Curve (AUC).
R software (version 4.1.0) and R. Studio (version 1.9.0) were used to analyze data and visualize results. We applied the t-test, Mann–Whitney U test and the chi-square test to identify differences between two groups. Correlation analyses were showed according to the Pearson and Spearman theories. P values < 0.05 was regarded as statistical significance.
Figure 1 shows illustration for this study.
Acquisition of DEGs and mechanisms analyses for dengue
The Principal Component Analysis (PCA) shows three stage samples can be distinguished (Additional file 1: Figure S1 A–C). After normalizing the expression data from GSE43777 dataset on GLP201 platform, we identify 147 DEGs (121 up-regulated genes and 26 down-regulated genes) between the C and the EA stages (Fig. 2A, F and Additional file 7: Table S1), 124 DEGs (102 up-regulated genes and 22 down-regulated genes) between the C and the LA stages (Fig. 2B, G and Additional file 8: Table S2) and 195 DEGs (74 up-regulated genes and 121 down-regulated genes) between the EA and LA stages (Fig. 2C, H and Additional file 9: Table S3) by a “Limma” package based on |Log2FC|≥ 1, and adjusted P value < 0.05. Likewise, in the EA stage we obtained 30 DEGs (16 up-regulated genes and 14 down-regulated genes) (Fig. 2D, I and Additional file 10: Table S4) between DF (n = 15) and DHF (n = 11) samples from GSE43777 dataset on GLP570 platform, and 58 DEGs (48 up-regulated genes and 10 down-regulated genes) (Fig. 2E, J and Additional file 11: Table S5) between DF (n = 25) and DHF (n = 26) samples in the LA stage, based on |Log2FC|≥ 1 and P value < 0.05.
GO enrichment analysis results show that 147 DEGs (between the C and the EA stages) are mainly enriched in defense response to viruses and type I interferon (Fig. 2K); DNA replication is a common bio-activity for 124 DEGs (between the C and the LA stages) (Fig. 2L); the Fig. 2M shows that neutrophil activation and defense response to virus are dominating enrichment biofunctions for 195 DEGs between the EA and LA stages; bio-function involved in negative regulation of viral genome replication are more active for 30 DEGs between DF and DHF samples in Fig. 2N; neutrophil and humoral immune response are activated for DHF samples in the LA stage (Fig. 2O).
GO enrichment analysis results (down-regulated genes in DHF samples enriched in negative regulation of viral replication) in Fig. 2I and N caught our attention, because it suggested it was difficult to control viral replication in DHF patients in the EA stage. Relative studies [21, 24, 50] demonstrated DENV inhibited cell apoptosis through autophagy, thereby promoting its replication. Therefore, we suspecte related-autophagy genes may be activated in EA stage and LA stage. As the Additional file 1: Figure S1 J shown, in the LA stage, we identify definitely 1 different expression autophagy-related gene (CCL2) by intersecting DEGs between DHF and DH samples from GSE43777 analyzed on GLP570 platform with 222 autophagy-related genes downloaded from Human Autophagy Database (HADb) which provides a complete and an up-to-date list of human genes and proteins involved directly or indirectly in autophagy. The boxplot diagram (Additional file 1: Figure S1 K) shows that the expression levels of CCL2 increases significantly in DHF samples in the LA stage, suggesting that DENV replicates more strongly in DHF than DF which may be one reason resulting in DHF in the LA stage. While we do not identify different expression of autophagy-related genes between DF and DHF in the EA stage.
KEGG analysis results reveal that pathogen infection and NOD − like receptor signaling pathway are obvious enrichment pathways for 147 DEGs (between the C and the EA stages) (Fig. 2P); Cell replication, Legionellosis and Malaria, are major enrichment pathways for 124 DEGs (between the C and the LA stages) (Fig. 2Q); 195 DEGs (between the EA and LA stages) are enriched in pathogen infection and Cell replication (Fig. 2R); NOD − like receptor signaling pathway and pathogen infection are more active for 30 DEGs between DF (n = 15) and DHF (n = 11) samples in Fig. 2S; inflammatory pathways are activated for DHF samples in LA stage in Fig. 2T.
Acquisition of key modules by WGCNA
Using the GSE43777 dataset analyzed on GLP201 platform, we performed a “WGCNA” package to obtain key modules associated with procession features of Dengue. We select 5, 4 and 12 (Fig. 3A–C) as the soft-threshold power respectively and 0.25 (Additional file 1: Figure S1 D-F) as the cutoff height in the C vs EA group, in the C vs LA group and in the EA vs LA group. Modules consist of genes with similar expression properties (Additional file 1: Figure S1 G–I). In the cluster tree, we use each branch to represent a gene, and one color to show a co-expression module (Fig. 3D–F).
The association of module eigengene (ME) values and clinical stages is applied to assess the module-stage feature correlation. WGCNA results show 14, 12 and 8 modules, respectively (Fig. 3G–I). Compared the C with the EA group, the turquoise module is the most matched strongly with the EA group (Pearson co-efficient = − 0.89, P = 1e−33) and the C group (Pearson co-efficient = 0.89, P = 1e−33) in Fig. 3G. Compared the C with the LA group, the most significant correlation can be observed between the LA group (Pearson co-efficient = 0.71, P = 5e−20) and the blue module, and between the C group (Pearson co-efficient = − 0.71, P = 5e−20) and the blue module in Fig. 3H. While comparing the LA group with the EA group, the black module and the green module show the same highest correlation (Pearson co-efficient = 0.72) in Fig. 3I. We regard these modules mentioned above as key modules.
Acquisition of key genes
To filter reliable and strong hub genes (shared genes) accurately, Venn diagrams were used to visualize hub genes between DEGs and key modules by a website (http://www.ehbio.com/test/venn/#/) . The results illustrate 143, 99 and 187 hub genes by interacting the turquoise module with 147 DEGs (between the C and the EA stages) (Fig. 4A), interacting the blue module with 124 DEGs (between the C and the LA stages) (Fig. 4B), interacting the black and green modules with 195 DEGs (between the EA and LA stages) (Fig. 4C), respectively. CD38 and CDKN1C are shared by the three phases (Fig. 4D). When considering serotypes, CD38 and CDKN1C still express differentially in three comparison groups (C vs EA, C vs LA, EA vs LA) for each serotype and log|FC|> 1 or log|FC| is extremely closed to 1 (Additional file 12: Table S6). The Fig. 4G shows the expression levels of CD38 firstly increase and then decrease from day 0 to the C stage (***P < 0.001). While the expression of CDKN1C is firstly down-regulated and subsequently up-regulated from the day 0 to the C stage (***P < 0.001) (Fig. 4G). The GSEA shows CD38 is enriched in cellar proliferation and metabolic pathways (Fig. 4E) and CDKN1C is enriched in lipid metabolism and inflammation pathways (Fig. 4F). Correlation analyses show in Fig. 4H expression levels of CD38 are negatively correlated with CDKN1C (spearman correlation = − 0.5). We select CD38 and CDKN1C for follow-up analyses.
Verification of key genes value in staging diagnosis
Because CD38 and CDKN1C express differently in three stages (Fig. 5A–B), we speculate that they can be used as biomarkers with staging characteristic to distinguish clinical stages for Dengue patients. To identify, verify and test the value of CD38 and CDKN1C, we regard GSE43777 datasets analyzed on the GPL201 platform (C stage, n = 48; EA stage, n = 47; LA stage, n = 73), GSE43777 datasets analyzed on the GPL570 platform (C stage, n = 24; EA stage, n = 26; LA stage, n = 51) and GSE28405 (C stage, n = 31; EA stage, n = 57; LA stage, n = 31) as a training dataset, a verifying dataset and a testing dataset, respectively. When analyzing single serotype, we regard GSE43777 datasets analyzed on the GPL201 platform (Serotype I: C stage, n = 26; EA stage, n = 26; LA stage, n = 44. Serotype II: C stage, n = 6; EA stage, n = 7; LA stage, n = 5. serotype III: C stage, n = 9; EA stage, n = 9; LA stage, n = 15. Serotype IV: C stage, n = 6; EA stage, n = 4; LA stage, n = 8.), GSE43777 datasets analyzed on the GPL570 platform (Serotype I: C stage, n = 7; EA stage, n = 3; LA stage, n = 14. Serotype II: C stage, n = 11; EA stage, n = 16; LA stage, n = 23. serotype III: C stage, n = 3; EA stage, n = 3; LA stage, n = 8. Serotype IV: C stage, n = 3; EA stage, n = 3; LA stage, n = 6.) as a training dataset and verifying dataset, respectively.
In the training group, CD38 has 0.960, 0.976, and 0.829 of AUC in three comparing groups (C vs EA, C vs LA, EA vs LA) respectively and CDKN1C has 0.852, 0.868 and 0.955 of AUC in three comparing groups (C vs EA, C vs LA, EA vs LA) respectively (Fig. 5D–F). In the verifying group, the staging diagnostic value of CD38 and CDKN1C in staging for Dengue is as follows (Fig. 5G–I): CD38 (AUC: 0.897, C vs EA; AUC: 0.957, C vs LA; AUC: 0.827, EA vs LA), CDKN1C (AUC: 0.575, C vs EA; AUC: 0.929 C vs LA; AUC: 0.876, EA vs LA). Figure 5J–L show high diagnostic values in the testing dataset: CD38 (AUC: 0.822, C vs EA; AUC: 0.996 C vs LA; AUC: 0.926, EA vs LA), CDKN1C (AUC: 0.744, C vs EA; AUC: 0.724 C vs LA; AUC: 0.842, EA vs LA). The analytical results above show that CD38 can distinguish admirably different phases for Dengue patients and has a higher distinguishable value than CDKN1C. Although we consider serotypes (serotype I–IV), results are similar among three comparison groups (C vs EA, C vs LA, EA vs LA) (Additional file 2: Figure S2, Additional file 3: Figure S3).
In addition, as Fig. 5C shown, CD38 distinguishes perfectly Dengue samples from normal samples with an AUC of 100% in GSE51808 dataset (9 normal samples and 28 Dengue samples). Therefore, CD38 shows the high value in distinguishing stages for Dengue patients. We selected CD38 as a biomarker for staging diagnosis of Dengue.
There is no significant difference in expression levels of CD38 and CDKN1C between DH and DHF samples (Additional file 1: Figure S1 L), and different serotypes of Dengue also do not show obvious difference in expression levels of CD38 and CDKN1C (Additional file 1: Figure S1 M).
Exploration of biomarkers for DHF
After intersecting 30 DEGs between DF (n = 15) and DHF (n = 11) samples from GSE43777 dataset on GLP570 platform in the EA stage with 58 DEGs between DF (n = 25) and DHF (n = 26) samples in the LA stage, we identify 6 shared DEGs (LOC101928288, TCN1, DEFA4, FRG1B, LOC286087 and ZNF595) (Fig. 6A). We speculate that LOC101928288, TCN1, DEFA4, FRG1B, LOC286087 and ZNF595 can be used as biomarkers to discriminate DHF patients from DF patients due to different expression of them between DHF and DF. The GSE43777 dataset analyze on the GPL570 platform in the EA stage (DF = 15; DHF = 11) (Fig. 6B) and in the LA stage (DF = 25; DHF = 26) (Fig. 6C) are regarded as a training dataset; GSE43777 dataset analyzed on the GPL201 platform in Acute stage (DF = 40; DHF = 37) is regarded as a verifying dataset; GSE51808 (DF = 18; DHF = 10) and GSE18090 (DF = 8; DHF = 10) datasets are regarded as testing datasets. The ROC results (Fig. 6B–F) show ZNF595 always has acceptable diagnostic value in distinguishing DHF patients from DF patients. Therefore, ZNF595 is selected as a biomarker to predict DHF patients in Acute stages.
Immune landscape related to characteristics of dengue patients
GO enrichment analyses for 58 DEGs between DHF samples and DF samples show neutrophil and humoral immune response are activated and KEGG pathway enrichment analyses show rich inflammatory pathways. Therefore, we analyze fraction changes for 22 types of immune cells during DENV infection by CIBERSORT. The GSE43777 dataset analyze on the GLP201 platform is used to explore fractions of 22 types of immune cells in whole blood samples from Dengue patients in three phases and the GSE43777 dataset analyzed on the GLP570 platform is applied to explore differently infiltrating-immune characteristic between DF and DHF samples.
As Fig. 7A–C and 7F shown, in the EA phase, fractions of activated dendritic cells, Neutrophils, Monocytes, and M1 Macrophages (P < 0.05) increase significantly compared to the LA and C phases. Compared the LA phase with the EA phase and the C phase, fractions of Plasma cells and activated memory CD4+ T cells (P < 0.05) increase clearly (Fig. 7A–C and F). While Compared with the EA phase and the LA phase, the increments in fractions of memory B cells, resting memory CD4+ T cells, resting dendritic cells, Eosinophils, and naïve B cells (P < 0.05) are more obvious in the C phase (Fig. 7A–C and F). Analytical results above mean that immune cells with antigen presentation, phagocytosis and chemotaxis, immune cells involved in humoral immunity and memory cells are more obviously active in the EA, LA and C phase, respectively. When analyzing in a single serotype, we can still get above results (Additional file 4: Figure S4).
In the LA stage, fractions of activated NK cells (P < 0.05) increase in DHF samples (Fig. 7E and H), but fractions of CD8+ T cells, gamma delta T cells, resting NK cells, M2 Macrophages and resting mast cells decrease (Fig. 7E and H) (P < 0.05), which imply immune response are damaged in DHF samples. In the EA stage, the immune response of DF patients is similar to that of DHF patients (Fig. 7D and G). The immune response of DHF patients in the LA stage is significantly impaired which explain rapid deterioration of DHF patients in the LA stage.
Correlations between gene and immune cells
We analyze correlations between CD38 and immune cells in GSE43777 dataset (Fig. 7I). The infiltration level of Plasma cells, activated memory CD4+ T cells, gamma delta T cells and CD8+ T cells are positively related to CD38 (Fig. 7I); the infiltration level of naïve B cells, Monocytes, resting dendritic cells, resting memory CD4+ T cells, naïve CD4+ T cells, memory B cells, Eosinophils and activated mast cells are negatively related to CD38 (Fig. 7I). Therefore, the analytical results above show there is strong co-expression relationship between CD38 and plasma cells and between CD38 and activated memory CD4+ T cells (Pearson’s correlation > 0.5). In individual serotype, we can still draw this conclusion (Additional file 5: Figure S5). Correlation between immune cells and ZNF595 is not significant (Fig. 7J).
Analyzing the staging characteristic of immune cells
Because of high co-expression correlation between CD38 and plasma cells, we speculated that infiltrating-immune plasma cells could also show distinct differences in three stages. To explore this characteristic, we regarded GSE43777 analyzed by GPL201 as a training set and GSE43777 analyzed by GPL570 as a test set. As the Fig. 8A–C shown in the training group, the distinguishing value of Plasma cells, activated memory CD4+ T cells, and Monocytes in staging were good and Plasma cells have the highest distinguishing value (AUC:0.827, C vs EA; AUC:0.964, C vs LA; AUC: 0.832, EA vs LA). In the test group, the distinguishing value of three types of immune cells in stage for Dengue was still valuable and Plasma cells have the highest distinguishing value in Fig. 8D–F (AUC: 0.905, C vs EA; AUC: 0.949, C vs LA; AUC: 0.824, EA vs LA. Plasma cells (AUC = 0.968), activated memory CD4 + T cells (AUC = 0.845) and Monocytes (AUC = 0.869) can excellently distinguish Dengue samples from normal samples in GSE51808 (Fig. 8G). When considering different serotypes, above results were still obtained (Additional file 6: Figure S6). Therefore, we can discriminate three stages based on the fraction of Plasma cells, activated memory CD4 + T cells and Monocytes in Dengue patients.
This study combined DEGs analysis, WGCNA and ROC to identify, valid and test potential biomarkers associated with the staging and severity of Dengue, and used GO enrichment analysis, KEGG analysis and GSEA analysis to explore potential reasons resulting in DHF. The CIBERSORT website was also applied to explore immune differences during Dengue infection. Our research was the first to show that CD38 and ZNF595 had clinical significance in the stage and severity of Dengue and that they could be used as biomarkers to distinguish clinical stages and severity for Dengue patients. It was worth noting that Plasma cells, activated memory CD4+ T cells, and Monocytes also showed distinguishing value.
CD38 which was identified, verified and tested in independent datasets could distinguish three clinical stages of Dengue fever and was significantly associated with plasma cells, but it could not use to predict severity which was similar to precious results  and distinguish different serotypes. Our results showed expression levels of CD38 and fractions of plasma cells were similar in four serotypes. We identified, verified and tested ZNF595 in independent datasets that could predict DHF. Mechanistically, EA-DEGs which inhibited viral replication were down-regulated in DHF, and a related-autophagy gene (CCL2) expressed differently and increased significantly in DHF, all of which suggested DENV regulated autophagy to lead to DHF.
In addition, we analyzed systematically immune differences among three stages and between DHF and DF in the acute stage, and correlation between immune cells and genes. Immune analysis results showed fractions of monocytes, activated mast cells, M1 macrophages and neutrophils referred to innate immune response increased obviously in the EA stage compared with the C stage; significant increments can be observed in fractions of plasma cells and activated memory CD4+ T cells comparing the LA stage with the C stage; increasing fractions of plasma cells, CD8+ T cells, activated memory CD4+ T cells, follicular helper T cells, regulatory T cells (Tregs) and gamma delta T cells can be discovered in the LA stage compared with the EA stage. Above results were suitable for any serotype, suggesting different serotypes did not have obvious immune differences. Increasing immune response eliminated DENV which avoided Dengue symptoms  and immune response kinetics dependent on initial lymphocyte numbers, were observed to correspond with illness severity . Our studies showed neutrophil and humoral immune response are activated in DHF in the LA stage, but the whole immune system was damaged in DHF compared with DF, which was a possible reason leading to DHF.
Interestingly, KEGG enrichment showed DEGs (between the EA stage and the C stage, and between the EA stage and the LA stage) were enriched in Coronavirus disease —COVID-19, Influenza A, Hepatitis C and Measles pathways, implying Coronavirus , Alphainfluenzavirus influenzae , Hepatitis C virus  and Measles morbillivirus also up regulated these genes during infection and had shared pathogenic mechanism.
Gene expression profiles from public databases (GEO) have been applied to explore Dengue biomarkers by researchers. Several studies have used multiple genes to distinguish Dengue patients from healthy samples [39, 40], DHF from DF  and different stages . However, multi-gene lists can limit the sensitivity and specificity of DEGs as disease biomarkers [58, 59].
Compared with previous studies [39,40,41], our study had several advantages. First of all, this study was based on single gene analyses which allowed us to identify a stable and robust biomarker and these biomarkers were identified and verified on independent datasets which increased the accuracy of our study. Secondly, we combined three methods, including DEG, WGCNA and ROC to screen, verify and test biomarkers for Dengue diagnoses which increase the accuracy of our results, differing from precious studies only depending on the DEG analysis. Thirdly, we analyzed 22 types of immune cells including less attention immune cells based on RNA-sequence which contributed to increase understand of the whole immune response during DENV infection and explore correlations between genes and immune cells. A previous study showed that peripheral lymphocyte subset alteration can be independent predictors for clinical characteristics and treatment efficacy of COVID-19 . Interestingly, we found that the fraction of Plasma cells, activated memory CD4+ T cells and Monocytes in Dengue patients also had clinical characteristics and can distinguish these clinical stages for Dengue patients. Finally, not just for single serotype, our study included four serotypes which was involved in separate and combined analysis.
There is still a shortcoming of this study, because the study only uses public datasets to analyze, verify and test, without clinical verification and test. Our research has provided help in distinguishing the stage and severity of Dengue infection and understanding pathogenic mechanism of different serotypes, and will help analyze mechanisms of DHF and benefit to clinical treatment in the future.
In conclusion, based on expression levels of CD38 and fractions of Plasma cells, activated memory CD4+ T cells and Monocytes, we can discriminate admirably three clinical stages for Dengue patients (results are suitable for any serotype); ZNF595 can better distinguish dengue hemorrhagic fever (DHF) from Dengue fever (DF). Up-regulated autophagy-related genes contribute to understand mechanisms of DHF.
Availability of data and materials
The datasets supporting the conclusions of this article are available in the Gene Expression Omnibus (GEO) database (GSE43777, GSE51808 and GSE28405).
Differentially expressed genes
Weighted co-expression network analysis
Receiver operator characteristic curve
Kyoto Encyclopedia of Genes and Genomes
Gene set enrichment analysis
Area under the curve
Dengue hemorrhagic fever
World Health Organization WHO
Mitochondria antiviral protein
Dengue shock syndrome
Principal component analysis
Early acute stage
Late acute stage
National Center of Biotechnology Information
Peripheral blood mononuclear cell
Topological overlap matrix
Zhang J, Shu Y, Shan X, Li D, Ma D, Li T, Long S, Wang X, Pan Y, Chen J, et al. Co-circulation of three dengue virus serotypes led to a severe dengue outbreak in Xishuangbanna, a border area of China, Myanmar, and Laos, in 2019. Int J Infect Dis. 2021;107:15–7.
Carbajo AE, Cardo MV, Vezzani D. Is temperature the main cause of dengue rise in non-endemic countries? the case of Argentina. Int J Health Geogr. 2012;11:26.
Hiatt T, Nishikiori N. Epidemiology and control of tuberculosis in the Western Pacific Region: analysis of 2012 case notification data. Western Pac Surveill Response J. 2014;5:25–34.
Brady OJ, Gething PW, Bhatt S, Messina JP, Brownstein JS, Hoen AG, Moyes CL, Farlow AW, Scott TW, Hay SI. Refining the global spatial limits of dengue virus transmission by evidence-based consensus. PLoS Negl Trop Dis. 2012;6: e1760.
Inizan C, Minier M, Prot M, O’Connor O, Forfait C, Laumond S, Marois I, Biron A, Gourinat AC, Goujart MA, et al. Viral evolution sustains a dengue outbreak of enhanced severity. Emerg Microbes Infect. 2021;10:536–44.
Thadchanamoorthy V, Dayasiri K. Postdengue chronic fatigue syndrome in an adolescent boy. BMJ Case Rep. 2021;14:e238605.
Kanesa-Thasan N, Sun W, Kim-Ahn G, Van Albert S, Putnak J, King A, Raengsakulsrach B, Christ-Schmidt H, Gilson K, Zahradnik J. Safety and immunogenicity of attenuated dengue virus vaccines (Aventis Pasteur) in human volunteers. Vaccine. 2001;19:3179–88.
Kitchener S, Nissen M, Nasveld P, Forrat R, Yoksan S, Lang J, Saluzzo J-F. Immunogenicity and safety of two live-attenuated tetravalent dengue vaccine formulations in healthy Australian adults. Vaccine. 2006;24:1238–41.
Guy B, Briand O, Lang J, Saville M, Jackson N. Development of the Sanofi Pasteur tetravalent dengue vaccine: one more step forward. Vaccine. 2015;33:7100–11.
Osorio JE, Velez ID, Thomson C, Lopez L, Jimenez A, Haller AA, Silengo S, Scott J, Boroughs KL, Stovall JL. Safety and immunogenicity of a recombinant live attenuated tetravalent dengue vaccine (DENVax) in flavivirus-naive healthy adults in Colombia: a randomised, placebo-controlled, phase 1 study. Lancet Infect Dis. 2014;14:830–8.
Martinez LJ, Lin L, Blaylock JM, Lyons AG, Bauer KM, De La Barrera R, Simmons M, Jarman RG, Currier JR, Friberg H. Safety and immunogenicity of a dengue virus serotype-1 purified-inactivated vaccine: results of a phase 1 clinical trial. Am J Trop Med Hyg. 2015;93:454.
Raviprakash K, Wang D, Ewing D, Holman DH, Block K, Woraratanadharm J, Chen L, Hayes C, Dong JY, Porter K. A tetravalent dengue vaccine based on a complex adenovirus vector provides significant protection in rhesus monkeys against all four serotypes of dengue virus. J Virol. 2008;82:6927–34.
Shukla R, Ramasamy V, Shanmugam RK, Ahuja R, Khanna N. Antibody-dependent enhancement: a challenge for developing a safe dengue vaccine. Front Cell Infect Microbiol. 2020;10: 572681.
Narayan R, Tripathi S. Intrinsic ADE: the dark side of antibody dependent enhancement during dengue infection. Front Cell Infect Microbiol. 2020;10: 580096.
Bournazos S, Gupta A, Ravetch JV. The role of IgG Fc receptors in antibody-dependent enhancement. Nat Rev Immunol. 2020;20:633–43.
Malavige GN, Jeewandara C, Ghouse A, Somathilake G, Tissera H. Changing epidemiology of dengue in Sri Lanka-Challenges for the future. PLoS Negl Trop Dis. 2021;15: e0009624.
Deng S-Q, Yang X, Wei Y, Chen J-T, Wang X-J, Peng H-J. A Review on Dengue Vaccine Development. Vaccines 2020;8(1):63. https://doi.org/10.3390/vaccines8010063
Eder M, Cortes F, Teixeira de Siqueira Filha N, Araújo de França GV, Degroote S, Braga C, Ridde V, Turchi Martelli CM. Scoping review on vector-borne diseases in urban areas: transmission dynamics, vectorial capacity and co-infection. Infect Dis Poverty. 2018;7:90.
Mizushima N. A brief history of autophagy from cell biology to physiology and disease. Nat Cell Biol. 2018;20:521–7.
Kroemer G, Mariño G, Levine B. Autophagy and the integrated stress response. Mol Cell. 2010;40:280–93.
Chu LW, Yang CJ, Peng KJ, Chen PL, Wang SJ, Ping YH. TIM-1 as a signal receptor triggers dengue virus-induced autophagy. Int J Mol Sci. 2019;20:4893.
Lu ZY, Cheng MH, Yu CY, Lin YS, Yeh TM, Chen CL, Chen CC, Wan SW, Chang CP. Dengue nonstructural protein 1 maintains autophagy through retarding caspase-mediated cleavage of beclin-1. Int J Mol Sci. 2020;21:9702.
Sun P, Nie K, Zhu Y, Liu Y, Wu P, Liu Z, Du S, Fan H, Chen CH, Zhang R, et al. A mosquito salivary protein promotes flavivirus transmission by activation of autophagy. Nat Commun. 2020;11:260.
Heaton NS, Randall G. Dengue virus-induced autophagy regulates lipid metabolism. Cell Host Microbe. 2010;8:422–32.
Huang X, Yue Y, Li D, Zhao Y, Qiu L, Chen J, Pan Y, Xi J, Wang X, Sun Q, Li Q. Antibody-dependent enhancement of dengue virus infection inhibits RLR-mediated Type-I IFN-independent signalling through upregulation of cellular autophagy. Sci Rep. 2016;6:22303.
Yeo AS, Azhar NA, Yeow W, Talbot CC Jr, Khan MA, Shankar EM, Rathakrishnan A, Azizan A, Wang SM, Lee SK. Lack of clinical manifestations in asymptomatic dengue infection is attributed to broad down-regulation and selective up-regulation of host defence response genes. PLoS ONE. 2014;9: e92240.
Simon-Lorière E, Duong V, Tawfik A, Ung S, Ly S, Casadémont I, Prot M, Courtejoie N, Bleakley K, Buchy P, et al. Increased adaptive immune responses and proper feedback regulation protect against clinical dengue. Sci Transl Med. 2017;9:eaal5088.
OhAinle M, Balmaseda A, Macalalad AR, Tellez Y, Zody MC, Saborío S, Nuñez A, Lennon NJ, Birren BW, Gordon A, et al. Dynamics of dengue disease severity determined by the interplay between viral genetics and serotype-specific immunity. Sci Transl Med. 2011;3:114ra128.
Garcia-Bates TM, Cordeiro MT, Nascimento EJ, Smith AP, de Melo KMS, McBurney SP, Evans JD, Marques ET, Barratt-Boyes SM. Association between magnitude of the virus-specific plasmablast response and disease severity in dengue patients. J Immunol. 2013;190:80–7.
Duangchinda T, Dejnirattisai W, Vasanawathana S, Limpitikul W, Tangthawornchaikul N, Malasit P, Mongkolsapaya J, Screaton G. Immunodominant T-cell responses to dengue virus NS3 are associated with DHF. Proc Natl Acad Sci. 2010;107:16922–7.
Mlera L, Offerdahl DK, Dorward DW, Carmody A, Chiramel AI, Best SM, Bloom ME. The liver X receptor agonist LXR 623 restricts flavivirus replication. Emerg Microbes Infect. 2021;10:1378–89.
Hong M, Tao S, Zhang L, Diao L-T, Huang X, Huang S, Xie S-J, Xiao Z-D, Zhang H. RNA sequencing: new technologies and applications in cancer research. J Hematol Oncol. 2020;13:166.
Hill SR, Taparia T, Ignell R. Regulation of the antennal transcriptome of the dengue vector, Aedes aegypti, during the first gonotrophic cycle. BMC Genomics. 2021;22:71.
Azlan A, Obeidat SM, Theva Das K, Yunus MA, Azzam G. Genome-wide identification of Aedes albopictus long noncoding RNAs and their association with dengue and Zika virus infection. PLoS Negl Trop Dis. 2021;15: e0008351.
Jiang L, Ma D, Ye C, Li L, Li X, Yang J, Zhao Y, Xi J, Wang X, Chen J, et al. Molecular characterization of dengue virus serotype 2 cosmospolitan genotype from 2015 dengue outbreak in Yunnan, China. Front Cell Infect Microbiol. 2018;8:219.
Wen S, Ma D, Lin Y, Li L, Hong S, Li X, Wang X, Xi J, Qiu L, Pan Y, et al. Complete genome characterization of the 2017 dengue outbreak in Xishuangbanna, a Border City of China, Burma and Laos. Front Cell Infect Microbiol. 2018;8:148.
Wang X, Ma D, Huang X, Li L, Li D, Zhao Y, Qiu L, Pan Y, Chen J, Xi J, et al. Complete genome analysis of dengue virus type 3 isolated from the 2013 dengue outbreak in Yunnan, China. Virus Res. 2017;238:164–70.
Jiang L, Sun Q. The role of autophagy-mediated dengue virus antibody-dependent enhancement infection of THP-1 cells. Intervirology. 2020;63:57–65.
Zhong XL, Liao XM, Shen F, Yu HJ, Yan WS, Zhang YF, Ye JJ, Lv ZP. Genome-wide profiling of mRNA and lncRNA expression in dengue fever and dengue hemorrhagic fever. FEBS Open Bio. 2019;9:468–77.
Xie LM, Yin X, Bi J, Luo HM, Cao XJ, Ma YW, Liu YL, Su JW, Lin GL, Guo XG. Identification of potential biomarkers in dengue via integrated bioinformatic analysis. PLoS Negl Trop Dis. 2021;15: e0009633.
Sun P, García J, Comach G, Vahey MT, Wang Z, Forshey BM, Morrison AC, Sierra G, Bazan I, Rocha C, et al. Sequential waves of gene expression in patients with clinically defined dengue illnesses reveal subtle disease phases and predict disease severity. PLoS Negl Trop Dis. 2013;7: e2298.
Tolfvenstam T, Lindblom A, Schreiber MJ, Ling L, Chow A, Ooi EE, Hibberd ML. Characterization of early host responses in adults with dengue disease. BMC Infect Dis. 2011;11:209.
Soares-Schanoski A, Baptista Cruz N, de Castro-Jorge LA, de Carvalho RVH, Santos CAD, Rós ND, Oliveira Ú, Costa DD, Santos C, Cunha MDP, et al. Systems analysis of subjects acutely infected with the Chikungunya virus. PLoS Pathog. 2019;15: e1007880.
Kwissa M, Nakaya HI, Onlamoon N, Wrammert J, Villinger F, Perng GC, Yoksan S, Pattanapanyasat K, Chokephaibulkit K, Ahmed R, Pulendran B. Dengue virus infection induces expansion of a CD14(+)CD16(+) monocyte population that stimulates plasmablast differentiation. Cell Host Microbe. 2014;16:115–27.
Wold S, Esbensen K, Geladi P. Principal component analysis. Chemom Intell Lab Syst. 1987;2:37–52.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci. 2005;102:15545–50.
Zhang B, Horvath S: A general framework for weighted gene co-expression network analysis. Statistical Applications in Genetics & Molecular Biology 2005, 4:Article17.
Newman AM, Steen CB, Liu CL, Gentles AJ, Chaudhuri AA, Scherer F, Khodadoust MS, Esfahani MS, Luca BA, Steiner D, et al. Determining cell type abundance and expression from bulk tissues with digital cytometry. Nat Biotechnol. 2019;37:773–82.
Nettleman MD. Receiver operator characteristic (ROC) curves. Infect Control Hosp Epidemiol. 1988;9:374–7.
Heaton NS, Randall G. Dengue virus and autophagy. Viruses. 2011;3:1332–41.
Chen T, Zhang H, Liu Y, Liu YX, Huang L. EVenn: Easy to create repeatable and editable Venn diagrams and Venn networks online. J Genet Genomics. 2021;48:863–6.
Castañeda DM, Salgado DM, Narváez CF. B cells naturally induced during dengue virus infection release soluble CD27, the plasma level of which is associated with severe forms of pediatric dengue. Virology. 2016;497:136–45.
Nguyen HD, Chaudhury S, Waickman AT, Friberg H, Currier JR, Wallqvist A. Stochastic model of the adaptive immune response predicts disease severity and captures enhanced cross-reactivity in natural dengue infections. Front Immunol. 2021;12: 696755.
Zheng W, Wu H, Liu C, Yan Q, Wang T, Wu P, Liu X, Jiang Y, Zhan S. Identification of COVID-19 and dengue host factor interaction networks based on integrative bioinformatics analyses. Front Immunol. 2021;12: 707287.
Zhou A, Dong X, Liu M, Tang B. Comprehensive transcriptomic analysis identifies novel antiviral factors against influenza A virus infection. Front Immunol. 2021;12: 632798.
Wei J, Wang B, Gao X, Sun D. Prognostic value of a novel signature with nine hepatitis C virus-induced genes in hepatic cancer by mining GEO and TCGA databases. Front Cell Dev Biol. 2021;9: 648279.
Cougnoux A, Dalmasso G, Martinez R, Buc E, Delmas J, Gibold L, Sauvanet P, Darcha C, Déchelotte P, Bonnet M, et al. Bacterial genotoxin colibactin promotes colon tumour growth by inducing a senescence-associated secretory phenotype. Gut. 2014;63:1932–42.
Deng M, Yin Y, Zhang Q, Zhou X, Hou G. Identification of inflammation-related biomarker Lp-PLA2 for patients with COPD by comprehensive analysis. Front Immunol. 2021;12: 670971.
Tang BM, Shojaei M, Parnell GP, Huang S, Nalos M, Teoh S, O’Connor K, Schibeci S, Phu AL, Kumar A, et al. A novel immune biomarker IFI27 discriminates between influenza and bacteria in patients with suspected respiratory infection. Eur Respir J. 2017;49:1602098.
Wang F, Nie J, Wang H, Zhao Q, Xiong Y, Deng L, Song S, Ma Z, Mo P, Zhang Y. Characteristics of peripheral lymphocyte subset alteration in COVID-19 pneumonia. J Infect Dis. 2020;221:1762–9.
We would like to thank Mr. Yinde Huang for his support with bioinformatics analyses and pay tribute to the contributions of public databases such as GEO to human medicine.
The study was sponsored by the Foundation of the CAMS Initiative for Innovative Medicine (2021-I2M-1-036); the National Natural Science Foundation of China (31970868); Yunnan health training project of high-level talents (L-2019030); Innovation Team Project of Yunnan Science and Technology Department (202105AE160020).
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1. Figure S1.
(A–C) PCA shows that different stages can be distinguished. Weighted co-expression network analysis (WGCNA). (D–F) Showing the cutoff height. (G–I) Sample clustering. (J) Shared gene (CCL2) in the LA stage between DEGs (between Dengue Hemorrhagic Fever (DHF) and Dengue Fever (DF)) and 232 autophagy-related genes, and (K) its expression levels. Similar expression levels of CD38 and CDKN1C between DH and DHF (L) and among different serotypes (M). S1, serotype I; S2, serotype II; S3, serotype III; S4, serotype IV. (C, Convalescent stage; EA, Early Acute stage; LA, Late Acute stage).
Additional file 2. Figure S2.
Analyzing staging diagnosis value of CD38 and CDKN1C for four serotypes among three comparison groups by Area Under the Curve (AUC) in the training set (GSE43777 dataset of GLP201 platform), respectively. (A) C vs EA in serotype I; (B) C vs LA in serotype I; (C) EA vs LA in serotype I; (D) C vs EA in serotype II; (E) C vs LA in serotype II; (F) EA vs LA in serotype II; (G) C vs EA in serotype III; (H) C vs LA in serotype III; (I) EA vs LA in serotype III; (J) C vs EA in serotype IV; (K) C vs LA in serotype IV; (L) EA vs LA in serotype IV. (C, Convalescent stages; EA, Early Acute stage; LA, Late Acute stage).
Additional file 3. Figure S3.
Analyzing staging diagnosis value of CD38 and CDKN1C for four serotypes among three comparison groups by Area Under the Curve (AUC) in the test set (GSE43777 dataset of GLP570 platform), respectively. (A) C vs EA in serotype I; (B) C vs LA in serotype I; (C) EA vs LA in serotype I; (D) C vs EA in serotype II; (E) C vs LA in serotype II; (F) EA vs LA in serotype II; (G) C vs EA in serotype III; (H) C vs LA in serotype III; (I) EA vs LA in serotype III; (J) C vs EA in serotype IV; (K) C vs LA in serotype IV; (L) EA vs LA in serotype IV. (C, Convalescent stage; EA, Early Acute stage; LA, Late Acute stage).
Additional file 4. Figure S4.
Violin diagrams and heatmaps display immune differences of immune cells in different comparing groups. (A) C vs EA stages in the serotype I. (B) C vs LA stages in the serotype I. (C) EA vs LA stages in the serotype I. (D) Immune difference heatmap of the serotype I in three stages. (E) C vs EA stages in the serotype II. (F) C vs LA stages in the serotype II. (G) EA vs LA stages in the serotype II. (H) Immune difference heatmap of the serotype II in three stages. (I) C vs EA stages in the serotype III. (J) C vs LA stages in the serotype III. (K) EA vs LA stages in the serotype III. (L) Immune difference heatmap of the serotype III in three stages. (M) C vs EA stages in the serotype IV. (N) C vs LA stages in the serotype IV. (O) EA vs LA stages in the serotype IV. (P) Immune difference heatmap of the serotype IV in three stages. (C, Convalescent stage; EA, Early Acute stage; LA, Late Acute stage).
Additional file 5. Figure S5.
Correlation between CD38 and 22 types of immune cells (red rectangular: statistically significant (P < 0.05)). (A) In serotype I; (B) in serotype II; (C) in serotype III; (D) in serotype IV. (C, Convalescent stage; EA, Early Acute stage; LA, Late Acute stage).
Additional file 6. Figure S6.
Analyzing staging diagnosis value of immune cells for Dengue by Area Under the Curve (AUC) in each serotype. (A) C vs EA stages in the serotype I. (B) C vs LA stages in the serotype I. (C) EA vs LA stages in the serotype I. (D) C vs EA stages in the serotype II. (E) C vs LA stages in the serotype II. (F) EA vs LA stages in the serotype II. (G) C vs EA stages in the serotype III. (H) C vs LA stages in the serotype III. (I) EA vs LA stages in the serotype III. (J) C vs EA stages in the serotype IV. (K) C vs LA stages in the serotype IV. (L) EA vs LA stages in the serotype IV. (C, Convalescent stage; EA, Early Acute stage; LA, Late Acute stage).
Additional file 7. Table S1.
Differentially expressed genes (DEGs) in the C vs EA group. (C, Convalescent stage; EA, Early Acute stage).
Additional file 8. Table S2.
Differentially expressed genes (DEGs) in the C vs LA group. (C, Convalescent stage; LA, Late Acute stage).
Additional file 9. Table S3.
Differentially expressed genes (DEGs) in the EA vs LA group. (EA, Early Acute stage; LA, Late Acute stage).
Additional file 10. Table S4
. Differentially expressed genes (DEGs) from comparing Dengue Hemorrhagic Fever (DHF) with Dengue Fever (DF) in EA group. (EA, Early Acute stage).
Additional file 11. Table S5
. Differentially expressed genes (DEGs) from comparing Dengue Hemorrhagic Fever (DHF) with Dengue Fever (DF) in LA group. (LA, Late Acute stage).
Additional file 12. Table S6
. Differentially expressed fold of genes in individual serotype for three comparison groups. (C, Convalescent stage; EA, Early Acute stage; LA, Late Acute stage).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Xiong, N., Sun, Q. Identification of stage-related and severity-related biomarkers and exploration of immune landscape for Dengue by comprehensive analyses. Virol J 19, 130 (2022). https://doi.org/10.1186/s12985-022-01853-8