Relationship between polymorphism of receptor SCARB2 gene and clinical severity of enterovirus-71 associated hand-foot-mouth disease

Background To investigate the relationship between polymorphism of scavenger receptor class B member 2 (SCARB2) gene and clinical severity of enterovirus (EV)-71 associated hand-foot-mouth disease (HFMD). Methods Among the 100 recruited cases, 56 were in the severe HFMD group (case group) and 44 were in the general HFMD group (control group). By screening functional single nucleotide polymorphisms (SNPs) and hot SNPs, and performing SNP site optimization, some SNP sites of SCARB2 gene were selected for analysis. Genotyping was performed using a MassArray platform. PLINK software was used for statistical processing and analysis of the correlation differences between the mutant genotypes in the severe and general HFMD groups. The relationship between the SNPs and clinical severity of enterovirus (EV)-71 associated HFMD was assessed. Results 28 SNPs in SCARB2 were selected by site optimization. Then three loci were not in agreement with the minor allele frequency (MAF) in the 1000 Han Chinese in Beijing (CHB) dataset. Another three loci could not be detected. Nine loci were not suitable for further analysis (MAF < 0.01 and Hardy–Weinberg [HWE] P < 0.001). A total of 13 sites were subsequently analyzed. Through Fisher analysis, the frequency of the rs6812193 T allele was 0.134 and 0.034 in the severe and general HFMD groups, respectively (P 0.023 < 0.05, odds ratio [OR] 4.381 > 1). Logistic regression analysis of rs6812193 T alleles between the severe and general HFMD groups, respectively (P 0.023 < 0.05, OR 4.412 > 1, L95 1.210 > 1). Genotype logistic regression analysis of the rs6812193 alleles CT + TT versus CC gave an OR of 4.56 (95% confidence interval [95% CI] 1.22–17.04, P = 0.012). Conclusion The rs6812193 T allele was a susceptibility SNP for SHFMD, and the rs6812193 polymorphism might be significantly associated with the susceptibility to EV-71 infection.


Introduction
Hand, foot, and mouth disease (HFMD) is an infectious disease caused by a variety of enteroviruses which belongs to the small RNA virus family [1]. HFMD is common in children under 5 years of age, and it is mainly manifested as herpes and maculopapules on the hands, feet, mouth, and other areas. A few patients progress rapidly and develop neurogenic pulmonary edema, circulatory disturbance, and even death at 1-5 days after disease onset [2]. The following indicators should alert the clinician of possible deterioration and impending critical type of severe case: persistent high fever, nervous system involvement, abnormal respiratory rate and rhythm, circulatory dysfunction, elevated peripheral white blood cell count, elevated blood glucose, elevated blood lactic acid [3]. HFMD is a global disease with a variety of causes. The main causes of HFMD are enterovirus (EV)-71 and Coxsackievirus A16 infection; however, EV-71 is responsible for most of the severe cases and fatal cases [4].
EV-71 is another important neuroenterophilic virus after poliovirus elimination. An analysis shows that EV-71 was circulating in the Netherlands as early as 1963 [5], but it was first reported in 1969 [6]. It was global distribution and occasionally concentrated outbreak. Since 1997, outbreaks of HFMD caused by EV-71 have occurred in the Asia-Pacific region, such as Malaysia [7], Taiwan [8], Singapore [9], etc. In 1998, the epidemic outbreaks began in Chinese mainland, especially in 2007, Shandong province [10] and 2008 Anhui province [11], which resulted in a large number of severe and dead children. Therefore, in 2008, the Ministry of Health of China listed HFMD in the list of Class C infectious diseases. In recent years, the incidence of HFMD has been 37.01-205.06 per 100,000, with a fatality rate of 6.46-51.00 per 100,000 [3]. HFMD has the highest number of cases and deaths of all Class C infectious diseases in China and represents a serious threat to the health of children.
As a major cause of severe and fatal cases, the pathogenesis of EV-71 has attracted more and more researchers' attention. In clinical work, it is not difficult to find that the severity of clinical symptoms and prognosis of different children with the same infection with EV-71 are significantly different. As the first important portal for the virus to enter the human body, the virus receptor determines the host range of a virus and tissue specificity. The influence of individual differences on the severity of clinical symptoms is worth further studying. Yamayoushi et al. [12,13] confirmed that SCARB2 is the receptor of all EV-71 strains in cell experiments. The guidelines for the diagnosis and treatment of HFMD issued by the Ministry of Health of China (2018) clearly indicate that SCARB2 is the main receptor of EV-71 virus. Studies have found that people with different genotypes and alleles have different probability of disease and severity [14]. Single nucleotide polymorphisms (SNPs) are the most common form of variation in human genomic DNA. The SNPs of SCARB2 gene have naturally attracted great attention. At present, most studies focus on the relationship between the severity of EV-71 infection and the polymorphism of cytokines such as TNF-ɑ, IL-6, IL-10 [15,16], chemokine IP-10, MCP-1(CCl2) [17], immune-related factors OAS1, OAS2, OAS3 and MXA [18,19]. To the best of our knowledge, few studies on the relationship between SCARB2 SNP and EV-71 HFMD have been performed. In this study, 28 SNP sites in SCARB2 were selected as study loci, expecting to further clarify the pathogenesis of EV71 infection, and to provide a strong research basis for the early warning of critical disease and the reduction of case fatality rate.

Clinical data and sample collection
We recruited 100 children with HFMD admitted to our hospital from April 2018 to October 2020 who were positive after EV-71 nucleic acid test. Diagnosis was based on the "Guidelines for the Diagnosis and Treatment of Hand, Foot, and Mouth Disease" (2018) [3]. According to the occurrence and development process of the disease, HFMD is divided into general HFMD and severe HFMD. General HFMD is usually in the eruption stage, but severe HFMD includes nervous system involvement stage, early cardiopulmonary failure stage, and cardiopulmonary failure stage, based on the degree of danger and heavy. There were 56 cases with severe HFMD (as the case group) and 44 cases with general HFMD (as the control group). This study was approved by the Ethics Committee of Tianjin Second People's Hospital, and informed consent was obtained from the patients' parents or family members.
HFMD samples were collected before treatment on the day of admission; 3 mL peripheral venous blood was collected and stored at − 80 °C for later use.

SNP site optimization method
According to the linkage disequilibrium analysis results, the completely linked loci with R 2 = 1 were discarded, and the loci in the promoter region with R 2 > 0.8 were retained (Haplotypes in this region are important for gene expression); however, the strong association sites with R 2 > 0.8 identified in other regions and literature studies, are meaningless and omitted.

DNA extraction
DNA was extracted from the blood samples using a Tiangen kit. DNA samples were analyzed on a Nan-oDrop2000, and 1.25% agarose gel electrophoresis was performed. DNA was quantified and transferred to a 96-well plate for storage at − 20 °C for later use.

SNP typing Primer design and synthesis
Assay Designer 3.1 software was used to design the primers, and the primers were synthesized by the company [BGI Tech Solutions (Beijing Liuhe) Co., Ltd]. The primer sequences are shown in Table 1.

Primer dilution and extension mix configuration
The single-tube PCR masters were diluted to 100 μM, and deionized water was added to achieve a final PCR master mix concentration of 0.5 μM. The single tube extension primers were diluted to a final concentration of 500 μM. Each primer was diluted to 8 μM, 10 μM, and 15 μM. According to the instructions of the DNA synthesis products, the molecular weight and number of moles, the amount of deionized water to be added were calculated according to the required concentration. According to the molecular weight of the mixed single-tube extension primers, 1 time (< 6300 Da), 1.2 times (6300-7200 Da), and 1.5 times (> 7200 Da) were taken for mixing.

MassArray reactions
PCR amplification was conducted in 5 µL reactions,

Statistical analysis
PLINK software was used for statistical processing and analysis of the correlation differences between the mutant genotypes in the case and control groups. Case: severe HFMD group; Control: general HFMD group. A1: mutant; A2: wild-type (the default is the variant with the lowest allele frequency). A1 frequency is the MAF value. According to the Hardy-Weinberg equilibrium, the selected samples were from a random population. Fisher test was used to compare the genotype frequency between the case group and the control group. P-value represents the statistical difference between both groups, P < 0.05 indicates that there is a significant difference in A1 between the case and control groups; OR < 1 indicates that A1 is protective; OR = 1 indicates that A1 has no relationship with disease; OR > 1 indicates that A1 has a pathogenic effect. The differences of alleles and genotypes were compared by Logistic regression analysis. 95% confidence interval = L95 -U95. L95 and U95 represent the lower and upper limits of the confidence interval, respectively. OR > 1 and L95 > 1 indicate that the allele has a pathogenic effect, while OR < 1 and U95 < 1 indicate that the allele has a protective effect.

Characteristics
The characteristics of the subjects are shown in Table 2.
There is no statistically significant difference between the case and control groups in terms of age and sex (P > 0.05).
The MAFs of the selected SNPs were greater than 0.01, and the P-values of the Hardy-Weinberg equilibrium test were greater than 0.001. Nine SNPs (rs121909118, rs121909119, rs200053119, rs727502772, rs727502781, rs755903502, rs886041074, rs886041076, and rs886041078) did not fulfill these criteria and were excluded from further analysis. At last, a total of 13 sites were subsequently analysed (Table 3).

Fisher analysis
As shown in Table 4, the frequencies of the rs6812193 T allele was 0.134 and 0.034 in the case and control group, respectively. P value 0.023 < 0.05, indicating a significant difference of A1 between the case and control groups; the OR of 4.381 > 1 indicates that A1 has a pathogenic effect. The remaining 12 SNPs may not be related to the pathogenicity of EV-71. Therefore, the rs6812193 T genotype is a susceptibility SNP.

Allele logistic regression analysis
As shown in Table 5, the P value of the rs6812193 T allele was 0.0245 < 0.05, indicating a significant difference between the case and control groups; the OR of 4.412 > 1 and L95 value 1.210 > 1 indicate that the allele had a pathogenic effect. The remaining 12 SNPs may not be related to the pathogenicity of EV-71. Therefore, the rs6812193 T genotype is a susceptibility SNP.

rs6812193 genotype logistic regression analysis
As shown in Table 6, in the dominant model, the rs6812193 T allele was associated with a risk of severe disease. CT + TT genotype carriers had an increased risk of severe disease compared with CC genotype carriers (OR = 4.56, 95% confidence interval = 1.22-17.04, P = 0.012).

Discussion
The human SCARB2 gene is located on chromosome 4 and encodes a peptide chain containing 478 amino acids. SCARB2 is a transmembrane sialic acid glycoprotein with a relative molecular mass of 85 kDa, and belongs to the family of CD36 molecules [20]. SCARB2 is mainly located in lysosomes and endosome, and widely present on the membrane of most human cells including nerve cells [21]. This protein is also called lysosomal integral membrane protein 2. It is a type of specific glucose cerebral fat enzyme combined with ligands, involved in the lysosomal pathway. The related research fields are mostly Parkinson's disease with abnormal lysosomal metabolism [22,23], Gaucher's disease and myoclonic epilepsy [24,25]. Yamayoshi et al. [12,13] found that the tissue distribution of EV-71 virus antigen was well correlated with SCARB2, and further found that this receptor was involved in the endocytosis and membrane transport of pathogenic bacteria. This study speculated that the expression level of SCARB2 might be related to virus sensitivity and infection rate. Therefore, the 28 selected sites were all functional sites related to the expression level, including exons, promoters and introns.
Choi M et al. [26] found that exons contain the vast majority of protein coding synthesis, and about 85% of pathogenic mutations are located in the exon region. In 2009, Ng SB et al. [27] used exome sequencing for the first time to find point mutations located in MYH3 in 4 patients with Freeman Sheldon syndrome (autosomal dominant genetic disease), showing the powerful effect of exome sequencing in identifying pathogenic genes of Mendelian genetic disease. Many complex diseases have been identified by exome sequencing, such as genetic disease OHDO syndrome (KAT6B) [28], CTNNB1 mutation in craniopharyngioma patients [29], point mutation of dilated cardiomyopathy GATAD1 [30], etc. Jenny Do et al. [25] found that 3'-UTR mutations in SCARB2 may be associated with Gaucher disease and myoclonic epilepsy. Yock-Ping Chow et al. [31] found that SCARB2 exon mutation was associated with Pendred syndrome. Yamayoshi et al. [13] found that amino acids at position 142-204 of SCARB2 played an important role in promoting the binding of virus particles to cells and susceptibility to EV-71. However, the study of Ting-Yu Yen et al. did not find the correlation between amino acids at position 142-204 and clinical severity [32]. There are 12 exon sites selected in this study: rs1051326, rs3733255, rs3733256, rs6811781, rs6841815, and rs8475 belong to 3' UTR region, and its function was predicted as miRNA binding site. rs11547135 and rs1465922 belong to 5' UTR region, and its function was predicted to be a TFBS transcription factor binding region. rs121909118 rs200053119 rs755903502 and rs886041078 are NCBI pathogenic clinical significance sites. Among these exons, rs6811781 and rs6841815 (not in CHB); rs11547135 and rs1465922 (not detected); rs121909118, rs200053119, rs755903502, rs886041078 (not in line with MAF value). Finally, rs8475, rs1051326, rs3733255 and rs3733256 were included in the study, but no correlation was found between these four exon loci and the severity of clinical infection.
Promoter is an important cis-element in gene expression regulation and the core region of gene transcriptional regulation. In this study, two promoter loci were selected: rs1465923 and rs78737354. Finally, no correlation between these two promoters and clinical infection was found.   Previously, it was often believed that introns do not encode proteins and do not have biological functions in organisms. However, studies have found that the expression profiles of the same gene with and without introns are significantly different [33]. In many cases of transgenic expression, the addition of a universal intron to cDNA results in a significant increase in gene expression [34,35]. The optimal expression of many endogenous genes has been demonstrated in mammalian tissue culture cells, transgenic mice, insects, and plant systems requiring the presence of one or more introns. Therefore, a variety of introns in organisms are an important part of eukaryotic genome and are closely related to the construction and dynamic changes of cytoskeleton of gene expression [36]. Ting-Yu Yen studied the relationship between SCARB2, PSGL-1, ANXA2 polymorphisms and clinical severity, and found that rs11097262 was associated with rs6824953 located in the intron region of SCARB2 gene, considering that it may regulate the function or expression of SCARB2 and thus affect the susceptibility to EV-71 [32]. In this study, 14 introns were selected: rs121909119, rs727502772, rs727502781, rs886041074, and rs886041076 were considered to be the sites with pathological clinical significance on NCBI website; rs6812193 is a hot spot site that can be found in the literatures [23,[37][38][39]; rs1470194, rs2119733, rs2869851, rs57374265, rs6824953, rs72857048, rs75285019, and rs7697073 are the TAGSNP sites. Among these introns of this study, rs72857048 is not in CHB; rs2869851 is not detected; rs121909119, rs727502772, rs727502781, rs886041074, and rs886041076 are not in line with MAF value. Finally, rs1470194, rs2119733, rs57374265, rs6812193, rs6824953, rs75285019 and rs7697073 were included in the final study, but only rs6812193 was correlated with the severity of clinical infection, and no correlation was found for the other 6 introns. By using Fisher analysis and allele logistic regression analysis, the rs6812193 T allele was shown to have a pathogenic effect. rs6812193 genotype logistic regression analysis in a dominant model showed that CT + TT genotype carriers had an increased risk of severe HFMD compared with CC genotype carriers. Therefore, the rs6812193 T genotype is considered to be a susceptibility SNP, and the rs6812193 polymorphism may be related to susceptibility to EV-71.
rs6812193 is actually a hot SNP close to SCARB2. A 2011 Web-based Genome-wide Association (GWA) study found that a nucleotide polymorphism rs6812193 close to SCARB2 was significantly associated with Parkinson's disease (PD) in people of European ancestry [37]. In 2012, Shuai Chen et al. conducted a genotyping study on rs6812193 in 449 PD patients and 452 control patients in mainland China, and found that there is no statistically significant differences in allele and genotype distribution between the patients and the control group [38]. In 2013, Kallirhoe Kalinderi et al. studied 210 Greek patients with sporadic PD and 133 control subjects in Greece. It was found that there was no difference in genotype or allele frequency between PD patients and controls [39]. In 2021, T.S. Usenko et al. rs6812193 of the SCARB2 gene does not confer a significant risk for PD in Russian population [23]. At present, to the best of our knowledge, there is no research related to the role of rs6812193 in HFMD. How does rs6812193 affect the expression and function of SCARB2? This will be examined in the future by increasing the sample size to verify the association of rs6812193 with susceptibility to EV-71 HFMD, and to study the function of rs6812193.
The present study also has some limitations, such as the small number of cases, which may make it difficult to find significant differences between low-frequency SNPs. In addition, with sufficient funds and time, genome-wide tests can be performed to avoid screening for missing sites of interest. Clarifying the relationship between gene polymorphisms and disease will enable us to analyze disease pathogenesis further, to explore the nature of the diversity of disease phenotypes, and to develop more individualized treatment measures.

Conclusion
The rs6812193 T genotype was identified as a susceptibility SNP. CT + TT genotype carriers have an increased risk of severe HFMD compared with CC genotype carriers. Therefore, the rs6812193 polymorphism might be considerably related to clinical severity of enterovirus (EV)-71 associated HFMD, which can support doctors to make evidence-based health recommendations to patients.