Academic Editor

Article Metrics

  • Fig. 1.

    View in Article
    Full Image
  • Fig. 2.

    View in Article
    Full Image
  • Fig. 3.

    View in Article
    Full Image
  • Fig. 4.

    View in Article
    Full Image
  • Fig. 5.

    View in Article
    Full Image
  • Fig. 6.

    View in Article
    Full Image
  • Fig. 7.

    View in Article
    Full Image
  • Information

  • Download

  • Contents

Abstract

Background:

Heat shock proteins (HSPs) play a critical role in the molecular mechanisms of ischemic stroke (IS). A possible role for HSP40 family proteins in atherosclerosis progression has already been revealed; however, to date, molecular genetic studies on the involvement of genes encoding proteins of the HSP40 family in IS have not yet been carried out.

Aim:

We sought to determine whether nine single nucleotide polymorphisms (SNPs) in genes encoding HSP40 family proteins (DNAJB1, DNAJB2, DNAJA1, DNAJA2, DNAJA3 and DNAJC7) are associated with the risk and clinical features of IS.

Methods:

Using TaqMan-based polymerase chain reaction (PCR) and the MassArray-4 system, DNA samples of 2551 Russians — 1306 IS patients and 1245 healthy individuals — were genotyped.

Results:

SNP rs2034598 DNAJA2 decreased the risk of IS exclusively in male patients (odds ratio = 0.81, 95% confidence interval 0.78–0.98, p = 0.028); rs7189628 DNAJA2 increased the brain infarct size (p = 0.04); and rs6500605 DNAJA3 lowered the age of onset of IS (p = 0.03). SNPs rs10448231 DNAJA1, rs7189628 DNAJA2, rs4926222 DNAJB1 and rs2034598 DNAJA2 were involved in the strongest epistatic interactions linked to IS; SNP rs10448231 DNAJA1 is characterised by the most essential mono-effect (2.96% of IS entropy); all of the top SNP–SNP interaction models included the pairwise combination rs7189628 DNAJA2×rs4926222 DNAJB1, which was found to be a key factor determining susceptibility to IS. In interactions with the studied SNPs, smoking was found to have multidirectional effects (synergism, antagonism or additive effect) and the strongest mono-effect (3.47% of IS entropy), exceeding the mono-effects of rs6500605 DNAJA3, rs10448231 DNAJA1, rs2034598 DNAJA2, rs7189628 DNAJA2 and rs4926222 DNAJB1, involved in the best G×E models and determining 0.03%–0.73% of IS entropy.

Conclusions:

We are the first to discover polymorphisms in genes encoding HSP40 family proteins as a major risk factor for IS and its clinical manifestations. The comprehensive bioinformatics analysis revealed molecular mechanisms, underscoring their significance in the pathogenesis of IS, primarily reflecting the regulation of heat stress, proteostasis and cellular signalling.

1. Introduction

Ischemic stroke (IS) poses a substantial threat to human health worldwide, ranking among leading causes of disability and death [1]. IS occurs when an atherosclerotic plaque or thrombus obstructs an artery in cerebral vessels, thereby leading to brain tissue ischemia [2]. This occlusion deprives the affected brain region of oxygen and nutrients, resulting in hypoxia, oxidative stress, excitotoxicity and ultimately neuronal death [3, 4]. Heat shock proteins (HSPs) and other proteins with chaperone-like activity [5, 6] play a crucial part in the mitigation of these processes to ensure cell survival [7].

HSPs are a highly conserved superfamily of chaperones involved in protein folding, quality control and transport [8]. Among various HSP families, roles of HSP90 [9], HSP70 [10, 11, 12], HSP60 [13, 14] and HSP27 [15] are well established in both IS progression and neuroprotection. Of note, despite extensive global research on HSPs in relation to IS, the role of the HSP40 family remains virtually uninvestigated and poorly understood.

Essential and functionally conserved HSP40s, characterised by the presence of the J-domain, are also known as J-domain–containing proteins [16]. HSP40s accelerate ATP hydrolysis by HSP70s, and many HSP40s directly bind unfolded or partially folded polypeptides, thereby delivering these substrates to HSP70s for proper folding and stabilisation [17, 18, 19]. This intricate cooperation between HSP70 and HSP40 is vital for proteostasis, particularly under stressful conditions such as those induced by IS.

Thus, we aimed to investigate whether HSP40 polymorphisms are related to IS risk, to assess their association with the risk of IS in terms of inter-gene and gene–environment interactions and to determine their influence on clinical manifestations of IS. Additionally, we sought to analyse molecular mechanisms of involvement of HSP40 family genes’ SNPs in the development of IS through comprehensive bioinformatic analysis.

2. Materials and Methods

The scheme of the study design is presented in Fig. 1.

Fig. 1.

The basic outline of our study design. HSP, heat shock protein; SNPs, single nucleotide polymorphisms; MDR, multifactor dimensionality reduction; MB-MDR, model-based MDR; CVDKP, Cardiovascular Disease Knowledge Portal; CDKP, Cerebrovascular Disease Knowledge Portal; PCR, polymerase chain reaction; IS, ischemic stroke; eQTL, expression Quantitative Trait Loci.

A total of 2551 unrelated individuals of Russian ancestry (1306 IS patients and 1245 healthy controls) from Central Russia participated in this study. The study received ethical approval. Participants had to be born in Central Russia and self-identify as being of Russian descent in order to be eligible. Supplementary Table 1 lists the study cohort’s baseline and clinical characteristics.

The study involved two separate enrolment periods: the first took place in the neurology clinics of Kursk Emergency Medicine Hospital [20], and the second one took place in the Regional Vascular Centre of Kursk Regional Clinical Hospital [21] between 2015 and 2017. A group of skilled neurologists evaluated every case. Using the results of brain computed tomography and/or magnetic resonance imaging, the diagnosis of IS during the acute phase of stroke was verified. The vertebrobasilar arteries and the middle cerebral arteries on the left or right were among the impacted regions. The following exclusion criteria were applied when recruiting patients: traumatic brain injury, hepatic or renal failure, cancer, autoimmune diseases, cerebral ischemia of haematological origin, intracerebral haemorrhage, haemodynamic or dissection-related stroke and other conditions that could result in acute cerebrovascular events. Every patient with IS had a medical history of hypertension and was taking antihypertensive drugs.

Low fruit and vegetable intake was defined as per WHO guidelines [22]. During routine medical examinations at governmental institutions and in industrial settings, controls were chosen from Kursk hospitals [23, 24]. They were chosen at the same time and from the same group of people as the IS patients. Healthy volunteers in the control group had normal blood pressure, did not take antihypertensive medication and did not exhibit any clinical signs of major disorders such as cardiovascular or cerebrovascular.

2.1 Genetic Analysis

Genotyping was conducted by the Laboratory of Genomic Research at the Research Institute for Genetic and Molecular Epidemiology of Kursk State Medical University (Kursk, Russia). Each participant’s venous blood samples were stored at –20 °C in tubes coated with EDTA until processing. From thawed blood samples, genomic DNA was isolated using phenol/chloroform extraction and ethanol precipitation techniques. The extracted DNA was assessed for purity, quality and concentration using a Nanodrop spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA).

Genotyping of two SNPs (rs2034598 DNAJA2 and rs7189628 DNAJA2) was performed using allele-specific probe-based polymerase chain reaction (PCR) in accordance with our protocols. For the primer design, Primer3 software version 4.1.0 (https://primer3.ut.ee/) was utilised. SNPs were excluded if we faced challenges in designing allele-specific TaqMan probes owing to methodological issues, including a low CG content in the DNA region flanking an SNP, the presence of GC clamps, runs of identical nucleotides and difficulties with optimising primer annealing conditions. The PCR reaction solution (25 µL) consisted of ~10 ng of DNA, 1.5 units of Hot Start Taq DNA polymerase (Biolabmix, Novosibirsk, Russia) and the following reagent concentrations: 0.25 µM each primer; 0.1 µM each probe; 250 µM each dNTP; 3.5 mM MgCl2 for rs2034598, 5.5 mM MgCl2 for rs7189628; 1× PCR buffer [67 mM Tris-HCl, pH 8.8, 16.6 mM (NH4)2SO4, 0.01% of Tween 20]. The PCR procedure comprised an initial denaturation for 10 min at 95 °C, followed by 39 cycles of 92 °C for 30 s and 61/64 °C for 1 min (for rs2034598 and rs7189628, respectively). Supplementary Figs. 1,2 show allelic discrimination plots for rs2034598 DNAJA2 and rs7189628 DNAJA2 assays designed for this study.

Agena Bioscience’s (San Diego, CA, USA) MassARRAY-4 genetic analyser was used to genotype rs10448231 DNAJA1, rs6500605 DNAJA3, rs4926222 DNAJB1, rs3962158 DNAJB1, rs755892 DNAJB1, rs3731896 DNAJB2 and rs4796756 DNAJC7. The methodology of the genotyping was described in detail in our previous study [25].

To ensure quality control, 10% of the DNA samples were genotyped twice, with the case-control status blinded. The results showed over 99% concordance between the repeated tests.

2.2 Bioinformatics and Statistical Analyses

We used the STATISTICA software (version 13.3, Informer Technologies, Inc., Santa Clara, CA, USA) for the statistical analysis. Because most of continuous variables were characterised by a non-normal distribution, we present these parameters as medians with first and third quartiles [Q1, Q3] and used the Mann–Whitney test for intergroup comparison of continuous variables. Using Pearson’s chi-squared test and Yates’s correction for continuity, categorical variables were examined for statistical significance.

Using Fisher’s exact test, genotype distributions were examined for Hardy–Weinberg equilibrium. The SNPStats software was used to analyse genotype frequencies and their relationships to IS risk [26]. In the analysis of genotype association, the log-additive model was employed. Covariates, which comprised factors indicating variations in the overall biological characteristics of the studied groups, were taken into account when adjusting associations in the entire group of patients and controls.

Two-, three- and four-level genotype combinations (G×G) and genotype–environment combinations (G×E) were investigated using the model-based multifactor dimensionality reduction (MB-MDR) approach. A permutation test was used to assess the empirical p-value (pperm) for each model; models were deemed statistically significant if pperm < 0.05. All calculations were adjusted for covariates, and the most robust models (averaging 2–3 models per level) based on Wald statistics and significance levels were included in subsequent analysis. The MB-MDR approach also revealed specific genetic combinations linked to the phenotypes under investigation (p < 0.05). The MB-MDR tool for the R software (version 3.6.3, R Foundation for Statistical Computing, Vienna, Austria) was used to do the calculations.

The multifactor dimensionality reduction (MDR) approach was used to further analyse the most significant G×G and G×E models in MDR (v3.0.2) (http://sourceforge.net/projects/mdr). The MDR method assessed interaction mechanisms (synergy, antagonism or additive interactions) and interaction strength (individual genes’ and environmental factors’ contributions to trait entropy and interacting effects). The interactions were demonstrated with the use of graphs created from the MDR analysis results.

Utilising the following tools, functional implications of the studied SNPs were examined:

• Genotype-Tissue Expression (GTEx) Portal: this tool was used to analyse the expression quantitative trait loci (eQTLs) effects of the studied SNPs in peripheral blood, vessels and brain [27].

• eQTLGen (https://www.eqtlgen.org/; accessed on May 10, 2024): This resource was used to examine associations of the HSP40 family gene SNPs with eQTLs in peripheral blood [28].

• HaploReg (v4.2), a bioinformatics tool available at https://pubs.broadinstitute.org/mammals/haploreg/haploreg.php (accessed on May 10, 2024) was employed to assess histone modifications associated with HSP40 family gene SNPs. Histone H3 protein lysine residues at positions 27 and 9 (H3K27ac and H3K9ac, respectively) as well as mono- and tri-methylation at position 4 (H3K4me1) and position 4 (H3K4me3) were among the alterations that were examined [29].

• atSNP Function Prediction (http://atsnp.biostat.wisc.edu/search; accessed on May 10, 2024): We used this Web-based tool to evaluate how the presence of reference or alternative alleles affected the effects of SNPs of the HSP40 family genes on DNA affinity to transcription factors (TFs) [30].

• Gene Ontology (GO, https://geneontology.org/): This tool enabled the analysis of joint involvement of TFs linked to the reference or SNP alleles in overrepresented biological processes directly related to the pathogenesis of IS [31]. Biological functions controlled by TFs associated with HSP40 family genes’ SNPs were used as functional groups.

• The STRING database’s bioinformatic tools were utilised to analyse the main functional partners of DNAJA2 and DNAJA3 (https://string-db.org/, accessed on May 11, 2023) [32]. Additionally, the STRING database was utilised to analyse biological processes and molecular functions, focusing on protein–protein interactions between DNAJA2 and DNAJA3 as well as their functionally significant partner proteins.

• Cerebrovascular Disease Knowledge Portal (CDKP, https://cd.hugeamp.org/, accessed on May 11, 2023) was employed to analyse associations of the HSP40 family gene SNPs with IS and IS-associated phenotypes, such as heart rate, hypertension and coronary artery disease [33].

3. Results
3.1 Association of HSP40 Family Gene SNPs with IS Risk

Every SNP under study in the control group had genotype frequencies that were in line with Hardy–Weinberg equilibrium, except for rs3731896 DNAJB2, rs3962158 DNAJB1 and rs755892 DNAJB1, which were excluded from the further analysis.

No associations were found with IS in the entire study population (Table 1).

Table 1. Relationship between HSP40 family SNPs and IS risk (entire study population).
Genetic variant Effect allele Other allele N OR [95% CI] p
rs4926222 DNAJB1 G A 2030 1.08 [0.89–1.30] 0.45
rs2034598 DNAJA2 A G 2065 0.94 [0.82–1.09] 0.41
rs7189628 DNAJA2 T C 1991 1.00 [0.72–1.40] 0.98
rs10448231 DNAJA1 G A 1487 1.06 [0.88–1.27] 0.54
rs4796756 DNAJC7 C T 1513 0.82 [0.66–1.02] 0.075
rs6500605 DNAJA3 G A 1519 0.87 [0.74–1.03] 0.12

All calculations were performed relative to the minor alleles (Effect allele) with adjustment for sex, age and smoking status; statistically significant differences are marked in bold. OR, odds ratio; 95% CI, 95% confidence interval; SNPs, single nucleotide polymorphisms; IS, ischemic stroke; HSP, heat shock protein.

Further analysis stratified by sex revealed that rs2034598 DNAJA2 lowered the risk of IS exclusively in males (odds ratio [OR] = 0.81, 95% confidence interval [CI] 0.78–0.98, p = 0.028) (Table 2).

Table 2. Sex-dependent relationships between HSP40 family SNPsand IS.
Genetic variant Effect allele Other allele N OR [95% CI]1 p2 N OR [95% CI]1 p2
Males Females
rs4926222 DNAJB1 G A 1025 1.13 [0.88–1.46] 0.34 1032 1.04 [0.80–1.35] 0.75
rs2034598 DNAJA2 A G 1048 0.81 [0.67–0.98] 0.028 1040 1.13 [0.92–1.37] 0.24
rs7189628 DNAJA2 T C 997 1.24 [0.78–1.96] 0.36 1018 0.76 [0.48–1.20] 0.23
rs10448231 DNAJA1 G A 719 1.00 [0.79–1.28] 0.99 793 1.09 [0.86–1.37] 0.49
rs4796756 DNAJC7 C T 729 0.92 [0.69–1.24] 0.59 807 0.82 [0.62–1.11] 0.19
rs6500605 DNAJA3 G A 732 0.81 [0.65–1.02] 0.072 812 0.88 [0.70–1.10] 0.27

1: odds ratio and 95% confidence interval; 2: p value; differences that are statistically significant are indicated in bold.

Additionally, to exclude the influence of smoking on the association between rs2034598 DNAJA2 and IS in males, we performed an analysis in subgroups of smoking/non-smoking males and females (Supplementary Table 2). Thus, we found that rs2034598 DNAJA2 lowered the risk of IS exclusively in non-smoking males (OR = 0.60, 95% CI 0.42–0.86, p = 0.004). This suggests that the male sex specifically influences the association between SNP rs2034598 DNAJA2 and IS.

Furthermore, we analysed the association between HSP40 SNPs and stroke localisation. Our findings revealed that the rs2034598 DNAJA2 reduced the risk of stroke in the vertebrobasilar system (OR = 0.74, 95% CI 0.55–0.99, p = 0.04) (Supplementary Table 3).

The next step of our study was to analyse the influence of SNPs on clinical and biochemical parameters. We found that rs7189628 DNAJA2 affected the brain infarct size (pKruskal-Wallis = 0.04; the largest brain infarct size was observed in carriers of the T/T genotype) (Fig. 2A), while rs6500605 DNAJA3 influenced the age of onset of IS (pKruskal-Wallis = 0.03; a later age of manifestation of IS was observed in carriers of the A/A genotype) (Fig. 2B).

Fig. 2.

Association of HSP40 SNPs with clinical characteristics. (A) Brain infarct size for rs7189628 DNAJA2 (p = 0.02 in Mann–Whitney test: C/T vs T/T genotypes), (B) age of onset for rs6500605 DNAJA3 (p = 0.009 in Mann–Whitney test: A/A vs G/A genotypes). SNPs, single nucleotide polymorphisms.

3.2 Gene–Gene Interactions of Polymorphic Loci of the HSP40 Family Genes Associated with the Risk of IS

Applying the MB-MDR methodology, three most important gene–gene interactions associated with the risk of IS were identified (pperm 0.05) (Table 3). The four polymorphic loci involved in the best G×G interactions were as follows: rs10448231 DNAJA1, rs7189628 DNAJA2, rs4926222 DNAJB1 and rs2034598 DNAJA2.

Table 3. Intergenic interactions related to IS risk (obtained using the MB-MDR approach).
Gene-gene interaction models NH beta H WH NL beta L WL Wmax pperm
The best three-locus models of intergenic interactions (for G×G models with pmin < 0.002, 1000 permutations)
rs10448231 DNAJA1 × rs7189628 DNAJA2 × rs4926222 DNAJB1 1 0.2696 7.423 1 –0.1990 10.62 10.62 0.021
rs7189628 DNAJA2 × rs2034598 DNAJA2 × rs4926222 DNAJB1 2 0.2429 10.661 0 NA NA 10.66 0.037
The best four-locus models of gene-gene interactions (for G×G models with pmin < 5.5 × 10-4, 1000 permutations)
rs10448231 DNAJA1 × rs7189628 DNAJA2 × rs2034598 DNAJA2 × rs4926222 DNAJB1 1 0.38158 12.375 1 –0.2228 8.732 12.375 0.048

NH, the quantity of interacting high-risk genotypes; beta H, regression coefficient for interactions with high risk; WH, Wald statistics for interactions with high risk; NL, quantity of interacting low-risk genotypes; beta L, regression coefficient for interactions with low risk; WL, Wald statistics for interactions with low risk; NA, not applicable; pperm, permutational significance levels (with adjustment for sex, age and smoking). Loci involved in at least two best models of intergenic interactions are highlighted in bold.

The MDR method (Fig. 3), first, revealed that SNPs in the best G×G interactions involved mainly moderate and pronounced antagonism, except for the interactions of SNPs rs7189628 DNAJA2 vs rs4926222 DNAJB1/rs10448231 DNAJA1 vs rs4926222 DNAJB1, which exhibited independent (cumulative) effects (in interaction with one another). Second, rs10448231 DNAJA1 was characterised by the strongest mono-effect (2.96% of the entropy of IS), and rs4926222 DNAJB1 was distinguished by the weakest mono-effect (0.04% of entropy). Third, the mono-effect of rs10448231 DNAJA1 was slightly higher than the effects of intergenic interactions (0.10%–0.79%). Fourth, the mono-effect of rs2034598 DNAJA2, which showed a pronounced independent effect on the risk of developing IS when analysing individual genetic variants, when analysing intergenic interactions showed a mono-effect equal to 0.72% and similar to the outcomes of intergenic interactions (0.58%–0.79% contribution to entropy). Fifth, the following combinations of genotypes possessed the most robust connections to IS: rs10448231 DNAJA1 A/A×rs7189628 DNAJA2 C/T×rs4926222 DNAJB1 A/A (Beta = –0.19897; p = 0.001); rs7189628 DNAJA2 C/T×rs2034598 DNAJA2 A/A×rs4926222 DNAJB1 A/G (Beta = 0.22273; p = 0.004); and rs10448231 DNAJA1 A/A×rs7189628 DNAJA2 C/T×rs2034598 DNAJA2 A/A×rs4926222 DNAJB1 A/G (Beta = 0.381584; p = 0.0004) (Supplementary Table 4).

Fig. 3.

A graph of the most significant G×G interactions of HSP40 family genes’ SNPs associated with IS. The type of interaction is indicated by the colour of the lines: blue and green means strong and middling antagonism; brown denotes independent (cumulative) effects; % indicates the direction and strength of the gene–gene interaction’s phenotypic effect (as a percentage of entropy).

3.3 Gene–Environment Interactions of Polymorphic Loci of the HSP40 Family Genes Associated with the Risk of IS (MB-MDR and MDR Modelling)

The four most important models of gene–environment interactions connected to IS were identified using the MB-MDR methodology (Table 4). Overall, smoking in combination with five SNPs (rs6500605 DNAJA3, rs10448231 DNAJA1, rs2034598 DNAJA2, rs7189628 DNAJA2 and rs4926222 DNAJB1) was present in the best models of G×E interactions. Next, we used the MDR method to examine the mechanisms of relationship between smoking and these genetic variants (Fig. 4).

Fig. 4.

The graph of the most significant G×E interactions of HSP40 family SNPs associated with IS. The type of interaction is indicated by the colour of the lines: red denotes a powerful synergy, blue and green: strong and middling antagonism, respectively; brown denotes independent (cumulative) effects; % indicates the direction and strength of the gene–gene interaction’s phenotypic effect (as a percentage of entropy).

Table 4. Gene–environment interactions related to IS (obtained using the MB-MDR approach).
Gene-gene interaction models NH beta H WH NL beta L WL Wmax pperm
The best two-factor models of gene-smoking interactions (for G×E models with pmin < 1 × 10-22, 1000 permutations)
SMOKE × rs6500605 DNAJA3 2 0.2543 101.93 2 –0.2230 84.44 101.93 <0.001
SMOKE × rs10448231 DNAJA1 2 0.2506 101.28 2 –0.2506 101.28 101.28 <0.001
The best three-factor models of gene- interactions (for G×E models with pmin < 1 × 10-23, 1000 permutations)
SMOKE × rs10448231 DNAJA1 × rs2034598 DNAJA2 5 0.2545 102.37 6 –0.2573 105.86 105.86 <0.001
The best four-factor models of gene- interactions (for G×E models with pmin < 1 × 10-24, 1000 permutations)
SMOKE × rs10448231 DNAJA1 × rs7189628 DNAJA2 × rs4926222 DNAJB1 7 0.2696 111.6 5 –0.2293 86.37 111.6 <0.001

pperm, permutational significance levels (with adjustment for sex, age and smoking); Loci involved in at least two best models of intergenic interactions are highlighted in bold.

First, MDR revealed that smoking had the most pronounced single effect (3.47% contribution to IS entropy), exceeding mono-effects of SNPs involved in the best of G×E models (0.03%–0.73% contribution to entropy) and exceeding the effects of G×E interactions (0.02%–0.96% of the entropy). Second, the strongest G×E models contained SNPs having multidirectional effects in their interactions with smoking: significant synergy when interacting with rs6500605 DNAJA3 and rs10448231 DNAJA1; moderate antagonism in interaction with rs4926222 DNAJB1; and independent (cumulative) impacts in interaction with rs2034598 DNAJA2 and rs7189628 DNAJA2. Third, every SNP present in the top G×E models featured strong and middling antagonism in interaction with each other, except for SNPs rs6500605 DNAJA3 and rs10448231, which exhibited additive (independent) effects. Fourth, the rs10448231 genetic variant interacting with smoking formed the basis of three of the four best gene–environment interaction models. Fifth, IS had the strongest associations with the following gene–environment interactions: smoking×rs6500605 DNAJA3 A/A (Beta = 0.211652, p = 2.73 × 10-11); non-smoking×rs6500605 DNAJA3 G/A (Beta = –0.16535, p = 4.14 × 10-9); smoking×rs10448231 DNAJA1 A/A (Beta = 0.1822, p = 6.70 × 10-10); smoking×rs10448231 DNAJA1 G/G×rs2034598 DNAJA2 A/A (Beta = 0.21936; p = 2.64 × 10-6); smoking×rs10448231 DNAJA1 A/A×rs7189628 DNAJA2 C/C×rs4926222 DNAJB1 A/A (Beta = 0.192881; p = 3.54 × 10-8); and non-smoking×rs10448231 DNAJA1 A/G×rs7189628 DNAJA2 C/C×rs4926222 DNAJB1 A/A (Beta = –0.158, p = 2.36 × 10-6) (Supplementary Table 5).

The summary of the effects of HSP40 family SNPs on the contribution to IS is provided in Table 5.

Table 5. Summarised results of the analysis of G×G and G×E interactions using the MB-MDR and MDR methods (analysis of mono-effects of SNPs, G×G and G×E interactions relative to the contribution to the entropy of IS).
SNP G×G interactions G×E interactions
Mono-effect GG-effect Mono-effect SNP-smoking-effect
rs10448231 DNAJA1 2.96% 1.26% 0.03% 0.62%
rs7189628 DNAJA2 0.21% 1.14% 0.41% 0.18%
rs2034598 DNAJA2 0.72% 2.03% 0.62% 0.15%
rs4926222 DNAJB1 0.04% 0.79% 0.73% 0.27%
rs6500605 DNAJA3 - 0.33% 0.62%

MB-MDR, model-based multifactor dimensionality reduction; MDR, multifactor dimensionality reduction.

3.4 Functional Annotation of HSP40 Family Genes’ SNPs
3.4.1 QTL Effects

Data from the eQTLGen Browser revealed that the C allele of rs7189628 DNAJA2 lowers the expression levels of RP11-169E6.1, DNAJA2 and PHKB while increasing the expression levels of NETO2 and GPT2 in whole blood (Table 6). The A allele of s6500605 DNAJA3 increases the expression of NMRAL1, DNAJA3, HMOX2 and CDIP1 and lowers the expression of PAM16, MGRN1, CORO7, ROGDI and C16orf96 in whole blood (Table 6).

Table 6. Detected cis-eQTL effects of HSP40 family genes’ SNPs (eQTL Gen browser data).
SNP Effect Allele Gene expressed Z-score p-value
rs7189628 DNAJA2(C/T) C RP11-169E6.1 (–7.9132) 2.51 × 10-15
NETO2 (7.0376) 1.96 × 10-12
DNAJA2 (–5.6849) 1.31 × 10-8
PHKB (–4.995) 5.88 × 10-7
GPT2 (4.5014) 6.7507 × 10-6
rs6500605 DNAJA3(A/G) A NMRAL1 (44.1326) 4.82 × 10-54
DNAJA3 (15.4788) 8.29 × 10-47
PAM16 (–14.3674) 7.91 × 10-36
MGRN1 (–12.4954) 1.11 × 10-22
HMOX2 (9.8015) 1.35 × 10-10
CORO7 (–6.4214) 6.97 × 10-10
ROGDI (–6.1666) 3 × 10-6
CDIP1 (4.6605) 3.4173 × 10-6
C16orf96 (–4.6438) 4.82 × 10-54

eQTL, expression quantitative trait loci.

Additionally, data from the GTEx portal (Supplementary Table 5) revealed that allele A of rs6500605 DNAJA3 affects expression levels of eight genes. Specifically, CDIP1 has lower expression in brain tissues but increased expression in arteries and whole blood. CORO7 shows increased expression in brain tissues, while HMOX2 has lower expression in arteries and brain. NMRAL1 has lower expression in arteries, brain and whole blood. MGRN1 and PAM16 exhibit increased expression in whole blood. RP11-295D4.3 shows increased expression in the aorta, pituitary and blood, and VASN has increased expression in the brain (Supplementary Table 6).

Interestingly, rs2034598 DNAJA2 did not show any cis-eQTL effects in blood, brain tissues or vessels.

3.4.2 Transcription Factors

Allele A of rs2034598 DNAJA2 creates DNA-binding sites for 20 TFs involved in artery morphogenesis (GO:0048844), while allele G of rs2034598 DNAJA2 generates DNA-binding sites for 44 TFs involved in five biological processes: interleukin-9-mediated signaling pathway (GO:0038113), cellular response to estrogen stimulus (GO:0071391), intracellular estrogen receptor signaling pathway (GO:0033146), regulation of angiogenesis (GO:0016525) and regulation of apoptotic process (GO:0042981) (Supplementary Table 7).

Allele A of rs6500605 DNAJA3 creates DNA-binding sites for 32 TFs that participate in neuron differentiation (GO:0030182), generation of neurons (GO:0048699) and neurogenesis (GO:0022008) (Supplementary Table 8). Allele G of rs6500605 DNAJA2 creates DNA-binding sites for 26 TFs involved in positive regulation of smooth muscle cell apoptotic process (GO:0034393) and negative regulation of angiogenesis (GO:0016525) (Supplementary Table 8).

Allele C of rs7189628 DNAJA2 generates DNA-binding sites for 64 TFs related to three GO terms: cellular response to reactive oxygen species (GO:0034614), epigenetic regulation of gene expression (GO:0040029) and regulation of apoptotic process (GO:0042981) (Supplementary Table 9). Allele T of rs7189628 DNAJA2 created DNA-binding sites for 28 TFs involved in 14 biological processes: negative regulation of interleukin-4 production (GO:0032713), forebrain radial glial cell differentiation (GO:0021861), negative regulation of interleukin-5 production (GO:0032714), negative regulation of oxidative stress-induced neuron intrinsic apoptotic signaling (GO:1903377), positive regulation of vascular endothelial growth factor production (GO:0010575), cytokine production (GO:0001816), regulation of transforming growth factor beta production (GO:0071634), negative regulation of gliogenesis (GO:0014014), cellular response to hypoxia (GO:0071456), angiogenesis (GO:0001525), response to oxidative stress (GO:0006979), response to growth factor (GO:0070848), cellular response to cytokine stimulus (GO:0071345) and chromatin organization (GO:0006325) (Supplementary Table 9).

3.4.3 Protein–Protein Interactions of IS-Associated Genes

The STRING database identified 10 proteins having the most significant interactions with DNAJA2: HSPA8, HSPA1B, HSPA1L, DNAJA1, HSPA2, DNAJB1, HSPA4, HSP90AA1, DNAJA4 and STIP1 (Fig. 5, Supplementary Table 10).

Fig. 5.

Predicted functional partners of DNAJA2. PPI enrichment p-value: 1.91 × 10-14.

DNAJA2 and its main partners in function are involved in 21 GO terms, primarily representing proteostasis [e.g., protein folding (GO:0006457, false discovery rate {FDR} = 5.53 × 10-18), protein refolding (GO:0042026, FDR = 3.17 × 10-17), chaperone-mediated protein complex assembly (GO:0051131; FDR = 0.0003), response to unfolded protein (GO:0006986, FDR = 2.44 × 10-12) and cellular response to unfolded protein (GO:0034620, FDR = 0.0001)], mitochondrial homeostasis [protein insertion into mitochondrial outer membrane (GO:0045040, FDR = 0.017), protein localization to mitochondrion (GO:0070585, FDR = 0.01), mitochondrial membrane organization (GO:0007006, FDR = 0.022) and mitochondrial transport (GO:0006839, FDR = 0.04)] as well as cellular response to stress (GO:0033554, FDR = 0.04), response to heat (GO:0009408, FDR = 2.2 × 10-8), cellular response to steroid hormone stimulus (GO:0071383, FDR = 0.04), chaperone-mediated autophagy (GO:0061684, FDR = 0.01) and negative regulation of transcription from RNA polymerase II promoter in response to stress (GO:0097201, FDR = 0.02) (the full list of biological processes associated with DNAJA2 and its main functional partners is presented in Supplementary Table 11).

The analysis of the primary functional partners of DNAJA3 using the STRING database revealed 10 proteins with the most prominent interactions: HSPA9, HSPA8, HSPA1B, HSPA1A, HSPA4, HSPA5, GRPEL1, GRPEL2, HSPA1L and CLPP (Fig. 6, Supplementary Table 12).

Fig. 6.

Predicted functional partners of DNAJA3. PPI enrichment p-value: 9.4 × 10-14.

Supplementary Table 12 presents the complete list of 54 biological events involving DNAJA3 and its partners. Here are some key processes that seem to be most important for the pathogenesis of IS: response to heat (GO:0009408; FDR = 0.0063), response to stress (GO:0006950; FDR = 0.0156), negative regulation of apoptotic process (GO:0043066; FDR = 0.0156), protein refolding (GO:0042026; FDR = 3.26 × 10-11), chaperone cofactor-dependent protein refolding (GO:0051085; FDR = 6.85 × 10-11), response to unfolded protein (GO:0006986; FDR = 3.68 × 10-10), mitochondrial membrane organization (GO:0007006; FDR = 0.00047), ATP metabolic process (GO:0046034; FDR = 0.0178), positive regulation of tumor necrosis factor-mediated signaling pathway (GO:1903265; FDR = 0.0068) and others (Supplementary Table 13).

3.4.4 Phenome-wide Association Study (PheWAS) Associations

According to CDKP data, the examined SNPs were linked to a decreased risk of cerebrovascular diseases and stroke severity (e.g., large artery stroke, white matter hyperintensities in cerebral small vessel disease, intracerebral haemorrhage or small-vessel IS and modified Rankin scale score 0–1 vs 2–6 adj stroke severity) and improvement in related traits (e.g., lower systolic blood pressure, lower heart rate and mean arterial pressure). The full list of phenotypes related to IS is presented in Table 7.

Table 7. Association analysis of HSP40 family gene SNPs with cerebrovascular diseases and related phenotypes (CDKP data).
SNP Phenotype p-value Beta (OR) Sample size
1. rs2034598 DNAJA2 (G/A) Systolic blood pressure 0.0017 Beta –0.0052 1,914,060
2. Pulse pressure 0.002 Beta –0.0051 953,984
3. Coronary artery calcification 0.02 Beta –0.0210 9584
4. White matter hyperintensities in cerebral small vessel disease 0.02 Beta –0.0247 18,381
5. Heart rate 0.03 Beta –0.0049 484,178
6. rs7189628 DNAJA2(C/T) Large artery stroke 0.018 OR 0.9012 36,651
7. Intracerebral hemorrhage or small-vessel ischemic stroke 0.03 OR 0.9012 24,366
8. rs6500605 DNAJA3(A/G) Pulse pressure 0.0001 Beta –0.0047 929,653
9. Systolic blood pressure 0.0005 Beta –0.0051 1,879,660
10. Myocardial infarction 0.0009 OR 0.9771 302,705
11. Mean arterial pressure 0.008 Beta –0.0107 147,482
12. Coronary artery disease (CAD) 0.02 OR 0.9893 1,250,750
13. Modified Rankin scale score 0–1 vs 2–6 adj stroke severity 0.03 OR 0.8843 4226
14. Hypertension 0.03 OR 0.9940 603,130

Effect alleles are marked in bold.

3.4.5 Histone Modifications

HaploReg data revealed a substantial effect of rs6500605 DNAJA3 on histone tags in brain tissues and blood. This SNP is in a DNA region binding to H3K4me1 and H3K4me3, with the effect being further enhanced by H3K27ac and H3K9ac in most brain regions and in whole blood (Table 8).

Table 8. Effects of SNPs on histone modifications in brain tissue and whole blood.
SNP (Ref/Alt allele) Tissues Marks Brain Blood
(1) (2) (3) (4) (5) (6) (7) (8)
rs2034598 DNAJA2 (G/A) H3K4me1 - - Enh Enh Enh - Enh Enh
H3K27ac - - - Enh - Enh - Enh
H3K9ac - - - - - - - Pro
rs7189628 DNAJA2 (C/T) H3K4me1 - - - - - - - Enh
H3K27ac - - Enh Enh - - - Enh
rs6500605 DNAJA3 (A/G) H3K4me1 Enh Enh Enh Enh Enh Enh Enh Enh
H3K4me3 - - - - Pro - Pro Pro
H3K27ac - - Enh Enh Enh Enh Enh Enh
H3K9ac - - Pro Pro Pro Pro Pro Pro

1: Brain hippocampus middle; 2: brain substantia nigra; 3: brain anterior caudate; 4: brain cingulate gyrus; 5: brain inferior temporal lobe; 6: brain angular gyrus; 7: brain dorsolateral prefrontal cortex; 8: cells from peripheral blood (any). Effect alleles are marked in bold.

SNP rs2034598 DNAJA2 is in a DNA region binding to H3K4me1. The effect is increased by H3K27ac in some brain tissues and in blood, but only in blood is the effect also enhanced by H3K9ac (Table 8).

Lastly, SNP rs7189628 DNAJA2 is marked by H3K4me1 in blood and by H3K27ac in brain anterior caudate and brain cingulate gyrus, as well as in blood (Table 8).

4. Discussion

This is the first paper to show a link between the IS risk, IS clinical characteristics and SNPs in genes encoding HSP40 family proteins. Namely, in our study, SNP rs7189628 DNAJA2 increased brain infarct size, rs6500605 DNAJA3 influenced the age of IS onset, and SNP rs2034598 DNAJA2 had protective effects against IS exclusively in males. The observed sexual dimorphism may be linked to an influence of sex hormones on DNAJA2expression levels. For instance, animal studies have found significant changes in the expression levels of DNAJA2 between males and females [34, 35]. Moreover, in our work, SNP rs2034598 DNAJA2 was associated with a reduced risk of stroke in vertebrobasilar arteries. We demonstrated that SNPs rs10448231 DNAJA1, rs7189628 DNAJA2, rs4926222 DNAJB1 and rs2034598 DNAJA2 were involved in the top epistatic interactions related to IS; SNP rs10448231 DNAJA1 manifested the strongest mono-effect (2.96% of IS entropy); pairwise combination rs7189628 DNAJA2×rs4926222 DNAJB1 was an important determinant of the IS risk (characterised all the best SNP–SNP interaction models). In correlations with the SNPs, cigarette smoking was shown to have multifaceted effects (synergism, antagonism or additive effects) and the largest mono-effect (3.47% of IS entropy) exceeding the mono-effects of rs6500605 DNAJA3, rs10448231 DNAJA1, rs2034598 DNAJA2, rs7189628 DNAJA2 and rs4926222 DNAJB1; they were involved in the best G×E models and determined 0.03%–0.73% of IS entropy.

The role of HSPs in diseases associated with misfolded and aggregated proteins, including IS, is well established. HSPs are critical for proteostasis, especially under brain injury scenarios. Notably, HSP70 and its co-chaperone HSP40 are significantly induced in response to the brain injury [36]. Although the involvement of HSP70 in ischemia is well documented due to its abundance and its functions in the repair, refolding and trafficking of damaged proteins, the participation of HSP40 in ischemia has not been extensively researched.

HSP40 co-chaperone proteins regulate the formation of complexes between HSP70 and client proteins. The J-domain, present in all HSP40 family members, is crucial for regulation of HSP70’s ATPase activity [37]. Together, HSP70 and HSP40 constitute a system essential for proteostasis [38].

In cardiovascular diseases, HSP40 expression is notably increased in atherosclerotic plaques, thus surpassing the levels of HSP70 or HSP60 [39]. Nguyen et al. [39] have proposed that HSP40 member DNAJA1 can be responsible for T-cell activation during the immunopathogenesis of atherosclerotic lesions. Additionally, previous research has revealed DNAJA2 underexpression in angiotensin II–induced endothelial dysfunction [40]. The involvement of HSP40 in atherosclerosis is confirmed by recent findings from Kumari et al. [41], who identified DNAJA1 and DNAJB1 as key players in the biogenesis of ApoB100. Because apolipoprotein B–containing lipoproteins are major drivers of atherosclerosis and related conditions, including IS [42], the participation of HSP40 proteins in Apolipoprotein B (ApoB) biogenesis highlights their potential role in increased IS risk. Additionally, elevated DNAJC2 antibody (DNAJC2-Ab) levels have been linked to an elevated risk of stroke [43]. This association is explained by those authors as an initial response to bacterial infection [44], with this response being further amplified by overexpression of DnaJ family proteins in atherosclerotic plaques as the disease progresses [39].

Because to date, there has been no research on effects of the HSP40 SNPs under study on IS pathogenesis, we conducted a comprehensive bioinformatic analysis, employing a variety of tools to annotate the potential effects these SNPs can have on the initiation and progression of IS.

A summary of the overall molecular effects of HSP40 SNPs on IS pathology is depicted in Fig. 7.

Fig. 7.

An outline of involvement of HSP40 genes’ polymorphisms in molecular mechanisms of IS (results of comprehensive annotation of the SNPs). GO, Gene Ontology; ROS, reactive oxygen species.

First, analysis of DNAJA2’s protein–protein interactions uncovered its interactions with numerous chaperones, including HSPA8 and adapter proteins, which we have previously found to be associated with IS risk [45, 46]. The STRING database analysis revealed that DNAJA2 and DNAJA3, along with their functional partners, participate in various biological processes essential for cell function and survival, such as proteostasis, mitochondrial homeostasis, autophagy and the cellular response to stress. Notably, DNAJA2 takes part in the cellular response to a steroid hormone stimulus, thereby providing an additional link to the sex-specific properties of SNP rs2034598 DNAJA2 (Fig. 7).

Second, TF-associated biological processes of the studied SNPs include angiogenesis, regulation of apoptotic process, response to oxidative stress, regulation of inflammation and cytokine production. Angiogenesis — a critical defensive reaction to hypoxia — plays a major part in the modulation of long-term neurological recovery after IS [47]. Regulation of apoptosis is crucial for improving post-stroke recovery [48]. The oxidative stress that develops under ischemic conditions [49] not only leads to cell death but also contributes to the pathogenesis of IS by breaking down the blood–brain barrier and causing focal destructive lesions [50]. Inflammation further exacerbates neuronal cell death [51] and is central to the development of atherosclerosis: a major risk factor for IS [52].

Third, data from the CDKP indicate that the analysed SNPs are associated with large-artery stroke, IS-related phenotypes (e.g., blood pressure, heart rate or hypertension) and white-matter hyperintensities in cerebral small vessel disease as well as other atherosclerosis-related disorders (e.g., coronary artery disease, coronary artery calcification and myocardial infarction).

Fourth, our examination of cis-eQTL effects revealed that the DNAJA2 and DNAJA3 SNPs influence expression levels of several genes, implying possible mechanisms by which HSP40 SNPs may affect IS development. Among those genes with altered expression are NMRAL1, HMOX2, CDIP1, CORO7, MGRN1, PAM16, VASN, PHKB and GPT2 (Fig. 7) [53].

NMRAL1 regulates redox homeostasis by suppressing reactive oxygen species (ROS) and nitric oxide (NO) production. Intensive and persistent oxidative stress leads to translocation of NMRAL1 from the cytoplasm to the nucleus, where it regulates the DNA damage response while linking cellular redox status to other signalling pathways and fine-tuning the cellular response to redox changes [54]. Considering the leading role of oxidative stress in the risk of atherosclerosis-associated diseases [23, 55, 56], including IS, as evidenced by numerous articles [20, 57], the regulation of oxidative stress by NMRAL1 is particularly noteworthy. In hypoxia, a critical factor of cardiovascular pathologies [58, 59, 60] that is called HMOX2 senses the lower O2 concentration and reduces the formation of carbon monoxide (CO) [61]. HMOX2-deficient mice exhibit increased apoptosis around the infarct area following focal transient ischemia, indicating a protective effect of HMOX2 against IS [62]. Silencing of CDIP1 inhibits apoptosis and improves angiogenesis in myocardial infarction [63], thus highlighting its participation in apoptotic regulation and a vascular response to ischemic injury [64]. CORO7 is required for the activation of the Hippo pathway, which controls tissue homeostasis through the modulation of cell proliferation and apoptosis [65]. The hippo pathway takes part in pathological processes of IS, including oxidative stress, an inflammatory response, blood–brain barrier damage, mitochondrial disorders and neural cell death [66]. Loss of MGRN1 has chronic and age-dependent effects on mitochondrial homeostasis in the brain [67]. PAM16 encodes a mitochondrial protein involved in granulocyte-macrophage colony-stimulating factor (GM-CSF) signal transduction; this signalling pathway enhances collateral artery growth and reduces infarct volume in a mouse model of brain ischemia [68]. This protein also participates in the import of nuclear-encoded mitochondrial proteins into the mitochondrial matrix and may be important in ROS homeostasis [69]. Following arterial injury, neointimal development is facilitated by the down-regulation of VASN (vasorin), indicating that it modifies the arterial wall’s biological responses to pathogenic stimuli [70]. PHKB encodes the β-subunit of glycogen phosphorylase kinase, which seems to be inactivated during ischemia, thereby leading to impaired metabolism of glycogen and its accumulation in astrocytes, thus worsening the outcomes of stroke [71, 72]. Glutamate pyruvate transaminase 2 (GPT2) has glutamate-scavenging activity in the blood: lowering of glutamate levels. This reduction is linked to decreased mortality, smaller infarcted brain volume and oedema as well as improved neurological outcome following artery occlusion [73].

Future research should focus on the functional effects of the HSP40 family SNPs not covered in this work and their potential roles in IS pathogenesis. Moreover, understanding how HSP40 family SNPs regulate gene expression and protein function in the context of IS is crucial. Longitudinal studies are also needed to track changes in HSP40 genes’ expression and proteins’ levels in IS patients, through examination of associations with disease progression and outcomes. Finally, our findings may help to identify therapeutic targets within HSP40 pathways and thus could lead to the development of novel interventions for the prevention and treatment of IS.

5. Conclusions

Our study is the first to demonstrate a role of HSP40 SNPs in the risk of IS and its clinical manifestations. We analysed intergenic and gene–environment interactions of HSP40 SNPs towards IS risk and performed a comprehensive bioinformatic analysis to reveal the underlying molecular mechanisms. This approach provides new insights into the genetic factors influencing IS and highlights the significance of HSP40 polymorphisms in the disease’s pathogenesis.

6. Study Limitations

First, we were unable to include other HSP40 SNPs in the study owing to methodological challenges; this limitation could have led to loss of information about the associations of these SNPs with the risk of IS. Second, the absence of data on physical activity and fruit and vegetable intake in the control group limited our ability to conduct MB-MDR and MDR modelling that would have incorporated these environmental factors. Third, we did not measure HSP40 gene expression, thereby precluding analysis of differences in expression levels between males and females and assessment of sex-specific differences as well as evaluation of the effect of DNAJA2 genotypes on gene expression levels.

Availability of Data and Materials

All data reported in this paper will be shared by the lead contact upon request from corresponding author.

Author Contributions

OYB designed the research study. KAK, DEG, VMP and EAP performed the research. AVP analyzed the data. KAK, DEG and OYB wrote the manuscript. All authors contributed to editorial changes in the manuscript. All authors read and approved the final manuscript. All authors have participated sufficiently in the work and agreed to be accountable for all aspects of the work.

Ethics Approval and Consent to Participate

The study partially used biological material from the biobank of the Research Institute of Genetic and Molecular Epidemiology, collected from 2010 to 2012 as part of previously conducted studies (with permission from the Ethical Review Committee of Kursk State Medical University, Protocol No. 1 from 21.01.2010). Additional collection of biological material for the purpose of this study was carried out in the period 2015–2017 (Protocol No. 12 from 7.12.2015). The study was conducted according to the guidelines of the Declaration of Helsinki. All the participants or their families/legal guardians gave written informed consent before the enrollment in this study.

Acknowledgment

Not applicable.

Funding

This research was funded by Russian Science Foundation (№ 22-15-00288, https://rscf.ru/en/project/22-15-00288/).

Conflict of Interest

The authors declare no conflict of interest.

Supplementary Material

Supplementary material associated with this article can be found, in the online version, at https://doi.org/10.31083/j.jin2312211.

References

Publisher’s Note: IMR Press stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.