Assessing known chronic kidney disease associated genetic variants in Saudi Arabian populations

Background Genome wide association studies of patients with European descent have identified common variants associated with risk of reduced estimated glomerular filtration rate (eGFR). A panel of eight variants were selected to evaluate their association and prevalence in a Saudi Arabian patient cohort with chronic kidney disease (CKD). Methods Eight genetic variants in four genes (SHROOM3, MYH9, SLC7A9, and CST3) were genotyped in 160 CKD patients and 189 ethnicity-matched healthy controls. Genetic variants were tested for association with the development of CKD (eGFR < 60 ml/min/1.73m2) and effects were compared with results obtained from 133,413 participants in the CKD genetics consortium. Multivariable regression was used to evaluate the role of these eight variants in improving prediction of CKD development. Results All eight variants were present in Saudi populations with minor allele frequency ranging from 16 to 46%. The risk variant in all four genes demonstrated the same direction of effect as observed in European populations. One variant, rs4821480, in MYH9 was significantly associated with increased risk of development of CKD (OR = 1.69, 95% CI 1.22–2.36, P = 0.002), but the additional variants were not statistically significant given our modest sample size. Conclusions CKD risk variants identified in European populations are present in Saudis. We did not find evidence to suggest heterogeneity of effect size compared to previously published estimates in European populations. Multivariable logistic regression analysis showed a statistically significant improvement in predicting the CKD using models with either FGF23 and vitamin D or FGF23, vitamin D level, and MYH9 genotypes (AUC = 0.93, 95% CI 0.90–0.95, P <  0.0001). Electronic supplementary material The online version of this article (10.1186/s12882-018-0890-9) contains supplementary material, which is available to authorized users.


Background
Chronic Kidney disease (CKD) has been rising in prevalence globally in recent years with shifting demographics including an aging population, lifestyle changes associated with urbanization, obesity and type 2 diabetes (T2D). The morbidity and mortality costs of CKD on society are significant, and are projected to become an even larger healthcare burden [1,2]. In the Kingdom of Saudi Arabia (KSA), CKD is estimated to affect approximately 1.72 million Saudis [3], equating to approximately 6% of the population. Furthermore, in KSA there is a consistent annual increase in the rate of CKD patients undergoing hemodialysis after development of end stage renal disease (ESRD) [4]. In a 2010 KSA investigation only 7.1% of Saudis with CKD were aware of their disease status and such under-and late-diagnoses of CKD often results in poorer outcomes in such patients [3].
A strong heritable component to CKD and its risk factors exists [5,6], independent of its largest risk factors hypertension and T2D [7,8]. Monogenic pediatric renal diseases, such as congenital and infantile nephrotic syndromes, appear significantly more common in KSA than in the western world [9]. Over the last decade, genome wide association studies (GWAS) have proven to be an invaluable tool for identifying common variants associated with complex human diseases with small to moderate effects in agnostic hypothesis free approaches [10]. The identified variants and genes can yield novel insights into disease pathogenesis. GWAS of European white populations examining estimated glomerular filtration rate (eGFR) and CKD have identified greater than 50 associated loci [5,[11][12][13][14]. Variants in Shroom Family Member 3 (SHROOM3) have been identified as a CKD susceptibility locus through GWAS. SHROOM3 is regulator of epithelial cellular arrangement and planar remodeling [15], which contributes to glomerular filtration barrier integrity [16]. One CKD-associated SHROOM3 variant, rs17319721, was shown to impact cis-expression and renal allograft fibrosis [17]. Polymorphisms in Myosin Heavy Chain 9 (MYH9) have also been shown to be associated with CKD risk in admixed nondiabetic nephropathy, and focal segmental glomerulosclerosis (FSGS) [18,19]. Genetic polymorphisms in Solute Carrier Family 7 Member 9 (SLC7A9), an amino acid transporter known to be expressed in renal proximal tubule cells, cause cystinuria [20], are associated with GFR [5,21], and have been identified as a risk factor for CKD patients of European ancestry [12]. Variants in Cystatin C (CST3), have been also shown to impact altered eGFR and kidney disease [22].
While ethnicity is thought to play a large role in CKD genetics, very few genotyping studies of CKD loci have been performed in Saudi patients to date. In this study, we genotyped eight SNPs in four genes which have been shown to be associated with CKD mainly in populations of European descent. Herein we present the allele frequencies of said SNPs in 160 Saudi CKD and 189 non-CKD subjects from KSA, and study associations with CKD. Additionally, we employed multivariate analyses to examine the utility of SNPs and CKD-related biomarkers in CKD risk progression.

Methods
Following research ethics approval from the University of Dammam Institutional Research Board (IRB), blood samples from 160 Saudi Arabian patients with CKD were collected in the outpatient Nephrology department at King Fahd University Hospital. All participants gave informed consent. To confirm disease associations, 189, ethnically-matched donors with no prior medical or family history of CKD were recruited for comparison to a healthy population. Informed consent for DNA analysis was obtained from all research participants under the supervision of the IRB. DNA was extracted from whole blood using QIAmp Blood DNA mini kit (Qiagen, Germany). Eight polymorphisms (SHROOM3: rs9992101, rs17319721; SLC7A9: rs4805834; MYH9: rs4821480, rs4821481, rs2032487, rs3752462; CST3: rs13038305) were genotyped using TaqMan® assays (ThermoFisher, USA). Call rates greater than 95% and assessment for Hardy-Weinberg Equilibrium (HWE) were established for quality control.
Allele frequencies and genetic variant association with eGFR from European descent populations were obtained from the CKD genetics consortium (CKDGen) [13]. The CKDGen cohort is composed of including data from 67 studies and up to 133,814 participants in populationbased and randomized control trials. The beta coefficients of the linear regression were corrected for age [2], sex, and population stratification in CKDGen. Kidney function was reported as sex-and age-adjusted log transformed eGFR (n = 133,814, overall mean = 87.12 ml/ min/1.73 m2, standard deviation = 23.09), with 14% of participants having an eGFR < 60 ml/min/1.73m 2 . Vitamin D3 and Fibroblast growth factor 23 (FGF23) levels were obtained using the RECIPE ClinRep® HPLC Complete Kit (RECIPE Chemicals, GmbH, Munich, Germany) and the Human FGF-23 ELISA assay kit (Millipore, USA) respectively. Parathyroid hormone (PTH), serum phosphorus, serum calcium, creatinine, alkaline phosphatase (ALP), albumin and urine protein were assessed using the Flex® reagent cartridge on Siemens Dimension RxL chemistry system (Siemens Healthcare, GmbH, Erlangen, Germany). Based on clinical diagnostic criteria, patients were stratified by clinical comorbidities including Type 2 Diabetes (fasting glucose level ≥ 121 mg/dL) and Hypertension (systolic BP ≥ 130 mmHg and diastolic BP ≥ 85 mmHg).
Clinical and demographic variables were assessed for correlation with CKD status by using 2-sided, unpaired student's t-test, Mann Whitney and chi-square tests. Association testing and Multivariate logistic regression and AUC analyses were performed using an R Companion for the Handbook of Biological Statistics (https://rcompanion.org/rcompanion/e_07.html).

Results
The baseline characteristics of the study participants demonstrated are shown on Table 1. The median age of CKD patients and healthy donors were 47.7 years and 32.1 years, respectively, with a larger proportion of healthy donors being male (76.6% versus 53.1%, P < 5 × 10 − 4 ). There were no significant differences in the BMI between cases and controls (mean of 27.7 and 27.9, respectively, P = 0.824). CKD cases included 64 patients with CKD stage 3, 20 patients with CKD stage 4, and 76 patients with CKD stage 5 (N = 160). Glomerulonephritis was present in 62 patients including 45 with lupus nephritis. Comorbidities in CKD patients included T2D (n = 57), hypertension (n = 37), coronary disease (n = 29), and congestive heart failure (n = 12). The mean estimated glomerular filtration rate (eGFR) in patients was 17 ml/min/1.73 m 2 with a standard deviation of 25 ml/min/1.73 m 2 . As expected, there were significant differences between the Vitamin D3 and FGF23 levels in patients with CKD as compared to healthy donors (Fig. 1b), 22.3 ng/mL vs 28.9 ng/ mL and 485.6 pg/mL vs 39.4 pg/mL (P < 5 × 10 − 4 ), respectively. CKD patients were then stratified on the basis of one or more comorbid conditions, including hypertension and T2D, and with hypertension and diabetic nephropathy, and evaluated for significant associations to biochemical laboratory test values. Patients with end-stage renal disease (ESRD) demonstrated significantly altered biochemical laboratory values compared to those with earlier stages of CKD ( Fig. 1), although no significant differences in vitamin D3 or serum calcium levels were observed (data not shown).  Fig. 1 Significant associations in Saudi CKD patients were revealed in eight of ten measured biochemical analytes stratified across CKD, ESRD, Type 2 Diabetes, and hypertension. a One-way ANOVA analysis of clinical analytes were evaluated for statistical significance across measured CKD comorbidities. Serum calcium and vitamin D were not significant. b Unpaired two-tailed Student's t-test of Vitamin D and FGF23 between healthy and CKD disease states. Significance is noted as follows: *P < 0.05, **P < 0.01, ***P < 0.001 and ****P < 0.0001 (one-way ANOVA) Comparatively, patients with T2D and hypertension had significantly higher creatinine and FGF23 values as compared to those patients with CKD but without said comorbidities. Genotyping and quality controlled was performed in the 349 study participants of Saudi descent. All eight variants in the four tested genes were observed in appreciable frequencies in Saudis (see Table 2), along with those observed in CKDGen reference populations. Among the eight SNPs genotyped, rs4821480, a SNP mapping to the MYH9-APOL1 locus was most strongly associated with presence of CKD stage 3 or worse (OR = 1.69, 95% CI 1.22-2.36, P = 0.002). Linkage disequilibrium (LD) was observed between genotyped variants located in the same genes (r > 0.9). The direction of effects of the lead SNP from each gene were the same as those reported from the largest published cohort of European descent including 133,780 participants.
A multivariate model was built using clinical demographic information including age and gender, which demonstrated notably correlation with CKD disease status (AUC =0.80). To further strengthen regression analysis, biochemical analyte values for FGF23 and Vitamin D3, and MYH9 SNPs, rs4821480 and rs4821481 were tested in their capacity to enhance regression score. While FGF23 plus vitamin D levels improved association values, the combinatory enhancement of both biochemical analytes and MYH9 rs4821480 and rs4821481 genotypes generated the greatest predicative capacity and served as the best statistical model (AUC = 0.93, 95% CI 0.90-0.95, P < 0.0001; Table 3). Using all ten of the above described biochemical analytes across patient populations stratified for end stage renal disease (ESRD), type 2 diabetes (T2D) and hypertension (HTN) a number of significant findings were observed (Additional file 1: Table S1). Similarly, when stratifying patient population by CKD, T2D and hypertension status significant associations were found (Additional file 1: Table S2).

Discussion
We tested the prevalence and CKD risk associated with alleles identified in European populations in Saudi patients. All eight variants were present in Saudi populations at appreciable frequencies, and one variant in MYH9 was statistically significantly associated with CKD in our Saudi study population. While the remainder of alleles were not associated at statistically significant levels, our study is limited in power due to our relatively modest sample size. We did however demonstrate direction of effects that were concordant with previous reports from the larger CKDGen consortium. Genetic variants in MYH9 are associated with non-diabetic chronic kidney disease, FSGS and HIV nephropathy, however whether the association is due to a functional variant in MYH9 or due to LD with nearby Apolipoprotein L1 (APOL1) remains a point of controversy [23]. Functional variants in APOL1 are found on two different G1 and G2 haplotypes that are common in African populations (10-25%), but not observed to date in other populations [24,25]. The function of the G1 and G2 APOL1 haplotypes have been recapitulated in a recently produced transgenic mouse strain [26]. It remains possible that both MYH9 and APOL1 are important in CKD progression, and multiethnic populations and model organisms will play key roles in untangling their relationships with CKD.
GWAS have predominantly been performed in participants of European descent, and discovery and replication cohorts in populations from around the world are needed to further enhance our understanding of complex genetic traits. The generally held belief is that a functional risk allele should exhibit the same direction of effect in all populations. However, the genetic variant tested is often not the responsible functional variant but Odds ratio (OR) is used to quantify the presence of analyzed genes (column 1) with each population (Saudi, CKDGen). Beta is the standardized regression coefficient from CKDGen simply associated with, termed in LD with, the truly responsible variant. As such, ethnic-and population-based differences in regional linkage disequilibrium can create associations with opposite directions of effect, especially if the genetic distance between the marker and the responsible variant is great. Moreover, as the effect is often small, genetic drift and random chance allows the minor allele frequency to vary greatly between populations, resulting in significant differences in the power to detect associations. Novel, or de novo, mutations may result in private mutations that are only found in specific populations. Disease prevalence can vary between populations, as observed with pediatric nephrotic syndromes in Saudis, which is likely a result of the prevalence of underlying genetic risk factors. Finally, differences in environment and genetic background produce the possibility of gene-environment and gene-gene interactions. Over 50 loci have now been implicated in CKD progression [13] and a larger sample will be required to obtain adequate power to confidently test in Saudi populations, and overcome the multiple testing required as the number of known CKD loci grows. Large collaborative efforts such as the Saudi 100,000 genomes project [27] will facilitate larger aggregations of CKD risk loci in larger number of individuals which will greatly improve CKD risk stratification of individuals. When we combined variants in our most associated CKD locus with known CKD biochemical markers we observed a very significant AUC value (0.93). As better Saudi specific CKD genetic risk loci become available, along with more powerful CKD disease progression biomarkers, we may be able to prioritize which individuals may be suitable for interventions such as the renoprotective agents, Nisoldipine and Lisinopril. Indeed, specific targeting of specific modifiable risk factors such as hypertension and T2D, may ameliorate the majority of the CKD genetic risk loci in some subsets of individuals.

Conclusions
In conclusion, we observed a significant association between variants in MYH9 and CKD in Saudi Arabia, and use of multivariable logistic regression analysis showed a strong statistically significant improvement in predicting CKD using FGF23, Vitamin D and MYH9 variants. Our power to replicate previously reported associations in three other loci was limited, but we identified appreciable allele frequencies and identical direction of effect to previous reports. Further evaluation of CKD risk alleles in Saudi populations has the potential to increase our understanding of the pathogenic basis of CKD.

Additional file
Additional file 1: Table S1. One-way ANOVA comparing the various Biochemical parameters in patients stratified for ESRD, T2D and HTN. Availability of data and materials Data sharing is not applicable to this article as no datasets were generated or analysed. However, the data used and/or analysed during the current study are available from the corresponding author on reasonable request.
Authors' contributions CC, CV, and SC designed the study, performed the assay, and drafted the manuscript. SAM and FAM provided the CKD patient samples; HQ and AAR provided the age and sex matched controls. RA and KA collected all medical data of the individual participant from the hospital records. BK, YRL and MBL were involved in drafting the manuscript for important intellectual content. AAA and FAM provided critical review of the manuscript. BM drafted the requested major revisions to the manuscript including data quality control, English corrections, and reformatting of figures/table legends and charts. All authors read and approved the final manuscript.
Ethics approval and consent to participate This study was approved by the Ethical Committee of the University of Dammam (IRB # 2014-08-046) in accordance with the 1964 Helsinki Declaration and its later amendments. Written informed consent, in native Arabic, was duly signed from each participant.

Competing interests
The authors declare that they have no competing interests.