| 其他摘要 |
阿尔茨海默病(Alzheimer's disease,AD),俗称老年痴呆症,是最常见的一种老年期原发性神经退行性疾病,临床上以记忆障碍、执行功能障碍以及人格和行为改变等全面性痴呆表现为特征。65岁以前发病者,称早老性痴呆;65岁以后发病者称散发性痴呆。AD的分子病理学特征包括神经元胞外β淀粉样蛋白 (β-amyloid, Aβ)异常聚集形成老年斑 (neuritic plaques),神经元胞内高度磷酸化的微管相关蛋白tau(microtubule-associated protein tau, MAPT)聚集形成神经纤维缠结 (neurofibrillary tangles, NFTs),小胶质细胞激活,突触与神经元损伤,颞顶叶和扣带回等脑区的萎缩。AD是一组异质性疾病,其病因学还不是很清楚,其发生发展受到遗传因素和环境因素共同影响,已报道AD的遗传力为0.58~0.79。对AD的遗传研究开始于上世纪八九十年代,通过连锁分析把早发性家族型AD的致病基因定位到APP(β淀粉样蛋白前体)基因。随着关联分析、全基因组关联分析(Genome Wide Association Study,GWAS)、全基因组测序等技术的发展,一大批遗传因子与AD的统计相关性被揭示。然而,已发现的AD相关遗传因子,大部分是功能未知的非编码区变异。从已揭示的AD统计学相关的遗传因子中,鉴定出真正的疾病相关的功能基因与位点,一直是AD遗传学研究中亟待解决的问题。目前,由于前期积累和大数据时代的来临,整合各层面数据,通过系统的分析,从遗传关联(DNA变异)、表观遗传调控(DNA甲基化)、表达调控(DNA调控元件、mRNA表达)、作用通路(蛋白水平、pathway水平)、生理功能(生化、细胞水平)等多层面,把DNA、蛋白、细胞、个体疾病联系起来,共同指向真正疾病相关的功能基因与位点这一策略,得以实现。本论文以整合分析策略,对AD相关遗传数据、表观遗传数据、表达数据、生物标记物、脑影像等各层面数据进行整合,发现了若干潜在的AD相关功能基因。我们对AD的GWAS研究提示潜在相关的位点进行了系统的功能注释,找到一些新的潜在的编码区致病性错义突变和具有调控功能的非编码变异。通过整合GWAS信号和表达数量性状(Expression Quantitative Trait Loci,eQTL)信号,我们筛选到10余个潜在的功能基因,如MADD、SPATA7、STOX2、GHDC、DYNC1H1、EDC4、ZNF20、APBB3等。这些基因的表达水平显著的受到AD风险相关位点的调控,其表达水平在AD病理相关的海马、颞叶等组织中极显著的发生改变。我们的分析提示,AD易感SNP位点通过调节这些基因的表达水平改变,从而促进AD的发生发展。基因的表达水平改变除了受遗传变异影响外,还受表观遗传修饰如DNA甲基化、组蛋白乙酰化等调控。最近,两个大规模的AD脑组织全基因组甲基化研究给出了AD病灶老年斑含量和神经纤维缠结程度相关的甲基化位点的全貌。然而,仅仅甲基化水平的证据,限制了对AD病理相关基因的进一步解读。我们通过整合两个甲基化研究结果与AD GWAS研究、表达数据、神经影像学以及AD相关亚表型数据,发现一个核糖体亚基基因RPL13同时受上游甲基化和3’UTR区SNP调控,导致其表达水平在AD脑组织中发生改变,从而促进了AD的发生发展。我们结果提示,RPL13可能是一个重要的新的AD相关功能基因。Aβ过度产生激发的病灶级联免疫反应被认为是非常重要的病理过程。目前,已确认的AD遗传易感相关的基因或遗传变异中,免疫相关基因(CD33、CLU、CR1、TREM2等),尤其是补体系统基因,如CR1(补体受体1)和CLU(凝聚素,补体抑制因子),已被确定为欧洲人群最重要的AD易感基因之一。但是,这些AD易感相关的重要免疫基因是否也是中国人群的易感基因,是否有中国人群特异的重要免疫基因,它们是怎么在AD病理过程中发挥作用,在中国人群中还没有得到系统的研究。本研究中,我们首先在中国AD病例对照群体中扫描了5个免疫基因或区域,包括欧洲人群中鉴定的重要的AD易感免疫基因区域(CR1、CR2、CLU、CD33、TREML2)以及补体因子H(CFH)基因,然后对中国人群中影响AD遗传易感的免疫基因进行独立群体的遗传关联验证,对于得到证实的基因与位点,在基因表达、脑影像、AD亚表型等层面对其进行多阶段系统研究。我们的结果显示,欧洲人群中的重要AD易感免疫基因在中国人群中作用非常微弱,而CFH基因则在中国群体中充当重要的AD易感免疫基因。我们的研究发现,CFH遗传变异对AD的遗传易感贡献可通过多方面实现:通过基因表达水平和活性改变,作用于脑结构和功能改变,并影响疾病进展中的免疫反应,从而促进AD的发生发展。最重要的是,CFH风险等位基因对AD相关的病理改变在成年早期即已发生,提示其可作为疾病早期诊断的标记物。但是,这些结果需要基于人群对神经影像学和生物标志物指标进行纵向跟踪研究,以及动物实验的进一步证实。综上所述,本论文以整合分析策略,对AD相关遗传数据、表观遗传数据、表达数据、生物标记物、脑影像等各层面数据进行整合,发现MADD、RPL13、CFH等可能是潜在的AD相关功能基因。这些基因都受功能性遗传变异调控,在AD相关组织中发生表达改变,与AD相关的亚表型和遗传易感风险显著相关。但是,其遗传效应大小,还有待更多来自大样本量的独立人群的验证;其具体作用机制,也有待进一步功能实验的确认。本研究结果证明了整合分析的有效性和必要性。目前AD的相关大规模数据都还不够充分,也使得本研究不够深入全面。在可以预见的将来,随着AD相关大数据的积累,将开启利用整合分析策略研究AD疾病机理的新时代。; Alzheimer's disease (AD) is the most common neurodegenerative disease in the elderly, characterized by memory impairment, executive dysfunction, as well as personality and behavioral changes. Molecular pathological features of AD include extracellular β-amyloid protein (Aβ) accumulation (neuritic plaques), neurofibrillary tangles within neurons, microglia activation, synapses and neuronal damage, atrophy of temporal lobe and other brain regions.AD is a complex heterogeneous disease with unclear etiology. Its occurrence and development may be determined by genetic and environmental factors. It’s been reported that heritability of AD is 0.58 to 0.79. Genetic research on AD began with the identification of the causative gene APP (β-amyloid precursor protein) for the early-onset familial AD through linkage analysis. Subsequent genetic association analyses of candidate genes, genome-wide association study (GWAS), whole-genome sequencing identified a large number of genetic factors related to AD susceptibility. However, most of the known AD-related loci have been found to be non-coding variants with unknown function. The remaining question is to identify the functional causal variants. Integrated analysis combining data from genetic association study (DNA variation), epigenetic study (DNA methylation), expression and regulation analysis (DNA regulatory element, mRNA expression ), pathway (protein level), physiological functions (biochemical, cellular level), and other individual – level data together, by virtue of strategy of systems biology, is a good approach to pinpoint the true disease-related functional genes and loci. In this thesis, we performed integrated analyses with genetic association data, epigenetic data, expression profiling, biomarkers, brain imaging and other funcational evidence, with an intention to learn more about the genetic baisis of AD.With a full annotation of the AD GWAS summary statictics from 17008 cases and 37154 controls, we suggested several new potential damaging missense mutations and non-coding regulatory variations. Through integrating of GWAS signals and expression quantitative trait (eQTL) signals using the sherlock method, we identified more than 10 functional genes, e.g. MADD, SPATA7, STOX2, GHDC, DYNC1H1, EDC4, ZNF20, APBB3. Expression level of these genes were regulated by AD-related functional variants, and their expression levels significantly changes in AD pathology related tissues such as hippocampus and temporal cortex. The differentially expressed genes were well validated in independent datasets and presented a co-expression network. Our analysis suggested that AD susceptibility loci may contribute to the development of AD by regulating gene expression.In addition to the influence of genetic variations, gene expression level is also modified by epigenetic variations such as DNA methylation, histone acetylation, etc. AD is a complex neurodegenerative disorder with epigenetic modifications largely unknown. Recently, two large-scale genome-wide DNA methylation profiling studies of AD brain tissues offered the pictures of AD endophenotype (senile plaques and neurofibrillary tangles content) related DNA methylation pattern. However, evidence at only the methylation level limites further interpretation of the AD pathology related genes. Through the integration of the two methylation profiling and AD GWAS findings, with expression, neuroimaging, and AD-related phenotypic data, we found a functional gene, the ribosomal subunit RPL13 gene, which showed an association with AD genetic susceptibility at the gene-based level (P = 0.029). The top AD-associated RPL13 SNPs were eQTLs for the RPL13 gene. In addition, we found that RPL13 top SNPs/eQTLs rs3751691 and rs12709089 were significantly associated with cerebrospinal fluid (CSF) tau and Aβ levels and mini-mental state examination (MMSE) scores. mRNA expression change was observed in AD brain tissues relative to control samples. These methylomic, genetic, expression and eQTL data suggest an essential role of RLP13 expression changes in AD pathology, as shown to be regulated by both methylation and SNPs. These findings indicated that RPL13, usually considered to be a house-keeping gene, may be an important new AD-related functional gene that plays a complex and essential role in brain aging, AD pathology and cognitive decline.Aβ induced immune response is considered to be an essential pathological process in AD development. At the present, most of the confirmed AD susceptibility genes or genetic variations were immune-related genes, eg. CD33, CLU, CR1, TREM2, etc. Complement system genes such as CR1 and CLU were in the top list of AD risk gene in European populations. However, it is unknown whether these important immune-related AD susceptibility genes were involved in Chinese. In this study, we firstly screened five immune genes or regions that were identified in Europeans in Chinese subjects, followed by subsequent independent verification, gene expression analysis, brain imaging, and subphenotype analyses. Our results showed that AD-related top immune genes identified in European populations have weak effects in Chinese population, whilst the complement gene, the CFH gene, has a strong effect in Chinese population. In particular, rs1061170 (Pmeta =5.0x10-4) and rs800292 (Pmeta = 1.3x10-5) showed robust associations with AD, which were confirmed in multiple world-wide sample sets (4,317 cases and 16,795 controls). SNP rs1061170 (P = 2.5x10-3) and rs800292 (P = 4.7x10-4) risk-allele carriers have an increased entorhinal thickness in their young age and a higher atrophy rate as the disease progress. rs800292 risk-allele carriers have higher CSF tau and Aβ levels and severe cognitive decline. CFH expression level, which was affected by the risk-alleles, was increased in AD brains and cellular experiments. These comprehensive analyses suggested that CFH is an important immune factor in AD and CFH genetic variations contribute to AD susceptibility through 1) changing the gene expression levels and activitie, 2) affecting the structure and function of the brain, and 3) affecting the immune response in disease progression. Most importantly, CFH risk allele carriers have AD-related pathological changes in early adulthood, suggesting a potential role as an early diagnostic marker for the disease. Population-based longitudinal follow-up study and animal experiments are needed to further confirm the results.Taken together, we found that MADD, RPL13, and CFH may be the potentially functional genes related to AD through a integrative strategy of genetic, epigenetic, expression, biomarkers, and brain imaging data. However, more independent validation and further functional experiments are needed to confirm the effect size and to characterize the mechanism. The results of this study demonstrate the effectiveness and necessity of integrative analysis. In the future, with the accumulation of AD-related big data, a new era of AD mechanism research will come. |
修改评论