Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Identification of coding variant associations for complex diseases offers a direct route to biological insight, but is dependent on appropriate inference concerning the causal impact of those variants on disease risk. We aggregated coding variant data for 81,412 type 2 diabetes (T2D) cases and 370,832 controls of diverse ancestry, identifying 40 distinct coding variant association signals (at 38 loci) reaching significance (p<2.2×10−7). Of these, 16 represent novel associations mapping outside known genome-wide association study (GWAS) signals. We make two important observations. First, despite a threefold increase in sample size over previous efforts, only five of the 40 signals are driven by variants with minor allele frequency <5%, and we find no evidence for low-frequency variants with allelic odds ratio >1.29. Second, we used GWAS data from 50,160 T2D cases and 465,272 controls of European ancestry to fine-map these associated coding variants in their regional context, with and without additional weighting to account for the global enrichment of complex trait association signals in coding exons. At the 37 signals for which we attempted fine-mapping, we demonstrate convincing support (posterior probability >80% under the “annotation-weighted” model) that coding variants are causal for the association at 16 (including novel signals involvingPOC5p.His36Arg,ANKHp.Arg187Gln,WSCD2p.Thr113Ile,PLCB3p.Ser778Leu, andPNPLA3p.Ile148Met). However, at 13 of the 37 loci, the associated coding variants represent “false leads” and naïve analysis could have led to an erroneous inference regarding the effector transcript mediating the signal. Accurate identification of validated targets is dependent on correct specification of the contribution of coding and non-coding mediated mechanisms at associated loci.

Original publication

DOI

10.1101/144410

Type

Journal article

Publication Date

31/05/2017