Comprehensive functional annotation of susceptibility variants identifies genetic heterogeneity between lung adenocarcinoma and squamous cell carcinoma
Qin, Na; Li, Yuancheng; Wang, Cheng; Zhu, Meng; Dai, Juncheng; Hong, Tongtong; Albanes, Demetrius; Lam, Stephen; Tardón, Adonina; Chen, Chu; Goodman, Gary; Bojesen, Stig E.; Landi, Maria Teresa; Johansson, Mattias; Risch, Angela; Wichmann, H-Erich; Bickeboller, Heike; Rennert, Gadi; Arnold, Susanne; Brennan, Paul; Field, John K.; Shete, Sanjay; Le Marchand, Loic; Melander, Olle; Brunnstrom, Hans; Liu, Geoffrey; Hung, Rayjean J.; Andrew, Angeline; Kiemeney, Lambertus A.; Zienolddiny, Shanbeh; Grankvist, Kjell; Johansson, Mikael; Caporaso, Neil; Woll, Penella; Lazarus, Philip; Schabath, Matthew B.; Aldrich, Melinda C.; Stevens, Victoria L.; Jin, Guangfu; Christiani, David C.; Hu, Zhibin; Amos, Christopher I.; Ma, Hongxia; Shen, Hongbing
Peer reviewed, Journal article
Published version
Permanent lenke
https://hdl.handle.net/11250/3147415Utgivelsesdato
2020Metadata
Vis full innførselSamlinger
Originalversjon
10.1007/s11684-020-0779-4Sammendrag
Although genome-wide association studies have identified more than eighty genetic variants associated with non-small cell lung cancer (NSCLC) risk, biological mechanisms of these variants remain largely unknown. By integrating a large-scale genotype data of 15 581 lung adenocarcinoma (AD) cases, 8350 squamous cell carcinoma (SqCC) cases, and 27 355 controls, as well as multiple transcriptome and epigenomic databases, we conducted histology-specific meta-analyses and functional annotations of both reported and novel susceptibility variants. We identified 3064 credible risk variants for NSCLC, which were overrepresented in enhancer-like and promoter-like histone modification peaks as well as DNase I hypersensitive sites. Transcription factor enrichment analysis revealed that USF1 was AD-specific while CREB1 was SqCC-specific. Functional annotation and genebased analysis implicated 894 target genes, including 274 specifics for AD and 123 for SqCC, which were overrepresented in somatic driver genes (ER = 1.95, P = 0.005). Pathway enrichment analysis and Gene-Set Enrichment Analysis revealed that AD genes were primarily involved in immune-related pathways, while SqCC genes were homologous recombination deficiency related. Our results illustrate the molecular basis of both wellstudied and new susceptibility loci of NSCLC, providing not only novel insights into the genetic heterogeneity between AD and SqCC but also a set of plausible gene targets for post-GWAS functional experiments. Comprehensive functional annotation of susceptibility variants identifies genetic heterogeneity between lung adenocarcinoma and squamous cell carcinoma