TY - JOUR
T1 - Global Biobank analyses provide lessons for developing polygenic risk scores across diverse cohorts
AU - BBJ
AU - BioMe
AU - BioVU
AU - Canadian Partnership for Tomorrow's Health/OHS
AU - China Kadoorie Biobank Collaborative Group
AU - Colorado Center for Personalized Medicine
AU - deCODE Genetics
AU - ESTBB
AU - FinnGen
AU - Generation Scotland
AU - Genes & Health
AU - LifeLines
AU - Mass General Brigham Biobank
AU - Michigan Genomics Initiative
AU - QIMR Berghofer Biobank
AU - Taiwan Biobank
AU - The HUNT Study
AU - UCLA ATLAS Community Health Initiative
AU - UKBB
AU - Wang, Ying
AU - Namba, Shinichi
AU - Lopera, Esteban
AU - Kerminen, Sini
AU - Tsuo, Kristin
AU - Läll, Kristi
AU - Kanai, Masahiro
AU - Zhou, Wei
AU - Wu, Kuan Han
AU - Favé, Marie Julie
AU - Bhatta, Laxmi
AU - Awadalla, Philip
AU - Brumpton, Ben
AU - Deelen, Patrick
AU - Hveem, Kristian
AU - Lo Faro, Valeria
AU - Mägi, Reedik
AU - Murakami, Yoshinori
AU - Sanna, Serena
AU - Smoller, Jordan W.
AU - Uzunovic, Jasmina
AU - Wolford, Brooke N.
AU - Wu, Kuan Han H.
AU - Rasheed, Humaira
AU - Hirbo, Jibril B.
AU - Bhattacharya, Arjun
AU - Zhao, Huiling
AU - Surakka, Ida
AU - Lopera-Maya, Esteban A.
AU - Chapman, Sinéad B.
AU - Karjalainen, Juha
AU - Kurki, Mitja
AU - Mutaamba, Maasha
AU - Partanen, Juulia J.
AU - Chavan, Sameer
AU - Chen, Tzu Ting
AU - Daya, Michelle
AU - Ding, Yi
AU - Feng, Yen Chen A.
AU - Gignoux, Christopher R.
AU - Graham, Sarah E.
AU - Hornsby, Whitney E.
AU - Ingold, Nathan
AU - Johnson, Ruth
AU - Laisk, Triin
AU - Lin, Kuang
AU - Lv, Jun
AU - Millwood, Iona Y.
AU - Palta, Priit
AU - Pandit, Anita
N1 - Publisher Copyright:
© 2022 The Author(s)
PY - 2023/1/11
Y1 - 2023/1/11
N2 - Polygenic risk scores (PRSs) have been widely explored in precision medicine. However, few studies have thoroughly investigated their best practices in global populations across different diseases. We here utilized data from Global Biobank Meta-analysis Initiative (GBMI) to explore methodological considerations and PRS performance in 9 different biobanks for 14 disease endpoints. Specifically, we constructed PRSs using pruning and thresholding (P + T) and PRS-continuous shrinkage (CS). For both methods, using a European-based linkage disequilibrium (LD) reference panel resulted in comparable or higher prediction accuracy compared with several other non-European-based panels. PRS-CS overall outperformed the classic P + T method, especially for endpoints with higher SNP-based heritability. Notably, prediction accuracy is heterogeneous across endpoints, biobanks, and ancestries, especially for asthma, which has known variation in disease prevalence across populations. Overall, we provide lessons for PRS construction, evaluation, and interpretation using GBMI resources and highlight the importance of best practices for PRS in the biobank-scale genomics era.
AB - Polygenic risk scores (PRSs) have been widely explored in precision medicine. However, few studies have thoroughly investigated their best practices in global populations across different diseases. We here utilized data from Global Biobank Meta-analysis Initiative (GBMI) to explore methodological considerations and PRS performance in 9 different biobanks for 14 disease endpoints. Specifically, we constructed PRSs using pruning and thresholding (P + T) and PRS-continuous shrinkage (CS). For both methods, using a European-based linkage disequilibrium (LD) reference panel resulted in comparable or higher prediction accuracy compared with several other non-European-based panels. PRS-CS overall outperformed the classic P + T method, especially for endpoints with higher SNP-based heritability. Notably, prediction accuracy is heterogeneous across endpoints, biobanks, and ancestries, especially for asthma, which has known variation in disease prevalence across populations. Overall, we provide lessons for PRS construction, evaluation, and interpretation using GBMI resources and highlight the importance of best practices for PRS in the biobank-scale genomics era.
KW - accuracy heterogeneity
KW - Global-Biobank Meta-analysis Initiative
KW - multi-ancestry genetic prediction
KW - polygenic risk scores
UR - http://www.scopus.com/inward/record.url?scp=85147104921&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85147104921&partnerID=8YFLogxK
U2 - 10.1016/j.xgen.2022.100241
DO - 10.1016/j.xgen.2022.100241
M3 - Article
C2 - 36777179
AN - SCOPUS:85147104921
SN - 2666-979X
VL - 3
JO - Cell Genomics
JF - Cell Genomics
IS - 1
M1 - 100241
ER -