Summary
The All of Us Research Program released genomic data from 245,388 clinical-grade genome sequences with exceptional demographic diversity (77% from historically under-represented communities, 46% from under-represented racial and ethnic minorities). The dataset identified over 1 billion genetic variants, including 275 million novel variants, and demonstrated high replication rates for disease associations across ancestry groups. This resource aims to advance genomic medicine equity by providing publicly accessible summary-level data and researcher access through a streamlined approval pathway.
UK applicability
Whilst this cohort is United States-based, the findings on genetic variant diversity and disease replication across ancestry groups have potential applicability to UK precision medicine initiatives and NHS genomics programmes. However, UK researchers would need to assess whether findings replicate in UK populations and consider ancestry-specific differences in allele frequencies.
Key measures
Number of genome sequences (245,388); genetic variants identified (>1 billion); previously unreported variants (>275 million); coding variants (>3.9 million); genetic variants associated with disease (3,724); diseases evaluated (117); replication rates across European and African ancestry participants; time to data access (median 29 hours)
Outcomes reported
The study released clinical-grade genome sequences from 245,388 participants and identified over 1 billion genetic variants, including 275 million previously unreported variants. Using linkage with electronic health records, the researchers evaluated 3,724 genetic variants associated with 117 diseases, achieving high replication rates across ancestry groups.
Topic tags
Dig deeper with Pulse AI.
Pulse AI has read the whole catalogue. Ask about this record, its theme, or how the findings apply to UK farming and policy — every answer cites the underlying studies.