Curated Data Repository (CDR) version 7 Release Notes

  • Updated

The All of Us Research Program released the Curated Data Repository (CDR) version 7 for Controlled Tier and Registered Tier data.

The CDRv7 for both tiers (Controlled Tier C2022Q4R9 and Registered Tier R2022Q4R9) includes participant data with a cutoff date of July 1, 2022.

Overview of Registered Tier updates

  • Participant number increased to over 413,000 with at least one type of associated Registered Tier data.
  • Physical measurement data from over 337,000 participants.
  • EHR data from over 287,000 participants.
  • Fitbit data from over 15,600 participants. These data now include sleep and device data in addition to activity, step counts, and heart rate.
  • New Year Minute survey data are available for over 33,000 participants and a revised Personal, Family Medical History (PFMH) survey.

Overview of Controlled Tier updates

  • Short read whole genome sequences (srWGS) for over 245,000 participants and microarray data from over 312,000 participants.
  • Long read sequencing variants from over 1,000 participants and short read structural variants from over 11,000 participants.
  • Introduction of the Hail VDS file format for srWGS data along with VCFs, Hail MatrixTables, and PLINK files for smaller callsets.

For detailed information regarding the CDR updates, read the full CDRv7 release notes (Registered Tier and Controlled Tier).

Was this article helpful?

2 out of 4 found this helpful

Have more questions? Submit a request

Comments

0 comments

Article is closed for comments.