The All of Us Researcher Workbench team released an incremental data release for the Controlled Tier of the All of Us Curated Data Repository (CDR) version 7 (v7) long-read whole genome sequencing data (lrWGS) on February 8, 2024 to ensure accuracy with internal data procedures.
Our team discovered the original Telomere-to-Telomere (T2T) Hail MatrixTable (MT) did not follow the correct data quality procedures and had an aberrantly high no-call rate. The issue only affected the small variant (Single Nucleotide Polymorphism and Insertion-Deletion) T2Tv2.0 joint-called Hail MT. No other file formats or reference versions were affected.
As a result of the incremental data release, the lrWGS small variant Hail MatrixTable (MT) on the T2T v2.0 reference genome has been updated. You can access the dataset using the same file path and environmental variable.
The changes are reflected in the Controlled CDR Directory as well as release notes.
If you have any questions regarding the incremental data release, please contact us at support@researchallofus.org.
Comments
0 comments
Article is closed for comments.