Phased whole genome genetic risk in a family quartet using a major allele reference sequence.

Posted by on September 5, 2011 in Uncategorized

Whole-genome sequencing harbors unprecedented potential for characterization of individual and family genetic variation. Here, we develop a novel synthetic human reference sequence that is ethnically concordant and use it for the analysis of genomes from a nuclear family with history of familial thrombophilia. We demonstrate that the use of the major allele reference sequence results in improved genotype accuracy for disease-associated variant loci. We infer recombination sites to the lowest median resolution demonstrated to date (< 1,000 base pairs). We use family inheritance state analysis to control sequencing error and inform family-wide haplotype phasing, allowing quantification of genome-wide compound heterozygosity. We develop a sequence-based methodology for Human Leukocyte Antigen typing that contributes to disease risk prediction. Finally, we advance methods for analysis of disease and pharmacogenomic risk across the coding and non-coding genome that incorporate phased variant data. We show these methods are capable of identifying multigenic risk for inherited thrombophilia and informing the appropriate pharmacological therapy. These ethnicity-specific, family-based approaches to interpretation of genetic variation are emblematic of the next generation of genetic risk assessment using whole-genome sequencing.

http://www.ncbi.nlm.nih.gov/pubmed/21935354

Dewey FE, Chen R, Cordero SP, Ormond KE, Caleshu C, Karczewski KJ, Carrillo MW, Wheeler MT, Dudley JT, Bynes JK, Cornejo OE, Knowles JW, Woon M, Sangkuhl K, Gong L, Thorn CF, Hebert JM, Capriotti E, David SP, Pavlovic A, West A, West JS, Thakuria JV, Ball MP, Zaranek AW, Rehm HL, Church GM, Bustamante CD, Snyder MP, Altman RB, Klein TE, Butte AJ, Ashley EA. Phased whole genome genetic risk in a family quartet using a major allele reference sequence. PLoS Genet. 2011 Sep;7(9):e1002280.