23andMe's coverage of rare prion diseases
Sonia and I got back our 23andMe results a few weeks ago. For those who don’t know, 23andMe offers direct-to-consumer genetic testing. For $99 you can spit into a tube, mail it to them and they’ll tell you your genotype at about 900,000 locations across your genome. They don’t just give you the data, they help interpret it for you – information about your own health, about whether you carry a recessive disease that could affect your kids, your ancestry, and more. In my next post I review our experience with it and tell you why it’s a preposterously good deal and good idea and everyone should do it.
But first, this post is to discuss the technical matters of 23andMe’s coverage of the rare genetic prion diseases. In browsing our raw data, we found that they have an admirably broad coverage of the polymorphisms and prion disease mutations in PRNP. Here’s a list of the exonic SNPs they genotype:
mutation or polymorphism | disease association | dbSNP id | 23andMe id (if different) |
---|---|---|---|
S97N | suspected cause of disease [Zheng 2008] | rs56362942 | - |
P102L | GSS | rs74315401 | i5004359 |
P105L | GSS | rs11538758 | - |
A117V | GSS | rs74315402 | i5004358 |
G127V | resistance to kuru | not in dbSNP | i5004345 |
129M/V | it's complicated* | rs1799990 | - |
G131V | GSS | rs74315410 | i5004351 |
A133V | GSS | rs74315415 | i5004347 |
N171S | it's complicated** | rs16990018 | - |
D178N | FFI or CJD | rs74315403 | i5004357 |
V180I | CJD | rs74315408 | i5004353 |
T183A | suspected cause of disease | rs74315411 | - |
F198V | suspected cause of disease | rs55871421 | - |
F198S | GSS | rs74315405 | i5004356 |
E200K | CJD | rs28933385 | - |
R208C | CJD | rs55826236 | - |
R208H | CJD | rs74315412 | i5004349 |
V210I | CJD | rs74315407 | i5004354 |
G217R | GSS [Hsiao 1992] | rs74315406 | i5004355 |
E219K | resistance to sporadic CJD [Shibuya 1998] | rs1800014 | i3002436 |
M232R | it's complicated*** | rs74315409 | i5004352 |
*Codon 129 does not cause any prion diseases, but heterozygosity is associated with reduced risk for sporadic and variant CJD and later onset or longer disease course for some genetic prion diseases [reviewed in Lukic & Mead 2011]. Codon 129 also largely determines the phenotype of the D178N mutation – FFI if in cis with 129M and CJD if in cis with 129V [Goldfarb 1992] – though there is in fact a phenotypic spectrum between these two [Zarranz 2005].
**There is debate over whether N171S is a disease mutation or neutral, and it’s also been suggested to increase schizophrenia risk [Tsai 2001, Appleby 2010, Beck 2010, Streibel 2011]
***M232R has been suggested to increase sporadic CJD risk but it might be neutral [Beck 2012 (ft)].
N.B. The disease associations without citations above are reviewed in PRNP’s OMIM entry.
Now that’s an awful lot of coverage for PRNP, especially considering how rare most of these mutations are. It is super awesome that 23andMe decided to include all of these on their chip. In the short run, though, don’t expect 23andMe to necessarily tell you whether you carry a genetic prion disease. In our case, we know from Sanger sequencing conducted by the National Prion Disease Pathology Surveillance Center at Case Western that Sonia carries the D178N mutation, i.e. her rs74315403 genotype is AG. Yet that SNP came up ‘no call’ in her 23andMe data:
Is it a pure coincidence that this one most important SNP in all of her genome, the SNP that has driven us to become scientists, was one of the few that 23andMe could not call confidently?
Probably not. I did some Googling and asking around and learned that ‘no call’ on SNP chips is especially common for very rare SNPs [Ritchie 2011]. The issue is that those two dots of light on the SNP chip that represent the A allele and the G allele are not equally intense for all SNPs in the genome. It varies according to how well the probes hybridize and how many other sites in the genome have some homology with the area in question. If other parts of the genome look a lot like the neighborhood around rs74315403, they’ll hybridize to those probes too, and might favor one allele over the other, confounding the readout. Therefore algorithms need to see several people of each genotype in order to learn how to turn those visual light intensities into a genotype call. For literally 1-in-a-million SNPs like the fatal familial insomnia mutation, it is eminently possible that 23andMe never saw anyone else with it before Sonia, and so their algorithm hasn’t yet learned whether her greater light intensity for the ‘A’ allele probe actually means she has the mutation.
And after browsing more of the raw data, I see more reasons why SNP chip data need to be interpreted with care. Sonia also came up ‘no call’ for rs74315402, the A117V GSS mutation, which we know (again, from Sanger sequencing) that she does not have. So (though this is probably obvious) you can’t just assume every ‘no call’ means something important is going on here. In fact, on further examination, I see why that SNP came up ‘no call’. Sonia has a synonymous polymorphism (rs8124214 A117A) right next to the A117V SNP, which would interfere with her DNA hybridizing to the A117V probe and complicate the readout. This A117A SNP is not disease-causing at all, yet if we went around trying to read too much into ‘no calls’ without any Sanger validation, we might have freaked out when we saw her results.
All this just calls for a bit of caution, and I think 23andMe tries to send that message loud and clear as well. False positives and false negatives do happen, and their web interface is accordingly littered with messages that you should talk to a health care professional about this and that. I now see that those aren’t just obligatory CYA disclaimers. If you’re wondering if you have a PRNP mutation, 23andMe might be able to tell you and might not, but before you change your life in response to it, it’s probably best to get it validated by the National Prion Disease Pathology Surveillance Center.
23andMe’s ability to call these rare SNPs will only improve with time as they add more people to their database. And by the way, they not only add annotations over time as new genotype-phenotype associations are discovered (all of which is included in the up-front price you pay), they also change your raw data over time as they get better at converting those dots of light on the SNP chip into called genotypes.