• Posts
  • RSS
  • ◂◂RSS
  • Contact

  • Interpreting genetic testing

    December 15th, 2018
    genetics  [html]
    Several years ago I participated in a study where my DNA was sequenced, and while I ended up not getting the sequence data [1] I did get a file of 23andme-style SNP variant calls. I loaded it into Promethease, and excluded mutations with magnitude below 2 ("looks interesting enough to be worth reading"). I saw 139 mutations marked as "bad," 41 as "good," and 26 as "not set."

    Initially I interpreted this to mean that I should be more pessimistic about my health than I was before getting the report, since more of the mutations are bad (2x risk of something) than good (0.5x risk of something else). To figure out how your beliefs should change, though, you need to know how many bad vs good mutations people typically have. For example, if someone might normally have 200 bad mutations and 10 good ones then my report is good news, but if instead normal is 100 bad mutations and 70 good ones then my report is bad news.

    In general, I would expect most people to have more negative mutations than positive ones, simply because most mutations with an effect are negative. Randomly changing something is much more likely to break things than make them better.

    This also applies when determining total risk of something. For example, lets say I have SNPs that individually give me 3x, 1.5x, 2x, and 0.5x risk for heart disease. I could naively multiply them together, ignoring that they don't stack perfectly, [2] and conclude that I had 4.5x the risk of the general population. But most people will probably have some mutations that increase their risk of heart disease. I think the proper way to handle this is for each case where you have the normal value of a variant you count that as slightly improving your risk, and when you consider all of these tiny improvements you get back to the average person having the average risk. Alternatively, and probably more accurately, you could just naively compute each person's risk, and then normalize.

    Is this just a Promethease problem? Do other places that give health reports handle this better? Or do places just avoid giving consumer health information because this is both really hard to do well and highly regulated?

    (It's also definitely possible I'm misinterpreting Promethease, or not thinking well about how the stats work here.)


    [1] This was really frustrating. They confirmed receipt of my sample in September 2012, and in March 2015 they said they had the full 26GB sequence data available for me to transfer. Unfortunately they only ever uploaded the first few 200MB chunks, and then stopped responding to my emails in mid-April. I wrote to them a few more times, and eventually gave up about a year later.

    [2] This caveat about stacking is pretty serious, though. Imagine mutations A and B both give you a 3x risk of some condition. If they act completely independently then a "stacked" 9x risk from having both A and B is reasonable. But if instead A and B act exactly the same way, breaking something that has multiple ways to be rendered fully inoperable, then having them both is no worse than having just one. I don't know which end of this is closer to how things usually work.

    Comment via: facebook, lesswrong

    Recent posts on blogs I like:

    More on the Deutschlandtakt

    The Deutschlandtakt plans are out now. They cover investment through 2040, but even beforehand, there’s a plan for something like a national integrated timetable by 2030, with trains connecting the major cities every 30 minutes rather than hourly. But the…

    via Pedestrian Observations July 1, 2020

    How do cars fare in crash tests they're not specifically optimized for?

    Any time you have a benchmark that gets taken seriously, some people will start gaming the benchmark. Some famous examples in computing are the CPU benchmark specfp and video game benchmarks. With specfp, Sun managed to increase its score on 179.art (a su…

    via Posts on Dan Luu June 30, 2020

    Quick note on the name of this blog

    When I was 21 a friend introduced me to a volume of poems by the 14th-century Persian poet Hafiz, translated by Daniel Ladinsky. I loved them, and eventually named this blog for one of my favorite ones. At some point I read more and found that Ladinsky’s …

    via The whole sky June 21, 2020

    more     (via openring)


  • Posts
  • RSS
  • ◂◂RSS
  • Contact