Superintelligence Risk Project

July 3rd, 2017
airisk, ea
I've decided to make a larger project out of looking into AI risk. I don't think we really know why there's such a disconnect between people who think we should strongly prioritize it and the mainstream ML perspective that it's not useful to work on (at least not currently). I've applied for an EA grant, and am thinking I'll spend about a month on this.

Here's where I currently am:

  • I've read nearly all of the reading people suggested (by number of words) or about two thirds of it (by individual pieces). This is mostly an effect of Superintelligence being very long.

  • I've had conversations with one person in each camp, have a few more scheduled, and am working on lining up more.

Here are some very preliminary thoughts on where I think the disagreement might be:

  • How likely is it that current approaches are all we need for AGI, with relatively straightforward extensions and a lot of scaling?

  • How valuable is it to work on solving problems that are probably not the right ones? For example, even if we think AGI will not look like current systems, might trying to solve the control problem for current systems teach us enough about the underlying problem and how to do this kind of work that we'll be in a better position once we see more what AGI will actually look like?

  • How useful is it to have a strong theoretical foundation, vs just understanding the technology enough from an engineering perspective that we can make it do things for us?

  • How similar is this to normal engineering? How much should we expect companies' desires that their AI systems do what they want them to do to work out?

  • As we get closer to AGI, how likely is the ML community to take superintelligence risk seriously? Is it just that they don't think it can be productively worked on now or do they not think it will ever be a real problem?

Referenced in:

Comment via: google plus, facebook, substack

Recent posts on blogs I like:

Differential diagnosis of loveshyness

In my life coaching practice, I see a lot of male clients who have trouble getting dates (including fairly severe trouble, such as never having been kissed in spite of being in their thirties).

via Thing of Things February 6, 2026

2025-26 New Year review

This is an annual post reviewing the last year and setting intentions for next year. I look over different life areas (work, health, parenting, effectiveness, etc) and analyze my life tracking data. Highlights include a minimal group house, the usefulness…

via Victoria Krakovna January 19, 2026

Family Christmas

Unlike many families my family celebrates Christmas with really really a lot of our family. This past year there were about 29 people at my Grandfather's house in the week around Christmas. I know what you're thinking: how does that work? It's…

via Lily Wise's Blog Posts January 3, 2026

more     (via openring)