Looking into AI Risk

June 26th, 2017
airisk, ea
In considering what to work on, I wrote that recently many people I respect have started to think that AI risk is the most valuable place to focus EA efforts. For example, 80000 Hours ranks it first on their "list of global issues", taking into account "scale, neglectedness, and solvability". On the other hand, I have a lot of friends working in machine learning, and none of them think AI risk is worth working on now. This level of disagreement is very strange, and kind of worrying.

What I'm planning to spend the next few days on is getting a better understanding of where this difference comes from. I think I'm in a good position to do this: I'm close to both groups, have some technical background as a programmer, and have some time. I see two ways this could go:

  • If after looking into it more I still think AI risk is not a valuable place to be working, I may be able to convince others of this. Since 80000 Hours and other EAs are currently suggesting a lot of people go into this field, if it turns out we're overvaluing it then those people could work on other things.

  • If I change my mind and start thinking AI risk is something we should be working on, I may convince some of my friends in machine learning. It's also likely that something in this direction would be close enough to my skills to be a good career fit and I should consider working on it.

Of course it's also possible that I won't get to the root of the disagreement, or that I won't convince anyone except myself, but I do think it's worth trying.

Rough plan: read a bunch of stuff to get background, talk to a lot of people, write things up. Things I'm planning to read:

The list above is entirely people who think AI risk should be prioritized, aside from the Ceglowski post at the end, so I'm especially interested to read (if they exist) pieces where machine learning experts talk about why they don't think AI risk is a high priority. I'm also interested in other general AI risk background reading, and suggestions of people to talk to.

Referenced in:

Comment via: google plus, facebook, substack

Recent posts on blogs I like:

On Apologizing To Kids

Everyone is so weird about apologizing to children.

via Thing of Things August 25, 2025

Against the Teapot Hold in Contra Dancing

The teapot hold is the most dangerous common contra dancing figure, so I’ve been avoiding it. The teapot hold, sometimes called a "courtesy turn hold,” requires one dancer to connect with their hand behind their back. When I realized I could avoid put…

via Emma Azelborn August 25, 2025

Little Puppy

She's very little and she likes to do stuff with me. She also likes to bark around and run around and jump around. She also likes to go to places with me and that's all I have.

via Nora Wise's Blog Posts August 23, 2025

more     (via openring)