Reboot, then wander

2024-04-29T00:00:00+00:00

Three years since the first (and last) post, time to look around and pick up the threads.

I was previously aiming at a more focused sequence, but I think now we’re better served by less focused wandering. The core feeling remains: there’s untapped hope in the space of anthropomorphic-by-design approaches to AI. Old threads of investigation remain too: mammals that mimic meaning, anthropomorphic ideals, and artificial life. But new threads have joined them, and the world has shifted. Large language models and generative image models have proved astoundingly proficient and popular, and both are anthropomorphic in important ways.

So, let’s reboot, then wander.

Init

2021-01-15T00:00:00+00:00

Artificial Intelligence designs its systems according to two kinds of ideal. One kind involves doing something well: solving problems, maximizing expected utility, minimizing required training data, among more concrete ideals like winning Go games or driving cars well. The other involves being like us in some sense: looking like us, behaving like us, or being structured like us. Call the first kind adeptness ideals and the second anthropomorphic ideals.

Systems today, both in AI alignment and AI more broadly, are typically designed according to adeptness ideals, with humans used mainly for inspiration (outside of scientific efforts to understand humans through AI).

By contrast, in this blog I will argue that designing explicitly anthropomorphic AI will be crucial for developing aligned AGI, and give concrete research directions and comparisons to existing approaches. Befitting the medium, I’ll present the ideas in no particular order, pull it together as we go, and promise that all of your questions will be satisfactorily answered in some later post.

Any anthropomorphic AI approach rests on an understanding of what humans are. Among other things, we are mammals that mimic meanings. Anthropomorphic AI should be too, given an appropriate generalization of these concepts. I’ll elaborate on what this means and why it’s true in a later post.

Humans are a messy meaty outgrowth of biological evolution and historical accident, but there’s ultimately no way of escaping the fact that we are human. All of our highest dreams and aspirations, no matter how abstract and universal, are only human. This fact is not a bad thing, but any aligned AGI approach must take it into account.