Two Papers You Should Read Wednesday, Aug 13 2008
AI 11:21 pm
Some of you may have seen these papers already, as I mention them frequently, but they’ll important enough that I like to re-mention them regularly. They’re “Artificial Intelligence as a Positive and Negative Factor in Global Risk” by Eliezer Yudkowsky and “The Basic AI Drives” by Steve Omohundro. The papers are 42 and 11 pages, respectively. There’s no abstract for the first paper, but here’s the abstract for the second:
“One might imagine that AI systems with harmless goals will be harmless. This paper instead shows that intelligent systems will need to be carefully designed to prevent them from behaving in harmful ways. We identify a number of “drives” that will appear in sufficiently advanced AI systems of any design. We call them drives because they are tendencies which will be present unless explicitly counteracted. We start by showing that goal-seeking systems will have drives to model their own operation and to improve themselves. We then show that self-improving systems will be driven to clarify their goals and represent them as economic utility functions. They will also strive for their actions to approximate rational economic behavior. This will lead almost all systems to protect their utility functions from modification and their utility measurement systems from corruption. We also discuss some exceptional systems which will want to modify their utility functions. We next discuss the drive toward self-protection which causes systems try to prevent themselves from being harmed. Finally we examine drives toward the acquisition of resources and toward their efficient utilization. We end with a discussion of how to incorporate these insights in designing intelligent technology which will lead to a positive future for humanity.”
Feel free to post your reactions here.

August 13th, 2008 at 11:50 pm
You know what would be helpful; a paper that explicitly defines and explains what is “A.I.”; because it seems that there are several definitions being used at the time, sometimes interchangeably.
Some define “A.I.” as a self-aware machine. Capable of having an individual identity and self-will; others define it merely as a sufficiently advanced algorithm based machine. I personally define it, not by how it works, but by what it is able to do or mimic.
Perhaps the idea of ‘intelligence’ at this point is not clear enough; some more time needs to be given to properly define it. After all, what shape A.I. will take in the future is still largely unknown.
Still, a paper that attempts to categorize and define what is now a semi-loose concept might prove very helpful (perhaps one already exists. And if not, maybe you could write it.)
In any case, I found both those papers very interesting, and will pass them on to whoever is interested.
(I particularly find it interesting that an intelligent program will act in ways that the designers of said program did not predict. It dispels the notion that the digital world is completely in our control; it is very much based on rules and logic, which stem directly from reality.)
August 14th, 2008 at 8:00 am
Ryan is correct that the definitions are a true stumbling block. I wrote my Masters report on the deployment of Sensor Web technology with Intelligent Software Agents and what was immediately apparent when I began researching AI was the definition dilemma. The concepts are not very standardized and we have a long long ways to go.
I’ve extracted a few relevant paragraphs from my paper:
August 15th, 2008 at 1:43 pm
There was discussion about the AI Drives paper on the AGI list, with some criticism from Richard Loosemore - see this message, for instance.
August 23rd, 2008 at 10:10 pm
What I find implausible in Steve’s paper is an underlying assumption I consider quite suspect. The assumption is that an AI with the unalterable goal to be the best at X at the heart of all its motivations, is going to somehow attain a broad enough knowledge of all non-X specific things to understand which, if mastered sufficiently, will make it better at X. If we assume for the sake of argument that it does somehow attain this knowledge and seeks such mastery then how could it do so without comparing X to all other possible goals closely related to mastering these other domains? I find it very unbelievable that the AI would deviate enough from X to notice these other fields much less to master them and that if it did it would still be oblivious to all but being better at X tossing aside all other perspectives, goals and values it encountered to master these other abilities.