<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Response to &#8220;What is friendly?&#8221;</title>
	<atom:link href="http://www.acceleratingfuture.com/michael/blog/2006/10/response-to-what-is-friendly/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.acceleratingfuture.com/michael/blog/2006/10/response-to-what-is-friendly/</link>
	<description>Transhumanism, AI, nanotechnology, the Singularity, and extinction risk.</description>
	<lastBuildDate>Mon, 13 Feb 2012 02:08:02 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Michael Anissimov</title>
		<link>http://www.acceleratingfuture.com/michael/blog/2006/10/response-to-what-is-friendly/#comment-5399</link>
		<dc:creator>Michael Anissimov</dc:creator>
		<pubDate>Tue, 10 Oct 2006 15:48:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.acceleratingfuture.com/michael/blog/?p=206#comment-5399</guid>
		<description>Nick: I prefer the way that anthropomorphism is dispelled in CFAI, in the section, &quot;Beyond anthropomorphism&quot;.  CFAI talks about &quot;conservative for AI&quot; while &quot;AGI and Global Risk&quot; doesn&#039;t.  CFAI presents the important Sysop idea, while AGIGR doesn&#039;t.  CFAI has &quot;the story of a blob&quot;, AGIGR doesn&#039;t.  CFAI talks explicitly about observer-biased beliefs and observer-centric goal systems, whereas AGIGR does not.  CFAI is more technical, proposing design characteristics for Friendly AI, while AGIGR does not.  CFAI covers a range of design challenges barely even broached in AGIGR.  CFAI is more theoretically rich, conceptualizing humans as three layers of functional complexity, and discussing strategies we might use to ground a normatively altruistic AI in that complexity.  CFAI describes normatively altruistic Friendly AI, and people approaching Friendly AI for the first time should be able to answer the question, &quot;Do you agree or disagree with the following statement? &quot;Creating Friendly AI as a normative altruist is a good idea.&quot;  Explain why or why not&quot;.  CFAI is longer, framing the problem in more detail.  Anyone who wants to be qualified to talk about Friendly AI will need to read CFAI sooner or later anyway, and they might as well read it first and AGIGR afterwards, rather than just reading AGIGR and saying, &quot;hey, I&#039;m done, I can go argue my point of view now&quot;.

The existing literature on Friendly AI is not large.  It&#039;s something like 500 pages, and can be read in a week or two by anyone who thinks the problem is important enough to spend some time on.  Might as well start with the oldest, largest piece of work first, then work your way down to the newer, shorter stuff.</description>
		<content:encoded><![CDATA[<p>Nick: I prefer the way that anthropomorphism is dispelled in CFAI, in the section, &#8220;Beyond anthropomorphism&#8221;.  CFAI talks about &#8220;conservative for AI&#8221; while &#8220;AGI and Global Risk&#8221; doesn&#8217;t.  CFAI presents the important Sysop idea, while AGIGR doesn&#8217;t.  CFAI has &#8220;the story of a blob&#8221;, AGIGR doesn&#8217;t.  CFAI talks explicitly about observer-biased beliefs and observer-centric goal systems, whereas AGIGR does not.  CFAI is more technical, proposing design characteristics for Friendly AI, while AGIGR does not.  CFAI covers a range of design challenges barely even broached in AGIGR.  CFAI is more theoretically rich, conceptualizing humans as three layers of functional complexity, and discussing strategies we might use to ground a normatively altruistic AI in that complexity.  CFAI describes normatively altruistic Friendly AI, and people approaching Friendly AI for the first time should be able to answer the question, &#8220;Do you agree or disagree with the following statement? &#8220;Creating Friendly AI as a normative altruist is a good idea.&#8221;  Explain why or why not&#8221;.  CFAI is longer, framing the problem in more detail.  Anyone who wants to be qualified to talk about Friendly AI will need to read CFAI sooner or later anyway, and they might as well read it first and AGIGR afterwards, rather than just reading AGIGR and saying, &#8220;hey, I&#8217;m done, I can go argue my point of view now&#8221;.</p>
<p>The existing literature on Friendly AI is not large.  It&#8217;s something like 500 pages, and can be read in a week or two by anyone who thinks the problem is important enough to spend some time on.  Might as well start with the oldest, largest piece of work first, then work your way down to the newer, shorter stuff.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Miron's Weblog</title>
		<link>http://www.acceleratingfuture.com/michael/blog/2006/10/response-to-what-is-friendly/#comment-5374</link>
		<dc:creator>Miron's Weblog</dc:creator>
		<pubDate>Tue, 10 Oct 2006 07:29:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.acceleratingfuture.com/michael/blog/?p=206#comment-5374</guid>
		<description>&lt;strong&gt;What is Friendly?&lt;/strong&gt;

Bob Mottram argues in a post on the Streeb-Greebling Diaries that Friendly AI suffers from a critcal flaw.  The definition of Friendly is lacking:
The elephant in the room here though is that there really is no good definition for what qualifies as &amp;#8...</description>
		<content:encoded><![CDATA[<p><strong>What is Friendly?</strong></p>
<p>Bob Mottram argues in a post on the Streeb-Greebling Diaries that Friendly AI suffers from a critcal flaw.  The definition of Friendly is lacking:<br />
The elephant in the room here though is that there really is no good definition for what qualifies as &amp;#8&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick Hay</title>
		<link>http://www.acceleratingfuture.com/michael/blog/2006/10/response-to-what-is-friendly/#comment-5198</link>
		<dc:creator>Nick Hay</dc:creator>
		<pubDate>Mon, 09 Oct 2006 00:34:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.acceleratingfuture.com/michael/blog/?p=206#comment-5198</guid>
		<description>Michael:  Why do you point to CFAI rather than AI and global risks?  Is the former a better introduction?

randpost:  The default is for the AI to do nothing.  Only where it can find some sort of coherence in human wishes need it do anything e.g. protect against rogue AIs.  Defining &quot;coherence in human wishes&quot; is a fairly complex technical problem, but is orthogonal to contemporary moral disputes.  It&#039;s a different kind of problem.

Riley:  You&#039;re quite right that you need to solve both problems.  This is explicitly mentioned in the original CEV proposal (http://www.singinst.org/friendly/extrapolated-volition.html , see the list of 3 life-or-death problems).</description>
		<content:encoded><![CDATA[<p>Michael:  Why do you point to CFAI rather than AI and global risks?  Is the former a better introduction?</p>
<p>randpost:  The default is for the AI to do nothing.  Only where it can find some sort of coherence in human wishes need it do anything e.g. protect against rogue AIs.  Defining &#8220;coherence in human wishes&#8221; is a fairly complex technical problem, but is orthogonal to contemporary moral disputes.  It&#8217;s a different kind of problem.</p>
<p>Riley:  You&#8217;re quite right that you need to solve both problems.  This is explicitly mentioned in the original CEV proposal (<a href="http://www.singinst.org/friendly/extrapolated-volition.html" rel="nofollow">http://www.singinst.org/friendly/extrapolated-volition.html</a> , see the list of 3 life-or-death problems).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Riley Gutzeit</title>
		<link>http://www.acceleratingfuture.com/michael/blog/2006/10/response-to-what-is-friendly/#comment-5195</link>
		<dc:creator>Riley Gutzeit</dc:creator>
		<pubDate>Sun, 08 Oct 2006 23:33:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.acceleratingfuture.com/michael/blog/?p=206#comment-5195</guid>
		<description>Imagine you&#039;re in a field with a long-range sniper rifle, one kilometer from a fence. On the fence, every meter, is a beer can. Over your eyes is a blindfold with a very complex knot. You have one bullet. If you hit a beer can on your first shot, the human race is put in a future determined by which beer can you hit and where exactly on the can the bullet pierces. If you miss, we all die. Worrying about what exactly we want an FAI to do before figuring out how to make an AI that will do what we tell it is like debating which can is better to hit before untying the knot on the blindfold.

Even if we decided that CEV or some competing proposal is /the/ thing that we want an FAI to do, we don&#039;t have an AI that can do it. There are a lot of optimization targets for a superintelligence that result in the immediate termination of the human race. There needn&#039;t be specific mention of the human race--it just goes poof by default. That&#039;s half the problem. The other half is we have no way of aiming the superintelligence at any particular target. That&#039;s the other half. Put them together and we die.

I think in communicating FAI, Singluaritarians tend to stress how hard it is to choose the right goal, but the more fundamental problem gets ignored. The examples of the solar system being turned into paperclips or nanoscale smiley faces unfortunately imply that the problem was just the poor specification of the target: The problem isn&#039;t that the AI didn&#039;t maximize paperclips, it&#039;s that it didn&#039;t understand the tacit assumption that not overwriting the galaxy takes precedence. A perhaps better example: If we were to create an AI that responded positively to smiling faces and saught to make people smile, then, upon reaching superintelligence, it would rearrange every atom in the solar system into Alka-Seltzer (except, obviously, those atoms currently composing introductory economics textbooks) and spend the next several million years arranging the Milky Way into a scale model of Texas State Highway 82.</description>
		<content:encoded><![CDATA[<p>Imagine you&#8217;re in a field with a long-range sniper rifle, one kilometer from a fence. On the fence, every meter, is a beer can. Over your eyes is a blindfold with a very complex knot. You have one bullet. If you hit a beer can on your first shot, the human race is put in a future determined by which beer can you hit and where exactly on the can the bullet pierces. If you miss, we all die. Worrying about what exactly we want an FAI to do before figuring out how to make an AI that will do what we tell it is like debating which can is better to hit before untying the knot on the blindfold.</p>
<p>Even if we decided that CEV or some competing proposal is /the/ thing that we want an FAI to do, we don&#8217;t have an AI that can do it. There are a lot of optimization targets for a superintelligence that result in the immediate termination of the human race. There needn&#8217;t be specific mention of the human race&#8211;it just goes poof by default. That&#8217;s half the problem. The other half is we have no way of aiming the superintelligence at any particular target. That&#8217;s the other half. Put them together and we die.</p>
<p>I think in communicating FAI, Singluaritarians tend to stress how hard it is to choose the right goal, but the more fundamental problem gets ignored. The examples of the solar system being turned into paperclips or nanoscale smiley faces unfortunately imply that the problem was just the poor specification of the target: The problem isn&#8217;t that the AI didn&#8217;t maximize paperclips, it&#8217;s that it didn&#8217;t understand the tacit assumption that not overwriting the galaxy takes precedence. A perhaps better example: If we were to create an AI that responded positively to smiling faces and saught to make people smile, then, upon reaching superintelligence, it would rearrange every atom in the solar system into Alka-Seltzer (except, obviously, those atoms currently composing introductory economics textbooks) and spend the next several million years arranging the Milky Way into a scale model of Texas State Highway 82.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bob Mottram</title>
		<link>http://www.acceleratingfuture.com/michael/blog/2006/10/response-to-what-is-friendly/#comment-5189</link>
		<dc:creator>Bob Mottram</dc:creator>
		<pubDate>Sun, 08 Oct 2006 21:10:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.acceleratingfuture.com/michael/blog/?p=206#comment-5189</guid>
		<description>It&#039;s true that human-like brains are constrained within their goal attractors, but these constrains aren&#039;t especially narrow.  The range of human goal systems is fairly wide, from Mother Teresa to Adolf Hitler and everything in between.  Admittedly these are the statistical outliers and most people fit within the hump of a normal distribution of dispositions.

It&#039;s possible that some AIs may be constructed with inflexible supergoals or fixed utility programs.  However, these machines will probably not be able to graduate to AGI status, and will remain as sophisticated but narrowly circumscribed optimisation systems.  Such systems will be like the guy at cocktail parties whose conversation always gravitates towards some monotonous pet subject, no matter what the initial topic may have been.  For a true AGI, capable of self modification, it&#039;s hard to see how such fixed algorithms would remain unchanged for very long.

I agree with Goertzel and others that there will be a first mover advantage for whoever manages to come up with a workable AGI design.  However, I&#039;m not sure that the takeoff will necessarily be hard.  The first AGIs created will almost certainly be of sub-human level intelligence.  It may take some time to bring these up to speed, and there may be technical difficulties involved in supplying them with knowledge or having them directly experience the world (the notorious grounding problem).  If the initial advantage is significant but still not of global proportions it&#039;s likely that we&#039;ll see various AGI designs produced by different research groups around the world for varying reasons and being put to various uses.  I think this more heterogenous &quot;Gates scenario&quot; is more likely in the short term than a single all-powerful AGI presiding omnipotently like Deep Thought over world affairs, if only for the reason that human tribalism will tend to work against such a situation.

Tribalism is bound to play a role in the development of any future superintelligences.  An AGI constructed by well-intentioned first movers to obey the zeroth law may generate problems for the human population which were unforseen at design time.  Does the good of the many always outweight the interests of the few?    Should production of a new vaccine cease to divert resources to poverty relief?  Even if there were an all-powerful AGI calling the shots, it&#039;s unlikely that all human leaders would agree with its &quot;friendly&quot; decisions.</description>
		<content:encoded><![CDATA[<p>It&#8217;s true that human-like brains are constrained within their goal attractors, but these constrains aren&#8217;t especially narrow.  The range of human goal systems is fairly wide, from Mother Teresa to Adolf Hitler and everything in between.  Admittedly these are the statistical outliers and most people fit within the hump of a normal distribution of dispositions.</p>
<p>It&#8217;s possible that some AIs may be constructed with inflexible supergoals or fixed utility programs.  However, these machines will probably not be able to graduate to AGI status, and will remain as sophisticated but narrowly circumscribed optimisation systems.  Such systems will be like the guy at cocktail parties whose conversation always gravitates towards some monotonous pet subject, no matter what the initial topic may have been.  For a true AGI, capable of self modification, it&#8217;s hard to see how such fixed algorithms would remain unchanged for very long.</p>
<p>I agree with Goertzel and others that there will be a first mover advantage for whoever manages to come up with a workable AGI design.  However, I&#8217;m not sure that the takeoff will necessarily be hard.  The first AGIs created will almost certainly be of sub-human level intelligence.  It may take some time to bring these up to speed, and there may be technical difficulties involved in supplying them with knowledge or having them directly experience the world (the notorious grounding problem).  If the initial advantage is significant but still not of global proportions it&#8217;s likely that we&#8217;ll see various AGI designs produced by different research groups around the world for varying reasons and being put to various uses.  I think this more heterogenous &#8220;Gates scenario&#8221; is more likely in the short term than a single all-powerful AGI presiding omnipotently like Deep Thought over world affairs, if only for the reason that human tribalism will tend to work against such a situation.</p>
<p>Tribalism is bound to play a role in the development of any future superintelligences.  An AGI constructed by well-intentioned first movers to obey the zeroth law may generate problems for the human population which were unforseen at design time.  Does the good of the many always outweight the interests of the few?    Should production of a new vaccine cease to divert resources to poverty relief?  Even if there were an all-powerful AGI calling the shots, it&#8217;s unlikely that all human leaders would agree with its &#8220;friendly&#8221; decisions.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

