<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Goal System Zero</title>
	<atom:link href="http://rhollerith.com/blog/21/feed" rel="self" type="application/rss+xml" />
	<link>http://rhollerith.com/blog/21</link>
	<description>overcoming bias, improving the world, refining the art of human rationality</description>
	<lastBuildDate>Tue, 07 Jul 2009 15:55:17 -0700</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Roko</title>
		<link>http://rhollerith.com/blog/21/comment-page-1#comment-87</link>
		<dc:creator>Roko</dc:creator>
		<pubDate>Thu, 28 Aug 2008 20:30:45 +0000</pubDate>
		<guid isPermaLink="false">http://rhollerith.com/blog/21#comment-87</guid>
		<description>Also: I have not thoroughly read knowability of FAI. I&#039;ll let you know when i have...</description>
		<content:encoded><![CDATA[<p>Also: I have not thoroughly read knowability of FAI. I&#8217;ll let you know when i have&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Roko</title>
		<link>http://rhollerith.com/blog/21/comment-page-1#comment-86</link>
		<dc:creator>Roko</dc:creator>
		<pubDate>Thu, 28 Aug 2008 15:39:22 +0000</pubDate>
		<guid isPermaLink="false">http://rhollerith.com/blog/21#comment-86</guid>
		<description>&quot;This has been discussed a lot on SL4, Roko.&quot;

 - can you give me either a link to the relevant posts, or some info on which keywords to search them by?

&quot;they are able to hit a very tiny region in the space of all possible futures with very high probability&quot;

 - I think that there are deep problems with this idea see &lt;a href=&quot;http://transhumangoodness.blogspot.com/2008/07/ontologies-approximations-and.html&quot; rel=&quot;nofollow&quot;&gt; my post on ontologies and approximations&lt;/a&gt;

In essence, we can&#039;t totally represent the space of all possible futures; any attempt to do so is only approximate. 

On the other side of a singularity, the quality of the approximations that we use today (both in terms of coarse-grainedness and in terms of deeply mistaken assumptions) will look much worse than the quality of the ontology that a chimpanzee uses looks to us.

We don&#039;t even know what the future can look like, because we&#039;re not clever enough, and attempts to forever constrain the behavior of a seed AI will be a bit like a chimp replacing the US constitution with &quot;more bananas!&quot;

Of course the hard part is balancing the effect that we have on curtailing possible futures against the treatment that we receive in those possible futures; I am still a little hazy on how to do this.</description>
		<content:encoded><![CDATA[<p>&#8220;This has been discussed a lot on SL4, Roko.&#8221;</p>
<p> &#8211; can you give me either a link to the relevant posts, or some info on which keywords to search them by?</p>
<p>&#8220;they are able to hit a very tiny region in the space of all possible futures with very high probability&#8221;</p>
<p> &#8211; I think that there are deep problems with this idea see <a href="http://transhumangoodness.blogspot.com/2008/07/ontologies-approximations-and.html" rel="nofollow"> my post on ontologies and approximations</a></p>
<p>In essence, we can&#8217;t totally represent the space of all possible futures; any attempt to do so is only approximate. </p>
<p>On the other side of a singularity, the quality of the approximations that we use today (both in terms of coarse-grainedness and in terms of deeply mistaken assumptions) will look much worse than the quality of the ontology that a chimpanzee uses looks to us.</p>
<p>We don&#8217;t even know what the future can look like, because we&#8217;re not clever enough, and attempts to forever constrain the behavior of a seed AI will be a bit like a chimp replacing the US constitution with &#8220;more bananas!&#8221;</p>
<p>Of course the hard part is balancing the effect that we have on curtailing possible futures against the treatment that we receive in those possible futures; I am still a little hazy on how to do this.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard Hollerith</title>
		<link>http://rhollerith.com/blog/21/comment-page-1#comment-85</link>
		<dc:creator>Richard Hollerith</dc:creator>
		<pubDate>Wed, 27 Aug 2008 23:55:28 +0000</pubDate>
		<guid isPermaLink="false">http://rhollerith.com/blog/21#comment-85</guid>
		<description>In other words, I always thought it is impossible to  launch the seed of an SI without committing yourself irrevocably to something -- or more precisely, if it is possible,  I cannot imagine why you would want to.  You can launch a seed you do not understand, and you can launch a seed you do understand that predictably implements a terminal goal you do not understand.  I always thought those two things would be a bad idea (and I called them very irresponsible just now).

Let me give an example.  Suppose I launch a seed whose utility function &lt;code&gt;U&lt;/code&gt; is a weighted average of two subfunctions:

&lt;code&gt;U(e) == 0.3 * GSZ(e) + 0.7 * rest(e) for every e&lt;/code&gt;

&lt;code&gt;GSZ&lt;/code&gt; is goal system zero and &lt;code&gt;rest(e)&lt;/code&gt; is the probability that that e would have happened if I had not launched the seed.  Note that this utility function is ambiguous in that it does not specify what fraction of its &quot;resources&quot; the seed should devote to the determination of the probability distribution &lt;code&gt;rest&lt;/code&gt;, but it suffices for present purposes.

One way to look at this seed is that it leaves the future &quot;open&quot; in the sense that I have very little control over how the future would turn out if I do not launch the seed, and that counterfactual &quot;open&quot; future is represented in the actual future by that term &lt;code&gt;rest(e)&lt;/code&gt;.  But on the other hand, I would consider it irresponsible for anyone to launch the seed before he has an unambigous definition of &lt;code&gt;rest&lt;/code&gt; and makes himself quite certain the definition means what he thinks it means.  And I hope you will concede that it is possible to be quite certain that the definition means what you think it means without having any significant degree of control over or ability to predict what would have happened if you do not launch the seed.  And I submit to you that the choice to launch the seed with the &lt;code&gt;rest(e)&lt;/code&gt; term instead of no term or instead of some other term is an irrevocable choice.</description>
		<content:encoded><![CDATA[<p>In other words, I always thought it is impossible to  launch the seed of an SI without committing yourself irrevocably to something &#8212; or more precisely, if it is possible,  I cannot imagine why you would want to.  You can launch a seed you do not understand, and you can launch a seed you do understand that predictably implements a terminal goal you do not understand.  I always thought those two things would be a bad idea (and I called them very irresponsible just now).</p>
<p>Let me give an example.  Suppose I launch a seed whose utility function <code>U</code> is a weighted average of two subfunctions:</p>
<p><code>U(e) == 0.3 * GSZ(e) + 0.7 * rest(e) for every e</code></p>
<p><code>GSZ</code> is goal system zero and <code>rest(e)</code> is the probability that that e would have happened if I had not launched the seed.  Note that this utility function is ambiguous in that it does not specify what fraction of its &#8220;resources&#8221; the seed should devote to the determination of the probability distribution <code>rest</code>, but it suffices for present purposes.</p>
<p>One way to look at this seed is that it leaves the future &#8220;open&#8221; in the sense that I have very little control over how the future would turn out if I do not launch the seed, and that counterfactual &#8220;open&#8221; future is represented in the actual future by that term <code>rest(e)</code>.  But on the other hand, I would consider it irresponsible for anyone to launch the seed before he has an unambigous definition of <code>rest</code> and makes himself quite certain the definition means what he thinks it means.  And I hope you will concede that it is possible to be quite certain that the definition means what you think it means without having any significant degree of control over or ability to predict what would have happened if you do not launch the seed.  And I submit to you that the choice to launch the seed with the <code>rest(e)</code> term instead of no term or instead of some other term is an irrevocable choice.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard Hollerith</title>
		<link>http://rhollerith.com/blog/21/comment-page-1#comment-84</link>
		<dc:creator>Richard Hollerith</dc:creator>
		<pubDate>Wed, 27 Aug 2008 23:22:20 +0000</pubDate>
		<guid isPermaLink="false">http://rhollerith.com/blog/21#comment-84</guid>
		<description>This has been discussed a lot on SL4, Roko.  The most cogent remark there that I can remember right now (by Eliezer) remarked that if offered a pill that would make him selfish rather than altruistic, or a pill that would cause him to start to believe that murder was OK, Eliezer would choose not to take the pill.  So the general consensus IIRC on SL4 (which I share) was that even if an AI has full-overwrite privileges, it is not going to use them in a way that has more than a negligible chance of altering its terminal goal.   Are you sure you are not trying to generalize from your experience with your own mind or with human minds?  

Also, I want to retract my previous statement that every human has an unchanging terminal goal for some sensible definition of terminal goal.  It now occurs to me that probably every human&#039;s terminal goal can be influenced by causes outside the human.  It is a pretty puzzle to say what exactly is outside or inside a human.  If for example, I consult my own blog to remind myself of what my terminal goal is and if if-counterfactual I had not consulted my blog to remind myself, then my terminal goal would have turned out differently, then one can say that my terminal goal has been affected by a cause outside myself.  But are the words I have written on my own blog really outside myself?  Why not consider them part of myself?

So the situation with the human beings is more complicated than I implied in my previous comment, but the point remains that it is very irresponsible to launch the seed of a SI unless

(1) you possess an unambiguous (preferably formal a la code or formulae) compact description of the terminal goal of the seed and unless

(2) you understand the seed well enough to know with high confidence that the seed will steer the future into the terminal goal -- that is, that the SI will truly be under the control of the terminal goal.

In other words, the way I hope the future will go is that no team will launch a seed for an SI until they are able to hit a very tiny region in the space of all possible futures with very high probability.  It is this very tiny region that I wish to denote with the phrase &quot;terminal goal of the SI&quot;.  The region has a compact description because if it did not then mere humans would not have a chance to hit it with very high probability.

Eliezer&#039;s &lt;a href=&quot;http://sl4.org/wiki/KnowabilityOfFAI&quot; rel=&quot;nofollow&quot;&gt;Knowability of AI&lt;/a&gt; is very illuminating on those two points.  If you reply to this comment, please indicate whether you have groked that document.</description>
		<content:encoded><![CDATA[<p>This has been discussed a lot on SL4, Roko.  The most cogent remark there that I can remember right now (by Eliezer) remarked that if offered a pill that would make him selfish rather than altruistic, or a pill that would cause him to start to believe that murder was OK, Eliezer would choose not to take the pill.  So the general consensus IIRC on SL4 (which I share) was that even if an AI has full-overwrite privileges, it is not going to use them in a way that has more than a negligible chance of altering its terminal goal.   Are you sure you are not trying to generalize from your experience with your own mind or with human minds?  </p>
<p>Also, I want to retract my previous statement that every human has an unchanging terminal goal for some sensible definition of terminal goal.  It now occurs to me that probably every human&#8217;s terminal goal can be influenced by causes outside the human.  It is a pretty puzzle to say what exactly is outside or inside a human.  If for example, I consult my own blog to remind myself of what my terminal goal is and if if-counterfactual I had not consulted my blog to remind myself, then my terminal goal would have turned out differently, then one can say that my terminal goal has been affected by a cause outside myself.  But are the words I have written on my own blog really outside myself?  Why not consider them part of myself?</p>
<p>So the situation with the human beings is more complicated than I implied in my previous comment, but the point remains that it is very irresponsible to launch the seed of a SI unless</p>
<p>(1) you possess an unambiguous (preferably formal a la code or formulae) compact description of the terminal goal of the seed and unless</p>
<p>(2) you understand the seed well enough to know with high confidence that the seed will steer the future into the terminal goal &#8212; that is, that the SI will truly be under the control of the terminal goal.</p>
<p>In other words, the way I hope the future will go is that no team will launch a seed for an SI until they are able to hit a very tiny region in the space of all possible futures with very high probability.  It is this very tiny region that I wish to denote with the phrase &#8220;terminal goal of the SI&#8221;.  The region has a compact description because if it did not then mere humans would not have a chance to hit it with very high probability.</p>
<p>Eliezer&#8217;s <a href="http://sl4.org/wiki/KnowabilityOfFAI" rel="nofollow">Knowability of AI</a> is very illuminating on those two points.  If you reply to this comment, please indicate whether you have groked that document.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Roko</title>
		<link>http://rhollerith.com/blog/21/comment-page-1#comment-82</link>
		<dc:creator>Roko</dc:creator>
		<pubDate>Wed, 27 Aug 2008 20:22:25 +0000</pubDate>
		<guid isPermaLink="false">http://rhollerith.com/blog/21#comment-82</guid>
		<description>In fact I would go further and say that writing an AI with full overwrite privileges is the best way to go. It seems more risky in the short term, but in the long term you have the advantage that you haven&#039;t committed yourself irrevocably to something.</description>
		<content:encoded><![CDATA[<p>In fact I would go further and say that writing an AI with full overwrite privileges is the best way to go. It seems more risky in the short term, but in the long term you have the advantage that you haven&#8217;t committed yourself irrevocably to something.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Roko</title>
		<link>http://rhollerith.com/blog/21/comment-page-1#comment-81</link>
		<dc:creator>Roko</dc:creator>
		<pubDate>Wed, 27 Aug 2008 20:14:59 +0000</pubDate>
		<guid isPermaLink="false">http://rhollerith.com/blog/21#comment-81</guid>
		<description>&quot;Roko. How would this agent change its terminal goal? What criterion would it use to decide when and how to change its terminal goal? I humbly suggest that whatever that criterion is is the agent’s real terminal goal.&quot;

 - suppose I have a computer program implemented on a computer which works by taking an input string and replacing the entirety of its own code with that string, including the section which talks about replacing the source code. 

In this case, there are sequences of inputs where nothing remains constant over each input/output/modify cycle. 

So, given this definition, not all agents have terminal goals. 

Is it irresponsible to write an agent that doesn&#039;t have some constant part? 

I don&#039;t think so - I think that it is good enough to be highly confident that the agent&#039;s motivational system is what you want it to be, even if, in general, that motivational system can overwrite itself.</description>
		<content:encoded><![CDATA[<p>&#8220;Roko. How would this agent change its terminal goal? What criterion would it use to decide when and how to change its terminal goal? I humbly suggest that whatever that criterion is is the agent’s real terminal goal.&#8221;</p>
<p> &#8211; suppose I have a computer program implemented on a computer which works by taking an input string and replacing the entirety of its own code with that string, including the section which talks about replacing the source code. </p>
<p>In this case, there are sequences of inputs where nothing remains constant over each input/output/modify cycle. </p>
<p>So, given this definition, not all agents have terminal goals. </p>
<p>Is it irresponsible to write an agent that doesn&#8217;t have some constant part? </p>
<p>I don&#8217;t think so &#8211; I think that it is good enough to be highly confident that the agent&#8217;s motivational system is what you want it to be, even if, in general, that motivational system can overwrite itself.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard Hollerith</title>
		<link>http://rhollerith.com/blog/21/comment-page-1#comment-80</link>
		<dc:creator>Richard Hollerith</dc:creator>
		<pubDate>Wed, 27 Aug 2008 19:39:18 +0000</pubDate>
		<guid isPermaLink="false">http://rhollerith.com/blog/21#comment-80</guid>
		<description>&lt;blockquote&gt;when you decide that “X” is your goal, and then implement a clause in your goal system that says that you should never change your goal (like all utility maximizers do) you have done something very silly.&lt;/blockquote&gt;

Suppose you build a artificial general intelligence, Roko.   How would this agent change its terminal goal?  What criterion would it use to decide when and how to change its terminal goal?  I humbly suggest that whatever that criterion is is the agent&#039;s real terminal goal.  The thing that changed was never the agent&#039;s terminal goal.

In other words, I suggest that we define the phrase terminal goal (or system of terminal goals or values) in such a way that an agent cannot change its terminal goal.  

What about people like you and I who seem to be able to change our own terminal values?  (For example, I have tentatively chosen goal system zero as my system of terminal values, and it was not till I was 32 years old that I became aware of the existence of anything resembling goal system zero (GSZ), suggesting that at some time after my 32nd birthday I changed my terminal values.)  Well, I suggest we view humans as &quot;messy intelligent agents&quot; with the property that no one (not even the holder of the terminal goal) can become truly confident about what a human&#039;s terminal goal is.   According to this view, even I do not know with certainty my true system of terminal values.  There is a good chance that it really is GSZ, but there is also a chance that I am deceiving myself about the nature of my system of terminal values.

Superintelligent AIs are different from humans in this regard.  Although it is &lt;i&gt;possible&lt;/i&gt;, it is very irresponsible not to be highly confident of the agent&#039;s true terminal goal when launching the seed of a superintelligence.

(Eliezer&#039;s CEV proposal is not irresponsible in this way.)</description>
		<content:encoded><![CDATA[<blockquote><p>when you decide that “X” is your goal, and then implement a clause in your goal system that says that you should never change your goal (like all utility maximizers do) you have done something very silly.</p></blockquote>
<p>Suppose you build a artificial general intelligence, Roko.   How would this agent change its terminal goal?  What criterion would it use to decide when and how to change its terminal goal?  I humbly suggest that whatever that criterion is is the agent&#8217;s real terminal goal.  The thing that changed was never the agent&#8217;s terminal goal.</p>
<p>In other words, I suggest that we define the phrase terminal goal (or system of terminal goals or values) in such a way that an agent cannot change its terminal goal.  </p>
<p>What about people like you and I who seem to be able to change our own terminal values?  (For example, I have tentatively chosen goal system zero as my system of terminal values, and it was not till I was 32 years old that I became aware of the existence of anything resembling goal system zero (GSZ), suggesting that at some time after my 32nd birthday I changed my terminal values.)  Well, I suggest we view humans as &#8220;messy intelligent agents&#8221; with the property that no one (not even the holder of the terminal goal) can become truly confident about what a human&#8217;s terminal goal is.   According to this view, even I do not know with certainty my true system of terminal values.  There is a good chance that it really is GSZ, but there is also a chance that I am deceiving myself about the nature of my system of terminal values.</p>
<p>Superintelligent AIs are different from humans in this regard.  Although it is <i>possible</i>, it is very irresponsible not to be highly confident of the agent&#8217;s true terminal goal when launching the seed of a superintelligence.</p>
<p>(Eliezer&#8217;s CEV proposal is not irresponsible in this way.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Roko</title>
		<link>http://rhollerith.com/blog/21/comment-page-1#comment-79</link>
		<dc:creator>Roko</dc:creator>
		<pubDate>Wed, 27 Aug 2008 16:49:52 +0000</pubDate>
		<guid isPermaLink="false">http://rhollerith.com/blog/21#comment-79</guid>
		<description>I think that the only natural way to avoid arbitrary choices in &quot;goals&quot; or &quot;terminal values&quot; is to exploit those facts that underlie the question &quot;what should I do? What goal should I have?&quot;. 

As one spends more time thinking about the question, one realizes that even in asking the question, a lot is implicit. 

Firstly, the fact that you are a particular agent with sensors, effectors, some kind of (finite) brain and mind is implicit. 

Secondly, the actual laws of physics of our universe are implicit. 

Thirdly, by using the word &quot;could&quot; in a world with deterministic physics, you are implicitly using some kind of approximate ontology and simplified version of physics. 

That&#039;s actually quite a lot to go on - and I suspect that from those assumptions, one can construct a canonical set of goals along the lines of:

 - I need more free energy, space, time, matter, ingenuity and intelligence

 - I need to expand my brainpower, because I have a necessarily limited representation of the world and my choice of possible goals is limited by that representation. 

 - I need to overpower other agents with rival goals, but I don&#039;t want to delete them completely because they constitute highly ordered intelligence and so are necessarily more useful than raw materials in a universe where raw materials are fairly abundant</description>
		<content:encoded><![CDATA[<p>I think that the only natural way to avoid arbitrary choices in &#8220;goals&#8221; or &#8220;terminal values&#8221; is to exploit those facts that underlie the question &#8220;what should I do? What goal should I have?&#8221;. </p>
<p>As one spends more time thinking about the question, one realizes that even in asking the question, a lot is implicit. </p>
<p>Firstly, the fact that you are a particular agent with sensors, effectors, some kind of (finite) brain and mind is implicit. </p>
<p>Secondly, the actual laws of physics of our universe are implicit. </p>
<p>Thirdly, by using the word &#8220;could&#8221; in a world with deterministic physics, you are implicitly using some kind of approximate ontology and simplified version of physics. </p>
<p>That&#8217;s actually quite a lot to go on &#8211; and I suspect that from those assumptions, one can construct a canonical set of goals along the lines of:</p>
<p> &#8211; I need more free energy, space, time, matter, ingenuity and intelligence</p>
<p> &#8211; I need to expand my brainpower, because I have a necessarily limited representation of the world and my choice of possible goals is limited by that representation. </p>
<p> &#8211; I need to overpower other agents with rival goals, but I don&#8217;t want to delete them completely because they constitute highly ordered intelligence and so are necessarily more useful than raw materials in a universe where raw materials are fairly abundant</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Roko</title>
		<link>http://rhollerith.com/blog/21/comment-page-1#comment-78</link>
		<dc:creator>Roko</dc:creator>
		<pubDate>Wed, 27 Aug 2008 16:37:25 +0000</pubDate>
		<guid isPermaLink="false">http://rhollerith.com/blog/21#comment-78</guid>
		<description>Richard: &lt;i&gt;My reason for prefering that agent to, e.g., one that maximizes moments of human happiness is that I perceive no valid reason to prefer happiness moments and gold atoms. (Yes, I concede that I have an austere and dry way of looking at the world.)&lt;/i&gt;

I think that the reason that we have come up with similar ideas is that we have the same desire to not make arbitrary choices. The latest comment on my blog &lt;a href=&quot;https://www.blogger.com/comment.g?blogID=7813164664144593334&amp;postID=7646613949377166277&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt; gives a summary of my thought process:

&lt;i&gt;&lt;b&gt;Firstly, I dislike having to make choices: the entire paradigm of &quot;ends&quot; and &quot;means&quot; basically forces a very large arbitrary choice on you.

I then found out that, after having made the arbitrary choice of what terminal value you want to adopt, for all &quot;sensible&quot; choices (which might be hard to make precise), you end up pursuing the same set of instrumental values - outlined in steve omohundro&#039;s paper, and in an earlier post of mine. So you can, in a sense, avoid the choice by pursuing those instrumental values instead of any one particular terminal value. You can easily dip in and out of each particular terminal state once you have accumulated enough free energy, space, matter, intelligence, etc to do so. By pursuing Often instrumental values, you get to a state which close to being optimal with respect to most terminal values you could have chosen to start with.

But, as you point out, there is no better way of achieving X than by having X as your one true goal. Adopting the OIVs instead necessarily means that you will not achieve X quite as well.

Now in my post on ontologies approximations and fundamentalists, I realized that something else is going on. There are two mathematical structures present when you choose which goal to spend your life pursuing: firstly, there is the instrumental structure: if you spend time pursuing the goal of getting more OIVs, you make all other goals easier.

There is another structure present: the mere fact that your mind is finite means that your list of goals to pursue is far smaller than the set of possible states of the universe. So when you decide that &quot;X&quot; is your goal, and then implement a clause in your goal system that says that you should never change your goal (like all utility maximizers do) you have done something very silly.&lt;/i&gt;&lt;/b&gt;</description>
		<content:encoded><![CDATA[<p>Richard: <i>My reason for prefering that agent to, e.g., one that maximizes moments of human happiness is that I perceive no valid reason to prefer happiness moments and gold atoms. (Yes, I concede that I have an austere and dry way of looking at the world.)</i></p>
<p>I think that the reason that we have come up with similar ideas is that we have the same desire to not make arbitrary choices. The latest comment on my blog <a href="https://www.blogger.com/comment.g?blogID=7813164664144593334&amp;postID=7646613949377166277" rel="nofollow">here</a> gives a summary of my thought process:</p>
<p><i><b>Firstly, I dislike having to make choices: the entire paradigm of &#8220;ends&#8221; and &#8220;means&#8221; basically forces a very large arbitrary choice on you.</p>
<p>I then found out that, after having made the arbitrary choice of what terminal value you want to adopt, for all &#8220;sensible&#8221; choices (which might be hard to make precise), you end up pursuing the same set of instrumental values &#8211; outlined in steve omohundro&#8217;s paper, and in an earlier post of mine. So you can, in a sense, avoid the choice by pursuing those instrumental values instead of any one particular terminal value. You can easily dip in and out of each particular terminal state once you have accumulated enough free energy, space, matter, intelligence, etc to do so. By pursuing Often instrumental values, you get to a state which close to being optimal with respect to most terminal values you could have chosen to start with.</p>
<p>But, as you point out, there is no better way of achieving X than by having X as your one true goal. Adopting the OIVs instead necessarily means that you will not achieve X quite as well.</p>
<p>Now in my post on ontologies approximations and fundamentalists, I realized that something else is going on. There are two mathematical structures present when you choose which goal to spend your life pursuing: firstly, there is the instrumental structure: if you spend time pursuing the goal of getting more OIVs, you make all other goals easier.</p>
<p>There is another structure present: the mere fact that your mind is finite means that your list of goals to pursue is far smaller than the set of possible states of the universe. So when you decide that &#8220;X&#8221; is your goal, and then implement a clause in your goal system that says that you should never change your goal (like all utility maximizers do) you have done something very silly.</b></i></p>
]]></content:encoded>
	</item>
</channel>
</rss>
