Misalignment and misuse: whose values are manifest?

By Katja Grace, 18 November 2020, Crossposted from world spirit sock puppet.

AI associated disasters are sometimes categorized as involving misaligned AI, or misuse, or accident. The place:

misuse means the unhealthy outcomes had been needed by the folks concerned,
misalignment means the unhealthy outcomes had been needed by AI (and never by its human creators), and
accident signifies that the unhealthy outcomes weren’t needed by these in energy however occurred anyway as a consequence of error.

In enthusiastic about particular eventualities, these ideas appear much less useful.

I believe a possible state of affairs resulting in unhealthy outcomes is that AI may be made which provides a set of individuals issues they need, on the expense of future or distant sources that the related folks don’t care about or don’t personal.

For instance, take into account autonomous enterprise strategizing AI methods which are worthwhile additions to many firms, however in the long term accrue sources and affect and actually simply need sure companies to nominally succeed, leading to a nugatory future. Suppose Bob is contemplating whether or not to get a enterprise strategizing AI for his enterprise. It’s going to make the distinction between his enterprise thriving and struggling, which can change his life. He suspects that inside a number of hundred years, if this kind of factor continues, the AI methods will management the whole lot. Bob most likely doesn’t hesitate, in the best way that companies don’t hesitate to make use of fuel autos even when the folks concerned genuinely assume that local weather change might be a large disaster in a whole bunch of years.

When the enterprise strategizing AI methods lastly plough all the sources within the universe into a number of thriving twenty first Century companies, was this misuse or misalignment or accident? The unusual new values that had been glad had been these of the AI methods, however all the final result solely occurred as a result of folks like Bob selected it knowingly (let’s say). Bob preferred it greater than the lengthy superb human future the place his enterprise was much less good. That seems like misuse. But additionally in a system of many individuals, letting this choice fall to Bob could nicely have been an accident on the a part of others, such because the know-how’s makers or legislators.

Outcomes are the results of the interaction of decisions, pushed by totally different values. Thus it isn’t essentially sensical to consider them as flowing from one entity’s values or one other’s. Right here, AI know-how created a greater choice for each Bob and a few newly-minted misaligned AI values that it additionally created—‘Bob has an amazing enterprise, AI will get the long run’—and that choice was worse for the remainder of the world. They selected it collectively, and the selection wanted each Bob to be a misuser and the AI to be misaligned. However this isn’t a bizarre nook case, this can be a pure approach for the long run to be destroyed in an financial system.

Because of Joe Carlsmith for dialog resulting in this put up.

Source link

Artificial Intelligence
in Action

Top Stories

How Meta’s CyberSecEval 3 can help combat weaponized LLMs

Forrester’s CISO budget priorities include API, supply chain security

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

Misalignment and misuse: whose values are manifest?

Leave a Reply Cancel reply

Related Strories

Building the exceptional: culture and values that define viso

Advancing AI Alignment with Human Values Through WARM

IP Addresses Revealed: Beyond Just Numeric Values

Aliens and AI May Unlock Cryonic Stasis On Earth: Exploring The Possibility of Cryogenically Freezing Humans

Quick links

Popular Categories

Follow Socials

Artificial Intelligence in Action

Top Stories

How Meta’s CyberSecEval 3 can help combat weaponized LLMs

Forrester’s CISO budget priorities include API, supply chain security

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

Misalignment and misuse: whose values are manifest?

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Building the exceptional: culture and values that define viso

Advancing AI Alignment with Human Values Through WARM

IP Addresses Revealed: Beyond Just Numeric Values

Aliens and AI May Unlock Cryonic Stasis On Earth: Exploring The Possibility of Cryogenically Freezing Humans

Get Insider Tips and Tricks in Our Newsletter!

Artificial Intelligence
in Action