Counter Arts
Published in

Counter Arts

Are Unfriendly Values Unstable?

Source: The Moral Economist

In artificial intelligence (AI) safety, there is a concept known as human friendly values. In short, if an AI has human friendly values, then it will do things that the humans wants them to do.

This is complex for a number of reasons. First, it is difficult to define human friendly values. Second, it is hard to program complex concepts accurately. Third, humans don’t always know what they want. Finally, do…

--

--

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store