New paper: The Incentives that Shape Behaviour

Ryan Carey
Jan 22 · 8 min read

Ryan Carey and Eric Langlois, introducing The Incentives that Shape Behaviour.

Machine learning algorithms are often highly effective, but it can be difficult to establish their safety and fairness. Typically, the properties of a machine learning system are established by testing. However, even if a system behaves safely in a testing environment, it may behave unsafely or unfairly when it is deployed. Alternatively, the properties of a model can be investigated by analysing input perturbations, individual decisions, or network activations, but this…

To keep reading this story, create a free account.

Already have an account? Sign in

    Ryan Carey

    Written by

    AI researcher at the Future of Humanity Institute, University of Oxford, https://www.fhi.ox.ac.uk/team/ryan-carey/.

    Welcome to a place where words matter. On Medium, smart voices and original ideas take center stage - with no ads in sight. Watch
    Follow all the topics you care about, and we’ll deliver the best stories for you to your homepage and inbox. Explore
    Get unlimited access to the best stories on Medium — and support writers while you’re at it. Just $5/month. Upgrade