Published on September 26, 2024 10:05 PM GMT
One of the questions I've heard asked is "how do you design a utility function that would make the AI do nothing?" That is, how could we put a pause button on an AI so that we could pause it if we wanted to? I had an idea about how one could do this, and am sure it has already been thought of, so I'm curious to know why this doesn't end up working.
Why can't we just write something like:
If (paused) and not (taking actions): utility=PredictedUtilityIfNotPaused
If not (paused): utility=NormalUtilityFunction
Discuss