Safety, Ethics & Future

What Is the AI Control Problem?

The AI control problem is the challenge of ensuring that highly capable AI systems remain under meaningful human control and continue to do what we intend, even as they become more powerful. It underlies much of AI-safety research, since a system that pursues the wrong objective effectively could be difficult to correct or stop.

What Is the AI Control Problem?

Related topics

Further reading