Safety, Ethics & Future
What Is the AI Control Problem?
The AI control problem is the challenge of ensuring that highly capable AI systems remain under meaningful human control and continue to do what we intend, even as they become more powerful. It underlies much of AI-safety research, since a system that pursues the wrong objective effectively could be difficult to correct or stop.
Further reading
Read more about AI control problem — articles and blogs from around the web: