BlueDot Narrated
Audio versions of the core readings, blog posts, and papers from BlueDot courses.
BlueDot Narrated
Introduction to AI Control
•
BlueDot Impact
Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.
Audio versions of blogs and papers from BlueDot courses.
By Sarah Hastings-Woodhouse
AI Control is a research agenda that aims to prevent misaligned AI systems from causing harm. It is different from AI alignment, which aims to ensure that systems act in the best interests of their users. Put simply, aligned AIs do not want to harm humans, whereas controlled AIs can't harm humans, even if they want to.
Source:
https://bluedot.org/blog/ai-control
A podcast by BlueDot Impact.
Why might AI Control be useful?
How could we control AIs?
Limitations of AI Control